Data Extraction, Web Harvesting, Data Entry, Web Crawler Project
Need a data extraction from approximately 650 town websites. I need to extract specific data from each website. The winning bid will need to visit each website and also locate the section of the site where Public Bids or Request for Proposals are posted. This may require some searching on each site but most have a link posted on the main site.
If they don’t have a page where this is listed the bidder must put “N/A”. In addition, some sites may require that you open a PDF or other file to gain additional information.
Most of this information can be copy and pasted into an excel sheet but feel free to use harvesting, data extraction, web spider, etc.
I will provide:
1.) Website with list of target website addresses
2.) Excel sheet format
I need several fields from each bid/RFP listed:
1. Town
2. Title (of the bid or proposal)
3. OpenDate
4. CloseDate (sometimes referred to as due date)
5. Description
6. Category
7. URL
8. MainBid page URL
9. Date Accessed (when you visited the site)
(Note the URL is for the specific individual bid this is often a PDF link. The Main Bid page URL is where all bids available are listed. See example). If the site requires a login for additional information please note that in the description –but no need to create a login.
Understanding of English language is important.
I have attached an example. Also if please complete the SAMPLE sheet in the excel to demonstrate ability to do project. Four sites are listed.
Hi, I am very interested to work on this project. I am confident that I can do this job efficiently with required accuracy and time. Please refer to my reviews, qualification as well as experience.