I need a program to extract information from Craigslist Postings with the following specifications:
1.) Categories: Extract the information made in any of the categories in any combination on Craigslist (Including a specific grouping of 10 categories I will supply)
2.) Cities: Extract the information from any and all current cities in any combination on Craigslist (Including a specific grouping of all USA Cities). There should be an easy way to add cities to the program as Craigslist adds them. All cities and sub categories of each city shall be included. For Example Boston shall include gbs nwb bmw nos, sob.
3.) Time Frame: Extract the information for all Posting Dates with the added ability to limit the dates to 1, 2, 3, or 4 specific dates.
4.) Keyword Searches: Use keywords to limit the information that is returned. This must include the ability to search on a single word or multiple words, extracting listings with whole words only. For Example a search on “excel” would not return “excellent”. A search on “Bike Mountain” would return only listings with those two words in them, not Bike or Mountain.
5.) Negative Keywords: Ability to use negative keywords to further limit Extracted information returned. Postings with these words would not be retrieved.
6.) Data base: The resulting information will be in Excel format
7.) Scan for and purge information with duplicate emails addresses that may appear in the data base.
8.) Email Addresses: The ability to select either the normal type public email Addresses or the “…. @[url removed, login to view]” email type
9.) Extracted information: The information retrieved shall be:
a) Date listing was Posted
b) E-mail Address
c) URL of Posting
d) City of Posting
e) Category of Posting
10.). Source code must be provided.
11.) Ability to set up and delete the excel spreadsheet used by the program.
12.) Show the number of postings extracted as the program is recording them as well as the city and category being used.
13.) All postings without email addresses shall be skipped
14.) Start and Stop buttons shall be provided
15.) The program will have the ability to run multiple copies of the program simultaneously from different folders.