CraigsList "Free" section Scraper.

I need a script that will scrape [url removed, login to view] and save the information in an xml file. Only the United States "Free" section needs to be scraped and I need to ability to specify the US city (e.g. chicago), area (e.g. sox) and neighborhood (neighborhood should be optional since most cities don't offer this; see "sf bayarea", choose "sby", click "free" and notice the neighborhood dropdown). If a neighborhood is not specified, the script should parse the listings and pull the neighborhood listed next to the title (ignore postings without neighborhoods).

Here's the list of information that is needed from the scrape:

URL of Post

URL(s) of photos (if available)

Title of Post

"Reply to" Email Address

Post Description

Date Posted



This script can be coded in Python (preferred) or Ruby

Here are a couple of example URLs:

[url removed, login to view] (no neighborhood specified in URL)

[url removed, login to view];neighborhood=109 (neighborhood specified in the URL)

Kemahiran: Perl, Python, Ruby on Rails

Lihat lagi: craigslist org chicago, craigslist org boston, chicago craigslist, craigslist us, xml scrape, SF, scrape python, save the date, python perl, preferred free, parse email, neighborhood, list pull, email list pull, chicago, Boston, perl post url, python post xml, cities file, section script, parse search, python script parse xml file, xml quot, date scraper, quot email address quot

Tentang Majikan:
( 0 ulasan ) Milpitas, United States

ID Projek: #283681

3 pekerja bebas membida secara purata $140 untuk pekerjaan ini


Please check PMB. Thanks.

$120 USD dalam 3 hari
(73 Ulasan)

Hi, Please read my message.

$150 USD dalam 3 hari
(8 Ulasan)

Although, I am new to [login to view URL], but I am an experienced Ruby/Python developer and I can do this pretty quickly. Please message me back, if you are interested.

$150 USD dalam sehari
(0 Ulasan)