I am looking for someone to write a scraper for the UK directory sites Thomson Local, Yell and BT.
I would like this to be written using Node.js and PhantomJS and am looking for good code that I will be able to maintain myself.
I would like the scraper system to only require 1 input: Type Of Business
I would like it to scrape the following: Name of business, Location (for using on Google Maps), Address, Telephone Number, Contact Email (if available), Website URL (if available), Description, Keywords (e.g. keywords from Yell or 'categories' from other sites OtherData - put other data in here that you think might be useful), Logo (image url for logo if available), screenshot of url
As stated, I would like node.js to be used as this is what I am using for all my scraping projects. Please do not apply if you wish to use different technology.