Sedang Disiapkan

A node.js & phantomjs scraper for Thomson Local, Yell and BT - repost

I am looking for someone to write a scraper for the UK directory sites Thomson Local, Yell and BT.

I would like this to be written using Node.js and PhantomJS and am looking for good code that I will be able to maintain myself.

I would like the scraper system to only require 1 input: Type Of Business

I would like it to scrape the following: Name of business, Location (for using on Google Maps), Address, Telephone Number, Contact Email (if available), Website URL (if available), Description, Keywords (e.g. keywords from Yell or 'categories' from other sites OtherData - put other data in here that you think might be useful), Logo (image url for logo if available), screenshot of url

As stated, I would like node.js to be used as this is what I am using for all my scraping projects. Please do not apply if you wish to use different technology.

Kemahiran: node.js, Kejuruteraan Perisian, Pengikisan Web

Lihat lebih lanjut: thomson local scraper, scrape thomson local, yell data uk, yell data, yell categories, what to use node.js for, what is node js written in, what is node js good for, website scraping projects, web scraping uk, use of node.js, thomson local business, js do, google system architecture, google js, directory web scraping, directory scraping software, data scraping technology, data scraping from website software, what is node, google web scraper, thomson, scraping google maps, scraper software, scrape yell

Tentang Majikan:
( 13 ulasan ) Edinburgh, United Kingdom

ID Projek: #4036287

Dianugerahkan kepada:

estliberitas

Hello, I'm not so familar with PhantomJS but have experience in headless browsing with Zombie so it won't be a problem to adapt to some-new-API with same functionality. Also ha-ha right now I'm doing another site scra Lagi

£200 GBP dalam 3 hari
(3 Ulasan)
4.0

2 pekerja bebas membida secara purata £225 untuk pekerjaan ini

mantislin

Hi sir, please check PM, thx Kimi.

£250 GBP dalam 5 hari
(65 Ulasan)
6.1