For this exciting project you will be scraping a large content website.
Our target price for this project is $100.
We will give you the address of a website containing many content pages
For all of the normal content/article pages, you will need to:
1) Scrape the content
2) Result should be presented as a UTF-8 CSV file
3) Parse content and save the following fields: title, body, category
4) Remove specific string patterns that we define.
The resulting content must be free of any images and html tags, but must maintain spaces and paragraph indicator.
We are looking to complete this project quickly – 5 days from start.
We will ask you to show us a few scraped records from our site before we accept you to do the work,
Please use the phrase 'super-scraper' in your response, so we know you have read this description.
We expect to have additional work like this.
29 pekerja bebas membida secara purata $91 untuk pekerjaan ini
'super-scraper'.. Easy Task but need to be done quickly & carefully according to your instruction..I can work 10-14 hours per day for 5 days..Please check pm for details....Thanks
super-scraper, I hope can help you to deliver the articles in csv. Need to check the website structure and how large it is first. Please provide the website name. Thanks in advance.