The goal of this project is to deliver a python script capable of scraping the GRI website: [login to view URL] The script should be able to scrape all the details about all the orgs(15558).
For each org we need to have at least the following fields
-Stock listing code
---Downoad all the reports in a separate location
---The reports should be easily traceable back to the organisation
-Datetime of scraping
-Mandatory Technical requirements
-The script should be written in python
-Last step of the scripts, should be pandas dataframe(s) dropping the data to csvs.
-Outputs must always have the same numbers of columns (columns migbt be empty)
-The script should allow for stopping and restart from lastest completed scrape (via table of content for example).
-The script should allow for the usage of proxies
-Script should be commented
>>Important technical requirements<<
-Script should used some “stealth” techniques (sleep, user-agent switches, …)
-Api scraping is preferred
>>Nice to have technical requirements<<
-Possibility to do parallel sraping with different proxies
-1(or more) Python scripts
-Csvs containing the data for the scripts scrapped (for verification)
32 pekerja bebas membida secara purata €160 untuk pekerjaan ini
Hi, I am Python script developer with 10 years of experience. I can scrape required website by python script/bot with your instructions very short time. Can we discuss please? Thanks.
Hello, I can write python script to scrape all 15558 records from the GRI website with following all of your requirements, The delivery will be fully functional script and CSV files with all records. Thank you
Hello I am a python developer and have rich experience in python scraping tools I can scrape your site with proper proxies (I know of sort of free pool) and can provide data and scripts in limited time