Given csv file:
I would need a multi-threaded python script that goes through each csv line, grab the IP and port on each line, and scrape the TITLE of each webpage. (Each IP address with the port links to a website).
After it grabs the title, It would need to print the results in a new CSV file like this:
To prevent the code from running for hours we’ll need to setup a timeout, if a website doesn’t respond in say 12 seconds, print that ip and port to another file.
Also, for each IP, I’ll need to check both HTTP and HTTPS results. If HTTP doesn’t load a title or timeouts, check HTTPS. Vice Versa.
Please only bid if you are capable of completing the project fully.
If using selenium, you would need to use chrome / chromium as I'm running on this on a linux box (Kali or Ubuntu)
For use of chrome/Chromium you would need to use --ignore-certificate-errors tag.
22 pekerja bebas membida secara purata $51 untuk pekerjaan ini
Hi Sir, I can complete this project within few hours as I am expert in python scrapping via HTTP and Via headless and head full browsers. Please let me know if you are interested in ..
I have experience with scraper scripts, in past I have to migrate some data from 10 Gb excel .xls to Odoo (python framework) . I'll be glad to help with that Regards, Rafael Lima