
Closed
Posted
Paid on delivery
Optimize High-Recall Indeed Job Scraper (Python, Multithreading) Description: I have a Python script that scrapes job listings from Indeed USA using ZIP codes as input. It’s built around the external library and reads ZIPs from an .xlsx file covering all US states. The scraping logic is solid—it fetches a high volume of jobs posted in the last 24 hours with excellent accuracy. I’ve tested it for California only: 8,000+ jobs fetched from 163 ZIP codes Took 6–7 hours to complete California has 1,200+ ZIPs, and I haven’t even touched other states yet What I Need: I’m looking for a Python expert to optimize the script for speed without compromising the current scraping logic. The goal is to make it scalable across all states and ZIP codes. Optimization Goals: Implement multithreading or multiprocessing to parallelize ZIP-level and/or page-level requests Maintain the current adaptive radius and job-type split logic Support proxy rotation if needed (I dont have proxies now) Preserve deduplication and error handling Ideally stream results to CSV shards or a single output file I will share the code, once we finalize
Project ID: 39745143
12 proposals
Remote project
Active 8 mos ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
12 freelancers are bidding on average ₹2,000 INR for this job

Hi there, I have reviewed your requirements and believe that my expertise aligns perfectly with them. It seems like something we can easily accomplish Please check my profile to learn about my expertise: https://www.freelancer.com/u/AliGhazanfar2 Looking forward to the opportunity to work together. Regards Ali
₹2,001 INR in 1 day
5.3
5.3

With 8 years of experience in Python development, I am confident that I am the best fit to optimize the High-Recall Indeed Job Scraper script. I have the relevant skills and expertise to enhance the speed of the script without compromising its accuracy. How I will be completing this project: - Implement multithreading or multiprocessing to parallelize ZIP-level and/or page-level requests - Maintain the current adaptive radius and job-type split logic - Support proxy rotation if needed - Preserve deduplication and error handling - Stream results to CSV shards or a single output file for efficiency What tech stack I will be following: - Python for scripting - Multithreading or multiprocessing for optimization - External libraries for proxy rotation if required Having worked on similar solutions in the past, I understand the importance of scalability and speed in job scraping. My goal is to ensure that the script can handle job listings from all states and ZIP codes efficiently. I am committed to delivering a high-quality, optimized solution that meets your requirements.
₹2,000 INR in 7 days
5.2
5.2

With 7 years of experience in Python development, I am the best fit to optimize the Indeed Job Scraper script. I have the relevant skills and expertise to meet your requirements efficiently. How I will be completing this project: - Implement multithreading or multiprocessing to parallelize ZIP-level and/or page-level requests - Maintain the current adaptive radius and job-type split logic - Support proxy rotation if needed - Preserve deduplication and error handling - Stream results to CSV shards or a single output file What tech stack I will be following: - Python - Multithreading - Pandas - Requests I have worked on similar solutions in the past, ensuring successful optimization. Trust me to deliver a scalable and efficient job scraper tailored to your needs. Let's discuss further details once we finalize.
₹2,000 INR in 7 days
4.0
4.0

I understand you already have a solid Indeed scraper that works correctly but needs optimization for speed and scalability. I can help optimize your Python script by: Implementing multithreading / multiprocessing for ZIP- and page-level parallelization Adding proxy rotation support for reliability (optional if needed later) Preserving your adaptive radius and job-type split logic Maintaining deduplication and error handling Streaming results efficiently into CSV shards or a single output file for large-scale runs I have strong experience in Python, data scraping, and performance optimization, so I’ll ensure the script scales seamlessly across all US ZIP codes without losing accuracy. Looking forward to collaborating and making your scraper production-ready. Best regards, Haseeb
₹2,000 INR in 3 days
2.0
2.0

Hi there! I know it is frustrating when your scraper works accurately but takes hours to finish, making scaling across all US ZIP codes slow and inefficient. I will optimize your Python scraper by implementing multithreading or multiprocessing to parallelize ZIP-level and page-level requests. I will keep your existing adaptive radius, job-type split, deduplication, and error handling intact. The results can be streamed efficiently to CSV files, and I can add optional proxy rotation for future scalability. This will make the scraper fast and ready for all states. Do you want me to start with parallelizing ZIP-level requests first to measure speed improvements? Start chat now and let us optimize your Indeed job scraper.
₹2,001 INR in 7 days
0.0
0.0

I TREAT EVERY PROJECT AS IF IT WERE MY OWN — WITH CARE, PRECISION AND ACCOUNTABILITY Looking for speed, reliability, and quality? That’s exactly what I bring. I understand the need to optimize the Indeed job scraper script for speed and accuracy across all US states. With expertise in Python, multithreading, and external library integration, I can implement parallel processing, adaptive radius adjustments, and error handling enhancements while ensuring seamless data output. While I am relatively new to freelancer, I have tons of experience with other projects off site. If you're looking for someone who treats your project like their own and delivers beyond expectations, I'd love to discuss further about your project! Regards, Praven
₹2,000 INR in 14 days
0.0
0.0

I’ll optimize your Indeed scraper for maximum speed with no loss of accuracy. Add multithreading/multiprocessing to parallelize ZIP/page scraping Future-ready proxy rotation and robust error handling Stream results to CSV to handle large datasets Preserve adaptive radius, job-type split, and deduplication logic You’ll get a scalable, production-ready scraper that processes all US ZIP codes much faster. Ready to start as soon as you share the code. I will be waiting for your response.
₹2,001 INR in 4 days
0.0
0.0

Hello, I’m an experienced Python developer and can optimize your Indeed job scraper for maximum speed without compromising accuracy. I will implement multithreading/multiprocessing for ZIP-level and page-level parallelization, maintain adaptive radius and job-type logic, ensure deduplication and error handling, and stream results efficiently to CSV. I have a full team ready, so I can deliver fast, reliable, and scalable results across all US ZIP codes. Ready to start immediately and ensure high-quality output. Best regards, Jamilur Rahman
₹2,000 INR in 1 day
0.0
0.0

I have the necessary skills for this project. It doesn't seem to be very complicated. I could have it ready in less than 7 days.
₹2,001 INR in 7 days
0.0
0.0

My skills and experience align perfectly with what you're looking for. Optimizing your Python job scraper script to fetch job listings faster across all US states is a challenge I'm ready to tackle head-on. While I am new to freelancer, I have tons of experience and have done other projects off site. I specialize in implementing multithreading for speed, maintaining current logic, and preserving error handling in Python scripts. I’d be happy to discuss your project in more detail! Regards, Marinus Botha
₹2,000 INR in 14 days
0.0
0.0

Hi, I can optimize your Indeed job scraper for speed, scalability, and reliability while keeping high recall intact. I’ll implement multithreading/multiprocessing to parallelize ZIP/page-level requests, add proxy rotation support, and preserve adaptive radius, job-type split, deduplication, and error handling. To handle scale, I’ll stream results into CSV shards or a single output. My approach: review and profile your script, apply concurrency optimizations, integrate scalable output, and validate nationwide across ZIPs. Tools may include Requests, aiohttp, concurrent.futures. Delivery in 4–6 days with full documentation and testing.
₹2,001 INR in 4 days
0.0
0.0

I will optimize your Indeed job scraping script for speed and scalability without affecting the current logic. I’ll implement multi-threading to make it run much faster, add proxy support for future use, and ensure clean, deduplicated CSV outputs. You’ll get a highly efficient and reliable solution ready for all U.S. states.
₹2,000 INR in 2 days
0.0
0.0

Mangalore, India
Payment method verified
Member since Sep 15, 2024
₹2000-2001 INR
₹600-601 INR
₹1500-1501 INR
₹7000-7001 INR
₹1500-1501 INR
$10-30 USD
$30-250 USD
₹37500-75000 INR
$250-750 AUD
$10-50 USD
₹12500-37500 INR
$30-250 USD
$30-250 USD
£20-250 GBP
$15-25 USD / hour
€8-30 EUR
min £36 GBP / hour
$30-250 USD
$30-250 USD
₹1500-12500 INR
₹12500-37500 INR
$30-250 USD
€30-250 EUR
$250-750 USD
$750-1500 AUD