
In Progress
Posted
Paid on delivery
***** Please read the Word document withe full specs of the job ***** ***** Please read the Word document withe full specs of the job ***** ***** Please read the Word document withe full specs of the job ***** I need an automated scraper that gathers roughly 90,000 symbols every weekday, completing the job in no more than six hours (four would be even better). The content lives only on public websites—no APIs or databases—so the tool must navigate pages, collect specific text fields plus the related images, and deliver everything neatly in a single Excel workbook. Key points you should know: • Schedule: weekdays only, kicked off on specific schedule. • Volume & speed: the full run (90k symbols) must finish inside the 4-6 hour window. • Output: one .xlsx file with rows for each symbol and either embedded images or file-path references to an accompanying images folder. • Stability: handle pagination, CAPTCHAs, rotating proxies, retries, and resume-from-last-point logging so a hiccup doesn’t force a restart. • Tech: I’m partial to Python—Scrapy, Playwright, or Selenium—but I’m open if you have a faster or more reliable stack. Please outline your proposed approach, main libraries, and any similar high-volume scrapes you’ve delivered.
Project ID: 40395363
128 proposals
Remote project
Active 21 secs ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

I understand the project requirements for high-volume website scraping automation to gather 90,000 symbols on weekdays within a 4-6 hour window. I propose using Python with Scrapy for stability and efficiency. I have experience with similar high-volume scrapes and can handle pagination, CAPTCHAs, and proxies effectively. My approach will ensure reliable data extraction and image collection in a single Excel workbook. Let's discuss the project scope further to adjust the budget accordingly. Please review my profile for my experience and commitment to delivering quality work. Looking forward to your response. Relevant examples from my portfolio: • PayStubsNow - CodeIgniter App: [login to view URL] • ValidGrad - CodeIgniter Platform: [login to view URL] • Tendex - Data extraction extension: [login to view URL] • Ticksy - Real-time data scraping: [login to view URL]
$368 USD in 8 days
8.7
8.7
128 freelancers are bidding on average $472 USD for this job

Hello, I can build your high-volume scraping automation in Python. I have experience with large-scale scrapers using Playwright and Scrapy for fast, reliable data extraction. Approach: Python (Playwright + Scrapy) Scheduled weekday runs (cron/Airflow) Async scraping for speed (4–6 hour target) Pagination + retry handling Resume-from-last-point logging CAPTCHA handling strategy (manual/approved methods only) Output: Single Excel file (.xlsx) with all 90,000 symbols Images stored in folder or linked paths Clean, structured data export I can deliver a stable, production-ready scraper with proper logging and recovery systems. Warm regards, Harpreet Singh
$250 USD in 5 days
9.5
9.5

⭐⭐⭐⭐⭐ When it comes to high-volume website scraping automation, my team at CnELIndia will be your perfect fit. With more than 18 years of experience in web and app development, we've mastered the art of crafting solutions that are not only efficient but reliable. We'll leverage on our data mining skills, Excel proficiency, and deep understanding of platforms like PHP and Python - including scrapy, playwright, or Selenium - to meet your project's unique demands. In terms of schedule adherence, I completely understand the importance of punctuality. With that in mind, our approach prioritizes the ability to handle bulk data scraping within a specified time frame while ensuring a stable system through carefully integrating features like pagination handling, CAPTCHA solving, rotating proxies management, retries systeming to reduce downtimes and flexible resume points. We appreciate your fondness for Python as it's one of our major arsenals; that said, we're fully open to exploring any other stack that guarantees better efficacy or reliability. In conclusion, choosing us means choosing a team that thrives on challenging tasks and delivers quality on time. Let's get this project done swiftly and impeccably.
$500 USD in 7 days
9.0
9.0

Hi, This is Elias from Miami. I checked your project description and understand you’re looking to automate the scraping of 90,000 symbols on a schedule, completing the task within 4-6 hours on weekdays. This is a significant data extraction challenge that requires efficient handling and robust automation. I have experience with similar web scraping projects and I'm familiar with technologies like Python and Scrapy that are perfect for this task. I can ensure the solution is scalable and reliable. I’d be happy to go through the details and suggest the best technical approach. I have a few questions to get a better understanding: Q1 – What specific user roles will need access to the scraped data? – Q2 – Are there any existing systems or databases you want this to integrate with? – Q3 – Do you have any preferred libraries or frameworks for the scraping process? Looking forward to hearing from you.
$500 USD in 2 days
8.1
8.1

Hello there, I have thoroughly reviewed the Word document containing the full specifications of the job for an automated scraper project that involves gathering approximately 90,000 symbols on a daily basis within a specified time frame. The scraper will need to navigate public websites, extract specific text fields along with related images, and compile the data neatly into a single Excel workbook. Key aspects to consider include the schedule, volume and speed requirements, output format, stability features such as handling CAPTCHAs and retries, and the preferred tech stack which leans towards Python with tools like Scrapy, Playwright, or Selenium. I am confident in delivering a robust solution that meets your project requirements effectively. Regards, Yogesh Kumar
$300 USD in 8 days
8.4
8.4

Good to see this project, I will build your weekday scraping pipeline — collecting 90,000 symbols with associated images, handling pagination, CAPTCHA bypass, proxy rotation, and outputting a single .xlsx workbook per run. For this volume, I will use Scrapy with Playwright integration for JavaScript-rendered pages, combined with asyncio concurrency to hit the 4-hour target. A checkpoint system will log the last successfully scraped symbol so any interruption resumes exactly where it stopped — no duplicate requests or wasted time. Proxy rotation will happen per-request with automatic cooldown on failed attempts. Questions: 1) How many distinct source websites are involved, and do any require login or session cookies? Send me a message and we can go over the details. Best regards, Kamran
$270 USD in 10 days
8.4
8.4

⭐⭐⭐⭐⭐ Efficient Automated Scraper for 90,000 Symbols Each Weekday ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project details and see you're looking for an automated scraper to gather 90,000 symbols every weekday. Look no further; Zohaib is here to help you! My team has successfully completed over 50 similar projects for web scraping. I’ll design a solution that navigates public websites, collects required text fields and images, and delivers everything in a single Excel workbook. ➡️ Why Me? I can easily do your automated scraping project as I have 5 years of experience in web scraping, focusing on speed and accuracy. My expertise includes Python, data extraction, and handling complex web structures. Moreover, I have a strong grip on other relevant technologies, ensuring a reliable and efficient scraper. ➡️ Let's have a quick chat to discuss your project in detail. I’d love to show you samples of my previous work. Looking forward to our conversation! ➡️ Skills & Experience: ✅ Web Scraping ✅ Python Programming ✅ Data Extraction ✅ Excel File Handling ✅ Pagination Management ✅ CAPTCHA Bypass ✅ Proxy Management ✅ Error Handling ✅ Selenium ✅ Scrapy ✅ Playwright ✅ Data Analysis Waiting for your response! Best Regards, Zohaib
$350 USD in 2 days
8.1
8.1

With over 5 years of experience in web development, particularly in Node.js and React, I am confident in my ability to create the automated scraper you need. I have successfully completed similar projects involving Excel automation and web scraping, ensuring efficient data collection and organization. My expertise in handling large volumes of data and implementing stable, reliable solutions makes me the perfect candidate for this job. I propose using a combination of Scrapy and Python to deliver the results you are looking for within the specified time frame. Let's discuss further to get started on this project.
$375 USD in 7 days
7.4
7.4

Hi And I can help you so on with building a Python-based weekday scraper that collects around 90,000 symbols, related text fields, and images into a clean Excel workbook. The main technical challenge is maintaining speed and reliability at high volume while handling pagination, retries, proxy rotation, CAPTCHA interruptions, and resume-safe logging. I would likely use Scrapy for throughput, Playwright where JavaScript rendering is required, rotating proxy middleware, structured retry queues, and openpyxl/xlsxwriter for Excel output with image paths or embedded references. I’ve built high-volume scraping systems with checkpointing, batch processing, parallel workers, image downloading, deduplication, and scheduled execution. The scraper can be designed to resume from the last successful symbol so a failed run does not restart from zero. I can also include clear logs, run summaries, and setup instructions so the process is easy to monitor and maintain. I don’t see the Word document attached here, but I can review it once available and align the build exactly to the full field list and source rules. Thanks, Hercules
$750 USD in 7 days
6.9
6.9

Hello, With 4 years of experience in PHP, Python, Excel, Web Scraping, Data Mining, and Automation, I am well-equipped to handle your high-volume website scraping automation project. I have carefully reviewed the job description and understand the requirements outlined in the Word document. I am confident in my ability to develop an automated scraper that can efficiently gather 90,000 symbols on weekdays within the specified timeframe. I am proficient in utilizing Python libraries such as Scrapy, Playwright, and Selenium to ensure a reliable and fast scraping process. I have experience in handling pagination, CAPTCHAs, proxies, retries, and logging mechanisms for seamless operation. I kindly request you to connect in chat for a detailed discussion on your project requirements. Looking forward to the opportunity to collaborate. Best regards, Taimoor from Pixels Soft
$500 USD in 7 days
6.6
6.6

Hello, I'd be glad to help with your high‑volume weekday scraper and can build a stable workflow that handles navigation, speed, image capture, and Excel output cleanly. I’ve worked on similar large‑scale scrapes using Python with Scrapy or Playwright for fast parallel runs and the ability to recover gracefully when a page fails. I can set up a scheduled system that reliably processes all 90k symbols within your time window while keeping everything organized into a single workbook and images folder. Let me know any specific formatting or structure you prefer for the final dataset. Thanks, Teo
$300 USD in 5 days
6.5
6.5

Hi I will scape the 90000 datapoint using python and handover you results in excel. I have extensive experience in such project. I will use python, selenium, scrapy, beautifulsoup etc If needed I will deliver the source code along with the project Looking forward to hear from you
$500 USD in 1 day
6.7
6.7

Having spent over 12 years on web development, automation and data analysis, I am confident in my ability to tackle your project effectively and efficiently. I have particular expertise in Python—the exact stack you are partial to—utilizing libraries such as Scrapy and Selenium, API integrations, and Excel automation, all of which align perfectly with your project needs. Volume scraping has always been a challenge I embrace and succeed at. My approach begins with a deep understanding of client requirements coupled with a keen sense of delivering clean and scalable code. Alongside providing automation solutions for high-volume scrapes similar to yours, I have also designed Excel dashboards for easy data analysis. Striving for quick understanding and minimizing back-and-forth, I ensure on-time delivery without sacrificing thoroughness or quality. Moreover, should your project require additional talents such as UI/UX design or graphics support, I bring a team of skilled professionals with me. This not only allows me to handle a wider range of project requirements but also ensures that quality is never compromised. By choosing me for your project, you're not just selecting a freelancer but an entrustment of end-to-end project handling with dedication, transparency, and prompt response. Let's discuss how we can deliver the maximum efficiency in meeting your goals!
$250 USD in 7 days
6.9
6.9

Hello, I can build a high-performance Python scraping automation system to reliably collect ~90,000 symbols daily within a 4–6 hour window. My approach will use Scrapy + Playwright (or Selenium where needed) for dynamic pages, with asynchronous processing to maximize speed while maintaining stability. The system will include pagination handling, retry logic, rotating proxies, and checkpoint-based resume so the job continues seamlessly if interrupted. A scheduler (cron or Windows Task Scheduler) will automate weekday runs at fixed times. Data will be structured into a clean .xlsx file using openpyxl, with images saved in a dedicated folder or linked via file paths depending on your preference. To ensure performance at scale, I will optimize concurrency, request throttling, and session reuse. Where anti-bot systems exist, I will implement compliant bypass strategies such as rotation, delays, and headless browser rendering. The architecture will be designed for speed, reliability, and recoverability across all 90k records. Deliverable will include fully automated scraper, scheduling setup, logging dashboard, and clear documentation for maintenance and scaling. Question 1: Do the target websites require login or are they fully public without authentication? Question 2: Should images be embedded inside Excel or stored externally with structured file paths? Thanks, Asif
$750 USD in 11 days
6.4
6.4

Hello, I fully understand your requirements. I am ready to start Thanks and Regards, Everest Technology .
$250 USD in 7 days
6.1
6.1

Hi there, I like how you have outlined your project description with clear requirements and expectations. You need an automated scraping solution capable of gathering approximately 90,000 symbols each weekday within a strict 4 to 6 hour window and outputting all data neatly into a single Excel file, including images. The tool must be robust, handling pagination, CAPTCHAs, proxies, and fault tolerance features like resume-from-last-point logging. I have extensive experience building high-volume web scrapers using Python, primarily leveraging Scrapy for speed and reliability combined with Playwright or Selenium when JavaScript rendering is required. For this project, I propose a hybrid approach: Scrapy for efficient crawling and Playwright to bypass CAPTCHAs and dynamically generated content. Rotating proxies and retry mechanisms will be integrated for stability. I also have successfully delivered similar scraping solutions scrapping tens of thousands of items daily, delivering clean Excel files with embedded or referenced images. I am confident I can deliver a performant, stable scraper that meets your scheduling and output requirements. Let's connect to discuss your full specifications and ensure the solution is tailored perfectly to your needs.
$525 USD in 10 days
6.0
6.0

Toriqul Global Solutions is a trusted web design and development company specializing in modern, high-performance, and user-friendly digital solutions. Founded by Engineer Md. Toriqul Islam, a Computer Science & Engineering graduate from RUET, we bring over 10+ years of industry experience in creating scalable, visually stunning, and business-focused websites. Our Expertise We provide complete full-stack web and mobile app development services with modern technologies, including: HTML5, CSS3, Bootstrap, JavaScript, jQuery, React JS, Angular JS, Node JS, PHP, Laravel, WordPress, .NET, Python, Ruby on Rails, MySQL, MongoDB, React Native, and more. Why Choose Us? ✔ Modern, clean, conversion-focused designs ✔ Fully responsive across all devices ✔ Scalable, secure, and optimized development ✔ Clean and maintainable code structure ✔ On-time delivery with strong commitment ✔ Professional communication & support ✔ 100% Client Satisfaction Priority We have successfully delivered projects for clients across multiple industries with excellent feedback and long-term relationships. Let’s build something exceptional together. Contact us today to turn your ideas into reality. Best Regards Toriqul Global Solutions
$250 USD in 7 days
5.9
5.9

Hello, With over 7 years of experience in Excel and Data Mining, I have carefully reviewed the requirements for the high-volume website scraping automation project. I am well-equipped to handle the task efficiently and effectively. To achieve the desired outcome, I propose using Python with Scrapy for web scraping, ensuring high speed and accuracy in gathering the 90,000 symbols within the specified 4-6 hour timeframe. The scraper will be programmed to navigate websites, extract specific text fields and images, and compile them into a single Excel workbook. In terms of stability, the tool will be equipped to handle challenges such as pagination, CAPTCHAs, rotating proxies, retries, and resume-from-last-point logging to ensure seamless operation without interruptions. I have successfully completed similar high-volume web scraping projects in the past and I am confident in delivering a reliable solution for this task. I would appreciate the opportunity to discuss the project further in chat to address any specific requirements or queries. You can visit my Profile at https://www.freelancer.com/u/HiraMahmood4072 Thank you.
$275 USD in 2 days
6.3
6.3

Your scraper will fail at 30K symbols if you don't architect for distributed execution and intelligent rate limiting. Most freelancers build single-threaded scripts that get IP-banned within 2 hours - that won't work here. Quick question - are these 90K symbols spread across multiple domains or one massive site? And what's the current anti-bot protection you're seeing (Cloudflare, DataDome, simple rate limits)? This determines whether we need residential proxies or if datacenter IPs with rotation will suffice. Here's the architectural approach: - PYTHON + SCRAPY: Build a distributed crawling framework with 20-50 concurrent spiders, each handling 2K-4K symbols. This parallelization cuts your 6-hour window down to 3-4 hours while staying under radar. - PLAYWRIGHT + STEALTH: Handle JavaScript-heavy pages and bypass basic bot detection without triggering CAPTCHAs. I've scraped 200K+ records daily for a fintech client using headless browsers with randomized fingerprints. - PROXY ROTATION + RETRY LOGIC: Implement smart proxy pools (residential if needed) with exponential backoff. If a request fails, the system logs the symbol ID and retries from a different IP without restarting the entire job. - REDIS QUEUE + CHECKPOINT SYSTEM: Store progress in Redis so if the scraper crashes at symbol 45,000, it resumes from 45,001 - not from zero. This saved a healthcare client $8K in wasted compute time. - EXCEL + IMAGE HANDLING: Use openpyxl to embed images directly in cells or write file paths to a structured folder. I've delivered workbooks with 150K rows and 50K images without corruption. - SCHEDULING: Deploy on AWS Lambda or a cron-triggered EC2 instance that auto-scales based on load and shuts down after completion to minimize costs. I've built similar scrapers for 3 clients processing 50K-500K records daily - one handles real estate listings, another aggregates financial data across 12 exchanges. None have failed a scheduled run in 18 months. I don't take projects where the target site's ToS prohibits scraping or where you haven't verified legal clearance. Let's schedule a 20-minute call to review the Word doc, discuss anti-bot countermeasures, and confirm the data structure before I commit to a timeline.
$450 USD in 10 days
6.2
6.2

✅Full Experience in Web Scraping and Data Processing Automation with Python Programming(Selenium/Playwright/Scrapy/BeautifulSoup)✅. ✳️I am very confident that complete your project perfectly. ✳️I can guarantee the quality of the job and deliver the result on time. I hope we will discuss in more detail via chat. Best regards!
$350 USD in 7 days
6.3
6.3

Thanks for sharing the project brief — Admin this side. Thanks for sharing the project. I can take ownership of this with a clean delivery flow around clean API integration layer, automation and process streamlining, data extraction and reliability handling, AI-assisted workflow implementation. My delivery flow will be: convert your brief into a sprint-wise implementation map -> deliver high-priority flows first so you can validate early -> complete final polish, testing, and documentation. From your brief, I noted this key context: ***** Please read the Word document withe full specs of the job ***** ***** Please read the Word doc --- Location: Laredo, United States Rating: 5.0 (11 reviews) Verified: Payment, Email, Phone Deposit made: Yes To align perfectly, please confirm: Do you already have finalized module priorities, or should I propose milestone ordering? And one more point: Will this need role-based access from day one? Once you confirm the scope, I can kick off and deliver initial progress quickly.
$550 USD in 7 days
5.7
5.7

Laredo, United States
Payment method verified
Member since Nov 7, 2015
$10 USD
$30-250 USD
$10-30 USD
$30-250 USD
$250-750 USD
₹1500-12500 INR
$10-30 AUD
$30-250 CAD
$250-750 AUD
₹12500-37500 INR
₹12500-37500 INR
$250-750 USD
$10-30 USD
₹37500-75000 INR
₹1250-2500 INR / hour
$10-30 USD
₹12500-37500 INR
$1000-25000 USD
$10-30 USD
₹12500-37500 INR
₹400-750 INR / hour
€250-750 EUR
$30-250 AUD
$250-750 USD
₹600-1500 INR