
Closed
Posted
I have roughly ten thousand ISBN-13 codes and I need a production-ready Python pipeline that can take those codes, pull the corresponding book details from [login to view URL], [login to view URL], and a small set of external APIs, then push the cleaned results straight into a Google Sheets workbook. The pipeline must • survive Amazon’s throttling, bot checks, and page format changes without manual babysitting, • finish a full run on 10 k titles in a single session without crashing or silently skipping rows, and • give me fields that are already matched and normalised so downstream staff can link them to our catalogue instantly. Architecture is up to you: Scrapy, Playwright, headless Chrome, rotating residential proxies, Selenium, or a custom HTTP solution—whichever mix keeps the request footprint human-like and maximises up-time. What matters is that the codebase is clean, well-documented, and easy for an internal engineer to extend later. Deliverables 1. Fully annotated Python source (PEP 8 compliant) packaged so I can run it with one command. 2. A Google Sheets connector that inserts or updates rows atomically, preserving formulas already in place. 3. README with environment setup, proxy configuration, and step-by-step deployment instructions for macOS and Ubuntu. 4. Brief test report showing a run on at least 300 sample ISBNs, including elapsed time, success rate, and any retries triggered. Acceptance will be based on: • ≥ 98 % scrape success on the 300-item test set, • no Amazon “bot detected” blocks during that run, and • correctly populated Google Sheets in the agreed format. If you have proven experience scraping Amazon at scale and piping results into Google Sheets, I’m ready to review your plan and timeline.
Project ID: 40427630
10 proposals
Remote project
Active 9 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average ₹910 INR/hour for this job

With over a decade of experience in web technologies, I specialize in large-scale data scraping and pipeline orchestration. I am proficient in Python and API integration, ensuring efficient code that handles extensive tasks while maintaining data integrity. I have a strong track record of implementing robust architectures that bypass throttling and bot checks, delivering reliable results without manual intervention. My approach involves using a mix of Scrapy, Selenium, and rotating residential proxies to simulate human-like traffic patterns. Additionally, I am skilled in integrating Google Sheets, allowing for seamless row updates without disrupting formulas. I’m excited about the opportunity to collaborate on a bespoke scraper pipeline that connects book details to your catalog efficiently. Let’s bring this vision to life!
₹1,000 INR in 40 days
1.3
1.3

In my role at Mitraa Technology, I've had extensive experience working on projects just like yours, developing data pipelines and scraping solutions that can handle large volumes of data from various sources. Our team has specialized in Python development and I adhere strictly to the PEP8 coding standards, ensuring a clean, well-documented codebase that is easy to understand and extend for internal engineers. Specifically in terms of scraping Amazon and leveraging Google Sheets, we have a wealth of experience and ample tricks up our sleeve to overcome challenges such as throttling, bot checks, and format changes. We've successfully implemented headless Chrome, Selenium to mitigate bot detection and subsequently devised HTTP solution mix to maximize uptime while keeping requests human-like. In terms of project management my team works on agile methodologies ensuring clear communication and deliver products on-time within budget. I look forward to not only meeting your specs but exceeding your expectations with a successful run on your 10K-title database or more in a single session while preserving normalized data perfect for downstream staff linking. I meticulously test my codes which would be highlighted in the brief report giving you full visibility of the project’s success rate. Given our skillset and approach, I am confident we can achieve the desired acceptance criteria for you
₹850 INR in 40 days
0.0
0.0

As an experienced Full-Stack Digital Developer, I have ample experience in handling API Integration and Python, two key skills required for your project. I understand your need for a clean-coded and well-documented pipeline that can scrape Amazon at scale and push the data to Google Sheets with minimal babysitting. Not only can I deliver such a pipeline but also ensure it survives Amazon's throttling, bot checks, and page format changes. Over the years, I've built high-performance websites and scalable applications that can handle substantial data like your 10k ISBN codes without crashing or silently skipping rows. Moreover, my understanding of databases and scalable system architecture will enable an easy link between the scraped data and your catalogue for immediate productivity. My proficiency extends beyond the technicalities. Drawing from my background in design, motion graphics, video editing, VFX, and post-production, I understand the significance of visually engaging platforms. Combining my technical strength with creative advantage assures you a pipeline that is not only functional but also visually sleek and user-friendly. It would be a pleasure to lend my skills towards transforming your idea into a robust reality that drives results. Let's craft a powerful solution together!
₹750 INR in 40 days
0.0
0.0

With my 3+ years of experience as a backend-focused Python developer and my expertise in building scalable API-driven platforms, I believe I'm perfectly suited for this project. I'm not just familiar with Django/DRF, PostgreSQL, and async processing but also JWT/OAuth auth flows which will be vital in ensuring the smooth execution of this Python pipeline project. In the past, I've implemented secure authentication and containerized services with Docker, both of which are key skills needed for this project as they enhance security and stability. Additionally, I have worked extensively with Google Sheets so I can assure you that the Google Sheets integration component of this project will be done flawlessly. Lastly, I have a history of quickly ramping up on unfamiliar codebases which speaks to my ability to adapt and learn fast. Furthermore, being comfortable in cloud-based deployments and AWS-based infrastructure, you can rely on me to contribute effectively in a high-availability SaaS environment. Let's discuss your plan and timeline further to create an optimal solution empowering your downstream staff to instantaneously link book details to your catalogue. (Passing or remaining character count: 282 characters)
₹1,250 INR in 30 days
0.0
0.0

Hi, I can help with this Python, Web Scraping, Software Architecture, Data Mining project. I’ll first confirm the exact inputs, current workflow, and success criteria, then deliver a working, tested result with concise handoff notes. I prefer starting with a small verified sample or first milestone so you can confirm the direction before I complete the full scope.
₹750 INR in 7 days
0.0
0.0

Chhatrapati Sambhajinagar, India
Member since May 5, 2026
₹12500-37500 INR
₹12500-37500 INR
$30-250 USD
$30-250 AUD
$1500-3000 USD
₹12500-37500 INR
€750-1500 EUR
$250-750 USD
$1500-3000 CAD
₹1500-12500 INR
$10-30 USD
₹75000-150000 INR
$30-250 USD
$30-250 USD
₹12500-37500 INR
$250-750 USD
£2-5 GBP / hour
₹50000-75000 INR
₹12500-37500 INR
$30-250 USD
₹600-666 INR
$250-750 USD