
Ditutup
Disiarkan
Dibayar semasa penghantaran
I have an existing Python script whose sole purpose is data extraction: it turns PDF documents into CSV files. Right now the results are incomplete and the code is a little fragile. I need it tightened up so every piece of data in each PDF is captured and written to a clean, well-structured CSV. What I already have • A working—but imperfect—Python script • Sample PDFs that show the range of layouts the tool must handle • A sample CSV that illustrates the column order I expect What needs to improve • Reliable parsing across multiple pages and varied table structures • Accurate capture of every field, not just the obvious text blocks • Clear, readable code with comments so future tweaks are simple • A straightforward command-line call such as python [login to view URL] [login to view URL] [login to view URL] Useful libraries are entirely up to you—pdfplumber, PyPDF2, tabula-py, Camelot, pandas, or a combo—so long as the final script runs on standard Python 3 and requires only pip-installable packages. Deliverables 1. Updated script (single .py file or a small module) 2. [login to view URL] listing any external dependencies 3. One example CSV generated from my sample PDFs to prove full data coverage 4. Brief README with run instructions Acceptance Criteria • Running the script on my test PDFs produces a CSV that matches the source data exactly, column for column and row for row. • No hard-coded file paths; everything is parameterised. • Code executes without warnings or errors on Python 3.10 under Windows and Linux. Please keep the focus on robust extraction—the project’s primary goal—so I can drop new PDFs in and get accurate CSVs every time.
ID Projek: 40301625
10 cadangan
Projek jarak jauh
Aktif 1 bulan yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
10 pekerja bebas membida secara purata ₹1,134 INR untuk pekerjaan ini

Hello I have several years of experience with Python coding and processing PDF files Also, I completed several similar projects recently. Could you share sample of PDF files to process? Thanks
₹1,047 INR dalam 1 hari
8.1
8.1

Hi, I can improve your existing Python PDF-to-CSV extraction script to ensure accurate parsing across multiple pages and varied table structures. I’ll make the code more robust, readable, and parameterized with proper error handling and CLI usage. Thanks Anshuman
₹1,200 INR dalam 2 hari
6.4
6.4

I have done a similar project a week ago. I am sure you will give me more projects after this. I am interested to do this project too and ready to complete this within the timeline. Kindly check my profile to see all rating and reviews given by clients. Hoping to hear from you soon. Payment after completion.
₹1,500 INR dalam 3 hari
0.0
0.0

I will transform your fragile PDF extractor into a resilient, production-ready pipeline. Instead of just "fixing" the current script, I will implement a schema-validated extraction process that ensures 100% data integrity for every CSV row. You will receive: 1. A robust, non-fragile Python script. 2. Complete extraction of all missing fields. 3. Zero-loss data conversion guarantee. Ready to deliver the final functional script within 24 hours.
₹1,000 INR dalam 1 hari
0.0
0.0

Matunga East, India
Ahli sejak Mac 2, 2023
€30-250 EUR
$30-250 USD
$250-750 USD
$3000-5000 USD
₹600-1000 INR
$250-750 USD
€250-750 EUR
₹1500-12500 INR
$750-1500 USD
$250-750 AUD
$25-50 USD / jam
£250-750 GBP
$15 USD
$250-750 USD
$30-250 CAD
₹750-1250 INR / jam
$10-30 USD
$10-30 AUD
$250-750 USD
₹1500-12500 INR