
Closed
Posted
I have a designated Windows folder that keeps filling up with new PDF reports. Every table or other structured block of data inside those PDFs must land in an Excel workbook automatically, without me pressing a button. Here is the core workflow I need built: • The moment a PDF is dropped into (or removed from) the monitored folder, the solution scans it, pulls every table or structured dataset, and appends the results to an ever-growing database area in Excel. No data should be overwritten. • Each row (or logical record) must include the source-file name and timestamp so I can trace anything back to its original PDF. • The Excel layout itself—columns, headers, any calculated fields, summaries, or pivot tables—will be finalised together once the extraction logic is proven. • The solution has to run quietly in the background. A macro-enabled workbook, a small Python script called from Power Query, or another lightweight approach is fine—as long as the update feels instantaneous and the workbook stays open-friendly for everyday users. Acceptance check: drop a fresh PDF with tables into the folder and watch Excel add the new rows within seconds, without affecting existing data. Let’s get this set up so the workbook becomes my live dashboard, not yet another manual task.
Project ID: 40444374
18 proposals
Remote project
Active 3 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
18 freelancers are bidding on average ₹969 INR/hour for this job

Hi I have gone through the requirements. Good experience in creating custom scripts. Will do this script using PHP/Python to update data from pdf to excel in background. I'm available to start now. Please contact for further discussions. Regards, Mohan
₹800 INR in 40 days
5.4
5.4

Hello there, we are a senior team of Automation engineers, Full Stack Web and Mobile App Developers and we can do this project in no time. Thanks Ashish Kumar.
₹1,000 INR in 40 days
5.4
5.4

Your PDF-to-Excel pipeline will fail the moment a report contains merged cells or nested tables - most extraction libraries treat these as plain text, which means your data lands in the wrong columns. This breaks traceability and forces manual cleanup, defeating the entire automation goal. Before architecting the solution, I need clarity on two things: What's the average file size and table complexity you're dealing with - are we talking simple 5-column invoices or multi-page financial statements with subtotals? And do these PDFs follow a consistent template, or does the structure vary wildly between reports? Here's the architectural approach: - PYTHON + WATCHDOG: Deploy a filesystem watcher that triggers extraction the instant a PDF hits the folder, using tabula-py for structured tables and pdfplumber as a fallback for edge cases like scanned documents. - OPENPYXL + XLWINGS: Write directly to Excel without opening the file, appending rows with atomic operations to prevent corruption if someone has the workbook open during an update. - DATA VALIDATION LAYER: Implement column-mapping logic that detects table structure variations and flags anomalies (missing headers, unexpected column counts) in a separate audit sheet instead of silently corrupting your dataset. - VBA REFRESH TRIGGER: Embed a lightweight macro that auto-refreshes pivot tables and calculated fields whenever new data arrives, so your dashboard updates without manual intervention. I've built similar real-time extraction systems for 4 clients processing 500+ PDFs daily, including one that handled inconsistent vendor invoices with 12 different layouts. Let's schedule a 15-minute call to review sample PDFs and confirm the table structures before I write a single line of code - I don't take on projects where the data format is a moving target.
₹900 INR in 30 days
5.7
5.7

As an experienced and results-oriented Python developer, I am confident in my ability to meet and exceed your needs for the Live PDF to Excel Extractor project. My expertise lies specifically in automation using Python, this includes foundational skills in data structures and file handling which are particularly important for manipulating and extracting data from PDFs. Additionally, my proficiency in industry-standard practices such as using Git for version control and writing clear documentation will ensure that your final solution is not only effective but maintainable. Moreover, I have applied my Python skills to various industries including S100D Document Processing, Defense and Aerospace, where accuracy and efficiency are paramount. I have used machine learning for fine-tuning projects as well as creating dashboards from live data using automation. This experience has honed my problem-solving abilities and equipped me with the knowledge needed to architect a solution that can handle the different file types you need (such as JSON, XML, CSV, etc) ensuring a flexible and robust implementation.
₹1,000 INR in 40 days
5.6
5.6

I read your project requirements and would be thrilled to collaborate with you. With expertise in Web Scraping and Data Extraction using Python, I specialize in navigating complex data structures and deliver efficient results and scalable solutions. Let’s connect to discuss further
₹1,000 INR in 40 days
4.0
4.0

My solution will use a combination of Python, Power Query, and VBA to create an automated PDF-to-Excel reporting workflow. The approach will continuously monitor a designated folder for new or updated PDF files and extract structured table-based data automatically. Extracted data, along with source metadata and processing logs, will be exported into a centralized CSV data pool (or alternatively stored within a lightweight and transparent SQLite database). A logging mechanism will also be included to capture processing issues, failed files, and extraction errors for traceability and support. A dedicated Excel Master Dashboard workbook will serve as the reporting layer, hosting the Power Query connections, formulas, summaries, and pivot tables. Power Query will consolidate and transform the extracted datasets into a live reporting structure, while VBA will support controlled refresh actions and user-friendly interaction within the workbook. The initial implementation will focus on digitally generated PDFs that do not require OCR processing, ensuring faster and more reliable extraction accuracy. If required, the solution can later be extended to support scanned or image-based PDFs through OCR-enabled processing. The refresh strategy will rely on: Automatic refresh when the workbook is opened A VBA-triggered refresh button for manual updates
₹1,200 INR in 44 days
3.5
3.5

@SahyadriTech #SahyadriTech Completed Projects: 1. Hotel Booking Management & Tracking System (Excel + VBA Script) 2. Appsheet: ERP system for textile business 3. MNGL Gas Incident Tracker & Dashboard (Google Sheets + Google Data Studio) 4. Daily Expenses Tracker (Google Sheets) 5. Option Scalping Strategy Automation (Excel VBA) 6. Nifty50 Live Option Chain Dashboard (Google Sheets) 7. Binary Trading Sheet (Google Sheets) 8. Customer Data Cleaning & Sorting Tool (Excel VBA & Python) 9. Historical Stock Closing Price Analysis for 2,600 Stocks (Excel & Python) 10. Power Bi dashboard for F1 Car racing insights 11. GOLD Loan tracking in Google sheet and Google Data studio 12. Local Taxi Tracking System (Google Sheets + Google Data Studio) 13. Appsheet Milk Drivers wages and attendance system (Appsheet + Google sheet ) Key Highlights: 1. Pay only if satisfied with the work 2. Expert in Power BI, Excel, VBA Macros, Google Sheets, Google Apps Script, and Python 3. Experience in 3 American MNCs 4. Skilled in Data Analytics, Automation, and Visualization 5. Proficient in Statistical Analysis 6. Offer Long-Term Support for all projects 7. Quick Delivery with multiple revisions I can deliver any project related to Data Analytics, Automation, and Reporting with precision and reliability.
₹1,000 INR in 40 days
3.1
3.1

Hello, Yes, I’m confident I can build this automated PDF-to-Excel workflow professionally. I have experience working with Excel automation, PDF data extraction, Power Query, Python-based workflows, and structured reporting systems designed to reduce manual processing completely. I can create a lightweight solution that continuously monitors your designated Windows folder, extracts tables and structured data from newly added PDFs, and appends the results into an organized Excel database without overwriting existing records. I will also ensure every imported row includes the source filename and timestamp for complete traceability and validation. The workflow can be implemented using a macro-enabled workbook, Python automation, Power Query integration, or another stable background approach depending on your preferred setup. My focus will be on building a reliable, easy-to-maintain solution that updates quickly while keeping the Excel workbook user-friendly for everyday use. I’m available to start immediately once the sample PDFs and workbook structure are shared, and I look forward to working with you.
₹750 INR in 40 days
3.2
3.2

✨ Hi, I can build the live PDF to Excel extraction system so every new PDF dropped into your Windows folder is automatically processed and added to your workbook without overwriting existing data. I have experience with Python, Excel automation, Power Query, VBA, folder monitoring, PDF table extraction, structured data parsing, timestamp logging, and background automation. My first step would be to test a few sample PDFs, confirm whether the tables are text based or scanned, then build the watcher script to extract tables, append rows, and include source file name plus processing time for traceability. I’ll keep the workbook easy for daily users, with clean database rows, headers, calculated fields, and optional summaries or pivot views once the extraction is stable. The final setup will run quietly in the background and update Excel whenever a PDF is added or removed. Best regards Ankit ✨
₹1,000 INR in 40 days
2.5
2.5

Hello! My suggestion would be small CRM/mini SaaS tool to do all the heavy loading work. Share your Excel File Sample. I can easily code this better.
₹1,000 INR in 40 days
1.7
1.7

I focus on delivering work that’s done properly, clear, polished, and aligned with exactly what you need. I’m focused on building my reputation, so I offer competitive rates while putting in extra effort to ensure high quality results, reliable communication, and work I stand behind. My extensive experience with Python and data management makes me an ideal fit for your project. I can easily build a solution that monitors your designated Windows folder, scans and extracts structured data from new PDFs, and appends the extracted data to an Excel workbook without overwriting any existing data. Each row in the workbook will also include the source file name and timestamp for easy traceability. Utilizing my skills in software architecture, I can create a lightweight solution like a small Python script called from Power Query that runs quietly in the background. This approach ensures instantaneous updates so your Excel workbook functions as a live dashboard. Additionally, my proficiency with Excel will ensure that we design columns, headers, calculated fields, summaries, or pivot tables according to your specific needs once the extraction logic is proven.
₹1,000 INR in 40 days
0.0
0.0

Your PDF folder can become unreliable quickly if new files are appended without source tracking, duplicate checks, and a clear extraction log. The key is not just pulling tables into Excel, but making sure every row can be traced back to the original PDF and timestamp. Before starting, I have two quick questions: Are the PDFs text-based reports, or are some scanned/image-based PDFs? Do the PDF tables follow a consistent layout, or do different report types use different structures? Execution plan: FOLDER MONITORING: Build a lightweight Windows-based workflow that watches the target folder and detects newly added PDF files. PDF TABLE EXTRACTION: Use Python-based extraction logic to pull structured tables from each PDF and prepare them for Excel import. APPEND-ONLY DATABASE: Add extracted rows to an Excel database area without overwriting existing records, including source file name and timestamp. DATA QUALITY CHECKS: Flag extraction errors, missing fields, duplicate files, or PDFs that need manual review. EXCEL REPORTING LAYER: Once extraction is proven, structure headers, calculated fields, summaries, and pivot/dashboard areas for daily use. I have experience with Python, Excel, ERP records, data cleaning, reporting, and operational data workflows. I can start by testing on a few sample PDFs first, then build the automated folder-to-Excel workflow based on the confirmed PDF format.
₹1,000 INR in 40 days
0.0
0.0

I’m Nayan, an accomplished data professional with a wealth of experience in exactly the kind of work you need. My skills with Excel, knowledge of Power Query and my ability to integrate macros can make your vision a reality. I have a strong foundation in dealing with technical tools for processing data; MATLAB proficiency will assist any complex processing. Having performed similar conversions from PDF to Excel at length, I can assure you that my work is not only rapid but accurate too; not one row will be overwritten or omitted unintentionally. Your need for sourced-file name and timestamp will also be met meticulously. Being aware of data sensitivity, I maintain strict confidentiality and keep your data secure. With me as your collaborator, the workbook becomes your efficient live dashboard rather than another manual task up your alley. My services focus on flexibility to adapt to specific project requirements like yours. Data entry, cleaning, research, conversion- you name it and it'll be done at the highest quality, on-time and with 100% accuracy. Need a reliable, IDOS_HOUR_16:57_DETECT_SENTIMENTIDOS_DAY_0JOB filler? Let’s bring our talents together and build a robust Live PDF to Excel Extractor that will remove that manual burden for good!
₹750 INR in 40 days
0.0
0.0

Hi - I can build this end-to-end. This is a straightforward Python automation project. My approach: 1. Folder Monitoring: Python watchdog library to detect new PDFs in real-time 2. PDF Table Extraction: pdfplumber (primary) + camelot-py (fallback) to extract every structured dataset 3. Excel Output: openpyxl to append data to an ever-growing workbook with source filename and timestamp 4. Runs as Windows Service: background service that starts on boot Deliverables: - Python script with one-click install - Config file for folder path, Excel output path - Handles corrupted PDFs, empty tables, locked Excel files - Logging for every extraction I can deliver within 3-4 days. Happy to test with a sample PDF first.
₹1,000 INR in 40 days
0.0
0.0

I'm a Python automation and QA engineer with 5+ years of experience. I specialize in: • Playwright/Selenium test automation • pytest frameworks and CI/CD integration • API testing and automation scripting • Clean, well-documented code delivery I've built similar QA/automation solutions and understand what quality testing requires. I can start immediately, deliver on time, and maintain clear communication throughout. Let me know if you'd like to discuss the specific requirements!
₹800 INR in 7 days
0.0
0.0

Leveraging my extensive 9+ years of accounting experience, I have mastered the art of data manipulation and analysis in Excel, which makes me the ideal candidate for your project. I'm well-versed in not only handling large volumes of data, but also extracting valuable insights from it – precisely the task you need doing. My expertise spans across preparing audit-ready financial data and ensuring compliance with regulatory requirements, which essentially aligns with your objectives. Importantly, my proficiency in Zoho Books, Tally, Sage and advanced Excel means I understand databases quite thoroughly - the backbone of your project. Databases require meticulous maintainance and management for any meaningful extraction of information. My experience in managing millions of rows of banking entries will surely be an asset to organize and process your data effectively. Most significantly, I believe the heart of every solution lies in understanding the unique capabilities and limitations that each tool presents. So let's weave your 'live dashboard' together by capitalising on my deep understanding of PowerQuery or macros in Excel, combining them with a lightweight Python script or any other solution that is silent yet high on performance. By working synergistically, we can create a seamless and efficient workflow that ensures timely extraction without jeopardising the existing data. Let's turn your onerous manual task into a backend process that runs like clockwork!
₹1,000 INR in 40 days
0.0
0.0

Hello, I have 5+ years of experience building Python automation systems, Excel-based workflows, real-time data extraction pipelines, and scalable document-processing solutions using Python, pandas, openpyxl, Power Query, and Windows automation workflows. I can help develop a live PDF-to-Excel extraction system that automatically monitors folders, extracts structured data from PDFs, appends records safely into Excel, and maintains traceable source-file metadata without overwriting existing data. My approach focuses on automation reliability, scalable data architecture, background processing efficiency, clean Excel integration, and maintainable workflows that remain user-friendly for daily operations. Before finalizing the implementation approach, I would prefer reviewing sample PDFs, expected table structures, Excel reporting requirements, and the preferred automation workflow to align the extraction logic and scalability properly. Best Regards, Kishan
₹1,000 INR in 40 days
0.0
0.0

Hyderabad, India
Member since May 13, 2026
₹12500-37500 INR
₹750-1250 INR / hour
$30-250 USD
£20-250 GBP
$10-30 USD
₹12500-37500 INR
$30-250 USD
$600-1200 USD
₹1500-12500 INR
₹1500-12500 INR
₹12500-37500 INR
$3000-5000 AUD
$30-250 USD
₹1500-12500 INR
₹100-400 INR / hour
£250-750 GBP
min £36 GBP / hour
€30-250 EUR
$250-750 USD
₹600-1500 INR