
Ditutup
Disiarkan
Dibayar semasa penghantaran
Remember this should work offline environment I have a batch of English-language PDFs that were scanned as images. Each file contains a mix of cleanly typed passages and more challenging handwritten notes in the margins. I need every legible word pulled out with the highest accuracy you can achieve and stored directly in a MySQL database, not as flat files. Accuracy matters more than speed; feel free to combine engines such as Tesseract, Google Vision, or AWS Textract—whatever blend gives you the best recognition rate on both printed and cursive text. Pre-processing for skew, noise, and contrast is expected so the handwriting is captured as reliably as the typed sections. The database is already provisioned; I will share connection details and a simple schema suggestion (doc_id, page_no, original_block, extracted_text, confidence_score). If you would rather propose a better structure, I’m open to it as long as each text block can be traced back to its page and position. Deliverables • A script or small application (Python, Java, or PHP are all fine) that ingests each PDF, performs OCR, and inserts results into MySQL. • SQL dump or migration file that recreates any additional tables you introduce. • Brief read-me explaining setup, dependencies, and how to rerun the process on future files. • Sample run on three provided PDFs demonstrating the expected accuracy and table population. I’ll test by spot-checking handwritten lines and running keyword searches across the stored text. Payment releases once the sample set passes those checks and the code runs cleanly on my machine. Remember this should work offline environment
ID Projek: 40232431
23 cadangan
Projek jarak jauh
Aktif 28 hari yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
23 pekerja bebas membida secara purata ₹1,833 INR untuk pekerjaan ini

Hello! I fully understand your requirement to extract legible words from scanned PDFs using high-accuracy OCR techniques, all while ensuring compatibility with an offline environment. My plan includes employing a combination of Tesseract and AWS Textract, along with necessary pre-processing to enhance both printed and handwritten text recognition. I will deliver a robust Python application that ingests each PDF, executes the OCR, and accurately populates your MySQL database according to your specified schema. Additionally, I’ll provide a SQL dump for any enhancements made, along with a comprehensive read-me file for easy setup and future use. Please check my profile for relevant work samples demonstrating my capability. Regards, Davide
₹1,165 INR dalam 1 hari
5.2
5.2

With your project requiring accurate and high-quality OCR work, paired with a deep understanding of MySQL management, I am confident that I am the freelancer you are looking for. I have extensive experience in handling backend tasks that require complex problem-solving skills. I've incorporated translated data from a variety of sources into MySQL in previous projects, including scanned images with handwriting, through utilizing tools like Tesseract and Google Vision to ensure the highest accuracy possible. Moreover, my proficiency in Laravel allows me to design and implement appropriate database structures to accommodate complex data while maintaining optimal efficiency. I can easily adapt your suggested schema or provide an alternative approach if necessary to ensure each text block can be traced back effortlessly. As demonstrated in my robust portfolio, I am highly skilled at creating scripts and small applications that address specific requirements. You can rely on me to deliver a straightforward, well-documented process along with a reliable SQL dump for future use. Let me handle the technical aspect of your project so you can focus on reviewing the results. Let’s connect and discuss how we can best bring your vision to fruition.
₹600 INR dalam 1 hari
4.9
4.9

Hi, I can write a simple Java/python application to solve this problem using AWS Textract which based on my experience is the best when extracting text from images , even handwritten text. Please let me know if we have a deal and I can ship a working script in less than 24h. Looking forward working with you, Ioan
₹10,000 INR dalam 7 hari
3.1
3.1

As an experienced full-stack developer and data analyst, I believe I possess unique skills that configure with your project exceptionally. Through leveraging these skills, I promise an offline-friendly script or small application for you that ingests each PDF and performs OCR to the highest level of accuracy you seek. Being an enthusiastic advocate of business and data intelligence, I have a knack to make your murky scanned PDFs shine in your MySQL database. My proficiency in Python will be invaluable for this project. I am well-versed with Python libraries such as Pandas to manipulate the data effectively enabling efficient integration with MySQL. Additionally, my expertise in using cloud technologies can ensure smooth yet secured migration of any additional tables. Dedicating a specific database schema tailored to your need where each text block can be traced back to its page and position is an assurance from my end. In summary, by combing excellent technical prowess, extensive experience in data analysis, and my deep dexterity in MySQL databases, I’m confident about producing a clean code on-time exceeding your expectations. Rest assured, when it comes to quality and precision, there will be no comprise on my end because what I do best is deliver accurately. Let's bring those heavily scribbled notes out of oblivion into a structured SQL environment. When would you like us to commence work?
₹1,050 INR dalam 7 hari
2.5
2.5

Hello, I have just read the job description carefully and I can do this job quickly. The core part of this job is to convert your scanned pdf to high resolution images, then cleaned using OpenCV. please share with me the pdf so I can start the work. looking forward to working with you.
₹1,000 INR dalam 3 hari
1.1
1.1

Hello, I’m Dinesh Kumar With 14+ years of experience across multiple platforms, I’ve helped build numerous startups through dedication and hard work. I’m committed to delivering high quality work that ensures 100% client satisfaction. Your success is my priority, and I focus on building long term relationships based on trust and excellence. Expertise: Web & App Development – React.js, Node.js, JavaScript, PHP, MySQL, WordPress, Magento, CodeIgniter, Shopify, .NET, Flutter, FoxPro Strong knowledge of frameworks, software design, and development methodologies Proven ability to deliver custom, scalable, and reliable solutions for diverse industries I work with clients globally, providing end to end solutions that meet unique project needs while maintaining the highest quality standards.
₹1,050 INR dalam 7 hari
0.9
0.9

Hello, I understand you require an offline-capable solution that extracts every legible word from image-based PDFs, preserves page/block traceability, and inserts structured results into your existing MySQL schema. I will build a Python-based OCR pipeline using Tesseract (offline-optimized) with advanced pre-processing (deskewing, noise reduction, contrast enhancement) to maximize handwritten and printed text accuracy. Each text block will be mapped to doc_id, page_no, position, extracted_text, and confidence_score, with an improved schema suggestion if needed. You’ll receive a clean, reusable script, SQL migration file, and a clear README for future batch runs, plus a verified sample run on your three PDFs. I focus on accuracy over speed and will ensure the solution runs smoothly on your machine before final delivery. Let’s start with the sample files and database schema so I can demonstrate precision from day one.
₹1,000 INR dalam 7 hari
0.4
0.4

Hello, I am a skilled Full Stack Developer specializing in OCR solutions and database management. Your project of extracting text from scanned PDFs and storing it accurately in a MySQL database aligns perfectly with my expertise. I propose utilizing a blend of OCR engines like Tesseract and Google Vision for optimal accuracy on both printed and handwritten text. I will ensure pre-processing for quality improvement and develop a script/application in Python, Java, or PHP to handle the OCR extraction and database insertion efficiently. I am open to discussing any adjustments to the database schema for better organization. Upon completion, I will provide a detailed read-me for easy setup and future use. I look forward to working on this challenging project and delivering high-quality results for you. Best regards, Gajanan P.
₹1,100 INR dalam 7 hari
0.0
0.0

I have read the explanation of your project in details. If you hire me for this project, you will have a chance to get to know another talent and trustworthy guy. I have a good idea to make this project better. I have a good solution to make this project better. Although you don’t choose me, I want you to meet another talent and experienced developer to complete this project perfectly. You’ll get to know another trustworthy and talent developer by choosing me and I will be honored to complete another task perfectly. In particular, I will be honored to make this project successful for a nice customer. I guarantee the qualification of my result and try hard to satisfy customers.
₹8,000 INR dalam 2 hari
0.0
0.0

With deep expertise in Artificial Intelligence and MySQL, I'm well-prepared to tackle your OCR project head-on. I understand that accuracy is paramount for this task, especially for processing handwritten notes, which often pose significant barriers to traditional OCR engines. My comprehensive skill set allows me to deploy a blend of the most sophisticated OCR engines like Tesseract, Google Vision, or AWS Textract to achieve the best recognition rate on both printed and cursive text. Moreover, my Python proficiency ideally suits the task at hand. I assure you that your unique schema requirements will be meticulously adhered to as I structure each text block with corresponding doc_id, page_no, original_block, extracted_text, and confidence_score. For any additional tables introduced during the process, I'll provide a SQL dump or migration file for easy and efficient control. In addition, I pride myself on building reliable systems designed for extensive use in offline environments. Working offline should not limit productivity or performance, hence my commitment to creating a script or small application that ingests your PDFs and performs OCR within the offline environment you require. With a milestone-oriented payment approach, your satisfaction with the accuracy of the data and clean code on your machine comes first. Let's be intentional about building properly - together!
₹1,050 INR dalam 7 hari
0.0
0.0

Bro i am ready to work 500 rupees per day because i want my first project, but this is a very simple task for me, being an Ai Engineer, these are my day to day tasks, I world convince you to get me this project All the deliverables will be delivered in 2 days with 500 per day you can contact me directly if you want we can exchange contacts
₹600 INR dalam 2 hari
0.0
0.0

Hi, I can convert your scanned PDFs into MySQL accurately and quickly. I have strong experience in OCR (Tesseract) and MySQL database insertion, ensuring your data is structured, indexed, and ready for use.
₹1,050 INR dalam 7 hari
0.0
0.0

I am Karan, a professional Full-Stack and Flutter Developer with experience in delivering scalable, secure, and performance-driven digital solutions. I specialize in web application development, cross-platform mobile applications for Android and iOS, and custom software solutions. My technical expertise includes AI-based systems, chatbot development, workflow automation, and seamless third-party integrations. I follow industry best practices, write clean and maintainable code, and focus on modern architecture and optimized performance. I work closely with clients to understand business requirements and deliver reliable, user-centric solutions with timely execution and long-term technical support.
₹600 INR dalam 2 hari
0.0
0.0

Hi, I analyzed the other proposals. Most freelancers are offering AWS Textract or Google Vision. Problem: Those require an Internet connection. Your project says "work offline environment". The others suggest Tesseract. Problem: Tesseract fails on handwritten notes (accuracy < 50%). My Solution: PaddleOCR I will implement a customized Python script using PaddleOCR (Deep Learning). Handwriting Accuracy: Beats Tesseract by 40%+. Matches Cloud quality. 100% Offline: Runs locally on your machine (No AWS/Google API required). MySQL Ready: I will map the coordinates (page, block) directly to your schema. Proof: Send the 3 sample PDFs. I will process them locally and send you the SQL dump. Best, Max
₹1,500 INR dalam 4 hari
0.0
0.0

I am experienced in deploying ML models for OCR and i can assure the product would be state-of-the art tech which would be reliable for years to come
₹1,050 INR dalam 3 hari
0.0
0.0

I have hands-on experience working on a live, data-driven web platform where I manage structured information, database updates, and data validation processes on a regular basis. My responsibilities include maintaining MySQL tables, ensuring data accuracy, preventing duplicates, and handling backend updates efficiently. Due to confidentiality agreements, I am unable to publicly disclose full project details, but I can discuss my technical responsibilities and workflow privately if required. This experience has strengthened my attention to detail, data integrity practices, and ability to work with sensitive information in a secure and professional manner while maintaining high accuracy standards.
₹1,000 INR dalam 7 hari
0.0
0.0

Leveraging my extensive expertise in Database Development, Management, and Programming, I can provide you with a unique approach to your project. My comprehensive understanding of MySQL combined with my proficiency in Python makes me an ideal fit for this job. My specialty in handling large-scale data and ensuring high-performance processing aligns seamlessly with your requirement for storing scanned PDF information into the database accurately. Having worked on diverse projects requiring a meticulous eye for detail and accuracy, I can carry out complex data extraction from handwritten lines to typed documents effectively. Using advanced techniques including those from Tesseract, Google Vision, and AWS Textract to improve recognition rates on printed and cursive text is well within my realm of skills. My experience with preprocessing noisy data will ensure extraction from challenging handwriting as reliably as typed sections.
₹1,000 INR dalam 31 hari
0.0
0.0

With 20 years of experience as a Full-Stack Developer and specialized expertise in offline OCR solutions, I offer a professional, high-accuracy tool for your PDF-to-MySQL pipeline. My Technical Approach: 100% Offline Standalone App: I will deliver a self-contained executable (.exe). This ensures the tool runs perfectly in your isolated environment without requiring Python installations or external API calls. All processing happens locally on your CPU/GPU. Hybrid OCR Strategy: To capture both clean print and handwritten marginalia, I will use a combination of Tesseract and EasyOCR. My custom pre-processing scripts (de-skewing and noise reduction) are designed to maximize recognition rates for challenging text. Secure Local Integration: The tool assumes a local MySQL architecture. It will handle the entire process: ingesting PDFs, performing OCR, and inserting structured data (including confidence scores) directly into your local database with zero data leakage. Reliable Delivery: I provide clean, documented code and a "plug-and-play" experience. While handwriting accuracy depends on original legibility, my dual-engine approach ensures the highest possible reliability. Deliverables: Standalone OCR-to-MySQL executable, SQL migrations, and documentation for a seamless hand-off. I am ready to process your sample files and prove the accuracy of this tailored offline solution.
₹1,500 INR dalam 2 hari
0.0
0.0

Hello, I can deliver a complete offline OCR-to-MySQL solution for your scanned PDFs, extracting both typed text and handwritten margin notes with maximum accuracy. My workflow includes OpenCV preprocessing (deskew, denoise, contrast) + Tesseract LSTM OCR, storing every text block directly into MySQL with page/block traceability and confidence scores. Deliverables ✅ Python script/app for PDF OCR ingestion ✅ MySQL schema/SQL dump (if needed) ✅ ReadMe + rerun instructions ✅ Sample run on 3 PDFs with verified table population Delivery within 7 days. Ready to start immediately once files and DB access are shared. Best regards, Arun Thangaiah
₹1,300 INR dalam 7 hari
0.0
0.0

I can build a fully offline OCR processing system optimized for both printed and handwritten English text. The solution will: - Extract high-resolution images from PDFs - Apply advanced preprocessing (adaptive thresholding, noise reduction, contrast enhancement) - Detect layout regions and isolate handwritten marginal notes Use a hybrid OCR strategy: - Tesseract LSTM for printed text - Deep learning handwriting recognition model (TrOCR or PaddleOCR) deployed locally - Insert structured, position-traceable results directly into MySQL along with confidence score The system will run entirely offline and can be rerun on future batches without modification.
₹1,500 INR dalam 10 hari
0.0
0.0

Belagavi, India
Ahli sejak Mei 23, 2019
₹12500-37500 INR
₹1500-12500 INR
₹1500-12500 INR
₹600-1500 INR
₹600-1500 INR
₹750-1250 INR / jam
£18-36 GBP / jam
£3000-5000 GBP
₹12500-37500 INR
$10-30 USD
₹12500-37500 INR
$250-750 USD
₹37500-75000 INR
$15-25 USD / jam
$250-750 USD
$20000-50000 USD
€80-150 EUR
€8-30 EUR
$6000-12000 HKD
$15-25 USD / jam
₹1500-12500 INR
$15-25 USD / jam
$30-250 USD
₹250000-500000 INR
$250-750 USD