Find Jobs
Hire Freelancers

Python ML Engineer for PDF Data Extraction

€250-750 EUR

Ditutup
Disiarkan sekitar 1 bulan yang lalu

€250-750 EUR

Dibayar semasa penghantaran
Dear developers, I am looking for a skilled python engineer that is able to create a ML engine which is capable to extract specific datapoints from the pdf files and save them as per set of rules to sql table. system has to run offline, so no online solutions will be accepted as they cannot be used for this project. current problems: 1. large part of documents are scanned, but data is digital. essentially form was printed and scanned upon completion. 2. scanned files are tilted. 3. scanned files have noise which require removal of it. 4. few scanned documents require quality enhancement. 5. some scanned documents have different zoom. 6. all files have checkboxes. 7. specific datapoints can change location within the file, so engine must search for it and find it. there is master list at front page that indicates coa checkboxes as to what data points will be in the file. 8. few datapoints will have multiple entries, that can be true and all instances have to be entered into SQL segregating them by unique identifiers. we also need dashboard, that allows training of OCR engine using ML such as llama. to repeat, system has to run completely offline. bonus task: integrated ML that is able to scan the documents per group, and combined with extracted data is able to answer questions about the text within such documents, per group, bot mixing up information with other groups.
ID Projek: 37985648

Tentang projek

116 cadangan
Projek jarak jauh
Aktif 1 bulan yang lalu

Ingin menjana wang?

Faedah membida di Freelancer

Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
116 pekerja bebas membida secara purata €478 EUR untuk pekerjaan ini
Avatar Pengguna
Hi Good morning , I have read the brief details on your job listing . I see you have been looking for someone experienced with Django, Machine Learning (ML), MySQL, Software Architecture and Python. Its been 8 years since I have been working on freelancer.com, I have 9 years of experience doing similar jobs. I would request you to check my profile and review projects, feedbacks of projects related to those skills. Questions: 1. These are all the requirements of your job or do you have more? If yes, Please provide detailed requirements in chat and let me review and get back with queries. 2. Do you currently have anything done or this job has to be done from scratch? 3. What is the timeline to get this job done? 4. Are you open to use 3rd party APIS for it even if they are paid? Why Choose Me? 1. I have done more than 250 major projects only on freelancer.com. 2. I have not received a single bad feedback since last 5-6 years. 3. You will find 5 star feedback on last 100+ major projects which shows my clients are happy with my work. Portfolio: https://www.freelancer.com/u/AwaisChaudhry Timings: 9am - 9pm Eastern Time (I work as a full time freelancer) Please initiate the chat so we could discuss it in detail and we will continue from there. Thanks! Awais
€750 EUR dalam 12 hari
4.9 (114 ulasan)
8.6
8.6
Avatar Pengguna
Hi there I am confident in my ability to create the required ML engine for extracting specific datapoints from scanned PDF files and saving them to an SQL table according to set rules. I will address the challenges including tilted files, noise removal, quality enhancement, varying zoom levels, and changing locations of datapoints. I will incorporate OCR training using ML such as llama and ensure all operations run completely offline. Additionally, I am capable of integrating ML for document grouping and answering questions related to each group accurately. Let's create a powerful and efficient solution together. I can provide links to similar works from my portfolio. Please go through my profile its 15 years old see the work I did over the years. ---> No Win No Fee means that your satisfaction is my utmost priority. <---- Lets discuss the job details. Moreover, I am willing to start the job and perform tasks without even being hired; it is just to show my commitment to this project. Looking forward to hear from you. Regards Shah
€488 EUR dalam 7 hari
4.9 (100 ulasan)
7.7
7.7
Avatar Pengguna
Top 1% in Freelancer.com Hi, Greetings! ✅checked your project details: ✅Completed Time: In project deadline We have worked on 900 + Projects. I have 6 + years of the experience in same kind of projects. If you are looking for a true Freelancer, I am the Right person for you. I am available almost 24-7 and am very responsive. I feel proud that I am a trusted Freelancer who pleases almost every single client. You can rest assure, your work will be delivered well in advance of others, with passion and accuracy. I guarantee you instant communication & responses when you need me. Why choose me? I think every client is the reason for my success. I only take projects which I am sure I can do quickly. My Portfolio Items: https://www.freelancer.com/u/schoudhary1553 I would really like to work with you on this project. If interested, Kindly contact me via chat for further details and discussion. Thank you Sandeep
€300 EUR dalam 5 hari
4.9 (200 ulasan)
7.6
7.6
Avatar Pengguna
Hi, As an experienced and skilled Full Stack Developer, I believe my abilities in Python, Django, & MySQL make me the perfect fit for your project. Over the years, I have tackled a wide range of development challenges, some of which align perfectly with your specific problems. I have developed solutions for data extraction, handling scanned documents, and enhancing data quality while ensuring the system remains offline. Building a system that intelligently finds specific data points by analyzing master lists is just one among many challenges I have relished in recent projects. Moreover, I have successfully integrated Machine Learning (ML) into processes to enhance data extraction accuracy and automation. My knowledge in different ML models, including the use of llama, can provide your system with significant enhancements. I would not only focus on the solitary task of data extraction but offer an efficient dashboard solution for training your OCR engine; all while maintaining complete offline functionality. By partnering together on this project, we can create an end-to-end solution - from extracting specific data points from the tilted documents to answering questions about the extracted texts without mixing information between groups. Let's bring value to your organization by turning your digital dreams into a reality. Thanks & Regards
€500 EUR dalam 7 hari
4.7 (26 ulasan)
7.1
7.1
Avatar Pengguna
Hey Mate, I am a skilled Python engineer to build a powerful ML engine that extracts specific data points from PDF files and saves them to an SQL table. We can also help to handle scanned documents with digital data, noise removal, quality enhancement, and searching for variable data points. Let's discuss further! Please note that this is a placeholder proposal, we can be more specific once we get all the requirements and information required to execute the project.
€450 EUR dalam 5 hari
4.9 (56 ulasan)
7.1
7.1
Avatar Pengguna
Hi, I hope you are doing fine. I have almost 10 years of experience in machine learning algorithms. I can implement various types of artificial intelligence algorithms including yours with Matlab, Python and etc. I have PhD from Tohoku University and have several journal publications on the subjects. You can see portfolio for my previous projects. I read about your project and am interested in working with you. Please send me a message so that we can discuss more. Best regards.
€500 EUR dalam 7 hari
5.0 (46 ulasan)
6.8
6.8
Avatar Pengguna
With my extensive experience of over 8 years in Python development and data extraction projects, I am confident in my ability to meet the unique challenges posed by your PDF extraction project. I understand that a major part of your documents is scanned and the data is digital. This, in turn, necessitates the utilization of offline-based solutions which aligns well with my skill set. Due to working with various digital sources, I am well-equipped to handle the issues that you've mentioned involving tilted, noisy and inconsistently zoomed scans. Moreover, I have considerable proficiency in building ML engines and using OCR tools like llama for document processing tasks - vital skills for addressing checkbox identification, searching for data points and handling dynamically changing locations. Additionally, my expertise extends not only to managing extracted data but also storing them systematically into SQL tables with accurate segregation using unique identifiers for duplicates. Creating a user-friendly dashboard for training the ML engine is also well within my capabilities. As a diligent developer who prizes punctuality and quality in delivery, I am confident in your satisfaction with my work on this project. Rather than complicating things unnecessarily, I will focus on keeping the solution streamlined while maintaining its efficiency and effectiveness. Let's bring your vision to life together!
€500 EUR dalam 2 hari
4.9 (74 ulasan)
6.7
6.7
Avatar Pengguna
I can do it I have 5+ years of experience with Python, ML, DL and Artificial Intelligence. I can share my CV upon your request. Waiting for your message to discuss project further Burak - Owner of AImpact Analytics
€500 EUR dalam 7 hari
5.0 (30 ulasan)
6.6
6.6
Avatar Pengguna
Dear Hiring Manager, I am a seasoned Python ML Engineer with expertise in Django, Software Architecture, Machine Learning, and MySQL. I have a proven track record of successfully developing ML engines for data extraction tasks. I am confident in my ability to create a robust ML engine that can accurately extract specific data points from scanned PDF files and store them in an SQL table based on predefined rules. I understand the challenges posed by scanned documents such as tilting, noise, quality issues, and varying zoom levels. My experience in handling checkboxes, searching for data points, and dealing with multiple entries will ensure seamless extraction and organization of data. Additionally, I can develop a user-friendly dashboard for training the OCR engine using ML algorithms like llama. I am committed to delivering a high-quality solution that operates entirely offline, meeting all your project requirements effectively. Let's discuss how I can bring your vision to life.
€750 EUR dalam 7 hari
4.8 (28 ulasan)
6.5
6.5
Avatar Pengguna
Hello, As a seasoned full-stack web and mobile developer with expertise in Python and MySQL, I am confident that I have the skills required to tackle the challenges presented by your project. I have extensive experience in automating data extraction processes using powerful machine learning algorithms and feel comfortable handling all of the specific problems you've listed, from dealing with scanned files to removing noise and enhancing quality. Additionally, I am familiar with the LMAA ML library, which is an asset that can be deployed for training the OCR engine you need for your dashboard. One of my core strengths as a developer is my adaptability and resourcefulness--traits that are particularly crucial for building an offline system like you require. My knowledge spans multiple tech stacks, so not only will I be able to deliver a solution that aligns precisely with your needs, but I can also guarantee its reliability and performance. Finally, as an individual who has completed a wide range of projects for various clients, I am well-versed in ensuring adherence to instructions and executing tasks with meticulousness. Client satisfaction has always been paramount to me, which will continue to drive me as we collaborate on this project. Together, let's create an innovative offline PDF data extraction system that revolutionizes your workflow. Thanks
€500 EUR dalam 7 hari
5.0 (14 ulasan)
5.8
5.8
Avatar Pengguna
Hi , I'm sure that I can do this job. I'm artificial intelligence engineer experienced in Data Science, Machine/Deep Learning. I have accomplished many projects like yours, Also I will arrange with you to have a session to provide full illustration for you. Feel free to contact me for further details because I am looking forward working with you. Thanks
€500 EUR dalam 7 hari
4.9 (47 ulasan)
5.8
5.8
Avatar Pengguna
Hi, How are you? Very happy to bid on your project because my skills fit your project. I am a senior software engineer with 20 years of experience in Python, Java, C++, and C#. I am very familiar with developing ML engines for offline data extraction from PDF files and integration with SQL databases. If you award me, the project will be done perfectly. I will do my best to provide the results you are looking for. If you send the message, we can discuss the project more. Thanks.
€500 EUR dalam 7 hari
5.0 (25 ulasan)
5.5
5.5
Avatar Pengguna
the various steps included in this project will be <pre process of pdf <feauture extraction <training ML Model <offline deployment <integation <testing and optimisation
€650 EUR dalam 7 hari
4.9 (11 ulasan)
5.4
5.4
Avatar Pengguna
Hello, I have read through the project requirements and I am confident in my ability to create the ML engine for extracting specific datapoints from the PDF files and saving them to an SQL table as per the set of rules. With my wealth of experience in Python and machine learning, I am well-suited to tackle the challenges mentioned in the project. I have extensive experience in developing ML models for data extraction, working with scanned documents, and enhancing document quality. My expertise also includes working with OCR engines and implementing dashboard solutions for data visualization and model training. I have successfully completed similar projects in the past, which makes me confident in delivering high-quality results. I am dedicated to working hard and efficiently to complete the project within the specified timeframe. I am committed to ensuring that the ML engine meets all the requirements and functions effectively offline. Looking forward to the opportunity to work on this project. Best regards, Ioannis
€500 EUR dalam 7 hari
5.0 (6 ulasan)
5.4
5.4
Avatar Pengguna
Hello, Yes, we can help you with this project. I have few questions related to your project specification. Lets discuss over chat. Our specialty:- Angular, React, Nodejs, WordPress, PHP, Laravel, CodeIgniter, CakePHP, MySQL, Mongodb, PostgreSQL, HTML5, Bootstrap, JavaScript, etc... Why choose us: 1. Over 130 projects & 5 out of 5 star rating 2. No bad review yet 3. Most importantly 39% repeat hire rate [It's hard to find that kind of percentage here in freelancer] Thank you Sukanta
€500 EUR dalam 17 hari
5.0 (5 ulasan)
5.2
5.2
Avatar Pengguna
Hello Jonas, I will show you my recent projects related to PDF Data Extraction with accuracy using proper OCR, NLP and AI then we will move forward. So it's surety for you to get perfect solutions from my side. Also, if you want demo-type things or initial work for your project, then I will show you, and after that, we will finalize our project deal and payment milestones. I am from India, GMT +5:30, and I am available from 8:00 a.m. to 11:00 p.m. We have 16+ years of experience in software development. We have developed over 600 projects and research papers in the fields of machine learning, artificial intelligence, image processing (GIS), network, and SEO-based web and mobile apps. We have successfully completed the projects ChatGPT, Deep Learning, Computer Vision, Natural Language Processing (NLP), Encryption Decryption, Face Detection, UML Diagram, OCR, Big Data, Data Mining, Data Analysis, Statistics, Trading, Text, Image, Multiclass Classification Using Azure ML, Tensorflow, R Programming, OpenCV, Matlab, Hadoop, Artificial Intelligence Program Using PROLOG, Robotics Software, TCP-UDP Networking Project, Cloud Computing, etc. Note: The project has QA, testing, and comments in the code, so it's easy to understand the flow of the project.
€700 EUR dalam 7 hari
4.9 (17 ulasan)
5.3
5.3
Avatar Pengguna
Dear developers, I have a project that requires a skilled Python engineer to develop a machine learning engine for extracting specific data points from PDF files and storing them in a SQL table based on predefined rules. The system must operate offline, ruling out online solutions. Key challenges include handling scanned documents with tilt, noise, varying zoom levels, checkboxes, and changing data point locations. Bonus task involves integrating ML to scan and analyze text within document groups without mixing up information. Looking forward to discussing this exciting project further with interested candidates.
€250 EUR dalam 7 hari
5.0 (2 ulasan)
4.6
4.6
Avatar Pengguna
Hi there Ready to start work for now , i want to discuss about project as soon as posible . thankyou
€500 EUR dalam 7 hari
4.9 (24 ulasan)
4.9
4.9
Avatar Pengguna
Hi Jonas! As a developer with rich experiences in python, I excited to discuss this project details for clear understanding. Let's connect and get started. Alexandr
€300 EUR dalam 3 hari
5.0 (4 ulasan)
4.8
4.8
Avatar Pengguna
Hi... Nice to meet you. I am have full experiences in extraction numeric data from pdf or scanned image and convert this to csv file or txt file format using python automatically. In this project, we have to use OCR engine for extraction character and numerical data from scanned image. I am sure your project and i can deliver good result with high quality. I will wait your message to discuss project in more details. Thanks.
€300 EUR dalam 5 hari
4.7 (15 ulasan)
5.2
5.2

Tentang klien

Bendera LUXEMBOURG
K, Luxembourg
5.0
19
Kaedah pembayaran disahkan
Ahli sejak Okt 15, 2017

Pengesahan Klien

Terima kasih! Kami telah menghantar pautan melalui e-mel kepada anda untuk menuntut kredit percuma anda.
Sesuatu telah berlaku semasa menghantar e-mel anda. Sila cuba lagi.
Pengguna Berdaftar Jumlah Pekerjaan Disiarkan
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuatkan pratonton
Kebenaran diberikan untuk Geolocation.
Sesi log masuk anda telah luput dan telah dilog keluar. Sila log masuk sekali lagi.