
Ditutup
Disiarkan
Dibayar semasa penghantaran
The project is to build an automated pipeline that ingests batches of academic articles delivered as DOCX and outputs publication-ready EPUB3 files. The workflow must handle advanced formatting requirements—complex layouts, live internal navigation, clean styling, and support for equations, figures, tables, and a fully generated table of contents. Core workflow • Parse incoming DOCX normalise tags, and map semantic elements to EPUB3 structures. • Transform mathematical expressions into MathML (or high-resolution SVG fallback) so equations render crisply on all major readers. • Place figures, captions, and call-outs accurately with responsive behaviour. • Auto-build a hierarchical TOC that syncs with heading levels and in-text references. • Embed interactive footnotes and hyperlinks; keep architecture open so additional widgets (audio, quizzes, etc.) can be slotted in later without refactoring. • Package, compress, and validate every file with EPUBCheck before delivery. Implementation guidance Python, Node.js, or another scripting language is fine as long as the code is clean, documented, and callable from the command line so it can be wired into our CI environment. Leveraging tools such as Pandoc, Calibre, or custom XSLT is welcome, provided the final output meets EPUB3 spec and passes automated tests. Acceptance criteria 1. Given a sample batch of 50 articles, the tool produces 50 compliant EPUB3 files within one run. 2. All equations render via MathML (or agreed fallback) and survive EPUBCheck without warnings. 3. Figures, tables, and hyperlinks maintain correct placement and functionality on Apple Books, Kindle Previewer, and Thorium Reader. 4. Navigation (TOC, landmarks, page-list) is fully functional and reflects document hierarchy. 5. Script, config files, and brief setup guide are supplied so the pipeline can be reproduced on a fresh machine. Ship the finished scripts, a README, and the test EPUBs; once they pass the criteria above we are good to move to larger volumes.
ID Projek: 40329141
34 cadangan
Projek jarak jauh
Aktif 15 hari yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
34 pekerja bebas membida secara purata ₹22,577 INR untuk pekerjaan ini

Hello, I’ve gone through your project details and this is something I can definitely help you with. With over 10 years of experience in software development, I specialize in building automated pipelines and transforming data formats. The project you described for an automated academic EPUB converter aligns perfectly with my expertise in Python and Node.js. I understand the importance of handling complex formatting, equations, and creating a clean, navigable output. My approach would focus on ensuring that all mathematical expressions convert seamlessly into MathML or SVG for cross-reader compatibility, while also maintaining the integrity of figures and tables. I will adhere to best practices in coding and documentation, ensuring that the workflow is efficient and reproducible. After developing the pipeline, I'll validate each EPUB file against EPUBCheck to meet your quality requirements. Here is my portfolio: https://www.freelancer.in/u/ixorawebmob I’m interested in your project and would love to discuss specifics to ensure we align perfectly. Could you clarify: 1. Are there any specific tools or libraries you prefer for the conversion process? Let’s discuss over chat! Regards, Arpit Jaiswal
₹27,750 INR dalam 1 hari
6.4
6.4

Hi, As per my understanding: You need an automated pipeline that converts batches of DOCX academic articles into fully compliant EPUB3 files with precise formatting, MathML equations, responsive figures/tables, and accurate navigation (TOC, links, footnotes). The system must be scalable, CI-compatible, and produce validated outputs across major readers. Implementation approach: I will build a CLI-based pipeline (Python preferred) using Pandoc for base conversion, enhanced with custom filters/XSLT for semantic mapping, layout control, and EPUB3 structuring. Math expressions will be converted to MathML with SVG fallback where required. I’ll implement parsing logic to normalize DOCX styles into consistent tags, ensuring correct placement of figures, tables, and captions. TOC and navigation landmarks will be auto-generated from heading hierarchy. EPUB packaging will include validation via EPUBCheck. The system will be modular, documented, and easily integrable into CI workflows. A few quick questions: 1. Do your DOCX files follow a consistent style/template? 2. Any preference between strict MathML vs hybrid fallback strategy? 3. Should we include support for Kindle-specific formatting constraints?
₹12,500 INR dalam 7 hari
5.6
5.6

Hi, I will build your automated EPUB3 pipeline — XML/HTML ingestion, MathML equation rendering with SVG fallback, responsive figure/table placement, hierarchical TOC generation, and EPUBCheck validation — all callable from the CLI for CI integration. I will structure the parser with a plugin architecture so footnotes, audio, and quizzes slot in as independent modules without touching the core transform logic. Questions: 1) Do the source XML files follow a consistent DTD/schema, or do formats vary across publishers? 2) Is Pandoc acceptable as a base, or do you prefer a fully custom XSLT approach? Send me a message and we can go over the details. Best regards, Faizan
₹24,746 INR dalam 10 hari
5.4
5.4

Leveraging my five-year experience in full stack web and mobile app development, your project's unique needs align perfectly with my skill set, particularly when it comes to HTML and Node.js. I've made a commendable progress in the areas of parsing, mapping and transforming data- that is fundamental to the core workflow you requested. My fluency in working with various languages, frameworks and APIs will enable me replicate a user-friendly academic ecosystem fitting your specification. My strong background in API integration will also contribute to embedding new widgets such as audio or quizzes without much disheveling. I excel at creating clean optimized and maintainable codes - an essential for a complex project like this. Beyond coding, I bring an understanding that end-to-end development support and on-time delivery with a clear communication are integral to connecting all segments of the pipeline effectively. Together, we can create an automated workflow which ingests your academic articles seamlessly and translates them into publication-ready EPUB3 files without any concerns
₹25,000 INR dalam 7 hari
5.3
5.3

Hello, Having reviewed the requirements of your project, I believe I possess expertise and experience that make me an ideal fit for this job. With a long-standing proficiency in technical writing, I can assure you that all specified elements of the documents will be aptly translated into EPUB3 structures with suitable formatting. Moreover, my strong grasp on various scripting languages such as PHP, Python, and Node.js, along with my familiarity with Pandoc, Calibre, and other relevant tools would enable me to provide a thoroughly documented code callable straight from the CLI for seamless integration into your CI environment. Meanwhile, my previous encounters with EPUB3 validation using EPUBCheck ensures the produced files will successfully pass your automated tests. You'll find my attention to detail valuable as every equation will render correctly via MathML or a suitable fallback mechanism. Figures, tables, and hyperlinks, crucial to navigating well in an article will be appropriately placed and retain their proper functionality across different reading platforms (Kindle Previewer, Thorium Reader etc.). Best regards BHarti
₹22,000 INR dalam 3 hari
5.1
5.1

I build automation pipelines in Python regularly, so this is a natural fit. For academic PDF/XML-to-EPUB conversion I'd use a combination of pandoc, ebooklib, and lxml — the pipeline would handle batch ingestion, structure parsing (sections, references, figures), and produce EPUB 3-compliant output with proper metadata. A few questions before I quote precisely: are the source articles PDFs, XMLs (JATS/NLM?), or both? And do you need figures/tables preserved, or is text-only acceptable? Based on what you've described — batch ingestion, automated conversion, structured output — I'd estimate 3–5 days depending on source format complexity. I've worked on document processing pipelines before and can deliver clean, maintainable code with basic test coverage. Happy to share a small proof-of-concept on a sample article before we lock scope if that helps.
₹15,000 INR dalam 5 hari
3.9
3.9

Delivering publication-ready EPUB3 files from complex academic XML and HTML content requires a meticulous approach to ensure fidelity in formatting, navigation, and rendering across diverse reading platforms. This project’s focus on advanced elements like MathML-rendered equations, responsive figures, and an interactive, hierarchical table of contents aligns perfectly with the critical need for precision in academic publishing workflows. Understanding the nuances of semantic tagging and EPUB3 specifications is essential to achieving a seamless conversion that preserves the integrity of the original scholarly articles. The pipeline will be engineered using Python or Node.js, leveraging robust parsing libraries and transformation tools such as custom XSLT and Pandoc to normalize and map source elements accurately into EPUB3 structures. Equations will be converted into MathML with a high-resolution SVG fallback to guarantee crisp rendering on all major EPUB readers. The workflow will incorporate automated EPUBCheck validation to ensure compliance, while the modular architecture will support embedding interactive components without disrupting the core pipeline. Command-line operability will enable smooth integration into your CI environment, facilitating batch processing and reproducibility. Commitment to delivering a fully tested, documented, and maintainable solution is paramount. The final submission will include all scripts, configuration files, and a concise setup guide to enable straightforward deployment on fresh systems. Sample EPUB outputs will be provided for your review against acceptance criteria, ensuring functionality across Apple Books, Kindle Previewer, and Thorium Reader. I look forward to discussing how this pipeline can streamline your academic publishing process with precision and scalability.
₹33,750 INR dalam 7 hari
3.1
3.1

If you're seeking a highly skilled and experienced developer who can transform your academic articles into polished, publication-ready EPUB3 files, then look no further. With my extensive background in SaaS development and proficiency in Python, Node.js, and HTML, I'm confident in creating the automated pipeline you need. The technologies such as Pandoc, Calibre, or custom XSLT would be well within my wheelhouse and I'll ensure that the final output is not only compliant with the EPUB3 spec but also passes all crucial automated tests including EPUBCheck. One of the major strengths I bring to the table is my capability to understand complex requirements and deliver solutions accordingly. For this project, I will not only parse and map semantic elements to EPUB3 structures but also expertly handle formatting specifics like layouts, equations, figures, tables, footnotes, hyperlinks and more. I'm committed to building an open architecture that can accommodate additional widgets without unnecessary refactoring. Furthermore, my focus on writing clean, documented code callable from the command line makes my solutions easily integrated with your existing CI environment. Rest assured that I will provide you not just with finished scripts but also with a detailed README file and setup guide to make sure your pipeline can be efficiently replicated on a fresh machine. Let's get started and exceed your acceptance criteria on this project together!
₹12,599 INR dalam 7 hari
2.9
2.9

Hi, I can build a clean, automated pipeline to convert your XML/HTML articles into fully compliant EPUB3 files with proper formatting and validation. Solution Approach: Python-based pipeline using lxml + XSLT + Pandoc XML/HTML normalization → semantic mapping to EPUB3 structure Math handling via MathML conversion (with SVG fallback for compatibility) Figures, tables, and captions preserved with responsive styling Auto-generated TOC, landmarks, and navigation files EPUB packaging + validation using EPUBCheck Key Features: Batch processing (50+ articles/run) Clean CSS for consistent rendering across Apple Books, Kindle, Thorium Modular design (easy to extend for audio/widgets later) CLI-based execution (CI/CD ready) Reliability: I’ll include validation checks, error logs, and format consistency tests Cross-reader testing to ensure layout + navigation accuracy You’ll get fully working scripts, sample EPUBs, and a clear setup guide. Let’s build this pipeline properly from day one. Shubham Sharma
₹20,000 INR dalam 15 hari
2.4
2.4

As someone with 20 years of experience in web development, I've honed my skills in HTML, Python, and XML that make me well-equipped to meet the demands of your project head-on. Throughout my career, I've successfully executed numerous projects requiring advanced formatting and precise styling, both of which are crucial to ensuring that the conversion process is publication-ready. My deep understanding of semantic elements and document structure will aid in normalizing tags and mapping them flawlessly to EPUB3 structures. Moreover, my technical prowess extends to manipulating equations, figures, and tables - important elements in academic documents. I have extensive experience working with tools such as Pandoc and Calibre, which are recommended for this project. This familiarity would enable me to ensure clean equations rendering consistently throughout diverse readers and maintain the correct placement of figures, tables, and hyperlinks on Apple Books, Kindle Previewer, Thorium Reader. Another essential aspect of this project is the adherence to the EPUB3 specification. Over my career span, I have built an exceptional eye-for-detail approach. As a result, all my outputs pass stringent validations effortlessly. Furthermore, I can guarantee not only the production of 50 compliant EPUB3 files from 50 input articles within a single run but also a functional Ebook that reflects document hierarchy with synchronized TOCs and working navigation.
₹25,000 INR dalam 7 hari
2.0
2.0

Hello, I can build a production-ready XML/HTML to EPUB3 pipeline that meets your formatting and validation requirements. Approach: Use Pandoc + custom filters/XSLT for robust semantic mapping to EPUB3 Normalize input → map headings, figures, tables, references cleanly Convert equations to MathML with SVG fallback where needed Generate fully synced TOC, landmarks, and navigation Ensure responsive placement of figures, captions, and tables Package + validate using EPUBCheck Pipeline design: CLI-based Python for CI integration Modular stages, parse to transform to enrich to package to validate Extensible for future widgets Deliverables: Clean, documented scripts + config README setup + run instructions Test batch of 50 EPUB3 files passing validation I focus on accuracy, standards compliance, and scalability ready for larger volumes. And ready to start immediately.
₹28,000 INR dalam 4 hari
1.6
1.6

Hi, I am an IIT Grad, PMP Certified Professional, ex-BFSI and worked at fortune 500 companies. I will make it a reality for you. As a Freelance EPUB Converter Specialist, I can build an automated pipeline using Python, XSLT, and EPUB3 tools like Calibre and PyPDF2 to parse XML/HTML, transform mathematical expressions, and generate a publication-ready EPUB3 file with advanced formatting and features. Kindly click on the chat button so we can discuss and get started. Will share you my prior projects done and my resume too. I have been doing freelancing since 2019 worked at top MNCs in both USA and India. Lets connect
₹12,500 INR dalam 7 hari
1.6
1.6

Hello there, I’ve carefully reviewed your project details and fully understand your requirements. I’m confident that I can deliver high-quality results that meet your expectations within the given timeframe. I’d be happy to discuss your project further and get started right away. Best regards, Thanks
₹12,500 INR dalam 7 hari
0.7
0.7

Hello, I have experience building document-processing pipelines and EPUB generation workflows. I can create a CLI-based pipeline (Python/Node) that parses XML/HTML, maps to EPUB3, and handles MathML, figures, tables, TOC, and navigation correctly. ✔ XML/HTML → EPUB3 structured conversion ✔ MathML (with SVG fallback) ✔ Clean styling + responsive layout ✔ Auto TOC + links/footnotes ✔ EPUBCheck validation I can also integrate tools like Pandoc/XSLT where needed for accuracy and speed. You’ll get scripts, config, README, and test EPUBs ready for CI integration. Ready to start.
₹12,500 INR dalam 2 hari
0.0
0.0

Hello, I am interested in working on this project. I can help build a script-based workflow to convert XML / HTML academic articles into EPUB3 files with support for key formatting and navigation requirements. I understand the project involves handling: structured XML / HTML parsing equations figures and captions tables hyperlinks and footnotes TOC / navigation generation EPUB3 packaging and validation I would approach this using Python and XML/HTML processing tools, with a focus on creating a clean and reproducible command-line workflow that can process multiple files in one run. Deliverables: Conversion script / pipeline Batch processing support EPUB3 output generation Validation-ready structure README / setup guide Test outputs I am confident I can build this in a structured way and follow your listed requirements carefully.
₹27,778 INR dalam 15 hari
0.0
0.0

Hi! Python automation specialist. I read your project, I can help. Fast, reliable, clear communication. What is your timeline? Available to start immediately!
₹12,500 INR dalam 5 hari
0.0
0.0

Hi, I specialize in Python document processing pipelines and can build exactly this XML/HTML to EPUB3 converter. My approach: 1) Parser: lxml/BeautifulSoup to ingest XML/HTML, normalize tags, and map to semantic EPUB structure. Handle equations (MathML/LaTeX to MathJax), figures, tables, and footnotes. 2) EPUB3 generation: Using ebooklib with custom templates. Clean CSS styling, responsive layouts, live internal navigation (cross-references, citations), and auto-generated TOC from heading hierarchy. 3) Batch pipeline: CLI tool that processes a folder of articles, validates output against EPUB3 spec (epubcheck), and generates a report. 4) Quality: Proper metadata injection, accessibility tags, cover page generation, consistent typography across articles. Tech: Python, lxml, ebooklib, MathJax, epubcheck validation. I can deliver a working single-article converter in 3 days, full batch pipeline with all edge cases in 7. Can you share a sample XML/HTML input so I can validate my parser approach? Best, Kevin
₹25,000 INR dalam 7 hari
0.0
0.0

Hi, I carefully read your project requirements and understand that you need an automated pipeline to convert XML/HTML academic content into EPUB3 with proper formatting, MathML support, and navigation. I have experience working with Python, data processing, and structured data formats, and I am confident in building a clean, modular pipeline for this workflow. I am comfortable using tools like Pandoc and can implement transformations for content normalization, TOC generation, and EPUB validation. For this project, I will: ✔ Parse and normalize XML/HTML content ✔ Convert structured data into EPUB3 format ✔ Ensure proper handling of figures, tables, and hyperlinks ✔ Implement TOC and navigation structure ✔ Validate output using EPUBCheck I focus on writing clean, well-documented code that can easily integrate into CI/CD pipelines. I am also committed to testing thoroughly to ensure high-quality output across different readers. I am highly dedicated, quick to adapt, and ready to invest the effort needed to deliver this project successfully. Looking forward to working with you. Best regards, Sajeed Ahmed
₹20,000 INR dalam 10 hari
0.0
0.0

Hi, I can build a robust, automated pipeline to convert your XML/HTML articles into fully compliant EPUB3 files with accurate formatting, navigation, and validation. With strong experience in content transformation workflows, I’ll design a CLI-driven solution (Python + XSLT/Pandoc stack) that cleanly parses, normalizes, and maps semantic structures into EPUB3 standards. The pipeline will support MathML (with SVG fallback) for equations, precise handling of figures/tables, dynamic TOC generation, and interactive elements like footnotes and links. Each output will be packaged, validated via EPUBCheck, and tested across Apple Books, Kindle Previewer, and Thorium Reader to ensure consistency. I’ll deliver clean, well-documented scripts, config files, and a reproducible setup guide ready for CI integration. Looking forward to working together. Ashok
₹25,000 INR dalam 7 hari
0.0
0.0

Project Proposal: Automated XML/HTML to EPUB3 Pipeline This project aims to build an automated pipeline that converts batches of academic articles (XML/HTML) into publication-ready EPUB3 files. The system will ensure clean formatting, standards compliance, accurate rendering of complex elements, and compatibility across major e-readers. The pipeline will parse and normalize incoming XML/HTML, map semantic elements to EPUB3 structure, and transform content into valid XHTML. Mathematical expressions will be converted into MathML with SVG fallback to ensure consistent rendering. Figures, tables, and captions will be positioned accurately with responsive behavior. A hierarchical table of contents will be generated automatically based on document structure. Internal navigation, hyperlinks, and interactive footnotes will be embedded. The architecture will remain modular to support future extensions such as audio or interactive widgets. The final output will be packaged as EPUB3 files with proper manifest and spine, and each file will be validated using EPUBCheck to ensure compliance. The system will be implemented using Python (preferred) or Node.js, leveraging tools such as Pandoc, XSLT, or Calibre where appropriate. It will be accessible via a command-line interface and configurable for integration into CI/CD pipelines. Deliverables include the complete pipeline scripts, configuration files, sample EPUB outputs, and a setup guide to reproduce the workflow on a fresh system.
₹15,000 INR dalam 14 hari
0.0
0.0

New Delhi, India
Kaedah pembayaran disahkan
Ahli sejak Mac 21, 2026
₹12500-37500 INR
₹12500-37500 INR
$10-30 CAD
$30-250 USD
$15-25 USD / jam
₹37500-75000 INR
₹12500-37500 INR
$30-250 USD
$30-250 USD
$15-25 USD / jam
₹600-1500 INR
$8-15 USD / jam
$30-250 USD
₹750-1250 INR / jam
$30-250 USD
₹12500-37500 INR
$30-250 USD
$10-30 USD
₹1500-12500 INR
$10-30 AUD
$30-250 USD
€30-250 EUR