
Ditutup
Disiarkan
I have a collection of Hindi-language documents supplied as PDFs that need to be annotated for Part-of-Speech. Your task is to extract the text from each PDF and tag every token with the correct POS label, following the standard Hindi tag set (for example, NN, VB, JJ, PRP, etc.). Accuracy is essential because the data will feed a downstream NLP model. If you already use tools such as spaCy, Stanza, or custom tagging interfaces, feel free to integrate them, but the final output must be a clean, human-checked file—not just an auto-tagged draft. Deliverables • For each input PDF: a UTF-8 text file containing the original sentence order alongside its POS tags (tab-separated or CoNLL-style). • A brief log noting any unreadable sections or encoding issues you encounter. I will share the PDFs and a short style guide once we start. Let me know your estimated turnaround time per 1,000 words and any previous work you’ve done with Hindi linguistic annotation.
ID Projek: 40308706
2 cadangan
Projek jarak jauh
Aktif 30 hari yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
2 pekerja bebas membida secara purata ₹350 INR/jam untuk pekerjaan ini

Hi, This is with reference to your requirement for Tagging of PDF Hindi documents and I am applying for the same You can share the relevant details along with Style guide. and can refer my profile to know more about my skills. Do let me know if you have any questions. Regards, Rashmi
₹400 INR dalam 40 hari
0.0
0.0

Hello, I can help with Hindi POS annotation for your documents. I have a background in Artificial Intelligence and Data Science with experience in Natural Language Processing and text processing tasks. I’m comfortable working with Hindi text and can carefully tag tokens using the standard Hindi POS tag set. For this project, I will extract the text from the PDFs, tokenize the sentences, and annotate each token with the appropriate POS tag. I will also manually review the tags to ensure accuracy and provide the final output in a clean UTF-8 text file in the requested format (tab-separated or CoNLL-style). Any unreadable sections or encoding issues will be documented in a brief log. Estimated turnaround: around 1,000 words per day depending on the document quality. I’d be happy to review the sample PDFs and the tagging style guide before starting. Best regards.
₹300 INR dalam 30 hari
0.0
0.0

Bengaluru, India
Ahli sejak Ogo 13, 2025
₹400-750 INR / jam
₹12500-37500 INR
₹250000-500000 INR
₹12500-37500 INR
min ₹2500 INR / jam
₹37500-75000 INR
$250-750 USD
$8-15 USD / jam
$2-8 CAD / jam
$30-250 USD
$40 USD
$10-30 USD
₹75000-150000 INR
$250-750 USD
₹750-1250 INR / jam
$10-30 USD
$300-1000 USD
$250-750 USD
$14-20 NZD
₹600-1500 INR
₹400-750 INR / jam
$14-100 NZD
₹1500-12500 INR
$30-250 USD
$15-25 AUD / jam