need a program which can read pdf files stored in a folder and extract aadhar page ( a identification document) into a different pdf file.
basically our requirement is to hide first 8 digit of aadhar number mentioned in aadhar pages. there could be many pages of aadhar document in pdf.
so first step will be to identify aadhar page in pdf ,
then identify aadhar number
then hide/whiten/mask first 8 digit of aadhar number. please see sample attachments
then save pdf file again with same name
So now pdf file have everything same but aadhaar number first 8 digit is masked. thats it
aadhar document picture can be in the form of rotated document, photocopy of aadhar etc.
attached is the sample pdf file.
22 pekerja bebas membida secara purata ₹8872 untuk pekerjaan ini
I have done same type of work earlier so this work related pdf and perform masking with aadhar algorithm. i have around 15 years of experience very good exposer on pdf word and excel in dot net Microsoft technology.
I've worked on projects related to data extraction, data manipulation using Python language. Also, I have an extensive background on Machine Learning and AI through my current job profile.