Resume parse should be an application written in .NET (C# preferred) and should have the following input/output:
Input: Resume files in the following formats: WORD, PDF, TEXT, TIF
•XML format files of the resume (XML template attached) when all the words from resume are located in the correct tag of the XML.
•Problematic fields list (see more details in the additional details section)
Success criteria: The parsing mistake should be no more than 4% in average for 1000 resumes - the mistake percentage defined as the number of wrongly parsed fields divided by the total number of fields.
•The application should support mass processing of more than one resume file in different formats, the output should be more than one XML file for that case
•The application should contain 2 main modules:
[url removed, login to view] converter – Each file format will be translated by this module to text format
[url removed, login to view] engine – This engine should receive a text file and return an XML file
The separation is needed in order to allow additional file formats in the future.
•The application should return in addition to the XML a set of fields which might be problematic:
[url removed, login to view] mandatory fields in the resume file (list of mandatory fields will be provided at a later stage)
[url removed, login to view] which might has a mistake
•The application should contain a log file
•The application should provide an API which can be called by other .NET consumers, this application shouldn't have any user interface
•The process time per resume should be no more than 0.5 second per resume.
39 pekerja bebas membida secara purata $4343 untuk pekerjaan ini
I have the experience of development that convert pdf, doc files to text files, and the experience of convert text files to xml files. So I can do this job for you. Please contact me.