I have extensive experience with the processing of PDF files. Generating PDFs, extracting data, rasterizing PDF pages, creating thumbnails of PDF files. If the PDF file contains text, no OCR is needed, if the PDF contains images of text, then OCR will need to be used. I have a little experience with the following 2 OCR engines: MODI and Tesseract