Hi, I'm Enrique!
I'm a programmer since for many years, and I have made several projects similar to what you are searching. In Python there is a library called “tabula” that allows me to extract tables from PDF files, convert them to Pandas DataFrame, clean (remove empty and unnecessary columns and rows) and using Pandas to save that in an Excel document (“* .xlsx”). I also see that you added: "Web Scraping." I also work Web Scraping (using several libraries such as BeautifulSoup, requests, robobrowser, etc.), in case the information is found within a web page. If you want, we can discuss it and see how it is ;-)
Attentively, Enrique Mora :-)