I need a C# application that will allow me to provide a file containing a list of license numbers and license names and will screen-scrape this site [login to view URL] and return the license number, license status, address, license expiration date and list of DBA's for each license number/name combination. This information will be written to a new file. I would prefer that HTMLAgilityPack be used for the screen-scraping as it's being used elsewhere (your C# code will eventually become part of a larger application)
From this point on it's helpful if you follow along here:
[login to view URL]
As an example, one line in the file might contain 1274404, POTTER HERBERT JAMES. Your code will do a search on the license number, then "click on the link for POTTER HERBERT JAMES", then retrieve the list of DBA's (CENTRAL JAIL BAIL BONDS, SOS BAIL BONDS) and the Business Address, Status, and Expiration Date. This data would be written to appended to an output file. It's possible that there is no match on license number and/or license name so just write the license number, license name, and the reason you couldn't find data (eg, license number not found or license name not found).
Note that this site uses reCaptcha so you need to bypass it.
I've enclosed a small sample test file as well as a file showing what I expect the results to be. The output file contains one entry per line, with the items tab separated (the DBA's are comma separated) Note that for license number 1215644 there are no DBA's.
14 pekerja bebas membida secara purata $215 untuk pekerjaan ini
Hello!! I have read your requirements. I have done many tasks of scraping. Would like to share that I have 4+ years of experience. Looking to hear from you. Imtiyaz
Hi Dear. I am a senior C# developer. I have good experience in web scraping development. I am sure I can provide PERFECT result. I can start now. I hope to discuss with you. Thanks. From Paul.