I am looking for someone who is experienced in developing an e-mail extractor. I want to compile a large number of contact e-mails from a number of websites.
The e-mail extractor that I'm looking for must be capable of compiling at least 200,000 e-mails an hour. The extractor should have the ability to target specific domains (.edu, .org. .us, .k12) and also have the ability to target specific URL addresses (directory, staff, faculty, administration, etc).
The email extractor shoud also have the ability to extract emails in keyword search mode and also URL direct extraction mode. With regard to URL direct exctraction mode, it needs to have the ability to load multiple URL addresses from a text list and then be able to extract
e-mails from these URLs at the same time.
The e-mail extractor should have the ability to use connect with multiple search engines at the same time in order to speed up the collection process.
The email extractor should also have the ability to collect e-mail addresses from multiple scripts, such as .html, .asp, .cgi, .php, etc).
Finally, the extractor should have the ability to delete any duplicate email addresses right after the run is completed.
I am willing to pay for additional development and testing after the prototype is delivered because I know additional tweaking will be necessary.