Apache Solr 3.x (there is a module already available for Drupal ([url removed, login to view]))
Apache Nutch 2.x
1) Drupal 7.x module which enables me to configure Nutch 2.x
a) Must contain at least the possibility to set max hops
b) Set seed URLS
c) Crawl only when certain criteria is met in the Drupal 7.x module (for example, only crawl websites is about train, cars, news or even alternative news
d) use MYSQL as database
2) Use Solr to search the crawls that are done by Nutch 2.x
a) THere is a drupal 7.x module ready ([url removed, login to view])
b) this module must connect with nutch index, and be searchable by drupal 7.x search
3) Pageranking the search results given from Solr has to be based on pageranking based on the simplest of methods ( [url removed, login to view] )
a) I Want to have the ability to change the pageranking criteria/algorithm
I want to index certain websites based on my criteria. For example I want to index websites that are oriented towards automotive. I want to be able to give in the drupal module these criterias .
PLease do not hesitate to contact me if anythingn is not clear.
Must have PHP, Drupal 7.x module , SOLR 3.x and NUTCH 2.x experience