Solr + nutch + hadoop integration

This project is strictly for people who are highly skilled in nutch, hadoop and solr, as integrating these three shouldn't take more than an hour for the person who knows his job. After this, I will have more work with respect of search engine development - I plan to do large scale searches.

For now -

I need to create a nutch, solr, hadoop integration such that -

1. Hadoop will be configured on more than 2 machines and it should be easy to add another machine to expand existing configuration wrt scale

2. Nutch will be used for indexing, will pick up urls from a flat file, will pick up configuration from a central settings file and will start indexing. Will use hadoop to use other machines to do clustered indexing. Needs to be configured such that, urls already indexed, should not be followed unless reindex flag is set in settings file

3. Nutch input will go to solr, and I should be able to search indexed websites using solr. Again, solr will also be integrated with hadoop to run clustered searches.

Initially, we will have a central server and 2 sub servers on which we can distribute search and indexing.

If you can also suggest ways to change ranking dynamically, I would be interested.

Let me know.


Kemahiran: Apache Solr, PHP, Kejuruteraan Perisian

Lihat lagi: solr hadoop integration, nutch solr hadoop, hadoop nutch solr, solr nutch hadoop, solr hadoop, solr nutch integration, solr reindex using nutch, nutch hadoop integration, integration nutch hadoop, hadoop solr integration, software used to create websites, software development job search, job search skilled, hour change 2012, architecture search engine, hadoop job, wrt, Nutch, hadoop, hadoop project, run nutch hadoop, solr integration hadoop, using php hadoop, large scale php, search engine indexing

Tentang Majikan:
( 19 ulasan ) Mumbai, India

ID Projek: #4065263

3 pekerja bebas membida secara purata $283 untuk pekerjaan ini


we will do excellent job for you.

$250 USD dalam 10 hari
(251 Ulasan)

I have very good knowledge on HADOOP, I can help you,

$100 USD dalam 10 hari
(0 Ulasan)

Chris, I am certified for Hadoop from Cloudera.

$500 USD dalam 3 hari
(0 Ulasan)