This project is for building a search engine using the Apache Lucene open source project.
The winner of this project should be able to provide everything needed to make this site work according to the following specs:
- site should crawl specific sites and store a cache in a DB.
- cache should contain title,description,movie length,movie animated gif or a short flv movie (5 seconds)
- results should be indexed and returned to users
- site should direct the users to the specific page on the web at the specific screen position of the main content that the user searched for
- when a user opens the site he wishes to enter, the site will be displayed in a wrapper like in google images where you can see that the top bar is showing the google images result.
- builder should pay strong attention to building the site in a way that will be secured and will not result in blocking the crawler crawling the specific site needed to be crawled
- site should contain categories view beside search results and should be able build in a way that is easy to maintain and add/remove site crawled.
- grapich design is ready so no design is needed
- site CMS needed
- the Lucene project is using the Apache Tomcat server
- winning bidder should install the working site on my server
- NDU needed.
please do not bid if you have no experience with crawlers and search
Please PM me for any questions and more specs