What i want done is an search engine in java using classes provided by [url removed, login to view] that will search throw a large txt that inside includes description of articles with their corresponding author, must be completed in a month
1) [url removed, login to view] is a file with description of articles that must be seperated into 3204 seperate txt, every new txt will begin with .I(cappital i) as you can see if you open it (that code is already written and i can provide it, though i figure its easy)
2)Next create an indexer of lucene( will have an indexwritter,reading the files and indexing them in a directory, while holding 3 fields , title, author and synpsis)
3)A porter analyzer which will use the common [url removed, login to view] and tokenize .
4)A searcher which will read a search entry , search the 3 fields from the indexer and list the matching hits
5)A revelance feedback
p.s : its obvious that it is an asignment , i have worked with it but after a week i saw no progress in the complicated parts, so it got posted here(comments during the code with general exaplation are there for required for my understanding). Waiting offers about this.
[url removed, login to view]: usefull lucene classes TermPossitonVector,TopDocSearch,Directory,IndexSearcher,Query, MultiFieldQueryParser
[url removed, login to view] Porter analyzer refers specifically to porter stemming algorithm that extracts the grammatical roots of words