Font Size: a A A

Research And Implementation Of Personal Search Engine Based On Lucene

Posted on:2012-03-02Degree:MasterType:Thesis
Country:ChinaCandidate:Z G DingFull Text:PDF
GTID:2248330374495969Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The sharp increment of information quantity of technology knowledge brings many puzzles to the peoples who take in the industry which required strong technology. Most urgent need to solve the problem is how to find real useful information resource from the huge technology information. However the domestic and overseas traditional search engines always just make matching finding according to the keywords that user inputs. This mode has show many shortages in the environment of information blast increase, which mainly exhibits as below:the information quantity returned by the search engine is still huge. Users still need make another filter according to their personal requirements. For improving the weakness of traditional research engine, the paper researches and designs a searching algorithm based on user property and implements an individuation search engine.This paper researches how to use individuation service technology to provide full, unify and centralized information search solving scheme for the user who need find some information fast. The individuation service in this paper studies users interest and behavior by collect and analyze user information, and distinguish information requirement of different users, then provides individuation service for different users which improves the precision of information service. Further more, this paper has established a technology data support and help system which has a core of individuation search engine, to provide a complete and unify individuation services.Firstly, this paper analyzes the development status in quo and future trend of individuation service and search engine, then discusses the key technology of how to construction individuation search engine, and points out the key problem of individuation search engine which collects information based on users interest, namely individuation sorting technology, how to fast and effectively find the required information. This paper designs a searching algorithm based on user property by using vector space model. It first setups a series of user model, every kind user model corresponds a search strategy. When user uses the search services, it judges which kind of user model the user belongs according to the user property firstly, then uses this model corresponding search strategy to filter and resort the information, to get the individuation search result. Based on upper research fruit, this paper implements a search engine system based on Lucene open fountain platform which applies for military police army inner website. For increasing search precision of the system, this paper adds the tracing and studying of user behavior, and takes the user behavior custom as the extending of user property, then realizes the function of incremental collection, automatically decomposing words and settling reverse index of information. It is able to effectively eliminate "Information Island", improve the search frank of correctness and completion in military police army information website, then make the most of information resource.
Keywords/Search Tags:Search engine, Personality, Query expansion, Lucene
PDF Full Text Request
Related items