Font Size: a A A

The Design And Implementation Of Full Text Search Engine Based On Lucene

Posted on:2017-11-15Degree:MasterType:Thesis
Country:ChinaCandidate:Z HuangFull Text:PDF
GTID:2428330518496675Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the speed of development of the Internet network information network for the majority of Internet users to provide great freedom of expression and information available anywhere to bring a lot of convenience,people can get a variety of information around the world through a network.However,the information on the Internet site is an open,distributed information flow,rapid sprawl of information for Internet users,the lack of unified and effective means of information flow management.To find the information you need,users spend a lot of time and effort on the Internet,but can not be spent,the overall effectiveness of the information explosion as the amount of information compromised.Therefore,in order to let each user in the vast Internet to find the information they need,using traditional principle to achieve full text retrieval search engine system.Search engine that allows users to increase the capacity to collect and positioning information.By finding as much network information,and then to a certain strategy collected and processed and management,and ultimately to provide efficient,fast full-text search service.With the Internet technology becomes more mature,evolving open source technology,the site establishment costs are increasingly reduced,but can be a good showcase for its variety of information,almost every national government agency,institutions,business units have established their own portal.Over time,the accumulation of more and more information on the website,the user can not stand to spend a lot of time and effort to find pages of information,general search engines via the navigation bar,such as google,Baidu and other search engines can not meet the user to search precise positioning We need,in order to solve this problem,which requires the establishment of its own full-text search engine in the site.This paper discusses the background and significance of this paper;then a brief search engines background,history,information retrieval and future direction of development,focusing on detailed studies determine search engine performance three key technologies:Chinese word segmentation,indexing and retrieval technology;followed by detailed full-text search engine requirements analysis,system design and overall outline outline the main module design;then combine Lucene development framework to achieve a full-text search engine,to achieve a web crawler,data analysis,indexers,retrieval and user interface five modules;Finally,the full-text search engine system design deployed to the server,and then functional and performance testing,and the test results were summarized and proposed to improve the correlation algorithm improvements significantly improve the accuracy of the search engine,and ultimately It allows users to search for the station immediately precise information you need to obtain the page.Finally,the paper summarizes the implementation Lucene full-text search engine system based on further research and future work.
Keywords/Search Tags:search-engine, retrieval-model, index-model, lucene, relevance
PDF Full Text Request
Related items