Font Size: a A A

Based On Research And Optimization Lucene Inverted Index Performance

Posted on:2014-05-02Degree:MasterType:Thesis
Country:ChinaCandidate:B ZhangFull Text:PDF
GTID:2268330401473349Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Nowadays,Internet technology has been developing quickly,at the same time,the amount of information is increasing by geometric multiplication, The human society has entered the information age, People enjoy the convenience that the Internet brings,meanwhile, How to retrieve useful information on their own in the vast amounts of knowledge and information has become the Internet important issues that needs to be resolved. Today, the Internet generates, updates, or disappears various Web pages every day. It is because of the birth of the search engine technology, the complicated situation of Internet is challenged. People can easily use the search engine tool like maze lighthouse that helps thousands of people find important information. Search engine technology is the use of a certain strategy, the use of the the network spider the Internet to collect information, and then these information processes Search engine technology is the use of a certain strategy, the use of the the network spider the Internet to collect information, and then processes these information, stores in the host server, then provides search services to network users.The search engine is a complex technology, it relates to the technology of data mining, information retrieval, natural language processing, and distributed storage. Its core technology has been in the hands of commercial companies, ordinary people is difficult to contact with search engine technology. The emergence of the Lucene has broken the situation. Lucene is a free open-source Java package for full-text search. It is not a full text search engine, but a full-text search architecture, adding full-text search functionality for a variety of small and medium-sized applications, providing search engine services.This article research and analysis based on full-text search tool package of Lucene framework research and analysis. Analyzes the performance of the Lucene and its optimization and improvement.(1)Analysis of full-text inverted index technology, Inverted index-based full-text retrieval superior performance verified by experiments, compared with the traditional string matching search, Lucene with inverted index can implement full-text retrieval more quickly and more accurately.(2)Through research and analysis on the Lucene, Propose a new parallel processing inverted index. and proves that this indexing technology improves the performance of full-text search to some extent through experiments.
Keywords/Search Tags:Internet, Search engine, Full-text search, Lucene, Inverted index
PDF Full Text Request
Related items