Font Size: a A A

Lucene-Based Web Search Engine System Inside Web Site

Posted on:2006-12-19Degree:MasterType:Thesis
Country:ChinaCandidate:P B LiuFull Text:PDF
GTID:2168360152998580Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Along with the rapid growth of information on Web, it becomes ever more difficult for Web surfers to retrieve useful information among the gigantic amount of Web information. The thriving of research on Web Search Engine in various research institutes meets the need of solving this critical problem. A Web Search Engine is a kind of special web page available for Internet information retrieving. It collects various web pages through robots called Crawler, and stores the information into databases after the original web pages being analyzed. When the web surfer inputs keywords he wants to know, the Web Search Engine searches the indexes in its database and fetches relative web pages for the user. From 1994 on, to satisfy the increasing demands for web information search, a Web Search Engine has evolved through three stages: Centralized Search, Distributed Search, Intelligent Search. Nowadays,It is mainly focused on automation search, smart classification, and intelligent analysis. In the future, the research areas will expand to such extent as multimedia search, specialized search, and interlanguage search for the fulfillment of Web surfers'various requirements. This paper firstly introduces the basic principle of a Web Search Engine. Secondly, it makes in-depth discussion about improvement of a Search Engine's accuracy. Thirdly, it introduces a java-based full-content Search Engine software package Lucene, and it finally makes out a Web Search Engine System using Lucene, and improves the Search Engine's accruracy resorting to an advanced ordering algorithm...
Keywords/Search Tags:Web, Search Engine, Accuracy, Lucene
PDF Full Text Request
Related items