Font Size: a A A

Design And Implementation Of Search Engine Based On Lucene And HTML Parser

Posted on:2009-05-15Degree:MasterType:Thesis
Country:ChinaCandidate:C S NiuFull Text:PDF
GTID:2178360272478293Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The amount of information on the Internet play growth, and the contents also changes of redundancy and complications, in the case search engine to become more and more to be popular with people and it has become a kind of necessity of tool to obtain the information on the net. However the traditionally search engine's data quantity is also extremely huge enormous, in order to solve the problem the professional search engine develop on the base of traditionally search engine technique.Based on"the mobile phone product information-related search engine"research on the following three key issues are explored in the study. The first HTMLParser use of robot technology to crawl through the network; web site text of the high-efficiency analysis, the information once again integration, targeted at field extracted the required data processing, and then to return to some form users. The second is the index of data, optimization, and the sort of problems. The unit word stock of inquiry key words which corresponds based on the Lucene technology Established, to solve the slow peed question. The third is the framework of the System Spring through a systematic framework for the management of the background to ensure that the search engine system operational stability.At present, this professional search engine system has run and the resu1t is exce1lent.This System has reached its goal. To a certain extent to achieve the purpose of search optimization, improved information retrieval efficiency compared to General search engines.
Keywords/Search Tags:The professional search engine, Search engine, Net Robot
PDF Full Text Request
Related items