Font Size: a A A

The Design And Realization Of The Vertical Search Engine On The Basis Of Java

Posted on:2010-10-26Degree:MasterType:Thesis
Country:ChinaCandidate:S J ZhangFull Text:PDF
GTID:2178360278479704Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the explosive growth of Internet information, people become more and more dependable on the Internet search engine, which become the key to open up the knowledge palace and the tool to acquire the knowledge. With the increasing of the Internet information, people now want to get more accurate, more detail and more deep-seated classified information, which is new demanded serivice to Internet search engine to provide in the future.This paper first introduces the history, inner structure and the principle of search engine. After analyzing the existing problem of general search engine, elicit the conception of vertical search engine and point out its characteristics and future. Then expand the design thought of open-source projects, Heritrix and Lucene and propose the conception of a vertical search engine for mobile products based on existing open-source code. Combining with the explanation of program code, step by step, finish crawling pages, extracting information of product parameters, generating product word Library, constructing product indexer and saving the information into database. Finally, build the query interface of Web system accomplishing the construction of the whole system.The system realizes all the functions defined in summary design. This paper's design idea and implementation method have some reference and instructive value to both the study on vertical search technology and the establishment of a practical vertical search engine.
Keywords/Search Tags:vertical search engine, web spider, lucene, heritrix, website information extraction, java
PDF Full Text Request
Related items