Font Size: a A A

Research Of Vertical Search Engine Based On Web

Posted on:2013-07-23Degree:MasterType:Thesis
Country:ChinaCandidate:S J HuangFull Text:PDF
GTID:2248330374951523Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
Modern network rapid development, the amount of information on the web at an alarming rate of rapid growth, the search engine requirements continue to improve, in order to find a more people to meet the needs of the search engine, the vertical search engine in development produces. Vertical search engine services to a specific industry, specific populations, solve search engine some deficiencies, than the previous general search engine advantage. With the development of information industrialization, the search demand of oriented to the professional direction is also growing, now, it has become one of the hot issues of search engine to achieve a specific orientation of the vertical search engine system.This paper analysis and design the process of professional web spider,indexing and retrieval,on the basis of studying the vertical search engine key technology, using the professional network spider search strategy, to achieve the information collection, indexing and query, to constructs the vertical search engine system. In this paper, the main research contents are as follows:(1) professional network spider:the paper analysis professional network spider technology, design professional web spider search strategy and process, studies two kinds of search strategy which are two search strategies of based on the study of webpage content and the link structure, and using the the search strategy together with two search strategies, design and implement the network spider which is the core part of the system.(2) the index and the Chinese word segmentation:the index and Chinese word segmentation algorithm is analyzed and designed in the paper, using maximization segmentation strategy based Chinese word segmentation algorithm, and after word segmentation information indexing, the inverted index to an index data stored to the database, realize index.(3) information retrieval:the retrieval framework structure was studied, similarity matching algorithm on the webpage information is used to be sorted, the user through the retrieval interface to query, the result are sorted and output, are displayed to the user.(4) the system design and Realization:Through of discussion and research on vertical search engine search engine key technology, each module is analyzed and designed, design the specialized information acquisition module, index module, the information query module, to achieve a vertical search engine system. The paper designs the system as a professional, personalized features, to meet the requirements of users and information retrieval.Finally, the vertical search engine related technologies were summarized, it is discussed that the technology is not mature enough place and needs to be further optimized place, put forward the following roughly the study direction and target, to perfect system step by step, to make the vertical search services more professional.
Keywords/Search Tags:Search engine, Professional web spider, Chinese word segmentation, index
PDF Full Text Request
Related items