The thesis studied the key technologies to the realization of vertical search,research and analysis the two core modules, the theme spiders and indexing technology from details in-depth. In the design of spider, through theme relevance search strategy combined of content and link analysis, and improve theme-related degrees through content-based evaluation, improve the theme resources coverage though the evaluation of link structure, so as to improve performance of spider effectively; from structure of the index file itself, adopting a classification inverted table to index of the organizational structure, and improve the traditional implementation program of inverted indexing table,improved index created efficiency; Finally, combine indexing technology of open source project Nutch, design and implement of a digital information theme-based vertical search engine, through the relevant tests, verify the system can provide users with more complete and accurate the theme information inquiring services. |