Font Size: a A A

Design And Implementation Of Vertical Search Engine Based On The Subject Resources Of Web

Posted on:2013-06-13Degree:MasterType:Thesis
Country:ChinaCandidate:M ZhangFull Text:PDF
GTID:2248330371491499Subject:Education Technology
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet technology and applications, the information resource of web is increasing rapidly. There is a great deal of valuable subject resources for teaching and scientific research which has brought great convenience to the school’s teaching and research However, It is difficult to find valuable subject resources among the massive and heterogeneous information resources quickly and accurately with universal search engine such as Google and Baidu. Vertical search engine just meets this demand, which is professional search engine targeted at a particular field, which can provide users with higher quality subject resources.This paper takes the subject resources of the field of Educational Technology as an example. Based on analysis and algorithms of the method of search engine, this paper initially builds the architecture of vertical search engine based on subject resources of web. According to the design of the architecture, this paper carry out a detailed explanation of the more important part of the module, then implements the system through the use of the expansion of open source components such as Lucene and Heritrix and domain ontology of subject resources and text-classification techniques. To help users of majority of subject field to obtain valuable subject resources more quickly and more accurately is this system’s purpose. The main work of this paper is as follows.(1) This paper researches theory related to vertical search engine, and analyzes the overall architecture of the vertical search engine(2) This paper researches the key issues of vertical search engine system based on subject resources of web; This paper builds a ontology of subject resources, which is used in the information-collection module and information-retrieval module, which improves the recall and precision of the search engine. This paper classifies the search results with a new text-classification technology, which help users to obtain more valuable subject resources more efficiently. This paper advance a new algorithm of Information filtering based on the web-based links and relevance to theme. This paper allows the visualization of search results with Java AWT technology and technology of accessing to ontology, which provides the implicit and internal linked knowledge to users.(3) It researches the design and implementation of vertical search engine based on the subject resources of web. First we analyze this system’s general idea, and then we details design system architecture, functional module, and database. After that we will implement these functional main modules.The special features of this paper lie in:This paper builds ontology of subject resources, which is used in the information-collection module and information-retrieval module, which improves the recall and precision of the search engine. This paper classifies the search results with a new text-classification technology, which help users to search more valuable subject resources more efficiently. This paper advance a new algorithm of Information filtering based on the web-based links and relevance to theme. This paper allows the visualization of search results with Java AWT technology and technology of accessing to ontology.
Keywords/Search Tags:vertical search engine, ontology, text categorization, informationfiltering, visualization
PDF Full Text Request
Related items