Font Size: a A A

Intelligent Search Engine Based On Thematic Information Technology Research,

Posted on:2005-08-07Degree:MasterType:Thesis
Country:ChinaCandidate:Y H XuFull Text:PDF
GTID:2208360152957197Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of the Internet, especially with the astonishingly rapid development of the World Wide Web technology, computer network has become the largest distributed information and knowledge database in the world. This development has on the one hand provided a good platform for the co-construction and sharing of information resources, but on the other hand greatly increased the cost for information retrieving and knowledge acquiring, making it more and more difficult to search and get information, and has even caused such problems and phenomena as "rich data, but poor information", "getting lost in information", "information overloading", etc. As a tool for information searching on the net, searching engines, after many years of development and improvement, have brought great convenience for science workers, especially for professional information servers in retrieving and acquiring information on the Internet. Meanwhile, they are doubtlessly facing great challenges. To meet them, researches and studies on the key technologies of intelligent searching engines have been conducted in this article, which have focused on the problems and correspondent solutions in developing such intelligent searching engines as used for special topic information service to meet the requirement of science workers and of the work done in special scientific libraries.The current situation and characteristics of network resources, deficiencies of searching engines, and breakthroughs needing to be made in information service are first analyzed here. It is pointed out that to develop individually-specified, actively-serving, topic-oriented intelligent searching engines is the trend for developing a new generation of searching engines. Then we mainly discuss current hot technologies concerned in developing intelligent searching engines and their recent advancement, which include web information searching technology, web information extracting technology, web information retrieving technology, web clustering technology and searching engine evaluation technology, etc. An intelligent searching engine framework has been for information service center on special topics by referring to serving practice and the development trends of future searching engines. Centering on the intelligent system, special topic knowledge base, user knowledge base and information recommending modules have been added to strengthen individually-specified, actively-serving functions of searching engines.Based on the characteristics of special topic searching engines, the updating strategies those engines should adopt is described here, and an information searching strategy that is proper for special topic searching engine has been proposed, the purpose of which is to ensure that network robots will be able to download and update webpage information with high efficiency under existing hardware conditions. Meanwhile, with the practical work in mind, we have focused on webpage information extracting technology, which is based on semantic structure, and introduced with great detail two kinds of webpage information extracting technologies, one based upon formatted semantic structure, and the other upon text file semantic structure.
Keywords/Search Tags:Search engine, Web information retrieval, Special topic information service, Information extraction, Web data mining
PDF Full Text Request
Related items