Font Size: a A A

Research On Algorithms Library In Information Retrievel System

Posted on:2007-10-25Degree:MasterType:Thesis
Country:ChinaCandidate:X L HaoFull Text:PDF
GTID:2178360212480002Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of World Wide Web, the way in which people acquire useful information has been changing drastically. More and more people begin to make use of Web to live, work and study. Information Retrieval has become an evitable element of the Web. When searching information with search engine, people want to get the most concerned information by means of eliminating the irrelevant. Moreover, what they obtain should be most valuable to them. All these result in the birth of Web Data Mining. Whereas there're so many types of algorithms and each one applies to a specific case. To choose the best one to processing data becomes a problem.In the paper, we develop an algorithms library system to provide algorithm support for the information retrieval system, which provides the framework for function call and management. Based on reflection and meta-object protocol, we realize the separation of specific function concerns and system-controlling ones. Using meta-object protocol, we can add new meta-objects into this system. Such a system is easy to expand.When choosing the algorithms to be realized, we analize the characteristics of the data to be processed in our project and propose the conception of segment matching against the defects of the methods in existence .we using this conception to calculate the similarity between two trees. Then in the whole clustering procession, we equip each cluster with an XML cluster representative, which subsumes the most typical structural specifics of a set of XML documents. Also we give the arithmetic for constructing the representative. Then clustering is accomplished by comparing cluster representatives, and updating the representatives as soon as new clusters are detected.In addition, this paper describes a GUI system developed by the author, which is used to display a damo call procession directly.
Keywords/Search Tags:Web Data Mining, Document Clustering, Document Classification, Informaiton Retrieval, Reflection, Meta-Object Protocal
PDF Full Text Request
Related items