Font Size: a A A

The Research Of Personalized Information Retrieval System Based On XML

Posted on:2008-01-25Degree:MasterType:Thesis
Country:ChinaCandidate:G M CaiFull Text:PDF
GTID:2178360215986669Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
At present, people are confronting with the problems of inefficient inquiry in acquiring information and 'information bewilderment' in Internet, and their personalized requirements are growing day by day. To deal with them, this paper proposes a personalized information retrieval model based on XML for Web and researches the key algorithm in this field. This research is an important issue in information retrieval and is of important theory significance and practical significance.At first, a great number of search engine system and main algorithms at home and abroad are researched in the paper. Then the main structure and existed problems of search engine system are analyzed, based on these researches, the primary algorithms and technologies of the personalized information retrieval system are researched. In order to improve the performance of search engine, the three aspects around building of user's model and the system structure of the personalized search engine are studied as follows:(1)According to statistically analyzing the user's behavior features from the log file of TianWang(e.pku.edu.cn), the search word and the search process are pointed out to be relatively stable, then the user's model based on behavior features and the relevant algorithms are proposed.(2)Based on analyzing the basic structure of search engine, the basic structure of personalized system realization is proposed, and the key technologies of personalized system realization are analyzed.(3)In the process of constructing the personalized engine prototype, combining the statistical rules, the achieving method to increase the rate of search accuracy is determined, the information retreating strategies are improved, the page cleaning and reducing-repetition algorithms are optimized, and a new method with single word to construct Chinese words library is proposed. Meanwhile, combining the user's model, the relevant correlation analysis methods are improved, and which application area are broadened.This prototype system is proved to be feasible and effective by theoretical analysis and experimental results.
Keywords/Search Tags:personalized, user's model, information retrieval, relevant analyzing
PDF Full Text Request
Related items