Font Size: a A A

Study On Web Data Mining

Posted on:2004-07-02Degree:MasterType:Thesis
Country:ChinaCandidate:C M ZhangFull Text:PDF
GTID:2168360095961968Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Data Mining is fairly a new communicational technology that has been developed with the technology of database and Artificial Intelligence. Data Mining tries to extract the unknown, effective and useful knowledge from data. On one hand. Data Mining technology has a close relationship with Database technology, statistics and KDD; On the other hand, they are quite different. Data Mining mainly studies on research Generalization Knowledge, Association Knowledge, Classification Knowledge, Clustering Knowledge, Prediction Knowledge, and Deviation Knowledge. In the data mining, the technologies of associative analysis, classification, clustering have been used.As Web information is of great amount, strong orderlessness, high repeatability, people cannot get the information they need from Web quickly and conveniently. Web mining is the traditional data mining technology used in Web, attempting to find implicative, unknown, and non-trivial schema which has potential application from the innumerable Web file assembly and the data information which can be gotten when the user browse Web. Web using schema mining gets the interesting schema from the data the user browsed, and apprehend the user's browse interest behavior, in order to improve the Website's structure or provide individual service for the user.This paper is dedicated to Web schema mining's data acquisition mode, the measurement and expressing of user's browse interest, and the main tasks are as follows:1.Analysing the present data acquisition fashion of Web schema mining, pointing out the shortage of the present data acquisition fashion, For example, because the non-state link of HTTP, it is difficult to get exact information of user's browse from Web log; proposing a method which comprehensively use the service log file and the client end data to get the user's browse information.2.The interest is the selectivity attitude of objective matter of a person, and measuring user's browse interest exactly is the base of Web schema mining. According to the filed of Web usage schema mining, this paperanalyses the present the shortage of the style of measure and expresses the browsing interest of user. For instance, the too simple measure fashion often leads to difficulty of distribution which is the user interested in or not; not considering the page information amount's influence on the user's browse time and so on. As a result, point out a method based on user's browse behavior to measure the user's browse interest.3.One of the direction of using mode dining studying in Web is how to express user' browse interest effectively. In this paper, we gives a kind of expressing user' browser interest mode which is based on tree-type structure.The method based on user's browse behavior and expressing the user's browse interest in this paper improves the shortage of indigenous measurement and expresses the mode in data collection, interest measuring and interest expressing aspects, it can prepared for the further mining work better.
Keywords/Search Tags:data mining, Web mining, browse interest, personal recommendation.
PDF Full Text Request
Related items