Font Size: a A A

The Research And Implement Of Methods On Web Usage Mining

Posted on:2005-10-07Degree:MasterType:Thesis
Country:ChinaCandidate:X D WangFull Text:PDF
GTID:2168360152455977Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Web mining is the hot research issue which combines various technologies and methods between data mining and WWW. In general, Web mining includes three research domain: Web Content Mining, Web Structure Mining and Web Usage Mining. In these areas, web usage mining aims at the rule discovery of sites' visitors browsing behaviors, the improvement of sites' structure and the linkage structure among pages, the enhancement on the quality of web services and the decision support on client relationship management of the e-commerce. On the basis of the introduction of the development survey of web usage mining, the thesis discusses the procedure of web usage mining and some technologies relevant to each phrase in web usage mining. The main work and novel ideas of the thesis are showed as following: The description of the definition, taxonomy and classification of web mining, and main content in each research area of web mining; The description of the definition, procedure of web usage mining and the exploration of the research content and related technologies in every phrase of web usage mining; We give a novel session-constructed method, which is the Time-and-Referrer-based Heuristic Method. It not only uses the time characteristic of session between users and web sites, but also considers the users' browsing characteristic. Thus, it facilitates the mining of users' frequent access patterns to some extent; In Chapter Four, we put forward a revised algorithm, which is the FAP-Mining algorithm, based on the FP-pattern growth algorithm to mine frequent access patterns; The algorithm can be used to discover access patterns of all types of users and frequent access patterns according to the support threshold value decided by experts; The design and development of web usage mining experimental system.This system consists of four function modules: Data Cleaning Module, Session Construction Module, Web Traffic Analysis, and Access Pattern Mining Module. Session construction module implements the Time-and-Referrer-based heuristic method and compares it to other popular session-constructed heuristic methods; Web traffic module analyses the general access profile of a web site; Access pattern mining module fulfills the FAP-Mining algorithm.
Keywords/Search Tags:Knowledge Discovery in Databases, Data Mining, Web Mining, Web Usage Mining, Session Construction, Frequent Access Pattern
PDF Full Text Request
Related items