Font Size: a A A

The Research Of User Traversal Sequential Patterns Based On Web Log

Posted on:2010-07-25Degree:MasterType:Thesis
Country:ChinaCandidate:D ZuoFull Text:PDF
GTID:2178360275489593Subject:Computing applications technology
Abstract/Summary:PDF Full Text Request
As the Internet grows, Web became an effective platform on which people communicate and manage. A mass of data is stored in it. Because of enormous information, it has become more and more difficult to discover useful information to every user.In order to solve that problem, application of data mining techniques to Internet, Web mining emerges. Web usages mining is one of the most important research directions in the web mining research field. The aim of it is to find out user traversal patterns of web sites. It will help us to improve the site's structure and provide the better service to the users.This paper researches how to mine the user traversal patterns based on web log. To avoid the huge of candidate patterns during user access pattern mining, we present a new algorithm UAP-miner (User Access Pattern mining) for user access pattern mining. The algorithm facilitates the tedious support counting and candidate generating operations in the mining procedure. UAP-tree (user access pattern tree) is used to register user access sequence and corresponding counts, so that the tedious support counting can be avoided. Once the UAP-tree is built, all the remaining mining processing is based on the UAP-tree. The original access database is not needed any more; an efficient recursive algorithm is proposed to find user access pattern from UAP-tree. No candidate generation is required in the mining procedure.In the end, the algorithm is to validate using the trial data.
Keywords/Search Tags:Web mining, Web usage mining, user access sequence pattern
PDF Full Text Request
Related items