Font Size: a A A

Study On Web Log Mining Based On Associated Rule

Posted on:2008-07-30Degree:MasterType:Thesis
Country:ChinaCandidate:X R WangFull Text:PDF
GTID:2178360242971533Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Data Mining refers to a procedure where some implicit, undiscovered, useful knowledge is extracted from large amounts of data. The development of the World Wide Web and its fast popularization make the mankind really realize that the ocean of data is boundless. Facing such enormous data resources, people urgently need a kind of new technology and automatic tools to help change this enormous data resource into useful knowledge and information resources. This kind of technology should not only manage to get the top layer information of the data, but also be able to obtain the implied information and the inherent relation between the attributes of data on the basis of fully understanding the data, say, to obtain important knowledge. Web mining technology has offered a powerful means of transforming the vast data into useful information and knowledge.This thesis focus on how to make use of the Web mining analysis log to get the customer's access to the website pattern , afterwards put forward a kind of recommendation technique that can help all the customer the a site in have high-efficiency access and the perfect site's topology structure.This thesis mainly has done some research work as follows:i) In order to make the Web log file provide an accurate data source to several of mining algorithm, the research carries on preprocessing data and have some problems discussed.ii) Improving preprocessing quality of raw Web log, the thesis discussed the extract technique of website structure information.iii) Referring to Apriori algorithm, do the research of candidate sequence, which carry out an algorithm according to a graph-based candidate sequence frequent patterns generating algorithm,called SCG.iv) Design and implement the prototype system about Web log mining. Verified the SCG algorithm possibility and applicability thus.This text applies Web log mining technique for extract a customer's visit mode from the access log,will alternate the mining knowledge into the intelligence of website. The user access the research of mode to be advantageous to raising a site to order information service quality and promote the development of the intelligence information processing,all have important research meaning in both theory and practice.
Keywords/Search Tags:Data mining, Data preprocessing, Frequent Item, Web log mining, User access patterns
PDF Full Text Request
Related items