The Research On Self-adaptive Websites Recommendation System Framework Based-on Web Log Mining

Posted on:2006-02-11

Degree:Master

Type:Thesis

Country:China

Candidate:J X Dai

Full Text:PDF

GTID:2178360182970143

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

As Websites continue growing in size and complexity, a large number of web log records have been collected on WWW server, the results of web log mining have become critical for a number of applications, such as the design of websites system, business and marketing decision support, and the personalization of websites. Self-Adaptive Websites Recommendation System is a application of data mining techniques to Web log data in order to make the Website easy to browse. It can predict the quantity and hobby of future user, and offer the basis of making a serials of wise policy for e-commerce enterprises. In this paper, a Self-Adaptive Websites Recommendation System framework named SAWRS(Self-Adaptive Websites Recommendation System) is presented. It is divided into two parts: the offline part and online part, the former deal with the data collection, pre-process, the frequently accessing mode mining;the latter produces a recommendation collection according to existed mining rule of the offline part and current user accessing behavior, then realizing the adaptive online recommendation services;the main work is as follows:Analyzes data pre-process based on the server log, it is the most essential part in the web mining process, its result directly affects the accuracy and confidence level of the mining algorithm processing results, mainly including below stage: Data Cleaning, User Identification, Session Identificationand and Path Completion, Transaction Identification. This paper realizes the web log data pre-process using the highly effective algorithm of the newest pre-process research results.The Apriori algorithm using iterate searching method by the level, each time iterates produces massive k-item candidate collection, it affected the algorithm efficiency to carry out badly, this article has put forward a optimization method aim at this shortcoming of the algorithm, its core thought is judges its k-1 dimension subset whether existing in k-1 dimension frequent itemset when produced the K-dimension candidate collection, If it existing in it then counts adds 1, otherwise, deletes it directly, if its final counts amount to k then it is the k-dimension frequent itemset. so only need to search k-1 item collection one time when produced a K-item collection every time, it enhanced the algorithm efficiency greatly. The theoretical analysis and the experiment proves this method obviously surpasses the original algorithm.In order to enables the system to automatically realizing self-adaptive recommendation, this paper gain current user accessing path by using the sliding window method, then the utilization association rule algorithm bases on gathers tree's to produce association rule set, after obtains association rule set, the theoretical analysis and experimental result indicated this method is effective and feasible.

Keywords/Search Tags:

Data Mining, Web log Mining, Online Recommendation, Association Rules, Frequent Item Set

PDF Full Text Request

Related items

1	Research Of Paralleled Frequent Subgragh Mining Algorithm PG-Miner Based On Claster Environment
2	Search Of Algorithms For Mining Maximum Frequent Item-sets
3	Researches And Applications Of The Association Rule Mining Algorithms Based On Spleen Feebleness Diagnosis
4	Research On Correlative Algorithms Of Association Rule Mining
5	Study Of Data Mining Based On The Information System In Goverment
6	Research On Improving Algorithms Of Association Rules Mining
7	Research And Application Of A New Algorithm Based On Several Algorithms Of Mining Association Rules
8	Based On The Maximum Frequent Set Data Mining Association Rules Algorithm
9	The Research On The Algorithm Of Mining Quantitative Association Rules
10	Research Of Personalized Recommendation Model Based On Web Log Mining And Association Rules