Application Research Of Association Rules In Cross Selling Of Banks

Posted on:2017-01-31

Degree:Master

Type:Thesis

Country:China

Candidate:D D Li

Full Text:PDF

GTID:2348330503468125

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Association rule is one of the most important research directions in data mining. With the continuous development of the internet and data storage technology, association rule is getting more attention and wide used in various fields. Apriori is the classic algorithm in the mining of associate rule. It was proposed by R.Agrawal in 1994 and achieved by an iterative method based on layer search, which contains two steps of connection and pruning. The algorithm is simple and easy to implement, which can get different associate rules by different support degree and confidence degree. However there are some disadvantages of Apriori algorithm. A large number of candidate sets will be generated in the connection step, thus the database need to be scanned multiple times to calculate the support level, which will affect the efficiency and adaptability of the algorithm when the data itself is big or the amount of the association rules is master.In this paper, we have studied the association rules and Apriori algorithm based on the background of the cross selling in banks. And finally we get an improved combination algorithm. The algorithm mainly consists of two parts, which are data preprocessing and generation of association rule. The first of data preprocessing should do is to process the data which is incomplete, noisy and inconsistent into another form that meets the standards of data mining. The basic methods of data preprocessing are introduced in this paper, due to the data we used here has been processed, what we should do is to make the data become discrete and digital according to the characteristics of the property of the data. Data normalization is only part of the data preprocessing, and in the big data set, not all information is useful. Then the next we will introduce an iterative algorithm based on the approximate mass of collections to reduce the attribute set. In every iteration, we should ensure the approximate mass is not smaller than before, until we get the smallest reduction sets. During the generation of association rules, we used a method of vertical format data distribution, which were added an idea of greedy in this paper. In this way, the number of intersection in the process of getting frequent item sets is reduced, accordingly, the storage is reduced.The main advantage of the algorithm in this paper is to obtain the association rules with the smaller attribute set and less storage space, which doesn't affect the ability of classification decision. It's obvious in the analysis of the experimental results that there is no difference in the number of the association rules between the algorithm in this paper and the Apriori algorithm, while in the efficiency of the execution time, the algorithm is better than Apriori algorithm.

Keywords/Search Tags:

Association Rules, Cross-selling, Attribute reduction, Data preprocessing, Apriori algorithm

PDF Full Text Request

Related items

1	The Analysis And Optimization Of Association Rules Algorithm In Data Mining
2	The Research On The Algorithm And Application Of Association Rules Based On Logistics Information
3	The Study Of Parallel Algorithm For Mining Association Rules And Application In Medicine Selling System
4	The Applied Research Of Association Rules Mining Based On Colony Algorithm In Marketing
5	The Research On The Fast Class Association Rules Algorithm
6	The Research And Implementation Of Association Rules Algorithms-Apriori Based On Cloud Computing
7	Income-selling-store Management System Based On The Cmmi Specification And Ssh Framework
8	Research And Application Of Apriori Mining Algorithm Based On Association Rules In Colleges Management System
9	Model Design, Based On The Apriori Algorithm And Olap Association Rules Mining
10	Research On The Apriori Algorithms For Meteorological Data Association Rules Analysis Based On Cloud Computing