Research On The Problem Of Association Rule Mining In Incomplete Relational Database

Posted on:2002-06-09

Degree:Master

Type:Thesis

Country:China

Candidate:W Xiong

Full Text:PDF

GTID:2168360032453454

Subject:Circuits and Systems

Abstract/Summary:

Data Mining is a field of increasing interest combining database, artificial intelligence and machine learning. The purpose of data mining is to facilitate understanding large amounts of data by intelligently and automatically discovering useful information and knowledge. Discovery of Association Rules is an interesting subfield of data mining. Historically, the problem of association rule mining was first introduced by Agrawal etc.in 1993, the motivation for it came from the desire to analyze large amounts of supermarket basket data(et customer transaction database ) and find some regularities for business decisions. The information maintained for the different transactions is the sets of items bought by each customer. In this case, it may be desirable to find how the purchase behavior of one item affects the purchase behavior of another. Association rules help in find such relationships accurately. Many researchers have shown great interest in the problem. They have made deeply research in the area. Meanwhile, the problem of association rule mining has been extended and applied to many other field, such as telecommunication, finance etc, which all have achieved good effect. Missing value have not been considered of interest since the problem of association rule mining was first introduced because association rules have been first developed to explore transaction database where the problem of missing values does not practically exist. However this problem becomes important if we try to find associations between values of different attributes in relational database where missing values are often inevitable. It is not obvious how to compute association rules from such incomplete database. In this paper, several typical algorithms of bool association rule mining, MS algorithm, SETM algorithm, Apriori algorithm and DIC algorithm, are introduced in detail and compared; the classical flI ~ ~t7~ THESIS method and key techniques including the discretion of quantitative attribute, the interestingness of association rules, and the processing of counting support of candidates during scanning database, are also introduced in detail; a new method of mining association rule in relational database based on equivalence class in rough set theory is presented, in which the problem of association rule mining, the support and confidence of the association rule are redefined; by investigating properties of incomplete relational database, the association rule induced from an incomplete relational database and how to estimate support and confidence of it are introduced; Meanwhile, the definitions of expected support and confidence of an association rule is presented, which guarantee some required properties of association rules mining; several previous solutions to the problem of missing attribute values have been discussed, by analyzing the shortcoming of these solutions and combining the defines above , a new define and method of association rule mining in incomplete database are presented finally.

Keywords/Search Tags:

knowledge discovery, data mining, association rule, transaction database, relational database, equivalence class, missing values

Related items

1	The Applied Research In Bibliographic Retrieval System Base On Knowledge Discovery In Database
2	Algorithm Optimization Research And Application Of Association Rule In Data Mining
3	Research And Application Of Association Rule In Data Mining
4	Association Rules In Relational Database Mining Algorithm
5	Study On Parallel For Association Rules Mining
6	Research The Application Of Data Mining On Military Cadres' Culturing
7	The Design And Application Of Association Mining Matrix Algorithm Based On Equivalence Class
8	The Research Of Mining Algorithms About Association Rules Based On Relational Database
9	Research On Key Technologies About Data Mining In CIMS Database
10	Research Of Association Rules Mining Algorithms Based On Relational Database