The discovery of multiple-level profile association rules

Posted on:2003-01-16

Degree:Ph.D

Type:Dissertation

University:The University of Mississippi

Candidate:Bland, Charles Earl

Full Text:PDF

GTID:1468390011481912

Subject:Computer Science

Abstract/Summary:

Knowledge discovery in databases (KDD) or data mining is the field of study concerned with developing methods capable of efficiently analyzing very large datasets. Our research is focused on an area of data mining known as association discovery. A commonly used method for identifying associations is association rules. An association rule is a rule of the form A → B, where A and B are sets of items. This rule implies that when A occurs in a dataset B will also occur, with a certain probability. The traditional association rule problem can be extended to finding associations in that they present different views of data, giving insight that may not be possible with traditional association rules.; Techniques for discovering association rules have traditionally focused on identifying relationships between items describing some aspect of human behavior, usually buying behavior for determining items that customers purchase together. More than ever before, organizations are collecting personal information (profile information) associated with customer behavior. Considering this trend, in this study we take on the problem of incorporating profile information into association rule discovery. In addition, we study this problem at multiple In generating multiple-level profile association rules, we were faced with two major problems: (1) representing knowledge at multiple levels mining dense datasets, which result from the inclusion of profile items. The first problem was addressed using a markup language known as XML to partition data hierarchically according to some user-specified categorization of dataset items. We addressed the second problem by introducing a new method for compressing transactional data using a bit representation. Our compression technique allowed fairly large datasets to fit into memory. This eliminated the need for multiple dataset scans for discovering association rules, resulting in faster processing time. We tested our design on two real-world datasets. Our design resulted in a significant reduction of dataset size and faster generation of association rules. We also demonstrated that multiple-level profile association rules are a useful way of understanding data.

Keywords/Search Tags:

Association rules, Discovery, Data

Related items

1	The Research And Realization Of Association Rules Teachniques Facing The ERP System
2	The Research & Implement For Mining Association Rules Of Definite Semanteme
3	Data Mining Techniques And Algorithms For Mining Association Rules
4	Research On Algorithm Of Mining Association Rules
5	Study On Hybrid-Recommended Techniques For Papers Based On Community Discovery And Association Rules
6	Association Rules Detecting Based On Attribute Topology
7	Study Of Maintenance Algorithm For Association Rule
8	The Research Of The Frequent Item Sets Discovery Algorithm Of Association Rules Data Mining
9	Research On The Optimization Of Association Rules
10	Research On The Knowledge Discovery In Conceptual Hierarchy Knowledge Base Based On The Multiple-Level Association Rules