Transaction Data Attached To The Problem Of Clustering Research

Posted on:2004-06-26

Degree:Master

Type:Thesis

Country:China

Candidate:Q R Fan

Full Text:PDF

GTID:2208360125952070

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Clustering is the process of grouping the data into classes or clusters so that objects within a cluster have high similarity in comparison to one another, but are very dissimilar to objects in other clusters. Clustering analysis has been studied extensively, and many methods were found to solve various kinds of problems. Clustering algorithms can be divided into hard or fuzzy. A hard clustering algorithm allocates each pattern to a single cluster during its operation and in its output. A fuzzy clustering method assigns degrees of membership in several clusters to each input patterns, the degrees of membership are between 0 and 1. A fuzzy clustering can be converted to a hard clustering by assigning each pattern to the cluster with the largest measure of membership.But in some cases, a pattern can be allocated to more than one cluster. In this thesis we call it multi-subjected clustering. For numerical data, fuzzy clustering algorithms can be used to solve this kind of problems, but new algorithms need to be developed to solve the problem of multi-subjected clustering on transaction data or categorical data.With focus on transaction data, three algorithms were developed to solve multi-subjected clustering problems. There are frequent-items based algorithm, SLR-based algorithm and link-based algorithm. These algorithms can also be used on categorical data if the data were preprocessed.

Keywords/Search Tags:

clustering, data mining, multi-subjected clustering, transaction data, frequent itemsets, SLR, link

PDF Full Text Request

Related items

1	Research On Key Algorithms For Mining Frequent Patterns In Data Streams And Their Application In Simulation System
2	Research On Key Algorithms For Mining Frequent Patterns In Data Streams And Their Application
3	The Research And Implementation Of Mining Frequent Itemsets Algorithm Over Streaming Data
4	Study On Key Technologies Of Frequent Items Mining And Clustering On Data Streams
5	FP-Tree Based Mining Frequent Itemsets Over Data Streams
6	Frequent Itemsets Mining Algorithm And Its Application In Data Flow
7	Research On Multi-stream Frequent Item Set Mining Algorithm
8	Research On Algorithm For Mining Frequent Itemsets Of Uncertain Data
9	Research And Implementation On Frequent Itemsets Mining Algorithms In Uncertain Data Streams Environment
10	Research Of Frequent Itemsets Mining Algorithm With Differential Privacy For Large-scale Data