Study of document clustering using the k-means algorithm

Posted on:2007-03-14

Degree:M.S

Type:Thesis

University:University of Nevada, Las Vegas

Candidate:Gummuluru, Meghna Sharma

Full Text:PDF

GTID:2448390005974884

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

One of the most commonly used data mining techniques is document clustering or unsupervised document classification which deals with the grouping of documents based on some document similarity function.;This thesis deals with research issues associated with categorizing documents using the k-means clustering algorithm which groups objects into K number of groups based on document representations and similarities.;The proposed hypothesis of this thesis is to prove that unsupervised clustering of a set of documents produces similar results to that of their supervised categorization.

Keywords/Search Tags:

Document, Clustering, Using the k-means

PDF Full Text Request

Related items

1	Research On Document Clustering Algorithm Based On K-means
2	Study of document clustering using the k-means algorithm
3	Multi-document Summarization Based On Improved Fuzzy C-means Clustering Algorithm
4	A Distributed Indexing Method Of Large Scale Document Set Based On Clustering
5	Research Of Document Clustering For User Interest
6	Document Topic Clustering Analysis Based On Improved K-means Method
7	Research On Efficient Document Clustering Using Improvised Sub-Document Based Framework
8	A Study Of Chinese Multi-document Summarization Based On Adaptive Clustering Algorithm
9	The Application Research Of Incremental Clustering For Document Update Sumarization
10	Research On Web Document Clustering Approaches Based On Phrase Features