With the development of information technology, electronic texts have become more and more popular as source of information. People need some tool to find resource and knowledge from the Web urgently. Text mining had become a new promising research subject, especially in Text clustering, in recent years.This paper makes deeply research on the theory of Clustering first. In this section the paper discusses the basic concepts in Clustering in the form of mathematics and then introduces several useful Clustering algorithms such as K-means, DASCAN, SOM and makes a compare in theory.Then this paper is dedicated in studying Text Clustering, which is a special application of Clustering in Text mining. In this section the paper discusses the techniques to change the unorganized text data to organized data and some Text clustering algorithm based on feature vector.At last, this paper presents a Text Clustering model and then make a simple design to realize it based on a kind of K-means clustering algorithm.
|