Font Size: a A A

Research And Application Of WaveCluster

Posted on:2007-10-14Degree:MasterType:Thesis
Country:ChinaCandidate:L BiFull Text:PDF
GTID:2178360212457547Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Along with the development of global communication, the number of storage of database suddenly increase, in order to mine the important information hiding behind the large data, the study of database mining technology become a instant require. Clustering analysis, which is an important data mining problem, groups the data into classes or clusters so that objects within a cluster have high similarity in comparison to one another, but are very dissimilar to objects in other clusters. It has been widely used in numerous applications, including pattern recognition, data analysis and image processing. Clustering analysis technology can help people rapidly find the required information, it is very significant in real world.Traditional clustering methods can work efficiently in low dimensional data. In high dimensional data, however, efficiency and effect of traditional clustering methods are not well because of data sparsity, distance similarity and more outlier in the data. In this paper, based on analyzing every kinds of classical clustering method, it is emphasized on studying and improving the wavelet-based clustering technology (short for WaveCluster). The proposed approach is very efficient, the computational complexity of detecting clusters in our method is O(N). The results are not affected by noise and the method is not sensitive to the order of input objects to be processed. WaveCluster is well capable of finding arbitrary-shape clusters with complex structures at different scales, and does not assume any specific shape for the clusters. A priori knowledge about the exact number of clusters is not required in WaveCluster. However, an estimation of expected number of clusters helps in choosing the appropriate result of clusters. Then, in allusion to the low dimensional property of WaveCluster, improve the method. The improved method can using in high dimensional clustering, and then analysis the time complexity and space complexity. The result of experiment can prove the effectivity of improved method.In the end, it is using the improved method in the Intrusion Detection System, in allusion to the detection rate, the false alarm rate, and provided with computer simulations. The experiment result proves that the improved method can effectively enhance the detection rate and reduce the false alarm rate.
Keywords/Search Tags:Data mining, Clustering analysis, Wavelet transform, Intrusion Detection
PDF Full Text Request
Related items