Mining Research, Based On The Integration Algorithm Of The K-nearest Neighbor Classification

Posted on:2011-04-18

Degree:Master

Type:Thesis

Country:China

Candidate:L Y Sun

Full Text:PDF

GTID:2208360305959817

Subject:Computer application technology

Abstract/Summary:

With the rapid development of database technology and Internet technology, data mining technology has been further development and widespread concern. Meanwhile, classification data mining as an important research content has been widely used in pattern recognition, artificial intelligence and knowledge engineering. Therefore, researching into the subject not only has important theoretical significance, but also has important applications in reality.The thesis contains the following aspects:1. An overview of classification technology and analysis of the main classification algorithm, focuses on the principle of k nearest neighbor classification algorithm and the development present.2. An improved combination k nearest neighbors method based on simulation annealing is proposed, which introduce the simulated annealing technology to achieve random feature subset selection, and then use Vote Act to decide the final output of combination classifier. It is shown that the classification performance is better than the traditional k nearest neighbor algorithm from the simulation experiment.3. In view of the search process of simulated annealing algorithm is random, the classic simulated annealing algorithm stopping criterion does not ensure the quality of solutions, the improved simulated annealing algorithm is introduced. On the basis, the combination k nearest neighbors method based on improved simulated annealing is further proposed. The simulation experimental shows that the combination k nearest neighbors method based on improved simulated annealing has better classification performance than the one of the combination k nearest neighbors method based on traditional simulation annealing.4. A new fast k nearest neighbor algorithm based on the fuzzy-rough sets is proposed, taking into account fuzzy and rough uncertainty due to the overlapping classes and the attribute insufficiency, introducing p-tree data structure to improve the traditional k nearest neighbor method. With the traditional k nearest neighbor method and fuzzy k neighbor classifier comparison shows that the method can not only improve the classification performance, but also can improve the classifier speed. The simulation experiment has proven the method's validity and feasibility.

Keywords/Search Tags:

Classification, K Nearest Neighbor, Simulated Annealing, Fuzzy Set, Rough Set

Related items

1	Study On Generalized Nearest Neighbor Pattern Classification
2	Evolutionary Extreme Learning Machine Based Feature Weighted Nearest Neighbor Classification Algorithm
3	Prediction Of Moving Objects' K-Nearest Neighbor Based On Fuzzy-Rough Sets
4	Research On Feature Selection And Classification Method Under Multiple Kernel Fuzzy Rough Set
5	Research On Several Pattern Classification Methods Based On K-nearest Neighbor Criterion
6	Research On Trojan Horse Behavior Detection Technology Based On Speed-up K Nearest Neighbor Algorithm
7	Data Distribution Guided Fuzzy-rough Nearest Neighbour Algorithm
8	Text Classification Model Based On Fuzzy-Rough Sets Theory
9	Nearest Neighbor Classification Improved Algorithm
10	Study On Classification Algorithm Based On Natural Nearest Neighbor