Comparison Of Data Mining Methods Based On The Data From A Home Environment Test Of Patients With Parkinsonâ€™s Disease

Posted on:2015-01-29

Degree:Master

Type:Thesis

Country:China

Candidate:B W Yang

Full Text:PDF

GTID:2308330464951878

Subject:Statistics

Abstract/Summary:

This paper is to apply the data mining methods to predict the symptom severity and cause in data from a test battery for Parkinson patients and compare them in order to find the best model. We have two data sets, one is the cause data set, and the other is severity data set. We apply classification methods on cause data set, while we apply prediction methods on severity data set.We apply four different data mining methods for classification. They are Decision Trees, Random Forests, SVM and KNN method. Decision Trees only use 9 input variables from total 18 input variables and have high accuracy rate. Random Forests use all 18 input variables and its accuracy rate is always higher than Decision Trees. SVM have high accuracy rate, but it is still lower than Decision Trees and Random Forests. The accuracy rate of KNN method is the lowest among the four data mining methods. The accuracy rate of Random Forests is the highest among the four data mining methodsWe apply four different data mining methods for prediction. They are Decision Trees, Random Forests, GLM and MLP method. Decision Trees only use 6 input variables from total 18 input variables and have high accuracy rate. Random Forests use all 18 input variables and its accuracy rate is always higher than Decision Trees. GLM is statistical modeling method and have high accuracy rate. MLP is very famous artificial intelligence method and the accuracy rates are always high. The accuracy rate of MLP is the highest among the four data mining methods...

Keywords/Search Tags:

Data mining, Decision tree, Support vector machine, Parkinsonâ€™s disease

Related items

1	The Research And Application Of The Assessment System Of Suppliers Based On The SVM And Decision Tree Theory
2	Analysis And Application Of Telecommunications Data Based On Support Vector Machine And Decision Tree
3	Exporation And Research Of ODM Data Mining To Forest Management In Tahe
4	Application Of Data Mining In The Analysis Of Family Insurance Purchase Behavior In China
5	The Research On Maintenance And Decision Support System For Electric Power Plant Equipments Based On Data Mining
6	Using K-Mean And SVM To Build Hybrid Methodology To Classify Diseases
7	Study On Application Credit Scorecards In The Retail Bank Based On Data Mining Technology
8	Data Analysis Technology In The Power Plant Unit In The Process Of Excellence Selection Research And Application
9	Research On Classification Method Of Random Support Vector Machine And Its Application
10	The Application Of SVM To Decision Tree Induction