Class-imbalance Issue In Applying Multi-label Learning To The Study Of Parkinson In Traditional Chinese Medicine Diagnosis

Posted on:2017-03-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y Q Xiao

Full Text:PDF

GTID:2308330485460895

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Parkinsonâ€™s Disease (PD) is a common chronic central nervous system degeneration disease in middle aged and elderly people. Traditional Chinese Medicine (TCM) has a long history of research on Parkinsonâ€™s disease and also has different opinions about Parkinsonâ€™s disease syndrome. Through years of experience in TCM, provisions of Parkinsonâ€™s disease described by the five kinds of syndromes. Chinese medicine believes that Parkinson patients with up to two types of syndrome which contains the primary and secondary points. In order to standardize the process of traditional Chinese medicine diagnosis of Parkinsonâ€™s disease, the modern Chinese medicine has put forward the Parkinson TCM scale, which covers the clinical symptoms associated with Parkinsonâ€™s disease. However, for how to infer the syndrome from the scale of the symptoms, the Chinese medicine is still unable to reach a consensus, the diagnosis is still dominated by experience.In this paper, multi label learning is applied to the diagnosis of Parkinson in Chinese medicine, to explore the relationship between symptoms and syndromes, trying to provide assistance for the diagnosis process. The main work of this paper:1). In order to apply multi-label learning to the field of traditional Chinese medicine diagnosis of Parkinson, the inquiry of the scale treated as attributes, corresponding syndromes as label. In the process, there is serious imbalance problem in Parkinson data set.2). Aiming at the problem of lack of data representation of little class sample in multi-label imbalance issue, an ADAptive Multi-Label Synthetic Minority Over-sampling Technique(ADA-MLSMOTE) is proposed base on ideas of the distinct contribution of the sample and the abnormal data sample filter. The algorithm solves multi-label imbalance problem well from the data level.3). Aiming at the impact of label relevance to the multi-label imbalance issue, an algorithm named Random K-labelset Under Random Ensemble sample (RKUE) is proposed, which based on the ideas of dividing the whole label set into a plurality of sub sets and integrating multi under-sampling data sets. The experimental results show that compared with the existing multi-label algorithm, the algorithm can better solve the unbalanced phenomenon.

Keywords/Search Tags:

multi-label classification, multi-label class imbalance, over-sample algorithm, multi-label ensemble algorithm

PDF Full Text Request

Related items

1	Study Of The Classification Method Of Imbalanced Multi-Label Data Based On Label Correlation
2	Multi-label Prediction Model Based On Ontology Database And Data Mining In Bio-medicine
3	Multi-label Classification Research Based On Label-specific Features And Label Correlation
4	A Multi-label Classification Algorithm Based On Label Correlation And Class Imbalance
5	Research On Multi-label Classification Algorithm Based On Label Relationship
6	Research On Data Imbalance Of Multi Label Classification
7	Imbalanced Multi-label Learning Algorithm Based On Density Label Space
8	Research On Multi-label Classification Algorithms Based On Samples And Property Analysis
9	Research On Multi-label Classification Algorithm And Its Application In Compound Facial Expression Recognition
10	Research On The Multi-label Feature Selection And Classification Methods With The Label Correlations