Performance measures of machine learning

Posted on:2007-11-24

Degree:Ph.D

Type:Thesis

University:The University of Western Ontario (Canada)

Candidate:Huang, Jin

Full Text:PDF

GTID:2458390005990166

Subject:Computer Science

Abstract/Summary:

This thesis investigates some fundamental issues of performance measures of machine learning.;We first formally propose criteria to compare performance measures. These criteria focus on studying the consistency relationship between two measures, and whether one measure has more discriminatory power than the other. Based on the proposed criteria, we theoretically and empirically compare two most popular measures: accuracy and AUC (Area Under the ROC Curve). We show that AUC is statistically consistent and more discriminant than accuracy, which indicates that AUC should be preferred over accuracy in evaluating learning algorithms. We also compare ranking measures and give a preference order to use these measures in comparing ranking performance.;Based on the comparison criteria, we propose two general approaches to construct new measures from existing measures. We formally prove that the new measures are consistent and more discriminant than the existing ones. We also compare the learning models of artificial neural networks trained with the newly constructed measures and existing measures. The experiments show that the model trained with the newly constructed measure outperforms the models trained with the existing measures.;Finally, we explore model selection tasks using measures. We show that generally we should use different measures as model selection goal and evaluation measures. We show that a measure's model selection ability is stable to model selection goal and class distributions. We find that some measures perform better than others in the model selection tasks.;Performance measures (or evaluation measures) play important roles in machine learning. They are not only used as the the criteria to evaluate learning algorithms, but also used as the heuristics to construct learning models. However, little work has been done to thoroughly explore the characteristics of performance measures.;In summary, this thesis addresses several fundamental issues of machine learning measures. The research results are very useful in real world applications. It provides the guidance on how to select suitable measures to evaluate learning algorithms. Furthermore, it also presents general approaches to construct new measures efficiently and effectively, which provides new approaches in building learning models.;Keywords. performance measures, threshold measures, ranking measures, probability-based measures, comparison criteria, constructing measures, model selection.

Keywords/Search Tags:

Measures, Machine learning, Model selection, Criteria, Trained with the newly constructed, Fundamental issues, Compare, Evaluate learning algorithms

Related items

1	Dimension Reduction In A Number Of Issues
2	Semi-analytical method for analyzing models and model selection measures
3	Research On Model Selection For Machine Learning
4	Development Of A Machine Learning Algorithm To Compare RAMS Standards Documents
5	Analysing Correctness Of Implementations Of Machine Learning Algorithms By Machine Learning
6	A Research On Several Fundamental Problems And Core Algorithms Of Manifold Learning
7	Research On Feature Selection Algorithms In Machine Learning
8	Research On Internet Financial Risk Control Model Based On Machine Learning Algorithms
9	The Research On Language Analysis Of Social E-commerce Based On Machine Learning
10	Research And Realization Of Digitial Information Processing Technology Based On Machine Learning