Cost-sensitive performance of probability estimation-based classifiers: Analysis and practice

Posted on:2007-11-18

Degree:Ph.D

Type:Dissertation

University:Stanford University

Candidate:Robinson, Deirdre B. O'Brien

Full Text:PDF

GTID:1458390005980880

Subject:Statistics

Abstract/Summary:

Many schemes for classification are built using probability estimators that estimate the probability that a sample belongs to each of a given set of classes. For tasks that are cost-sensitive, in the sense that different classification errors carry different costs, probability estimation-based classifiers provide an elegant means of incorporating misclassification costs into the design of algorithms. Very often these probability estimators rely on both assumptions and simplifications that lead to inaccuracies in the estimates. Nonetheless, high classification accuracy can be achieved using inaccurate estimates of probabilities. However, for cost-sensitive classification it is the expected loss rather than the expected number of errors that is of primary concern, and the inaccuracies in probability estimation can adversely impact cost-sensitive classifier performance. When classification involves just two classes, there are simple but effective schemes to reduce the effects of these inaccuracies.; I describe how inaccurate probability estimates affect cost-sensitive classification in multi-class problems. I present an extension of bias/variance decomposition to multi-class cost-sensitive classification tasks. From this I show how two-class bias reduction schemes can be extended to multi-class problems. I also explore the effects of uncertainties in misclassification costs on the expected loss and show how to improve classifier performance when the exact misclassification costs are not known at training time. The schemes presented here improve classification for both two-class and multi-class tasks.

Keywords/Search Tags:

Probability, Classification, Cost-sensitive, Schemes, Performance, Multi-class

Related items

1	Multi-class Cost-sensitive Learning Based On Decision-making Rough Set Model
2	Cost Sensitive Classification Algorithm Based On Progep
3	Research On Multi-View Classification With Cost-Sensitive
4	Study On Class Imbalance Problem In Multi-Lable Image Classification
5	Fast Multi-label Text Classification Algorithm Based On Cost Sensitive
6	Research On Cost-sensitive Multi-label Classification Algorithms And Applications To Tag Recommendations
7	D-MetaCost:An Efficient Multi-class Cost-sensitive Algorithm
8	Research Of Ensemble Classification Methods For Class-imbalance And Cost-sensitive Datasets
9	Research And Implementation Of Cost-sensitive Multi-class Malicious Web Page Identification System
10	Study Of Multi-label Class Imbalance Classification Based On Extreme Learning Machine