A comparison of methods for learning cost-sensitive classifiers

Posted on:2011-04-12

Degree:M.S

Type:Thesis

University:University of California, San Diego

Candidate:Green, Michael T

Full Text:PDF

GTID:2448390002951718

Subject:Artificial Intelligence

Abstract/Summary:

There is a significant body of research in machine learning addressing techniques for performing classification problems where the sole objective is to minimize the error rate (i.e., the costs of misclassification are assumed to be symmetric). More recent research has proposed a variety of approaches to attacking classification problem domains where the costs of misclassification are not uniform. Many of these approaches make algorithm-specific modifications to algorithms that previously focused only on minimizing the error rate. Other approaches have resulted in general methods that transform an arbitrary error-rate focused classifier into a cost-sensitive classifier. While the research has demonstrated the success of many of these general approaches in improving the performance of arbitrary algorithms compared to their cost-insensitive contemporaries, there has been relatively little examination of how well they perform relative to one another. We describe and categorize three general methods of converting a cost-sensitive method into the cost-insensitive problem domain. Each method is capable of example-based cost-sensitive classification. We then present an empirical comparison of their performance when applied to the KDD98 and DMEF2 data sets. We present results showing that costing, a technique that uses the misclassification cost of individual examples to create re-weighted training data subsets, appears to outperform alternative methods when applied to DMEF2 data using increased number of re-sampled subsets. However, the performance of all methods is not statistically differentiable across either data set.

Keywords/Search Tags:

Methods, Cost-sensitive, Data

Related items

1	Cost-sensitive, scalable and adaptive learning using ensemble-based methods
2	Research On Automatic Diagnosis Methods Of Breast Cancer Based On Cost-Sensitive Learning And Its Application
3	Research On Multi-View Classification With Cost-Sensitive
4	Research On Cost-Sensitive Classification Of Data Streams And Its Application
5	Methods for cost-sensitive learning
6	Research On Intrusion Detection Model Based On Cost-Sensitive
7	Cost Sensitive Classification Algorithm Based On Progep
8	Studies On Uncertain Data And Cost Sensitive Learning
9	Research On Unbalanced Data Mining Algorithm Based On Cost-sensitive Learning
10	A Research Of Cost-sensitive Classification Methods Based On LGC