Methods for cost-sensitive learning

Posted on:2003-07-18

Degree:Ph.D

Type:Dissertation

University:Oregon State University

Candidate:Margineantu, Dragos Dorin

Full Text:PDF

GTID:1468390011985391

Subject:Computer Science

Abstract/Summary:

Many approaches for achieving intelligent behavior of automated (computer) systems involve components that learn from past experience. This dissertation studies computational methods for learning from examples, for classification and for decision making, when the decisions have different non-zero costs associated with them. Many practical applications of learning algorithms, including transaction monitoring, fraud detection, intrusion detection, and medical diagnosis, have such non-uniform costs, and there is a great need for new methods that can handle them.; This dissertation discusses two approaches to cost-sensitive classification: input data weighting and conditional density estimation. The first method assigns a weight to each training example in order to force the learning algorithm (which is otherwise unchanged) to pay more attention to examples with higher misclassification costs. The dissertation discusses several different weighting methods and concludes that a method that gives higher weight to examples from rarer classes works quite well. Another algorithm that gave good results was a wrapper method that applies Powell's gradient-free algorithm to optimize the input weights.; The second approach to cost-sensitive classification is conditional density estimation. In this approach, the output of the learning algorithm is a classifier that estimates, for a new data point, the probability that it belongs to each of the classes. These probability estimates can be combined with a cost matrix to make decisions that minimize the expected cost. The dissertation presents a new algorithm, bagged lazy option trees (B-LOTs), that gives better probability estimates than any previous method based on decision trees.; In order to evaluate cost-sensitive classification methods, appropriate statistical methods are needed. The dissertation presents two new statistical procedures: BCOST provides a confidence interval on the expected cost of a classifier, and BDELTACOST provides a confidence interval on the difference in expected costs of two classifiers. These methods are applied to a large set of experimental studies to evaluate and compare the cost-sensitive methods presented in this dissertation.; Finally, the dissertation describes the application of the B-LOTs to a problem of predicting the stability of river channels. In this study, B-LOTs were shown to be superior to other methods in cases where the classes have very different frequencies—a situation that arises frequently in cost-sensitive classification problems.

Keywords/Search Tags:

Methods, Cost-sensitive, Dissertation

Related items

1	Research On Automatic Diagnosis Methods Of Breast Cancer Based On Cost-Sensitive Learning And Its Application
2	A comparison of methods for learning cost-sensitive classifiers
3	Cost-sensitive, scalable and adaptive learning using ensemble-based methods
4	A Research Of Cost-sensitive Classification Methods Based On LGC
5	Research On Multi-View Classification With Cost-Sensitive
6	Research On Cost-sensitive Algorithms Based On Multi-objective Optimization
7	Research On Methods Of Classifying Anomaly Attacks Based On Cost-sensitive
8	Designing Feature Selection And Classincation Methods For Classificationmethods For Imbalanced Learning And Cost-sensitive Learning Problems
9	Three Kinds Of Cost Under The Environment For Sensitive Attribute Selection
10	Research On Intrusion Detection Model Based On Cost-Sensitive