Improving large margin classifiers using relationships among samples

Posted on:2010-06-04

Degree:Ph.D

Type:Thesis

University:Northeastern University

Candidate:Vural, Volkan

Full Text:PDF

GTID:2445390002475728

Subject:Engineering

Abstract/Summary:

Support vector machine (SVM) is a powerful supervised classification algorithm that has been successful in many real-world problems such as text categorization, face recognition, and applications in bioinformatics and computer-aided diagnosis. Although SVM is popular and accurate, it has some limitations as well. In this thesis, we focus on three major limitations of SVM and introduce various algorithms that utilize the relationships among samples to overcome these issues.;Firstly, a limitation of SVM and that of supervised learning algorithms in general is that they only learn from labeled data. However, in many domains, labeled instances are typically costly to obtain. This is particularly true for the medical domains that motivate our research, where labels are assigned via time-consuming manual review by physicians. We introduce a number of methods where we take advantage of the relationships between labeled and unlabeled data (also known as semi-supervised learning) and incorporate the information hidden in the unlabeled data into SVM.;Secondly, most classification systems assume that the data used to train and test the classifier are drawn from an independent and identically distributed (i.i.d.) underlying distribution. Nevertheless, this assumption is commonly violated in many real-life problems where sub-groups of samples have a high degree of correlation amongst both their features and their labels. Here, we introduce approaches that relax the i.i.d. assumption in support vector machines.;Finally, another limitation of standard SVM is that it is designed for binary classification. Yet, many real-world applications have more than two categories. In this thesis, we design different algorithms to extend SVM to multi-class problems pursuing the following two goals: (1) efficiency in terms of training and testing times, and (2) increased accuracy by exploiting the information hidden in inter-class relationships.

Keywords/Search Tags:

SVM, Relationships

Related items

1	The Influence Of Parent-Child Relationships And Peer Relationships On Senior High School Students' Romantic Relationships
2	Parent-young Relationships And Self Differentiation: Moderating Of Peer Relationships And Romantic Relationships
3	The Research Of The Effects Of College Students' Realistic Interpersonal Relationships On Their Mental Health: From A Perspective Of Internet Interpersonal Relationships
4	Wired to Bond: The Influence of Computer-Mediated Communication on Relationships
5	Essays on ongoing buyer-seller relationships in business markets
6	Tracing social-ecologial relationships: Ha`ena, Kaua`i, Hawai`
7	Emotional Blackmail Within Couple Relationships in Hong Kon
8	Effects of job type and culture on relationships between job characteristics and worker outcomes: A multilevel analysis
9	Meeting online friends: Personal relationships in the 21st century
10	Our shared kingdom at risk: Human-lion relationships in the 21st century (Tanzania)