Font Size: a A A

On L(1)-norm multi-class support vector machines: Methodology, theory and applications

Posted on:2007-06-16Degree:Ph.DType:Dissertation
University:University of MinnesotaCandidate:Wang, LifengFull Text:PDF
GTID:1458390005484933Subject:Statistics
Abstract/Summary:
Binary Support Vector Machines have proven to deliver high performance. In multi-class classification, however, issues remain with respect to variable selection. One challenging issue is classification and variable selection in presence of a large number of variables in the magnitude of thousands, which greatly exceeds the size of training sample. This often occurs in genomics classification. To meet the challenge, we propose a novel multi-class support vector machine, which performs classification and variable selection simultaneously through an L1-norm penalized sparse representation. The proposed methodology, together with the developed regularization solution path, permits variable selection in such a situation. For the proposed methodology, a statistical learning theory is developed to quantify the generalization error, where the number of variables is allowed to grow much faster than the sample size. The operating characteristics of the methodology are examined via both simulated and benchmark data, and are compared against some competitors in terms of accuracy of prediction. The numerical results suggest that the proposed methodology is highly competitive.
Keywords/Search Tags:Support vector, Methodology, Multi-class, Variable selection, Classification
Related items