Selected machine learning reductions

Posted on:2015-02-10

Degree:Ph.D

Type:Thesis

University:Columbia University

Candidate:Choromanska, Anna

Full Text:PDF

GTID:2478390017499708

Subject:Computer Science

Abstract/Summary:

Machine learning is a field of science aiming to extract knowledge from the data. Optimization lies in the core of machine learning as many learning problems are formulated as optimization problems, where the goal is to minimize/maximize an objective function. More complex machine learning problems are then often solved by reducing them to simpler sub-problems solvable by known optimization techniques. This dissertation addresses two elements of the machine learning system 'pipeline', designing efficient basic optimization tools tailored to solve specific learning problems, or in other words optimize a specific objective function, and creating more elaborate learning tools with sub-blocks being essentially optimization solvers equipped with such basic optimization tools. In the first part of this thesis we focus on a very specific learning problem where the objective function, either convex or non-convex, involves the minimization of the partition function, the normalizer of a distribution, as is the case in conditional random fields (CRFs) or log-linear models. Our work proposes a tight quadratic bound on the partition function which parameters are easily recovered by a simple algorithm that we propose. The bound gives rise to the family of new optimization learning algorithms, based on bound majorization (we developed batch, both full-rank and low-rank, and semi-stochastic variants), with linear convergence rate that successfully compete with state-of-the-art techniques (among them gradient descent methods, Newton and quasi-Newton methods like L-BFGS, etc.). The only constraint we introduce is on the number of classes which is assumed to be finite and enumerable. The bound majorization method we develop is simultaneously the first reduction scheme discussed in this thesis, where throughout this thesis by 'reduction' we understand the learning approach or algorithmic technique converting a complex machine learning problem into a set of simpler problems (that can be as small as a single problem).;Secondly, we focus on developing two more sophisticated machine learning tools, for solving harder learning problems. The tools that we develop are built from basic optimization sub-blocks tailored to solve simpler optimization sub-problems. We first focus on the multi class classification problem where the number of classes is very large. We reduce this problem to a set of simpler sub-problems that we solve using basic optimization methods performing additive update on the parameter vector. Secondly we address the problem of learning data representation when the data is unlabeled for any classification task. We reduce this problem to a set of simpler sub-problems that we solve using basic optimization methods, however this time the parameter vector is updated multiplicatively. In both problems we assume that the data come in a stream that can even be infinite. We will now provide more specific description of each of these problems and describe our approach for solving them.

Keywords/Search Tags:

Machine learning, Optimization, Problem, Specific, Data

Related items

1	Optimization-based machine learning and data mining
2	Optimization methods in data mining and machine learning
3	Product Recommendation Methods For Specific Types Of Users
4	Pattern Recognition Based On Machine Learning And Its Implementations On Clinical Technologies
5	Search and learning for the Linear Ordering Problem with an application to machine translation
6	Hybrid Intelligentalgorithm Based On Machine Learning Parameter Optimization Problem
7	Empirical learning methods for the induction of knowledge from optimization models
8	Optimization Algorithms for Structured Machine Learning and Image Processing Problems
9	Analysis And Application Of Stubborn Low-yield Problem Of Product Manufacturing Based On Data Mining
10	Research On A Single Machine Scheduling Problem Based On Improved Ant Colony Algorithms