Non-convex Optimization in Machine Learning: Provable Guarantees Using Tensor Methods

Posted on:2017-04-30

Degree:Ph.D

Type:Dissertation

University:University of California, Irvine

Candidate:Janzamin, Majid

Full Text:PDF

GTID:1458390008452967

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

In the last decade, machine learning algorithms have been substantially developed and they have gained tremendous empirical success. But, there is limited theoretical understanding about this success. Most real learning problems can be formulated as non-convex optimization problems which are difficult to analyze due to the existence of several local optimal solutions. In this dissertation, we provide simple and efficient algorithms for learning some probabilistic models with provable guarantees on the performance of the algorithm. We particularly focus on analyzing tensor methods which entail non-convex optimization. Furthermore, our main focus is on challenging overcomplete models. Although many existing approaches for learning probabilistic models fail in the challenging overcomplete regime, we provide scalable algorithms for learning such models with low computational and statistical complexity.;In probabilistic modeling, the underlying structure which describes the observed variables can be represented by latent variables. In the overcomplete models, these hidden underlying structures are in a higher dimension compared to the dimension of observed variables. A wide range of applications such as speech and image are well-described by overcomplete models. In this dissertation, we propose and analyze overcomplete tensor decomposition methods and exploit them for learning several latent representations and latent variable models in the unsupervised setting. This include models such as multiview mixture model, Gaussian mixtures, Independent Component Analysis, and Sparse Coding (Dictionary Learning). Since latent variables are not observed, we also have the identifiability issue in latent variable modeling and characterizing latent representations. We also propose sufficient conditions for identifiability of overcomplete topic models. In addition to unsupervised setting, we adapt the tensor techniques to supervised setting for learning neural networks and mixtures of generalized linear models.

Keywords/Search Tags:

Tensor, Non-convex optimization, Models

PDF Full Text Request

Related items

1	Research On Tensor Robust Principal Component Algorithm
2	Research On Weighted Non-convex Regularized Low Rank Tensor Recovery Model
3	Research On Nonconvex Models,Algorithms And Error Bounds For Low-Rank And Sparse Tensor Completion
4	Research On Theories And Applications Of Nonconvex Low-tubal-rank Tensor Model
5	Research On Modeling And Algorithm Of High-dimensional Image Processing Based On Tensor Decomposition
6	Convex Optimization Algorithms and Recovery Theories for Sparse Models in Machine Learning
7	Distributed Parallel Optimization Of Tensor Networks Based On Cloud Computin
8	Models And Algorithms For Image Restoration And Low Rank Tensor Approximation
9	Research On Models And Algorithms For Low-tubal-rank Tensor Recovery
10	Intonation recognition models based on convex optimizations