High Dimensional Learning with Structure Inducing Constraints and Regularizer

Posted on:2018-07-07

Degree:Ph.D

Type:Thesis

University:University of Minnesota

Candidate:Asiaeetaheri, Amir Asiaee

Full Text:PDF

GTID:2478390020457664

Subject:Computer Science

Abstract/Summary:

Explosive growth in data generation through science and technology calls for new computational and analytical tools. To the statistical machine learning community, one major challenge is the data sets with dimensions larger than the number of samples. Low sample-high dimension regime violates the core assumption of most traditional learning methods. To address this new challenge, over the past decade many high-dimensional learning algorithms have been developed.;One of the significant high-dimensional problems in machine learning is the linear regression where the number of features is greater than the number of samples. In the beginning, the primary focus of high-dimensional linear regression literature was on estimating sparse coefficient through l1-norm regularization. In a more general framework, one can assume that the underlying parameter has an intrinsic "low dimensional complexity" or structure. Recently, researchers have looked at structures beyond sparsity that are induced by any norm as the regularizer or constraint.;In this thesis, we focus on two variants of the high-dimensional linear model, i.e., data sharing and errors-in-variables where the structure of the parameter is captured with a suitable norm. We introduce estimators for these models and study their theoretical properties. We characterize the sample complexity of our estimators and establish non-asymptotic high probability error bounds for them. Finally, we utilize dictionary learning and sparse coding to perform Twitter sentiment analysis as an application of high dimensional learning.;Some discrete machine learning problems can also be posed as constrained set function optimization, where the constraints induce a structure over the solution set. In the second part of the thesis, we investigate a prominent set function optimization problem, the social influence maximization, under the novel "heat conduction" influence propagation model. We formulate the problem as a submodular maximization with cardinality constraints and provide an efficient algorithm for it. Through extensive experiments on several large real and synthetic networks, we show that our algorithm outperforms the well-studied methods from influence maximization literature.

Keywords/Search Tags:

Structure, Dimensional, Constraints

Related items

1	The Application Of System Inclusion Principle Under Information Structure Constraints Variation
2	A Study On Unsupervised Feature Selection Algorithms For High Dimensional Data
3	Theories And Matrix-Based Algorithms For The Constrained Minimax Design Of Two-Dimensional FIR Digital Filters
4	Research On Automatic Test Data Generation For Dynamic Data Structure
5	Three-dimensional Thermal Simulation And Structure Design Of Phase Change Access Random Memory
6	Research On The Nesting Algorithm Of Two-dimensional Irregular Parts Under Process Constraints
7	Scalable algorithms for Boolean satisfiability enabled by problem structure
8	Multi-sensor Information Fusion, With State Constraints
9	Reconstructing Three-dimensional Object Structure Chart To Organize And Correction Technology Research
10	Study On The Structure And Electronic Properties Of The New Low-dimensional Group ? Nitrides