Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data

Posted on:2008-05-14

Degree:Ph.D

Type:Dissertation

University:University of California, Berkeley

Candidate:Banerjee, Onureena

Full Text:PDF

GTID:1440390005470990

Subject:Operations Research

Abstract/Summary:

We consider the problem of estimating the parameters of a Gaussian or binary distribution in such a way that the resulting undirected graphical model is sparse. Our approach is to solve a maximum likelihood problem with an added ℓ1-norm penalty term. The problem as formulated is convex but the memory requirements and complexity of existing interior point methods are prohibitive for problems with more than tens of nodes. We present two new algorithms for solving problems with at least a thousand nodes in the Gaussian case. Our first algorithm uses block coordinate descent, and can be interpreted as recursive ℓ1-norm penalized regression. Our second algorithm, based on Nesterov's first order method, yields a complexity estimate with a better dependence on problem size than existing interior point methods. Using a log determinant relaxation of the log partition function (Wainwright and Jordan [2006]), we show that these same algorithms can be used to solve an approximate sparse maximum likelihood problem for the binary case. We test our algorithms on synthetic data, as well as on gene expression and senate voting records data.

Keywords/Search Tags:

Binary, Maximum likelihood, Gaussian, Problem, Sparse

Related items

1	Maximum Likelihood Estimation For A Gaussian Process With Special Covariance Functions
2	Maximum likelihood estimation of an unknown change-point in the parameters of a multivariate Gaussian series with applications to environmental monitoring
3	Research On Penalized Maximum Likelihood Estimation For Gaussian Graphical Mixture Model
4	Sparse Principal Component Regression Of Binary Data
5	Research On Sparse Bayes Model Applied In Classification And Regression
6	Maximum likelihood estimation in Gaussian AMP chain graph models and Gaussian ancestral graph models
7	Research On The Movement Direction Decoding Of Animals Based On Maximum Likelihood Estimation
8	Research On The Maximum Lq Likelihood Estimation Problem Of Logistic Regression Model
9	Genetic algorithms and maximum likelihood estimation
10	Research On Correct Convergence Of The EM Algorithm For Gaussian Mixtures