Font Size: a A A

Topics in information theory, probability, and statistics

Posted on:2007-04-04Degree:Ph.DType:Dissertation
University:Brown UniversityCandidate:Madiman, MokshayFull Text:PDF
GTID:1459390005982644Subject:Mathematics
Abstract/Summary:
This dissertation explores three topics at the intersection of information theory, probability and statistics.;The first, titled "The Minimum Description Length Principle in Lossy Data Compression", develops a theoretical framework for lossy source coding that treats it as a statistical problem, in analogy to the approach to universal lossless coding suggested by Rissanen's Minimum Description Length (MDL) principle. Two methods for selecting efficient compression algorithms are proposed, based on lossy variants of the Maximum Likelihood and MDL principles. Their theoretical performance is analyzed, and it is shown that the MDL approach to universal lossy coding identifies the optimal model class of lossy codes.;The second part, titled "Measure concentration for Compound Poisson distributions", considers the phenomenon of measure concentration for Lipschitz functions, of non-negative integer valued (i.e., Z+ -valued) compound Poisson random variables. New concentration inequalities with power law decay are proved when the Levy measure has finite (1 + delta)th moment for some delta > 0. In addition to a new modification of the Herbst argument, a new, elementary proof of a logarithmic Sobolev inequality for compound Poisson distributions [Wu (2000)] is presented.;The third part, titled "An information-theoretic approach to compound Poisson approximation" develops an information-theoretic approach to obtaining compound Poisson approximation bounds for sums of Z+ -valued random variables. First, a general and simple relative entropy bound for approximating an arbitrary function of possibly dependent Z+ -valued random variables using independent compound Poissons is proved. This yields a simple approximation bound for sums. In order to obtain refined bounds, three useful "local information quantities" are defined and analyzed. In particular, one of these quantities is used to obtain optimal-order total variation bounds for the case of "nearly homogeneous portfolios".
Keywords/Search Tags:Information, Compound poisson
Related items