Property Testing and Probability Distributions: New Techniques, New Models, and New Goal

Posted on:2018-09-19

Degree:Ph.D

Type:Thesis

University:Columbia University

Candidate:Canonne, Cl�ment L

Full Text:PDF

GTID:2448390005953744

Subject:Computer Science

Abstract/Summary:

In order to study the real world, scientists (and computer scientists) develop simplified models that attempt to capture the essential features of the observed system. Understanding the power and limitations of these models, when they apply or fail to fully capture the situation at hand, is therefore of uttermost importance.;In this thesis, we investigate the role of some of these models in property testing of probability distributions (distribution testing), as well as in related areas. We introduce natural extensions of the standard model (which only allows access to independent draws from the underlying distribution), in order to circumvent some of its limitations or draw new insights about the problems they aim at capturing. Our results are organized in three main directions: (i) We provide systematic approaches to tackle distribution testing questions. Specifically, we provide two general algorithmic frameworks that apply to a wide range of properties, and yield efficient and near-optimal results for many of them. We complement these by introducing two methodologies to prove information-theoretic lower bounds in distribution testing, which enable us to derive hardness results in a clean and unified way. (ii) We introduce and investigate two new models of access to the unknown distributions, which both generalize the standard sampling model in different ways and allow testing algorithms to achieve significantly better efficiency. Our study of the power and limitations of algorithms in these models shows how these could lead to faster algorithms in practical situations, and yields a better understanding of the underlying bottlenecks in the standard sampling setting. (iii) We then leave the field of distribution testing to explore areas adjacent to property testing. We define a new algorithmic primitive of sampling correction , which in some sense lies in between distribution learning and testing and aims to capture settings where data originates from imperfect or noisy sources. Our work sets out to model these situations in a rigorous and abstracted way, in order to enable the development of systematic methods to address these issues.

Keywords/Search Tags:

Models, Testing, Distribution, New, Order

Related items

1	Research On Order Batching Problem In A Distribution Center Based On Fixed Time Point And Line Distribution
2	Research On Order Batching Policy Optimization Problem In Distribution Center
3	Sparse Optimization And Network Compression Methods Based On The Second-order Information Of Models
4	Design And Implementation Of Intelligent Order Management And Distribution System For Supermarket
5	Research On The Optimization Of Order Picking In E-commerce Enterprise Distribution Center
6	Learning discrete hidden Markov models from state distribution vectors
7	Research Of Combinatorial Testing Based On Parameter Order
8	High order discrete -time models with applications to multirate control
9	Optimization On Online Supermarket Order Splitting And Order Consolidation And Package Distribution Based On Time-Space Network
10	Research On Unmanned Distribution System Based On Mobile Robots In Campus