Image parsing by data-driven Markov chain Monte Carlo

Posted on:2003-11-05

Degree:Ph.D

Type:Thesis

University:The Ohio State University

Candidate:Tu, Zhuowen

Full Text:PDF

GTID:2468390011980847

Subject:Computer Science

Abstract/Summary:

It's a scientific dream that we could build machines which understand the contents of natural images. The realization of this dream is challenged by the enormous complexity of natural images. The difficulty lies in two major aspects: (1) the modeling problem-how to model abundant patterns that emerge in images such as texture, lighting, shape, motion etc. (2) the computing problem-how to make inference of these patterns. Although the modeling problem has become increasingly clear by a number of recent developments [6, 49, 61, 65, 93, 96, 98], etc., designing a general and efficient computing framework remains a big challenge. Previous attempts in designing such a framework failed either because they are not general enough to deal with problems of high complexity or due to efficiency problem. We attack the computing problem by presenting a new paradigm called Data-Driven Markov Chain Monte Carlo (DDMCMC) in which exploration of the heterogeneous solution space is guided by importance proposals probabilistically in a efficient and robust manner. We make the contributions as follows: (1) We propose a stochastic computing paradigm called DDMCMC for visual inference and summarize many traditional vision problems for example, image segmentation, object recognition, perceptual organization etc., into a general problem called image parsing. It decomposes images into their natural patterns such as texture regions, curves, faces, etc. (2) In our first attempt to build an image parsing system, we use the DDMCMC paradigm and device a novel algorithm called image segmentation by DDMCMC wherein patterns of interest are regions of unknown parameters from seven known types. This algorithm has been tested extensively for a large set of images. (3) We then introduce two generative curve models to extend the system to parse images into regions and curves. (4) Two high level patterns of structure, parallel curves and trees curves, are further incorporated into the system so that procedures of perceptual organization are carried out in an early stage of image parsing. (5) At the end of this thesis, we also provide some proofs of how to achieve efficient MCMC algorithms.

Keywords/Search Tags:

Image, DDMCMC

Related items

1	Image parsing by data-driven Markov chain Monte Carlo
2	Based On Data-driven Markov Chain Monte Carlo Model Of The Dynamic Texture Analysis
3	Research On Key Technologies Of Image Automatic Mosaic On Image Space
4	Research On Pixel-Level Image Fusion Algorithms
5	Based On The Multi-scale Image Decomposition Of The Image Filter
6	Research On Web Image Retrieval Based On Web Information And Image Feature
7	Subpixel Image Registration And Multiband Image Synthesis Of True Color Map Algorithm Research
8	Research On Multi-source Image Fusion
9	Based On The Regional Characteristics Of Image Mosaicing Technique Research
10	The Research, Validating And Optimization Of Digital Image Haze Removal