Visual object recognition using generative models of images

Posted on:2011-07-07

Degree:Ph.D

Type:Thesis

University:University of Toronto (Canada)

Candidate:Nair, Vinod

Full Text:PDF

GTID:2448390002955276

Subject:Artificial Intelligence

Abstract/Summary:

Visual object recognition is one of the key human capabilities that we would like machines to have. The problem is the following: given an image of an object (e.g. someone's face), predict its label (e.g. that person's name) from a set of possible object labels. The predominant approach to solving the recognition problem has been to learn a discriminative model, i.e. a model of the conditional probability P(l|upsilon) over possible object labels l given an image upsilon.;We explore four types of applications of generative/reconstructive models for recognition: 1) incorporating complex domain knowledge into the learning by inverting a synthesis model, 2) using the latent image representations of generative/reconstructive models for recognition, 3) optimizing a hybrid generative-discriminative loss function, and 4) creating additional synthetic data for training more accurate discriminative models. Taken together, the results for these applications support the idea that generative/reconstructive models and unsupervised learning have a key role to play in building object recognition systems.;Here we consider an alternative class of models, broadly referred to as generative models, that learns the latent structure of the image so as to explain how it was generated. This is in contrast to discriminative models, which dedicate their parameters exclusively to representing the conditional distribution P(l|upsilon). Making finer distinctions among generative models, we consider a supervised generative model of the joint distribution P(upsilon, l) over image-label pairs, an unsupervised generative model of the distribution P(upsilon) over images alone, and an unsupervised reconstructive model, which includes models such as autoencoders that can reconstruct a given image, but do not define a proper distribution over images. The goal of this thesis is to empirically demonstrate various ways of using these models for object recognition. Its main conclusion is that such models are not only useful for recognition, but can even outperform purely discriminative models on difficult recognition tasks.

Keywords/Search Tags:

Recognition, Models, Image, Using

Related items

1	Visual object recognition using generative models of images
2	Research On The Acoustic Models And Implemention On The Keyword Recognition System
3	Researches About Generative And Discriminative Models Applied To Face Recognition
4	Models Recognition Based On Image Feature
5	Image Representation, Matching And Recognition With Graph Theory And Sparse Constraint Models
6	Speaker Recognition Based On Factor Analyzed Probability Statistic Models
7	Researches About Image Recognition Based On Supervised Pretraining NIN And Deep ELM Models
8	Research On Models Of Three Granularity Image Recognition Based On Convolutional Neural Networks
9	Geometric Image Models And Application In Medical Image Processing
10	Statistical models on human shapes with application to Bayesian image segmentation and gait recognition