Visual attention and object categorization: From psychophysics to computational models

Posted on:2005-12-28

Degree:Ph.D

Type:Thesis

University:California Institute of Technology

Candidate:Peters, Robert J

Full Text:PDF

GTID:2458390008986775

Subject:Biology

Abstract/Summary:

This thesis is arranged in two main parts. Each part relies on an approach using the methods of psychophysics and computational modeling to bring abstract or high-level theories of vision closer to a concrete neurobiological foundation.; The first part addresses the topic of visual object categorization. Previous studies using high-level models categorization have left unresolved issues of neurobiological relevance, including how features are extracted from the image and the role played by memory capacity in categorization performance. We compared the ability of a comprehensive set of models to match the categorization performance of human observers while explicitly accounting for the models' numbers of free parameters. The most successful models did not require a large memory capacity, suggesting that a sparse, abstracted representation of category properties may underlie categorization performance. This type of representation---different from classical prototype abstraction---could also be extracted directly from two-dimensional images via a biologically plausible early vision model, rather than relying on experimenter-imposed features.; The second part addresses visual attention in its bottom-up, stimulus-driven form. Previous research showed that a model of bottom-up visual attention can account in part for the spatial positions of locations fixated by humans while free-viewing complex natural and artificial scenes. We used a similar framework to quantify how the predictive ability of such a model may be enhanced by new model components based on several specific mechanisms within the functional architecture of the visual system. These components included richer interactions among orientation-tuned units, both at short-range (for clutter reduction) and at long-range (for contour facilitation). Subjects free-viewed naturalistic and artificial images while their eye movements were recorded. The resulting fixation locations were compared with the models' predicted salience maps. We found that each new model component was important in attaining a strong quantitative correspondence between model and behavior. Finally, we compared the model predictions with the spatial locations obtained from a task that relied on mouse clicking rather than eye tracking. As these models become more accurate in predicting behaviorally-relevant salient locations, they become useful to a range of applications in computer vision and human-machine interface design.

Keywords/Search Tags:

Model, Visual attention, Categorization, Part, Locations

Related items

1	Study On Algorithm Of Part-based Fine-grained Image Visual Analysis
2	Research On Image Categorization Based On BOW And Visual Attention Model
3	Visual Attention Model And Its Application On Scene Categorization
4	Visual attention: An investigation of the mechanisms of control, flexibility of allocation and the influence of distracting information
5	Research Of Visual Attention Model For Complex Object
6	Fine-Grained Visual Categorization With Part Alignment Model
7	Research On Image Categorization Based On Bag-of-words Model
8	Computer Model Research Of Visual Attention Based On Cooperative Work Between Spatial And Object Attention
9	Research And Application On Computational Model Of Visual Attention
10	Research On Fast Targets Detection Algrithms Based On Visual Attention In SAR Images