Active object recognition in theory and practice

Posted on:2012-10-08

Degree:Ph.D

Type:Thesis

University:York University (Canada)

Candidate:Andreopoulos, Alexander

Full Text:PDF

GTID:2458390008991989

Subject:Engineering

Abstract/Summary:

This thesis examines the problem of actively searching for an object in a 3D environment, subject to a cost constraint. The thesis shows that different variants of this problem are NP-Hard. The tradeoffs of localizing vs. detecting a target object, using single-view and multiple-view recognition, under imperfect dead-reckoning, and an imperfect recognition algorithm are explored. The effects that finite computational resources, input noise, occlusion, and the related object representation class complexities have in terms of localizing all objects present in the search region are investigated. Various bounds relating the feature detection noise-rate to the Vapnik-Chervonenkis-dimension of an object representable by an architecture satisfying the given computational constraints are presented. These prove that under certain conditions, the corresponding classes of object localization and recognition problems are efficiently learnable in the presence of noise and under a purposive learning strategy. Under this formulation, the existence of a number of emergent relations are proven, which demonstrate that selective attention is not only necessary due to computational complexity constraints, but it is also necessary as a noise-suppression mechanism and as a mechanism for efficient object class learning. An implementation of an active 3D object localization system on a state-of-the-art visually guided humanoid robot is presented. This involves a target probability updating scheme providing an efficient solution to the best next viewpoint selection problem. It employs a recognition architecture, for attending to the view-tuned units at the proper intrinsic scales and for purposively controlling the sensor's coordinate frame, giving control of the extrinsic image scale and achieving the proper sequence of informative views of the scene. This demonstrates the feasibility of using state of the art vision-based systems for efficient and reliable object localization in an indoor 3D environment. The thesis concludes by demonstrating that under certain conditions, the effects of dead-reckoning errors can be effectively addressed by a visually-guided agent. It is argued that reliable vision systems that are non-camera specific must purposively control all sensor parameters. This suggests ways of improving the evaluation techniques of vision algorithms and motivates topics for future research.

Keywords/Search Tags:

Object, Recognition

Related items

1	Research On Recognition And Tracking Technology Of Flying Objects In Complex Weather
2	The Research On Road Object Recognition Based On Driving Video And Its Android App Development
3	Research On Autonomous Object Recognition And Grasping Technology Of Robotic Arm
4	Research On Multi-view Indoor Object Recognition Algorithm
5	Research And Implementation Of DSP-based General Object Recognition
6	Articulated Object Detection And Recognition Applications
7	Research On Feature Based Representation For Point Cloud Object Recognition
8	Rotation-invariant 3-D Object Recognition Based On Structured Illumination
9	Study On Object Recognition Based On Contour Feature
10	Object Recognition And Contour Extraction For Interested Object Based On Over-Segmentation