Font Size: a A A

Active object recognition in theory and practice

Posted on:2012-10-08Degree:Ph.DType:Thesis
University:York University (Canada)Candidate:Andreopoulos, AlexanderFull Text:PDF
GTID:2458390008991989Subject:Engineering
Abstract/Summary:
This thesis examines the problem of actively searching for an object in a 3D environment, subject to a cost constraint. The thesis shows that different variants of this problem are NP-Hard. The tradeoffs of localizing vs. detecting a target object, using single-view and multiple-view recognition, under imperfect dead-reckoning, and an imperfect recognition algorithm are explored. The effects that finite computational resources, input noise, occlusion, and the related object representation class complexities have in terms of localizing all objects present in the search region are investigated. Various bounds relating the feature detection noise-rate to the Vapnik-Chervonenkis-dimension of an object representable by an architecture satisfying the given computational constraints are presented. These prove that under certain conditions, the corresponding classes of object localization and recognition problems are efficiently learnable in the presence of noise and under a purposive learning strategy. Under this formulation, the existence of a number of emergent relations are proven, which demonstrate that selective attention is not only necessary due to computational complexity constraints, but it is also necessary as a noise-suppression mechanism and as a mechanism for efficient object class learning. An implementation of an active 3D object localization system on a state-of-the-art visually guided humanoid robot is presented. This involves a target probability updating scheme providing an efficient solution to the best next viewpoint selection problem. It employs a recognition architecture, for attending to the view-tuned units at the proper intrinsic scales and for purposively controlling the sensor's coordinate frame, giving control of the extrinsic image scale and achieving the proper sequence of informative views of the scene. This demonstrates the feasibility of using state of the art vision-based systems for efficient and reliable object localization in an indoor 3D environment. The thesis concludes by demonstrating that under certain conditions, the effects of dead-reckoning errors can be effectively addressed by a visually-guided agent. It is argued that reliable vision systems that are non-camera specific must purposively control all sensor parameters. This suggests ways of improving the evaluation techniques of vision algorithms and motivates topics for future research.
Keywords/Search Tags:Object, Recognition
Related items