Font Size: a A A

Motion and Shape from Apparent Flow

Posted on:2014-01-17Degree:Ph.DType:Thesis
University:The Chinese University of Hong Kong (Hong Kong)Candidate:Hui, Tak WaiFull Text:PDF
GTID:2458390008460872Subject:Computer Science
Abstract/Summary:
Determination of general camera motion and reconstructing depth map from a captured video of the imaged scene relative to a camera is important for computer vision and various robotics tasks including visual control and autonomous navigation. The determination of the relative geometry between the camera frame and the end-effector frame which is commonly referred as hand-eye calibration is essential to proper operation in visual control. Determining the relative geometry of multiple cameras is also important to various applications requiring the use of multi-camera rig.;The relative motion between an observer and the imaged scene induces apparent flow in the video. The difficulty of the problem lies in the flow pattern observable in the video is not the full flow field induced by the motion, but only partial information of the spatial image intensity profile. The partial flow field is known as the normal flow field. This thesis addresses several important problems in computer vision: determination of camera motion, recovery of depth map, and performing handeye calibration from the apparent flow (normal flow) pattern itself in the video data directly but not from the full flow interpolated from the apparent flow. It does not require interpolating the flow field and in turn does not demand the imaged scene to be smooth. In contrast to optical flow, no sophisticated optimization procedures that account for handling flow discontinuities are required, and such techniques are generally computational expensive. In this thesis, several direct methods are proposed to determine camera motion using three different types of imaging systems, namely monocular camera, stereo camera, and multi-camera rig.;This thesis begins with the Apparent Flow Positive Depth (AFPD) constraint to determine the motion parameters using all observable normal flows from a monocular camera. The constraint presents itself as an optimization problem to estimate the motion parameters. An iterative process in a constrained dual coarse-to-fine voting framework on the motion parameter space is used to exploit the constraint.;This thesis proposes two constraints: one related to the direction component of the normal flow field -- the Apparent Flow Direction (AFD) constraint, and the other to the magnitude component of the field -- the Apparent Flow Magnitude (AFM) constraint, to determine motion. The first constraint presents itself as a system of linear inequalities to bind the direction of motion parameters; the second one uses the globality of rotational magnitude to all image positions to constrain the motion parameters further. A twostage iterative process in a coarse-to-fine framework on the motion parameter space is used to exploit the two constraints.;Normal flow is only raw information extracted locally that generally suffers from flow extraction error arisen from finiteness of the image resolution and video sampling rate. This thesis explores the visual field of the imaging system by fixating a number of cameras together to form an approximate spherical eye. With a substantially widened visual field, the normal flow data points would be in a much greater number to combat the local flow extraction error at each image point. More importantly, the directions of translation and rotation components in general motion can be estimated with the Apparent Flow Separation (AFS) and Extended Apparent Flow Separation (EAFS) constraints.;Stereo vision contributes another visual clue to determine magnitude of translation and depth map without the problem of arbitrarily scaling of the magnitude. This thesis explores two direct methods to recover the complete camera motion from the stereo system without the explicit point-to-point correspondences matching. The first method extends the AFD and AFM constraints to stereo camera, and provides a robust geometrical method to determine translation magnitude. The second method which requires the stereo image pair to have a large overlapped field of view provides a closed-form solution, requiring no iterative computation. Once the motion parameters are here, depth map can be reconstructed without any difficulty. The depth map resulted from normal flows is generally sparse in nature. We can interpolate the depth map and then utilizing it as an initial estimate in a conventional TV-L1 framework. The result is not only a better reconstruction performance, but also a faster computation time.;Calibration of hand-eye geometry is based on feature correspondences. This thesis presents an alternative method that uses normal flows generated from an active camera system to perform self-calibration. In order to make the method more robust to noise, the strategy is to use the direction component of the flow field which is more noise-immune to recover the direction part of the hand-eye geometry first. The final solution is refined using a robust method.
Keywords/Search Tags:Motion, Flow, Depth map, Camera, Imaged scene, Method, Video, Direction
Related items