Font Size: a A A

The Binocular Cognitive Stereo Model

Posted on:2009-12-09Degree:MasterType:Thesis
Country:ChinaCandidate:L WangFull Text:PDF
GTID:2178360272457786Subject:Biomedical engineering
Abstract/Summary:PDF Full Text Request
Binocular stereo vision is the senior visual function of mankind that through the convergence eye movement man acquires the three-dimensional shape of objects and accurately estimate the location of three-dimensional objects. In the past one and a half century, the area of stereo vision research has been very active. In the 1960s, Julesz was the first man to do three-dimensional visual research with computer designed random-dot stereogram(RDS), and found the sense of depth can only caused by the binocular disparity without the need of the perception of form, so it became the milestone of stereo vision. On this basis, the researchers did a series of studies on the disparity calculation and have made remarkable progress psychological in the areas of psychophysics and neurophysiology, but there also existed difficulties that cannot be resolved.Julesz proposed a cooperative stereo vision model based on the research of random-dot stereogram. Julesz assumed in his model that the corresponding points on the cortex of stimulus on the retina of both eyes has the same character as magnetic dipole, and can match like the manner of magnetic dipole interaction. The disparity of corresponding points can be represented by the rotation of magnetic dipole. In the 1970s, Marr established a stereo theory based on calculation, the theory figured that the three major constraints (the exclusive correspondence, compatibility and continuity), zero-crossing primitives and the multi-channel collaborative manipulation were the important components of three-dimensional vision. Since then, the researchers also put forward a number of stereo models based on the physiological processes. Including the representative model that is called the energy model. This model considered that a group of complex disparity sensitive cells can effectively calculated the disparity of a pair of corresponding points.However, there is a common fundamental problems hard to be resolved, named the correspondence problem (parallax Paradox) in these stereo models of perception. In the past literatures, the disparity was defined as the space distance between the two corresponding retinal positions projected by one stimulus. According to this definition, the experimental result from random-dot stereogram study (stereo perception is before the sense of form) is paradox: if the disparity definition is right, as long as the existence of disparity can create a sense of depth, it needs to find the accurate corresponding points, but to find accurate corresponding points need to identify the points, thus it is contradictive to the conclusion of the RDS study.In order to solve the difficulties of the stereo perception model, on the basis of the reality of the process of the human's vision This article cited the results of research on neural anatomy and introduced the relevance between the eye movement and stereo perception in the stereo cognitive mechanism, and brought forward an innovative stereo model of perception, the main content is:1.The model was based on the neural anatomy and the actual human vision process. a. The model cited the results of the research on neural anatomy: the binocular cells on the visual cortex cells can only receive two signals of the corresponding physiological points on the retina, rather than signals of the corresponding physical stimulus points. It can be concluded that it is very difficult for the existing stereo models to calculate the physical space distance of the corresponding stimulus points on retina. Therefore, it is reasonably that the human binocular vision achieved through manipulating the corresponding signals of the physiological points on the retina, rather than achieved by calculating the physical disparity of the corresponding physical points which explained in the existing model. b. The model investigated the actual process of human stereo perception in normal circumstances: when a person with normal stereo vision observed objects, the visual system always tried to superpose the retinal graphs to eliminate differences. The form of the stereo vision not only needed the wholesome optic system, but also the coordination of the ocular muscle, and the convergence eye movement would be a critical and necessary condition. If the ocular muscle can not coordinate, and the eyes can not gaze or convergent, then the image of objects can not be projected on the central retina, then the binocular stereo vision can not be achieved.2. On the basis of the neural anatomy and the actual human vision process, the model proposed that to represent one stereo world by two separated planar worlds in the two eyes, it needed to "mix" these two planar worlds into a single world by eye movement. "Mix" was a complex process, which included the peripheral and the central process. Simply speaking, "mix" was dominated by the peripheral eye movement regulation and process of sensory memory.3. The model considered that the stereo vision was dominated by the peripheral eye movement regulation and processing of sensory memory. The minus of optical signal of the retina physical corresponding points arose the focus and convergence of eyes; the sum of optical signal of the retina physical corresponding points transmitted to the central executive system to lead the continuous perception. The processing of stereo perception was the processing of eyes dynamic feedback regulation, the sense of depth was determined by the tensity of eye muscle in stability.The stereo cognitive model in this paper not only solved the correspondence problem in the existing model, furthermore, it explained a lot of three-dimensional perception phenomenon. Compared to the current stereo perception study, different theories explain a large number of different stereo phenomena of perception. Based on our stereo model of perception, the process of brain manipulating the information from two eyes is more concise and effective than what we imagine to be, and we do not need to have any insight on how to complete these operations.
Keywords/Search Tags:stereoscopic vision, binocular disparity, correspondence problem, eye movement
PDF Full Text Request
Related items