Understanding And Application Of RGB-D Visual Content

Posted on:2016-08-19

Degree:Master

Type:Thesis

Country:China

Candidate:H Luo

Full Text:PDF

GTID:2308330470971092

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Compared with 2D visual content,3D visual content contains more particularity and challenge. RGB-D image is one of 3D visual content, and it contains color information (RGB image) and spatial information (depth map). Depth data provide new clue for the research of 2D image. Compared to the point cloud data, depth data has low data volume, however, it also has low accuracy. So it is suitable for detecting and tracking in small scene. In recent years some RGB-D image acquisition devices with low cost and good robustness come out such as Microsoft Kinect, which makes researcher get RGB-D image easily. Since the research and application of RGB-D become more and more important, how to obtain the depth map quickly and accurately using existing images has become a worth problem to study.This paper explores the way to obtain depth information of image collection based on research of it. In a set of images of different perspectives on the same scene, these images have semantic similarity, we can think of their depth distribution is also similar.Firstly, SIFT Flow algorithm is used to obtain depth information of a single image. We input a set of images, and use the GIST features and KNN algorithm to retrieve candidate images from the RGB-D database which are similar with the input images. The SIFT flow algorithm is used to match the input images and candidates. SIFT flow use SIFT descriptor to match pixels in different images refer to optical flow. A three layers pyramid of classical SIFT Flow Algorithm is established by down-sampling to deal with the large computation problem, we find a rough position in the coarse layer and then refine the position in the accuracy layer. After matching, we can transfer the depth from candidates to the input, and estimate depth map of the input images.On the basis of depth information of a single image, image collection is used to refine depth data.In order to select the most accurate data from the candidates to make the depth image smooth, we use the space regularization method adopted by iterative calculation to refine preliminary estimation. Thesis uses homogenized Harris corner as a point of interest in feature collection. Both Haar wavelet transform and KNN algorithm are used to find the overlapping region of the image. The overlapping region is refined by Bilinear Interpolation algorithm. Experiment result shows that this method is effective partly.

Keywords/Search Tags:

depth estimation, semantic similarity, SIFT flow, depth transfer

PDF Full Text Request

Related items

1	Research On Monocular Image Depth Estimation
2	Research On Depth Estimation Algorithms For Monocular Image
3	Video Semantic Segmentation Based On Bidirectional Optical Flow Fusion And Image Depth Estimation
4	Monocular Image Depth Estimation Based On Transfer Learning
5	Research On 2D To 3D Conversion Methods Based On GIST Feature Matching And SIFT Flow
6	Monocular Depth Estimation And Depth Completion Based On Convolutional Neural Network
7	Research On Image Depth Estimation Algorithm
8	Research On High Quality Depth Maps Acquisition For RGB-D Data
9	Monocular Depth Estimation Based On Convolutional Neural Network
10	Recovery Of Depth Maps With High Resolution And High Accuracy