Research On Saliency Detection Of RGB-D Image

Posted on:2021-09-30

Degree:Doctor

Type:Dissertation

Country:China

Candidate:D Liu

Full Text:PDF

GTID:1488306290985709

Subject:Photogrammetry and Remote Sensing

Abstract/Summary:

When the human vision system perceives the surrounding scene,the information obtained not only includes the RGB information generated by the object reflecting different spectra,but also includes the depth information provided by the stereo vision formed by two eyes.The wide applications of RGB-D sensors can improve the ability of computer to perceive objects in the real environment,and the high availability of depth sensor provides valuable supplementary information for the modeling and applications of RGB-D image processing.Based on the depth related visual characteristics of human visual system,this paper discusses the mechanism of visual attention in stereo vision.Aiming at the two task scenarios of visual saliency prediction and saliency target detection of rgb-d image,this paper focuses on the deep saliency feature expression and multi-modal feature fusion in the saliency detection of RGB-D image.The research content of this paper includes the following three aspects:1)The mechanism of depth feature’s influence on visual attention in stereo vision is studied.A depth saliency feature based on disparity tuning mechanism and an RGB-D viusal saliency prediction algorithm are proposed.According to the parallax selection characteristics of the visual nerve in MT area of the cerebral cortex,the neural excitation response is modeled as a depth feature to predict visual saliency.This method explains the effect of depth feature on visual attention from the perspective of physiology.In the three largest visual saliency prediction datasets for RGB-D images,the result of this method is better than that of the existing methods.2)A two-stream refinement network for RGB-D salient object detection is proposed.The fusion based refinement network(FRN)and propagation based refinement network(PRN)are designed.The function of FRN is to fuse the multi-modal RGB and depth features as well as the multi scale intermediate convolution features in manner of intermediate fusion.As a result,the internal correlation between the RGB and depth map features is mined,and the high-resolution shallow convolution features are adopted to refine the boundary of the detected object.The function of PRN is to use a shallow network to learn the affinity matrix of each pixel,and further refine the result of preliminary detection.3)The fusion method of RGB and depth features in stereo vision attention is studied.An attentive cross modal feature-fusion method and an RGB-D image salient object detection algorithm based on deep neural network are proposed.This method uses the idea of residual connection,introduces the attention guidance mechanism,and realizes the optimization and efficient fusion of multimodal features.This method has achieved the best results on the eight public datasets for salient object detection.

Keywords/Search Tags:

RGB-D image, Human visual characteristics, Visual saliency prediction, Salient object detection, Disparity-tuning, Cross-modal attention

Related items

1	Research On Key Techniques Of Visual Saliency Detection
2	Fundamental Research On Image Saliency Detection Moedls And Methods
3	Object Detection Methods Based On Biologically Visual Mechanism
4	Researches On Visual Saliency Computation And Its Applications
5	Stereoscopic Visual Saliency And Its Application On Stereoscopic Image Disparity Control
6	Research On Visual Attention Detection And Salient Object Segmentation
7	Researches On RGB-D Visual Salient Object Detection Algorithms Based On Feature Fusion
8	The Study Of Salient Object Detection Algorithm Based On The Visual Perception Mechanism
9	Research On Visual Attention Mechanism Modeling And Its Applications
10	Research Of Salient Regions Detection Based On Bottom-Up Visual Attention Model