Research On Salient Object Detection Algorithm For Cross-modal Collaborative Compensation

Posted on:2022-11-28

Degree:Master

Type:Thesis

Country:China

Candidate:J X Yang

Full Text:PDF

GTID:2518306755494094

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

In the fast-paced and efficient real life,human beings can quickly focus on the salient areas of interest by using visual attention.In order to deal with the impact of massive image data in the era of big data,human beings can extract the salient regional information of images in large quantities by using the computer to simulate the visual attention mechanism of human eyes.Therefore,Salient Object Detection(SOD)has become one of the hotspots in the research field of computer vision.As human eyes like positioning detection technology,SOD algorithm accurately extracts the key and salient areas contained in the image,which lays a solid foundation for more advanced computer vision tasks such as Object Segmentation,Object Classification and Object Recognition.In recent years,the research on SOD algorithm based on RGB modal extracts salient objects through attributes such as color contrast and texture change,but there is still the phenomenon of fuzzy edge information in the division of foreground and background areas.In order to overcome this challenge,edge information is introduced as an important cooperative compensation modal,so as to improve SOD algorithm,increase the accuracy of the model and refine the edge of salient objects.However,in the face of challenges such as relatively complex scenes and illumination changes,the detection effect of SOD model based on RGB modal is not ideal and its robustness is poor.Tracing back to its source,human vision can also perceive the depth distance of the visual scene.The difference of depth distance can make the salient object highlight from the challenges such as complex background or illumination changes.Therefore,the SOD model can be further enhanced through the cooperative compensation of depth modal information.Aiming at different modal information(Edge,Depth),this paper studies the SOD algorithm of cross-modal cooperative compensation.The specific work is as follows:(1)In view of the unsatisfactory effect of the existing SOD algorithm on the edge perception of salient objects,and considering that the high-level features contain advanced semantic information and the low-level features contain rich texture information in the deep neural network,this paper designs a deep network composed of stacked edge-aware modules,that is,the SOD network based on edge modal cooperative compensation.Firstly,Res2 Ne Xt-50 is used as the backbone network to transform the image into multi-object and multi-scale initial salient features.Then,the edge-aware modules are stacked sequentially to fuse the highlevel and low-level information of the image in an asymmetric way,so as to highlight the salient object position.Among them,the network leads out the branch forcing model to learn the edge modal features and performs cooperative compensation in the edge-aware module.Finally,SOD results are outputted by the network.Experiments show that this algorithm can enrich the edge details of salient objects and further accurately locate the salient object areas due to the cooperative compensation of edge modal.(2)In order to overcome the challenges of complex scenes and illumination changes,SOD network based on depth modal cooperative compensation is proposed in this paper.Firstly,Res2 Ne Xt-50 is also used as the backbone network to transform the input image into rich salient features.Then,considering the gradient and smoothness of the saliency of each region in the image,the Soft Attention Module is designed to optimize the rough features extracted by the backbone network.At the same time,in order to deal with the multi-scale problem of salient objects,the Multi-scale Texture Aware Module is proposed to further improve the feature quality.Secondly,the Depth Cooperative Compensator is constructed,which uses the depth modal features of forced network learning to assist in optimizing the salient features in the way of cooperative compensation.Finally,the salient features are fused through the Pyramid Aggregation Module to predict the final SOD result.In addition,this paper also uses this network to demonstrate the necessity of introducing depth modal in the research direction of SOD based on RGB modal,and in order to expand into a general network framework,this paper demonstrates the universality of cross-modal cooperative compensation in the research direction of SOD based on RGB-T modal.Experiments show the effectiveness of the proposed algorithm in three different research directions of salient objection detection.

Keywords/Search Tags:

cross-modal cooperative compensation, edge-aware module, soft attention module, multi-scale texture aware module, depth cooperative compensator, pyramid aggregation module

PDF Full Text Request

Related items

1	Research On Motion Image Deblurring Based On Attention And Adversarial Network
2	Research And Realization Of AIS Base Station System Based On VC++
3	Design And Implementation Of Transform And Manage Module Of V.35/E1Based On STM32
4	Pyramid Pooling And Spatial Attention Optimized Deep Semantic Segmentation
5	The Research Of MUX/DEMUX Module
6	Development And Application Based On Dual CPU Configurale Of The GPS Module And GSM Module
7	Design Of LXI Multi-functional Test Module
8	Research On Image Panoptic Segmentation Based On Spatial Clustering Module And Multi-layer Feature Fusion
9	The Design And Implementation Of Order Module And Seckill Module In Electronic Shopping Mall System
10	Research On Top-down Multi-person Pose Estimation Algorithm