Image Semantic Segmentation Based On Pyramid Pooling And Attention Mechanism

Posted on:2022-07-20

Degree:Master

Type:Thesis

Country:China

Candidate:H Shi

Full Text:PDF

GTID:2518306539481194

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Image semantic segmentation is one of the main tasks in the field of computer vision.Its purpose is to classify each pixel in the image and predict its corresponding semantic label.With the developing of deep learning research,the semantic segmentation algorithm based on convolutional neural network directly extracts more expressive target feature information from the image through the multi-layer cascaded complex structure,which improves the accuracy of segmentation.In order to extract the feature information of the image more effectively to enhance the segmentation effect,this paper studies the semantic segmentation algorithm based on the deep learning technology of multi-layer feature fusion from the perspective of efficiently using the feature of each level of the convolutional neural network.The main work is as follows:Firstly,a multi-layer feature fusion semantic segmentation method DSPN based on pyramid pooling is proposed.This method uses the high-resolution shallow features extracted by the convolutional neural network to retain more local information and deep features with high semantic information.It is conducive to the characteristics of accurate classification,and effectively solves the problem of loss of segmentation accuracy caused by the loss of detailed information in the process of multiple downsampling of feature maps.The DSPN model uses the lightweight network Mobile Net V2 as the basic network to extract features,and uses the pyramid pooling structure as the last layer of the coding network to extract the global context information of the image.In the decoding stage,the deep features are continuously upsampled and merged layer by layer with the shallow semantic features of the corresponding size,which effectively combines the deep semantic features and shallow local features in the network to improve the accuracy of segmentation.Secondly,this paper further improves DSPN,and proposes a semantic segmentation method A-DSPN based on pyramid pooling and hybrid attention mechanism.This method introduces a hybrid attention mechanism,which combines channel attention and spatial attention to assign weights to different parts of the feature map,enhances the expression of features,improves the global perception of features.At the same time,an auxiliary loss function is added in the training process to optimize the learning process,to ensure the stable convergence of the network and improve the performance of the network.Finally,the method proposed in this paper is verified and evaluated on two public datasets Cam Vid and PASCAL VOC 2012.The experimental results show that the segmentation method proposed in this paper can effectively improve the segmentation performance of the network while reducing the amount of model parameters.

Keywords/Search Tags:

semantic segmentation, feature fusion, pyramid pooling, attention mechanism

PDF Full Text Request

Related items

1	Pyramid Pooling And Spatial Attention Optimized Deep Semantic Segmentation
2	Research On Image Semantic Segmentation Algorithm Based On Deep Learning
3	Image Semantic Segmentation Method Based On Attention Mechanism
4	Image Semantic Segmentation Based On Multi-level Feature Fusion And Attention Mechanism
5	Semantic Segmentation Of Codec Image Based On Multi-scale And Attention Mechanism
6	Research On Semantic Segmentation-Oriented Attention Mechanism And Multi-Scale Feature Cross-Layer Fusion
7	Recursive Network Based On Multiscale Pooling Attention And Hierarchical Feature Fusion For Image Super-Resolution
8	Image Semantic Segmentation Algorithms Based On Feature Fusion And Non-local Features
9	Research And Implementation Of Semantic Segmentation Of Urban Street View Images
10	Research On Real-Time Image Semantic Segmentation Using Deep Learning