Research On Real-time Semantic Segmentation Network Technology

Posted on:2022-04-27

Degree:Master

Type:Thesis

Country:China

Candidate:Y Zhang

Full Text:PDF

GTID:2518306731492644

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Semantic segmentation is one of the three major tasks of computer vision,which is based on classification and classifies each pixel of the image to produce a more detailed segmentation map than target detection.In recent years,semantic segmentation has been widely used in pedestrian detection and face recognition.In particular,autonomous driving has higher requirements for real-time semantic segmentation technology.However,there are still a series of problems in the current research methods,such as the inconsistency within the class and the indifference between the classes,that is,one objects are divided into two categories,different kinds of objects next to each other are divided into the same category,the spatial information recovery is not detailed enough,and the balance between speed and accuracy needs to be further solved and improved.Based on real-time semantic segmentation technology,this thesis proposed a two-times fusion pyramid semantic segmentation network,which can be divided into two parts: feature extraction and pyramid fusion.The feature extraction part uses lightweight network as the backbone,and the network is divided into three levels:shallow,medium and deep module according to the resolution and information of feature map.Iterative connections are carried out within each level to enhance feature reuse,which can keep the network Lightweight while enhancing feature extraction capabilities.Some networks do not make full use of the output characteristics of different levels.In the part of pyramid fusion,the main purpose of this part is designed to solve these problems.In order to effectively integrate these different levels of feature information,we use a two-times fusion method to integrate the different levels of spatial and semantic information.We add a semantic augmentation module between the pyramid levels to further improve the segmentation effect and to some extent make up for the loss of spatial information of pyramid pooling module,achieved the effective fusion of spatial details and advanced semantics.This makes it possible to improve the segmentation effect as much as possible while keeping the model light so as to achieve a balance between speed and accuracy.Finally,this article conducts experiments on many datasets such as CIFAR10,CIFAR100,Cityscapes and CamVid to verify the effectiveness and versatility of the model,which shows that our network achieves a balance between speed and accuracy.

Keywords/Search Tags:

Semantic segmentation, Lightweight model, Pyramid two-times fusion, Semantic augmentation module

PDF Full Text Request

Related items

1	Research On Image Semantic Segmentation Algorithm Based On Deep Learning
2	Research On Lightweight Semantic Segmentation Algorithm Based On Semantic Relocation
3	Research On 3D Semantic Surface Reconstruction Of Large Scale Scenes Based On RGB-D Video Sequence
4	Research On Semantic Segmentation Of Cityscape Based On Deep Neural Network
5	Research On Key Techniques Of Image Semantic Fusio
6	Research And Implementation Of Semantic Segmentation Of Indoor Scene Image Based On Embedded Platform
7	Research On Real-time Semantic Segmentation Algorithm Based On Deep Learning
8	Pyramid Pooling And Spatial Attention Optimized Deep Semantic Segmentation
9	Image Semantic Segmentation Based On Pyramid Pooling And Attention Mechanism
10	Research On Semantic Segmentation Algorithm Based On Feature Fusion And Deep Learning