Research On Video Semantic Segmentation In Compressed Domain

Posted on:2021-12-06

Degree:Master

Type:Thesis

Country:China

Candidate:T Yang

Full Text:PDF

GTID:2518306050954679

Subject:Precision instruments and machinery

Abstract/Summary:

PDF Full Text Request

Semantic segmentation is a basic task in the field of computer vision.It aims to assist the computer to understand the scene by classifying the pixels of the image.It has a very wide range of applications in real life.In recent years,breakthroughs have been made in image semantic segmentation tasks.Image semantic segmentation algorithms based on deep learning methods have made unprecedented progress in segmentation accuracy and inference delay.However,in most application scenarios,the image acquisition unit collects a continuous image sequence,and this sequence contains important space-time correlation information.Directly using image semantic segmentation algorithms to process image sequences will ignore this correlation,bring huge computing requirements and high inference delay,and affect the application of semantic segmentation algorithms in practical problems.Aiming at the problem that processing the original image sequence directly leads to the complexity of the algorithm calculation,this thesis proposes an algorithm for video semantic segmentation in the video compression domain.The algorithm is based on video compression stream,and the semantic segmentation results are obtained by partial decoding of video stream.According to the characteristics of video stream structure,the video frame is divided into key frame and non-key frame,and different methods are adopted.When processing the key frame,the corresponding video stream is fully decoded to obtain the original image,and then the semantic segmentation network is used to segment the image.For non-keyframe,this thesis obtains the motion vector from the code stream,and generates the non-keyframe feature image by feature propagation combined with the keyframe feature image,and finally generates the corresponding segmentation image based on the nonkeyframe feature image.Since the motion vector obtained from the video bitstream contains noise,this thesis uses the method of median filtering to process the motion vector.In order to verify the performance of the proposed compressed domain video semantic segmentation method,experiments carried out on Cityscapes dataset and Cam Vid dataset.The experimental results show that,compared with the frame-by-frame model,the method proposed in this thesis has a large increase in reasoning speed without a significant decrease in the segmentation accuracy.Compared with DFF,which adopts optical flow to carry out feature propagation,the proposed method has performance advantages in both precision and speed of segmentation.The experimental result strongly demonstrate that the proposed method can utilize video compressed domain information to realize semantic segmentation of video reasonably and effectively.

Keywords/Search Tags:

Video semantic segmentation, Video compressed domain, Feature propagation

PDF Full Text Request

Related items

1	Research On Video Segmentation And Feature Extraction In The H.264 Compressed Domain
2	Research On Compressed Domain Video Summarization Technology Based On HEVC
3	Feature-based Video Retrieval In Compressed Domain
4	Study And Application About Key Frame Extraaction Algorithm Of Video Based On Compressed Domain
5	Study On Shot Detection System In MPEG Compressed Video
6	Low-latency Video Semantic Segmentation Algorithm Based On Deep Learning
7	Research On Video Semantic Segmentation Algorithm Based On Deep Learning
8	The Study Of Image/Video Signal Processing Methods In The Compressed Domain Based On Compressed Sensing
9	Research On Several Issues In Video Semantic Annotation
10	Spatio-temporal segmentation in the compressed domain