Research On Convolutional Neural Network Based Distortion Prediction For Image Compression

Posted on:2021-05-24

Degree:Master

Type:Thesis

Country:China

Candidate:S H Wang

Full Text:PDF

GTID:2428330629985311

Subject:Photogrammetry and Remote Sensing

Abstract/Summary:

PDF Full Text Request

In signal compression,distortion information is significant for rate distortion optimization.In this paper,we propose a convolutional neural network(CNN)to predict distortion information for H.265/HEVC.With the strong representation power of CNN,structural similarity(SSIM)maps can be predicted directly in an end-to-end,pixel-to-pixel way.Different from traditional CNNs which focus on learning one-to-one mappings from input to output,we show that our CNN model can predict SSIM maps conditioned on quantization parameters(QPs),and realize one-to-many mappings.To construct our CNN network,QP labels are designed as conditions to feed the CNN model.Four residual blocks are used for better feature extraction.We also apply symmetrical network architecture and multi-level feature fusion method to ensure our network can utilize both high-level semantic features and low-level structure features.With an exactly symmetrical structure,our network can fuse the features of symmetrical layers easily since they have the same feature size.Symmetrical structure also ensures the prediction accuracy because we don't have to scale the feature to the target size for feature fusion,which will change the original value because of interpolation.The experiments on MS COCO database demonstrate the effectiveness of our CNNbased method for SSIM prediction.We show that SSIM maps can be predicted conditioned on QP labels without training models separately for each QP.Our multiple QP prediction model even outperforms the single QP prediction model,which demonstrates the robustness and effectiveness of the proposed CNN model.In addition,the proposed QP label enables our CNN model to train a unified model for all QPs,to solve time-consuming problem in training process.A unified model is also more suitable in practical applications with QP label as the condition to decide the outputs of distortion maps.We can also seek different input labels that can be the conditions of our CNN model to realize more generalized distortion prediction in one unified CNN model.In this way,our model can not only predict SSIM maps conditioned on QP labels,but also conditioned on any other parameters related to the distortion information.

Keywords/Search Tags:

SSIM, distortion prediction, convolutional neural network, H.265/HEVC, feature fusion, QP label

PDF Full Text Request

Related items

1	Research On Rate-distortion Optimization For HEVC Video Coding
2	Research Of HEVC Optimization Based On Texture Features And Visual Attention
3	Facial Beauty Prediction Research Based On Multi-Feature Fusion
4	The Research On Fast CU Partition And Intra Mode Decision Algorithm For HEVC
5	Research On Single Color Image Object Detection Method Based On Convolutional Neural Network
6	Gait Recognition Research Based On Feature Fusion Convolutional Neural Network
7	A Large-scale Facial Beauty Prediction Research Based On Deep Feature
8	Research On Algorithm Of Neural Network Data Label Generation
9	Research On Saliency Prediction Model And Its Application Based On Neural Network
10	Research On Fast Algorithm Of High Efficiency Video Coding Based On HEVC Standard