Research On Video Interpolation Based On Deep Learning

Posted on:2020-01-07

Degree:Master

Type:Thesis

Country:China

Candidate:Q Zhang

Full Text:PDF

GTID:2428330590474465

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

The video frame rate conversion technique is a technique for reconstructing an intermediate frame by using correlation information between adjacent two frames in the video and applying interpolation.Since the technology can remove redundant information in encoding and reduce the frame rate during video transmission,and reduce the amount of data transmitted by the video network,it can be applied to video compression or enhance video continuity.The traditional video interpolation method mainly includes two steps,namely optical flow estimation and pixel synthesis.In this method,the effect of video interpolation technique often depends on the quality of optical flow estimation,and the process of optical flow estimation is susceptible to obvious errors caused by occlusion and blur.With the development of deep learning,the video interpolation technology based on deep learning has also made new breakthroughs.Some researchers have made some success in using video convolutional neural networks to try video interpolation.In this thesis,the optical flow estimation and the deep learning in the traditional method are combined,and an end-to-end convolutional neural network model is proposed,which combines motion estimation and occlusion processing.We first use the improved FCN network model to calculate the bidirectional optical flow between the input images,and obtain two warped images based on the estimated bidirectional optical flow information and the input image wrap operation.To solve the occlusion problem,we use another GridNet network.The model re-estimates the bidirectional optical flow information of the image and predicts the visibility of the pixels of the interpolated frame,and finally linearly fuses the estimated information with the original image to form an intermediate frame.In this thesis,we also tried a variety of loss functions,and finally determined the loss function that weighted various loss functions such as L1 loss,perceptual loss,wrap loss,and smoothness loss.The experimental results show that the video frame interpolation network structure proposed in this thesis can effectively improve the quality of optical flow estimation and improve the occlusion problem,and can generate intermediate frames with realistic,natural and better quality.

Keywords/Search Tags:

frame interpolation, deep learning, optical flow estimation

PDF Full Text Request

Related items

1	Research On Interpolation Method And Application Of Sequence Images
2	Video Interpolation Technology Based On Deep Learning
3	Research On Video Frame Interpolation Algorithm Based On Global Optimal Optical Flow
4	Research On Video Frame Interpolation Based On Implicit Motion Estimation
5	Research On Recursive Models And Deep Learning Methods For Video Frame Rate Up-Conversion
6	Research And Implementation Of Key Technologies In Video Compression
7	Deep Learning Based Video Frame Interpolation Method
8	Research On Deep Learning Based Video Frame Interpolation Algorithm
9	Optical Flow Estimation Based Video Frame Interpolation In Complex Motion Scenes
10	Research On Pixel Flow Estimation Deep Models Based On Flow Boundaries Detection