Research On View Synthesis Technology Based On Multiplane Image Scene Representation

Posted on:2024-06-17

Degree:Master

Type:Thesis

Country:China

Candidate:J Y Wei

Full Text:PDF

GTID:2568307136488124

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

The study of novel view image synthesis is one of the current research topics in the field of computer vision.Multiplane image(MPI)constructs a camera-centered,depth-layered explicit representation of the 3D scene,which can effectively describe the geometry of the scene.Better synthetic perspective images can be obtained with high-quality MPI scene representation,but there are still artifacts and distortions that are difficult to eliminate.In order to resovle these problems,the following research has been conducted and completed:(1)A new MPI novel view synthesis algorithm based on image dense feature extraction is proposed,which explicitly models the dependency between convolutional feature channels in the network encoding stage and uses dynamic channel feature recalibration to optimize the ability of the encoder network representation.Dense feature extraction is implemented on the input view images,and geometry semantic features of the images are obtained from it.Experimental results show that the MPI scene representation inferred by the algorithm from the input view images can achieve an accurate description of the geometric semantics of the scene,thereby the quality of synthesized novel view image is improved.(2)A novel view synthesis algorithm based on MPI is proposed that makes use of feature connections across depth layers.The effective spatial features between multiple depth planes are captured by using 3D convolutional residual blocks,and the prediction ability of MPI depth plane occupied regions is improved,therefore,further the prediction accuracy of the geometric semantics of each depth plane is improved.Numerical experiments show that the algorithm can effectively eliminate artifacts and distortions in the synthesized novel view images in the view extrapolation and view interpolation tasks.When the horizontal baseline width of the reference view is doubled and the number of MPI depth planes is not increased,better numerical results are still obtained.(3)A new MPI novel view synthesis algorithm(Trans MPI)based on global feature modeling is proposed,and a self-attention mechanism is introduced on the basis of the network architecture in Chapter 4 to overcome the inductive bias of the convolutional network for global semantic information learning.The network of Trans MPI uses the obtained local features,combined with the Transformer encoder to achieve global feature representation modeling,therefore,the long-distance dependencies between features are established.Experimental results show that the inference quality of MPI scene representation and the quality of synthesized novel view images are further improved by utilizing the self-attention mechanism to learn global and local features between consecutive depth planes in Trans MPI.

Keywords/Search Tags:

multiplane images, explicit scene representation, view synthesis, Convolutional Neural Network, Self-attention mechanism

PDF Full Text Request

Related items

1	Research On Panoptic Segmentation Technology Of Street Scene Images Based On Convolutional Neural Networks
2	Research On Dynamic Scene Deblurring Based On Convolutional Neural Network
3	Scene Text Detection Algorithm Based On Convolutional Neural Network
4	Research On Image Semantic Segmentation Of Road Scene Based On Convolutional Neural Network
5	View-dependent Pixel Coloring: A physically-based approach for two-dimensional view synthesis
6	Object-based Representation For Scene Classification
7	Research And Implementation Of Semantic Segmentation Of Urban Street View Images
8	Research On The Visual Attention Mechanism For 3D Scenes
9	Explicit Object Representation by Sparse Neural Codes
10	A Research On The Improvement Method Of IMNet For Single-view 3D Reconstruction Based On Implicit Representation