An End-to-end Singing Voice Synthesis Method Based On Transinger

Posted on:2021-04-28

Degree:Master

Type:Thesis

Country:China

Candidate:X D Liu

Full Text:PDF

GTID:2415330611469727

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Singing voice synthesis is a kind of derivative technology of speech synthesis.It is a classical application of computer technology in the field of music to synthesize electronic singing by extracting information from music score.From the early sample concatenation method,the parametric model to the current neural network model,the performance of the singing voice synthesis system has reached the level that people can accept.At present,the related work is based on the serial model structure,using the lyrics,pitch,beat and other information in the score to train multiple models to work together.In the process of building a singing voice synthesis system,it usually requires many professionals to work together.The model training is difficult and the project time is long.Moreover,the series model is prone to error accumulation,and the final synthesis effect is not stable.Inspired by the related work of speech synthesis,this study proposes a complete end-to-end speech synthesis front-end model,Transinger,to simplify the model construction and training process.According to the characteristics of the front-end model and singing data,the existing neural network vocoder model is studied and improved,the characteristics of acoustic characteristics and voice data are analyzed,and the front-end model is compared Finally,a set of end-to-end voice synthesis method is proposed.Experimental results show that the proposed method can synthesize high quality and natural singing,and the pitch condition control method can enhance the robustness and convergence speed of the vocoder model.In addition,this study also attempts to synthesize singing voice of multiple singers.It is found that the Transinger model can synthesize singing voice of multiple singers at the same time under the condition of adding singer identity information.

Keywords/Search Tags:

Singing Voice Synthesis, Neural network, End-to-end, Vocoder

PDF Full Text Request

Related items

1	Research On Separation Of Singing Voice And Accompaniment In Music Signal
2	Research On Neural Network Based Tibetan Speech Synthesis Technique
3	Research On Malay Speech Synthesis Technology Based On End-to-End
4	Acoustic models for the analysis and synthesis of the singing voice
5	The Application Of Singing Techniques And Application In Singing
6	Research On Movie Recommendation Algorithm Based On Convolutional Neural Network And Recurrent Neural Network
7	Study On The Separation Algorithm Of Singing Voice And Accompaniment In Single Channel Music Signal
8	Objective Evaluation Of Artistic Voice Based On Acoustic Parameters
9	Emotion Recognition Of Mouse Track Based On Neural Network
10	Breeding context-dependent neural regulation of singing behavior in male European starlings (Sturnus vulgaris)