Font Size: a A A

Research On Lanzhou-Dialect Speech Generation

Posted on:2008-03-13Degree:MasterType:Thesis
Country:ChinaCandidate:Z Y GanFull Text:PDF
GTID:2178360215968779Subject:Circuits and Systems
Abstract/Summary:PDF Full Text Request
This dissertation proposed Lanzhou dialectal speech generation methods based on speech conversion. The dissertation adopt the Pitch Target estimates model as intonation model , proposed the generation method of Lanzhou dialectal speech based on Linear Modification Model (LMM) and Gaussian Mixture Model (GMM), and proposed a speech modification method for generating variety speech timbre Lanzhou dialectal speech. The main contributions of this dissertation are listed as follows:First, the dissertation proposed the intonation modeling method of Lanzhou dialectal speech. The dissertation discusses the present main intonation model. According to the characteristic of Lanzhou dialectal speech, choose the Pitch Target estimate model as intonation model.Second, the dissertation proposed the generation method of Lanzhou dialect based on Linear Modification Model (LMM). In this method ,we predict model parameters on the Mandarin speech and Lanzhou dialectal speech in the testing set, and use a 7 dimensions parameters denoting two speech F0 contours. Then, using line regression method calculates conversion function of 7 dimensions parameters. At the stage of generating speech, predict model parameters of candidate Mandarin speech, calculate accordingly Lanzhou dialectal speech 7 dimensions parameters, and generate its F0 contours, and synthesize the Lanzhou dialectal speech using Straight algorithm.Third, the dissertation proposed the generation method of Lanzhou dialectal based on Gaussian Mixture Model (GMM). This method can work on a big corpus based on statistics model. Firstly, using Pitch Target model predict feature parameters of the Mandarin speech and Lanzhou dialect speech in the training set, and train GMM conversion parameters. According to GMM conversion parameters, we get converted F0 contours of Lanzhou dialect speech. Then synthesize the Lanzhou dialect speech using Straight algorithm. The result show that increases the scale of training speech set, we can get better synthesizes speech.Forth, the dissertation proposed the generation method of variety speech timbre Lanzhou dialectal speech. The parameters which influence listening sense are pitch, duration, aperiodic exponent and frequency spectrum. Modifying the pitch, duration, aperiodic exponent and frequency spectrum of dialect speech using Straight algorithm, we can get variety speech timbre Lanzhou dialect speech. The results show we can get high quality Lanzhou dialectal speech by this method.
Keywords/Search Tags:Lanzhou dialect, Pitch Target estimates Model, GMM model, speech conversion, Straight algorithm
PDF Full Text Request
Related items