Research On Technologies Of Voice Conversion Based On Gaussian Mixture Model

Posted on:2012-08-15

Degree:Master

Type:Thesis

Country:China

Candidate:L L Zhao

Full Text:PDF

GTID:2218330338463146

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Voice conversion is a technique that modifies a source speaker?s speech to be perceived as if a target spearker?s, so that the source speaker?s voice sounds like the target speaker?s voice. It is an exciting new branch of speech signal processing. With the improvement of the people's quality of life, we are not only request the understanding of our voice, but also emphasize the individual characters of speech. So, the research of this technology both has great significance in theory and the value of application.The main work of this paper:First, discuss the model and system of vocal sound,the basic theory and method of voice conversion techniques, and the speech parameters. Study on the Gaussian mixture model (GMM) for performing spectral conversion between speakers .Second, the converted spectral are excessively smoothed by statistical Gaussian mixture model (GMM) algorithm. In order to address this problem, we propose a conversion method (GMM-GV) considering global variance feature. The emulation demonstrates that GMM-GV algorithm can overcome the problem effectively.Third, the spectral discontinuities due to independent mapping of subsequent frames. In order to address those problems, an enhanced conversion method(GMM-GV-Viterbi) incorporating frame selection algorithm is presented.Forth, F0 contour reflect individuality of speaker also. We propose an improved method (GMM-Viterbi) to modify prosody. The experiment indicate that the performance of proposed method is better than the statistical method(GMM).

Keywords/Search Tags:

Voice Conversion, Spectral Transformation, Frame Selection, Gaussian Mixture Model

PDF Full Text Request

Related items

1	Voice Conversion Based On GMM And Codebook Mapping
2	Voice Conversion Research Based On Spectral Envelope And Super-segmental Prosody
3	The Research Of Voice Conversion Based On The Spectral Parameters Of Vocal Tract
4	The Research On Vocal Tract Spectrum And Pitch Frequency Transformation In Voice Conversion
5	Research On Methods For Voice Covnersion
6	The Research Of Extracting Of Pathological Voice's Characteristics And Recognition Based On Wavelet Transformation And Gaussian Mixture Model
7	Emotional Speech Conversion Using Spectrum-Prosody Dual Transformation
8	Research On The Chinese Voice Conversion System Based On GMM
9	Key Algorithm In High Quality Voice Conversion System
10	Voice Conversion Algorithm Based On The Acoustic Characteristics Of Personality Study