Voice Conversion Using Structured Gaussian Mixture Model In Eigen Space

Posted on:2015-03-25

Degree:Master

Type:Thesis

Country:China

Candidate:Y C Li

Full Text:PDF

GTID:2268330428499332

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Voice conversion is an important branch in the field of speech signal processing, itstask is to keep the semantic information of voice remains unchanged, only change thespeaker’s personality, and make the personality traits of converted source speech closer tothe target speaker. Traditional voice conversion method most used parallel corpora to jointtraining source and target speaker’s voice model, and then derived the voice conversionfunction. But it is difficult in practice to obtain fully parallel corpora, and union speechmodel training cost a large amount of calculation, more importantly the voice conversionsystem between more speakers is complicated.Under the condition of non-parallel corpora without joint training, a new methodologyof voice conversion in eigen space based on structured Gaussian mixture model isproposed. For every speaker, after the cepstrum feature parameters are extracted, they arefurther mapped to the eigen space which is formed by eigen vectors of scatter matrix of thecepstrum features, then train speaker’s Structured Gaussian Mixture Model in the EigenSpace (SGMM-ES). The source and target speaker’s SGMM-ES are trained respectively，then based on Acoustic Universal Structure (AUS) principle to achieve spectrum transformfunction. Subjective and objective evaluations indicate the conversion performances arequite close to the traditional method under the parallel corpora condition. The results showthe eigen space based on structured Gaussian mixture model for voice conversion underthe non-parallel corpora is effective.This topic research content mainly includes these following aspects:1. Studied the basic principle and mathematical model of speech production, thecharacteristic parameters of voice are analyzed in detail, and skillfully mastering theSTRAIGHT analysis-synthesis platform. 2. Set up a GMM transformed voice platform with joint training under the parallel corpora,and analyzed the problems existing in the traditional voice conversion method.3. Further study of the Acoustic Universal Structure principle, and proposed a StructuredGaussian Mixture Model in the Eigen Space (SGMM-ES).4. Under the condition of non-parallel corpora without joint training, realized a voiceconversion system based on structured Gaussian mixture model in the Eigen Space.5. Using subjective and objective evaluations for the converted speech which obtained byGMM, SGMM, SGMM-ES methods, and analyzed the result in detail to verify theeffectiveness of the proposed method in this paper.

Keywords/Search Tags:

voice conversion, eigen space, non-parallel corpora, SGMM

PDF Full Text Request

Related items

1	Non-parallel Corpora Voice Conversion Based On Structured Gaussian Mixture Model Under Constraint Conditions
2	Research On Many To Many Voice Conversion Based On I-vector And Improved Variational Autoencoder For Non-parallel Corpora
3	Research On Many-to-Many Voice Conversion Based On I-vector,Variational Auto-encoder And Generative Adversarial Networks For Non-parallel Corpora
4	Non-parallel Many-to-Many Voice Conversion Based On SE-ResNet Combining Speaker Embedding
5	The Research On Voice Conversion Algorithm Based On Improved Bilinear Frequency Warping For Parallel Or Nonparallel Corpora
6	Voice Conversion Based On Isolated Speaker Model
7	Non-parallel Voice Conversion Using ACGAN And Variational Autoencoders Conditioned By Sentence Embedding
8	Research On Many-to-Many Voice Conversion Based On Multi-Scale StarGAN By Share-Learning For Non-parallel Corpora
9	High-quality Voice Conversion From Non-parallel Corpora Based On Variational Auto-encoder And Bottleneck Feature
10	Many-to-Many Voice Conversion Algorithm Based On Dense Net Star Generative Adversarial Network Combining I-vector For Non-parallel Corpora