GMM Voice Conversion System Based Time Length Changed

Posted on:2016-06-29

Degree:Master

Type:Thesis

Country:China

Candidate:J Y Luo

Full Text:PDF

GTID:2308330464462036

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

The speech signal is only contains speech content information, but also includes speaker audio feature form information. Under the premise of guaranteeing pronunciation voice content does not change, through changing the personal characteristics of the source speakerâ€™s voice, so that target speaker voice with personal characteristics,this technology called Voice Conversion.This technology encompasses almost every aspect of the speech signal processing field and it is one of the more popular technology currently. Prosodic features of voice has a great influence on naturally and intelligibility of the ultimate synthesized speech from conversion system.This paper proposed the algorithms of changes the time length of converted prosody features based on the traditional Gaussian mixture model speech conversion system, to make up the disadvantages of converted voice has low naturally compare with the post GMM model speech conversion system, it has improved the intelligibility of converted speech.In this thesis, we study some important technologies-based on GMM voice conversion system used, and then form objective evaluation and subjective evaluation to evaluate the converted speech,in order to determine how well the design of the conversion system is.The main work is as follows:1.The speech conversion system based on length transform is not only complete the basic voice conversion requirements, but also solve the shortcoming that the quality of synthetic voice through the system conversion is unnatural and rough and so on.From the vocal mechanism of voice began to study voice analysis model that suitable for voice conversion system, its corresponding speech parameters and conversion algorithm that used in voice conversion system. Focuses on the main voice conversion algorithm based on Gaussian mixture model voice conversion system and achieve simulation,at last,we get the subjective and objective test results.2. To the problem of low voice natural for traditional voice conversion system prevalent, we propose and implement an improved algorithm based on length changes voice conversion system, through change the time-scale of speech, which is operated with insert the converted parameters before and after each word. The results of the listening tests in which the naturalness and understandability of the converted voice are reported better than ever.3.In the voice conversion system based on the improved algorithm proposed before, MFCC is adopted to be extracted because it is more beneficial for sound perception. It is given the 3-D MFCC diagrams as well as waveforms of the voices before and after the conversion. The test results confirm that the transformed speech is not only approximates the characteristics of the target speaker, but also more natural and understandable.

Keywords/Search Tags:

voice conversion system, time-scale improvement, Gaussian mixture model, target speech

PDF Full Text Request

Related items

1	Key Algorithm In High Quality Voice Conversion System
2	Research On Technologies Of Voice Conversion Based On Gaussian Mixture Model
3	Research On Methods For Voice Covnersion
4	Research On The Chinese Voice Conversion System Based On GMM
5	Voice Conversion Based On GMM And Codebook Mapping
6	Voice Activity Detection Based On Sequential Gaussian Mixture Model
7	Voice Conversion Algorithm Based On The Acoustic Characteristics Of Personality Study
8	Research On Modeling Methods For Voice Conversion
9	The Research Of Voice Conversion Based On The Spectral Parameters Of Vocal Tract
10	The Research On Restoration Of Throat Microphone Speech