Age-Voice Conversion System Driven By Multi-Parameter

Posted on:2016-09-13

Degree:Master

Type:Thesis

Country:China

Candidate:J Z Li

Full Text:PDF

GTID:2308330467994932

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Voice Conversion is a technique used in order to change the personality characteristics of a source speakerâ€™s voice into the target speakerâ€™s and make the source speakerâ€™s voice sound like the target speakerâ€™s, while preserving the original semantic information. Age-voice Conversion which is one kind of Voice Conversion refers to change the age information of a source speakerâ€™s voice into the target speakerâ€™s and make the source speakerâ€™s voice sound like the target speakerâ€™s. This paper focus on the study of age-voice conversion based on speakerâ€™s personality characteristics. Actually, we set up the system of age-voice conversion driven by speakerâ€™s personality characteristics parameter. The main work and innovation of this thesis contains the following parts:(1)We established a little age voice database collected from a male and a femaleâ€™s corpus recorded in different ages on the internet. Among them, the maleâ€™s corpus contains176sentences which were recorded at the maleâ€™s age of12,18and23. Also the femaleâ€™s corpus contains85sentences which were recorded at the femaleâ€™s age of12and20. The length of each sentence is between5seconds and10seconds.(2)We proposed an improved cross-pole linear prediction algorithm to solve the problem that formant extraction based on linear predictive coding (LPC) cannot handle the merge peaks and false peaks resulting in the less accurate of extracted formant frequencies. This algorithm can reduce the error caused by the cross-poles by modified the formant-poleâ€™s radius, and then can enhance the accuracy of extracted formant frequencies.(3) As we all know, different voice track length is an important factor leading to sounding differences among the same speakerâ€™s voice of different ages. To ensure the test voice can sound like the voice of target age after age-voice conversion, voice track length alignment is an indispensable technique. This paper analyzed two core issues that is frequency conversion factor estimation and frequency warping function selection in voice track length alignment technique, and constructed spectrum transformation model for the age-voice conversion. On this basis, we establish an age-voice conversion system driven by multi-parameter, and achieve a good effects about keeping personality characteristics of age-voice conversion.

Keywords/Search Tags:

age-voice conversion, linear predictive coding, voice track lengthalignment, frequency conversion factor, frequency warping function

PDF Full Text Request

Related items

1	Research Of Voice Conversion Based On Frequency Warping Method
2	Research On High Quality Voice Conversion Algorithm Based On Improved GMM And Frequency Warping
3	The Research On Voice Conversion Algorithm Based On Improved Bilinear Frequency Warping For Parallel Or Nonparallel Corpora
4	Human Voice Conversion Based On Parameter Models
5	The Research And Implementation Of Voice Conversion Technology
6	Research On The Voice Conversion System
7	Emotional Voice Analysis And Conversion Based On Parallel Corpus
8	The Research On Vocal Tract Spectrum And Pitch Frequency Transformation In Voice Conversion
9	Emotional Voice Conversion Based On StyleGAN With Fundamental Frequency Difference Compensation
10	Voice Conversion Based On GMM And Codebook Mapping