Based Hmm Can Be Training Vietnamese Speech Synthesis System

Posted on:2012-02-16

Degree:Master

Type:Thesis

Country:China

Candidate:L Y He

Full Text:PDF

GTID:2218330338955885

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Speech synthesis is the process that reappear the understandable voice signal through the special hardware equipment or the computer. The speech synthesis technology is one of key technologies in realizing the man-machine speech communication, and establishing a spoken language system which has both heard and oral ability. The research of Speech synthesis technology had more than 200 years history, but the latter-day speech synthesis technology which has the practical significance is truly develops along with the development of computer technology and the digital signal processing technology.Over the past few decades, methods of speech synthesis mainly have:Articulatory Synthesis, Source-filter Model Synthesis, Unit-Selection Synthesis and Trainable TTS. These methods have merits and drawbacks respectively, but comparatively speaking, Trainable TTS has higher degree of automation, and small dependence with the different pronunciation person, the different pronunciation style, and the different language. Based on these characteristics, this thesis has selected a trainable speech synthesis method based on the Hidden Markov Model (HMM) to carry on the construction of synthesis system.Vietnam located at the east of Indo-China Peninsula, Southeast Asia. It is bounded on Yunnan of China, which has brought the frequent exchange of language culture and related talented people of two places, as well as the region superiority of researching Vietnamese new phonetic technology. Therefore this thesis has studied the Vietnamese speech synthesis system, and hoped that the result of research can be used to practice and the Vietnamese man-machine interaction can be realized. The main jobs of this thesis are as follows:(1) Expounding the fundamental principle of HMM, and introducing the procedure of constructing a trainable speech synthesis system which based on HMM.(2) Introducing the phonetic features of Vietnamese, and reviewing the present situation of Vietnamese speech synthesis. On that basis, proceeding data preparation of vietnamese speech synthesis system. The data preparation work mainly includes: Constrution of Corpus, determination of phoneme tabulation, labelling training data as well as design of context attribute and question collection. The most important part of work is labelling the training data. In this thesis, we select a method that combines the handwork with the procedure to carry on corpus labelling.(3) In the Cygwin platform, according to the training process of STRAIGHT synthesizer, using 500 labeled sentences to accomplish the model training process and back-end synthesis part of vietnamese speech synthesis system. Generating waveform using synthesizer after rhythm labeling of synthetic sentences.The experimental result shows that the STRAIGHT synthesizer for Vietnamese speech synthesis is feasible. Subsequent work should be realized on the automatic analysis of Vietnamese texts, as well as improving the naturalness of synthetic speech.

Keywords/Search Tags:

Speech Synthesis, Vietnamese, Hidden Markov Model, Trainable Speech Synthesis, STRAIGHT Synthesizer

PDF Full Text Request

Related items

1	HMM-based Trainable Speech Synthesis For Dai Language
2	Hidden Markov Model-based Speech Synthesis Technology Research
3	A Study On Speech Synthesis And Visual Speech Synthesis Based On Neural Networks
4	Research On Method Of Unit Selection Speech Synthesis Based On Hidden Markov Model
5	Mandarin Speech Synthesis System And Rhythm Adjustment
6	Research On Statistical Parametric Speech Synthesis Integrating Speech Production Mechanisms
7	Research On Statistical Acoustic Model Based Speech Synthesis
8	Research On Acoustic Modeling Methods In Statistical Parametric Speech Synthesis
9	Research On Emotion Speech Synthesis And Building Based On HMM
10	Research On Speech Synthesis Method Integrating Subjective Evaluation And Feedback