The Research Of Mongolian Speech Synthesis System Based On Verb's Affix And Stem

Posted on:2010-08-10

Degree:Master

Type:Thesis

Country:China

Candidate:C M Bao

Full Text:PDF

GTID:2178360278467623

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the rapid development of computer technology and information technology, how to make more natural and friendly communication through man-machine interface becomes a research hot topic, in which the study of speech interactive way is one of the focal points of common concern. Speech synthesis is a very important technology in speech interaction, and it relates to many research areas of computer technology, linguistics, phonetics, speech signal processing, and psychology. Mongolian is one of the official languages in Inner Mongolia Autonomous Region, and also it is a language with a certain influence in the world. Thus the research and implementation of Mongolian speech synthesis technology has important significance and practical value for promoting the development of Mongolian information processing.This article focuses on the study of establishing the Mongolian TTS voice database based on the Mongolian language and voice characteristics. It also researches the content of the rhythm features extraction for Mongolian speech, rhythm modeling, rhythm adjusting, and speech synthesis. Major works are as follows:First, in the design of Mongolian voice database, through studying the characteristics of Mongolian voice, we chose verbs stem and affix,the additional composition of the norm and other whole phrases as basic unit for the speech synthesis. We collect speech basic unit from a big corpus too. Then we record and segment the speech basic unit,build up the voice database.Second, the extraction and modeling of Mongolian speech rhythm, through the analysis and statistics for the natural speech flow of Mongolian, we summarized the rhythm transformation rules of Mongolian language. Main expression lies in the changes of pause, duration, accent, and tone on the word, sentence level.Last, rhythm adjustment and speech synthesis, we achieve the rhythm adjustment of the synthesized speech using TD-PSOLA algorithm, which improved the naturalness of the synthesized speech of Mongolian language. When synthesizing speech, we used the algorithms of unifying with the soft concatenation and the hard concatenation, which aims to achieve the better synthesis effect.

Keywords/Search Tags:

Mongolian, Speech Synthesis, TD-PSOLA Algorithm, Stem, Affix

PDF Full Text Request

Related items

1	The Research On Mongolian Speech Synthetical-System Based On Etyma And Affix For Finite Words
2	Study And Implement Of Uighur TTS System
3	Research And Implementation Of Mongolian Emotional Speech Synthesis System Based On Deep Learning
4	Speech Synthesis Algorithm Research And FPGA Implementation
5	An Improved Speech Synthesis Method
6	The Research Of Speech Synthesis Naturalness Based On Computer
7	Pitch Detection Algorithm And Its Application In Speech Synthesis
8	Applied Research On Dam Monitoring System Based On The Speech Synthesis Of Automatic Segmentation And Psola
9	The Research Of Mongolian Speech Synthetical Technology
10	Research On End-to-End Mongolian Speech Synthesis Method