Studies On Techniques For Chinese Speech Simulation System Based On Corpus

Posted on:2006-03-19

Degree:Master

Type:Thesis

Country:China

Candidate:J F Lu

Full Text:PDF

GTID:2178360185463328

Subject:Information and Communication Engineering

Abstract/Summary:

This thesis discusses on speech simulation technology which is based on the advanced corpus-based speech synthesis. And our research focuses on the manufacture of speech simulation system. Considering the main problems of traditional Mandarin Text-to-Speech System, in-depth research was conducted on a series of key techniques such as text prosodic level marking, corpus analysis and design, unit selection strategy and etc. We firstly take a glance back at the history of Mandarin speech synthesis technology whose defects is also indicated. It is important to manufacture the speech simulation system, so the generation of speech and the principium of speech synthesis are introduced. And the characteristic of Text-to-Speech(TTS) System is also shown.Then we introduce the text prosodic level marking in detail, analyze the prosodic level structure of sentences, point out the pause principle in sentences and the description of lexical word. We use C4.5 algorithm to label text prosodic levels automatically. Thirdly we discuss upon corpus analysis and design and determine how to select speech synthesis units.In the design of corpus, we carefully analyze the syllable distribution of corpus TH-CoSS, then classify the prosodic characters of this corpus and present out the distribution of every prosodic character. Based on prosodic character vector, we construct an error function which is used to select original corpus for simulation system, and show the distribution of prosodic characters for the original corpus. Greedy algorithm and corpus self-adaptive process are expatiated to set theoretical foundation for text material search.For the unit selection strategy, we achieve candidate units for target ones by calculating target cost based on prosodic character vector. Then we use Viterbi algorithm to select the best synthesis path for simulation speech by calculating concatenative cost of the synthetic waveform, thus a corpus-based speech simulation system comes into being, and speech simulated by this system has the original speaker's style, rather vivid.

Keywords/Search Tags:

speech simulation, corpus, prosodic word, C4.5 algorithm, Greedy algorithm, Viterbi algorithm

Related items

1	The Research Of Prosodic Control Algorithm And Realization For Chinese Speech Synthesis
2	Auto-constructing Speech Corpus With The Limited Text~2
3	Design And Implementation Of Speech Corpus
4	Recognition Of Prosodic Phrases Based On An Unlabeled Corpus And "Adhesion" Culling Strategy
5	The Research And Implementation Of Algorithm Of Isolated Word Speech Recognition
6	Research And Implementation Of The Prosodic Adjustment Algorithm For Mandarin Text-to-speech System
7	Embedded Speech Synthesis And Research And Achievement For Its Key Algorithm
8	The Research And Realization Of Uighur TTS System Using Variable Length Concatenating Units
9	Research On Syntactic Knowledge Mining And Extraction Based On English-chinese Parallel Corpus
10	Research And Application Of Search Algorithm For Continuous Speech Recognition