Constructing Of The Embedded Mandarin Speech Corpus And Unit Searching

Posted on:2009-06-16

Degree:Master

Type:Thesis

Country:China

Candidate:J L Wang

Full Text:PDF

GTID:2178360245996433

Subject:Circuits and Systems

Abstract/Summary:

PDF Full Text Request

Corpus-Based speech synthesis approach has been applied in many state-of-art speech synthesis systems because it can produce synthetic voice of high naturalness. But high-quality speech synthesis system has high computing complexity and large resource consuming, so it is limited in the application on server and desk platform. In recent years along with the embedded technology's unceasing development, the application of embedded speech synthesis technology has become the inevitable tendency. But because the storage capacity and processing power of embedded system is limited, thus limiting the application of the speech synthesis technology in embedded systems. Therefore, how to resolve the conflict between the effect of synthesis and the consumption of resources, realize the applications of speech synthesis technology in the field of embedded system has become an urgent problem.Aiming at the small capacity and the finite capability of compute of embedded equipment, a transformative CART and improved K-center cluster based reducing algorithm, which can select the most representative units from the initial speech corpus to reduce the footprint of the units inventory, and an unit selection algorithm with low complexity are described in this paper.First, we take the accent syllable as the unit, adopt transformative CART to carry on pre-classification, and carry on the statistical analysis to the large corpus about the type of syllable, the total number of syllable and so on. And then, we respectively take the accent syllable and the initial/final as the unit, with frequency and duration, use improved K-center cluster algorithm to carry on cluster reducing at different proportion to get a tailored database which covers the most prosody situations of primary corpus. Finally, we use tailored database and improved selecting algorithm to build the speech synthesis system. We validate the algorithm of cluster reducing and unit selecting by carrying out subjective listening and objective testing. The result shows that the tailored database has high intelligibility and naturalness.

Keywords/Search Tags:

Embedded Speech Synthesis, Classification and Regression Tree, Clustering Reducing, Unit Searching

PDF Full Text Request

Related items

1	A Research On Speech Synthesis Based On Statistical Modeling And Pronunciation Error Detection
2	A Study On Concatenative Speech Synthesis Based On Unit Embeddings
3	Research On Deep Learning Based Small-Sized Unit Concatenation Speech Synthesis
4	Research On HMM-Based Cross-Lingual Speech Synthesis
5	Research Of Chinese Speech Synthesis Technology Based On Speech Database
6	Speech Synthesis Algorithm And Embedded Speech Synthesis Systems Realization
7	Research On Embedded Tibetan Speech Synthesis System
8	Research On Unit Selection Concatenation Speech Synthesis Method Based On Deep Learning
9	Research On Statistical Acoustic Model Based Unit Selection Speech Synthesis Method
10	Research And Realization Of Embedded Speech Synthesis System