Reserch And Implementation Of Chrildren Speech-Triaining System Based On ASR

Posted on:2007-08-28

Degree:Master

Type:Thesis

Country:China

Candidate:K W Xu

Full Text:PDF

GTID:2178360212465621

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the progress of modern computer technology, more and more computer application is involved in everyday life. When people use computer, speech exchange with computer may be the most direct and convenient way. Therefore, Speech recognition and synthesis has become a significant mark of science and technology development, which becomes one of the important fields in computer research and development. The technology of speech recognition relates to multi-science. The achievement in these fields has contributed to the development of speech recognition. So far, most speech recognition system is still in its infancy and some problems will arise if migrated from lab, which is much far from practicality.This paper discussed various algorithms of adaptive techniques, especially focused on two classical methods:MAP (Maximum a Posteriors) and MLLR (Maximum Likelihood Linear Regression). Then, a new approach is presented in this paper, integrating MAP and MLLR for incremental adaptation. In the new approach, the simplified MLLR module uses a single globe regression class to minimize the mismatches caused by the environment and speaker anatomical differences, and provides a more accurate initial model to the MAP processing. The incremental MAP module is used for a further subtle removal of phoneme-level variations, and to ensure the asymptotic properties of the whole approach. We use the new approach to improve the Microsoft SDK, which is highly effective in our experiments. The results demonstrate that the new approach can effectively deal with both the speaker and environment variations, and is well suited for the speech recognition.Based on the above theoretical research, this paper combined the modern educational technology and the demand of children's linguistic education; it successfully developed the software of children speech education by applying the improved Microsoft voice identification engine. It has fulfilled the functions of Chinese voice identification, the communication between VC++, Flash and the voice identification engine, the voice identification of Chinese and English, the correct and error cartoons of pronunciation, TTS, etc. This software is a successful tool of children's linguistic education by its intuitive images and practical features.The paper applied the research in the automatic voice identification technology to the children's linguistic education and gained satisfactory result, which has significance on two levels: the theoretical and the practical.

Keywords/Search Tags:

ASR, Speech API, COM, Chrildren speech-training, adaptive

PDF Full Text Request

Related items

1	Research On Statistical Parametric Mandarin-Tibetan Cross-lingual Speech Synthesis
2	Research And Application Of Pronunciation Detection For Deaf Children Rehabilitation
3	Discriminative Training Based On TANDEM For Speech Assessment And Evaluation System
4	Structured Deep Learning For Adaptive Speech Recognition
5	Research On Affective Speech Synthesis
6	End-to-End Speech Synthesis Based On Multi-Language Modeling
7	Research Of Speech Recognition And Its Application In The Speech Error Identifying System
8	Research On Ultra Low Bit Rate Speech Coding
9	Research On Speech Enhancement With Adaptive Dual Data Stream
10	Research On Key Techniques Of Mono Speech Enhancement