Font Size: a A A

Reserch And Implementation Of Chrildren Speech-Triaining System Based On ASR

Posted on:2007-08-28Degree:MasterType:Thesis
Country:ChinaCandidate:K W XuFull Text:PDF
GTID:2178360212465621Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the progress of modern computer technology, more and more computer application is involved in everyday life. When people use computer, speech exchange with computer may be the most direct and convenient way. Therefore, Speech recognition and synthesis has become a significant mark of science and technology development, which becomes one of the important fields in computer research and development. The technology of speech recognition relates to multi-science. The achievement in these fields has contributed to the development of speech recognition. So far, most speech recognition system is still in its infancy and some problems will arise if migrated from lab, which is much far from practicality.This paper discussed various algorithms of adaptive techniques, especially focused on two classical methods:MAP (Maximum a Posteriors) and MLLR (Maximum Likelihood Linear Regression). Then, a new approach is presented in this paper, integrating MAP and MLLR for incremental adaptation. In the new approach, the simplified MLLR module uses a single globe regression class to minimize the mismatches caused by the environment and speaker anatomical differences, and provides a more accurate initial model to the MAP processing. The incremental MAP module is used for a further subtle removal of phoneme-level variations, and to ensure the asymptotic properties of the whole approach. We use the new approach to improve the Microsoft SDK, which is highly effective in our experiments. The results demonstrate that the new approach can effectively deal with both the speaker and environment variations, and is well suited for the speech recognition.Based on the above theoretical research, this paper combined the modern educational technology and the demand of children's linguistic education; it successfully developed the software of children speech education by applying the improved Microsoft voice identification engine. It has fulfilled the functions of Chinese voice identification, the communication between VC++, Flash and the voice identification engine, the voice identification of Chinese and English, the correct and error cartoons of pronunciation, TTS, etc. This software is a successful tool of children's linguistic education by its intuitive images and practical features.The paper applied the research in the automatic voice identification technology to the children's linguistic education and gained satisfactory result, which has significance on two levels: the theoretical and the practical.
Keywords/Search Tags:ASR, Speech API, COM, Chrildren speech-training, adaptive
PDF Full Text Request
Related items