Research On Uyghur Continuous Speech Recognition System Based On HTK

Posted on:2009-11-23

Degree:Master

Type:Thesis

Country:China

Candidate:M Tao

Full Text:PDF

GTID:2178360245985508

Subject:Computer application technology

Abstract/Summary:

Speech Recognition change speech data into text sequence, it is the main component of human-computer interaction. With the development of the Speech recognition technology, it becomes non-specific, large vocabulary continuous speech recognition from the initial isolated digital recognition.Uyghur belongs to the Turkic language family of Altaic language system. Uyghur is an agglutinative language. It is possible to produce a very high number of words from the same root with suffixes. Uyghur pronunciation is spliced by a number of phonemes and so have its unique laws on vowel harmony and consonant harmony. Based on Uyghur own characteristics, this paper established a Uyghur continuous speech corpus consisted of 64 speakers'speech data, researched on the selection of the Uyghur continuous speech recognition units . Based on the above study, this paper selected triphone as the basic recognition unit, used Hidden Markov Model tool (HTK) to established a triphone acoustic model, Used many methods such as decision tree, making Tied-State triphones, fixing the Silence Models, increasing Gaussian Mixture Distribution and so on to improve the precision of the models. In the word layer, used Statistics-based Bigram language model, it is suitable for Uyghur voice features.Finally, this paper has done a variety of recognition experiments using test databases based on the built acoustic model and Bigram language model under the DOS environment, the experimental results show that the recognition rate of sentence reached 68.98%, the recognition rate of word achieved 94.65%. Used VC2005 programming environment to do the secondary development based on HTK tools, developed a Uyghur continuous speech recognition system and made real-time speech recognition experiments. The experimental results show that the sentence recognition rate reached 63.31% and 65.67%, the word recognition rate achieved 90.25% and 91.40%, male and female respectively.

Keywords/Search Tags:

Uyghur, Hidden Markov Model, triphone, Bigram, HTK

Related items

1	Design And Building Of The Handwriting Uyghur Words Recognition Based On HTK
2	HMM-Based Recognition Research On Online Handwriting Uyghur
3	Study And Improve On The Mongolian Speech Recognition System
4	Based On Combination Strategy Online Uyghur Handwriting Recognition
5	Researching Of The Mongolian Acoustic Model Based On Speech Recognition
6	Research On Multiresolution Hidden Markov Model For Image Denoising
7	Study And Design Of Specific Character Speech Recognition Based On Embedded System
8	The Contourlet-based Statistical Models For SAR Images Denoising
9	Research On Hidden Markov Model And Its Application To Image Recognition
10	The Study On Key Technologies Of Realistic Chinese Visual Speech Synthesis