Application Research On Statistical Language Model Of Large Vocabulary Continuous Speech Recognition System

Posted on:2010-06-16

Degree:Master

Type:Thesis

Country:China

Candidate:Q Zhang

Full Text:PDF

GTID:2178360278958996

Subject:Computer application technology

Abstract/Summary:

With the widely application of speech recognition technology, which will be a very important topic in pattern recognition fields, because its development will deeply influence the future of life and work, and also it will become the main technology in next ten years. Due to large vocabulary, speaker independent and continuous speech recognition system is one of the most difficult and challenging topics Among speech recognition technologies in recent years. So more and more institutes and coporations have taken part in research and development of the technology. Although large vocabulary continuous speech recognition (LVCSR) systems based on Chinese have been quickly developed in recent years, none of them is good enough for real application. So, further research in this area is necessary and significant.The language model is extremely important in LVCSR, Its performance directly affects the application ranges and effect of the whole speech recognition system. So building and updating a reliable language model is very important for speech recognition system.Firstly, the working principle, smooth technique, evaluation criterion and related principle of language model are discussed in detail in this thesis. Secondly, the corpus is built to meet the requirement of HLM (HTK language modeling) tool in the HTK (Hidden markov toolkit), through seting up the HTK modeling platform under Linux, and using Linux commands and programming scripts programs with Bash and Perl to preprocess and divide the original data. Thirdly, a Tri-gram language model is created by using HLM to train the data in corpus continuously and to evaluate the results. Fourthly, with the continuous change of the recognition contents, the performance of language model will get down and many out of vocabulary in speech recognition appear. So, a whole solution is put forward to automatically update language model by making use of HTML Parser and PDFBox develop toolkits to extract web content and PDF document as a data source, which can improve the performance of language model.In the end, the framework of LVCSR system is introduced, and the recognition effect of the language model built in this thesis is validated through an experiment in a speech recognition system. Because the recognition result may not be completely right, and an approach that can modify and adjust the contents and time of the recognition result is discussed, which can keep the consistency between the recognition result and the speech, and improve the correctness of speech recognition.

Keywords/Search Tags:

Large varcabulary continous speech recognition, Statistical language model, Tri-gram language model, Corpus, Language model automatic update

Related items

1	Research On Statistical Language Model Of Large-Vocobulary Continuous Speech Recognition System
2	Researching Of The Mogolian Language Model Based On Speech Recognition
3	Researching And Building Of The Mongolian Large Vocabulary Independent Continuous Speech Recognition System
4	Mongolian Language Model Based On Recurrent Neural Network
5	The Design And Implement Of A Mobiles' Chinese Input System Based On Statistical Language Model
6	The Optimization And Implementation Of The Efficiency And Performance Of Chinese Language Model Based On Recurrent Neural Network
7	Research Of Statiscal Language Model N-best Reranking Algorithm
8	An Automatic Chinese Text Categorization System Based On Statistical Language Model
9	Parallel Optimization Method In Language Model For Mandarin Speech Recognition
10	Research On Language Model Corpus Expansion And Text Error Correction Algorithm For Speech Transcription