Research On Statistical Language Model Of Large-Vocobulary Continuous Speech Recognition System

Posted on:2007-12-27

Degree:Master

Type:Thesis

Country:China

Candidate:R Zou

Full Text:PDF

GTID:2178360185968216

Subject:Signal and Information Processing

Abstract/Summary:

Speech recognition is an important topic in pattern recognition field, because its development will deeply influence the future of human-computer interface. Among speech recognition technologies, large vocabulary, speaker independent, continuous speech recognition system is the most difficult and challenging one. Although many large vocabulary speech recognition systems (LVSRS) have been built up in recent years, none of them is satisfying and good enough for real applications. So, further research in this area is necessary and of great importance.Language model is a mathematics model that describes inherent disciplines of natural language. Along with the development of Corpus Linguistics, corpus-based statistical language model turn out to be an attractive alternative to manually constructed linguistic grammar, and widely applied to many research areas of natural language processing.The aim of this thesis is to construct a word-based context Chinese language model. The construction principle, evaluation criterion and current main problems of language model are discussed. Firstly a large, comprehensively sorted Chinese Corpus is constructed. The context is preprocessed and detached by word. CMU_Cam_Toolkit is used to finish the training and evaluation of language model. And a lot of experiments have been done to improve the performance of language model, such as the selecting of corpus contexts, the algorithm of preprocessing, the vocabulary file, smoothing and the adjusting the parameter of the toolkit. And these improve the word correction rate of speech recognition system greatly. The grapheme-to-phoneme conversion is a very important module for language model. It's used to connect the language model and acoustic model of speech recognition system. This thesis proposes to sort the phoneme of polyphone characters to high-frequency and low-frequency phoneme, and this method can largely reduce the labor consuming and time consuming. This thesis also proposed a data structure that can largely reduce the searching time. This model can be used in whole-word...

Keywords/Search Tags:

Speech recognition, HMM, N-gram language model, Chinese Corpus, CMU_Cam_Toolkit, smoothing algorithm, grapheme-to-phoneme conversion

Related items

1	Research On Key Technologies For Building Multilingual TTS Text Corpus
2	Research Of Uyghur N-gram Model And Smoothing Algorithm
3	Researching Of The Mogolian Language Model Based On Speech Recognition
4	Application Research On Statistical Language Model Of Large Vocabulary Continuous Speech Recognition System
5	Researching And Building Of The Mongolian Large Vocabulary Independent Continuous Speech Recognition System
6	Research And Application Of Statistical Language Model
7	A Chinese Multimodal Corpus Using Depth Information
8	Research On The Technologies Of HTK Based Uyghur Continuous Phoneme Recognition
9	Grapheme-to-phoneme conversion and its application to transliteration
10	Research On Tibetan Language Model For Continuous Speech Recognition