Font Size: a A A

Post-Processing Technique For Speech Recognition

Posted on:2009-02-12Degree:DoctorType:Dissertation
Country:ChinaCandidate:B WuFull Text:PDF
GTID:1118360278965432Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
Mandarin Large vocabulary Chinese continuous speech recognition has been researched for more than ten years. Although there are some achievements in continuous speech recognition research, the distance from widespread application is still very long. The post-processing of speech recognition is a processing which converts Pinyin to Chinese characters. The research shows that the post-processing of speech recognition has very important significance to improve the system performance. The experiment on hearing indicates that human can only hear 70% syllables in continuous speech and understand the remaining 30% using context knowledge. Therefore, the post-processing technique of speech recognition is paid a great attention and conducted in-depth studies.In this thesis, we will make a deep research on the post-processing technique of Mandarin large vocabulary continuous speech recognition, including language model adaptation, decoding strategy and error handling. The main contributions and innovations are described in details as follows:1. Chinese confusion network algorithmsAt the beginning, we study the minimum bayes risk decoding rule and some minimum word error rate decoding methods. According to the characteristic of Chinese linguistics, we proposed an improved algorithm of constructing Chinese word confusion network. The improved algorithm fleetly adds null arc in confusion set when the long arc with a Chinese character string is forcibly aligned in the process of constructing a confusion network. This improved algorithm were evaluated on 2005 HTRDP (863) Evaluation task, where improved word accuracy performance was observed.In general, a Chinese word consists of 1-4 single Chinese characters, so its pronunciation time length change quite large. So we proposed an novel Chinese character confusion network algorithm for the purpose of decreasing the number of null arc. Experimental result proves that this algorithm cut the character error rate of recognition results effectively.2. Research on detection and correction of decoding resultsOn the basis on analysis of decoding errors and reason, we proposed a method that we use transformation-based learning for learning error correction rules from Chinese word confusion network. Experimental result shows significant improvements over recognition results.Considering the complexity of Chinese and the limited corpus for learning error correction rules, we use statistical methods to detect and correct decoding errors. In details, we use SVM to classify the decoding results, detect the errors; then we use Chinese character confusion network to correct errors. Experimental result shows that this method can effectively detect and correct decoding errors, and reduce the character error rate.3. Study on discriminative language model of speech recognition Firstly, we study three discriminative methods of language modeladaptation, including the boosting algorithm, the perceptron algorithm and the minimum sample risk algorithm, and present comparative experimental results on the performance of using different approaches to train discriminative language model on the task of speech recognition. Then we use the perceptron algorithm with best discriminative performance to train discriminative language model for general domain Mandarin large vocabulary continuous speech recognition, and rescore the Chinese word confusion network. Experimental result shows that this method can effectively reduce the word error rate.
Keywords/Search Tags:post processing, decoding, minimum bayes risk rule, discriminative language model, error detection, error correction
PDF Full Text Request
Related items