The Key Technology Research About The Speech Recognition

Posted on:2016-09-24

Degree:Master

Type:Thesis

Country:China

Candidate:Y M Wang

Full Text:PDF

GTID:2308330473455815

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

With the rapid development of artificial intelligence, the technology of speech recognition is becoming the focus topic of research institutions around the world. People commit to make a machine that can understand human speech instruction, and also hope that we can control the machine through voice. Research and development of speech recognition will greatly convenient peopleâ€™s life in the near future.The processes of speech recognition include roughly: feature extraction, acoustic model training, linguistic model training and recognition algorithm. The whole process in this speech recognition is researched in this thesis, including feature extraction, model training and recognition algorithm. Based on the characteristics of Chinese phonetic pronunciation and computation problems such as comprehensive consideration, the Chinese voice initials and finals are selected as the basic acoustic unit. Besides this, semi-continuous HMM model is used as the acoustic model of initials and finals. Mel cepstrum coefficient is selected as the characteristic parameters in this thesis, and this paper deeply analyzes the MFCC parameters extraction process, and proposed an improved MFCC extraction algorithm that reduced calculation by nearly 50% than the traditional MFCC extraction algorithm, which greatly improved the efficiency of feature extraction.Sphinx system is a mature international system in speech recognition area. In this thesis, the acoustic model training tool sphinxtrain internal implementation and mainly relates to conduct the thorough research to the algorithm, and by adjusting the parameters to adapt the demand of Chinese speech recognition. In the end, based on CMU Sphinx this thesis builds and implements the mandarin continuous speech recognition system, and achieves pretty good effect. And the recognition rate of the numeric string comes to 98%, besides this, compare the recognition effect with different code book number, state number and feather extraction algorithm.

Keywords/Search Tags:

speech recognition, MFCC, acoustic model, Hidden Markov Model

PDF Full Text Request

Related items

1	The Study Of Feature Extraction And Acoustic Modeling In Speech Recognition System
2	The Recognition Model Research Based On Whole Acoustic Structure Features Of Speech Unit
3	Researching Of The Mongolian Acoustic Model Based On Speech Recognition
4	The Research On The Speech Recognition System In The Noisy Environment
5	VQ Combined With HMM Model Experimental Study On Speech Signal
6	Speech Recognition Speed Up Research Based On MFCC
7	Design Of Continuous Speech Recognition System Based On HMM
8	Study The Speech Recognition System Based On HMM
9	A Study On The Extraction Of Speech Depth In Tibetan Language And Its Speech Recognition
10	Study And Improve On The Mongolian Speech Recognition System