Anti-noise Speech Recognition Technology Research Based On The ARM-Linux

Posted on:2009-04-30

Degree:Master

Type:Thesis

Country:China

Candidate:Z W Wang

Full Text:PDF

GTID:2178360242991785

Subject:Control theory and control engineering

Abstract/Summary:

Speech recognition has received more and more attention recently due to the important theoretical meaning and practical value. Up to now, most speech recognition is based on conventional linear system theory, such as Hidden Markov Model (HMM) and Dynamic Time Warping(DTW). With the deep study of speech recognition, nonlinear system theory method must be introduced to it. Recently, with the development of nonlinear-system theories such as artificial neural networks (ANN), chaos and fractal, it is possible to apply these theories to speech recognition. Therefore, the research in this recognition paper is oriented on the theory and application of the mixed model HMM-ANN, and the related algorithms and model are developed.Mandarin speech recognition technology and implement approach is studied in this thesis. Introduce the basis theory of speech recognition, include the math model of speech signal, pretreatment, CASA (Computational Auditory Scene Analysis) algorithms and feature parameter extraction algorithms, Aim at the occasion where the design was applied, improved the configuration of CASA, optimized its arithmetic, debate the mandarin speech recognition technology and the fundamental of CASA. The application of CASA distilled multiple pure speech signals. The speech feature extraction algorithms was expatiated and improved.The paper indicated the merit and defect of Liner Prediction Cepstrum Coefficient and Mel Frequency Cepstrum Coefficient. Discussed the extract method and operate process of MFCC detailedly. The performance of speech recognition and application characteristic of HMM (Hidden Markov Model) and SONN (Self Organizing Neural Networks) methods used in this paper is compared. Discussed the theory and model parameters of HMM, analyzed the extract method of each parameter and resolved three basic problems, explained the basic conception of ANN, BP network and SONN. The research improvement in this paper is oriented on the theory and application of the mixed model HMM-ANN, which is formed by the combination of the Continues Hidden Markov Mode (CDHMM) and SONN, and the related algorithms and model are developed. After the high-point list of speech signal was computed by means of CDHMM, For the same state, build the same dimension's speech character vector by DTW, and affiliate it into SONN speech recognition sort. The HMM-ANN model has the ability of modeling and static state classify.The paper designed the software and hardware structure of speech recognition system, researched the CASA and HMM-ANN model arithmetic under the ARM-Linux crossed complier, Tested speech recognition rate in several occasion. As result, compared by the former HMM model method, the ameliorative CASA and HMM-ANN arithmetic improved the veracity, anti-noise ability, the stability and self-adaptability of speech recognition, indicated model performance, proved the feasibility and validity, and indicate the direction of the research improvement.

Keywords/Search Tags:

mandarin speech recognition, hidden markov mode, artificial neural networks, feature extraction, self-organized feature mapping

Related items

1	Mandarin Digit Speech Recognition Based On HMM And ANN Model
2	Study Of Speech Recognition Algorithm Based On HMM And Artificial Neural Network
3	Research Of The Speech Recognition Technology Based On HMM
4	A Hybrid Approach For Speech Recognition Based On Combination Of HMM And RBF Neural Network
5	The Research Of Feature Extraction Algorithm On The Speaker-Independent Speech Recognition
6	People Independent Chinese Speech Recognition Based On HMM And ANN
7	Research And Implementation Of Speech Recognition Based On HMM/BP
8	Research Of Speech Recognition Based On Mixture Feature Extraction And Improved Continuous Hidden Markov Model
9	Research On Key Issues Of Mandarin Speech Emotion Recognition
10	Discriminative Methodologies For Tone Problem Solving In Mandarin Speech Recognition