Multiple ANN/HMM Hybrid Used In Speech Recognition

Posted on:2009-10-12

Degree:Master

Type:Thesis

Country:China

Candidate:M Y Liu

Full Text:PDF

GTID:2178360278964584

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Speech is the most natural and familiar interactive way for human,and current days, speech recognition and speech synthesis are ascendant.In the case of adequate training samples, isolated word recognition has got very gratifying achievements. In some cases, however, the number of training samples may not see the need for training model. In order to obtain acceptable recognition rate the model must be improved further. Based on the original timing model which combines Artificial Neural Networks and Hidden Markov Model(ANN/HMM), this paper studies a multiple and hybrid recognition method to get high rate by establishing model complementary for defferent features.Artificial Neural Network (ANN) is used as model int the status class with features of anti-noise, anti-variant, adaptive, learning ability, high recognition speed,and is also the model of the basic unit of the object to be recognised. As the model of the whole pattern, Hidden Markov Model (HMM) has strong ability to deal with time-series. In this method, the combination of ANN and HMM is on the frame level. The output error of ANN is used to estimate output probability of one state of HMM. Furthermore, a method of auto-split-and-merge the state number is used to determine the state number of a model.In this method,states are automatically added or deleted on a proper position according to the training data.We split the states with low modeling precision,delete the redundant ones,and finally achieve a balance.On the basis of above model, we propose a multiple ANN/HMM hybrid model, which segments features with competitive learning mechanism and reduces the cost of storage and calculation of systems with the method of reorganizing features adaptively.This method can use the adaptive learning capacity of ANN to ensure the system's good performance.Taking speech commands for example,we compare the modeling effects between this method and traditional ones.The results show that this multiple model can improve the modeling precision and rate not with consuming resources of system massively.In order to put the research achievement into use,we developed a simple multi-mode Human-Machine Interaction system.In this system,we can speak to give orders to the computer in a more natural way. With the use of this system, it has the characteristics of a fast response and high recognition rate.

Keywords/Search Tags:

speech recognition, ANN/HMM, optimize of state number, multiple hybrid model, method of reorganizing features adaptively

PDF Full Text Request

Related items

1	Study On Distant Hybrid Speech Recognition Method
2	Research On Isolated Word Speech Recognition Based On HMM And ANN Hybrid Model
3	Local Feature Analysis,Extraction,and Model Validation For Speech Emotion Recognition
4	The Research And Realization Of Mandarin Digit Speech Recognition System Based On Optimum State Number
5	Speech Emotional Recognition Research Based On Features Extraction And Multi-modal Combination
6	Exploring deep learning methods for discovering features in speech signals
7	Research For Algorithm Of Speech Recognition Based On WD/HMM
8	Research On Robust Speech Recognition Based On Deep Learning In Adverse Environment
9	Research On The Speech Emotion Recognition Fusing Articulatory And Acoustic Features
10	Speech Recognition Technology Based On Hybrid Model Of HMM And DNN