Font Size: a A A

Multiple ANN/HMM Hybrid Used In Speech Recognition

Posted on:2009-10-12Degree:MasterType:Thesis
Country:ChinaCandidate:M Y LiuFull Text:PDF
GTID:2178360278964584Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Speech is the most natural and familiar interactive way for human,and current days, speech recognition and speech synthesis are ascendant.In the case of adequate training samples, isolated word recognition has got very gratifying achievements. In some cases, however, the number of training samples may not see the need for training model. In order to obtain acceptable recognition rate the model must be improved further. Based on the original timing model which combines Artificial Neural Networks and Hidden Markov Model(ANN/HMM), this paper studies a multiple and hybrid recognition method to get high rate by establishing model complementary for defferent features.Artificial Neural Network (ANN) is used as model int the status class with features of anti-noise, anti-variant, adaptive, learning ability, high recognition speed,and is also the model of the basic unit of the object to be recognised. As the model of the whole pattern, Hidden Markov Model (HMM) has strong ability to deal with time-series. In this method, the combination of ANN and HMM is on the frame level. The output error of ANN is used to estimate output probability of one state of HMM. Furthermore, a method of auto-split-and-merge the state number is used to determine the state number of a model.In this method,states are automatically added or deleted on a proper position according to the training data.We split the states with low modeling precision,delete the redundant ones,and finally achieve a balance.On the basis of above model, we propose a multiple ANN/HMM hybrid model, which segments features with competitive learning mechanism and reduces the cost of storage and calculation of systems with the method of reorganizing features adaptively.This method can use the adaptive learning capacity of ANN to ensure the system's good performance.Taking speech commands for example,we compare the modeling effects between this method and traditional ones.The results show that this multiple model can improve the modeling precision and rate not with consuming resources of system massively.In order to put the research achievement into use,we developed a simple multi-mode Human-Machine Interaction system.In this system,we can speak to give orders to the computer in a more natural way. With the use of this system, it has the characteristics of a fast response and high recognition rate.
Keywords/Search Tags:speech recognition, ANN/HMM, optimize of state number, multiple hybrid model, method of reorganizing features adaptively
PDF Full Text Request
Related items