Large Vocabulary Continuous Speech Recognition Research Based On HTK

Posted on:2017-05-28

Degree:Master

Type:Thesis

Country:China

Candidate:B L Li

Full Text:PDF

GTID:2308330488465244

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

Exchanging information with the outside world is one of the fastest, most effective and widely used way of exchange. Speech recognition technology through a number of disciplines, and it will be involved in many areas. For example, statistics, physiology, acoustics, information theory and computer science, digital signal processing technology, applied psychology and pattern recognition theory, etc.. In recent years, the research on continuous speech recognition, a large vocabulary,which is a difficult task. Although large vocabulary continuous speech recognition system has made a series of achievements, but with its wide application, the system also showed its shortcomings. Especially in the large-scale application of the same pronunciation of different word recognition on the recognition accuracy of the defects and the noise environment can not be very good recognition of the difficulties. To this end, the deeper research of the system is of great significance and value.In this paper, we study the large vocabulary continuous speech recognition, and research is proposed aiming at the noise environment based on Mel frequency cepstral coefficient similarity of endpoint detection methods. Contrast triphone and tone modeling methods are excellent in the process of implementation of, and based on the similarity of MFCC speech endpoint detection technology further to recognize speech. The model construction by way of contrast respectively by single phonemes and phoneme modeling, the triphone models in establishing a full consideration of the issues related to the context, so this modeling recognition rate is better than single phoneme model. Finally, when adding different noises, the validity of the method is studied by studying the similarity of Melâ€™s MFCC.Finally, six groups of contrast experiments were done, and the recognition rates of 10dB, 0dB and-10dB were obtained in the traditional endpoint detection. The recognition rates were 43.04%,12.62%, and 6.07%, respectively. In the use of the Mel MFCC coefficient of the similarity of the endpoint detection are also added 10dB, 0dB and-10dB noise, respectively, the recognition rate of the sentence:45.06%,41.19% and 29.23%. With the decrease of the ratio of signal to noise, the correlation coefficient of MFCC is also slow down, but it can still get a better detection result.

Keywords/Search Tags:

Ontinuous speech recognition, Hidden Markov models, HTK, Three, phoneme model

PDF Full Text Request

Related items

1	Speech Recognition Method Based On Hidden Markov Models
2	Research On Speech Phoneme Recognition Based On Deep Learning
3	American Sign Language recognition: Reducing the complexity of the task with phoneme-based modeling and parallel hidden Markov models
4	Research Of Speech Recognition Based On Mixture Feature Extraction And Improved Continuous Hidden Markov Model
5	Two-dimensional hidden Markov models for speech recognition
6	Recognition, Hidden Markov Model-based And Multi-class Mapping
7	Research And Implementation Of Speech Recognition Algorithm
8	Research On Speech Recognition Algorithm And Application
9	People Independent Chinese Speech Recognition Based On HMM And ANN
10	Research And Implementation Of Speech Recognition Based On HMM/BP