Reverberation Environment Of Isolated Word Speech Recognition Research

Posted on:2014-02-09

Degree:Master

Type:Thesis

Country:China

Candidate:R Kong

Full Text:PDF

GTID:2248330398962848

Subject:Detection Technology and Automation

Abstract/Summary:

PDF Full Text Request

The robustness of speech recognition system is the key whether the technology ofspeech recognition can step into the stage of practical application. But the existence ofnoise or reverberation in real environment, especially in distant-talking speech recognitionsystem, reverberation will cause the change of amplitude, the delay of phase, the offset offormant and the production of other peaks in speech signals. What’s more, the tailingreverberant part also hides the weak energy sound of the following voice. It greatly reducesthe voice intelligibility, causing the decline of recognition rate. So it is important toovercome reverberation for realizing distant-talking speech recognition.This paper proposes one method combining the complex spectrum linear filteringwith U-GMM to improve the recognition rate of isolated word speech in the reverberationenvironment, which is on the basic research of reverberation basic characteristics andspeech recognition algorithm. The main research works included is as follows:This paper mainly studies the reverberation characteristic, against the features that thereverberation voice is added by many time orders and the pure signal is weaken. Accordingto this feature that the complex spectrum of pure speech signal in the complex spectrumdomain is commonly distributed near the origin, while the complex spectrum of roomimpulse response is far away from the origin. First, we take use of the discrete Fouriertransform the speech signal into the complex spectrum domain. Then, realize the reductionof reverberation through the blind deconvolution linear filtering the amplitude and phaseinformation of reverberation. At the same time, it will reduce the distortion of speech whilenot change the information of original speech.In order to solve the problem that the reverberation affection crosses multi-continuestime frame, this paper proposes a method to adaptively adjust the mean value andcovariance value of every frame in Gaussian mixture model (GMM) according to the prior vector of characteristics. Then take use of maximum expected algorithm parameters toachieve the optimal estimation. Combing the complex spectrum linear filter, the filteredreverberation speech is input into this model for matching recognition, in order to improvespeech recognition rate in reverberation condition.We conduct the experimental simulation using this method, taking use of subjectiveand objective evaluation to judge the reverberation speech recognition. The experimentalresults show that the recognition rate in this paper is obviously higher than other commonalgorithms, better removing reverberation and guaranteeing the minimum voice distortion.At last, this paper summary the research work, the shortcomings and deficiencies ofthe proposed algorithm and also explore the direction of future research.

Keywords/Search Tags:

reverberation, complex spectrum, GMM, speech recognition

PDF Full Text Request

Related items

1	Study On MFCC And Lasso Reverberation Suppression Of Feature Extraction Algorithm Of Speech Recognition
2	Research On Deep Learning Based Speech Dereverberation Method
3	Study On Methods For Speech Enhancement Based On Microphone Array In Complex Environment
4	Research On Speech Preprocessing Of Speech Recognition For Multi-talker Conversations In Complex Acoustic Environments
5	Research On Robust Speech Recognition In Complex Environments
6	The Research On Speech Signal Dereverberation Technology
7	Research On Speech Recognition In Complex Noise Environment
8	Study On Algorithms Of Preprocessing Of Noise Robust Speech Recognition
9	The effects of pitch, reverberation, and spatial separation on the intelligibility of speech masked by speech in normal-hearing and hearing-impaired listeners
10	Far-field Speech Recognition Based On Multi-task Learning Using Phase And Reverberation Information