Study On Algorithms Of Preprocessing Of Noise Robust Speech Recognition

Posted on:2008-06-16

Degree:Master

Type:Thesis

Country:China

Candidate:J B Li

Full Text:PDF

GTID:2178360218952695

Subject:Detection Technology and Automation

Abstract/Summary:

Noise robustness is one of the major obstacles to the commercial use of speech recognition techniques. Though prevailing speech recognition systems can obtain a rather high accuracy for clean speech, their performance will degrade rapidly in noisy environments due to the mismatch between the acoustic models and the testing speech. Therefore, it makes the current speech recognizers unsuitable for practical applications.In this paper, preprocessing of speech recognition in noisy environments is studied, mainly including endpoint detection, speech enhancement and feature extraction.Firstly, endpoint detection is studied, which is the precondition and guarantee of speech enhancement and effectively extracting voice features. Detection algorithms such as short-time average energy, short-time average zero-crossing rate and based on spectrum variance are deeply studied. On the basis of analyzing the faults of these algorithms, endpoint detection algorithms based on adaptive subband spectral entropy and power entropy are proposed. Experimental results show it can obtain good effect under different noise conditions.Secondly, speech enhancement is studied, which is not only the precondition of effectively extracting feature parameters, but also is a vital step in text to speech and speech coding. Traditional algorithms, such as spectrum subtraction, Wiener filtering and MMSE amplitude estimate, are described in theory respectively, and they are validated by experiments. And then improved spectrum subtraction is presented. Experimental results reveal it's also a successful algorithm.Finally, feature extraction is studied, which is one of key parts in speech recognition. Common feature parameters such as LPCC and MFCC are theoretically stated. And a novel feature, named perceptual cepstral coefficients based on the minimum variance distortless response (PMCC), is proposed. Under different SNRs, a lot of recognition experiments using three features have been done. The results indicate the proposed feature outperforms LPCC and MFCC.

Keywords/Search Tags:

endpoint detection, speech enhancement, feature extraction, spectrum entropy, spectrum subtraction, minimum variance distortless response

Related items

1	Research On Speech Recognition In Noisy Environment
2	Endpoint Detection Algorithm For Speech Signal In Low SNR Environment
3	Research And Implementation Of Voice Laser Modulation Signal Enhancement Technology Based On Amplitude Spectrum Estimation
4	Research And Implementation On Noisy Speech Endpoint Detection Algorithm
5	Research For The Algorithms Of Speech Enhancement
6	Research On Speech Enhancement Based On Noise Spectrum Estimation And Signal To Noise Ratio Constraint
7	Research On Ear Speech Enhancement Algorithm Based On Subband Analysis
8	Voice Signal Front-end Processing Technology Research
9	Research On Simultaneous Speech Detection And Magnitude Squared Spectrum Estimation Approach For Speech Enhancement
10	A Speech Enhancement System Based On The Auditory Characteristics And The Speech Spectrum Characteristics