The Speech Enhancement System Based On Binary Mask And Perceptual Wavelet Packet Transform

Posted on:2012-06-19

Degree:Master

Type:Thesis

Country:China

Candidate:Y Y Shen

Full Text:PDF

GTID:2218330368492423

Subject:Detection Technology and Automation

Abstract/Summary:

PDF Full Text Request

In general,speech is often corrupted inevitably by surrounding environment or transmission medium. Interferenced speech signals can not only cause auditory fatigue, but also reduce the performance of the speech signal processing system, such as speech coding, speech recognition. In order to eliminate noise effects, it's very necessary to study on speech enhancement technique.Based on studying on the spectral subtraction methods, we proposed a new speech enhancement system based on binary mask and perceptual wavelet packet transform. The main works are as following:Aiming at the problem that methods of masking property only include the simultaneous masking, this thesis proposes a temporal masking factor to combine simultaneous masking with temporal masking. It's closer to human auditory perception characteristics. To segregate the residual noise from the speech distortion, we define a differential wavelet coefficient as the difference between the wavelet coefficients of the clean speech and the enhanced speech. We treat the differential wavelet coefficients as a linear superposition of speech distortion and residual noise, and define a cost function to combine them. According to the constraint conditions that the energy of residual noise is kept below the masking threshold, we minimize the speech distortion to optimize the gain function. And then we can get the optimal subtraction parameters to enhancing the noisy speech efficiently.Aiming at the problem that the present algorithms can cause the unvoiced speech to be damaged, we propose a speech enhancement algorithm based on binary mask. According to the computational auditory scene analysis, we segment the noisy unvoiced speech to many time-frequency units, and identify each unit as either target-dominated or masker-dominated. The target-dominated units will be retained and the masker-dominated units will be removed. Finally we synthesize the enhanced unvoiced speech and the voiced speech enhancing by the method based on perceptual wavelet packet transform to get the whole enhanced speech.Both subjective and objective evaluation criterions are conducted on the speech enhancing by the methods.Simulation results show that comparing with other methods, the proposed system can better remove background noise, restrain residual noise, and minimize speech distortion, meanwhile protecting the unvoiced speech.At last,this thesis raises the shortcomings of this method and the problems that haven't been solved,and gives the direction of further study and improving.

Keywords/Search Tags:

speech enhancement, binary mask, perceptual wavelet packet transform, unvoiced speech enhancement, masking property

PDF Full Text Request

Related items

1	Research On Speech Enhancement Algorithm Based On Wavelet Packet Adaptive Threshold
2	Research On Speech Enhancement Algorithm Based On Prior Signal-to-noise Ratio Estimation
3	The Research Of Speech Enhancement Algorithm
4	The Research Of Speech Enhancement Algorithms Based On Wavelet Transform
5	Research And Implementation Of Speech Enhancement Algorithm
6	Speech Enhancement Technique Research In Low SNR Circumstance
7	Robust Supervised Single Channel Speech Enhancement In The Wavelet Domain
8	The Algorithm Research Of Speech Enhancement Based On Wavelet
9	The Research On Speech Enhancement Based On Wavelet Packet Transform Algorithms
10	Speech Enhancement Based On Auditory Masking And Auditory Wavelet Packet Decomposition