Study On Speech Separation And Speech Enhancement Methods

Posted on:2009-10-13

Degree:Master

Type:Thesis

Country:China

Candidate:X H Liu

Full Text:PDF

GTID:2178360242467445

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

In our lives, speech signals are often disturbed by various interferences such as background noise and room reverberation. The existing of noise and reverberation not only affects human hearing, but also has influence on other steps of speech signal processing. So it is important to enhance speech using signal processing technology. In fact, besides speech enhancement, speech separation can also get rid of the influence of noise and reverberation. Blind Source Separation (BSS) is a challenging subject and becomes to be a popular research area in signal processing field in recent years. In the last two decades, a number of algorithms addressing the instantaneous BSS problems have been proposed and gained some results. However, current results are far from its mature solution. Moreover research on real-world convolutive speech signals has just begun.This thesis focuses on the methods of speech separation and speech enhancement which can be divided into three parts:Signal sparsity based undetermined blind source separation algorithm. In the case of anechoic and undetermined mixing mode, the independent component analysis based blind source separation algorithm can not achieve desired performance. To solve this problem, signal sparsity based time-frequency masking algorithm and line orientation separation technique is studied. Theory analysis and computer simulations indicate that both of the algorithms have achieved desired performance. However, there is a large number of zeros in the mask constructed by time-frequency masking algorithm. To solve this problem, some improvement has been made on it. As a result, the quality of the enhanced speech signal has been improved further.Independent component analysis with reference. In the blind source separation algorithm, people usually know something about the sources. Using this a priori information so as to extract the desired source is the subject of the algorithm, independent component analysis with reference. Experiment results indicate that the algorithm can perfectly extract the interested original signal witch has the strongest relationship with the reference signal in some sense. At the same time, comparing to the conventional independent component analysis algorithm, it has less computation consuming.Speech enhancement based on blind source separation with post-processing in subband. In the case of noise and reverberation coexisting, the whole algorithm is studied. The method has achieved high performance in depressing noise and reverberation. Regrettably, the noise was depressed at the expense of speech distortion. To solve this problem, some improvements have been made on it. This is that independent component analysis operations in low frequency subbands are replaced by the efficient time-frequency masking method. Experimental results indicate that a higher performance has been achieved through these improvements.

Keywords/Search Tags:

Speech Enhancement, Time-Frequency Masking, Subband, Blind Source Separation

PDF Full Text Request

Related items

1	Study On Speech Separation And Speech Enhancement Methods
2	Study On Frequency Domain Blind Source Separation Based Speech Enhancement Methods
3	Research On Fast Blind Source Separation Algorithm Based On Frequency Domain Independent Component Analysis
4	Study On The Methods Of Time Delay Estimation And Speech Enhancement
5	The Research And Application Of Cardiorespiratory Sounds Separation Meathod Based On Blind Source Separation
6	Research On Underdetermined Blind Speech Separation Based On Sparsity In Time-frequency Domain
7	The Research Of In-Car Speech Enhancement Algorithm Based On Blind Source Separation
8	Research On Chaotic Masking And Blind Extraction Algorithm Of Speech Signal Under Multipath Channel
9	Blind Separation Of Multiple Speech Signals
10	Research On Blind Source Separation In Frequency Domain For Convolutively Mixed Speech Signals