Speech Enhancement Based On HMM In Mel-Frequency Spectral Domain

Posted on:2016-01-20

Degree:Master

Type:Thesis

Country:China

Candidate:Z Z Gao

Full Text:PDF

GTID:2308330503950472

Subject:Electronic Science and Technology

Abstract/Summary:

PDF Full Text Request

The traditional single channel speech enhancement methods have made a lot of progress, but the enhanced speech obtained by these methods often cause speech distortion or remain ?music noiseâ€˜ and other issues when the non-stationary noise is present in real life. Therefore, how to get a better enhanced speech in non-stationary noise environments becomes a topic which needs to be solved in practical applications. In this thesis, a complete set of speech enhancement method using the Mel-frequency spectral domain hidden Markov model(MFS-HMM) is proposed.The contribution of this thesis is composed of the following three parts:First of all, based on the existing MFS-HMM speech enhancement method, an improved speech enhancement algorithm based on MFS-HMM is proposed. This method is generally considered as a weighted-sum filtering of the noisy speech. In the improved algorithm, both HMMs are parall trained in Mel-frequency spectral domain and in log-magnitude domain in order to solve the spectral distortion problem caused by inaccurate estimation of the filter. Then, the vector Taylor series(VTS) is adopted to estimate the parameters of HMM for noisy speech so that the HMM of noisy speech fits the noisy speech better and the weighted-sum filters are more suitable for noisy speech. The parall training of HMM and the usage of VTS make the background noise suppressed effectively and the subjective and objective qualities of the enhanced speech are improved greatly.Secondly, in order to balance the energy mismatch between the training data and test data, an energy adjustment method is proposed for MFS-HMM based speech enhancement. In this method, the online estimation of the log-energy adjustment factor for clean speech and noise is obtained by the iterative expectation maximization(EM) method, and the parameters of HMMs of clean speech and noise are modified online, respectively. This makes the energy of training data match to the test data better and the effect of energy mismatch on the enhanced speech is reduced efficiently. The subjective and objective qualities of the enhanced speech are obviously improved.Finally, the proposed speech enhancement algorithm is applied into ITU-T G.718 speech coder at 12 kb/s. The Objective and subjective quality test results show that the proposed method is superior to the speech enhancement method embedded in G.718 encoder, and can obtain a high quality for the output speech in noise environment.

Keywords/Search Tags:

speech enhancement, hidden Markov model, parallel model, vector Taylor series, enengy adjustment

PDF Full Text Request

Related items

1	Study On Statistical Model Based Speech Enhancement Algorithms
2	Speech Recognition Method Based On Hidden Markov Models
3	Chinese Speech Recognition Based On Vowel Length Adjustment
4	Research On Anti-Noise Of Speech Recognition Based On Continuous Hidden Markov Model
5	Mandarin Speech Synthesis System And Rhythm Adjustment
6	Research On Method Of Unit Selection Speech Synthesis Based On Hidden Markov Model
7	Research Of Speech Recognition Based On Mixture Feature Extraction And Improved Continuous Hidden Markov Model
8	VQ Combined With HMM Model Experimental Study On Speech Signal
9	HMM-based Chinese Part-of-Speech Tagging And Improvement
10	Research Of The GMM-HMM Based Acoustic Models