Font Size: a A A

Study Of Speech Enhancement And Blind Separation Algorithm With Multi-input

Posted on:2006-11-20Degree:MasterType:Thesis
Country:ChinaCandidate:S F OuFull Text:PDF
GTID:2168360155953057Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
In our lives, Speech is often corrupted acoustically by ambient noise whichproduces aesthetically undesirable effects on the performance of digital voiceprocessor and even diminishes communication system ability to conveyinformation across the interface. Therefore a speech enhancement system isstrongly needed whose responsibility is improving the speech quality and ensuringreliability of digital voice communication systems.Depending on the number of microphone, speech enhancement algorithmscan be divided into two categories, one is single-input and other is multiple-inputalgorithm. There is only one channel of speech signal needed in conventionalsingle-input system which is banded with the attributes of speech signals and basedon the attenuation of the varieties of noise, such as spectral magnitude subtraction,adaptive noise canceling and auditory masking and so on. These algorithms whichare easy to realize with hardware and simple for construction have been applied tomodern communication systems. While capable of improving speech quality inrestrictive environments (additive noise, no multiple channel, high to moderatesignal-to-noise ratio (SNR), single source), these approaches do not perform well inthe cases of reverberant distortions, competing sources, and severe noise conditions.In recent years, the use of microphone arrays has received considerable attenuationas a means for dramatically improving the performance of traditional singlemicrophone systems and amounts of approaches have been proposed, such as delayand sum beamformer, adaptive beamformer, and speech signal model etc..In addition, there is a very important accessory measure, namely voice activitydetection (VAD) in digital speech processing. By use of VAD, we can obtain oranew the statistical attributes of noise demanded in speech signal enhancementtechnique which is based on distinguishing the inactive interval of speech signaland after that, we can better trace the change of speech signal and improve theenhancement of speech signal at last. The essence of VAD is subtracting theattributes of the speech signal which are different from the ones of noise. In general,the parameters of VAD includes short time energy,autocorrelation of speechsignals and short time zero crossing.After general presentation of the algorithms for speech signal enhancementand VAD, based on array signal processing technique and high order statistics, avoice activity detection algorithm is proposed, which transforms convolutive mixedsignals in time domain to the instantaneous mixed in frequency domain using ashort-time discrete Fourier transform. Then voice activity detection in accordancewith certain characteristics of HOS is achieved. The proposed algorithm is not assame as other VAD algorithms with multi-input. It can be applied in a complexnoise environment without a priori knowledge on the direction and location of thesource signal. Furthermore, it is simple and effective. Simulation resultsdemonstrate that the proposed algorithm possesses good performance withstationary noise at –10dB SNR and keeps its performance in non-stationary noise at0dB SNR. An improved speech enhancement algorithm based on signal subspace withmulti-input is presented in chapter four. Through simultaneous diagonalization ofthe overall covariance matrices of clean speech and noise signal observed bymicrophone array, clean speech signal subspace is estimated without anypreassumption on the stochastic property of noise signal. The proposed methoddoes not rely on any signal model and attains the optimal estimation of speechcorrupted by colored noise, which overcomes the disadvantage of original methodonly suitable for white noise case. Simulation results demonstrate that thealgorithm possesses good performance both in objective and subjective tests. Blind signal separation (BSS) is a hot research topic in the signal processingand has been applied to lot of different fields. It can recover the source signals bymeans of the sensor signals, though there is any transcendental information forneither source signal nor transmitting channel. This proceeding can be named asindependent component analysis (ICA), too. At present, the algorithms of ICA include: second order statistics blindidentification based temporal structure of sources; learning algorithms based highorder statistics and blind source separation based on information theory. The formertwo among the three algorithms are the classical ones which use the eigenvaluedecomposition(EVD)generalized eigenvalue decomposition (GEVD) and CentralLimit Theorem as its radical principles and accomplish the blind separation ofseveral source signal under the satisfied assumption in the processing. Thealgorithms based on information theory are so popular recently that most people...
Keywords/Search Tags:Speech Enhancement, Voice Activity Detection, Signal Subspace, Independent Component Analysis, Speech Blind Separation
PDF Full Text Request
Related items