Application Of Deep Learning In Speech Enhancement

Posted on:2019-06-23

Degree:Master

Type:Thesis

Country:China

Candidate:X G Shao

Full Text:PDF

GTID:2428330545992501

Subject:Master of Engineering

Abstract/Summary:

In recent years,with the rapid development of computer and digital signal processing technology,speech communication has gradually become an important way in human-computer interaction.However,all kinds of interference noise can cause problems of delay or wrong recognition of human-machine interaction.Therefore,it has practical significance to solve noise interference and enhance speech quality.The traditional enhancement method will remain part of the noise,especially the music noise.In addition,the traditional method has made some assumptions about the relationship and characteristics of signals,which is not in line with the situation of actual environment.In this paper,firstly,the basic knowledge and related theories of speech enhancement technology and deep learning are thoroughly studied,including the preprocessing of speech signal,feature extraction and training of deep learning model.Secondly,the traditional speech enhancement framework based on deep learning is given,which takes the features of the noisy speech signal as the input of the deep neural network,the characteristic of the pure speech signal as the output of the network,and uses the powerful feature extraction and mapping ability of the deep neural network to reproduce the pure speech signal.Secondly,the spectral subtraction is a more classic method in traditional speech enhancement algorithms.Its computation is small and easy to be realized,but its sound quality is not ideal,and there is a lot of music noise in the enhanced speech signal.In this paper,the traditional spectral subtraction and deep network are used to reduce the noise.By spectral subtraction,most of the noise doped in the speech signals can be removed easily.The residual noise,especially the music noise,is filtered out by the self coding of the stack.In the experiment,4 noises are randomly selected from the database to generate 6kinds of noisy speech signals with different signal-to-noise ratios.The experimental results show that the two methods can significantly reduce the noise in the speech signal,improve the speech perception,and have a certain generalization ability and adaptability for the unknown noisy environment.

Keywords/Search Tags:

Signal processing, speech enhancement, neural network, deep learning

Related items

1	Study On Speech Enhancement Algorithm Based On Statistical Model And Deep Neural Network
2	Extremely Low Signal-to-noise Ratio Speech Enhancement Method Based On Deep Learning
3	Research On Ultra-low Bit-rate Speech Coding And Speech Enhancement
4	Research On Deep Learning Based Monaural Speech Enhancement
5	Research On Speech Enhancement And Related Technologies In Low SNR Environments
6	Research On Deep Learning Based Speech Enhancement
7	Research On Deep Learning Based Speech Enhancement
8	Single Channel Speech Enhancement Based On Deep Learning
9	Research On Deep Learning Model For Speech Enhancement And Separation In Complex Auditory Scenes
10	Robust Speech Recognition Based On Deep Learning