An End-to-end Bone-conducted Speech Enhancement Method Based On Generative Adversarial Networks

Posted on:2020-02-06

Degree:Master

Type:Thesis

Country:China

Candidate:H Z Hu

Full Text:PDF

GTID:2518306518966929

Subject:Software engineering

Abstract/Summary:

With the rapid development of mobile communications,the problem of voice communication in noisy environments has become an urgent problem that needs to be solved.How to remove the impact of background noise on voice communications has attracted widespread attention.Bone-conducted speech technology provides another anti-noise idea.Bone-conducted speech conduct vibration through the human body,and finally to collect signals through highly sensitive sensors.Because of this special conductive property,bone-conducted speech will not be disturbed by noise in the air,which can eliminate the effect of noise to a certain extent.However,human-conducted and air-conducted speech have different properties,so there is a certain acoustic difference between bone-conducted speech and air-conducted speech.The high-frequency information of bone-conducting speech is severely damaged,resulting in poor hearing of bone-conducted speech.The degree of recognition is not high,which seriously affects the application in noise immunity.In order to solve the problem,we proposes an end-to-end bone conduction speech enhancement method based on generative adversarial network.It uses bone conduction speech sampling points as network input and outputs enhanced speech sampling points.This end-to-end model The enhancement can better utilize the internal information of the speech signal and remove the complex feature extraction and feature synthesis speech process.The generator adopts a convolutional coding and decoding architecture.multiple dilation convolution operations are used to extract the features of the network encoding results at different scales and then fuse them to obtain a stronger expression.In order to enhance the enhancement results,a certain improvement was made to the network loss function,and the learning ability and bone conduction speech enhancement ability were improved through adversarial training.The experimental results show that the method has higher speech perception quality and intelligibility compared with other enhanced algorithms,and also uses the ASR recognition rate as the evaluation index of the voice.The improvement of the recognition rate further confirms the effectiveness of the algorithm.

Keywords/Search Tags:

Bone conduction speech, Speech enhancement, Generative Adversarial Network(GAN), End-to-end model

Related items

1	Research On Bone-conducted Speech Enhancement Based On Generative Adversarial Network
2	Bone Conduction Combined With Air Conduction To Structure Speech Enhancement System
3	Speech Enhancement Based On Linear Prediction And Generative Adversarial Network For Bone-conducted Speech
4	Research On Single-Channel Speech Enhancement Based On Generative Adversarial Network
5	Speech Enhancement Algorithm Based On Generative Adversarial Network
6	Research On Auto-encoders And Generative Adversarial Network Based Speech Enhancement
7	Single Channel Speech Enhancement Based On Generative Adversarial Networks
8	Research On Speech Enhancement Model Based On Improved Generative Adversarial Networks
9	Research On Speech Recognition Technology For The Elderly Living Alone
10	Research On Speech Enhancement Method Based On Generative Adversarial Networks