Reasearch Of Deep Learning In Audio Signal Processing

Posted on:2021-05-15

Degree:Master

Type:Thesis

Country:China

Candidate:Y M Xue

Full Text:PDF

GTID:2428330614963914

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Signal Denoising has always been a tough task in signal processing.Most of the image processing methods before are based on SIFT,HOG and other manual pre-extraction features.Since the values of image data are always in range of 0-255 and based on matrix computation,it is more suitable for convolution operation.But audio signals are mostly one-dimensional,whose processing method is so different from the former.In the past,audio signal processing was often performed manually by means of third-party software.It was a tough task to deal and might do harm to the operator's hearing.Following the fast development of deep learning theories and hardware,complex model structures can be effectively used for the computation of signal processing.This project is both the final viva of Queen Mary University of London and Yamaha Music Co.Japan.As a member of Yamaha Music Co's summer international internship,aims to design a series of high-performance deep learning algorithms to realize the detection and reduction process in audio signal processing.In order to solve the noise detection and reduction problems in audio signal processing,aims to propose an algorithm module,then merge the module into a auto-noise reduction system.The whole module can be divided into two parts.The first is designed for the audio noise detection.I designed two algorithms for this,the CR-DNN model and Dense-DNN model.The second is generative model used to generate the clean audio file based on Generative Adversarial Networks which I called it Denoise-GAN.The performance measure of the former is accuracy,another is using both method of noise-evaluation like PESQ and Nash equilibrium in loss function.Not only using the traditional audio feature extraction methods,but also un-structured data ways like str-vectorization and word-vectorization during the generating of dataset provided by Yamaha.The audio dataset has been devided into training,testing and development dataset for different use.The experimental results show that deep learning can greatly improve the efficiency of noise detection and the Dense-DNN achieves more than 97% in accuracy.At the same time,the Denoise-GAN model can effectively enhance the clean audio portion of the original audio and greatly reduce noise interference.The model is currently used by Yamaha Music since Sep.2019.

Keywords/Search Tags:

Deep Learning, Generative Adversarial Networks, Audio Noise Detection, Audio Noise Reduction

PDF Full Text Request

Related items

1	Abnormal Audio Detection Based On Deep Learning
2	Research And Application Of Audio Keyword Recognition Technology Based On Generative Adversarial Network
3	Design Of The Self-adaptive Audio Noise Reduction System Based On FPGA
4	Adversarial Sample Generation Against Audio Forensics
5	Nonparametric regression using wavelet transformations and noise reduction in audio signals
6	Design And Research Of Audio CAPTCHA Based On Adversarial Attack
7	Research On Digital Audio Copy-move And Splicing Detection
8	Single Channel Speech Enhancement Based On Generative Adversarial Networks
9	Audio Defect Analysis And Test Method Improvement Based On Audio Process Principle Of Handset
10	Weakly Supervised Learning For Audio Analysis