Research On Binaural Sound Source Localization Based On Convolutional Neural Network

Posted on:2021-03-02

Degree:Master

Type:Thesis

Country:China

Candidate:K Y Ma

Full Text:PDF

GTID:2492306476450864

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

Most sound source localization(SSL)algorithms are based on multi-microphone arrays.However binaural sound source localization has advantages of compactness and low-cost.Traditional binaural SSL algorithms are susceptible and perform poorly in reverberation and noise environments.Convolutional Neural Network(CNN)modeling has high level of robustness,significantly improved the performance in complex environments.In order to cope with the challenges of complex environments,this thesis proposes two binaural SSL algorithms based on CNN: binaural SSL algorithm based on Densely Connected Convolutional Networks(DCCN)classifiers,and binaural SSL algorithm based on Soft-argmax regressor.(1)The binaural SSL algorithm based on DCCN classifier is developed from the traditional CNN models.This algorithm introduces a dense connection structure.The character of feature reuse makes the model have powerful fitting ability without huge parameter amount.And the 1 × 1 convolution in DCCN also effectively compresses the model size.Speech with noise and simulated reverberation is used for training.Speech with noise measured reverberation is used for testing.Through such experimental settings,the algorithm’s performance in unseen environment is verified.Experiments show that the localization accuracy of this algorithm is significantly better than traditional algorithms in complex environments.(2)The binaural SSL algorithm based on Soft-argmax regressor is developed from the neural network classifier.This algorithm introduces Soft-argmax regressor structure.The algorithm can transform any neural network classifier with Softmax into a regressor,which will improve the performance in bad environments.In experiments section,we transform the DCCN classifier into a Soft-argmax regressor.Experiments show that the root mean squared error(RMSE)of this algorithm is less than that of the classifier model and traditional algorithms in complex environments.

Keywords/Search Tags:

Binaural Sound Source Localization, Convolutional Neural Network, Dense Connection, Regression

PDF Full Text Request

Related items

1	Evaluation Research On Sound Quality Of Cooling Fan Based On Binaural Effect
2	Research On Hyperspectral Image Classification Based On 3D Convolutional Neural Networks And Attention Mechanism
3	Sound Field Fitting Method Of Spatial Point Sound Source Based On Multi-channel Standard Sound Source
4	Research On Detection Of Abnormal Sound Based On Smart Lighting Monitoring System
5	A Category-consistent Deep Network Learning For Accurate Vehicle Logo Recognition
6	GF-2 Remote Sensing Image Fusion Based On Convolutional Neural Network
7	Research On High-resolution Acoustic Source Localization Algorithm For Rotating Sound Source
8	Study On Semantic Segmentation Of Traffic Scene Images Based On Deep Neural Networks
9	Research On Ship Identification Method Based On Depth Convolution Neural Network
10	Research On Building Extraction From High-resolution Remote Sensing Images Based On Convolutional Neural Network