Acoustic Scene Classification And Sound Event Detection Based On Deep Learning

Posted on:2020-12-06

Degree:Master

Type:Thesis

Country:China

Candidate:X K Li

Full Text:PDF

GTID:2428330590484517

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Acoustic scene classification(ASC)and sound event detection(SED)are key technologies in many fields,such as multimedia analysis and retrieval,audio monitoring,intelligent driving.In this thesis,the ASC and SED methods based on deep learning are investigated.The main work and contributions of this thesis are as follows:(1)A method for ASC based on audio feature augmentation(AFA)is proposed.This thesis mainly discusses the impacts of audio feature augmentation on the performance of ASC.Specifically,the similarities and differences of the features of two channels are obtained by performing mean and difference operations on the features of two channels audio data.Harmonic percussive source separation(HPSS)is applied to the audio spectrogram to obtain augmented audio features.When evaluated on the databases of DCASE2016 and DCASE2017,the accuracies obtained by the method based on audio feature augmentation are85.8% and 69.9% respectively,and are higher than that obtained by the methods without data augmentation.In addition,the proposed method is compared with other acoustic scene classification methods,and is better than other methods.(2)A method for detecting abnormal sound event of road based on deep feature fusion is proposed.First,the deep autoencoder network(DAN)is used to transform three shallow features,i.e.,mel frequency cepstrum coefficient(MFCC),Bark filter bank(BFB)and Gabor filter bank(GFB),to deep features.Next,the three deep features above are concatenated and then transformed to a new deep feature by a DAN.Finally,the new deep feature is fed into a long short term memory network(LSTM)for decision of ASC.The experimental results show that the proposed feature obtains detection accuracy of 92.15% and F score of 91.32%without adding background noises to the audio signal,and outperforms other features for abnormal sound event detection.After adding background noises to the audio signal for obtaining different signal-to-noise ratios(SNRs),such as 20 dB,10 dB,0 dB and-10 dB,the proposed deep feature obviously outperforms other features,and has a strong anti-noise ability.In summary,this thesis proposes a ASC method based on audio feature agumentationand a method of road abnormal sound event based on deep feature fusion.The performance of the proposed methods is analyzed by experiments from various aspects.The reliability and effectiveness of the proposed method are verified under various experimental conditions.

Keywords/Search Tags:

Deep learning, Acoustic scene classification, Sound event detection

PDF Full Text Request

Related items

1	Research On Acoustic Scene Classification Based On Convolutional Neural Network
2	Content Analysis For Natural Acoustic Scene Based On Deep Neural Network
3	Sound Detection,classification And Localization Under Noise Conditions
4	Research On Acoustic Scene Detection Based On Deep Learning
5	Research On Sound Scene Classification Based On Deep Learning
6	The Research Of Sound Event Classification And Detection On Semi-supervised Learning Method
7	A Study On Acoustic Scene Classification By Ensembling Multiple Deep Models
8	Research On Acoustic Scene Classification Using Deep Learning
9	Research On Acoustic Scene Classification
10	Research On Polyphonic Sound Event Detection With Deep Neural Network