The Multi-scale Fusion Of Acoustic Scene Classification Based On Attention Mechanism

Posted on:2022-07-31

Degree:Master

Type:Thesis

Country:China

Candidate:H Y Huang

Full Text:PDF

GTID:2568307049960179

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Acoustic scene classification is mainly dedicated to perceiving and understanding the surrounding environment by recognizing the semantic tags contained in a specific scene through the analysis of the complex characteristics extracted from various audio signals,and finally,and achieves the classification of specific sound scenes.Acoustic scene classification(ASC)using computers is of great significance to Home Automation,Self-driving Car,Speech Recognition in complex scenes,and Audio Monitoring Systems.However,acoustic scenes often contain a large number of interference from non-stationary,wide-frequency,and erratic abnormal sound signal or the superposition of multiple sound sources,which increases the difficulty of the research on the field of acoustic scene classification.To deal with these problems,this paper proposes a Mix-up data enhancement method based on the sound pressure level and an acoustic scene classification method based on the attention mechanism and the multi-scale feature-fusion model.The main research contents of this paper are as follows:(1)Extract multi-channel Mel energy spectrum characteristics.This paper constructs a multi-channel Mel spectrum feature map by concatenating the Mel energy spectrum characteristics of the harmonic source,percussion source,and the multichannel fusion signal,and we use it as the input feature of the proposed model.(2)Audio data enhancement.This paper proposes a new method of data enhancement.Considering the square relationship between the sound energy and amplitude,and the insensitivity with low-frequency and high-frequency of human hearing,the A-weighting and Mix-up methods are used to mix the two sound characteristics to generate new sound features.(3)Establish a multi-scale feature-fusion module.Based on the VGG convolutional block,this paper firstly uses the multi-channel Mel energy spectrum features as the input of the model;Then,the multi-channel features are extracted by up-sampling and down-sampling followed by horizontal connection,and finally used as the input of the attention module.(4)Design a new attention mechanism module.First,we assign weights to multiscale fusion features to obtain a probability distribution map;Then we multiply the probability distribution map with the original feature map element by element to obtain the relevant probability feature map;Then we normalize the original feature map and add it to the probability feature map to obtain the output feature of the attention module;Finally,the feature is input into the Softmax classifier for classification.This paper conducts experiments on the DCASE2019 acoustic scene development data set and the LITIS Rouen acoustic scene data set.Experimental results show that the recognition method proposed in this paper is 13.1% higher than the baseline average level,and the recognition and classification performance is well.

Keywords/Search Tags:

Acoustic scene classification, Mel energy spectrum, Data Augmentation, Attention mechanism, Multi-scale feature fusion

PDF Full Text Request

Related items

1	Study Of Multi-Scale Features Fusion And Data Augmentation Methods For Acoustic Scene Classification
2	Feature Augmentation And Model Build For Acoustic Scene Classification With Multiple Devices
3	Research On Image Classification Algorithm Based On Contextual Discriminative Feature Fusion
4	Research On Image Fusion And Fine-grained Classification Algorithm Based On Attention Mechanism
5	Research On Fine-grained Image Classification Method Based On Attention Mechanism
6	Research On Classification Of Acoustic Scenes Based On Deep Learning
7	Fine Grained Image Classification Based On Multi-scale Feature Fusion And Attention Mechanism
8	Acoustic Scene Classification Using Multi-Scale Deep Feature Aggregation
9	Video Action Recognition Based On Hybrid Attention Mechanism And Multi-scale Feature Fusion
10	Research On Fine-grained Image Classification Based On Attention Mechanism And Feature Fusion