Abnormal Audio Detection Based On Deep Learning

Posted on:2020-08-26

Degree:Master

Type:Thesis

Country:China

Candidate:Z Q Jin

Full Text:PDF

GTID:2428330578976479

Subject:Control engineering

Abstract/Summary:

PDF Full Text Request

With the development of artificial intelligence technology,the application scenarios of deep neural networks are becoming more and more extensive,and the introduction of deep learning methods in audio data processing has become one of the research hotspots.Due to the monitoring of dead angles and object occlusion and bad weather in traditional video detection methods,video surveillance of abnormal events is prone to problems such as missed detection and false detection.In this thesis,the method of abnormal audio feature extraction is analyzed and the deep feedforward neural network is introduced into the abnormal audio detection system.The main work of this thesis is as follows:(1)The relevant thesis and literatures related to audio data processing and deep neural networks are reviewed,the research status and future development trends of related fields are analyzed and the research plan and content of this paper are clarified.(2)The audio data preprocessing method is studied.The audio data is collected from the audio acquisition hardware,and the analog signal is converted into a digital signal,and the digital signal is pre-emphasized by the audio data,and the audio noise reduction algorithms such as wavelet denoising method,homomorphic filtering method and Wiener filtering method are compared and analyzed.After the audio data is subjected to noise reduction processing,endpoint detection is performed on the continuous noise reduction frequency data,and the effective audio segment is separated from the continuous audio signal.(3)The audio data feature extraction method is studied.In the process of audio feature extraction,the pre-processed effective audio segment is first framed and windowed to have stable short-term features,and then the time domain characteristics of the audio data and the frequency of the audio data are compared and analyzed.The advantages and disadvantages of domain features and methods of feature extraction using artificial neural networks are compared and analyzed.After comparative analysis,this thesis uses the frequency domain features of audio data to extract the features of the audio signal.After the audio signal data is framed and windowed,the corresponding Mel frequency Cepstral coefficients are calculated for each audio signal frame,which has the advantages of low computational complexity and good audio data feature extraction,and classifies the extracted low latitude features to obtain the final classification result.(4)The method of classification of abnormal audio data is studied.The deep feed forward neural network is used as the classifier in this thesis.The basic unit and architecture of the deep feedforward neural network are analyzed in detail.The advantages and disadvantages between different activation functions,loss functions and optimization algorithms are compared and analyzed.Two deep feedforward neural network model architectures are designed in this thesis,,which are applied to high-performance servers with strong computational power and low-power embedded devices.Compared with the traditional algorithm,the results show that the proposed algorithm has a good classification effect.

Keywords/Search Tags:

Audio classification, Mel frequency cepstral coefficient, Audio noise reduction, Deep neural network

PDF Full Text Request

Related items

1	Reasearch Of Deep Learning In Audio Signal Processing
2	Research On Audio Scene Recognition Based On Deep Learning
3	Research Of Audio Alassification Algorithms Based On Convolutional Neural Network And Its Applications
4	Audio Classification Event Identification Based On SVM Technology
5	Audio Scene Classification Based On Deep Learning
6	Design Of The Self-adaptive Audio Noise Reduction System Based On FPGA
7	Research On Audio Signal Classification Based On Tensor Decomposition
8	Automatic Audio Tagging Method In Complex Scene
9	Nonparametric regression using wavelet transformations and noise reduction in audio signals
10	Research On Acoustic Feature Analysis In Audio Retrieval