Design And Implementation Of Sound Spectrogram Recognition System Based On Convolutional Neural Network

Posted on:2023-09-19

Degree:Master

Type:Thesis

Country:China

Candidate:Q Wu

Full Text:PDF

GTID:2568306914482504

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In recent years,with the rapid development of various fields of machine learning,modern society has higher and higher requirements for artificial intelligence algorithms.As an important technology in the field of machine learning,voice recognition plays a key role in all aspects of the industry.Based on the techniques of data enhancement and model fusion,this paper proposes an algorithm for human voice and music detection in the context of different forms of sound spectrogram analysis tasks.In this paper,in data processing,the frequency domain and time domain information in the spectrogram converted from audio is randomly set to zero,which realizes the enhancement of data,improves the generalization ability of the model to unknown data,and prevents overfitting.In terms of network structure,the combination of convolutional neural network and gate cyclic unit is used to build a basic network suitable for processing the local features of sound spectrogram.Because of the existence of cyclic neural network,the model can also capture audio data.timing information to make better judgments.Based on the understanding of machine learning and the application of various technologies,this paper optimizes the model from the perspectives of activation function and optimizer.Through the established evaluation criteria and loss function,it aims to solve common machine learning problems such as overfitting and gradient disappearance.The model was evaluated,and finally a complete system with support for spectrogram generation,display and recognition was built,and the audio event detection task of classifying human and musical sounds in different scenarios was realized.

Keywords/Search Tags:

audio event detection, time and frequency domain zeroing, convolutional neural network, gated recurrent unit

PDF Full Text Request

Related items

1	Research On Deep Network Model Based On Sound Event Location And Detection
2	Research On Event Trigger Word Extraction Based On Convolutional Bidirectional Gated Recurrent Unit
3	Audio Scene Recognition Based On Deep Neural Network Of Multiple Optimization Mechanisms
4	Research On Emotional Tendency Classification Based On Online Video Website Reviews
5	Research On Image Description Method Based On Multimodal Recurrent Neural Networks
6	Research And Application Of Log Anomaly Detection Method Based On Gated Recurrent Unit Network
7	Research Of Audio Alassification Algorithms Based On Convolutional Neural Network And Its Applications
8	Research On SDN Anomaly Detection Based On Deep Learning
9	Research On Key Technologies Of Acoustic Event Detection In Audio Monitoring System
10	Research On Network Intrusion Detection Based On CNN-GRU And ResNet