Research On Voice Recognition In Security Monitoring

Posted on:2020-08-01

Degree:Master

Type:Thesis

Country:China

Candidate:L F Tai

Full Text:PDF

GTID:2428330590460202

Subject:Safety engineering

Abstract/Summary:

PDF Full Text Request

Human voice can well express their own emotions,so this paper studies how to use the emotions contained in human voice to monitor the safety of public places.The work is as follows:Considering that the public sound collected by the sound monitoring equipment will contain too much interference noise,the voice signal needs to be enhanced.In the traditional spectral subtraction method,the setting of the noise energy in the non-speech segment is the average energy of the noise,and the average energy is easily affected by the random impulse noise,which generally leads to the excessive average energy of the noise,and then leads to the loss of the speech signal spectrum in the spectral subtraction algorithm.In this paper,the traditional spectral subtraction speech enhancement algorithm is improved,and the noise energy ratio is introduced into the spectral subtraction algorithm.Experiments show that the improved spectral subtraction method not only reduces the risk of spectrum loss of speech signal,but also inhibits the noise in the speech signal.Considering that no research results have been made to clarify the emotions contained in public voices,this paper classifies the emotions contained in public voices into normal emotions,dangerous emotions,lively emotions and quiet and cheerless emotions.Collect emotional sounds from different scenes and build scene sound library.Combining the speech feature set and prosodic features of the international speech emotion recognition competition,four types of scene sound features are extracted.Considering the impact of penalty parameters and RBF kernel function width on the classification of c-svm model,this paper combines genetic algorithm and nonlinear programming to enhance the global and local search ability,so as to optimize the penalty parameters and kernel function width.Experimental results show that the random selection of penalty parameters and kernel width can only achieve 35% accuracy in the recognition of scene sound emotion.For the optimized selected parameters and,the recognition accuracy can reach 77%.Among them,the recognition rate of normal scene emotion and quiet scene emotion is the most stable and the highest,generally reaching 90%.The emotion of danger scene and the emotion of busy scene are easily confused,the recognition rate is between 60% and 80%,which affects the overall recognition rate of sound emotion in public places.

Keywords/Search Tags:

Sound Monitoring, Speech Enhancement, Voice Emotion, Genetic Algorithm, C-SVM

PDF Full Text Request

Related items

1	Research On The Speech Emotion Recognition Based On Voice Signal
2	Research On Speech Enhancement Algorithm And Realization Based On DSP
3	Research On Adaptive Speech Emotion Recogniton Method
4	Multi-layer SVM Speech Emotion Recognition Based On Genetic Optimization
5	Speech Enhancement, And Related Technologies
6	The Research On Speech Emotion Recognition Based On Contextual Position Enhancement And Weighted Space
7	Research Of Sound Localization And Speech Enhancement And Implementation Based On SOPC
8	The Research Of Speech Enhancement Algorithm And Implementation Based On DSP
9	Research On Key Techniques Of Speech Emotion Recognition
10	Research On Speech Emotion Recognition Technology Based On Deep Learning