Research On Speech Emotion Recognition Algorithm

Posted on:2020-11-06

Degree:Master

Type:Thesis

Country:China

Candidate:S Y Wang

Full Text:PDF

GTID:2428330590995893

Subject:Electronic and communication engineering

Abstract/Summary:

With the development of computer science,speech signal processing is widely used in all aspects of society.At present,speech emotion recognition technology has become the key of human-computer interaction system.To make human-computer interaction more convenient and humanized,researchers began to study the emotional signals of speech.By analyzing the emotions of the operators,the intelligent human-computer interaction system can be more active and accurate to achieve the requirements of the operators,and can timely adjust the form of the dialogue,make the communication more intelligent.The main work of this paper is as follows.(1)The speech emotion characteristic parameters were optimized.After the optimization and integration of the frequency cepstrum coefficient,MFCC is combined with prosodic features and sound quality features as the characteristic parameters of speech emotion recognition.The experimental results show that the new MFCC coefficients obtained from the mixture of MFCC,I-MFCC and Mid-MFCC have significantly improved the identification ability of the entire frequency band.(2)The speech emotion recognition algorithm based on F-MFCC parameters is proposed in this thesie.Fisher ratio criterion was used to combine MFCC and its derived parameters I-MFCC and Mid-MFCC to generate F-MFCC.After the generated F-MFCC is mixed with other feature parameters,the speech emotion recognition is performed by using different spectral-based feature parameters.The experimental results show that using F-MFCC as the characteristic parameter can further improve the recognition rate of the recognition model and reduce the dimension of the feature parameters to some extent.(3)A speech emotion recognition method based on the new decision model is proposed.The recognition results obtained by the BP neural network,the support vector machine and the K-nearest neighbor algorithm are passed through a voter,and the output of the voter is used as the recognition result.Experimental results show that the new decision model can reduce the probability of speech misjudgment and further improve the average recognition rate of the final speech emotion.

Keywords/Search Tags:

speech emotion recognition, feature fusion, MFCC, F-MFCC, decision model

Related items

1	The Research Of Fusion LPCC And MFCC Feature Parameters In Speech Recognition Technology
2	Speech Emotion Recognition Based On Features
3	Research On Speech Emotion Recognition Based On Multiple Feature Combination
4	The Research Of Robust Speech Recognition In Noise Environment Based On MFCC
5	Speech Emotion Recognition Based On Three-layer Model
6	Study On MFCC And Lasso Reverberation Suppression Of Feature Extraction Algorithm Of Speech Recognition
7	MFCC Feature Extraction Research Based On ICA And Its Implementation On DSP
8	Research On Speech Emotion Recognition Based On Feature And Decision Fusion
9	The Comparison And Analysis Of The Feature Extraction Algorithm Of Voiceprint Recognition System
10	Research On Speech Emotion Recognition Based On Multi-scale Feature Fusion And Decision Tree CNN