Multi-layer SVM Speech Emotion Recognition Based On Genetic Optimization

Posted on:2018-09-07

Degree:Master

Type:Thesis

Country:China

Candidate:L F Tan

Full Text:PDF

GTID:2348330518983332

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Language is the main medium of human communication.It not only contains abundant semantic information,but also carries a wealth of emotional information.It is of great significance to study how to make the computer identify the speaker's emotional state from the speech signal,so as to realize the natural human computer interaction.Aiming at the problem of slow recognition speed and low recognition accuracy in speech emotion recognition,in this paper,we proposes a scheme of classifying multi class emotion by constructing a multi-layer SVM with binary tree structure.And we use the genetic algorithm to reduce the dimension of the feature,so as to further improve the recognition performance of the speech emotion recognition system.In this paper,we first extract the speech energy,pitch period,formant and MFCC after the speech signal is processed that incluing pre-emphasi,windowing and framing,endpoint detection.Then,the hierarchical SVM with binary tree structure is constructed for multi class emotion classification.This model first divides the easily distinguishable emotions,and then classifies the easily confused emotions.The classification of multi class emotion is realized by layer by layer.The experiment was carried out on the Berlin emotional corpus containing 7 emotions.The results show that the model has the advantages of high classification efficiency of SVM,and also has the characteristics of high calculation efficiency of binary tree structure.Different features have different ability to distinguish emotions.And it is easy to lead to overfitting in modeling when the feature dimension is too high,which leads to long modeling time and low recognition accuracy.Therefore,we can optimize the extracted emotional features and then train the classification model to further optimize the multi-level SVM with binary tree structure.In this paper,we use the genetic algorithm to reduce the feature dimension,which is to select the key features from the extracted features.This method is an adaptive global optimal search method,and does not change the value of the selected feature,therefore,a better model can be constructed.The experiment is also carried out on the Berlin emotional corpus.The results show that the use of dimensionality reduction emotional features to the classification model training,can effectively improve the recognition rate of the system.Deep belief network as a deep learning structure,it is characterized first by the use of greedy unsupervised learning to train the network layer by layer,so as to get a good starting point,and then the use of supervised learning to optimize the whole deep structure.The network has strong capability of data abstraction and classification ability.Therefore,in the end of this paper,we use the deep belief network for speech emotion recognition experiment,and get a effective classification results on the same data set.

Keywords/Search Tags:

natural human-computer interaction, speech emotion recognition, support vector machine(SVM), genetic algorithm(GA), deep belief network(DBN)

PDF Full Text Request

Related items

1	Research Of Emotion Recognition Based On SVM And DBN
2	The Research Of Emotion Recognition Integrating Text And Speech Features Based On Deep Learning
3	Design And Implementation Of MCI Human-computer Interaction System Based On Speech Recognition And 3D Emotion Expression
4	Neural Network-based Chinese Speech Emotion Recognition
5	Research And Implementation Of Speech Emotion Recognition Algorithm Based On Fusion
6	Speech Emotion Recognition Based On Deep Learning
7	Design Of Speech Recognition Algorithm For Human Computer Interaction In Machine Operation Environment
8	The Research On Speech Emotion Recognition Algorithm Based On BP Neural Network
9	Speech Emotion Recognition Based On Extended Deep Belief Network
10	Research On Speech Emotion Recognition Based On The Fusion Of ANN And GMM