Speaker Recognition With Emotional Speech

Posted on:2020-01-28

Degree:Master

Type:Thesis

Country:China

Candidate:Ahmad Faraz Hussain

Full Text:PDF

GTID:2428330590961609

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

An integrative technology,Speaker recognition manipulates vocal features of speakers to infer information about their identifications.It is the biometric branch that is used for identification,verification and categorization of particular speakers,with the ability of detection,tracing and partition by extension.Speaker recognition is the only biometry that is simply checked(verified/identified)remote via the existing infrastructures i.e.Mobile network and phone network.This makes recognition of speakers very important and with the increasing number and complexity of cellular(mobile)telephones,recognition of speakers will become more popular in the future.Speaker recognition can be potentially applied to many applications like access control,transaction authorization over mobile phone and identification of forensic suspect by his/her voice.Other biometrics require special acquisition hardware but speaker recognition needs only a microphone.Despite the fact that speaker recognition research has been ongoing for extra than four decades,the performance of speaker recognition is effected by person health,age,background noise and the speaker emotional state.So as to build an emotional speaker recognition system,this paper uses Kaldi GMM-I-vector Toolkit to design an emotional speaker recognition that is tested in clear and noisy environments.The main work and contribution of this article are the following:1.In recognition of speaker's field,I-Vector has being proved to be very efficient because of it fixed length and low dimensional feature vector.I-Vector approach will be used for emotional speaker recognition on text-dependent database in clear and noisy environments.The databases contains six different emotions like sad,angry,fear,happy,neutral and disgust.Kaldi offer CMVNs(cepstral mean variance normalization),use to better normalization of MFCC features.Whereas for the testing and training system,the Gaussian Mixture Models are used.2.For channel/session compensation,linear discriminant analysis(LDA),probabilistic linear discriminant analysis(PLDA)and within � class Covariance Normalization(WCCN)are proposed.EER is used for performance evaluation.

Keywords/Search Tags:

Speaker recognition, MFCC, GMM, I-Vector, PLDA

PDF Full Text Request

Related items

1	Speaker Recognition With Emotional Speech
2	Research On Speaker Recognition Over Short Utterance And Varying Channels
3	Research On Algorithms For Speaker Recognition
4	Research On Robustness Of Speaker Recognition In Noisy Environment
5	Research On Speaker Recognition In Noisy Environment
6	Study On Speaker Recognition Technology
7	Research On Speaker Recognition Based On Vector Quantization (VQ)
8	Research And Implementation Of Speaker Recognition System Based On VQ And HMM
9	The Research Of Speaker Recognition Algorithms Based On MFC And Vector Quantization
10	I-vector Normalized Method Based Probabilistic Linear Discrimination Analysis For Speaker Verification Research