Voiceprint Recognition Technology Research Based On Gmm

Posted on:2013-01-03

Degree:Master

Type:Thesis

Country:China

Candidate:S Liu

Full Text:PDF

GTID:2248330374486628

Subject:Circuits and Systems

Abstract/Summary:

PDF Full Text Request

Speaker recognition technology, also known as the voice print recognition, is basedon human biological characteristics to determine the identity of the person. Sound as themost natural means of communication, with its incomparable advantages was widelyapplied to identification.For speaker recognition modeling there are a variety of techniques, GaussianMixture Model with its simply, good performance and text-independent feature is oneof the most frequently used method of modeling. This thesis describes the Gaussianmodel, parameter estimation and recognition method. For speech frames in certain voiceframe will affect the system recognition rate in the recognition phase,we give a votingbased method.Using gaussian mixture model in the speaker recognition,when thespeaker’s number is large then there need amount of calculation.We combine the VQmethod with Gaussian mixture model,the models are divided into two main parts whichare Male and Female parts, and we use dynamic time algorithm to calculate the distancebetween each pitch,then reduce the recognition time.At present,most work of speaker recognition technology study is based on theGaussian mixture model. In order to obtain a higher recognition rate we choose bettersound charcteristic parameters of the speaker and recognition algorithms. Speakerrecognition elaborates the characteristics of speaker recognition technology, extraction,modeling and other sectors. Recently, most speaker recognition method are usingMFCC and based GMM model.Another feature parameter of voice speech, pitch, isadded in this paper against imitative of MFCC.Adding Dynamic MFCC coefficients tothe feature vector will make the feature vector becomes complex, to shorten the time ofspeaker recognition,we given a weighted MFCC based on their contribution to theidentification rate.Experimental section is in the last part of this thesis, verify that the characteristicparameters of the Gaussian mixture model order, weighted MFCC, the recognition rateand the experimental results analysis.The experiment results show that the MFCC havea better performance than LPCC.When MFCC combine with Dynamic MFCC,the recongnition rate was obviously increased.The Weighted MFCC raises the recognitionrate and at the same time reduce the complexity of the calculation.We analysis pitch’sfunction and its effect on recongnition rate at last.

Keywords/Search Tags:

Speaker recognition, Voice print recognition, MFCC, GMM, Pitch

PDF Full Text Request

Related items

1	The Key Technology Research Of Voice-Print Recognition
2	Researches On Speech Feature Extraction And Implementation Of Speaker Recognition System
3	Multi-features For Speaker Recognition Based On DTW And GMM
4	Researches And Implementation On Speaker Recognition Algorithms And Systems
5	Research Of Speaker Recognition System Based On Mixed Festure Parameters And GMM-UBM
6	Research Of Speaker Recognition Algorithm
7	The Study Of Speaker Recognition System Based On MFCC
8	Study Of Extraction And Optimization Characteristic Parameters In Speaker Recognition
9	The Design Of A Speaker Recognition Software Based On Matlab
10	Study Of Speaker Recognition System Based On MFCC And GMM