Research On I-vector Based Speaker Normalization For Speech Recognition

Posted on:2015-10-21

Degree:Master

Type:Thesis

Country:China

Candidate:Y Q Li

Full Text:PDF

GTID:2298330431992083

Subject:Communications and signal systems

Abstract/Summary:

PDF Full Text Request

Speech recognition speaker normalization subtractive primary purpose ofrandom differences between speakers, improving the constant parameters, filteringpersonal characteristics in the process to obtain valid information with linguisticmeaning. Another effect is reflected in the different pronunciation of the timerecording mode (formal, differences and tensions, etc.) to eliminate differences.i-vector speaker recognition is the more effective method is more popular inrecent years, modeling idea. It can better reflect the personality differences betweenthe speaker, an important advantage of this remarkable feature, both for speakerrecognition or validation of talking people showed its effectiveness. We can use thesedifferences in speech recognition and clustering. After clustering, according to thisclustering information for speaker normalization should be able to obtain betterspeech recognition result.Based on the above ideas, this article will i-vector used in the acousticcharacteristics of speech recognition speaker normalization: First training speech dataextraction feature vector i-vector and use unsupervised clustering algorithm LBG,LBG algorithm without two types of supervised clustering reflects the gendercharacteristics of men and women. Then the maximum likelihood training of all kinds,respectively, using a linear transformation to achieve speaker adaptation trainingspeaker normalization. The characteristics of the transformed speech for speakerrecognition training and recognition, the experimental results show that the methodcan improve the performance of speech recognition.

Keywords/Search Tags:

Speech Recognition, i-vector, Maximum likelihood linear transforms, LBGalgorithm

PDF Full Text Request

Related items

1	Research On Way Of Speaking Reliability In Voiceprint Recognition
2	Linear transforms in automatic speech recognition: Estimation procedures and integration of diverse acoustic data
3	Environment Compensation For Speech Recognition
4	Maximum Likelihood Identification Methods
5	Research On Signer Adaptation In Chinese Sign Language Recognition
6	Research On Discriminative Training In Speech Recognition
7	The Study Of CT Statistical Reconstruction Algorithm Algorithm Based On Maximum Likelihood And Likelihood And Penalized Likelihood Estimates
8	Compensation Methods Of Different Speech Coding For Speaker Recognition
9	Research On Strategies Against Abnormal Speech In Voiceprint Recognition System
10	Research Of Parameter Estimation Based On Cos Method