Research On Robustness Of Speaker Verification Using Gaussian Mixture Models And System Implementation

Posted on:2009-07-03

Degree:Master

Type:Thesis

Country:China

Candidate:J Zhao

Full Text:PDF

GTID:2178360272474572

Subject:Instrument Science and Technology

Abstract/Summary:

Prevailing speaker recognition systems can obtain very high accuracy for clean speech, but their performance will degrade rapidly in noisy environments owing to the mismatch between the acoustic models and the testing speech. Therefore, noise robust technology is a crucial problem for the application of speaker recognition system in real life. The speech databases are mainly monolingual, and the performance of speaker verification system degrades dramatically because of the mismatch between mandarin and Sichuan dialect in training and test stages. Researches on Mel-Frequency Cepstrum Coefficients (MFCC) and Gaussian Mixture Models (GMM) have been made. Then based on this and the application of speech recognition on voice controlled system, the main contribution of our work in models training aiming at its stability, veracity and robustness are as following:1. In traditional model compensating methods, the white Gaussian noise is employed to simulate environmental noises to improve the robustness in the noise environment. In this paper, multi-noise multi-SNR is employed to train GMM. Experiments result shows these models are more robust than white Gaussian noise-added model.2. Due to the mismatch between mandarin and Sichuan dialect in training and test stages, the performance of speaker verification system degrades dramatically. To solve this problem, a combined Gaussian mixture model, which is trained by proportional pooling mandarin and Sichuan dialect, is presented in this paper. Experiments demonstrate that the introduced combined Gaussian mixture model is more robust for speech mismatching between mandarin and Sichuan dialect.3. A Voice Controlled Smart real time system is researched and developed using speaker verification and isolated words recognition. The speech recognition system tested by a small number of users, with high accuracy and good real-time performance, can basically fulfill the requirement of the application in office and home noise conditions with a small number of users.

Keywords/Search Tags:

speaker verification, Gaussian mixture models, text-independent, Mel-Frequency Cepstrum Coefficients

Related items

1	Mel Frequency Subband Coefficients Based Text Independent Speaker Recognition
2	Study On Speaker Verification Technology Related To Text And Applications
3	Study On Feature Extraction In Speaker Recognition
4	Research Of Text-Independent Speaker Recognition Based On VQ And GMM
5	Language Identification Based On Gaussian Mixture Models
6	Speaker Verification Based On Sorted GMM
7	Research On Text-Independent Speaker Verification System
8	The Research On Channel Robustness Of Text-independent Speaker Verification
9	Study On The Technologies Of Text-independent Short-duration Speaker Recognition
10	Research On Text-independent Speaker Verification Based On Deep Learning