The Research Of Feature Parameters Extraction For Speaker Recognition

Posted on:2013-08-30

Degree:Master

Type:Thesis

Country:China

Candidate:X Y Cao

Full Text:PDF

GTID:2248330395484781

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Speaker recognition as one of the biometric authentication techniques is to recognize speaker’s identity by its voice waveforms. With the rapid development of network and information technology, speaker recognition has caught more and more people’s attention for its application prospect, and has become a hot spot in the research field of biological authentication technology today. On the condition that research the feature extraction algorithms of the current speaker recognition, this thesis proposed an improved feature extraction algorithm based on MFCC and an auditory feature extraction algorithm based on gammatone filter banks. Finally, the simulation experiments were conducted in the MATLAB platform. The mainly works in this thesis are as follows:1. At first, this thesis introduces the related knowledge of speaker recognition technology in detail, such as the working principle, system structure and performance evaluation criteria and so on. Then the feature parameters extraction of speaker recognition was studied, especially for the extraction algorithm of LPC, LPCC and MFCC. Besides, the speaker recognition system based on GMM was also studied.2. An improved feature extraction algorithm based on MFCC was proposed. This thesis analyzed the advantages and disadvantages for MFCC and IMFCC. For the MFCC extraction algorithm’s shortcoming that traditional Mel triangle filter banks’computational precision insufficient in the high-frequency region, and the IMFCC extraction algorithm’s shortcoming that inverse Mel triangle filter banks’computational precision insufficient in the low-frequency region, this thesis proposed an improved filter banks that combined Mel triangle filter banks’low-frequency region with inverse Mel triangle filter banks’high-frequency region. Then in the MFCC extraction algorithm, we use the proposed filter banks to replace traditional Mel triangle filter banks, and got an improved feature NewMFCC. In addition, we also combined NewMFCC with spectral centroid, and extracted their combination feature parameter. Finally, the simulation experiments were conducted on TIMIT speech database; its results prove the effectiveness of the improved algorithm.3. An auditory feature extraction algorithm based on gammatone filter banks was proposed. This thesis analysis the composition of human ear auditory system and its working principle, and then the gammatone filter banks and the non-linear characteristics of auditory system were studied. In the new auditory feature extraction algorithm, we use gammatone filter banks to replace the traditional Mel triangle filter banks, and use exponential compression that about with frequency to replace the fixed logarithm compression; besides, we also employ half raised-sine cepstrum raised technology and got an auditory cepstrum coefficient GFCC. Finally, we compared and analyzed the performance of GFCC by simulation experiments, the results show that:compared with LPCC and MFCC, GFCC has better recognition rate and noise robustness.

Keywords/Search Tags:

Speaker Recognition, Feature Extraction, GMM, MFCC, Gammatone filter banks

PDF Full Text Request

Related items

1	The Research Of Feature Extraction Based On Fisher Criterion In Speaker Recognition
2	Research On Speaker Recognition Algorithm Based On Cepstrum Feature
3	Auditory Filter Banks In Speech Recognition System
4	The Study Of Speaker Recognition System Based On MFCC
5	Study Of Speaker Recognition System Based On MFCC And GMM
6	The Research Of Front-end Filter For Speaker Independent Robust Speech Recognition
7	Study On Speech Feature Extraction Algorithm In Speaker Recognition System
8	Research On Speaker Parameters Extraction Algorithm In Noisy Environment
9	Study On Speaker Recognition Technology
10	Studies On The Speech Signals Processing Of The Speaker Recognition System