A New Codebook Design Method Based On Genetic Algorithm For Text-Independent Speaker Identification

Posted on:2009-05-02

Degree:Master

Type:Thesis

Country:China

Candidate:L Z Chen

Full Text:PDF

GTID:2178360242967496

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Speaker recognition as one of the biometrics techniques is to recognize speaker's identity from its voice which contains physiological and behavioral characteristics specific to each individual. Speaker recognition has caught many attentions for its particularly advantage on convenience, economy and veracity and become an important and popular authentication technique in human life and work. Therefore, a more robust method for speaker recognition with high accuracy of recognition rate is the aim for researchers at home and abroad. In many speaker recognitions, paper combines VQ with GA, it uses Euclid distance to identify according to MFCC and its extended parameters.It often uses AMFCC to analyze change of every dimension parameter and AAMFCC to explain acceleration in the feature extraction. From old experiments and theories, paper introduces square sum of first-order derivative mel-cepstrum coefficient and difference which express total change of many-dimension cepstrum parameters in some frame. Along that method it appends two new feature parameters that are square sum of second-order derivative mel-cepstrum coefficient and its difference. Through experiments verified, new parameters increases the identification rate.Paper adopts the combination of VQ and GA, to avoid the possibility that classical LBG algorithm easily obtain local optimization and make up limitation of selection of initial codebook for final result. To prevent GA from getting into local optimization system not only increases iterative times, but also use a method that is combined by selecting part of optimal individuals and judging similarity every individual in the colony to change aberrance rate, it in some extent may avoid obtaining sub-optimization.From experiments, paper knows that after adopting GA recognition performance is improved comparing with LBG. And more long training speech length is, better recognition performance is.Paper finds that recognition does not always performance well while number of codeword is increasing. When the number reaches some, recognition begins to performances bad.

Keywords/Search Tags:

GA (Genetic Algorithm), VQ (Vector Quantization), Speaker Identification, MFCC (Mel-Frequency Cepstrum Coefficient)

PDF Full Text Request

Related items

1	Text-Dependent Speaker Recognition Algorithm Study Based On Improved VQ
2	A Research Based On The Technique Of Unit Identification Of The Animal Sound
3	Research On Speaker Recognition Based On Combination Of Features
4	The Research Of Speaker Recognition Under Noisy Environment
5	The Research Based On The Text Irrelevant Speaker Distinguishes
6	Research On Speaker Recognition Based On MFCC And PSO-BP Neural Network
7	Speaker Recognition Research In Noisy Environment
8	Speaker Identification Based On Independent Component Analysis And Genetic Algorithm
9	Application Study On Vector Quantization In Speaker Identification
10	Design And Implementation Of Speaker Recognition System Based On Windows Ce