Applying Hilbert-Huang Transform To Speaker Recognition

Posted on:2007-11-23

Degree:Master

Type:Thesis

Country:China

Candidate:L Li

Full Text:PDF

GTID:2178360185980767

Subject:Physical Electronics

Abstract/Summary:

PDF Full Text Request

With the rapid development of the information technology and communication technology, individual identification becomes more and more important. As a biological authentication technique, speaker recognition has many applications for its advantages such as convenient, safer, and unforgotten. Speaker recognition is to analyze the feature parameters which are extracted from series of speech signals, and to identify or verify who spoke them.Hilbert-Huang transform (HHT), its key part is empirical mode decomposition, is a newly signal processing method proposed by Huang in 1998. In this thesis, we apply HHT to speaker recognition and obtain some initial achievements. In the first part, we introduce the sketch of speaker recognition and the HHT, in the other parts, the main works are discussed.We propose using linear prediction to resolve the endpoint effect evoked in EMD. Applying linear prediction to extend the data sequence by adding some predicted points, and it is stopped when a new local maximum and minimum points are emerged. Thus the error is reduced occurring in the original end when splining envelop of the data. By comparing this method with the others, we find it is effective when dealing with the endpoint effect of the EMD.On the basis of EMD, six types of parameters are presented to describe the individual features of the speaker. These parameters are EIF, ED, IMF-MFCC, IMFMFCCW1, IMFMFCCW2 and CEI, respectively determined through different methods. Then, valid test is performed by calculating the D ratio to judge their effectivity, from the results we know CEI among the six features is the most effective.At last, we perform some speaker identification experiments based on vector quantization of the above mentioned feature parameters respectively. As is shown in the results, of the six parameters, CEI has the best effectiveness as a individual feature of the speaker, and then IMF-MFCC, IMFMFCCW2, IMFMFCCW1, while EIF and ED are not suitable to be parameters for speaker recognition alone. According to a...

Keywords/Search Tags:

Speaker Recognition, Hilbert-Huang Transform, Empirical Mode Decomposition, Feature extraction, Vector Quantization

PDF Full Text Request

Related items

1	Speaker Recognition Based On EMD
2	Hilbert Huang Transform And Its Application In Feature Extraction
3	Hilbert-Huang Transform And Its Application In Feature Extraction Of Speech
4	Hilbert-Huang Transform And Its Application In Signal Processing
5	Research On Hilbert-Huang Transform And Its Improvements
6	On The Theory And Applications Of Hilbert-Huang Transform
7	Improved Hilbert-Huang Transform And Its Application Of The Signal Process In Power System
8	Research On The Theory And Application Of Hilbert-Huang Transform
9	Research On The Theory And Application Of Hilbert-huang Transform
10	The Application Of Hilbert - Huang Transform In Noisy Speech Processing