Feature Extraction Of Speaker Identification Based On Phoneme F-ratio

Posted on:2015-02-04

Degree:Master

Type:Thesis

Country:China

Candidate:C Zhao

Full Text:PDF

GTID:2298330452958686

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In this paper, we employed the phoneme mean F-ratio method to investigate thedifferent contributions of different frequency region from the point of view of Chinesephoneme, and apply it for speaker identification. It is found that the speaker individualinformation depending on the phonemes is distributed in different frequency regionsof speech sound. Based on the contribution rate, we extracted the new features andcombined with GMM model. Compared with the MFCC feature, the identificationerror rate with the proposed feature was reduced by32.94%.Then, we conduct morphological analysis and acoustic modeling of the nasal andparanasal cavities to investigate the effects of the nasal cavity on speakercharacteristics. Morphological analysis showed that the nasal cavity possessesrelatively large variation across speakers. Acoustic effects results showed that theinter-speaker variation of the nasal tract affects spectra in the frequency range from2kHz to4kHz, which is in agreement with the results from our previous statisticalstudies.In speech production, the function of the velum is not a binary switch of on andoff. For the nasalized vowels and voiced stops in Japanese, the radiation probablymainly results from velum vibration. two mechanical experiments were conducted toreveal the acoustic incorporation of the transvelar coupling of the yielding velum.Finally, an acoustic model was proposed to integrate the velum effect for the speechsounds.

Keywords/Search Tags:

Speaker identification, Feature extraction, Nasal tract, Paranasalsinuses, Velum vibration

PDF Full Text Request

Related items

1	Study On Production Of Chinese Plosives Based On Simultaneous Measurement Of Sound And Oral-nasal Airflow
2	Research On Feature Extraction And Model Algorithm For Speaker Recognition
3	Digital Waveguide Model And Its Application In Speaker Identification
4	Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition
5	Any Text Speaker Recognition System
6	Research On Feature Extraction And Robust Technology For Speaker Identification
7	Based Text-independent Speaker Identification Technology
8	Research On Speaker Identification Based On Speech Processing
9	The Research On Vibration Source Identification Algorithm Based On Multidimensional Characteristics
10	Reseach On Perceptual Cues Of Nasal Consonants