Acoustic-Based Research On Automatic Language Identification

Posted on:2008-10-08

Degree:Master

Type:Thesis

Country:China

Candidate:H H Wang

Full Text:PDF

GTID:2178360215982728

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

PDF Full Text Request

This paper focuses on the research of Automatic Language Identification based on GMMs utilizing several kinds of acoustic features. Main research work includes the following:1. Building up a multiple lingual corpus including speeches of Chinese, English, German, Japanese, French and Spanish. The training set contains speech pieces from 264 speakers with duration from 60s to 300s. Fifty speech utterances with average duration of 4.5 seconds from 30-50 speakers outside the training set are chosen for each language to form the testing set.2. Primary experiment on ALID systems. GMM systems for the task of six-language recognition are established in order to investigate the relations among recognition rate, number of the GMM mixtures and quantity of the training data. The effect of RASTA and CMS on the performance of MFCC is also discussed.3. A new quadratic warping function is proposed for speaker normalization employing pitch mean based frequency warping. Comparative test with traditional linear function and piecewise linear function is performed to verify the validity of the new quadratic warping function.4. Shifted cepstra (SC) is proposed and applied to both RASTA-MFCC and RASTA-PLP in comparative tests with the prevailing SDC features. Effects on performance of different parameter configurations N-p-k for RASTA-PLP-SC are studied and the best performing path is determined through the hill-climbing method. Data fusion of RASTA-PLP-SC with other forms of vector is investigated at both feature and decision level.

Keywords/Search Tags:

Automatic Language Identification, acoustic feature, shifted cepstra, Gaussian mixture model

PDF Full Text Request

Related items

1	Research On Language IDE NT Ification
2	Footstep Identification Method Based On Gaussian Mixture Model
3	Language Identification Based On Gaussian Mixture Models
4	Research On Language Identification Based On Acoustic And Phonology
5	Research On Automatic Language Identification And Its Application
6	Identification, Based On The Language Of The Gmm-ubm Model
7	Acoustic Modeling Approach To Language Identification
8	Research On Automatic Language Identification Technology Over Telephone Channel
9	The Design And Implementation Of Automatic Language Recognition System
10	The Research Of Chinese Dialect Identification Based On Speech Feature Analysis