Acoustic Distinguishing Of Easily Confused Mandarin Phone

Posted on:2010-07-16

Degree:Master

Type:Thesis

Country:China

Candidate:C C Li

Full Text:PDF

GTID:2178360272982454

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

As the integration of world economy and the enhancement of our country's position in world economy, learning Chinese has been the urgent requirement for more and more representative people at home and abroad, because of the sharply increase of learner, Chinese teaching resources go short. Computer-assisted language learning can save Chinese teacher resources on the one hand, at the same time it can also meet the learners'requirements to learn Chinese independently at anytime and anywhere, which is an effective method to solve the problem.Nowadays the main technology of Computer-assisted Standard Mandarin learning is making use of speech recognition frame, the disadvantage of this frame is that the acoustic model has a limit discrimination ability of similar pronunciation. To make up the disadvantage of speech recognition frame, This paper discusses in detail the discrimination of easily confused mandarin phone pairs in the Computer-assisted teaching, directs towards acoustic feature of these phone pairs, three new feature extraction algorithms are proposed, Gaussian Mixture Modeling (GMM) is used to classify the new features. A new recognition algorithm of easily confused phone pairs is proposed.In the discrimination frame of easily confused phone, according to the flat tongue and raised tongue consonant with the same articulation manner and different articulation places, the paper proposes the method which adopts the energy concentrated frequency area as the divisional character. Directing towards the different energy distributing of flat tongue and raised tongue with Mel triangle filters extracting energy sum of high and low frequency, the method based on Mel Frequency Cepstrum Coefficient (MFCC) feature is proposed. Aiming at the human ears auditory model, by using the wavelet packet decomposition, describing frequency spectrum of easily confused phone pairs more accurately, the method based on Perceptual Linear Predictive (PLP) feature is proposed. The paper uses GMM to classify the new feature for phone discrimination. The experiment is done by different order of GMM, which attains the purpose of recognition the phone pairs.

Keywords/Search Tags:

speech recognition, energy concentrated frequency area, MFCC feature, PLP feature, GMM model

PDF Full Text Request

Related items

1	The Research On The Speech Recognition System In The Noisy Environment
2	Research On Key Technologies Of Speech Recognition In Tank Noise Environment
3	Structural Feature Parameters Of Speech Signal
4	Research Of The Speech Recognition Technology Based On HMM
5	Research On Speech Emotion Recognition Algorithm
6	The Research Of Robust Speech Recognition In Noise Environment Based On MFCC
7	Research On Speech Emotion Recognition Based On Multiple Feature Combination
8	MFCC Feature Extraction Research Based On ICA And Its Implementation On DSP
9	The Research Of Fusion LPCC And MFCC Feature Parameters In Speech Recognition Technology
10	The Research Of Anti-Noise Speech Recognition Feature