The Comparative Study On Acoustic Model Training Criterion For Automatic Mispronunciation Detection

Posted on:2014-12-08

Degree:Master

Type:Thesis

Country:China

Candidate:J M Wang

Full Text:PDF

GTID:2428330491955598

Subject:Communication and Information System

Abstract/Summary:

In this thesis,we research on the performance of automatic mispronunciation detection technology on Xinjiang non-native speaker's pronunciation,it is well know that mispronunciation detection technology is an important manifestation of computer-assisted language learning,Therefore,how to improve the performance of the Xinjiang non-native speaker Mandarin pronunciation error detection system is significant for the masses of ethnic minority learners.In this paper,taking Xinjiang Uygur and Kazakh speech database as an example,using Xinjiang non-native speakers small corpus,By improving and training on traditional pronunciation error detection acoustic model,the discriminative training algorithm can effectively improve the detection performance.The main work of this thesis as follows:Firstly,this paper introduces the calculation method of important index of pronunciation error detection of commonly used measurement GOP algorithm and F1-score which is an important indicator of error-detecting capability evaluation system calculation.Also introduces the maximum likelihood estimation(MLE)training criterion,and use MLE training criterion on Xinjiang non-native speech corpus,calculate the F1-score of the mispronunciation detection system,proposed for the comparison in this article laid the Foundation for the effectiveness of training criterion.Secondly,the traditional speech recognition of discriminative training criteria applied to the mispronunciation detection system,through the analysis of commonly used algorithm for discriminative training,we choose the widely used MPE discriminative training criteria,And verify the MPE discriminative training criterion can effectively improve the phoneme recognition error rate of data of experiments,but is not helpful to improve mispronunciation detection system performance,it shows that the existing discriminative training criteria can not effectively to improve mispronunciation detection system performance.Finally,based on the analysis of MLE and MPE acoustic training model,a discriminative acoustic model training method is proposed,through the establishment of Xinjiang non-native speaker speech database annotated by experts,to construct the objective function of maximize F1-score training criterion,and using to construct the weak-sense auxiliary function to update and optimization of parameters on the objective function,Through the use of threshold value iteration update method to ensure objective function improvement.The discriminative training criterion can automatically to learn standard of mispronunciation on Xinjiang non-native speakers small corpus,and the experimental results show that this criterion can effectively improve the performance of the mispronunciation detection system.

Keywords/Search Tags:

Maximum Likelihood Estimation, Minimum Phone Error, Maximum F1-score Criterion, Auxiliary Function

Related items

1	Research Of Parameter Estimation Based On Cos Method
2	A New Parameter-estimation Method Of 2PLM Based On Item Response Theory
3	Study On Sinusoidal Frequency Estimation Based On Maximum Likelihood Estimator
4	Research On Optimal Receiver In Generalized Gaussian Noise Channel Based On Maximum Likelihood Criterion
5	Maximum Likelihood Identification Of Dual Rate Sampled Hammerstein Output Error Systems
6	Improvement And Implementation Of ADC Test Algorithm Based On Maximum Likelihood Estimation
7	Maximum Likelihood Recursive Identification For Multivariate Equation-error Systems
8	Compensation Methods Of Different Speech Coding For Speaker Recognition
9	The Study Of CT Statistical Reconstruction Algorithm Algorithm Based On Maximum Likelihood And Likelihood And Penalized Likelihood Estimates
10	Maximum Likelihood Natural And Music Algorithm For Doa Estimation Of Dsp Realization