Application Of DT And DT-Adaptation In Acoustic Modeling Of ASR

Posted on:2010-09-06

Degree:Master

Type:Thesis

Country:China

Candidate:B Zhu

Full Text:PDF

GTID:2178360302959793

Subject:Signal and Information Processing

Abstract/Summary:

With the rapid development of the Automatic Speech Recognition technology whichis based on the statistical pattern recognition theories, remarkable progress has beenachieved in recent years.Discriminative Training has become one of the standard configuration methodsfor the state-of-the-art acoustic modeling and parameters optimization. Beyond that,these DT criteria have also been adopted to combine with other adaptation strategies,such as Linear Regression (LR) and Maximum A Posterior (MAP). All these methodshave been applied in some real systems, the validity of them has been proved to beuseful by the exciting improvements results. The strategy of DT based linear transformbeing applied in the feature end has also been improved to be useful. All the detail ofthese technologies is described in this thesis.Firstly, this thesis gives an overview and summary on the development history ofASR in chapter one. In the second chapter, the traditional acoustic modeling strategies,such as Maximum Likelihood Estimation(MLE), di?erent Discriminative Training cri-teria(DT) and some updating methods of acoustic model parameters are introducedseperately. The e?ectiveness of DT is obviously presented in our experiments. Wecompared the results on multiple Chinese and English recognition tasks which sharesdi?erent recognition types, the relative performance improvements are all above 15%.Secondly, the traditional adaptation methods are brie?y reviewed in the third chap-ter of this thesis.The new adaptation strategies which combine the discriminative train-ing criteria with the traditional adaptation ones are introduced in detail. They are linearregression with discriminative training(DT-LR) and maximum a posteriori with dis-criminative training(DT-MAP),we expand them with the MWCE criterion creativelyand both of DT-LR and DT-MAP are combined together to be adopted for adaptationin this thesis. The experimental results on both Chinese continuous speech recognitiontasks and English Spelling tasks are compared in detail. From the experimental results,it is obviously that the DT based adaptation strategies outperform the traditional MLEbased ones and they could be new e?ective choice to be applied for adaptation.Finally, the expand application of the discriminative training criteria for the fea-ture end's linear transform methodâ€“ feature Minimum Phone Error(fMPE), which isstrictly introduced in chapter four. This algorithm is realized based on HTK indepen- dently and it is a useful supplement of the traditional DT criteria for the updating ofacoustic model parameters. From the experimental results, some favorable improve-ments have also been achieved. It is very obviously that the MPE criterion could notonly be useful on the model parameters optimization but also on the feature end's re-finement.

Keywords/Search Tags:

Automatic Speech Recognition, Acoustic Modeling, Discriminative Training, Adaptation, Discriminative Feature Transform

Related items

1	Research On Acoustic Modeling For Spontaneous Spoken Speech Recognition
2	Research On Discriminative Techniques Of Feature Extraction And Acoustic Model Training In Continuous Speech Recognition
3	Discriminative Training Of Acoustic Models For Automatic Speech Recognition
4	Discriminative Methodologies For Tone Problem Solving In Mandarin Speech Recognition
5	Discriminative Training For Large Vocabulary Continuous Speech Recognition
6	Discriminative Training For Continuous Speech Recognition
7	Speaker Adaptation Of DNN-HMM Acoustic Model For Speech Recognition
8	Research On Discriminative Training In Speech Recognition
9	Mongolian Language Oriented Research On Acoustic Modeling For Speech Recognition
10	Research On Acoustic Model Of Speech Recognition In Educational Scene Based On Deep Learning