Discriminative training of language models for speech recognition

Posted on:2011-06-04

Degree:M.Sc

Type:Thesis

University:York University (Canada)

Candidate:Magdin, Vladimir

Full Text:PDF

GTID:2448390002453663

Subject:Artificial Intelligence

Abstract/Summary:

This thesis presents a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition (LVCSR). Language models play an important role in speech recognition because they help to constrain the potentially vast search space of possible hypotheses. The discriminative training algorithm introduced in this thesis aims to estimate a standard n-gram language model in order to increase recognition rates in speech recognition tasks.;Experimental results on the Speech in Noisy Environments 1 (SPINE1) speech recognition corpus have shown that the proposed discriminative training method can outperform the conventional discounting-based maximum likelihood estimation methods. A relative reduction in word error rate of over 2.5 percent has been observed on the SPINE1 speech recognition task.;Two different formulations of the algorithm are presented. One uses maximum mutual information estimation (MMIE), and the other uses large margin estimation (LME) to build an objective function that involves a metric computed between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding process. The nonlinear MMIE/LME objective functions are approximated by linear functions via an auxiliary function that is inspired by the Expectation-Maximization (EM) algorithm. Following the linear approximation, the non-linear discriminative training problem of n-gram language models is converted into a linear programing problem, which can be efficiently solved by widely-available convex optimization tools.

Keywords/Search Tags:

Language models, Speech recognition, Discriminative training, Algorithm

Related items

1	Compact representations and unsupervised training of discriminative language models
2	Discriminative Training For Continuous Speech Recognition
3	Discriminative Training Of Acoustic Models For Automatic Speech Recognition
4	Research On Discriminative Training In Speech Recognition
5	Research On Acoustic Modeling For Spontaneous Spoken Speech Recognition
6	Discriminative Training For Large Vocabulary Continuous Speech Recognition
7	Discriminative Methodologies For Tone Problem Solving In Mandarin Speech Recognition
8	Research On Discriminative Training For Speech Recognition
9	Research On Discriminative Techniques Of Feature Extraction And Acoustic Model Training In Continuous Speech Recognition
10	Research Of Language Recognition System Embedded Anchor Models And FPGA Implementation