Font Size: a A A

Latent semantic analysis as a tool to improve automatic speech recognition performance

Posted on:2006-12-20Degree:Ph.DType:Thesis
University:University of Colorado at BoulderCandidate:Coccaro, Noah BFull Text:PDF
GTID:2458390008463010Subject:Computer Science
Abstract/Summary:
This thesis explores the use of Latent Semantic Analysis to augment an N-gram language model to improve the accuracy of a large vocabulary speech recognition system. This thesis discusses possible solutions to three problems presented when integrating LSA with an N-gram model. First, two approaches to deriving a probability from a semantic distance are examined. Numerous parameters are introduced and optimal values found. Second, because the N-gram and LSA model have different strengths, it is necessary to develop confidence metrics that indicate when to rely more strongly on a particular model. Several confidence metrics are developed and used. Lastly, the problem of combining the two probability models is explored. Several different approaches to combining the models, including geometric mean and a decision tree were evaluated. Experimental results compared to a standard trigram model showed a reduction in perplexity of approximately 14%, and a significant reduction in the word error rate of a speech recognizer by 0.5%, 3.0% relative. Furthermore, the word errors that remained were on average of less semantic import than the original set of word errors.
Keywords/Search Tags:Semantic, Model, Speech
Related items