Font Size: a A A

Handling complexity of synchronous grammars for machine translation

Posted on:2009-07-29Degree:Ph.DType:Dissertation
University:University of RochesterCandidate:Zhang, HaoFull Text:PDF
GTID:1448390002991084Subject:Artificial Intelligence
Abstract/Summary:
Synchronous grammars are grammars that model two languages and their translational equivalence. They are rewriting systems extended to two dimensions. Systems based on synchronous grammars and tree transducers promise to improve the quality of statistical machine translation output, but are often very computationally intensive. We improve the efficiency of such systems, making both decoding and training fast and effective.;We devise an algorithm for factoring syntactic re-orderings by binarizing synchronous rules when possible and show that the resulting rule set significantly improves the speed and accuracy of a state-of-the-art syntax-based machine translation system.;We take a multi-pass approach to machine translation decoding when using synchronous context-free grammars as the translation model and n-gram language models: the first pass uses a bigram language model, and the resulting parse forest is used in the second pass to guide search with a trigram language model. An additional fast decoding pass maximizing the expected count of correct translation hypotheses increases the BLEU score significantly.;We combine the strengths of Bayesian modeling and synchronous grammar in unsupervised learning of basic translation phrase pairs. The structured space of a synchronous grammar is a natural fit for phrase pair probability estimation, though the search space can be prohibitively large. Therefore we explore efficient algorithms for pruning this space that lead to empirically effective results.
Keywords/Search Tags:Synchronous, Translation, Grammars, Model
Related items