Preference Grammars and Decoding Algorithms for Probabilistic Synchronous Context Free Grammar Based Translation

Posted on:2010-04-09

Degree:Ph.D

Type:Thesis

University:Carnegie Mellon University

Candidate:Venugopal, Ashish

Full Text:PDF

GTID:2448390002486336

Subject:Computer Science

Abstract/Summary:

Probabilistic Synchronous Context-free Grammars (PSCFGs) [Aho and Ullmann, 1969, Wu, 1996] define weighted transduction rules to represent translation and reordering operations. When translation models use features that are defined locally, on each rule, there are efficient dynamic programming algorithms to perform translation with these grammars [Kasami, 1965]. In general, the integration of non-local features into the translation model can make translation NP-hard, requiring decoding approximations that limit the impact of these features.;In this thesis, we consider the impact and interaction between two non-local features, the n-gram language model (LM) and labels on rule nonterminal symbols in the Syntax-Augmented MT (SAMT) grammar [Zollmann and Venugopal, 2006]. While these features do not result in NP-hard search, they would lead to serious increases in wall-clock runtime if naive dynamic programming methods are applied.;We develop novel two-pass algorithms that make strong decoding approximations during a first pass search, generating a hypergraph of sentence spanning translation derivations. In a second pass, we use knowledge about non-local features to explore the hypergraph for alternative, potentially better translations. We use this approach to integrate the n-gram LM decoding feature as well as a non-local syntactic feature described below.;We then perform a systematic comparison of approaches to evaluate the relative impact of PSCFG methods over a strong phrase-based MT baseline with a focus on the impact of n-gram LM and syntactic labels. This comparison addresses important questions about the effectiveness of PSCFG methods for a variety of language and resource conditions. We learn that for language pairs that exhibit long distance reordering, PSCFG methods deliver improvements over comparable phrase-based systems and that SAMT labels result in additional small, but consistent improvements even in conjunction with strong n-gram LMs.;Finally, we propose a novel approach to use nonterminal labels in PSCFG decoding by extending the PSCFG formalism to represent hard label constraints as soft preferences. These preferences are used to compute a new decoding feature that reflects the probability that a derivation is syntactically well formed. This feature mitigates the effect of the commonly applied maximum a posteriori (MAP) approximation and can be discriminatively trained in concert with other model features. We report modest improvements in translation quality on a Chinese-to-English translation task.

Keywords/Search Tags:

Translation, PSCFG, Grammars, Decoding, Features, Algorithms

Related items

1	Handling complexity of synchronous grammars for machine translation
2	Towards A Wide-Coverage Grammar: Graphical Abstract Categorial Grammars
3	Discriminative training and variational decoding in machine translation via novel algorithms for weighted hypergraphs
4	Investigations on categorial grammars: Parsing algorithms and equivalence of formalims (Spanish text)
5	Research On Context-Sensitive Graph Grammars And Their Applications
6	On Decoding Algorithms For Phrase-Based Statistical Machine Translation
7	A Stastical Machine Translation System Between Mongolian And Chinese
8	On Learning And Decoding Approaches To Tree-to-tree Statistical Machine Translation
9	Research On Improving Translation Diversity In Back-Translation
10	Boolean grammars: Expressive power and parsing algorithms