Handling complexity of synchronous grammars for machine translation

Posted on:2009-07-29

Degree:Ph.D

Type:Dissertation

University:University of Rochester

Candidate:Zhang, Hao

Full Text:PDF

GTID:1448390002991084

Subject:Artificial Intelligence

Abstract/Summary:

Synchronous grammars are grammars that model two languages and their translational equivalence. They are rewriting systems extended to two dimensions. Systems based on synchronous grammars and tree transducers promise to improve the quality of statistical machine translation output, but are often very computationally intensive. We improve the efficiency of such systems, making both decoding and training fast and effective.;We devise an algorithm for factoring syntactic re-orderings by binarizing synchronous rules when possible and show that the resulting rule set significantly improves the speed and accuracy of a state-of-the-art syntax-based machine translation system.;We take a multi-pass approach to machine translation decoding when using synchronous context-free grammars as the translation model and n-gram language models: the first pass uses a bigram language model, and the resulting parse forest is used in the second pass to guide search with a trigram language model. An additional fast decoding pass maximizing the expected count of correct translation hypotheses increases the BLEU score significantly.;We combine the strengths of Bayesian modeling and synchronous grammar in unsupervised learning of basic translation phrase pairs. The structured space of a synchronous grammar is a natural fit for phrase pair probability estimation, though the search space can be prohibitively large. Therefore we explore efficient algorithms for pruning this space that lead to empirically effective results.

Keywords/Search Tags:

Synchronous, Translation, Grammars, Model

Related items

1	Preference Grammars and Decoding Algorithms for Probabilistic Synchronous Context Free Grammar Based Translation
2	Towards A Wide-Coverage Grammar: Graphical Abstract Categorial Grammars
3	Synchronous and multicomponent Tree-Adjoining Grammars: Complexity, algorithms and linguistic applications
4	Research On Synchronous Tree Substitution Grammar Based Statistical Machine Translation Methods
5	Research Of Phrase-based Translation Model Using Syntactic And Morphologic Information
6	Research And Implementation Of Contract Translation Based On Neural Machine Translation Model
7	A Stastical Machine Translation System Between Mongolian And Chinese
8	Research On Auto-Construction Of EBMT Translation Model
9	Efficient XML stream processing and searching
10	Research And Implementation Of Hierarchical Phrase-based Translation Model In Statistical Machine Translation