Syntactic alignment models for large-scale statistical machine translation

Posted on:2013-12-08

Degree:Ph.D

Type:Dissertation

University:University of Southern California

Candidate:Riesa, Jason A

Full Text:PDF

GTID:1458390008968648

Subject:Artificial Intelligence

Abstract/Summary:

Word alignment, the process of inferring the implicit links between words across two languages, serves as an integral piece of the puzzle of learning linguistic translation knowledge. It enables us to acquire automatically from data the rules that govern the transformation of words, phrases, and syntactic structures from one language to another. Word alignment is used in many tasks in Natural Language Processing, such as bilingual dictionary induction, cross-lingual information retrieval, and distilling parallel text from within noisy data. In this dissertation, we focus on word alignment for statistical machine translation.;We advance the state-of-the-art in search, modeling, and learning of alignments and show empirically that, when taken together, these contributions significantly improve the output quality of large-scale statistical machine translation, outperforming existing methods. We show results for Arabic-English and Chinese-English translation.;Ultimately, the work we describe herein may be used for any language-pair, supporting arbitrary and overlapping features from varied sources. Finally, our features are learned automatically without any human intervention, facilitating rapid deployment for new language-pairs.

Keywords/Search Tags:

Alignment, Statistical machine, Translation

Related items

1	Study On Word Alignment Technology And Construction Of Statistical Machine Translation Platform
2	Syntactic alignment models for large-scale statistical machine translation
3	Morphology-Processing In Chinese-Mongolian Statistical Machine Translation
4	Research On Bilingual Corpus-Based Machine Translation
5	The Study On Phrase-Based Statistical Machine Translation System
6	Bitext alignment for statistical machine translation
7	Phrase Alignment Models for Statistical Machine Translation
8	The Research On English-Chinese Name Entity Translation
9	The Research On The Technology Of Statistical-Based Chinese-English Machine Translation
10	Improved word alignments for statistical machine translation