Font Size: a A A

Syntactic alignment models for large-scale statistical machine translation

Posted on:2013-12-08Degree:Ph.DType:Dissertation
University:University of Southern CaliforniaCandidate:Riesa, Jason AFull Text:PDF
GTID:1458390008968648Subject:Artificial Intelligence
Abstract/Summary:
Word alignment, the process of inferring the implicit links between words across two languages, serves as an integral piece of the puzzle of learning linguistic translation knowledge. It enables us to acquire automatically from data the rules that govern the transformation of words, phrases, and syntactic structures from one language to another. Word alignment is used in many tasks in Natural Language Processing, such as bilingual dictionary induction, cross-lingual information retrieval, and distilling parallel text from within noisy data. In this dissertation, we focus on word alignment for statistical machine translation.;We advance the state-of-the-art in search, modeling, and learning of alignments and show empirically that, when taken together, these contributions significantly improve the output quality of large-scale statistical machine translation, outperforming existing methods. We show results for Arabic-English and Chinese-English translation.;Ultimately, the work we describe herein may be used for any language-pair, supporting arbitrary and overlapping features from varied sources. Finally, our features are learned automatically without any human intervention, facilitating rapid deployment for new language-pairs.
Keywords/Search Tags:Alignment, Statistical machine, Translation
Related items