A Study Of Complex Structure Alignment And Development Set Selection Stratagy For Machine Translation

Posted on:2013-01-30

Degree:Master

Type:Thesis

Country:China

Candidate:C Hui

Full Text:PDF

GTID:2218330362959274

Subject:Computer software and theory

Abstract/Summary:

Machine translation is the core techneque in cross-language corperation and com-munacationinmodernworld. Itplaysanimportantroleinculture,science,religionandsociology. Learning how to translate from large scale data is called statistical machinetranslation. One important part of statistical machine translation is Alignment, whichextracts linguistical structures, such as word, phrase, syntax and semantics, from thesetence pair in two languages to guid the translation. And another problem resultedfrom machine learning is domain adaptation, which will effect development set se-lection used to optimize model parameters: different development set will exert a biginffuence on the quality of translation. This article will focus on these two problem-s. For the ffrst problem, this article statistics the usage of alignment module of singletranslation system in the previous statistical machine translation workshop share task,and carries a compareble study via experiments using task's data, and then shows thatthe phrase alignment is main stream in current statistical machine translation align-ment systems; meanwhile, for the domain adaptation problem in statistical machinetranslation, this artical proposes two evaluations, the difference of best translation er-rorandtheBLEU-RECALL,toselectthedevelopmentset, andexperimentsshowsthattranslation performance has signiffcantly improved.

Keywords/Search Tags:

Statistical Machine Translation, Alignment, Phrase Syntax, Domain Adaptation, Development Set Selection

Related items

1	Domain Adaptation For Statistical Machine Translation
2	The Study On Phrase-Based Statistical Machine Translation System
3	Phrase Alignment Models for Statistical Machine Translation
4	Study On Several Key Problems In The Training Process Of Phrase-based Statistical Machine Translation
5	Research On The Key Technologies For Phrase-based Statistical Machine Translation Models
6	Research On Semantics Analysis-based Domain Adaptation Reinforcement Method For Machine Translation
7	Optimization On Translation Knowledge In Statistical Machine Translation
8	Acquisition Of Tree-To-String Alignment Based On Phrase-Syntax Structure
9	Domain Adaptation For Statistical Machine Translation
10	On Key Technologies For Phrase-Based Statistical Machine Translation