Font Size: a A A

Bilingual Alignment Platform Multi-language Design And Implementation

Posted on:2005-12-26Degree:MasterType:Thesis
Country:ChinaCandidate:F WangFull Text:PDF
GTID:2208360125967991Subject:Computer applications
Abstract/Summary:PDF Full Text Request
Bilingual corpus plays an important role in Example-base Machine Translation (EBMT), acquirement of translation knowledge, construction of bilingual dictionary etc. Building a large scale of bilingual corpus is the foundation of research on corpus. It's very important how to use bilingual text attained on the Internet to build the bilingual corpus.As the key technology during the course of building corpus, bilingual alignment technology is growing high recognition. Combining length-based method with lexicon-based method, splitting bilingual text into several pieces by anchor sentence pair, we fulfill the work of the alignment of bilingual texts at the sentence level and improve the accuracy of the alignment both. At the other hand, facing the situation that the recall of the lexicon in bilingual dictionary is low, we propose a new method to improve the recall of the lexicon, which results in more accurate and rapid sentence alignment.According to the thought above, we finally design and build an accurate, reliable and practical bilingual sentence alignment system, and it offers a powerful support to the construction of bilingual corpus.
Keywords/Search Tags:bilingual corpus, alignment, bilingual sentence alignment, bilingual dictionary, accuracy, recall
PDF Full Text Request
Related items