Font Size: a A A

Research On Lao-Chinese Machine Translation Method Based On Structure Tree

Posted on:2019-04-19Degree:MasterType:Thesis
Country:ChinaCandidate:L S E N M A N Y C H A N T Full Text:PDF
GTID:2438330566483732Subject:Communication and Information System
Abstract/Summary:PDF Full Text Request
China and Laos are friendly neighbors linked by mountains and rivers.On April 25,1961,China and Laos formally established diplomatic relations.From the late 1970 s to the mid 1980 s,the relations between the two countries have experienced twists and turns.In 1989,relations between China and Laos were normalized,bilateral relations were fully restored and developed,and friendly exchanges and cooperation in the political,economic,military,cultural,and health fields continued to deepen.The two sides maintained close coordination and cooperation in international and regional affairs.In recent years,with the gradual implementation of the "One Belt and One Road" strategy put forward by China,exchanges and cooperation between China and Laos and other Southeast Asian countries have become increasingly frequent and close.The study of machine translation between Chinese and Laos languages is more meaningful and valuable.The paper analyzes the linguistic features of Chinese and Laos languages,the commonly used techniques of machine translation,rule-based translation methods,and the problem of segmentation and segmentation of sentences in Laos language processing.The chinese-laos corpora design is divided into several modules,and the chinese-laos parallel corpora are constructed using a top-down approach.The dissertation is designed to construct a Chinese-Laos translation system based on the tree structure,and gives the analysis of the system framework and grammar rules.The translation system can increase Chinese and Laotian sentences and achieve simple Chinese to Laos translation functions.The Laos language,also known as the proverb,belongs to the Zhuang language branch of the Zhuang and Dong language group of the Sino-Tibetan language family.Laos languages are used.except for Laos,in northeast and north Thailand.Laos writing is a type of phonemic text in phonetic writing and has two different forms.One of the older,“Duo Tan” means that the scriptures are more commonly found in the Bayan scriptures and are used in Buddhist temples;the other is called “the old”,meaning old text,similar in shape and spelling.Modern Thai.The Lao PDR is a country with its own language and its own Laotian word with the name Lao and Laos.For the language of Laos Its origin was born along with the Lao country.This language is one of the oldest languages in Southeast Asia.The origin of the Laos word is with Sanli and Sanskrit.The use of letters to communicate most of them comes from Sanskrit.The origin of the Lao characters is mostly used in the religious language with most of the languages of Bali.The Lao words include 56 words and the marks include: 33 consonants,28 vowels,4 tones,3 special marks and 10 numbers.Laos and Chinese are more different,and the difference between the two languages is the language of each sentence.Some Laoese may have long sentences when translating sentences,but sometimes sentences in Chinese are also long.So bilingual translation is really difficult for example:(?)(??????)In this sentence,Lao only has 4 words but Chinese has 5 words,so in order to let the Chinese study Laos more about Laos.Machine translation(MT)is a sub-field of computational linguistics that investigates the use of computer software to translate text or speech from one language to another.It can help people to communicate with others with different natural languages.Machine Translation(MT)between Laos and Chinese languages has been an ongoing and challenging research topic.Most research has been done on Chinese to Laos machine translation,but not the other way around.Developing a machine translation system which can correctly translate any sentence from one language to another,seems to be a formidable and challenging task due to the big differences between syntax and semantics of the two languages.This suggests that such a system should be implemented incrementally.Starting with a system that can translate simple phrases and sentences,it can then be expanded to include grammar rules and capabilities to translate new sentences which are more complex and have not previously been successfully translated.This research proposes a new framework to develop Laos to Chinese machine translation systems incrementally.It is based on the interlingua approach,where input sentences are translated into intermediate representations before they are converted into equivalent sentences in the other language.Lexical Functional Grammar(LFG)-or an LFG tree structure-is used as an intermediate representation in this research.The framework has been used to implement a prototype system that can translate?The dissertation reviewed relevant literature,analyzed machine translation techniques,Laos-Chinese bilingual languages,and syntactic features.In view of the current shortage of Laos-Chinese bilingual parallel corpus and the need to improve the translation technology of Laos-Chinese machines,the paper studied the following: content:(1)Constructing a bilingual corpus of Chinese-Laos bilingual(2)Considering the scarcity of Laos-Chinese bilingual parallel corpus and the difficulty in the development of statistical machine translation methods,based on the analysis of the features of the double-sentence method,a translation framework based on rules between languages was proposed.The work of this thesis provides the bilingual resources for research and has certain innovation.
Keywords/Search Tags:Lao-Chinese
PDF Full Text Request
Related items