The Design And Implementation Of Mongolian Word Analyzing And Correcting Based On Syllabic Statistical Language Method

Posted on:2008-11-30

Degree:Master

Type:Thesis

Country:China

Candidate:J Zhao

Full Text:PDF

GTID:2178360215491526

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of information society, there are more and moreelectronic books, papers, and document in our work. How to resolve the automaticdetection and correction of text have been a warm focus by natural languageprocessing (NLP) researchers. Nowadays, in Mongolian information processing area,the automatic correction for Mongolian text has not been well studied yet.Researchers have been using the method based on dictionary for correction so far.This method work well when the amount of word in dictionary is not large. And withthe growth words, the efficiency of correction is decreasing. The goal of this paper isto put forward a new method for problems in Mongolian text correction. The mainwork in this paper includes:First, some knowledge of Mongolian syntax is introduced. And the syllablecharacters in Mongolian words will be analyzed from the different perspectives, e g.the length of Mongolian word, the amount of syllable in a word and the location ofsyllables.Second, this paper introduces some well known language models used in naturallanguage processing and the algorithms of text similarity computing. And a method for Mongolian correction based on 2-gram is put forward. The design of proofreadingmodel, the algorithm for model learning and the algorithm for Mongolian correctionmodel are introduced in detail. The rules of text errors have been learned anddisplayed by directed graphs in this paper.

Keywords/Search Tags:

automatic proofreading, n-gram model, Mongolian scripts, syllable

PDF Full Text Request

Related items

1	Research On Text Proofreading Method Based On The Analysis Of The Mongolian Syllable
2	Research And Realization Of Non-word Error Automatic Proofreading System In Chinese Text
3	Study On The Method Of Automatic Proofreading Of Word-level Chinese Text
4	Research On Chinese Syllable Evaluation Approach After Automatic Speech Recogniton
5	A Research Of Mongolian Auto-proofreading Method Based On The Patterns
6	The Research Of The Word Of Mongolian Language Based On UNICODE And OpenType Font
7	The Research On The Automatic Proofreading Algorithm Of Recognition Flow
8	Design And Implementation Of Uyghur Words Automatic Proofreading System
9	Research On Cyrillic And Mongolian Scriptâ€™s Morphology And Conversion System
10	Research On Chinese Text Proofreading Algorithm Based On The Combination Of Statistical Features And Rules