Font Size: a A A

The Methodological Study Of Mongolian Woodblock To Automatic Recognition

Posted on:2016-03-18Degree:DoctorType:Dissertation
Country:ChinaCandidate:B H R L BinFull Text:PDF
GTID:1225330461980889Subject:Chinese Ethnic Language and Literature
Abstract/Summary:PDF Full Text Request
In the era of information technology and computer linguistics, one the the urgent requirement for keeping and developing our mother language, script, history, culture, intellectual heritage and the national peculiarity is to process our language and script by integrating with computer linguistics.In other words, it is essential to convert all those heritages such as sutras in mongolian script and textbooks to digital version using the modern advanced techniques and saving the time and money and generalize in public use. Taking into consideration on the social needs, the author has chosen the theme how to digitize sutras and books in woodblock, to form electronic fund of Mongolian Kangyur and to develop application programs.He has converted Mongolian woodblock Kangyur sutra from photo to digital texts taking its first chapter of Beijing xylographic version as an example and transcribed into scientific Latin transcription and formed the fund with photos of source textbook of the sutra. The sutra is a work with 396 sheets or 790 pages that 31 lines in each page with woodblock glyph.In the article, the author writes about the processing of program to textualize woodblock Kangyur sutra based on the OCR system in computer and the experiments of the program and its results as well.For the structure of article, introduction consists of aim, reason to choose, novity and significance of the work.In the first chapter, he introduced general theory, method and processing of OCR program for texts.The second chapter has introduces the main requirement of Mongolian script OCR processing, the basic structure and principle of Mongolian language and script, the survey of Mongolian woodblock printing and types and genres of woodblock books which is inherited to our generation. Also, he writes about the transcription method of the Kangyur sutra’s first chapter in Latin.In the third chapter, he focuses on sequence of program processing to recognize woodblock Kangyur sutra and its result as well as further development.
Keywords/Search Tags:Mongolian script, OCR, Woodblock
PDF Full Text Request
Related items