Font Size: a A A

Design Modern Tibetan Ordering System Based On Unicode Encoding

Posted on:2014-09-06Degree:MasterType:Thesis
Country:ChinaCandidate:J W LiFull Text:PDF
GTID:2268330401484825Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The Tibetan collation is a very important support to Tibetan Informatization。 Plenty of cataloguing, searching and collation of names in Tibetan also need that Tibetan is in order for improving work efficiency. The Tibetan collation is also the most important question need to be solved for writing Tibetan dictionary. To solve the Tibetan collation is the base of further Tibetan processing.The Tibetan collation is more complex than other language words because of the2D structure of Tibetan syllable. According to the dictionary order and the syllable structure of the Native Tibetan, making the2D Tibetan syllable to a string like "basic consonant+prefix consonant+head consonant+vowel+first suffix consonant+second suffix consonant" and the vacant position is replaced with the Unicode of space is proposed. Finally, the correct order of Tibetan syllable can be got according to the compare of the sort key.The whole system of the Native Tibetan collation is made of input module, syllable division module, syllable judgment module, the sort key extraction and compression module and results displaying module. In the syllable judgment module, a unique algorithm of the Native Tibetan syllable judgment is designed according to the characteristic of the Native Tibetan syllable and the Tibetan Unicode encoding. After syllable judgment, the sort key is extracted from DUCET and compressed. Then the sort key can be used to compare with.This collation system can correctly collate the Native Tibetan.
Keywords/Search Tags:Tibetan collation, Syllable judgment, DUCET
PDF Full Text Request
Related items