| The Xinxiu Leiyin Yinzheng Qunji Yupian(The following is called Xinxiu Yupian)is a large word book compiled by Xing Zhun in Jin Dynasty.Xinxiu Yupian was based on Leiyu Pianhai and absorbed the rhymes of Guangjiyun,Shengyun,Qieyun and Guangyun.There are thirty volumes in the book “Xinxiu Yupian”,but now there are twenty-nine volumes left.The whole book is divided into 545 parts,with more than 50,000 characters in total.In addition to regular script fonts,the whole book also includes a large number of semi-Li definite characters,handwriting,Buddhist scriptures and so on.The complex fonts have brought many difficulties to the Chinese character collation.There are 8649 uncoded words in the construction of the character database of Xinxiu Yupian.Uncoded words refer to Chinese characters that have not been included in UNICODE 12.0 and have not yet been coded by UNICODE.Except for the incomplete volume of Volume 21,which cannot count the number of words,there are uncoded words in all the other volumes.Heterographic writing refers to "recording the same word under the same system,with the same structure and meaning,but only with different writing styles",which can be divided into two types: stroke Heterographic writing and component Heterographic writing.By observing the uncoded words in Xinxiu Yupian,it is found that a considerable number of uncoded words are caused by the variation of strokes or components in writing.These words are collectively called uncoded heterographs.Among the 66 uncoded heterographs in Xinxiu Yupian,we found that there were fewer heterographs in strokes and most of them were component heterographs.Among them,there are four main cases of strokes’ heterographs: sketch variance,stroke increase,stroke decrease and stroke connection variance.Component heterographs can be divided into three situations: component position change,component similarity and component variance.The main reasons for these uncoded variants lie in the similarity of strokes,the pursuit of simplicity and rapidity,the different writing methods and the different writing styles of compilers.By sorting out the uncoded heterographs in Xinxiu Yupian with the help of the font database,the paper explores the causes and general rules of heterographs,which can lay a certain foundation for the collation and compilation of large-scale fonts in the future,and also provide help for the standardization and standardization of Chinese characters. |