Font Size: a A A

Research On Intelligent Chinese Character-making Without Library Based On Topology And Statistics

Posted on:2011-09-18Degree:DoctorType:Dissertation
Country:ChinaCandidate:J P LuFull Text:PDF
GTID:1118330332472027Subject:Control theory and control engineering
Abstract/Summary:PDF Full Text Request
Research on intelligent Chinese character-making (ICC) without library has made the abundant achievement from the angle of the culture and technology. The designed software system of ICC realizes successfully the ICC experiment of 70244 characters which are specified in the Chinese character set of GB18030-2005. In order to research the inherent regularity of ICC, this paper is applied to symbolization of Chinese character prototypes, Chinese character structures and Chinese character code and researches the rationality, seriousness and stability of the Chinese character prototype theory, structure theory, code theory and ICC theory using the mathematical tools such as the topology and statistics. Therefore, it enriches and improves the ICC theory. In order to verify the effectiveness of ICC and research the entropy-dropping mechanism of ICC, the informatization efficiency evaluation of ICC is applied.The main work and achievement during the paper research period is as flows:1. Research on Chinese Character Prototype. (1) Using topological theory to describe the Chinese character prototypes: the relationship among the sets of Chinese characters, components and prototypes is analyzed; the relationship between the prototype and topological basis is established, supporting the Chinese naming of the prototype mathematical theoretically, providing the mathematic theory support for the Chinese naming of the Chinese character prototypes. (2) The mathematical theory of available to chose the prototypes is established, resolving the problem that how to choose prototype sets respectively in the different subsets of Chinese characters without causing any conflicts from each other. (3) Further, the mathematical model how to choose prototypes from the set of characters is established by using AHP (Analytic Hierarchy Process), resolving practically the mathematical problem that how to choose prototypes from the set of characters. (4) The stability of the prototypes. For the certain composition of the prototypes, and the asymptotic stability of the prototypes acquired in the experiment, using the exponential smoothing method in statistical models to predict the stability of the prototypes. The stability of the prototypes is predicted by using the nonlinear regression method that can be linearized in the statistical models.2. Research on Chinese character structures theory. (1) Using topological theory to describe the Chinese character structures: using quotient space and homotopy in modern topology to study on the structures'classes with different topological features in ICC, the mathematical descriptive theories for character structures are formed. The goal that the Chinese character is applied to mathematic description using the topology is achieved. (2) The stability of the structures. From the certain composition of the structures, the stability of the structures acquired in the experiment, and the topological properties of joining together way of characters to predict the stability of the structures.3. Research on Chinese character coding theory. As to the feature of the code of ICC including structure coding and prototype coding, (1) It states mathematically that the coding of ICC is a combinational coding with the feature"structure plus prototype". (2) It also verifies that the internal code of characters of ICC is a unique decodable code and instantaneous code from the mathematic theory. For the code experiment result in which all the 70244 Chinese characters of the GB18030-2005 have their own codes under the code platform and these codes are unique, the Chinese code theory explains the completeness and uniqueness of the internal code for ICC.4. Research on the ICC theory and the system model. Mathematical description has made to show the process of making-character, Firstly, the mathematical theory which can make character is verified from the angle of topology and the problem of the mathematical theory support of Chinese character-making is resolved. Secondly, the mathematical model of ICC is set up according to the ICC theory and the transition from qualitative description to mathematic theory description of the Chinese character-making theory is resolved. The mathematical theory which can make the Chinese character explains the realizability of the Chinese character-making and the mathematical model of ICC is set up further. Besides, the character-making experiment result also verifies the feasibility and effectiveness of the model method proposed in this chapter.5. Research on the Chinese character entropy-reducing mechanism of ICC. The present Chinese information systems all adopt the Chinese character word library, a word processing system with the most expensive expenditure but the lowest efficiency in which the Chinese character is the smallest processing unit and the average static information entropy is 9.65 bit. On the basis of analysis and research on the reason that the Chinese character system information entropy of current Chinese character word library is on the high side and the entropy-reducing mechanism, the information entropy experiment is carried out by taking the Chinese character prototypes as the Chinese character processing units and gets the information entropy with 5.29 bit which is almost near to the alphabetic writing level. This experiment indicates that the above program reduces the Chinese character information entropy effectively.
Keywords/Search Tags:Intelligent Chinese character-making(ICC), Chinese character prototypes, Chinese character structures, Chinese character code, Topological theory, Cognitive mechanism, Information entropy
PDF Full Text Request
Related items