Formual—Syndrome Relationship in Traditional Chinese Medicine (TCM) is an important and difficult task in the research of TCM, Gene Expression Programming (GEP) is a new powerful tool for Knowledge discovery. To solve the problem for Formual—Syndrome Relationship ,this thesis proposes an new approach. The main contributions include:1) Analyses the shortage of Simple Formula and Syndrome Model (SimpFSM) , proposes effective and space-saving coding method。The advantages of the space-saving coding method include:Transformation from characters in the database to the numbers in the array is based on coding method .every number in the array corresponds to a position in the Symptom with little space.cost.It separates the Synstring into the SynMainStr and SynMinorStr and makes the search faster;2) Proposes a new concept named Major Homology(HMA) and Minor Homology (HMI) for Formula and Syndrome. Proposing two ways that is Simple Homology and Relative Homology to calculate the HMA and HMI;.3) Designs fitness functions for calculating fitness for a single Formula and Syndrome's relationship. It includes Simple Fitness Function, Means Absolute Fitness Function, design a fitness function named Average Absolute Error Fitness Function to calculate fitness of all Formula and Syndrome's relationships;4) Implements Minding Relationship Arithmetic based on Improved Gene Expression Programming (MRAGEP). The step of the MRAGEP is almost the... |