| TCM(Traditional Chinese Medicine) is a discipline which studies and exploits the macro functions of lives and diseases.During 2500 years of development and practice, TCM,based on clinical medicine,has made significant contributions to people's health not only in China but also in East Asian countries.Internet contains plenty of medical information and whose resources are still growing explosively,how to get useful medical information for TCM research has become a popular research direction. Vertical search engine is an important tool used for obtaining knowledge rapidly from the huge amount of data,thus,we can get more accurate,detailed and profound clinical medical information according to the discipline characteristics of TCM.In this paper, we designed a TCM clinical vertical search system called TCMVSE,which provides convenient information services based on TCM clinical data for public users.TCMVSE system consists of three core modules:web information collection,information extraction,information indexing and retrieval.The main research contents of this paper are as follows:(1) Based on deep study of Biclustering,we use a retrofitted version of cHawk algorithm to exploit the TCM clinical data and obtain fairly satisfaction results.The study shows that we can not only get the important information of compatibility between herbs,but also the targeted symptoms of these herbs combinations.(2) With the growing clinical data(including the case of structured electronic clinical data and bibliography of clinical),and web clinical information resources,we use mutual information to achieve the calculation and update of similarity between medical entities,which provides accurate data support for data mining.(3)We have implemented the key modules of TCM clinical vertical search system, including TCM clinical information collection and extraction from web,the conceptual structure of data processing,etc.According to system requirements,we use MALLET open source toolkit,which is used for text mining and information extraction,use Lucene for information indexing and retrieval services. |