Font Size: a A A

MapReduce-based Parallel Data Mining Services For TCM

Posted on:2011-12-18Degree:MasterType:Thesis
Country:ChinaCandidate:Y LiuFull Text:PDF
GTID:2178360302974686Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the development of the Traditional Chinese Medicine (TCM), more and more standardized data processed make the data size expanded. The stand-alone version of DartSpora can not satisfied the paralleling requiring. We designed a MapReduce-based TCM pararelling serving framework to offer high performance computing ability. In the framework, we implement a visualization interaction platform, and provide programming web service. It integrates some data mining method, such as clustering, frequent Pattern finding. Moreover, it has been applied in TCM research.In this framework, my contribution is:(1) Implement a visalization interaction platform, and provide programming web service.(2) In the algorithm library, 1 implement:a) A pattern finding algorithm for the simple graphb) A pointwise mutual information algorithm...
Keywords/Search Tags:Traditional Chinese Medicine, MapReduce, Paralleling Service Framework, Data Mining, Clustering, Frequent Pattern Finding
PDF Full Text Request
Related items