Font Size: a A A

Research On Data Mining Based On Web Services And PMML

Posted on:2009-09-01Degree:MasterType:Thesis
Country:ChinaCandidate:Z J ZhangFull Text:PDF
GTID:2178360242974300Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
At present, the technology of data mining has been widely used. However, it is also facing great challenges. First, the data mining models are defined differently recording to different suppliers, holding back the sharing of these models in different data mining systems. Second, a lot of data are distributed to different locations, so the costs of collecting data are very high. Furthermore, most of the data mining task need that all sorts of the data mining tools can be used synthetically. So, it is hoped that data mining models and mining modules can be integrated and reused in an open environment.This thesis puts forward a data mining system architecture based on Web Service and PMML. This system can be integrated with the existing systems and modules well; also can be apply operations of model exchange and model deployment. Moreover, it is independent of platform and programming language, transplantable, and can be expand flexibly.Firstly, the thesis analyses and discusses the three most important data mining language, researches the spec of PMML3.0 and the application of PMML. Secondly, it introduces the knowledge related to data mining, and researches the technology of Web Service and applications of Web Service in data mining systems. Then, it gives a general data mining architecture, using Web Service as the platform and PMML as the describing language of models. Finally, it implements a prototype of data mining system which based on Web Service and PMML and follows along with the B/S architecture. Furthermore, this thesis designs and implements a input and out module of PMML emphatically, associates the mining information and mining models. The input module can reads in the data mining model in PMML format, and gains the information of mining model. Contrarily, the output module accepts information of mining model and outputs the PMML document. In the designing process of the PMML module, idea of modularization is used, making the module weakly coupled and independent; In the implementing process, the technology of XML serialization based on .NET platform is adopted, carrying out the idea of Object-Oriented.
Keywords/Search Tags:Date Mining, Mining Model, Web Service, PMML
PDF Full Text Request
Related items