Font Size: a A A

The Research And Implementation Of Distributed Data Mining Model Based On Globus

Posted on:2009-06-30Degree:MasterType:Thesis
Country:ChinaCandidate:S L TaoFull Text:PDF
GTID:2178360245986075Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
All things are constantly changing and developing, computer application model with the development of enterprise applications are constantly changing and developing too. Computer application model in nearly 50 years of development and changes, has experienced from centralized to distributed models. With the presence of Grid technology, computer application model become distributed again. With the development of information technology, the data produced daily by various departments within the enterprise is increasing dramatically. Explosive growth of data in the enterprise not only brings opportunities but it also brings challenges, and how to discover knowledge and how to effectively discover knowledge from these massive data is a big challenge in today's information society. The traditional centralized data mining approach to some extent, can solve a number of issues brought about by data distribution, but when faced with a mass of data the traditional way of data mining is increasingly unable to meet people's needs. Grid technology brings new opportunities to the distributed data mining. This article mainly focused on Distributed Data Mining based on Globus environment. The first problem of DDM wants to slove is the rational matching between data resources and computation resources, in order to archive a good performance. The traditional model of distributed data mining-data transfer model and code transfer model, despite their different advantages, but did not solve the matching between data resources and computation resources, they can not performance task optimization. This article presents the PDS model(Policy , task Dispatching and Scheduling based DDM model, PDS Modle) combines the advantages of data transfer model and code transfer model, and apply minimum response time as a distributed data mining tasks allocation strategy. PDS model can assign task optimization based on multiple data sets DDM. The article also presented a prediction method of DDM minimum response time model.GS model is based on the Globus Grid Service, and it is a simplified model of PDS. GS model is a way of using SOA, it packs all function of distributed data mining services to a form of Grid Service, and allow the customer to call these services. In Chapter 5 , the author developed a model of GS.
Keywords/Search Tags:grid, ddm, gt4, pds model, gs model
PDF Full Text Request
Related items