Font Size: a A A

The Applied Research Of Data Mining Technology In Distributed Selective Information Assembly Process Based On Grid Technology

Posted on:2006-07-17Degree:MasterType:Thesis
Country:ChinaCandidate:Q X ZhuFull Text:PDF
GTID:2168360152489039Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the advent and development of the grid technology, application research based on grid environment is on the increase. As the third generation network techniques, many people paid more attention to the grid at present, and there is prospect of a very important scientific research and application for it. the community of scholars and business organizations had began to develop grid application program actively.The network resources continue to expand, so the web search and mining had became one of basic technique on computer. The resources on grid environment of the future is more rich in content and varied in style than now. Therefore, the research for ways to search and mine grid resource is important.this paper study and design a resource search and mining system based on grid environment by performing an technical analysis, which use the latest grid search results and the existing web search technologes.This system is designed in the linux operating system, which adopts a hierarchial structure and each level provides different function. It include three main levels: globus container, grid spider service and storage. The GT3 on which the system is built is an open source software toolkit used for building grids. The toolkit includes software services and libraries. And it is packaged as a set of components that can be used either independently or together to develop applications. Based on the GT3 container, we develop the grid spider service. The grid service can be deployed to more than one server at the same time, and response all requests together. Finally, the service store all data to the storage area across the network. As a storage area of grid, storage level can be a database, network file system or distributed file system, we use the MySql database system.Based on present grid technology research, this paper basically complete the grid spider service, which holds forth a wide prospect of grid application and extension on LAN, and lay the foundation of data mining technolody application on grid.
Keywords/Search Tags:Grid, Data Mining, Resource Search, OGSA, GT3, Spider
PDF Full Text Request
Related items