Font Size: a A A

Study And Implementation Of The Mining System Of Distributed Association Rule Based On CORBA

Posted on:2006-07-08Degree:MasterType:Thesis
Country:ChinaCandidate:Q LiuFull Text:PDF
GTID:2168360155960014Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Data mining acquires knowledge and rules that is connotative, unknown and having potential value for decision-making from large databases or data warehouses. It is the result that combines artificial intelligence and database. At present, data mining is one of the most advanced research direction in the field of database and information decision.Association rule mining is an active data mining research area and applys more widely than other methods. In the dissertation, it introduces the basic concepts, characters and famous algorithms of association rule mining in detail. Existing ARM algorithms and modules cater to a centralized environment, such as database or data warehouse. With the development of distributed database and network technology, they don' t meet the needs of mining rules from distributed data sets. The interest in distributed association rule mining arises from this situation.The dissertation introduces and analyses the algorithms of distributed association rule mining, especially FDM (Fast Distributed mining of association rule) algorithm. In distributed data environment, frequent itemsets computing and the costs of communication are the bottlenecks of algorithms. From the point of view, the dissertation proposes practical solutions to these problems. The applications of transactions pruning and a new data structure for storage of the candidate sets help to compute the support counts of candidate sets fast and reduce the cost of scanning database. Adding a site as data mining server is proposed to gather, compute and broadcast the result of each sites, the candidate sets pruning, control synchronization of the whole mining process and so on. It reduces the cost of communication in network. With the improvement of FDM algorithm, the technology of distributed object is introduced. A new architecture for distributed association rule mining based on the criterion of CORBA is proposed. The overall design of the system is given and the pivotal techniques...
Keywords/Search Tags:Data Mining, Distributed Database, Association Rules, FDM Algorithm, the criterion of CORBA
PDF Full Text Request
Related items