Font Size: a A A

Research On Algorithms Of Finding Rules Applied To Industry Audit

Posted on:2006-09-16Degree:DoctorType:Dissertation
Country:ChinaCandidate:G ChenFull Text:PDF
GTID:1118360212482667Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Recently, our governments attach importance to audit and ask the department of audit to strengthen to supervise the important state capitals for preventing the risks of economic. It is agreat challenge, however, to rapidly mine the useful information on audit from a vary large database system. It enforces us to find more effective auditing theories, methods and technologies. We attempt to combine the theory of industry audit and the technology of data mining to mine association pattern and industry audit assumptions from the databases of enterprises in the same industry, and then to find out the audit risks behind data. So, the paper is more useful in research and application.The main contribution of the paper are listed as follows:(1) According to the demand of industry audit, the paper presents the architecture of a data mining system AuditMiner based on distributed database environment, in which the task of mining association rules is completed together by global site and local sites.(2) Proposed an binary system based method B-Gen to generate candidate frequent itemsets and corresponding supporting counts efficiently, which needs only some operations such as"and","or"and"xor". Applying this idea in the existed association mining algorithm Apriori, FUP and FDM, the corresponding improved algorithm BApriori, BFUP and BFDM is proposed.. The theoretical analysis and experiment testify that they are effective and efficient..(3) Considering that more and more attention have been payed to the problem of association rule mining in large data set, distributed association mining is a effective method to solve this problem. The paper proposes an algorithm of distributed association mining algorithm GFDA based on the distributed architecture of the data.(4) Based on the FUP algorithm, the paper proposes several conceptions including backup support threshold, minor frequent candidates set and upper bound of support count, then presents an improved algorithm IFUP. Furthermore, incremental association rule mining in distributed environment are considered, algorithms LUDA, GUDA, LIDA2 and GUDA2 are proposed to solve this problem.(5) Propose an algorithm to mining abnormal transactions by Benford law. Present a concept of difference to compare association from abnormal transactions with global association rules for extracting more interesting rules from global association rules.(6) Develop a prototype system AuditMiner for mining distributed association rules from the customs'database system by industry audit. The algorithms presented in the paper are tested to be effective and efficient.
Keywords/Search Tags:industry audit, data mining, association rule, distributed association rule, association rules updating
PDF Full Text Request
Related items