Font Size: a A A

Research On The Performance Optimization Of Data Processing And Query Of Audit Data Center

Posted on:2017-05-26Degree:MasterType:Thesis
Country:ChinaCandidate:Q X WangFull Text:PDF
GTID:2348330518970807Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
The application of computer technology in the field of audit is becoming more and more extensive.Audit work needs the data provided by each auditee,but the data provided by them is heterogeneous.Taking a network audit system of a province as an example,the data needs to be preprocessed before entering the data center.But the data processing is time-consuming,and the network audit staff can hardly carry out audit work based on the latest data.This leads to the weak timeliness of the network audit system,and the speed to generate audit doubt is slow.This thesis focuses on the performance optimization of data preprocessing and query,the two aspects of audit data center,with the actual situation of the performance of the construction and application of the audit data center.Preprocessing aspects,in the face of the big audit data,the pre-processing cluster is constructed for concurrent processing.In order to meet the high requirements of the timeliness of the network audit platform,this thesis establishes an index system for evaluating the degree of importance of audit methods and uses the index system to calculate the degree of importance of audit methods.Then according to the degree of importance of audit methods to process the required data by the way of the priority and relevance.When the processing node is being assigned,the data processing tasks are assigned to keep the expected execution time of each node as balanced as possible,then the overall completion time of data processing tasks is reduced.So as to advance the start time and end time based on the latest data audit and enhance the timeliness of the network audit platform and improve audit efficiency.In the end,a new data processing scheduling algorithm to enhance the audit timeliness is proposed.About query optimization,use relational algebra optimization rules to rewrite the audit method and use DB2 index access mechanism to optimize the audit database,based on the characteristics of audit data.Finally,the design of the algorithm and the program was carried out in the audit data center,and the performance of data processing and query of audit data center has been improved significantly.
Keywords/Search Tags:the timeliness of audit, cluster, scheduling, query optimization
PDF Full Text Request
Related items