Automatic software performance optimization on modern architectures

Posted on:2008-07-15

Degree:Ph.D

Type:Dissertation

University:University of Illinois at Urbana-Champaign

Candidate:Jiang, Changhao

Full Text:PDF

GTID:1458390005481012

Subject:Computer Science

Abstract/Summary:

This dissertation extends automatic library generation methodology to an emerging untraditional computer architectures and to a complex application domain. Specifically, it consists of two parts of work: First, it implements an automatic matrix multiply library generator for graphics hardware---a specialized architecture with enormous computing power for graphics applications. Second, it uses machine learning techniques to automatically select the best algorithm for frequent pattern mining problems according to input characteristics.; The automatic matrix multiplication tuning system uses a parameterized code generator to generate multiple versions of matrix multiplication, whose performances are empirically evaluated by actual execution on the target platform. An ad-hoc search engine is employed to search over the implementation space for the version that yields the best performance. In contrast to similar systems on CPUs, which utilize cache blocking, register tiling, instruction scheduling tuning strategies, it identifies and exploits several tuning strategies that are unique for graphics hardware. These tuning strategies include optimizing for multiple-render-targets, SIMD instructions with data packing, overcoming limitations on instruction count and dynamic branch instruction. The generated implementations have comparable performance with expert manually tuned version in spite of the significant overhead incurred due to the use of the high-level BrookGPU language.; Frequent pattern mining is a fundamental problem in data mining and a large number of distinct algorithms have been proposed to solve it efficiently. However, no single algorithm outperforms all the others since their relative performance highly depends on the characteristics of the input data. In the dissertation, we present a machine learning based approach to select the best frequent pattern mining algorithm based on the input characteristics. Three of the fastest publicly available algorithms, FP_Growth, LCM and Eclat, were extensively evaluated using synthetic data sets. The results of these evaluations were used to train a support-vector machine (SVM) prediction system, which is then used at runtime to predict the best mining algorithm for real-world data sets. Our experiments show that the runtime prediction overhead is negligible and that the trained SVM prediction system usually identifies the best algorithm. In case of misprediction, the selected algorithm is still competitive in performance.

Keywords/Search Tags:

Performance, Automatic, Algorithm, Frequent pattern mining

Related items

1	The Research And Relization Of Mining Frequent Patterns On Business Data Straems
2	Optimization Of Frequent Pattern Mining Algorithm Based On Persistent Memory
3	Research And Application Of Frequent Pattern Mining Algorithm Based On Tissue-like P System
4	The Research On The Related Problems Of Association Rule Mining
5	Study On Frequent Pattern Mining Algorithms And Pruning Strategies
6	A Study On Algorithms Of Weighted Frequent Pattern Mining
7	Constraint-Based Frequent Pattern Mining:Novel Applications And New Techniques
8	Research On Parallel Frequent Graph Pattern Mining
9	Research On Website Optimization Strategy Based On Frequent Pattern Mining
10	A Study On Weighted Frequent Pattern Mining Algorithms