Font Size: a A A

Research On Resources Scheduling For Irregular Applications On Graphics Processing Units

Posted on:2014-11-01Degree:DoctorType:Dissertation
Country:ChinaCandidate:S MuFull Text:PDF
GTID:1228330452453597Subject:Electronic Science and Technology
Abstract/Summary:PDF Full Text Request
Recently, GPUs (Graphics Processing Units) have been widely adopted in manyscientific and engineering applications, such as graphic and image processing, scientificcomputing, multi-media applications, data mining, financial computing and so on.GPUs are inherently suitable for regular applications as it follows the SIMD (SingleInstruction Multiple Data) execution model. However, the irregular patterns that arepervasive in computation and memory operations have become the performancebottleneck of GPU applications. Such irregular patterns as unbalance workloads,divergent control flow, irregular memory access and poor data locality are exhibited inalmost all aspects of computer architecture design. Therefore, it is critical to minimizethe overhead of processing such irregular patterns for better performance. This workaims at solving the abovementioned obstacles from the perspectives of both designingefficient algorithms and optimizing micro-architectures. The contributions of this thesisare as follows:(1) We analyze and optimize three irregular applications: sparse matrix vectorproduct (SMVP), string matching and QR decomposition. For SMVP, a technique isproposed to eliminate irregular memory accesses by expanding the vector. For stingmatching, we devise two efficient techniques, data partitioning and data reordering, tosolve the irregular computation and memory access patterns simultaneously. For QRdecomposition, we exploit the pipelined parallelism by considering the inherent datadependence. Our techniques exhibit superior performance improvement, with anaverage speed-up over the CPU implementation by one order of magnitude.(2) We conduct a systematical analysis on the characteristics of GPU programs.Our analysis proves that the irregular patterns cause low utilization of GPU resources.On one hand, the unbalanced memory access latency introduced by the memory latencydivergence result in the under-utilization multiprocessors. On the other hand, currentcache management cannot adapt to the complex memory access patterns. Therefore,GPU programs cannot fully exploit the cache resources.(3) We develop a cache management policy called Effective Address BasedPriority and a memory scheduling policy called Divergence Aware Memory Scheduling, respectively. These two microarchitecture techniques can improve the cache efficiencyand reduce the impact of memory latency divergence concurrently. Experimental resultsshow that the cache miss rate can be reduced by20%and system performance can beimproved by30%.(4) For the unbalanced task workloads existed in streaming processing, we developa dynamic resources scheduling policy. Under such a policy, the workloads of each taskare monitored and the amount of data transferred between different tasks will becalculated. Therefore, the computation and cache resources can be allocated to each taskin a dynamically tuned manner. Experimental results show that our dynamic schedulingpolicy can improve the system performance by20%compared to current GPUpreemptive scheduling.
Keywords/Search Tags:GPU, SIMD, Irregular, Cache Management, Memory Scheduling
PDF Full Text Request
Related items