Efficiently and Transparently Maintaining High SIMD Occupancy in the Presence of Wavefront Irregularit

Posted on:2018-02-22

Degree:Ph.D

Type:Dissertation

University:Washington University in St. Louis

Candidate:Cole, Stephen V

Full Text:PDF

GTID:1448390002496019

Subject:Computer Science

Abstract/Summary:

Demand is increasing for high throughput processing of irregular streaming applications; examples of such applications from scientific and engineering domains include biological sequence alignment, network packet filtering, automated face detection, and big graph algorithms. With wide SIMD, lightweight threads, and low-cost thread-context switching, wide-SIMD architectures such as GPUs allow considerable flexibility in the way application work is assigned to threads. However, irregular applications are challenging to map efficiently onto wide SIMD because data-dependent filtering or replication of items creates an unpredictable data wavefront of items ready for further processing. Straightforward implementations of irregular applications on a wide-SIMD architecture are prone to load imbalance and reduced occupancy, while more sophisticated implementations require advanced use of parallel GPU operations to redistribute work efficiently among threads.;This dissertation will present strategies for addressing the performance challenges of wavefront-irregular applications on wide-SIMD architectures. These strategies are embodied in a developer framework called Mercator that (1) allows developers to map irregular applications onto GPUs according to the streaming paradigm while abstracting from low-level data movement and (2) includes generalized techniques for transparently overcoming the obstacles to high throughput presented by wavefront-irregular applications on a GPU. Mercator forms the centerpiece of this dissertation, and we present its motivation, performance model, implementation, and extensions in this work.

Keywords/Search Tags:

Irregular, SIMD, Applications, Efficiently

Related items

1	Research On Auto-Vectorization Compiling Techniques Oriented To Irregular Applications On SIMD Extension
2	Supporting Applications Involving Irregular Accesses and Recursive Control Flow on Emerging Parallel Environments
3	Research On Resources Scheduling For Irregular Applications On Graphics Processing Units
4	Supporting Applications Involving Dynamic Data Structures and Irregular Memory Access on Emerging Parallel Platforms
5	Compiling several classes of irregular applications on multithreaded architectures
6	Research On SIMD Vectorization And Optimization Of Non-Multimedia Applications
7	The High Resolution Spaceborne SAR Simulation Based On Heterogeneous SIMD
8	Locality transformations for adaptive irregular applications
9	Irregular subdivision and its applications
10	ILP-SIMD: An instruction parallel SIMD architecture with short -wire interconnects