Improving cache locality for thread-level speculation systems

Posted on:2006-02-09

Degree:M.A.Sc

Type:Thesis

University:University of Toronto (Canada)

Candidate:Fung, Stanley Lap Chiu

Full Text:PDF

GTID:2458390005497865

Subject:Engineering

Abstract/Summary:

With the advent of chip-multiprocessors (CMPs), Thread-Level Speculation (TLS) remains a promising technique for exploiting this highly multithreaded hardware to improve the performance of an individual program. However, with such speculatively-parallel execution the cache locality once enjoyed by the original uniprocessor execution is significantly disrupted: for TLS execution on a four-processor CMP, we find that the data-cache miss rates are nearly four-times those of the uniprocessor case, even though TLS execution utilizes four private data caches.; We break down the TLS cache locality problem into instruction and data cache, execution stages, and parallel access patterns, and propose methods to improve cache locality in each of these areas. We find that for parallel regions across 13 SPECint applications our simple and low-cost techniques reduce data-cache misses by 38.2%, improve performance by 12.8%, and significantly improve scalability---further enhancing the feasibility of TLS as a way to capitalize on future CMPs.

Keywords/Search Tags:

TLS, Cache locality, Improve

Related items

1	Model Driven Cache Management
2	A higher order theory of locality and its application in multicore cache management
3	Hardware techniques to improve cache efficiency
4	Resource Allocation And Structure Design For Bottom Level Caches
5	The Application Of Cache Dynamic Tuning Method In Riad System
6	Research And Implementation Of Cache Optimization Techologies Based On Group For RAID
7	Research And Implement On Three-level Cache In RAID
8	Design and evaluation of an adaptive write buffer cache for solid state drive
9	Research On In-network Cache Resolution Scheme In Information-Centric Networking
10	An Efficient Cache System For Hybrid Memory