Analyzing and managing shared cache in Chip Multi-Processors

Posted on:2009-12-26

Degree:Ph.D

Type:Dissertation

University:North Carolina State University

Candidate:Guo, Fei

Full Text:PDF

GTID:1448390005456486

Subject:Engineering

Abstract/Summary:

Recently, Chip Multi-Processor (CMP) or multicore design has become the mainstream architecture choice for major microprocessor makers. In a CMP architecture, some important on-chip platform resources are shared by all the processor cores. As will be shown in this dissertation, resource sharing may lead to low throughput for the applications that fail to acquire sufficient resources to make good progress. In addition, resource sharing may also lead to a large performance variation for an individual application. Such performance variation is ill-suited for the future uses of CMPs in which many applications may require a certain level of performance guarantee, which we refer to as performance Quality of Service (QoS). In this dissertation, we address the resource sharing problem from two aspects.;Firstly, we propose an analytical and several heuristic models that encapsulate and predict the impact of cache sharing. The models differ by their complexity and prediction accuracy. We validate the models against a cycle-accurate simulation. The most accurate model achieves an average error of 3.9%. Through a case study, we found that the cache sharing impact is largely affected by the temporal reuse behaviors of the co-scheduled applications.;Secondly, we investigate a framework for providing performance Quality of Service in a CMP server. We found that the ability of a CMP to partition platform resources alone is not sufficient for fully providing QoS. We also need an appropriate way to specify a QoS target, and an admission control policy that accepts jobs only when their QoS targets can be satisfied. We also found that providing strict QoS often leads to a significant reduction in throughput due to resource fragmentation. We propose novel throughput optimization techniques that include: (1) exploiting various QoS execution modes, and (2) resource stealing techniques. Through simulation, we found that compared to an unoptimized scheme, the throughput can be improved by up to 45%, making the throughput significantly closer to a non-QoS CMP.

Keywords/Search Tags:

CMP, Qos, Throughput, Cache

Related items

1	A Study And Optimization Of Cache Level In Collabrative Computing Platform
2	Wireless Cache Design For Physical Layer Security
3	Technology impacts of CMOS scaling on microprocessor core design for hard-fault tolerance in single-core applications and optimized throughput in throughput-oriented chip multiprocessors
4	The Optimization Management Strategy Based On Multi-processor Sharing Cache
5	Design And Implementation Of Distributed Cache Management System For In-memory Columnar Database
6	Study On Cache Partition Optimization Based On Non-stacked Cache Replacement Algorithm
7	A generic system simulator with novel on-chip cache and throughput models for gigascale integration
8	Application Research Of Data Cache Technology In MIS
9	Classification-based Prefetch-Aware Cache Partition Mechanism
10	Research And Implementation Of Cache Technology Based On WWW