Off-chip bandwidth for multicore processors: Managing the next big wall

Posted on:2011-08-06

Degree:Ph.D

Type:Thesis

University:City University of New York

Candidate:Ahsan, Bushra

Full Text:PDF

GTID:2448390002465718

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

As we approach billion transistors on chip, the number of on-chip cores is skyrocketing. With the number of on-chip cores increasing, the traffic generated from these cores is also increasing. Recent studies have shown that this surge of traffic in multicores is bad news for supercomputing design. This is due to off-chip contention amongst applications running on multiple cores.;Traffic in a multicore system is divided into on-chip traffic (traffic amongst cores) and off-chip traffic (traffic from chip to memory). The off-chip traffic is mainly generated by on-chip cache hierarchy and is divided into traffic towards memory, due to writebacks, and from memory, due to read misses. There is a huge body of research on managing cache hierarchies, improving their performance and hence reducing the number of cache misses. Bandwidth requirement has always been of secondary importance. In the multicore and many-core era, this is no longer the case. The cache hierarchy designer must take into account both cache performance and traffic generated by the cache in order not to put pressure on the available bandwidth. If off-chip bandwidth is not managed, a 16 core machine will not give much performance benefit over a dual core machine.;In most processor architectures, the cache hierarchy consists of several private caches per core, followed by a shared Last-Level Cache (LLC). This LLC is the last wall before hitting off-chip and is the cause of off-chip bandwidth traffic i.e. the writebacks. LLC, therefore, is a highly important factor in off-chip traffic generation. We manage the LLC in order to attain overall off-chip bandwidth management in a multicore system. In this thesis various methods to improve bandwidth by reducing traffic towards memory are proposed. We present hardware and hybrid techniques of varying complexities that work in rhyme to manage bandwidth for multicores. All techniques proposed to save bandwidth require very little overhead and reduce off-chip traffic considerably while not effecting overall performance. By bandwidth management we come closer to the ultimate goal of supercomputer on chip.

Keywords/Search Tags:

Bandwidth, Chip, Core, Traffic, Cache, LLC, Performance

PDF Full Text Request

Related items

1	Research Of Implementation Method About Multi-core Cache Consistency On The On-Chip Network
2	Performance Modeling and Characterization of Multicore Computing Systems
3	Design And Implementation Of Cache Core Based On Secure SoC Chip
4	Co-optimization Of On-chip Interconnects And Cache Coherence For Multi/Many-core Systems Based On Multithread Application Characteristics
5	Research On Chip Cache Power Consumption Control Method Of Multicore System
6	Research On Shared Cache Access Fairness For Many-Core Processor
7	Smart Directory Cache For Multi-Many-Core Systems
8	Optimization Techniques Of Cache In Chip MultiThreading
9	Research On Key Technologies For Cache Power And Performance Optimization On Many-core Heterogeneous Architecture
10	Research On Multilevel Cache Technology Of Network On Chip And Realization Of System Interconnection