Virtual clusters: Resource management on large shared-memory multiprocessors

Posted on:2002-02-11

Degree:Ph.D

Type:Thesis

University:Stanford University

Candidate:Govil, Kinshuk

Full Text:PDF

GTID:2468390011498609

Subject:Computer Science

Abstract/Summary:

Despite the fact that large scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all of their features is still not available. These machines require system software that is scalable, supports fault containment, and provides scalable resource management. Software supporting these features is currently unavailable, mostly due to the complexity and cost of making the required changes to the operating system. One proposed alternative is to partition the hardware into small units; however, hardware partitioning limits resource sharing flexibility, rendering the system unable to adapt to dynamically changing workloads.; Virtual Clusters, an alternative approach using virtual machine technology, provides all the features necessary to support large shared-memory multiprocessors at only a small fraction of the development cost of modifying the operating system. This approach effectively turns a large scale shared-memory multiprocessor into a virtual cluster that supports fault containment and heterogeneity, while avoiding operating system scalability bottlenecks. At the same time, Virtual Clusters preserve the benefits of the underlying shared-memory multiprocessor by implementing dynamic, fine-grained resource sharing, and by allowing users to overcommit resources such as processors and memory.; In this thesis, we describe the resource management aspect of the Virtual Clusters approach, which requires a scalable resource manager that makes local decisions with limited information while still providing good global performance. The resource manager must also be aware of issues related to fault containment and virtual machines in order to support fault containment and provide good performance. We describe our experience with a prototype implementation on a 32-processor SGI Origin 2000 system. We show that execution time penalties for this approach are low, typically within 10% of the best available commercial operating system for most workloads, and that it can manage the CPU and memory resources of the machine significantly better than the hardware partitioning approach.

Keywords/Search Tags:

Resource, Shared-memory, Virtual clusters, Large, System, Approach, Fault containment

Related items

1	Research Of Fault Containment And Checkpointing Technology For Shared-Memory Multiprocessor
2	Dynamic data replication: An approach to providing fault-tolerant shared memory clusters
3	Research On High Speed Networks Of Shared Virtual Memory Clusters
4	Cables: Thread and memory extensions to support a single shared virtual memory cluster image
5	Naplus: A Software Shared Memory For Virtual Clusters
6	Architectural Support for Large-scale Shared Memory System
7	Based On The Smp Cluster Virtual Shared Storage System
8	Shared Virtual Memory Systems Based On SCI
9	The Implementation And Performance Of User-level Communication Protocol For Shared Memory Clusters
10	Studies On Shared-Memory Management And Optimization Technologies In Parallel And Distributed Operating Systems