Font Size: a A A

CNA:a Performance Optimization System For Multi-core NUMA Architecture In Virtualized Environment

Posted on:2015-02-13Degree:MasterType:Thesis
Country:ChinaCandidate:L L LiFull Text:PDF
GTID:2268330425986447Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the development and popularity of virtualization technology and multi-core NUMA architecture, the multi-core NUMA architecture commercial server based on virtualization has increasingly been widespread used. At the same time, the performance requirements of industry for virtualized environments multicore NUMA architecture servers are increasingly high. On one hand, the system’s default scheduler in the virtual environment of NUMA architecture does not get good performance, on the other hand, the traditional NUMA architecture becomes more complex due to the introduction of virtualization technology and the existing thread-based scheduling optimization algorithms can not obtain satisfactory performance.In this context, this paper designs and implements a virtualized environment for multi-core NUMA architecture to optimize system performance based on Cache-Aware and NUMA-Aware (CNA). Through real-time monitoring virtual machine threads and NUMA hardware performance, the CNA system can analysis the characteristics of VCPU thread and to achieve wiser scheduling to improve system performance.Work done in this article is mainly reflected in the following aspects:(1) VCPU thread classification;In this paper, the CNA system bases on VCPU threads’ LLC miss rate to classify them into3classes, and then we use double exponential smoothing algorithm to make the update process of class to be more stable and reliable. Compare with the system’s default scheduler, this way better take the characteristics of VCPU thread into account. In this way, the CNA system can make more accurate scheduling decisions.(2) Proposing VCPU thread to Node binding;In design of CNA_VCPU scheduling algorithm, the author proposes VCPU threads to bind Node. The reason is that this strategy can guarantee scheduling algorithm to achieve load balancing system, it can also be built by the system scheduler node to the CPU core scheduling, which not only improves the flexibility of scheduling algorithm, but also improves the scheduling efficiency. Scheduling algorithm in making scheduling decisions VCPU thread priority will be placed on the current Node, which can reduce system overhead VCPU thread scheduling.(3) For each thread, introduces the concept of Home Node;Introducing the concept of Home Node for each thread, we can increase share of LLC resources and reduce inter-thread LLC competition as well as improve system performance.(4) Designing and implementing a detailed test program. In this paper, we design and implement a comprehensive testing program to verify the CNA system to a virtual machine environment of multicore NUMA systems have a significant performance boost.
Keywords/Search Tags:Multi-core NUMA, Virtualization, Performance Improvement, LLC
PDF Full Text Request
Related items