shared resource monitoring and throughput optimization in cloud-computing datacenters
DESCRIPTION
Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters. By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and Sadagopan Srinivasan. Abstract. Datacenters employ server consolidation to maximize the efficiency of platform resource usage. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/1.jpg)
Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and Sadagopan Srinivasan
![Page 2: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/2.jpg)
Abstract• Datacenters employ server consolidation to
maximize the efficiency of platform resource usage.
• Impacts on their performance.• Focus: Use of shared resource monitoring to
Understand the resource usage. Collect resource usage and performance. Migrate VMs that are resource-constrained.
• Result : To improve overall datacenter throughput and improve Quality of Service (QoS)
![Page 3: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/3.jpg)
Focus• Monitor and address shared cache contention.• Propose a new optimization metric that captures
the priority of the VM and the overall weighted throughput of the datacenter
• Conduct detailed experiments emulating data center scenarios including on-line transaction processing workloads.
• Results: Monitoring shared resource contention is
highly beneficial to better manage throughput and QoS in a cloud-computing datacenter environment.
![Page 4: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/4.jpg)
Keyword Benchmark- TPCC, SPECjAppServer, SPECjbb, PARSEC Virtualization LLC – Last Level Cache Shared-cache CMP – Chip Multiprocessing Cache Contention Virtual Platform Architecture MPI – Misses Per Instruction IPC – Instruction Per Cycle
![Page 5: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/5.jpg)
Outline•Introduction•Background and Motivation•Proposed Approach•Simulation•Related Work •Summary and Conclusions.
![Page 6: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/6.jpg)
Introduction Evolved data center with large number of heterogeneous
applications running within virtual machines on each Platform. Vsphere
Service Level Agreements.
Key Aspects: Shared Resource Monitoring VM Migration QoS and Datacenter Throughput
![Page 7: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/7.jpg)
Contribution A simple methodology of using cache occupancy for shared
cache environment. New optimization metric that captures QoS as part of the
throughput measure of the datacenter. Detailed Experiments emulating data center scenario resulting
in improvement in QoS and throughput. Work is unique as it addresses application/VM scheduling in
the context of SLAs. Management of the shared cache occupancy. Focus on shared cache contention which has first-order impact
on performance. LLC monitoring.
![Page 8: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/8.jpg)
Typical Datacenter Platform and VM Usage
![Page 9: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/9.jpg)
• Cloud-computing virtualized datacenters of the future will have machines that are based on CMP architecture with multiple cores sharing the same LLC.
• Measured the performance of Intel’s latest Core2 Duo platform when running all 26applications (in Windows XP) from the SPEC CPU2000 benchmark suite individually and in pair-wise mode .
Background and Motivation
![Page 10: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/10.jpg)
Impact of Cache/Memory Contention
![Page 11: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/11.jpg)
Cache sensitivity of Server Workloads
![Page 12: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/12.jpg)
TPCC performance while co-running with other workloads on same shared LLC
![Page 13: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/13.jpg)
Proposed MIMe Approach
•Key components :▫Mechanism used to monitor VM resource usage
and identifying VMs that suffer due to resource contention.
▫Techniques used to identify candidate VMs for migration based on priorities and their behavior to achieve improved weighted throughput and determinism across priorities.
▫A metric that quantifies the goodness/efficiency of the datacenter as weighted throughput measure.
![Page 14: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/14.jpg)
MIMe Key components to improve the efficiency of datacenter weighted throughput
![Page 15: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/15.jpg)
Monitoring resource usage •VPA architecture •VPAID
![Page 16: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/16.jpg)
IPC sensitivity for TPCC
![Page 17: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/17.jpg)
Identifying VM candidates for migration
•Two key factors :
▫VMs priority as agreed upon by an SLA
▫Behavior. E.g. cache sensitivity .
•Example scenarios wherein an application like TPCC can exhibit a huge variation in performance depending on co-scheduled application.
![Page 18: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/18.jpg)
The basic algorithm to identify a candidate VM for migration
![Page 19: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/19.jpg)
Goal
• No VM of interest that has a higher priority but runs less efficiently than a VM of lower priority after the migrations.
• The whole process is cyclic which ensures that workload phases change or changes in SLAs with customers can be addressed with ease
![Page 20: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/20.jpg)
Metric to quantify the efficiency of a datacenter
• Measure - Total System IPC.• Benchmarking propose a Vconsolidate concept using
weights associated to workload performance.Weighted normalized performance metric
• Our New metric that would incorporate the QoS value as part of the throughput measure :
QoS-Weighted throughput performance metric
![Page 21: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/21.jpg)
RESULTS AND ANALYSIS
• Simulation based methodology that uses CMPSched$im - Parallel multi-core performance simulator.
• Utilizes the Pin binary instrumentation system to evaluate the performance of single-threaded, multi-threaded, and multi-programmed workloads on a single/multi-core processor .
• Dynamically feeds instructions and memory references to the simulator.
• Modified to be used as a trace-driven simulator. • Server workload traces for TPCC, Specjbb, SPECjAppServer,
indexing workload and parsec .• Result:In the absence of any type of enforcement mechanisms being available in the hardware to control the cache occupancy, we have to rely only on monitoring information to make scheduling decisions.
![Page 22: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/22.jpg)
TPCC IPC and Occupancy with QoS values
![Page 23: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/23.jpg)
After Migration TPCC IPC and Occupancy with QoS values
![Page 24: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/24.jpg)
Effect of minimizing contention for HP applications
![Page 25: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/25.jpg)
Mean IPC after VM migration for reducing cache contention for HP applications
![Page 26: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/26.jpg)
Mean IPC after VM migration for reducing cache contention for HP applications
![Page 27: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/27.jpg)
Experiment Result
•Logically clustering identical machines together, then applying the migration policy.
•The overall scrore increases 8% for TPCC workloads.
•With SjappServer workloads the increase is 4.5%
![Page 28: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/28.jpg)
RELATED WORK
• All other study have focused on a single machine not virtualized environments.
• Recently, a few studies like - Cherkasova and Enright Jerger and have focused on sharing in caches and for better scheduling policies.
• We show how identical machines can be logically clustered, and based on VPA monitoring how higher priority applications that we care about are always guaranteed to get more platform resources (cache) than lower priority applications.
• We also propose a new metric that incorporates QoS as the throughput measure
![Page 29: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/29.jpg)
CONCLUSION
• Problem of contention in the shared cache is a critical problem in virtualized cloud computing data centers.
• High priority applications can suffer if scheduling at a data center level is not done with cache contention in mind.
• How it can be solved without waiting for enforcement mechanisms to be available in the shared LLC .
• A very simple solution based on a VPA architecture
![Page 30: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/30.jpg)
Future Work•Incorporating memory bandwidth as part of
the VPA architecture.
•Scheduling optimizations.
•Profiling of VMs to take scheduling decisions.
•Monitoring and Enforcement for cache, memory bandwidth and also power can be very efficiently used
![Page 31: Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters](https://reader033.vdocuments.us/reader033/viewer/2022051518/56813a9d550346895da29890/html5/thumbnails/31.jpg)
THANK YOU !!