Transcript
Page 1: VO-Ganglia  Grid Simulator

VO-Ganglia VO-Ganglia Grid Grid

SimulatorSimulator

Catalin Dumitrescu, Mike Wilde, Ian Catalin Dumitrescu, Mike Wilde, Ian FosterFoster

Computer Science DepartmentComputer Science DepartmentThe University of ChicagoThe University of Chicago

Page 2: VO-Ganglia  Grid Simulator

Talk Overview

➢Part I: The Grid-enabled Monitoring Tool

➢Part II: From Monitoring to Simulation

➢ Part III: Features / Extended Model

➢Shortcomings

➢Future Work / Conclusions

2

Page 3: VO-Ganglia  Grid Simulator

VO-Ganglia / Grid-enabled Mon

➢P2P Reporting ✗implicit hierarchic infrastructures

➢Interface with Other Monitoring Tools ✗Nagios, MDS 2

➢Grid/Globus Specific Metrics ✗Gatekeeper Information / Cluster RM Status

➢Per VO Monitoring Support ✗Collected metrics were aggregated and VO specific as well

➢Resource Management ➢Preference Specifications ➢Usage Policy Enforcement

3

Page 4: VO-Ganglia  Grid Simulator

Best Snapshot (1)

4

Page 5: VO-Ganglia  Grid Simulator

Best Snapshot (2)

5

Page 6: VO-Ganglia  Grid Simulator

Why to Continue on this Path?

➢Implemented Ideas ●VO based Metric Reporting●Usage Policy Metric Incorporation ●Distributed Infrastructure for Usage Policy

➢Time Spent with Development ●Enhanced Monitoring ~ 3 month●Policy ~ 6 months●Simulator ~ 3 months

➢Are Other Alternatives Around? ➢MonaLisa ➢Standard Ganglia

6

Page 7: VO-Ganglia  Grid Simulator

➢Difficult to Find Always Acceptable Grid Testbeds

➢Deployment Takes Time

➢Computing Time Represents an Issue in Production Environments

➢What Do Some Well Known TestBeds offer Today? ➢Grid3: many clusters with similar software AND Globus➢PlanetLab: individual machines with similar characteristics

7

From Monitoring to Simulation

Page 8: VO-Ganglia  Grid Simulator

8

Features / Implemented Model

➢CPU Management / Task Assignment Policies ➢Disk Management / Space Assignment Policies ➢Network Management / Maximum Capacity (so far) ➢Usage Policy Specification Interface

➢Data File Management (replica selection problem)

Page 9: VO-Ganglia  Grid Simulator

9

Implementation Details

➢Before: ✗Metric collection by means of specific collectors

➢Now: ✗Special modules that generate metrics about different loads✗Similar to a discrete simulator but integrated with a real tool

➢“How exactly?”✗Periodic invocations (instead of monitoring collectors) ✗State management for workloads, data file migration, CPU and disk allocations, network usages

Page 10: VO-Ganglia  Grid Simulator

10

Running Examples

Page 11: VO-Ganglia  Grid Simulator

Talk Overview

11

➢Part I: The Grid-enabled Monitoring Tool

➢Part II: From Monitoring to Simulation

➢ Part III: Features / Extended Model

➢Shortcomings

➢Future Work / Conclusions

Page 12: VO-Ganglia  Grid Simulator

12

Distributed Simulations

➢Idea: Is it possible to run several simulators on different machines and configure each instance to report to a set of specified neighbors?

➢Advantages:✗Simplicity in connecting several local simulators working on different data✗Support for metric distribution and visualization

Page 13: VO-Ganglia  Grid Simulator

13

Running Examples

[...]

Page 14: VO-Ganglia  Grid Simulator

14

Commitment Usage Policy

for each Gi with EPi, BPi, BEi do

  # Case 1: fill BPi + BEi   if (Sum(BAj) == 0) & (BAi < BPi) & (Qi has jobs) then     schedule a job from some Qi to the least loaded site

  # Case2: BAi<BPi (resources available)   else if (SUM (BAk) < TOTAL) & (BAi < BPi) & (Qi has jobs)     schedule a job from some Qi to the least loaded site

# Case 3: fill EPi (resource contention)   else if (sum(BAk) == TOTAL) &

(BAi < EPi) & (Qi exists) then     if (j exists such that BAj >= EPj) then      stop scheduling jobs for VOj # Need to fill with extra jobs?   if (BAi < EPi + BEi) then      schedule a job from some Qi to the least loaded site

# ??if (EAi < EPi) & (Qi has jobs) then    schedule additional backfill jobs

Page 15: VO-Ganglia  Grid Simulator

15

Usage Policy Example

99%

80%

20%

60%

90%

VO1

VO2

Page 16: VO-Ganglia  Grid Simulator

16

Commitment Policy in Practice

Page 17: VO-Ganglia  Grid Simulator

17

Current Issues

➢RRD / Disk Access

➢Perl / Interpreted Language Speed

➢Result Interpretation

➢Result Validation in Real Contexts

Page 18: VO-Ganglia  Grid Simulator

18

Future Work

➢“What Is Next? ”

✗More work Resource Usage Policy Analsys

✗“Export” ideas from VO-Ganglia in real pratice

Page 19: VO-Ganglia  Grid Simulator

19

Conclusions

➢“Why VO-Ganglia Is So 'Cool‘ for me?” ✗Some creative ideas

✗Easy to use

✗“Possibility to run on my laptop”

✗Provisioning tools for ✔Workload generation✔Result formatting

➢“Why Did I Invest More Than a Year in Developing It?”

Page 20: VO-Ganglia  Grid Simulator

20

Questions / Suggestions?

?


Top Related