grid aware ha-oscar

20
Oscar 05 Symposium May 2005 Grid Aware HA-OSCAR Kshitij Limaye 1 , Box Leangsuksun 1 , Venkata K. Munganuru 1 , Zeno Greenwood 1 , Stephen L. Scott 2 , Richard Libby 3 , and Kasidit Chanchio 4 1.Louisiana Tech University, 2.Oak Ridge National Laboratory, 3.Intel, 4.Thammasat University, Thailand

Upload: trancong

Post on 14-Feb-2017

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Grid Aware HA-OSCAR

Oscar 05 SymposiumMay 2005

Grid Aware HA-OSCAR

Kshitij Limaye1, Box Leangsuksun1, Venkata K. Munganuru1, Zeno Greenwood1, Stephen L. Scott2, Richard Libby3, and Kasidit Chanchio4

1.Louisiana Tech University, 2.Oak Ridge National Laboratory, 3.Intel, 4.Thammasat University, Thailand

Page 2: Grid Aware HA-OSCAR

2Oscar 05 SymposiumMay 2005

OutlineIntroductionTraditional & Dual head Architectures.Proposed FrameworkSmart Failover frameworkExperimentPlanned & unplanned downtimeConclusionFuture work

Page 3: Grid Aware HA-OSCAR

3Oscar 05 SymposiumMay 2005

IntroductionScientists across the world have employed Grid Computing to overcome various resource level hurdles.

Clusters are favored job sites in grids.

Rendering High availability becomes increasingly important as critical applications shift to grid systems.

Though Grid is distributed , inevitable errors can make a site unusable leading to reduced overall resources and slowing down the speed of computation.

Page 4: Grid Aware HA-OSCAR

4Oscar 05 SymposiumMay 2005

Introduction – continued…Efforts need to concentrate on making critical systems highly available and eliminate single point of failures in grids and clusters.

HA-OSCAR removes single point of failure of cluster based job site (Beowulf) by component redundancy and self-healing capabilities.

Smart Failover feature tries to make failover mechanism graceful in terms of job management.

Page 5: Grid Aware HA-OSCAR

5Oscar 05 SymposiumMay 2005

Traditional Intra site cluster configuration

Site-Manager is (cluster head node having Globus Services) the node acting as the gateway between the cluster and the grid.Site-manager is critical from point of site being used to its full potential.Failure of Site-Manager causes whole site to go unused till it becomes healthy.Outages are non-periodical and unpredictable and hence measures should be taken to guarantee high availability of services. Hence the proposed architecture.

Page 6: Grid Aware HA-OSCAR

6Oscar 05 SymposiumMay 2005

Critical Service Monitoring & Failover-Failback capability for site-manager

Client

Client submits MPI job

Site-Manager

HAOSCAR failover if

critical services

(Gatekeeper, gridFTP, PBS) die

Compute nodes

Stand-By

Page 7: Grid Aware HA-OSCAR

7Oscar 05 SymposiumMay 2005

Proposed Framework

Most of the current efforts have focused on task-level fault tolerance as in retrying the job on an alternate site.

There is dearth of solutions for fault detection and recovery at the site level.

We monitor Gatekeeper & gridFTP services in the Service monitoring sublayerand failover & failback in irreparable situations.

Operating System Applications

Cluster Software

Grid Layer

HA-OSCAR Service Monitoring

HA-OSCAR policy-based

recoverymechanism

Page 8: Grid Aware HA-OSCAR

8Oscar 05 SymposiumMay 2005

Grid Enabled HA service

The HA-OSCAR monitors the gatekeeper and gridFTP services every 3 seconds.When a service fails, to start after 3 attempts, failover happens.Standby also monitors Primary every 3 seconds to check whether it is alive.

Page 9: Grid Aware HA-OSCAR

9Oscar 05 SymposiumMay 2005

Smart Failover FrameworkEvent monitor triggers Job Queue monitor on events such as JOB_ADD, JOB_COMPLETE and system eventsOn sensing change in job queue, job queue monitor triggers backup updater to update backup.

Page 10: Grid Aware HA-OSCAR

10Oscar 05 SymposiumMay 2005

HA-OSCAR in a cluster based Grid environmentProduction-quality Open

source Linux-cluster project

HA and HPC clustering techniques to enable critical HPC infrastructure Self-configuration Multi-head Beowulf system

HA-enabled HPC Services:Active/Hot Standby

Self-healing with 3-5 sec automatic failover time

The first known field-grade open source HA Beowulf cluster release

Page 11: Grid Aware HA-OSCAR

11Oscar 05 SymposiumMay 2005

ExperimentGlobus Toolkit 3.2Oscar 3.0 HA-OSCAR beta 1.0

Page 12: Grid Aware HA-OSCAR

12Oscar 05 SymposiumMay 2005

Observations

Average Failover time was 19 seconds and average failback time was 20 seconds.

Services were restarted in between 1-3 seconds depending on when last monitoring was done.

Group Service Type Time Alert

1 Service_mon

Gatekeeper

Alert Sun Nov 21

09:10:302004

Xinetd.alert

2 Service-mon

Gatekeeper

Upalert

Sun Nov21

09:10:332004

Mail.alert

Group Service Type Time Alert

1 Primary_server

Ping Alert Sun Nov21

09:30:202004

Server-down Alert

2 Primary_server

Ping Upalert

Sun Nov2109:35:392004

Server-up.alert

Page 13: Grid Aware HA-OSCAR

13Oscar 05 SymposiumMay 2005

Time needed for jobs to complete with/without “Smart Failover”

Assuming jobs start running after reboot on clusters.TLR = Time to complete last running jobs.

MTTR (seconds) Total Time needed without Smart Failover feature

Total time needed with smart Failover feature

120 (2 min) 120 + run time of predecessors– TLR (running jobs lost)

20 + run time of predecessors + TLR

600 (10 min) 600 + run time of predecessors – TLR (running jobs lost)

20 + run time of predecessors + TLR

3600 (60 min) 3600 + run time of predecessors

– TLR (running jobs lost)

20 + run time of predecessors + TLR

7200 (2 hours) 7200 + run time of predecessors

– TLR (running jobs lost)

20 + run time of predecessors + TLR

Page 14: Grid Aware HA-OSCAR

14Oscar 05 SymposiumMay 2005

Planned Downtime

Time to taken to setup and configure software adds to the planned downtime.We have developed a easy Globus Toolkit configuration helper package.Also helps installation of side packages, such as schedulers, MPI(s), etc.This will help reducing planned downtime by automating the process.

Page 15: Grid Aware HA-OSCAR

15Oscar 05 SymposiumMay 2005

Unplanned DowntimeAssumptions:

Package used: SPNPAvailability for grid having traditional cluster as intra site solution : 0.968 i.e. 11.68 days downtime per year.Availability for grid having HA-OSCAR enabled cluster as intra site solution:0.99992 i.e. 2 minutes downtime per year

Hence the obvious availability gain.

HAOSCAR enabled Grid Vs Traditional Grid

70.00%72.00%74.00%76.00%78.00%80.00%82.00%84.00%86.00%88.00%90.00%92.00%94.00%96.00%98.00%

100.00%

1000 2000 3000 5000 6000

Mean Time To Failure(MTTF) in Hours

Avai

labi

lity/

year

Single Head 4 clusterGridHAOSCAR enabled 4cluster gridSingle Head 10Cluster GridHAOSCAR enabled10 Cluster Grid

HA-OSCAR enabled Grid Vs Traditional Grid

Page 16: Grid Aware HA-OSCAR

16Oscar 05 SymposiumMay 2005

Polling Overhead Measurement20 sec failover time0.9% CPU usage at each monitoring interval

0

50

100

150

200

250

300

1 2 5 10 15 20 30 60

HA-OSCAR Mon polling interval (s)

HA-O

SC

AR N

etw

ork

load

in

Pack

ets/

Min

m

easu

red b

y TC

Ptra

ce

Comparison of network usages for HA-OSCAR different polling sizes

Page 17: Grid Aware HA-OSCAR

17Oscar 05 SymposiumMay 2005

SummaryInstitutions have significant investment in resources and that needs to be guaranteed.

“Smart Failover” HA-OSCAR makes failover graceful in terms of job management.

“Smart Failover” HA-OSCAR with Failover Aware solution for site-manager provides better availability, self healing and fault tolerance.

HA-OSCAR ensures service and job level resilience for clusters and grids.

Page 18: Grid Aware HA-OSCAR

18Oscar 05 SymposiumMay 2005

Current statusSmart failover feature tested with Oscar 3.0, OpenPBS as the scheduler.

Failover Aware client written to achieve resilience for jobs submitted through grid.

Lab grade automated Globus installation package ready.

Page 19: Grid Aware HA-OSCAR

19Oscar 05 SymposiumMay 2005

Future WorkDevelop the wrapper around scheduler for per job add/complete events.

Testing of Smart failover feature with the event monitoring system.

Integration of “Smart Failover” in next release of HA-OSCAR

Research into lazy failback mechanism.

Page 20: Grid Aware HA-OSCAR

20Oscar 05 SymposiumMay 2005

Thank You