sum like functionality with wlcg-mon ivan dzhunov

11
IT-SDC : Support for Distributed Computing SUM like functionality with WLCG-MON Ivan Dzhunov

Upload: manjit

Post on 22-Feb-2016

23 views

Category:

Documents


0 download

DESCRIPTION

SUM like functionality with WLCG-MON Ivan Dzhunov. What do we mean by SUM like?. Historical plots sites availability/reliability s ervices availability/reliability metrics history per hostname metric’s detailed output Latest results table “SUM uses the screen very economically” - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: SUM like functionality with WLCG-MON Ivan  Dzhunov

IT-SDC : Support for Distributed Computing

SUM like functionality with WLCG-MON

Ivan Dzhunov

Page 2: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 2IT-SDC

What do we mean by SUM like? Historical plots

sites availability/reliability services availability/reliability metrics history per hostname metric’s detailed output

Latest results table

“SUM uses the screen very economically”

drill down navigation between the plots filters for sites, group of sites, services, metrics, hostnames

6 Dec 2013

Page 3: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 3IT-SDC

Site reliability historical plot

New type of historical metric plot implemented in SSB (WLCG-MON)

Reliability calculatedby formula good/(good+bad)

good = {‘warning’, ‘ok’} bad = {‘critical’, ‘unknown’}

Configurable per metric

Idea is to use the same plot for service reliability

6 Dec 2013

Page 4: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 4IT-SDC

Service reliability historical plot ? We need plot like

showing reliability for set of service flavors (CREAM-CE, SRMv2) for set of sites (T2_ES_CIEMAT, T2_FR_CCIN2P3) per profile

In SSB status of single service flavor (per profile) is different metric

Need of cross metric, cross instance historical SSB plot – MxN- SSB not flexible enough, has historical plots only per metric (1xN) or per instance (Mx1)

Here we thought a bit. If we want to provide SUM like functionality, why not use SUM ?

6 Dec 2013

Page 5: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 5IT-SDC

Current usage of SUM SUM

User interface for displaying SAM data No DB behind it, works with API calls

SSB interacting with SUM

SSB data not in sync with re-computed historical SAM data 6 Dec 2013

SUM UIAPI calls

SSB UI

SAM data

SSB data

get current data

Page 6: SUM like functionality with WLCG-MON Ivan  Dzhunov

6IT-SDC

Future usage of SUM SUM on top of WLCG-MON

Single data store Re-computed data will be seen by both applications

6 Dec 2013Ivan Dzhunov

SUM UIWLCG-MON

API callstopology

profile definition

metric resultsSSB UI

Page 7: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 7IT-SDC

SUM on top of WLCG-MON

6 Dec 2013

http://wlcg-sam-cms.cern.ch

Page 8: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 8IT-SDC

Advantages of using SUM Migration to WLCG-MON will be transparent for the end user

SUM will serve the same plots and data as it used to do

No UI has to be re-written

Validation of the new system will be much easier Validate data out of two equivalent systems – “old” and “new” SUM

6 Dec 2013

Page 9: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 9IT-SDC

Disadvantages of using SUM

Maintain 2 different user interfaces

6 Dec 2013

Page 10: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 10IT-SDC

Which way do we go ?

6 Dec 2013

Page 11: SUM like functionality with WLCG-MON Ivan  Dzhunov

Ivan Dzhunov 11IT-SDC

WLCG-MON TODO list …assuming we decide to adopt SUM

SUM side metric result details and latest results part Availability plots when site/service downtimes are introduced to WLCG-MON

Profiles definition – currently in static JSON file -> move to DB Changes in aggregation, SUM actions

Service type <-> flavor mapping Changes in aggregation?, SUM actions

Site/service downtime handling

Re-computations on aggregated data with SSB “modify metric data” functionality

Validate data

Monthly reports

6 Dec 2013