why we do monitoring wrong #osmc edition

74
Wrong Why we do

Upload: michael-medin

Post on 12-Jul-2015

573 views

Category:

Software


3 download

TRANSCRIPT

Page 1: Why we do monitoring Wrong #osmc edition

WrongWhy we do

Page 2: Why we do monitoring Wrong #osmc edition

…frustration…

devnot ops

Page 3: Why we do monitoring Wrong #osmc edition
Page 4: Why we do monitoring Wrong #osmc edition
Page 5: Why we do monitoring Wrong #osmc edition
Page 6: Why we do monitoring Wrong #osmc edition
Page 7: Why we do monitoring Wrong #osmc edition
Page 8: Why we do monitoring Wrong #osmc edition
Page 9: Why we do monitoring Wrong #osmc edition
Page 10: Why we do monitoring Wrong #osmc edition
Page 11: Why we do monitoring Wrong #osmc edition

Please don’t be angry!

Some times I am busy

Page 12: Why we do monitoring Wrong #osmc edition
Page 13: Why we do monitoring Wrong #osmc edition
Page 14: Why we do monitoring Wrong #osmc edition
Page 15: Why we do monitoring Wrong #osmc edition

TAKE:1

Page 16: Why we do monitoring Wrong #osmc edition

check_disk-w 80 –c 90

Page 17: Why we do monitoring Wrong #osmc edition

Slack

-w 80 –c 901gb1tb1pb

0.2g219g225 179g

Page 18: Why we do monitoring Wrong #osmc edition

Better?

-w $ARG1$1gb1tb1pb

0.2g22g2 251g

80%98%99,8%

Magic?

Page 19: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

3000

Value Warning Critical

The problem

The first alert

On call staff alerted

Lost time

Things went bad!

Page 20: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

3000

Value Warning Critical

The problem

The first alert

On call staff alerted

Lost time

Page 21: Why we do monitoring Wrong #osmc edition

No Slack

-w trend-line1gb1tb1pb

0g0g0g

Page 22: Why we do monitoring Wrong #osmc edition

Works With Everything!

Magic?

Page 23: Why we do monitoring Wrong #osmc edition

TAKE:2

Page 24: Why we do monitoring Wrong #osmc edition

planningWhat aboutCapacity

Bounds?

Page 25: Why we do monitoring Wrong #osmc edition

Alarm clock

Page 26: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

Warning Critical HDD 1 HDD 2

Full

How long?

> 80%

> 90%

Page 27: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

Warning Critical HDD 1 HDD 2

Full

warn=full in less than x weeks

Page 28: Why we do monitoring Wrong #osmc edition

Photo Credit Howard Dickins

Alarm clock

2 hours before work

Page 29: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

3000

Value Warning Critical

The first alert

On call staff alerted

Page 30: Why we do monitoring Wrong #osmc edition

No basic math!

Magic?

Page 31: Why we do monitoring Wrong #osmc edition
Page 32: Why we do monitoring Wrong #osmc edition

check_disk-w 80 –c 90

Page 33: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

Value Warning Critical

Backup

check_disk check_disk_backup

Page 34: Why we do monitoring Wrong #osmc edition

0

500

1000

1500

2000

2500

Value Warning Critical

check_disk warn=usage>80% and not_backup

Backup

Page 35: Why we do monitoring Wrong #osmc edition

No it is tags

Magic?

Page 36: Why we do monitoring Wrong #osmc edition

Other

TAKE:1

Page 37: Why we do monitoring Wrong #osmc edition

check_load-w 1 –c 2

Page 38: Why we do monitoring Wrong #osmc edition

Bad CPU load?80%

90%100%

0%

Page 39: Why we do monitoring Wrong #osmc edition

0

10

20

30

40

50

60

70

80

90

100

Value Yesterday Last Week

Page 40: Why we do monitoring Wrong #osmc edition

No, still math

Magic?

Page 41: Why we do monitoring Wrong #osmc edition
Page 42: Why we do monitoring Wrong #osmc edition

check_load-w 1 –c 2

Page 43: Why we do monitoring Wrong #osmc edition

High Load???GOOD BAD

DO WE CARE?

Page 44: Why we do monitoring Wrong #osmc edition
Page 45: Why we do monitoring Wrong #osmc edition

No, still math

Magic?

Page 46: Why we do monitoring Wrong #osmc edition

TAKE:2

Page 47: Why we do monitoring Wrong #osmc edition

check_mem-w 80 –c 90

Page 48: Why we do monitoring Wrong #osmc edition

Bad Memory?80%

90%100%

0%

Page 49: Why we do monitoring Wrong #osmc edition

Managed…Java

JVM.net

CLR

Page 50: Why we do monitoring Wrong #osmc edition

check_mem

check_jmxcheck_counter

check_wmi

Page 51: Why we do monitoring Wrong #osmc edition

check_disk-w 80 –c 90

Page 52: Why we do monitoring Wrong #osmc edition

FULL DISK???GOOD BAD

DO WE CARE?

Page 53: Why we do monitoring Wrong #osmc edition

Because we can?Why do we monitor?

Because we do?Because…

Page 54: Why we do monitoring Wrong #osmc edition

Business!Technology

NOT

Page 55: Why we do monitoring Wrong #osmc edition

IT

BUSINESS

Page 56: Why we do monitoring Wrong #osmc edition

No, common sense

Magic?

Page 57: Why we do monitoring Wrong #osmc edition

TAKE:1

Page 58: Why we do monitoring Wrong #osmc edition

Nagios™ is Old

EasySimple

What we always do

Page 59: Why we do monitoring Wrong #osmc edition

bischeckAddons

Other solutions“the new stuff”

forks

Page 60: Why we do monitoring Wrong #osmc edition

Why a tool?

fast forward 15 yearsNagios™Naemon™could do this!

Why an addon?

Page 61: Why we do monitoring Wrong #osmc edition

cron*/5 * * * * wrap.sh mycheck

#!/bin/bash$*if [ $? == 1 ];then

send-email.shfi;

Page 62: Why we do monitoring Wrong #osmc edition
Page 63: Why we do monitoring Wrong #osmc edition
Page 64: Why we do monitoring Wrong #osmc edition

TAKE:2

Page 65: Why we do monitoring Wrong #osmc edition
Page 66: Why we do monitoring Wrong #osmc edition
Page 67: Why we do monitoring Wrong #osmc edition
Page 68: Why we do monitoring Wrong #osmc edition
Page 69: Why we do monitoring Wrong #osmc edition

TAKE:1

Page 70: Why we do monitoring Wrong #osmc edition
Page 71: Why we do monitoring Wrong #osmc edition
Page 72: Why we do monitoring Wrong #osmc edition

TAKE:2

Page 73: Why we do monitoring Wrong #osmc edition

Photo by Olga Berrios

Page 74: Why we do monitoring Wrong #osmc edition

Information about NSClient++http://nsclient.org

facebook.com/nsclient

Slides, and exampleshttp://nsclient.org/nscp/conferances

My Bloghttp://blog.medin.name