logging instrumentation dashboards alerts - for developersinstrumentation most often forgotten...
TRANSCRIPT
Logging
Instrumentation
Dashboards
Alerts
- for developers
WONDERING WHO
IS THAT GUY?
HENRY BEEN
Independent Devops & Azure Architect
E: [email protected]: @henry_beenL: linkedin.com/in/henrybeenW: henrybeen.nl
Very boring part
Pretty boring part
Maybe boring part
Amazing part
Really amazing part
Mindblowing part
LOGGING, INSTRUMENTATION, DASHBOARDS & ALERTS
A bit of monitoring theory
What to monitor
Demo’s!
LOGGING, INSTRUMENTATION, DASHBOARDS, ALERTS
MONITORING
-or-
KNOWING WHAT BE HAPPENING
InstrumentationMost often forgotten
Concerned with metrics
• (Timestamp, value) tuples
• Events & values
• Dimensions
• Used for Dashboards & Alerts
Rolled up over time
LoggingOften combined with tracing
Concerned with state changes
• When
• By who
• What changed
• Can a source of metrics
Retained for longer or indefinitly
TracingUsed for troubleshooting
What we developers call logging
• When
• Code location
• Very, very verbose message
• Can a source of metrics
Retained for a limited time
MONITORING
VM / container
TimeseriesDatabase
Application Instance AgentSDK
LoggingDatabase
Https endpoint
Visualization Engine
Your Dashboards
Query Engine
Your Alerts
Streaming Data Engine
BUT HEY THAT STUFFIS OPERATIONS…
WHY WOULD I CARE?
DEVOPS POWERRRR
• DevOps is the union of people, process, and products to enable continuous delivery of value to your end users.
“
”
DEVOPS
OKAY, OKAY I CARE…NOW, WHAT TO MONITOR?
OKAY, OKAY I CARE…NOW, WHAT NOT TO MONITOR?
WHAT NOT TO DO
FOCUS ON AVERAGESAverages are lying cheats and hide real issues!
• Use dimensions to find outliers
• Use maximums or percentiles to identify spread
FOCUS ON YOUR SYSTEMSDo you think your users care? At all?
• Is the amount of disk IO really interesting?
• Does page file usage predict conversion?
To add some nuance:
- Some system metrics are handy, just don’t start there maybe
- Same goes for averages
0
20
40
60
80
100
120
Database load(%)
DO WE HAVE AN ISSUE?
Proposed alert
ARE MY USERS HAPPY?
0
20
40
60
80
100
120
140
160
Average response time
0
200
400
600
800
1000
1200
Average response time and maximum response time
Average response time Maximum response time
ARE MY USERS HAPPY?
0
100
200
300
400
500
600
700
800
900
Average response time and response time per operation
Average response time View item AddToBasket
ARE MY USERS HAPPY?
PERCENTILES!
PERCENTILES!
PERCENTILES!
MEASURE THINGSTHE USER CARES ABOUT
AND THINK IT THROUGH!
Let’s do a Quick Demo!
Demo
A VERY GOOD BOOKALSO QUITE THIN
FOR THOSE USING AZUREHOW WOULD YOU DO IT?
DEMO TIME!A Todo Application…
ALMOST
POST UrlShortner AddAliasProcessor
SQL Database
GET UrlShortner
VM / container
TimeseriesDatabase
Application Instance AgentSDK
LoggingDatabase
Https endpoint
Visualization Engine
Your Dashboards
Query Engine
Your Alerts
Streaming Data Engine
Two more things
“YOU CAN USE THE DEMOS!”
“AND YOU CAN USE THE DEMOS!”
“EVERYBODY CAN USE THE DEMOS!”
DASHBOARDS AND ALERTS
AS CODE
Questions?
Now is the time!
DO TRY THIS AT HOME!
HENRY BEEN
Independent Devops & Azure Architect
E: [email protected]: @henry_beenL: linkedin.com/in/henrybeenW: henrybeen.nl