© copyright 2011 | first data corporation instrumentation strategies for the cloud 01 april 2011...
TRANSCRIPT
© Copyright 2011 | First Data Corporation
Instrumentation Strategies For The Cloud
01 April 2011
What you need to tell your execs
David Halbig
2 | © Copyright 2011 | First Data Corporation
Agenda
• What is it and what’s the big deal?
• Where are we today?
• Why does performance need special attention?
• What you gonna do about it?
• Q&A
3 | © Copyright 2011 | First Data Corporation
What Is It and What’s the Big Deal?
Adams, Scott; “Cloud Consultant”, Licensed from www.thedilbertstore.com on 30Mar2011
4 | © Copyright 2011 | First Data Corporation
What Is it?
• On-demand self-service
• Broad network access
• Resource pooling
• Rapid elasticity
• Measured Service
Source: Nat’l Institute of Science and Technology, http://csrc.nist.gov/publications/drafts/800-145/Draft-SP-800-145_cloud-definition.pdf, accessed on 30Mar2011
5 | © Copyright 2011 | First Data Corporation
What’s the Big Deal?
KAL, “Blame Game”, accessed at: http://www.economist.com/printedition/displayCover.cfm?url=/images/images-magazine/2011/01/15/CN/20110115_CNA400.jpg on 15Jan2011
Figure 1: Before the Cloud: Outsider’s view of IT Negotiations
6 | © Copyright 2011 | First Data Corporation
What’s the Big Deal?
Figure 3: “Power of the Cloud”,,BusinessWeek downloaded @ www.businessweek.com on 07Mar2011
7 | © Copyright 2011 | First Data Corporation
What’s the Big Deal?
• Potentially much lower TCO
• Potentially easy access to the ‘ilities’ (scalability, manageability, share ability)
• Absolutely much lower hurdle to prototyping
• Potentially much faster time to market
8 | © Copyright 2011 | First Data Corporation
Where Are We Today?
The Gartner Group, The Gartner Hype Cycle, accessed from http://en.wikipedia.org/wiki/File:Gartner_Hype_Cycle.svg on 30Mar2011
Figure 3: The Gartner Hype Cycle
9 | © Copyright 2011 | First Data Corporation
Why Does Performance Need Special Attention?
• On-demand self-service• Interference effects
• Geographic dispersion
• Broad network access
• Resource pooling• Interference effects
• Geographic dispersion
• Heterogeneous machine architectures
• Rapid elasticity
• Measured Service
10 | © Copyright 2011 | First Data Corporation
Geographic Dispersion
186,000 Miles Per SecondIt’s not just a good idea
It’s the law
11 | © Copyright 2011 | First Data Corporation
Geographic Dispersion
App Server
DB Server
Time
Same Data center – 2 ms latency per SQL call = 10 * 2ms = 20ms
12 | © Copyright 2011 | First Data Corporation
Geographic Dispersion
App Server
DB Server
Time
Remote Data center – 40 ms latency per SQL call = 10 * 40ms = 400ms
13 | © Copyright 2011 | First Data Corporation
Interference Effects – I/O
SAN
Server Hardware
Virtualization Layer
GUEST
GUEST
GUEST
GUEST
GUEST
GUEST
GUEST
I’m OK I’m OK I’m OK I’m OK I’m OKI’m OKI’m OK
HELP!!!
14 | © Copyright 2011 | First Data Corporation
Interference Effects - CPU
Source: VMware ESX 3: Ready Time Observations - Feb 2004
15 | © Copyright 2011 | First Data Corporation
The Cloud Does Not Fix STUPID
• BUILDs promoted with test databases still imbedded
• Undocumented off-campus transactions/special cases
• Un-benchmarked / un-measured BUILDs, particularly in high-volume environments
16 | © Copyright 2011 | First Data Corporation
What Are You Gonna DO About It?
System, Host & Process Level Monitoring
Platform “Deep Dive” Diagnostic Monitoring
Business Transaction Management
Desktop TierWeb
Server Tier
AppServer
Tier
DBaseServer
Tier
ApacheIHS
Linux /VMware
WASWeblogic Tomcat JBoss
OracleDB2
SQL Server
Solaris /Zones
AIX /LPARWindows /VMware
17 | © Copyright 2011 | First Data Corporation
What You Gonna DO About It?Performance Monitoring Products
Desktop Tier
WebServer
Tier
AppServer
Tier
System, Host & Process Level Monitoring
Platform “Deep Dive” Diagnostic Monitoring
Business Transaction Management
CA-Introscope
DBaseServer
Tier
AWR Spotlight
GridPerfmon
UpTime DTRACE
UpTime perfPMR
UpTime VIC
ESXPlot
BTM BTM BTM
UpTime VIC
ESXPlot
18 | © Copyright 2011 | First Data Corporation
System, Host, and Process Level Monitoring
• Continuous Monitors• Windows
• Unix (and dialects) – SAR & I/O Stat
• Specialized Use Monitors• Network
• Windows – WireShark
• Solaris – Snoop
• Linux – WireShark
• AIX – Iptrace
• OS
• Windows – Perfmon/SysInternals
• Solaris – DTRACE
• AIX – PerfPMR
• VMware – ESXtop/ESXplot
19 | © Copyright 2011 | First Data Corporation
Continuous Monitoring
Wikipedia: Perpetuum Mobile of Villard de Honnecourt (about 1230)
20 | © Copyright 2011 | First Data Corporation
Intermittent Monitoring
In Case of Fire
Break Glass
21 | © Copyright 2011 | First Data Corporation
Middleware Monitoring
• Continuous Monitors• Database
• SQL Server - Perfmon (maybe)
• Oracle – Automated Workload Repository (AWR)
• JVMs (Tomcat/Jboss)
• CA-Wily
• Compuware – Agent Vantange
• DynaTrace
22 | © Copyright 2011 | First Data Corporation
There Are No Standards for BTM Capabilities
Figure 4: BTM-capable Ham Sandwich
23 | © Copyright 2011 | First Data Corporation
BTM High-Level Architecture(Typical multi-tier distributed environment)
Management Server& Web GUI
User request
Web Server App Server(.Net/J2EE)
MSG BUSApp Server
DB Server
ObservedTier
Transaction profiles collected
Transactiontopology map
DTE = Dynamic Tier Extension
Agent
DTE
Agent
DTE
Agent
DTE
Agent
DTE
24 | © Copyright 2011 | First Data Corporation
Dave’s Sure-Fire List of BTM Capabilities
• Horizontal view of aggregate and single transactions across all tiers of interest
• Resource consumption information from each monitored tier
• Auto-discovery of transaction path
• Capture path contribution of non-monitored tiers
• Continuous operation at volume
• Low transaction path overhead
25 | © Copyright 2011 | First Data Corporation
Progress!!
KAL, “Blame Game”, accessed at: http://www.economist.com/printedition/displayCover.cfm?url=/images/images-magazine/2011/01/15/CN/20110115_CNA400.jpg on 15Jan2011
Figure 1: With the Cloud: Outsider’s Reality of IT Negotiations