storage performance and io load basics - snia training... · storage performance and io load basics...

42
PRESENTATION TITLE GOES HERE Storage Performance and IO Load Basics Leah Schoeb, Vice Chair, SNIA Technical Council SNIA Emerald TM Training SNIA Emerald Power Efficiency Measurement Specification, for use in EPA ENERGY STAR ® June 24-27, 2013

Upload: buique

Post on 24-Apr-2018

224 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

PRESENTATION TITLE GOES HERE Storage Performance and IO Load Basics

Leah Schoeb, Vice Chair, SNIA Technical Council

SNIA EmeraldTM Training

SNIA Emerald Power Efficiency Measurement Specification,

for use in EPA ENERGY STAR®

June 24-27, 2013

Page 2: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 2

Topics

!   Today’s Impact on Storage Performance !   Storage Performance Planning !   Troubleshooting Methodology and basic metrics

Page 3: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 3

IO Performance Needs Monitoring at Every Level

Application Guest OS

Hypervisor

Physical Server

Connectivity

Storage

Application Level App Specific Perf tools/stats

Guest OS CPU Utilization, Memory Utilization, I/O

Latency

Virtualization Level Performance Metrics /Charts

Limits, Shares, Virtualization Contention

Physical Server Level

CPU and Memory Saturation, Power Saving

Connectivity Level

Network/FC Switches and data paths Packet loss, Bandwidth Utilization

Storage Level

SAN or NAS Devices Utilization, Latency, Throughput

Page 4: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 4

Storage Performance Planning

Page 5: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 5

Planning for Performance

Storage

Planning

Workload Behavior

Storage Optimization

Page 6: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 6

Storage Planning

!   Understand the workload !   Sharing or Consolidation !   Storage Protocol Options

!   File, block, or object

!   Data Reduction Options !   Thin provisioning

!   Data Protection !   Other Storage Technology

trade offs

FC Switch

Server Infrastructure

HBA1 HBA2 HBA3 HBA4

SP2 SP1

1 2 3 4

Page 7: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 7

Rotating Media Selection

Drive Type Speed MB/sec IOPS Latency LC Manage

FC 4Gb 15k 150 200 5.5ms High Perf. Trans

FC 4Gb 10k 75 165 6.8ms High Perf. Trans

SAS (6Gb,12Gb) 10k 150 185 12.7ms Streaming

SATA (6Gb,12Gb) 7200 140 38 12.7ms Streaming/Nearline

SATA 7200 68 38 12.7ms Nearline

RAID 0

RAID 5

RAID 6

RAID 10

$

$$

$$$$

$$$

Page 8: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 8

Solid State Storage

!   No all SSDs designed the same !   NAND-based flash memory !   DRAM-based (Random Access Memory) !   Enterprise flash drives (EFDs) !   Hybrid Drives

!   Performance varies widely !   Capacity !   Compression !   Wear leveling !   Error Correction and bad block mapping !   Metadata management !   Garbage collection !   Encryption

Page 9: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 9

Solid State Storage

Metric NAND Flash

SLC MLC

Latency (microseconds) 100 200-300

Persistence 10x more persistent Less reliable*

Cost 30% more expensive More cost effective

Sequential read/writes 3x faster Slower

*This can be overcome, even reversed by the internal design using higher over provisioning, interleaving, and changes to writing algorithms.

Page 10: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 10

Virtualize to consolidate

Win2k3

SQL

Win2k3

SQL

Win2k3

SQL

ESXi Server ESXi Server

vSphere

VMDK VMDK VMDK 5 Disks 5 Disks 5 Disks

Physical Virtual

Page 11: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 11

Over Provisioning

§ Using Thick provisioning it is easy to over provision. § You may want to consider Thin Provisioning. § Most vendors offer Thin Provisioning

500GB Volume 300GB VMFS 150GB VMDK

Unused by users Unused by Admin Actual space used

Page 12: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 12

Planning for Performance

Storage

Planning

Workload Behavior

Storage Optimization

Page 13: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 13

Meeting SLA

Not Meeting SLA Acceptable Response

Time Threshold

As I/O workloads increase so does

the Response Time

I/O Workload Activity vs. Response Time Supply and Demand

Page 14: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 14

Meeting SLA

Not Meeting SLA

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Seasonal Workload Surges

Normal Workload Activity

Seasonal/Periodic Performance Surges

Page 15: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 15

1ms 1ms

1ms

1ms

= 1 ms

= 8 ms Single Threaded

Multi Threaded

Single vs. Multi-threaded Applications

Page 16: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 16

I/O Queue Depth

!   The number of I/O request waiting to be completed !   Also known as outstanding I/Os

!   Limiting host I/O demands !   Certain applications, under extreme load, can gain

performance by increasing the I/O Queue Depth !   Accepting requests from the Application

Page 17: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 17

Skew

!   Asymmetry of a distribution about its mean or the non-uniform distribution of data or I/O activity across storage devices.

!   New storage technologies are handling this automatically !   Disk skew

!   An area of the disk has higher amounts of activity !   Referred to as a ‘hot spot’ !   Data is accessed more frequently

!   Controller skew !   A controller has a higher amount of activity compared to rest of

the controllers in a storage system.

Page 18: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 18

Chunk (SAN) Chunk (SAN) Chunk (SAN)

Block (VMFS) Block (VMFS) Block (VMFS)

Chunk (SAN) Chunk (SAN) Chunk (SAN)

Block (VMFS) Block (VMFS) Block (VMFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Cluster VMDK (NTFS)

Before Partition Alignment

After Partition Alignment

Misalignment

Page 19: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 19

Workload Consolidation

Random

Random

Random

Sequential

Sequential

Sequential

Random

Random

Random

Group similar workloads together (Random w/

Random and Sequential /w Sequential)

Mixing Sequential with Random can hurt

Sequential workload Throughput.

Negative Impact on Sequential Perf.

Too many sequential threads on a lun will appear as a random

workload to the storage Negative Impact on

Sequential Perf.

Page 20: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 20

Mixed Workloads

OLTP

Video Streaming

Email

Email

Satellite Streaming

Log Streaming

R

S

R

R

S

S

Page 21: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 21

Storage

Planning

Workload Behavior

Storage Optimization

Planning and Best Practices

Page 22: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 22

Optimizing Storage

!   Over 80% of storage related performance problems stem from misconfigured storage hardware !   Consult SAN Configuration Best

Practice Guides !   Ensure disks are correctly

distributed !   Ensure the appropriate

controller cache is enabled !   Count the cost in choosing a

level of protection

FC Switch

Server Infrastructure

HBA1 HBA2 HBA3 HBA4

IBM Storage array

SP2 SP1

1 2 3 4

Page 23: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 23

Optimizing Storage

!   Avoid negatively impacting high volume sequential performance

!   Choose a storage protocol best fitting requirements and needs

!   Use the Hypervisor filesystem (VMFS, ZFS, SMB3, etc…) !   No overhead compared to RDM (physical or virtual)

!   Thick provisioning !   Use when possible to help prevent over provisioning !   No performance impact compared to Thick

!   Are other departments sharing a RAID set

Page 24: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 24

101 BASICS

Troubleshooting Methodology Storage Performance

Page 25: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 25

Performance Methodology

Modeling Get to know

your workload

Measure & Monitor Use Tools

Performance Analysis

Performance Improve

Validate Success

Page 26: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 26

Understanding Your Workload

!   Workload Indicators !   Demand for resources vs. Resources currently used !   Result is a percentage of Workload

!   Low latency number is Good – Object has the resources it needs !   Can go above 100% - Object is “Starving”

!   Workload summarized across critical resources !   Workload Details View

!   Detailed understanding of the lacking resource and associated metrics

!   View the state of the Peer and Parent Objects and troubleshoot !   Am I a victim or a villain? !   Is this a population problem? !   Should we move the VM? ! A Configuration issue? ! Lack of resources?

Page 27: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 27

Analyze all resource dimensions

Model Activity

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

0

10

20

30

40

50

60

70

80

90

100

% U

tiliz

atio

n

Hour of Day

Server Hourly Utilization

CPU Disk I/O Network I/O Memory

Understanding Your Workload

Page 28: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 28

Performance Methodology

Modeling Get to know

your workload

Measure & Monitor Use Tools

Performance Analysis

Performance Improve

Validate Success

Page 29: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 29

Approach to Real-Time Performance Management

Flexible INTEGRATION

to many data sources

Enterprise SCALABILITY

Patented performance ANALYTICS

I can put all my monitoring tools to good

use and get better performance analytics.

Powerful information DASHBOARDS

3rd Generation – Holistic, Real Time Analytics

Page 30: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 30

Infrastructure vs. Operations Impacts on the storage performance & efficiency

Virtual Datacenters Physical Datacenters

Cloud OS

Setup HA

vMotion DRS

Update

Distr. S/W I/O Control

Inventory

INFR

AST

RUC

TU

RE

OPE

RAT

ION

S

Capacity Dedicated resources

Pre-committed

Performance App = host

Limited movement

Configuration Static , pre-configured

1-1 mapping

+

+

Capacity Resource pooling

Minimize waste, Prevent Stress

Configuration On-demand, self-service

Dependencies

Performance Resource abstraction

Mobility

Page 31: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 31

Performance Methodology

Modeling Get to know

your workload

Measure & Monitor Use Tools

Performance Analysis

Performance Improve

Validate Success

Page 32: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 32

Basic Metrics

!   Performance (Data at work) – I/O per second (IOPS) !   Throughput (Data on the move) - Mega- or Giga- bytes

per second (MB/sec, GB/sec) !   Network throughput Mega- or Giga- bits per second (Mbps,

Gbps)

!   Idle (Data at rest) !   Response time

!   HHDs – milliseconds (ms) !   SSS – microseconds !   Overall response times – milliseconds (ms)

!   Retries !   Queue Depth

Page 33: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 33

Basic Metrics

!   Power performance - I/Os per watt !   Write coalescing

!   Combining several or many small blocks into one large block then writing that single large block to disk

!   Hard Disk Drive Service Time: !   Seek - The initial operation a disk performs to place the read/

write head on the right track of a disk drive. !   Latency (Rotational Latency) - The secondary operation that

occurs after the “seek”, which is the time it takes for the data to reach the read/write head of a disk drive.

!   Transfer Time – The time it takes for data to be read from or written to the host after seek and latency.

!   Service Time = seek + latency + transfer Time

Page 34: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 34

Identifying Unhealthy Storage

Metric Described Threshold

Average Device latency latencies from the storage system

10-15 ms

Average Kernel latency Latencies from the kernel’s I/O subsystem

1-2 ms

Aborts and retries Can’t keep up with demand and times out or something broke

1

Response Time Overall application or OS response time

Many IOs above 10 ms

Page 35: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 35

Performance Methodology

Modeling Get to know

your workload

Measure & Monitor Use Tools

Performance Analysis

Performance Improve

Validate Success

Page 36: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 36

Monitor and Validate Success

§  Does your application continue meet its SLA?

§  Do known activities perform the same or better?

§  Check and monitor key performance counters

§  Are business and application owners satisfied?

Page 37: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 37

101 BASICS

I/O Generator Tools

Page 38: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 38

I/O Generators - IOmeter

!   I/O disk testing tool !   Uniform distributions (speeds and feeds) ONLY !   Built originally to measure server side disk storage

!   IOmeter was formerly known as “Intel's Galileo”. !   IOmeter does for a computer’s I/O subsystem what a

dynamometer does for an engine (Block only) !   It measures performance under a controlled load.

!   Measures !   Performance and throughput of disk and network controllers. !   Bandwidth and latency capabilities of buses. !   Shared bus performance. !   System-level hard drive and network performance.

Page 39: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 39

I/O Generators - IOmeter

!   An access pattern contains mainly the following parameters: !   Transfer Request Size - a minimal data unit to which the test

can apply. !   Percent Random/Sequential Distribution - percentage of

random requests (read/write ratio) !   Percent Read/Write Distribution - percentage of requests for

reading. !   # of Outstanding I/Os - defines a number of simultaneous I/O

requests for the given worker and, correspondingly, disc load.

Page 40: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 40

I/O Generators - Vdbench

!   I/O workload generator !   Both uniform and non-uniform distributions !   Built to measure storage systems

!   Generates and measure storage performance (block or file)

!   Collect and replay real world enterprise application workloads with the addition of SWAT

!   Swiss army knife of I/O generators !   Java based is ported to most major operating systems

!   Unix, Linux, windows, etc…

Page 41: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 41

I/O Generators - Summary

!   Many IO Generators !   Uniform vs. non-uniform distributions !   Skew !   Replay real world workloads !   Measuring a disk vs. a storage system !   Measuring block vs file

Page 42: Storage Performance and IO Load Basics - SNIA Training... · Storage Performance and IO Load Basics Leah Schoeb, ... SLC MLC Latency ... Performance and throughput of disk and network

SNIA EmeraldTM Training ~ June 24-27, 2013 www.sniaemerald.com 42

Thank You

Leah Schoeb [email protected]

Twitter:@vLeahSchoeb