storage for 2020 · storage for 2020 technologies shaping storage infrastructure...
TRANSCRIPT
Storage for 2020 Technologies Shaping Storage Infrastructure [email protected] Technical Director, CTO Office
27 May, 2015
© 2015 NetApp, Inc. All rights reserved..
Exponential Growth of Computing
http://www.singularity.com/charts/page70.html
Fact, or Fiction?
© 2015 NetApp, Inc. All rights reserved.
© 2015 NetApp, Inc. All rights reserved. 3
© 2015 NetApp, Inc. All rights reserved.
Networked Storage
© 2015 NetApp, Inc. All rights reserved. 5
Object Store
NAS/ SAN
Ext-DAS
PCIe Flash
Archival Drives use Shingling
http://www.slideshare.net/hgststorage/navigating-storage-in-a-cloudy-environment
Write-head wider than Read-head
© 2015 NetApp, Inc. All rights reserved. 6
Archival Drives
§ Summary § ~16TB capacity through end of the decade § 30% reduction in RPM (as compared to SATA) § ½ the cost of SATA § Cannot be in continuous operation – ~30% duty cycle § 200TB Workload limit as compared to 3,000TB for NL SATA
§ Good § Very low cost, high capacities § Duty cycle will force the data center to run cooler
§ Challenges § Traditional RAID will not work; traditional scrubs and resiliency techniques will not work § Impact on storage efficiency § Typically a whole track needs to be written (~256MB); much like an extreme FTL § Random overwrites do not work – much like a Tape
© 2015 NetApp, Inc. All rights reserved 7
© 2015 NetApp, Inc. All rights reserved.
SCM
§ Summary § 30x better resilience (than the best Flash) § 10x the cost (of cheapest Flash) § 1/10th the media latency (of the fastest Flash)
§ Good § Opportunity to reduce the server footprint due to increased transaction rates § Run the data center cooler (no refresh)
§ Challenges § Applications/ libraries need to coordinate the data movement across volatile and
protected regions of the CPU complex. Transaction complexity? § Software overheads of IO stack are very high § New fail-safe data structure variants need to evolve § Much higher troubleshooting complexity § Will be till end of the decade before the tool-chains and ecosystem matures
© 2015 NetApp, Inc. All rights reserved. 9
© 2015 NetApp, Inc. All rights reserved
Persistence Moves to Host
§ Non-MPHA hardware
§ Distributed shared-nothing architecture is mandated
§ RDMA starts to be really needed … finally! § Also leveraged by NVMe over Fabrics
© 2015 NetApp, Inc. All rights reserved. 11
© 2015 NetApp, Inc. All rights reserved.
Where are we today?
§ Performance: NVMe devices on the PCIe bus § 600K sustained Read IOPs § Soon at 10us access latency § Will incumbent applications be able to leverage this?
§ Capacity Flash: cMLC § ~2K Program-Erase cycles § Need to dramatically reduce the write-amplification
© 2015 NetApp, Inc. All rights reserved. 13
Virtualization 2.0?
§ Application-granular data management
§ Actually, “application container” driven
§ Different approaches – much higher application density § VVOL (VMware) like approach § Docker/ CoreOS (Linux Namespace and Cgroups)
§ Evolution of Virtio based IO § The control point is compelling for low latency media operations
© 2015 NetApp, Inc. All rights reserved. 14
Scale as a “Data Management” Attribute
§ On the scale that Moore’s Law is linear, the data growth is on an exponential trajectory § Only way to offset it is to scale-out
§ Leverage Server Supply Chain to lower cost § Even then so, there would be “Performance Blades” and “Capacity Blades”
§ Too many moving parts § Increases management complexity § Troubleshooting § Unless …
© 2015 NetApp, Inc. All rights reserved. 15
Pack in the Intelligence of a Human
§ Autonomic Data Management @Scale § Capacity Balancing § Hot-spot elimination § Auto Recovery § Service-level Restoration § De-clustered “RAID” § Topology aware data placement
§ Simplified Namespaces § No storage-admin visible containers § Single large data pools
§ Also enables migration into the Public Infrastructure (aka Cloud)
© 2015 NetApp, Inc. All rights reserved 16
Before Security comes Provenance!
§ Tracking data and lineage – tagging volumes, VMs, LUNs, Objects
§ Allows one to enforce data migration compliance across the data fabric § Restricting data movement across Geo boundaries § Restricting certain kinds of data to be moved outside the Data Center § That will also help to determine what is being stored
§ This tagging provides explicit information about data objects to storage § Only economics can drive this behavior § Implicit information has been available through system metadata and access patterns
© 2015 NetApp, Inc. All rights reserved. 17
In Closing …
§ Best of breed storage will continue to evolve – one size does not fit all
§ Latency sensitive IOPs will be served closer to the host
§ Virtualization will evolve into making protocols irrelevant
§ Autonomic operations help simplify storage, reduce cost
§ Cloud economics will drive explicit intelligence about data
© 2015 NetApp, Inc. All rights reserved. 18