toward an economic model of long-term storage - usenix · 2019. 2. 25. · toward an economic model...

1
Toward an Economic Model of Long-Term Storage Daniel C. Rosenthal UC Santa Cruz David S. H. Rosenthal Stanford University Libraries Ethan L. Miller UC Santa Cruz Ian F. Adams UC Santa Cruz Mark W. Storer NetApp Erez Zadok Stony Brook University Motivation A significant obstacle facing archival storage is how to finance the storage of digital data over the long term. Little is understood about the economic implications of various trade-offs involved in designing, implementing, and managing a digital archive. Study the impact of disruptive technologies on archival storage Study trade-offs between endowment size, protection level, and survivability Compare various storage media (disk, flash, cloud, etc.) for suitability in archival storage Experiment with various data and media capacity growth rates Examine the impact of financial events on archive survivability • Test various forecasting strategies • Explore ``what if?’’ scenarios Ongoing and Future Work Effect of HDD Service Lifetime on TCO Graph shows TCO breakdown as a function of disk service lifetime Devices have a constant failure probability during their service lifetime. ``Leave in-service until failure’’ is a suboptimal device replacement policy Modeling Real-World Events: Thailand Floods Floods modeled by a temporary spike in storage media cost • Graph shows endowment required for 95% chance of not running out of money for 100 years 0 5 10 15 20 25 30 35 40 Cost drop % 0 1 2 3 4 5 6 7 Years until spike 4 6 8 10 12 14 16 18 20 22 Endowment/Initial Cost Effect of Storage Density Growth Rate • Storage density growth rate has a non-linear influence on data storage costs • Graph shows endowment-to-initial-cost ratio required for 98% chance of not running out of money for 100 years 4 6 8 10 12 14 16 0 5 10 15 20 25 30 35 40 Endowment/Initial Cost Kryder rate %/yr Results Simulator Parameters General Simulation Length 10 years Time Step 30 days Data Initial Data 100 TB Growth Rate 57% annual growth Infrastructure Space Cost $800 / m 3 Power Capacity Cost $20,000 / kW Electricity Cost 12.78 cents / kWh Labor Cost $50 / hr Device Storage Medium Hard Drive Capacity Growth Rate Kryder’s Law* Power Draw 5 W Service Lifetime 7 years Failure Probability 0.05 Management Time Cost 1 hr / year Purchase Price $150 * Kryder’s Law: Annual doubling in device capacity Simulations use an endowment model of paying for storage • Infrastructure, labor, and electricity costs are modeled as perfectly scalable For models which consider time value of money, interest rates are modeled using the past 20 years Economic Model Overview Monte Carlo simulations used to model storage system evolution over time Simulations account for initial and operating storage system costs Simulations model single-replica storage with no redundancy (assumes failed data can be recovered from off-site) Tunable simulator parameters for: Various components of initial costs (storage devices, infrastructure) Various components of operating costs (electricity, labor) Data size and storage density growth rates Device power draw, service lifetime, and failure probability • Storage system utilization Goals Understand how various trade-offs affect storage system longevity Design a cost-accurate model of an archive Implement a simulator which reflects this model Estimate the probability of not running out of money with a given cash flow Sponsors

Upload: others

Post on 24-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Toward an Economic Model of Long-Term Storage - USENIX · 2019. 2. 25. · Toward an Economic Model of Long-Term Storage Daniel C. Rosenthal UC Santa Cruz David S. H. Rosenthal Stanford

Toward an Economic Model of Long-Term StorageDaniel C. Rosenthal

UC Santa CruzDavid S. H. Rosenthal

Stanford University LibrariesEthan L. MillerUC Santa Cruz

Ian F. AdamsUC Santa Cruz

Mark W. StorerNetApp

Erez ZadokStony Brook University

Motivation A signi�cant obstacle facing archival storage is how to �nance the

storage of digital data over the long term. Little is understood about the economic implications of various trade-o�s involved in designing, implementing, and managing a digital archive.

• Study the impact of disruptive technologies on archival storage• Study trade-offs between endowment size, protection level, and

survivability• Compare various storage media (disk, flash, cloud, etc.) for suitability

in archival storage

• Experiment with various data and media capacity growth rates• Examine the impact of financial events on archive survivability• Test various forecasting strategies• Explore ``what if?’’ scenarios

Ongoing and Future Work

E�ect of HDD Service Lifetime on TCO• Graph shows TCO breakdown as

a function of disk service lifetime

• Devices have a constant failure probability during their service lifetime.

• ``Leave in-service until failure’’ is a suboptimal device replacement policy

Modeling Real-World Events:Thailand Floods

• Floods modeled by a temporary spike in storage media cost

• Graph shows endowment required for 95% chance of not running out of money for 100 years 0 5 10 15 20 25 30 35 40

Cost drop % 0 1

2 3

4 5

6 7

Years until spike

4 6 8

10 12 14 16 18 20 22

Endo

wm

ent/

Initi

al C

ost

Effect of Storage Density Growth Rate• Storage density growth

rate has a non-linear influence on data storage costs

• Graph shows endowment-to-initial-cost ratio required for 98% chance of not running out of money for 100 years

4

6

8

10

12

14

16

0 5 10 15 20 25 30 35 40En

dow

men

t/In

itial

Cos

tKryder rate %/yr

Results

Simulator ParametersGeneral

Simulation Length 10 yearsTime Step 30 days

DataInitial Data 100 TBGrowth Rate 57% annual growth

InfrastructureSpace Cost $800 / m3

Power Capacity Cost $20,000 / kWElectricity Cost 12.78 cents / kWhLabor Cost $50 / hr

DeviceStorage Medium Hard DriveCapacity Growth Rate Kryder’s Law*Power Draw 5 WService Lifetime 7 yearsFailure Probability 0.05Management Time Cost 1 hr / yearPurchase Price $150

* Kryder’s Law: Annual doubling in device capacity

• Simulations use an endowment model of paying for storage

• Infrastructure, labor, and electricity costs are modeled as perfectly scalable

• For models which consider time value of money, interest rates are modeled using the past 20 years

Economic Model Overview• Monte Carlo simulations used to model storage system evolution over

time

• Simulations account for initial and operating storage system costs

• Simulations model single-replica storage with no redundancy (assumes failed data can be recovered from off-site)

• Tunable simulator parameters for: • Various components of initial costs (storage devices, infrastructure) • Various components of operating costs (electricity, labor) • Data size and storage density growth rates • Device power draw, service lifetime, and failure probability • Storage system utilization

Goals• Understand how various trade-offs affect storage system longevity• Design a cost-accurate model of an archive• Implement a simulator which reflects this model• Estimate the probability of not running out of money with a given

cash flow

Sponsors