network coding for distributed storage systems ieee transactions on information theory, september...

Network Coding for Distributed Storage Systems

IEEE TRANSACTIONS ON INFORMATION THEORY, SEPTEMBER 2010

Alexandros G. Dimakis

Brighten Godfrey

Yunnan Wu

Martin J. Wainwright

Kannan Ramchandran

Outline

ه Introductionه Backgroundه Analysisه Evaluationه Conclusion

Introduction

ه Distributed storage systems provide reliable access to data through redundancy spread over individually unreliable nodes.

ه Storing data in distributed storage systemsه the encoded data are spread across nodes.ه require less redundancy than replication.ه replace stored data periodically.

Introduction

ه Key issue in distributed storage systems.ه repair bandwidthه storage space

ه How to generate encoded data in a distributed way as little data as possible ?

MDS Codes

ه A common practice to repair from a single node failure for an erasure coded system.1. a new node to reconstruct the whole encoded data object.

2. then, generate just one encoded block.

ه Maximum Distance Separable (MDS) code.ه (n, k)-MDS propertyه recover original file by any k set of encoded data.

MDS Codes

File divide

encodestore at n nodes

MDS encode

Introduction

ه Redundancy must be continually refreshed as nodes fail in distributed storage systems.ه large data transfers across the network.

Introduction

ه The erasure codes can be repaired without communicating the whole data object.

ه (4, 2)-MSR example when node is fail.ه generate smaller parity packets of their data.ه forward them to the newcomer.ه the newcomer mix packets to generate two new packets.

0.50.50.50.5

Introduction

ه This paper identifies that there is a optimal tradeoff curve between storage and repair bandwidth.ه smaller storage space => less redundancy => more repair

bandwidth

ه This paper calls codes that lie on this optimal tradeoff curve regenerating codes.

Introduction

ه Minimum-Storage Regenerating (MSR) codes.ه can be efficiently repaired.

ه Minimum-Bandwidth Regenerating (MBR) codes.ه storage node stores slightly more than M/k .ه the repair bandwidth can be reduced.

Outline

Erasure Codes

ه Classical coding theory focuses on the tradeoff between redundancy and error tolerance.

ه In terms of the redundancy-reliability tradeoff, the Maximum Distance Separable (MDS) codes are optimal.ه the most well-known is Reed-Solomon codes.

Network Coding

ه Network coding allows ه the intermediate nodes to generate output data by encoding

previously received input data.ه information to be “mixed” at intermediate nodes.

ه This paper investigates the application of network coding for the repair problem in distributed storage.ه tradeoff between storage and repair network bandwidth

Distributed Storage Systems

ه Erasure codes could reduce bandwidth use by an order of magnitude compared with replication.

ه Hybrid strategy: ه one special storage node maintains one full replica.ه multiple erasure encoded data.ه transfer only M / k bytes for a new encoded data by replica node.ه there is the problem when replica data lost.

Outline

Information Flow Graph

Storage-Bandwidth Tradeoff

ه The normal redundancy we want to maintain requires active storage nodesه each storing α bitsه β bits each from any d surviving nodesه total repair bandwidth is γ = d β

ه For each set of parameters (n, k, d, α, γ), there is a family of information flow graphs, each of which corresponds to a particular evolution of node failures / repairs.

ه Denote this family of directed acyclic graphs by

ه (4, 2, 3, 1 Mb, 1.5 Mb) is feasible.

ه Theorem 1 : For any α ≥ α*(n, k, d, γ), the points are feasible.

Theorem Proof (1/4)

Theorem Proof (2/4)

Theorem Proof (3/4)

Theorem Proof (4/4)

ه Code repair can be achieved if and only if the underlying information flow graph has sufficiently large min-cuts.

ه Optimal tradeoff curve between storage α and repair bandwidth γه (γ = 1, α = 0.2) (γ = 1, α = 0.1)

Special Cases (1/2)

ه Minimum-Storage Regenerating (MSR) Codes

Special Cases (2/2)

ه Minimum-Bandwidth Regenerating (MBR) Codes

Outline

ه Introductionه Backgroundه Analysisه Evaluation

ه Node Dynamics and Objectivesه Modelه Quantitative Results

ه Conclusion

Node Dynamics and Objectives (1/2)

ه A permanent failureه the permanent departure of a node from the systemه a disk failure resulting in loss of the data stored on the node

ه A transient failureه node rebootه temporary network disconnection

Node Dynamics and Objectives (2/2)

ه A file is availableه it can be reconstructed from the data stored on currently available

nodes.

ه A file is durabilityه after permanent node failures, it may be available at some point in

the future.

Model (1/5)

ه The model has two key parameters, f and a.ه a fraction f of the nodes storing file data fail permanently per unit

time.ه at any given time, the node storing data is available with some

probability a.

ه The expected availability and maintenance bandwidth of various redundancy schemes can be computed to maintain a file of M bytes.

Model (2/5)

ه Replicationه redundancy R replicasه store total R M bytesه replace f R M bytes per unit timeه the file is unavailable if no replica is available

ى probability

ه Ideal Erasure Codesه n = k R, redundancy R n / kه transfer just M / k bytes each packetه replace f R M bytes per unit timeه unavailability probability

Model (3/5)

ه Hybridه n = k (R− 1)ه store total R M bytesه transfer f R M bytes per unit timeه The file is unavailable if the replica is unavailable and fewer than

k erasure-coded packets are availableى probability

Model (4/5)

ه Minimum-Storage Regenerating Codesه store total R M bytesه redundancy R n / kه replace f R M bytes per unit timeه extra amount of informationه unavailability

Model (5/5)

ه Minimum-Bandwidth Regenerating Codesه store total M n bytesه redundancy R n / kه replace f M n bytes per unit timeه extra amount of informationه unavailability

Estimating f and a

Quantitative Results (1/2)

Quantitative Results (2/2)

Quantitative Comparison

ه Comparison With Hybridه Disadvantage : asymmetric design

ه MBR codesه Disadvantage :

ى reconstruct the entire file, requires communication with n1 nodesى if the reading frequency of a file is sufficiently high and k is sufficiently small,

this inefficiency could become unacceptable.

Outline

Conclusion

ه This paper presented a general theoretic framework that can determine the information.ه communicate to repair failures in encoded systems.ه identify a tradeoff between storage and repair bandwidth.

ه One potential application area for the proposed regenerating codes is distributed archival storage or backup.ه regenerating codes potentially can offer desirable tradeoffs in

terms of redundancy, reliability, and repair bandwidth.

network coding for distributed storage systems ieee transactions on information theory, september...

replica data

new encoded data

little data

output data

input data

stored data

encoded data object

repair bandwidth storage

Documents

repairable fountain codes megasthenis asteris, alexandros g....

chinese yunnan hills

architecture portfolio lazaridis alexandros

alexandros a. taﬂanidis1 robust stochastic design of...

alexandros moraitides 3

yunnan & tibet - flowers in the...

the yunnan lijiang 2.4m telescope and its...

distributed storage allocation problems derek leong,...

distributed storage allocations for optimal delay derek...

arash saber tehrani alex dimakis mike neely

architecture yunnan china

marketing strategy yunnan

nikolaos dimakis, lazaros polymenakos and john soldatos...

dr alexandros stampolidis - geophysics.geo.auth.gr

dr. alexandros glykas

china's yunnan province

scheduling for maximizing throughput eecs, uc berkeley...

streaming weak submodularity: interpreting neural networks...

simple regenerating codes: network coding for cloud storage...

xce yunnan 2011