jayaram mudigonda, hp labs praveen yalagandula, hp labs

71
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 © Copyright 2010 Hewlett-Packard Development Company, L.P. Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul, HP Labs SPAIN: High BW Data-Center Ethernet with Unmodified Switches

Upload: olisa

Post on 23-Feb-2016

33 views

Category:

Documents


0 download

DESCRIPTION

SPAIN: High BW Data-Center Ethernet with Unmodified Switches. Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul, HP Labs. Traditional Datacenter. Internet. Datacenter Fabric. Internet-facing applications: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    1© Copyright 2010 Hewlett-Packard Development Company, L.P.   

Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP LabsMohammad Al-Fares, UCSD Jeff Mogul, HP Labs

SPAIN:High BW Data-Center Ethernet

with Unmodified Switches

Page 2: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    2

Datacenter Fabric

Traditional DatacenterInternet

Internet-facing applications: E-Mail, Web Servers, etc.

Page 3: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    3

DC TrendsInformatio

n Explosion

Application Consolidatio

nVirtualization

HPC Applications

Page 4: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    4

Datacenter Fabric

DC TrendsInternet

M R

M

R

RR

RR

R

R

M

M

M

M

M

M

Shuffle phase of Map – Reduce

RM

Page 5: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    5

Datacenter Fabric

DC TrendsInternet

M R

M

R

RR

RR

R

R

M

M

M

M

M

M

Shuffle phase of Map – Reduce

RM

High bisection

bandwidth

Page 6: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    6

Datacenter Fabric

DC TrendsInternet

Flat Network

Page 7: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    7

DC Fabric Goals

High bisection BWFlat network

Low-cost

Page 8: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    8

Ethernet: a good choiceCommodity Inexpensive

Speeds: 10G is here 40G/100G soonFlat-addressingSelf-configuring

Page 9: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    9

But wait…

Page 10: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    10

Spanning Tree Protocol (STP)

makes Ethernet hard

to scale!

Page 11: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    11

Spanning Tree Protocol (STP) Roo

tBandwidth bottleneck

Unused links

Page 12: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    12

Proposal 1: High-port core switch

A common current approach

Page 13: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    13

Expensive Core

SwitchHigh BW or

Multiple Links

Page 14: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    14

Proposal 2: L3

IP SubnettingVL2 [SIGCOMM’09]

Page 15: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    15

L3 routers

ExpensiveNo non-IP protocols

(FCoE)

Page 16: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    16

Proposal 3: Modify switches (HW/SW)

TRILL [IETF]

SEATTLE [SIGCOMM’08]PortLand [SIGCOMM’09]

Not deployable today!

Page 17: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    17

SPAINUnmodified L2 switchesMulti-pathingArbitrary topologies

Page 18: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    18

SPAIN Approach

Multi-pathing via VLANs

+ End-host driver to

spread load

Page 19: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    19

A C B D

Multi-pathing via VLANsDefault

VLAN

Page 20: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    20

A C B D

Multi-pathing via VLANsDefault

VLAN

Page 21: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    21

SPAINUnmodified L2 switchesMulti-pathing via VLANsArbitrary topologiesMinor End-host modifs

Low-costHigh-BWDC FabricToday!

Page 22: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    22

OutlineIntroductionSPAIN Components Offline computation End-host driverEvaluationSummary

Page 23: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    23

OutlineIntroductionSPAIN Components Offline computation End-host driverEvaluationSummary

Page 24: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    24

Offline Computation Steps: 1. Discover topology 2. Compute paths 3. Layout paths as VLANs

Page 25: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    25

Discover topologySNMP Queries

SPAIN

Page 26: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    26

Compute pathsGoal: leverage redundancy; improve reliabilityChallenges: large graphs; more pathsmore resources

Page 27: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    27

Compute pathsOnly consider paths between edge-switchesModified Dijkstra’s; Prefer edge-disjoint

paths

Page 28: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    28

VLAN LayoutSimple scheme: Each Path as

VLAN

Page 29: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    29

But…

VLAN ID = 12 bits 4096 VLANs!

IEEE 802.1Q:

Page 30: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    30

Simple scheme: Each Path as VLANScales to only few switches

VLAN Layout

Page 31: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    31

Our approach: 1 VLAN for a set of paths

VLAN Layout

Page 32: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    32

Challenge: Minimize VLANs

NP-Hard for arbitrary topologies

Page 33: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    33

Heuristics: 1. Greedy path packing 2. Parallel graph-coloring

VLAN Layout

Page 34: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    34

# VLANs = 4VLAN Layout

Page 35: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    35

OutlineIntroductionSPAIN Components Offline computation End-host driverEvaluationSummary

Page 36: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    36

SPAIN End-host Driver

A B

SPAIN

Page 37: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    37

SPAIN

SPAIN End-host Driver

A BTopology & VLANs

Page 38: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    38

21

SPAIN End-host Driver

A B

Flow TableAB, 1 : REDAB, 2 : BLUE

Flow Table2

1

Page 39: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    39

ChallengesLink & switch failuresPathological flooding

InteroperabilityHost mobilityLoad-balanceEnd-host state

Page 40: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    40

Failures

A B

Flow TableAB : RED

Flow Table

Page 41: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    41

Pathological Flooding

A B

Flow TableAB : RED

Flow TableBA : GREEN

Does not know the

location of B

Page 42: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    42

Solution:

Chirping

Page 43: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    43

Chirping

A B

Flow TableAB : RED

Flow TableBA : GREEN

Does not know the

location of B

C

Knows the location of B

Page 44: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    44

Chirping

A B

Flow TableAB : RED BLUE

Flow TableBA : GREEN

Page 45: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    45

OutlineIntroductionSPAIN Components Offline computation End-host driverEvaluationSummary

Page 46: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    46

Evaluation

SimulationsReal testbed

Page 47: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    47

SimulationsTopologies:CiscoD

C Core switches

Aggregation modulesm = 2

Access switches per modulea = 2

Page 48: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    48

SimulationsTopologies:CiscoD

C Fat-Tree [Al-fares et al. SIGCOMM’08]

#ports/switchp = 4

Page 49: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    49

Simulations

2D HyperX

k=4

Topologies:CiscoD

C Fat-Tree [Al-fares et al. SIGCOMM’08]

HyperX [Ahn et al. SC’09]

Page 50: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    50

SimulationsTopologies:CiscoD

C Fat-Tree [Al-fares et al. SIGCOMM’08]

HyperX [Ahn et al. SC’09]

B-Cube [Guo et al. SIGCOMM’09]

#ports/switch (p) = 2

Levels (l) = 2

Page 51: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    51

SimulationsTopologies:CiscoD

C Fat-Tree [Al-fares et al. SIGCOMM’08]

HyperX [Ahn et al. SC’09]

B-Cube [Guo et al. SIGCOMM’09]

Metrics: #VLANs Link-Coverage Reliability Throughput

Page 52: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    52

SimulationsTopologies:CiscoD

C Fat-Tree [Al-fares et al. SIGCOMM’08]

HyperX [Ahn et al. SC’09]

B-Cube [Guo et al. SIGCOMM’09]

Metrics: #VLANs Link-Coverage Reliability Throughput

Page 53: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    53

Num. of VLANs

CiscoDC (8,8) Fat-Tree (48)HyperX (16)B-Cube (48,2)

146

2880

256

2048

#switches38

576

971

2048

#VLANs

Page 54: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    54

ThroughputCiscoDC Fat-TreeHyperXB-Cube

2x24x10.5x 1.6x

Improvement over STP

Page 55: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    55

OpenCirrus Experiments

Page 56: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    56

1G

10GRACK SWITCH (RS)

CORE SWITCH (CS)

80 blades

OpenCirrus Testbed

Page 57: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    57

1G

10GRACK SWITCH (RS)

CORE SWITCH (CS)

80 blades

CS

S1

S2

S3

OpenCirrus Testbed

Page 58: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    58

CS

S1

S2

S3

OpenCirrus Testbed

10G links that we added

Page 59: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    59

CS

S1

S2

S3

OpenCirrus Testbed

4 VLANs

Page 60: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    60

Shuffle-like experimentEvery server to all other

servers500MB data transfer

Page 61: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    61

CS

S1

S2

S3

Spanning Tree Protocol(STP)

Page 62: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    62

Link utilization in each direction

100%

100%

0%

Time

Page 63: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    63

CS

S1

S2

S3

Spanning Tree Protocol(STP)

Over loade

d

Unused

Page 64: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    64

CS

S1

S2

S3

No bottle-necks

SPAIN

Page 65: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    65

Completion times

STP SPAIN

832 s

431 s

~50% reduction

Page 66: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    66

Aggregate Goodput (Gbps)

STP SPAIN

35.6

66.7

87% improvement

Page 67: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    67

Aggregate Goodput (Gbps)

0% 20% 50% 70% 100%

35.6 37.0 44.756.0

66.7

% SPAIN hosts

Incremental Deployability

Page 68: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    68

CS

S1

S2

S3

Single Shortest Path(SSP) SEATTLE/TRILL

All flows

on RED

All flows on

GREEN

All flows

on GRAY

SEATTLE/TRILL on unmodified switches with

SPAIN

Page 69: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    69

Comparison with SSP

SSP SPAIN

62.3 66.7Goodput (Gbps)

SSP SPAIN

513 431

Completion Time(s)

16% better 7% better

Page 70: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    70

SPAIN Take-awayUnmodified L2 switchesMulti-pathing via VLANsArbitrary topologiesMinor End-host modifs

Low-costHigh-BWDC FabricToday!

Page 71: Jayaram Mudigonda, HP Labs      Praveen Yalagandula, HP Labs

© Copyright 2010 Hewlett-Packard Development Company, L.P.    71 © Copyright 2010 Hewlett-Packard Development Company, L.P.    71

Q&A