data center computing trends a survey

Datacenter Computing

Trends and Problems :

A survey

Partha KunduSr. Distinguished Engineer

Corporate CTO Office

Special Session, May3NOCS 2011

Pittsburgh, PA, USA

Special Session NOCS 2011 2Partha Kundu

Data center computing is a new paradigm!

Special Session NOCS 2011

Outline of talk

Power & Energy in Data Centers

Network architecture

Protocol interactions

Conclusions

3Partha Kundu

Power & Energy in the Data Center

4Partha Kundu

Source: ASHRAE Source: Google 2007

Data Center Energy breakdown Server Peak power usage profile

• Power delivery and Cooling overheads are quantified in PUE metric• Cooling is the most significant source of energy inefficiency

CPU power contribution is less than 1/3 of server power

5Partha Kundu

Energy Efficiency

Most of the time server load is around 30%

But, server is least energy efficient in it’s most common operating region!

Source : Barroso, Holzle: Data Center as a Computer, Morgan Claypool (publishers), 2009

Servers are never completely idle

6Partha Kundu

Dynamic Power Range

CPU power component (peak & idle) in servers has reduced over the years

Dynamic Power range:• CPU power range is 3x for servers• DRAM range is 2X• Disk and Networking is < 1.2X

Disk and Network switches need to learn from the CPU’s power

proportionality gainsSource : Barroso, Holzle: Data Center as a Computer, Morgan Claypool (publishers), 2009

7Partha Kundu

Energy Proportionality

Goal:Achieve best energy efficiency

(~80%) in the common operating regions (20 – 30% load)

Challenges to proportionality:• Most proportionality tricks in embedded/mobile devices are not useable in DC due to huge activation penalties

• Distributed structure of data and application doesn’t allow powering down during low use• Disk drives spin >50% of time even when there is no activity

[Sankar et al, ISCA ‘08] smaller rotational speeds, multiple heads

8Partha Kundu

Source : Kozyrakis et al, IEEE Micro 2010

Application Behavior in Data Centers

• Cosmos is similar to data mining workload• Bing preloads web index in memory• But, peak disk bandwidth can be high

Significant variation in disk, memory and network capacity and bandwidth usage across Apps

9Partha Kundu

Dynamic Resource requirements in the Data-center

Intra-server variation (TPC-H, log scale) Inter-server variation (rendering farm)

Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 Q11 Q120.1MB

Huge variations even within a single Application running in a large cluster

10Partha Kundu

CPUsDIMMDIMM

ackplan

Conventional blade systems

Motivating Disaggregated memory**Lim et al: Disaggregated Memory for expansion and sharing in Blade Servers, ISCA 2009

Disaggregated Memory*

Break CPU-memory co-location

Leverage fast, shared communication fabrics

Blade systems with disaggregated memory

CPUsDIMMDIMM

*Lim et al: Disaggregated Memory for expansion and sharing in Blade Servers, ISCA 2009 Memory blade

DIMMDIMM

DIMMDIMMDIMM

DIMM DIMM

Blade systems with disaggregated memory*Lim et al: Disaggregated Memory for expansion and sharing in Blade Servers, ISCA 2009

CPUsDIMMDIMM

CPUs DIMMDIMM

Memory blade

DIMMDIMM

DIMMDIMMDIMM

DIMMDIMM

Authors claim: 8X improvement on memory constrained environments 80+% improvement in performance per $ 3x consolidation

Disaggregated Memory*

Disaggregated Server

High Density, Low Power SM10000 Servers*• Designed to replace 40 1 RU servers in a single 10 RU system. • 512 1.66 GHz 64 bit X86 Intel Atom cores in 10 RU; 2,048 CPUs/rack• 1.28 Terabit interconnect fabric• Up to 64 1 Gbps or 16 10 Gbps uplinks• 0-64 SATA SSD/Hard disk• Integrated load balancing, Ethernet switching, and server management• Uses less than 2.5 KW of power

SeaMicro SM10000 server*

Claim:Achieves 4x Space & Power consolidation

*Source : Seamicro URL http://www.seamicro.com/?q=node/102

14Partha Kundu

DRAM Disk drivesPower supply

Fabric connectivity

Servers with Consolidated

Network Architecture

15Partha Kundu

Requirements of a Cloud-enabled Data Center

Capacity re-allocation

Economies of Scale

Economic & Technical Motivations:

Use commodity hardware & components

Dynamically distribute compute resources

16Partha Kundu

Status Quo: Conventional DC Network

Ref: “Data Center: Load balancing Data Center Services”, Cisco 2004

AR AR AR AR. . .

DC-Layer 3

Internet

DC-Layer 2Key

• CR = Core Router (L3)• AR = Access Router (L3)• S = Ethernet Switch (L2)• A = Rack of app. servers

~ 1,000 servers/pod == IP subnet

17Partha Kundu

Conventional DC Network Problems

• Cost of network equipment is prohibitive

• Limited server-to-server capacity

AR AR AR AR

~ 40:1

~ 200:1

18Partha Kundu

And More Problems …

AR AR AR AR

IP subnet (VLAN) #1

~ 200:1

• Resource fragmentation, significantly lowering cloud utilization (and cost-efficiency)

IP subnet (VLAN) #2

… … … …

19Partha Kundu

And More Problems …

AR AR AR AR

IP subnet (VLAN) #1

~ 200:1

• Server IP address assignments are topological

• IP movement from contained VLAN is hard

Complicated manual L2/L3 re-configuration

IP subnet (VLAN) #2

… … … …

20Partha Kundu

What We Need is…..

1. L2 semantics

2. Uniform High capacity

3. Performance isolation

… … … …

21Partha Kundu

Achieve Uniform High Capacity :Clos Network Topology*

20 Servers

. . . . .

K aggr switches with D ports

20*(DK/4) Servers

.. . . . . . . .

• Large bisection BW • Multi paths at modest cost

• Tolerates Fabric Failure

*Ref: A Scalable, Commodity, Data Center architecture, Al-Fares et al, SIGCOMM 2008

22Partha Kundu

Addressing and Routing:Name-Location Separation

payloadToR3

. . . . . .

Servers use flat names

Switches run link-state routing and maintain only switch-level topology

y zpayloadToR4 z

ToR2 ToR4ToR1 ToR3

ypayloadToR3 z

DirectoryService

…x ToR2

y ToR3

z ToR4

Lookup &Response

*VL2: A Scalable and Flexible Data Center Network, Greenberg et al, SIGCOMM 2009

23Partha Kundu

Addressing and Routing:Name-Location Separation

payloadToR3

. . . . . .

Servers use flat names

Switches run link-state routing and maintain only switch-level topology

yzpayloadToR4 z

ToR2 ToR4ToR1 ToR3

payloadToR3 z

DirectoryService

…x ToR2

y ToR3

z ToR4

Lookup &Response

…x ToR2

y ToR3

z ToR3

24Partha Kundu

*VL2: A Scalable and Flexible Data Center Network, Greenberg et al, SIGCOMM 2009

VL2 FabricObjectives and Solutions

SolutionApproachObjective

2. Uniformhigh capacity between servers

Enforce hose model using existing

mechanisms only

Employ flat addressing

1. Layer-2 semantics

3. Performance Isolation

Guarantee bandwidth for

hose-model traffic

Clos based network,Valiant LB flow

routing

Name-location separation &

resolution service

25Partha Kundu

Protocol Interactions

26Partha Kundu

TCP InCast Collapse : Problem

Affects key datacenter applications with barrier synchronization boundaries e.g. DFS, web search, MapReduce

Source : Nagle et al, The Panasas ActiveScale Storage Cluster – Delivering Scalable High Bandwidth Storage,SC2004

27Partha Kundu

New Cluster Based Storage System

29Partha Kundu

Incast Application overfills Buffers

30Partha Kundu

Solution: TCP with ms-RTO**Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication, Vasudevan et al, SIGCOMM 2009

• Little adverse effect on WAN traffic

31Partha Kundu

Incast Collapse : an unsolved problem at scale*

*Understanding TCP Incast Throughput Collapse in Datacenter Networks, Griffith et al WREN 2009

Solution space is complex:• Network conditions can impact RTT• Switch buffer management strategies • Goodput can be unstable with load/num. senders

32Partha Kundu

Conclusions

33Partha Kundu

• Opportunities to realize energy efficiency particularly in IO sub-systems

• Data Center fabrics need to be re-architected for application scalability and cost

• WAN artifacts can create bottlenecks

34Partha Kundu

Data Center Computing

• Energy Efficiency: Local (distributed) energy management decision & coordination by NOC

• Fabric communication:NOC can reduce intra-chip/socket communication latencies between VMs

• Congestion Mgt:NOC can assist in traffic orchestration across VMs

35Partha Kundu

NOCs in the Data Center

Thank you!

data center computing trends a survey

may3 nocs

disaggregated server

cpus power source

blade servers

data center computing

structure of data

power delivery

talk power energy

Technology

comptia trends in cloud computing report -4th annual trends...

2014 cloud computing survey

ten cloud computing trends

programming trends in high performance computing

cloud computing global trends pac webinar

5th annual trends in cloud computing

computing system fundamentals/trends + review of

trends in wireless computing

latest trends in cloud data management computing litreature...

comptia: trends in cloud computing

trends in cloud computing in higher education - eschool...

2009 cloud c trends report -...

fog computing: survey of trends, architectures ... · naha...

salary & trends survey

cloud computing: trends and challenges

trends in cloud computing 2016

manufacturing trends survey

2009 cloud computing trends report

cloud computing industry trends and directions

cloud computing - trends and performance issues