advanced rdma-based admission control for modern data...

25
Advanced RDMA-based Admission Control for Modern Data-Centers Ping Lai Sundeep Narravula Karthikeyan Vaidyanathan Dhabaleswar. K. Panda Computer Science & Engineering Department Ohio State University

Upload: others

Post on 27-Jun-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

Advanced RDMA-based Admission Control for Modern Data-Centers

Ping Lai Sundeep Narravula Karthikeyan VaidyanathanDhabaleswar. K. Panda

Computer Science & Engineering DepartmentOhio State University

Page 2: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

2

Outline

• Introduction & Motivation

• Proposed Design

• Experimental Results

• Conclusions & Future Work

Page 3: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

3

Introduction

• Internet grows– Number of users, various type of services, huge amount of data

– Typical apps: e-commerce, bio-informatics, online banking etc.

• Web-based multi-tier data-centers– Huge bursts of requests �server overloaded

– Clients pay for service �guaranteed QoS}

Efficient admission control needed!

Multi-tier Data-center

Tier 0 Tier 1 Tier 2

Proxy

Servers

Web Servers(Apache)

Application

Servers (PHP)

DatabaseServers (MySQL)

WANWANClients

Page 4: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

4

General Admission Control

• What is admission control? – determine whether to accept/drop the incoming requests while

guaranteeing the performance (or QoS requirements) of some already existing connections in the overloaded situation

• Typical approaches– Internal approach: on the

overloaded servers

– External approach: on the front-

tier nodes. Main advantages are:

• Make global decisions

• More transparent to the

overloaded servers

• Easily applicable to any tier

AdmissionControl

AdmissionControl

AdmissionControl

Front-tierservers

Overloadedservers

Back-endservers

AdmissionControl

AdmissionControl

AdmissionControl

AdmissionControl

Page 5: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

5

Motivation

• External approach– Front-tier proxy servers need to get load information from

back-end servers

• Problems with the existing designs– Use TCP/IP – coarse-grained and high overhead;

responsiveness depends on load

– Workload is divergent and unpredictable – require fine-grained and low overhead

Page 6: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

6

Opportunity & Objective

• Opportunity: modern high-speed interconnects– iWARP/10-Gigabit Ethernet, InfiniBand, Quadrics etc.

– High performance: low latency & high bandiwidth

– Novel features: atomic operation, protocol offloading, RDMA operations etc.

– RDMA: low latency & no communication overhead on the remote node

• Objective– Leverage the advanced features (RDMA operation) to

design more efficient, lower overhead and better QoSguaranteed admission control

Page 7: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

7

Outline

• Introduction & Motivation

• Proposed Design

• Experimental Results

• Conclusions & Future Work

Page 8: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

8

System Architecture

• Load gathering daemon running on overloaded web servers

• Load monitoring daemon running on front-tier proxy servers

• Admission control module running on front-tier proxy servers

WANWAN

Admission

Control Module

Apache

IPC

Load Monitoring

Daemon

Load Gathering

Daemon

Server 1

Load Gathering

Daemon

Server N

RDMA read

……

Client

Client

Client

Proxy Servers Web/App Servers (overloaded)

Back-tier Servers

Applicable to any tier

Page 9: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

9

Load Gathering and Monitoring Daemon

• Load gathering daemon– Running on each of the overloaded servers in background – low

overhead

– Gather instantaneous load information

• Load monitoring daemon– Running on each of the front-tier proxy servers

– Retrieve load information from all the load gathering daemons

• Communication is important!– TCP/IP is not good, so?

Page 10: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

10

Gathering and Monitoring Daemon Cont.

• Use RDMA read– Monitoring daemon issues RDMA read to gathering daemon

• Buffer must be registered and pinned down before the operation

• Monitoring daemon has to know the memory address of the remote buffer

– Retrieve load information at high granularity under overload –better decisions

– No CPU involvement on the loaded servers – low overhead

Overloaded ServerFront-tier Server

servicethreads

servicethreads

monitoringthread

gatheringthread

Memory Memory

registeredbuffer

registered buffer

RDMA read

Page 11: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

11

Admission Control Module

• Use shared memory to communicate with load monitoring daemon • Attach to Apache: dynamically loadable; trap into Apache request

processing

• New processing procedure

– Apache main thread call the

admission control module after

TCP connection is established

– Admission control module uses

weighted score to make

decisions

– If all of the back-end servers are

overloaded, call back to Apache

thread to close the new

connections; otherwise, call

back to resume the processing

request Connectionrequest

Process/forwardrequest

to web server

Connectionrequest

Process/forwardrequest

Can the system afford more requests?

Weighted score

NO

call backclose connection

call backresume processing

YES

QoSControl

request

load info. LoadMonitor

QoSStatus

Admission Control Module

Page 12: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

12

Outline

• Introduction & Motivation

• Proposed Design

• Experimental Results

• Conclusions & Future Work

Page 13: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

13

Experimental Platforms

• 32 Compute nodes– Dual Intel 64-bit Xeon 3.6 GHz CPU, 2 GB memory

– Mellanox MT25208 InfiniBand HCA, OFED 1.2 driver

– Linux 2.6

• Two-tier data-center including proxy servers and web

servers; web servers are potentially overloaded

• Apache 2.2.4 for proxy servers and web servers

Page 14: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

14

Experiment Results Outline

• Micro-benchmarks: basic IBA performance

• Data-center level evaluation– Single file trace

• Average response time and aggregate TPS

• Instant performance analysis

• QoS analysis

– Worldcup trace and Zipf trace

• Worldcup trace: real data from world cup 1998

• Zipf trace: workloads follow Zipf-like distribution (probability of i’th most

popular file )1 iα

Page 15: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

15

Performance of RDMA read and IPoIB(TCP/IP over IBA)

With varying message size

0

5

10

15

20

25

30

1 4 16 64 256 1K 4K

Message Size (Byte)

Late

ncy (

usec)

RDMA read IPoIB

• 1 Byte message

– RDMA read: 5.2 us

– IPoIB: 18.9 us

• Improvement using RDMA

increases when message

size increases

• IPoIB significantly degrades

• RDMA read keeps constant

latency

Performance of IPoIB depends

on load, while RDMA NOT!

With background computation

0

100

200

300

400

500

1 2 4 8 16 32 64

Number of Background Threads

La

ten

cy

(u

se

c)

64 Byte RDMA read 64 Byte IPoIB

Page 16: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

16

Data-Center level Evaluation

• Configuration– 4 nodes used as proxy servers

– 1 node used as web server

– Remaining nodes are clients

• Load information updated every 1 ms

• Measured average client-perceived response time (for successful request) and aggregate system TPS

• Comparing performance of three systems– AC-RDMA: system with admission control based on RDMA read (the

proposed approach)

– AC-TCP/IP: system with admission control based on TCP/IP

– NoAC: system without any admission control

Page 17: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

17

Performance with Single File Trace (16 KB)

Average Response Time

0

20

40

60

80

100

120

140

160

180

200

80 160 200 240 280 320 360 400 440 480 520Number of Clients

Av

era

ge

Re

sp

on

se

Tim

e (

ms

ec

)

AC-RDMA AC-TCP/IP NoAC

• With 520 clients

– NoAC: 192.31ms

– AC-TCP/IP: 142.29ms -26% improvement

– AC-RDMA: 105.03ms - 26% improvement over AC-TCP/IP (45% improvement over NoAC)

• AC-RDMA and AC-TCP/IP are

comparable

• System with admission control has

higher TPS than the original system

Aggregate TPS

0

200

400

600

800

1000

1200

1400

1600

1800

80 160 200 240 280 320 360 400 440 480 520Number of Clients

Ag

gre

ga

te T

PS

AC-RDMA AC-TCP/IP NoAC

Page 18: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

18

Instant Performance

• Workload: 400 clients

• Instant response time

– NoAC: many requests

served with very long time

– AC-RDMA: almost no such

requests

– AC-TCP/IP: some requests

with long response time

Instant performance is consistent with the trend of average response time

Page 19: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

19

Instant Performance Cont.

• Instant drop rate

– AC-RDMA: closely reflects the

instantaneous changing load

on web servers

– AC-TCP/IP: longer streak of

continuous drops or

acceptance

– NoAC: a lot of acceptance;

some drops because of

timeout

AC-RDMA gets the load information timely, while AC-TCP/IP sometimes reads the stale information due to the slow response from overloaded servers in TCP/IP communication

Page 20: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

20

QoS AnalysisAverage QoS Unsatisfaction

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

360 400 440 480 520Number of Client

Qo

S U

ns

ati

sfa

cti

on

Ra

te (

%)

AC-RDMA AC-TCP/IP NoAC

Instant QoS status

• Instant QoS status

– AC-RDMA has much better

capability of satisfying the QoS

requirement

With the same requirement of QoS (e.g., response time), AC-

RDMA can serve more clients than the other two systems

• Average QoS unsatisfaction

– AC-RDMA is the best

Page 21: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

21

Performance with Worldcup and Zipf Trace

• AC-RDMA is better– Compared to AC-TCP/IP: 17%

– Compared to NoAC: 36%

World Cup Trace

0

20

40

60

80

100

120

140

160

180

200

80 160 200 240 280 320 360 400 440 480 520

Number of Clients

Av

era

ge

Re

sp

on

se

Tim

e (

ms

ec

)

AC-RDMA AC-TCP/IP NoAC

• AC-RDMA is better– Compared to AC-TCP/IP: 23%

– Compared to NoAC: 42%

Zipf Trace

0

20

40

60

80

100

120

140

160

180

200

80 160 200 240 280 320 360 400 440 480 520

Number of Clients

Av

era

ge

Re

sp

on

se

Tim

e (

ms

ec

)

AC-RDMA AC-TCP/IP NoAC

Page 22: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

22

Outline

• Introduction & Motivation

• Proposed Design

• Experimental Results

• Conclusions & Future Work

Page 23: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

23

Conclusions & Future Work

• Leveraged RDMA read in designing efficient admission control mechanism used in multi-tier data-centers

• Implemented the design in a two-tier data-center over InfiniBand

• Evaluated with single file, worldcup trace and Zipf trace– AC-RDMA outperforms AC-TCP/IP up to 28%, outperforms NoAC up to

51%

– AC-RDMA can provide better QoS satisfaction

– Main reasons

• Update load information timely

• No extra overhead on the already overloaded servers

• Future work: study the scalability performance, incorporate other earlier work for integrated resource management service etc.

Page 24: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

24

Overall Datacenter Framework

ReconfigurationResource

Monitoring

Soft Shared State Lock ManagerGlobal Memory

Aggregator

Distributed Data/Resource Sharing Substrate

High-Performance Networks (InfiniBand, iWARP 10GigE)

Existing Datacenter Components

Multicast

Advanced

System

Services

ActiveCaching

CooperativeCaching

Dynamic Content Caching

ActiveCaching

CooperativeCaching

ReconfigurationResource

Monitoring

Active Resource Adaptation

ReconfigurationResource

Monitoring

RDMA Atomics

High-speed

Networks

Advanced Communication

Protocols and SubsystemsSockets Direct Protocol

QoS&

AdmissionControl

Advanced

Service

Primitives

Distributed Data/Resource Sharing Substrate

Soft Shared State Lock ManagerGlobal Memory

Aggregator

Existing Datacenter Components

Page 25: Advanced RDMA-based Admission Control for Modern Data …nowlab.cse.ohio-state.edu/static/media/publications/slide/lai-ccgrid08.pdfAdvanced RDMA-based Admission Control for Modern

25

Thank you

{laipi, narravul, vaidyana, panda}@cse.ohio-state.edu

Network-Based Computing Laboratory

http://nowlab.cse.ohio-state.edu/Data-Center Web Page

http://nowlab.cse.ohio-state.edu/projects/data-centers/index.html

NBC-LAB