c3: cutting tail latency in cloud data stores via adaptive replica selection marco canini (ucl) with...

C3: Cutting Tail Latency inCloud Data Stores via

Adaptive Replica Selection

Marco Canini (UCL)

with Lalith Suresh, Stefan Schmid, Anja Feldmann (TU Berlin)

Latency matters

Latency $$$$

+500ms lead to a revenue

decrease of 1.2%[Eric Schurman, Bing]

Every 100ms in latency cost them

1% in sales [Greg Linden,

Amazon]

Oneuser request

Tens to Hundredsof servers involved

Variability of response time at system components inflateend-to-end latenciesand lead to high tail latencies

LatencyE

Size and system complexity

At Bing, 30% of services have95th percentile > 3 x median latency[Jalaparti et al. SIGCOMM’13]

Performance fluctuations are the norm

Queueingdelays

Skewed access patterns

Resource contention

Background activities

How can we enable datacenter applications*to achieve morepredictable performance?

* in this work: low-latency data stores

Replica selection

{request}

Replica Selection Challenges

• Service-time variations

• Herd behavior

Request

Service-time variations

Request

Herd Behavior

? ? ? Request

Request

Load Pathology in Practice

• Cassandra on EC2– 15-node cluster, skewed access pattern,

workload by 120 YCSB generators

• Observe load on most heavily utilized node

Load Pathology in Practice

99.9th percentile ~ 10x median latency

Load Conditioning in Our Approach

C3Adaptive replica selection mechanism that is robust to service time heterogeinity

C3 • Replica Ranking

• Distributed Rate Control

ClientServer

Client

Server

µ-1 = 2 ms

µ-1 = 6 ms

ClientServer

Client

Server

Balance product of queue-size and service time{ }

µ-1 = 2 ms

µ-1 = 6 ms

Server-side Feedback

Servers piggyback {} and {} in every response

Client Server

Clients maintain EWMAs of these metrics

• Concurrency compensation

outstanding requests

Select server with min ?

• Potentially long queue sizes

• Hampers reactivity

Penalizing Long Queues

Scoring Function

Clients rank server according to

Ψ 𝑠=𝑅𝑠−𝜇−1+( �̂�𝑠 )3 ∙𝜇−1

Need for distributed rate control

Replica ranking insufficient

• Avoid saturating individual servers?

• Non-internal sources of performance fluctuations?

Cubic Rate Control

• Clients adjust sending rates according to cubic function

• If receive rate isn’t increasing further, multiplicatively decrease

Putting everything together

On receiving a request:• Client sorts replica servers by ranking function• Select first replica server which is within rate limit• If all replicas have exceeded rate limit, retain in

backlog queue until a replica server is available

C3 implementation in

Cassandra

Read ( “key” )

@Snitch,Who is the best replica?

Read ( “key” )

Dynamicsnitching

C3 Implementation

• Replacement for Dynamic Snitching

• Scheduler and backlog queue per replica group

• ~400 lines of code

Evaluation

Amazon EC2

Controlled Testbed

Simulations

Evaluation

Amazon EC2

• 15-node Cassandra cluster

• Workloads generated using YCSB

• Read-heavy, update-heavy, read-only

• 500M 1KB records (larger than memory)

• Compare against Cassandra’s Dynamic

Snitching

2x – 3x improved99.9th percentilelatencies

Up to 43%improvedthroughput

Improvedload conditioning

How does C3 react todynamic workload changes?

• Begin with 80 read-heavy workload generators

• 40 update-heavy generators join the system after 640s

• Observe latency profile with and without C3

Latency profile degrades gracefully with C3

Summary of other results

Higher system load

Skewed record sizes

SSDs instead of HDDs

> 3x better 99th percentile latency

> 50% higher throughput

Ongoing work

• Stability analysis of C3

• Deployment in production settings at Spotify and SoundCloud

• Study of networking effects

Summary

C3 combines careful replica ranking anddistributed rate control to:

• Reduce tail, mean, and median latencies

• Improve load conditioning and throughput

Thank you!

Client

Server

{ qs , 1/µs }

c3: cutting tail latency in cloud data stores via adaptive replica selection marco canini (ucl) with...

Documents

falko feldmann...

ict in education - presented by lalith prasad

hans peter feldmann

hair:hierarchicalarchitectureforinternetroutingconferences.sigcomm.org/co-next/2009/workshops/rearch/papers/... ·...

bradycardia & tachycardia lalith sivanathan 2015 advanced...

doug feldmann, ph.d. education...feldmann – curriculum...

feldmann on the evolution of your plan for health

dr lalith senaweera director general/ceo sri lanka standards...

„enterprise class load balancer“ - sebastian feldmann

isp-aided neighbor selection for p2p...

melinda feldmann comprehensive musicianship graduate ... ·...

slides borrowed from cs 240 by marco canini and tadayoshi

rediscovery of gelidiella ramellosa feldmann et hamel...

ceer sstems escrementi canini - robi ag · ceer sstems...

administration of medication – intramuscular, subcutaneous...

medicinal plants - prof. antonella canini...

stroke lalith sivanathan 2015 advanced concepts in emergency...

tsunami early warning system sri lanka lalith chandrapala...

technology and research for development lalith gamage...

prof. dr. claus feldmann at basf science symposium 2015