extending the yahoo streaming benchmark + mapr benchmarks

Extending the Yahoo! Streaming Benchmark

Jamie Grier@[email protected]

mailto:[email protected]

Who am I?• Director of Applications Engineering at data

Artisans• Previously working on streaming

computation at Twitter, Gnip and Boulder Imaging

• Involved in various kinds of stream processing for about a decade

• High-speed video, social media streaming, general frameworks for stream processing

Overview• Yahoo! performed a benchmark comparing

Apache Flink, Storm and Spark• The benchmark never actually pushed Flink

to it’s throughput limits but stopped at Storms limits

• I knew Flink was capable of much more so I repeated the benchmarks myself

• I did a follow up blog post explaining my findings and will summarize them here

https://yahooeng.tumblr.com/post/135321837876/benchmarking-streaming-computation-engines-at

http://data-artisans.com/extending-the-yahoo-streaming-benchmark/

Yahoo! Benchmark• Count ad impressions grouped by

campaign• Compute aggregates over a 10 second

window• Emit current value of window aggregates

to Redis every second for query• Map ads to campaigns using Redis as well

Any questions so far?

Storm Code

Flink Code

Hardware Specs• 10 Kafka brokers with 2 partitions each• 10 compute nodes (Flink / Storm)• Each machine has 1 Xeon [email protected] CPU

• 4 cores, 8 vCores (hyperthreading)• 32 GB RAM (only 8GB allocated to JVMs)

• 10 GigE Ethernet between compute nodes• 1 GigE Ethernet between Kafka cluster and compute

nodes

Logical Deployment

Data Generat

orKafka Source Filter Project Join

Redis

Window Sink Redis

Stream Processor

Redis

Apache StormDeployment

Kafka

Kafka

Kafka

Source Filter Project Join Window Sink

FlinkData Generator

Redis

Shuffle

Apache Storm10 Gige Link1 Gige Link

Redis

Kafka

Kafka

Kafka


FlinkData Generator

Redis

Shuffle

10 Gige Link1 Gige Link

Redis

Kafka

Kafka

Kafka

Source / Filter Project Join Window Sink

FlinkData Generator

Redis

Shuffle


Redis

Kafka

Kafka

Kafka

Source / Filter / Project Join Window Sink

FlinkData Generator

Redis

Shuffle


Redis

Kafka

Kafka

Kafka

Source / Filter / Project / Join Window Sink

FlinkData Generator

Redis

Shuffle


Redis

Kafka

Kafka

Kafka

Window / Sink

FlinkData Generator

Redis

Shuffle

Source / Filter / Project / Join


Redis

Kafka

Kafka

Kafka

FlinkData Generator

Redis

Shuffle

Window / SinkSource / Filter / Project / Join


Redis

Kafka

Kafka

Kafka

FlinkData Generator

Redis

Shuffle

Apache FlinkDeployment

Apache Flink



Processing Guarantees

Apples and OrangesApache Storm Apache Flink

At least once semantics

Exactly once semantics

Double counting after failures No double counting

Lost state after failures No state loss

Benchmark

Storm (Kafka, 1 GigE)

Flink (Kafka, 1 GigE)

0 1 2 2 3 4

0M

3M

Baseline

Throughput: msgs/sec

Bottleneck AnalysisApache Storm

Kafka

Kafka

Kafka


FlinkData Generator

Shuffle


Redis

Redis

Bottleneck AnalysisApache Storm

Kafka

Kafka

Kafka


FlinkData Generator

Shuffle


Redis

Redis

CPU

Redis

Kafka

Kafka

Kafka

FlinkData Generator

Redis

Shuffle

Bottleneck AnalysisApache Flink

Apache Flink



Redis

Kafka

Kafka

Kafka

FlinkData Generator

Redis

Shuffle

Bottleneck AnalysisApache Flink

Apache Flink



Network

Redis

Kafka

Kafka

Kafka

FlinkData Generator

Redis

Shuffle

Eliminate theBottleneck

Apache Flink



Redis

FlinkData Generator

Redis

Shuffle

Apache Flink




Redis

Redis

Shuffle

Apache Flink



DataGenerator


Redis

Redis

Shuffle

Apache Flink



DataGenerator

Apache FlinkDeployment

Round 2

Benchmark



0 1 2 2 3 4

0M

3M

Baseline


BenchmarkRound 2



Flink (DataGen, 10 GigE)

0 4 8 12 16

0M

3M

15M

10 GigE end-to-end


Results• Apache Flink achieved 15 million messages

/ sec on Yahoo! benchmark• Much stronger processing guarantees:

Exactly once• 80x higher than what was reported in the

original Yahoo! benchmark on similar hardware

Questions?

Redis

Redis

Shuffle

MapR Cluster


10 Gige Link

DataGenerator

Apache Flink andMapR Streams

MapRStreams

MapRStreams

MapRStreams

MapR BenchmarkHardware Specs

• 10 MapR nodes, 3X data replication• Each node has 1 Xeon E5-2660-v3 @ 2.60GHz

CPU• 10 cores, 20 vCores (hyperthreading)• 16 vCores used for Flink on each node• 256 GB RAM (only 8GB allocated to Flink)

• 40 GigE Ethernet between compute nodes

Benchmarking on MapRHPC Cluster

Series1

40 GigE end-to-end


10 Million msgs/sec(with 3x replication)

Benchmarking on MapRHPC Cluster

Flink (MapR Streams)

Flink (w/ Data Gener-ator)

0 20 40 60 80

10M

72M

40 GigE end-to-end


BenchmarkingSummary



Flink (MapR, 40 GigE)



0 20 40 60 80

0M

3M

10M

15M

72M


What’s missing?



0 1Throughput: msgs/sec

???

???

Results• Apache Flink achieved 10 million messages

/ sec on Yahoo! benchmark when paired with MapR Streams and a high-performance 10 node cluster

• On the same cluster hardware Apache Flink achieved 72 millions message / sec when using direct data generation

Storm Compatibility• Lot’s of companies already have applications

written using the Storm API• Flink provides a Storm compatibility layer• Run your Storm jobs on Flink with a one line

code change• Flink also allows you to reuse your existing

Storm spout and bolt code from a Flink job• Give it a try!

Thanks to MapR!Special thanks to:

Terry HeTed Dunning