datacenter management with apache mesos

Benjamin Hindman – @benh

Datacenter Management with Apache Mesosmesos.apache.org@ApacheMesos

I’ve got tons of data ...

… more everyday!

That must be why they call it a datacenter.

I’d love to answer some questions with the help

of my data!

I think I’ll try Hadoop.

your datacenter

+ Hadoop

happy?

Not exactly …

… Hadoop is a big hammer, but not

everything is a nail!

I’ve got some iterative algorithms, I want to try

Spark!

datacenter management

static partitioning

Oh noes! Spark wants to read and write data to

Hadoop …

(map/reduce)

(distributed file system)

Could we just give Spark it’s own HDFS cluster

HDFS tee incoming data(2 copies)

periodic copy/sync

That sounds annoying … let’s not do that. Can we do any better though?

happy now?

No! We’ve decided to start doing real time

computation with Storm …

happy now!?

Not really … during the day I’d rather give more machines to Spark but at

night I’d rather give more machines to

Hadoop!

And failures require more datacenter management!

I don’t want to deal with this!

the datacenter …rather than think about the datacenter like this …

… is a computerthink about it like this …

datacenter computer

applications

resources

filesystem

mesosapplications

resources

filesystem

kernel

Okay, so how does it work?

Step 1: HDFS

Step 2: Mesosrun a “master” (or multiple for high availability)

Step 2: Mesosrun “slaves” on the rest of the machines

Step 3: Frameworks

tep 4: Profit$

tep 4: Profit (utilize)$

just one big pool of resources,utilize single machines more fully!

tep 4: Profit (utilize)$

tep 4: Profit(statistical multiplexing)$

reduces CapEx and OpEx!

reduces latency!

tep 4: Profit (failures)$

This sounds pretty good!

Other than Hadoop, Spark, and Storm, what

else can I run on Mesos?

frameworks• Hadoop (github.com/mesos/hadoop)• Spark (github.com/mesos/spark)• DPark (github.com/douban/dpark)• Storm (github.com/nathanmarz/storm)• Chronos (github.com/airbnb/chronos)• MPICH2 (in mesos git repository)• Aurora (proposed for Apache incubator)

What about XYZ?

port an existing frameworkstrategy: write a “wrapper” which launches existing components on mesos~100 lines of code to write a wrapper (the more lines, the more you can take advantage of elasticity or other mesos features)see src/examples/ in mesos repository

write a new framework!as a “kernel”, mesos provides a lot of primitives that make writing a new framework relatively easyprimitives: extracted commonality across existing distributed systems/frameworks (launching tasks, doing failure detection, etc) … why re-implement them each time!?

case study: chronosdistributed cron with dependencies

developed at airbnb~3k lines of Scala!distributed, highly available, and fault tolerant without any network programming!http://github.com/airbnb/chronos

Hmm … if Mesos gives me a datacenter

computer … can I run stuff other than

analytics?

case study: aurorarun N instances of my server, somewhere, forever

(where server == arbitrary command line)developed at Twitterruns hundreds of production services, including ads!recently proposed for Apache Incubator!

aurora

But what about resource isolation!? I don’t want

my end users to have to wait for our website to

load because of resource contention!

resource isolationLinux control groups (cgroups)

CPU (upper and lower bounds)memorynetwork I/O (traffic controller)filesystem (lvm, in progress)

conclusionsdatacenter management is a pain

conclusionsmesos makes running frameworks on your datacenter easier as well as increasing utilization and performance while reducing CapEx and OpEx!

conclusionsrather than build your next distributed system from scratch, consider using mesos

conclusionsyou can share your datacenter between analytics and online services!

Questions?mesos.apache.org@ApacheMesos

framework commonalityrun processes simultaneously (distributed)handle process failures (fault-tolerance)optimize execution (elasticity, scheduling)

primitivesscheduler – distributed system “master” or “coordinator”(executor – lower-level control of task execution, optional)requests/offers – resource allocationstasks – “threads” of the distributed system…

scheduler

ApacheHadoop

Chronos

scheduler(1) brokers for resources(2) launches tasks(3) handles task termination

brokering for resources(1) make resource requests 2 CPUs 1 GB RAM slave *(2) respond to resource offers 4 CPUs 4 GB RAM slave foo.bar.com

offers: non-blocking resource allocationexist to answer the question:“what should mesos do if it can’t satisfy a request?”

(1) wait until it can(2) offer the best allocation it can immediately

offers: non-blocking resource allocationexist to answer the question:“what should mesos do if it can’t satisfy a request?”

(1) wait until it can(2) offer the best allocation it can immediately

resource allocation

ApacheHadoop

Chronos

request

resource allocation

ApacheHadoop

Chronos

request

allocatordominant resource fairnessresource reservations

resource allocation

ApacheHadoop

Chronos

request

optimisticpessimistic

resource allocation

ApacheHadoop

Chronos

request

optimisticpessimisticno overlapping offers all overlapping offers

resource allocation

ApacheHadoop

Chronos

“two-level scheduling”mesos: controls resource allocations to framework schedulersschedulers: make decisions about what to run given allocated resources

end-to-end principle“application-specific functions ought to reside in the end hosts of a network rather than intermediary nodes”

taskseither a concrete command line or an opaque description (which requires a framework executor to execute)

a consumer of resources

task operationslaunching/killinghealth monitoring/reporting (failure detection)resource usage monitoring (statistics)

resource isolationcgroup per executor or task (if no executor)

resource controls adjusted dynamically as tasks come and go!

case study: chronosdistributed cron with dependencies

built at airbnb by @flo

before chronos

single point of failure (and AWS was unreliable)resource starved (not scalable)

chronos requirementsfault tolerancedistributed (elastically take advantage of resources)retries (make sure a command eventually finishes)dependencies

chronosleverages the primitives of mesos

~3k lines of scalahighly available (uses Mesos state)distributed / elasticno actual network programming!

after chronos

after chronos + hadoop

case study: aurora“run 200 of these, somewhere, forever”

built at Twitter

before aurorastatic partitioning of machines to serviceshardware outages caused site outagespuppet + monitops couldn’t scale as fast as engineers

aurorahighly available (uses mesos replicated log)uses a python DSL to describe servicesleverages service discovery and proxying (see Twitter commons)

after aurorapower loss to 19 racks, no lost services!more than 400 engineers running serviceslargest cluster has >2500 machines

MesosNode Node Nod

e Node

Hadoop

Node Node Node Node

Node Node

MPI Storm

Chronos

MesosNode Node Nod

e Node

Hadoop

Node Node Node Node

Node Node

MesosNode Node Nod

e Node

Hadoop

Node Node Node Node

Node Node

MPI Storm

MesosNode Node Nod

e Node

Hadoop

Node Node Node Node

Node Node

MPI Storm

Chronos …

datacenter management with apache mesos

frameworks tep

frameworks step

hdfs step

machines step

datacenter hadoop happy

hdfs hdfs hdfs happy

profitstatistical multiplexing

benhdatacenter management

Documents

introduction to mesos and the datacenter operating system

the datacenter needs an operating...

apache flink® meets apache mesos® and dc/os

making apache kafka elastic with apache mesos

building high scalable distributed framework on apache mesos

apache mesos overview and integration

cracking the container scale problem with apache mesos

containers, devops, apache mesos and cloud - reshaping how

mesos: the operating system for your datacenter

re:platforming// the/datacenter// with/apache/mesos

minimesos. apache mesos made easy

introduction to apache mesos

a whirlwind tour of apache mesos - yow! conferences ·...

apache kafka, hdfs, accumulo and more on mesos

chug: datacenter computing with apache mesos

scalable eventing over apache mesos

jupyter and spark on mesos: best practices ·...

multi-tenant machine learning apache aurora & apache...

scaling like twitter with apache mesos

datacenter computing with apache mesos - bigdata dc