mortar: hadoop-as-a-service + open source framework | aws re: invent public …

Post on 28-May-2015

1.093 Views

Category:

Technology

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Presentation from AWS re: Invent, where Mortar announced it's public availability on stage. Mortar facilitates Hadoop data development in two parts: an open source data development framework (like Rails for Hadoop) and a PaaS (like Heroku for Hadoop). The open source framework allows for sharing/repeating/maintaining code history, automated testing, and one-button deploy. The PaaS provides high availability execution, full job history, and helps users identify problems.

TRANSCRIPT

Open-source softwaredevelopment frameworks

Open-source software dev frameworks

(and many more...)

• Avoid repeat work• Remain 100% customizable• Escape vendor lock-in

Every platform needsopen-source, code-based dev frameworks

But for data?Nothing.

Data work today• Proprietary / GUI / No framework• Not reusable• Little collaboration• Too many errors, too slow to fix

an open source dev framework for data

“Mortar takes something complex and makes it simple and intuitive.”

—Jon Coveney, Twitter

What big data system should I use? Hadoop, HPCC, Disco, Storm… What Hadoop distro should I use?How much will this cost? Should I do it some other way?How long is this going to take to learn? Should I do it some other way?How many machines should I buy?Can I run in the cloud?How should I interact with it? Pig, Hive, Cascading, Scalding, CascalogCan I use libraries I need?How do I install everything?How can my team share code?How can I reuse code?Is my workload typical?How can I safely deploy?How do I know if what I've written is correct?Are there any libraries I could start with?Can I connect to my key-value store?Can I do machine learning in Hadoop?Is it secure?What if I need help?

Weeks

More Weeks

Even More Weeks

“Our focus in designing Pig has always been to make Hadoop easy...

Mortar's approach is right on—they extend our quick start and ease of use focuses with pre-built Hadoop clusters, clear examples, code organization templates, and github for social sharing of the code.”

OPEN SOURCED PIG—Alan Gates, Hortonworks co-founder

Mortar is for analyzing lots of data in AWS.

Can Mortar help you?

Mortar serves companies of all sizes from any industry.

Who is Mortar for?

“...Mortar fits right in with our vision of the future... With this exciting launch, MongoDB users can now also seamlessly use Mortar.”

—Dwight MerrimanFOUNDER OF 10GEN (MAKER OF MONGODB), DOUBLECLICK (ACQ. GOOGLE), SHOPWIKI, BUSINESS INSIDER, GILT GROUPE

By and for engineers and data scientists

> gem install mortar

> mortar new my_project

> git clone your_project

> mortar run your_project

Pig is easy to learn(and we’ve made it easier)

Illustrate is awesome• Find your mistakes• Understand code before collaborating• Automated tests: a way to test every condition

Hadoop & Python are powerful data science tools

...but they haven’t worked together before.

Now you can use Hadoop & real Python on Mortar

• Installed Mortar• Made a new project• Cloned a project• Ran the project• Illustrated project• Use Python and other libraries on Hadoop

What you just saw

- Web Projects: zero install, in the browser

2 options for using Mortar:

- Git Projects: modularity, testability, code sharing, local dev, and revision control.

• Use your browser• Minutes to connect data• Productive in one hour

One-hour challenge

As a good citizen, Mortar has a rich API

How does Mortar fit with other

Full speed, directly on Hadoop

How about speed?

• Easy start• Keeps you productive• Collaborate with data• No lock-in• Easy to budget

Mortar revolutionizes your data pipeline.

• Free | Service use unlimited | 10 node-hours• Pay as you Go | $0.89/node-hour | support• Enterprise | $3,000/month | $0.69/node-hour | live

support

Tiers

mortardata.com / @mortardata

top related