scala: the unpredicted lingua franca for data science

Post on 09-Jan-2017

1.043 Views

Category:

Data & Analytics

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Scala: the Unpredicted lingua franca

for Data ScienceDean Wampler@deanwampler

lightbend

Andy Petrella@noootsab

Data Fellas

Distributed Data Science

Distributed Data Science is the “new” interpretation of “big data”

Big Data

Why Distributed Computing became Big Data?

Big Data was the visible part of the Iceberg

Business

Thanks @Google (for All the fish)

Enterprise ready Open Source Implementation

Hadoop (JVM -- Enterprise)

Big Data made easy → it becomes popular

Spark (Scala -- Functional)

After the How, the what

Distributed Data Science

WhyScala.snb

https://github.com/data-fellas/scala-for-data-science

Scala features for data science

Tooling, port models AND invent new models!

What’s missing in Scala/JVM?

Why Spark Notebook.snb

https://github.com/data-fellas/scala-for-data-science

Tooling for data science

Scala: the Unpredicted lingua franca

for Data ScienceDean Wampler@deanwampler

lightbend

Andy Petrella@noootsab

Data Fellas

top related