scala: the unpredicted lingua franca for data science
Post on 09-Jan-2017
1.043 Views
Preview:
TRANSCRIPT
Scala: the Unpredicted lingua franca
for Data ScienceDean Wampler@deanwampler
lightbend
Andy Petrella@noootsab
Data Fellas
Distributed Data Science
Distributed Data Science is the “new” interpretation of “big data”
Big Data
Why Distributed Computing became Big Data?
Big Data was the visible part of the Iceberg
Business
Thanks @Google (for All the fish)
Enterprise ready Open Source Implementation
Hadoop (JVM -- Enterprise)
Big Data made easy → it becomes popular
Spark (Scala -- Functional)
After the How, the what
Distributed Data Science
WhyScala.snb
https://github.com/data-fellas/scala-for-data-science
Scala features for data science
Tooling, port models AND invent new models!
What’s missing in Scala/JVM?
Why Spark Notebook.snb
https://github.com/data-fellas/scala-for-data-science
Tooling for data science
No more one-liner...
● MLlib (and other AMPLab stuff: MLPipeline, MLBase)
● Deeplearning4J● OptiML● Streaming Clustering: G-Stream, Mean-Shift-LSH, SOM-MR● Figaro
Universities are now teaching for data science ● LIPN● Radboud Universiteit (http://rubigdata.github.io/course/)
Education: Data Science Inc. (12 weeks!)
Of course, check http://spark-packages.org
Models and education (soon snb)
Scala: the Unpredicted lingua franca
for Data ScienceDean Wampler@deanwampler
lightbend
Andy Petrella@noootsab
Data Fellas
top related