are we reaching a data science singularity? how cognitive computing is emerging from machine...

41

Upload: big-data-spain

Post on 16-Apr-2017

146 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa
Page 2: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

1 Natalino Busa - @natbusa

Natalino BusaHead of Data Science Teradata

Page 3: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

2 Natalino Busa - @natbusa

Page 4: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

3 Natalino Busa - @natbusa

Page 5: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

4 Natalino Busa - @natbusa

Page 6: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

5 Natalino Busa - @natbusa

Page 7: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

6 Natalino Busa - @natbusa

What about (data) science?

- technologies and tools are driving innovation in data analytics -

Page 8: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

7 Natalino Busa - @natbusa

Man - Machineas cognitive systems

Page 9: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

8 Natalino Busa - @natbusa

Learning: The Scientific Method

Ørsted's "First Introduction to General Physics" (1811) https://en.m.wikipedia.org/wiki/History_of_scientific_method

observation hypothesis deduction synthesis

Hans Christian Ørsted

experiment

Icons made by Gregor Cresnar from www.flaticon.com is licensed by CC 3.0 BY

Page 10: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

9 Natalino Busa - @natbusa

Innovation in Data Analytics

Cloud Community AI & ML

Page 11: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

10 Natalino Busa - @natbusa

Cloud

Page 12: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

11 Natalino Busa - @natbusa

“we live in an age of open source datacenters, so we can stack all these things together and we have open source from the ground to ceiling.”

Sam Ramji, CEO of Cloud Foundry

https://www.youtube.com/watch?v=7oCSFcUW-Qk

Page 13: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

12 Natalino Busa - @natbusa

Analytics in the cloud

Bare Metal: Physical Machines

IAAS: Virtual Resources

CAAS: Containers,

dPAAS: Datastores, Data Engines iPAAS: Tools Integration, Flows & Processes

DAAAS: Data Analytics as a Service

Page 14: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

13 Natalino Busa - @natbusa

DAAAS: AI and ML API’s

Cloud Computing for Deep Neural Networks > Models, Compute (Train, Score), and Data

AI and ML models for:

● Speech (audio)● Language (text)● Vision (images/video)

● Data (classification, regression, clustering, anomaly detection)

Page 15: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

14 Natalino Busa - @natbusa

Ephemeral Computing Clusters on a Cloud

data

create load compute storetimeline

destroy

Page 16: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

15 Natalino Busa - @natbusa

dPaaS: Analytical clusters

Ephemeral

Short-Lived

Data Exploration

Isolated, Personal

Simple Access Management

Permanent

Long Lived

Production / Operations

Co-Ordinated

Complex Access Management

vs

Page 17: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

16 Natalino Busa - @natbusa

GPU’s and Distributed ComputingGPU support is coming in Kubernetes, Mesos, Spark

https://www.oreilly.com/learning/accelerating-spark-workloads-using-gpushttp://www.slideshare.net/databricks/tensorframes-google-tensorflow-on-apache-spark

out

up

CPUR,Python

SparkTensorFrames

Page 18: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

17 Natalino Busa - @natbusa

Community

Page 19: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

18 Natalino Busa - @natbusa

Community

Develop - Use - Share

Page 20: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

19 Natalino Busa - @natbusa

Sharing is caring … speed

github.com + Jupyter notebooks, share ideas, code, and data

arxiv.orgshare innovation and scientific results

Page 21: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

20 Natalino Busa - @natbusa

Artificial Intelligence Machine Learning

Page 22: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

21 Natalino Busa - @natbusa

Google: open-sources NLP parserscoring 95% in grammar accuracy

https://github.com/tensorflow/models/tree/master/syntaxnet

Page 23: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

22 Natalino Busa - @natbusa

Deep Learning in Language Parsing

https://github.com/tensorflow/models/blob/master/syntaxnet/ff_nn_schematic.png

Page 24: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

23 Natalino Busa - @natbusa

Semantic Search: TDA + NNs Word2Vec, Par2Vec, Doc2Vechttps://arxiv.org/pdf/1405.4053v2.pdfhttps://arxiv.org/pdf/1301.3781v3.pdf

Page 25: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

24 Natalino Busa - @natbusa

Lip reading

LipNet achieves 93.4% accuracy,on GRID corpus.

https://arxiv.org/pdf/1611.01599v1.pdf

Page 27: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

26 Natalino Busa - @natbusa

Ask me Anything

http://www.socher.org/index.php/DeepLearningTutorial/DeepLearningTutorial

Dynamic Memory Networks for Natural Language Processinghttps://arxiv.org/pdf/1603.01417v1.pdf

http://www.socher.org/Local context

Wider context

NLP, Attention Masks

Semantic Embeddings from Text, Images

Page 28: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

27 Natalino Busa - @natbusa

Network Traffic Patterns Classification

Page 29: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

28 Natalino Busa - @natbusa

Network Intrusion Detection

http://billsdata.net/?p=105

It contains 130 million flow records involving 12,027 distinct computers over 36 days (not the full 58 days claimed for the entire data release).

Each record consists of: time (to nearest second), duration, source and destination computer ids, source and destination ports, protocol, number of packets and number of bytes

Techniques: TDA, Dimensionality Reductionhttps://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction

Page 30: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

29 Natalino Busa - @natbusa

Approaching (Almost) Any Machine Learning Problem- Abhishek Thakur, Kaggle Grandmaster -

data labels

raw data: tables, files Useful dataData munging Feature Engineering

Tabular Data ready for ML

http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/

Page 31: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

30 Natalino Busa - @natbusa

AutoML challenge

- based on scikit-learn- 15 classifiers, - 14 feature preprocessing methods- 4 data preprocessing methods- 110 hyperparameters

- Supervised classification challenge:100 different datasets

Natalino Busa - @natbusa

Page 32: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

31 Natalino Busa - @natbusa

Artificial + Human Intelligence

Page 33: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

32 Natalino Busa - @natbusa

Human cognitive biases :

Too much information

Not enough meaning

What should we remember?

Need to act fast

https://en.wikipedia.org/wiki/List_of_cognitive_biases

Page 34: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

33 Natalino Busa - @natbusa

Man vs Machine cognitive limits

Model generation

Explanation

Unsupervised

Planning

Too much information

Not enough meaning

Need to act quickly

Memory limits

Page 35: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

34 Natalino Busa - @natbusa

Theorems often tell us complex truths about the simple things, but only rarely tell us simple truths about the complex ones

Marvin MinskyK-Linesː A Theory of Memory (1980)

Page 36: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

35 Natalino Busa - @natbusa

Data Science: wear the AI/ML LensesWe are entering a new era of intelligent machines

Boost our understanding of data

Focus on higher level analyses

Page 37: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

36 Natalino Busa - @natbusa

Intelligent Data Systems:Long live the “database”

Wikipedia:A database is an organized collection of data.

DATA

New-SQL

ML

AI

SQL

Python - Scala - R

NLP

UX

Speech

COG

Page 38: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

37 Natalino Busa - @natbusa

The Database.is never going to be the same.

Page 39: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

38 Natalino Busa - @natbusa

Thank you.@natbusa

Page 40: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

39 Natalino Busa - @natbusa

credits

Page 41: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa

40 Natalino Busa - @natbusa

bonus slides