![Page 1: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/1.jpg)
![Page 7: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/7.jpg)
6 Natalino Busa - @natbusa
What about (data) science?
- technologies and tools are driving innovation in data analytics -
![Page 9: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/9.jpg)
8 Natalino Busa - @natbusa
Learning: The Scientific Method
Ørsted's "First Introduction to General Physics" (1811) https://en.m.wikipedia.org/wiki/History_of_scientific_method
observation hypothesis deduction synthesis
Hans Christian Ørsted
experiment
Icons made by Gregor Cresnar from www.flaticon.com is licensed by CC 3.0 BY
![Page 10: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/10.jpg)
9 Natalino Busa - @natbusa
Innovation in Data Analytics
Cloud Community AI & ML
![Page 12: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/12.jpg)
11 Natalino Busa - @natbusa
“we live in an age of open source datacenters, so we can stack all these things together and we have open source from the ground to ceiling.”
Sam Ramji, CEO of Cloud Foundry
https://www.youtube.com/watch?v=7oCSFcUW-Qk
![Page 13: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/13.jpg)
12 Natalino Busa - @natbusa
Analytics in the cloud
Bare Metal: Physical Machines
IAAS: Virtual Resources
CAAS: Containers,
dPAAS: Datastores, Data Engines iPAAS: Tools Integration, Flows & Processes
DAAAS: Data Analytics as a Service
![Page 14: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/14.jpg)
13 Natalino Busa - @natbusa
DAAAS: AI and ML API’s
Cloud Computing for Deep Neural Networks > Models, Compute (Train, Score), and Data
AI and ML models for:
● Speech (audio)● Language (text)● Vision (images/video)
● Data (classification, regression, clustering, anomaly detection)
![Page 15: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/15.jpg)
14 Natalino Busa - @natbusa
Ephemeral Computing Clusters on a Cloud
data
create load compute storetimeline
destroy
![Page 16: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/16.jpg)
15 Natalino Busa - @natbusa
dPaaS: Analytical clusters
Ephemeral
Short-Lived
Data Exploration
Isolated, Personal
Simple Access Management
Permanent
Long Lived
Production / Operations
Co-Ordinated
Complex Access Management
vs
![Page 17: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/17.jpg)
16 Natalino Busa - @natbusa
GPU’s and Distributed ComputingGPU support is coming in Kubernetes, Mesos, Spark
https://www.oreilly.com/learning/accelerating-spark-workloads-using-gpushttp://www.slideshare.net/databricks/tensorframes-google-tensorflow-on-apache-spark
out
up
CPUR,Python
SparkTensorFrames
![Page 20: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/20.jpg)
19 Natalino Busa - @natbusa
Sharing is caring … speed
github.com + Jupyter notebooks, share ideas, code, and data
arxiv.orgshare innovation and scientific results
![Page 22: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/22.jpg)
21 Natalino Busa - @natbusa
Google: open-sources NLP parserscoring 95% in grammar accuracy
https://github.com/tensorflow/models/tree/master/syntaxnet
![Page 23: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/23.jpg)
22 Natalino Busa - @natbusa
Deep Learning in Language Parsing
https://github.com/tensorflow/models/blob/master/syntaxnet/ff_nn_schematic.png
![Page 24: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/24.jpg)
23 Natalino Busa - @natbusa
Semantic Search: TDA + NNs Word2Vec, Par2Vec, Doc2Vechttps://arxiv.org/pdf/1405.4053v2.pdfhttps://arxiv.org/pdf/1301.3781v3.pdf
![Page 25: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/25.jpg)
24 Natalino Busa - @natbusa
Lip reading
LipNet achieves 93.4% accuracy,on GRID corpus.
https://arxiv.org/pdf/1611.01599v1.pdf
![Page 26: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/26.jpg)
25 Natalino Busa - @natbusa
Ask me Anything
Dynamic Memory Networks
for Natural Language
Processinghttps://arxiv.org/pdf/1603.01417v1.pdf
https://youtu.be/oGk1v1jQITw
Caiming Xiong, Stephen Merity, Richard Socher
![Page 27: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/27.jpg)
26 Natalino Busa - @natbusa
Ask me Anything
http://www.socher.org/index.php/DeepLearningTutorial/DeepLearningTutorial
Dynamic Memory Networks for Natural Language Processinghttps://arxiv.org/pdf/1603.01417v1.pdf
http://www.socher.org/Local context
Wider context
NLP, Attention Masks
Semantic Embeddings from Text, Images
![Page 29: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/29.jpg)
28 Natalino Busa - @natbusa
Network Intrusion Detection
http://billsdata.net/?p=105
It contains 130 million flow records involving 12,027 distinct computers over 36 days (not the full 58 days claimed for the entire data release).
Each record consists of: time (to nearest second), duration, source and destination computer ids, source and destination ports, protocol, number of packets and number of bytes
Techniques: TDA, Dimensionality Reductionhttps://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction
![Page 30: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/30.jpg)
29 Natalino Busa - @natbusa
Approaching (Almost) Any Machine Learning Problem- Abhishek Thakur, Kaggle Grandmaster -
data labels
raw data: tables, files Useful dataData munging Feature Engineering
Tabular Data ready for ML
http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/
![Page 31: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/31.jpg)
30 Natalino Busa - @natbusa
AutoML challenge
- based on scikit-learn- 15 classifiers, - 14 feature preprocessing methods- 4 data preprocessing methods- 110 hyperparameters
- Supervised classification challenge:100 different datasets
Natalino Busa - @natbusa
![Page 33: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/33.jpg)
32 Natalino Busa - @natbusa
Human cognitive biases :
Too much information
Not enough meaning
What should we remember?
Need to act fast
https://en.wikipedia.org/wiki/List_of_cognitive_biases
![Page 34: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/34.jpg)
33 Natalino Busa - @natbusa
Man vs Machine cognitive limits
Model generation
Explanation
Unsupervised
Planning
Too much information
Not enough meaning
Need to act quickly
Memory limits
![Page 35: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/35.jpg)
34 Natalino Busa - @natbusa
Theorems often tell us complex truths about the simple things, but only rarely tell us simple truths about the complex ones
Marvin MinskyK-Linesː A Theory of Memory (1980)
![Page 36: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/36.jpg)
35 Natalino Busa - @natbusa
Data Science: wear the AI/ML LensesWe are entering a new era of intelligent machines
Boost our understanding of data
Focus on higher level analyses
![Page 37: Are we reaching a Data Science Singularity? How Cognitive Computing is emerging from Machine Learning Algorithms, Big Data Tools, and Cloud Services by Natalino Busa](https://reader031.vdocuments.us/reader031/viewer/2022030317/586fa13d1a28abcc238b6ad3/html5/thumbnails/37.jpg)
36 Natalino Busa - @natbusa
Intelligent Data Systems:Long live the “database”
Wikipedia:A database is an organized collection of data.
DATA
New-SQL
ML
AI
SQL
Python - Scala - R
NLP
UX
Speech
COG