building the ideal stack for machine learning
TRANSCRIPT
2
Agenda
oMachine Learning Overview
o The Need for Real-Time
o Real-Time Machine Learning Use Cases
oDemo
MISSION
Make every company a real-time enterprise
PRODUCT
Database Platform: Real-Time | Scalable | ProvenRanked #1 Operational Data Warehouse
ABOUT
Founders are former Facebook, SQL Server database engineersYC alumni with $85m in funding from top tier investors
MemSQL at a Glance
MACHINE LEARNINGRunning a mathematical model on a data set, where the model is trained from data using some optimization technique
DATA SCIENCEBroader field of training, running, testing, and productizing data models
5
Example Types of Machine Learning
SUPERVISED LEARNINGRegressionClassification
UNSUPERVISED LEARNINGCluster Analysis
8
Example Machine Learning Pipeline
(Mathematical function)
(Arbitrary computation)Feature
Extraction
ModelScoring
NEW DATA
FEATURES
PREDICTION
9
Example Machine Learning Pipeline
Model Accuracy
Feature Extraction
ModelScoring
NEW DATA
FEATURES
PREDICTION
LABELED HISTORICAL DATA
FEATURES
PREDICTION
LABELS
Validation
10
Example Machine Learning Pipeline: Fraud Detection
Model Accuracy
Feature Extraction
ModelScoring
NEW DATA
FEATURES
PREDICTION
LABELED HISTORICAL DATA
FEATURES
PREDICTION
LABELS
Validation
Previously labeled credit card
transactions
X% of predictions are accurately predicted to
be fraud
New credit card transactions
Yes/No fraud prediction
BI DASHBOARD
11
Building your Machine Learning Stack for Experimentation
Model Accuracy
LABELED HISTORICAL DATA
FEATURES
PREDICTION
LABELS
PROGRAMMING STACKPython/NumPy StackROthers: Matlab, Julia
VISUALIZATION TOOLInteractiveCustomizableSharable
12
Building your Machine Learning Stack for Productization
NEW DATA
FEATURES
PREDICTION
o Performance
o Message Delivery Semantics
o Message Queues
o Distributed Computation
o Persistent Database
o Business Intelligence Tool
14
Real-Time Machine Learning
Model Accuracy
Feature Extraction
ModelScoring
NEW DATA
FEATURES
PREDICTION
LABELED HISTORICAL DATA
FEATURES
PREDICTION
LABELS
Validation
15
Real-Time Machine Learning
Model Accuracy
Feature Extraction
ModelScoring
NEW DATA
FEATURES
PREDICTION
LABELED HISTORICAL DATA
FEATURES
PREDICTION
LABELS
Validation
Real-timewebsite
traffic data
Spark MLlib Predictive Model
Advertising Optimization in AdTech Industry
Determine the best ads to serve to users based on previous behavior and market segmentation
Machine learning model implemented using Spark MLlib
BUSINESSLOGIC
18
Real-TimeAnomaly Alerts
...
Sensor Activity
Anomaly Detection in Energy Industry
Proactively detect non-optimally functioning oil drills based on sensor activity, and correct issues before damage is done
Machine learning model is SAS model exported in PMML format run inside Spark
PMML Predictive Model
Facial imagecontent
Machine Learning
Model
Image recognitiondashboard
Image Detection in Social Media Industry
Find matches for faces found in millions of image files, and determine if the face matches a set of known “friends”
Machine learning model is a vector dot product calculation between image feature vectors, all done in MemSQL
MNISTdata
MemSQLPipeline SQL
Leverage the TensorFlow machine learning library to predict images in the MNIST training data set, and store predictions in MemSQL
MNIST Data Set Demo
Thank You!Try MemSQL: memsql.com/downloadContact: [email protected]: @memsql