demo: predictive modeling with bigml - by david gerster - papis connect

10
1 David Gerster VP Data Science [email protected]

Upload: papisio

Post on 06-Aug-2015

133 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

David GersterVP Data Science

[email protected]

Page 2: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

2

Demo: Predictive Modeling

• Train a predictive model using 699 biopsies• The “label” of benign or malignant is known for each one• Since we have labels, this is supervised learning

Page 3: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

3

What if we don’t have labels?

• Can we get insight into our data if we don’t know the labels?• Enter anomaly detection• Since we don’t have labels, this is unsupervised learning

Page 4: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

10 lines are neededto isolate this data point(not anomalous)

Page 5: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

Only 4 lines are neededto isolate this data point(highly anomalous)

Page 6: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

6

Demo: Anomaly Detection

• Remove the labels of benign or malignant• Train an anomaly detector on this unlabeled data• Create a new dataset with the anomaly scores as “labels”• Use these “labels” to train a predictive model!

Page 7: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

Who Needs Labels?

Page 8: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

Who Needs Labels?

Page 9: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

9

Minority Report

• Anomaly detection works great on large unlabeled datasets, especially if you expect to find an (adversarial) minority class• Millions of credit card transactions, billions of network events …

• Doesn’t require you to know what you’re looking for!

Page 10: Demo: Predictive Modeling with BigML - by David Gerster - PAPIs Connect

10

Thanks!

David GersterVP Data Science, BigML

[email protected]