webinar - analyzing video

16
Dato Confidential 1 Analyzing Video with GraphLab Create June 16, 2016 Guy Rapaport, Data Scientist, Dato EMEA [email protected]

Upload: turi-inc

Post on 16-Apr-2017

325 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Webinar - Analyzing Video

Dato Confidential1

Analyzing Video with GraphLab CreateJune 16, 2016

Guy Rapaport, Data Scientist, Dato [email protected]

Page 2: Webinar - Analyzing Video

Dato Confidential2

Dato: We Intelligent Applications

Page 3: Webinar - Analyzing Video

Dato Confidential

Some of our Customers

3

Page 4: Webinar - Analyzing Video

Dato Confidential4

Businessmust be

intelligent

Machine learning applications

• Recommenders • Fraud detection• Ad targeting• Financial models• Personalized

medicine • Churn prediction• Smart UX

(video & text)• Personal assistants• IoT• Socials networks• Log analysisLast decade:

Data managementNow:

Intelligent apps

?Last 5 years:

Traditional analytics

Page 5: Webinar - Analyzing Video

Dato Confidential

Page 6: Webinar - Analyzing Video

Dato Confidential

Creating a model pipeline

exploration

data

modeling

- Images- Text- Graphs- Tabular Data

Page 7: Webinar - Analyzing Video

Dato Confidential

Creating a model pipeline

Ingest Transform

Model DeployUnstructured Data

Page 8: Webinar - Analyzing Video

Dato Confidential

Creating a model pipeline using Dato products

Ingest Transform

Model DeployUnstructured Data

SFrame Engine(FREE, open

source)

GraphLab Create (Scalable Machine Learning Python

Library, 4K/machine/year)

Predictive Services(Serving + Load Balancing + AB

Testing,10K/machine/year)

Page 9: Webinar - Analyzing Video

Dato Confidential9

$ pip install –U graphlab-create

Page 10: Webinar - Analyzing Video

Dato Confidential10

What will we cover today?1. Match a movie’s screenplay with its subtitles.

- Now we know who says what and when.

2. Extract frames, then actors’ faces, from the movie.- We’ll use opencv for video manipulation and face

detection.

3. Train a face recognition model over the faces.- What’s the smallest portion of the movie we can get good

results from?

10

Page 11: Webinar - Analyzing Video

Dato Confidential11

Python vs. Anaconda• You can download Python for free from python.org .

- Python with its standard library.

• Or, you could download the Anaconda distribution.- Python + tons of installed packages + package managers.

• It’s the same Python, but Anaconda includes both pip and also with it’s own package manager, conda.

11

Page 12: Webinar - Analyzing Video

Dato Confidential12

pip vs. conda vs. virtualenvpip – install Python packages.

conda – install Python packages + any OS packages required for your package to work (libraries etc).

$ conda install -c menpo opencv3=3.1.0

virtualenv – separate environment (by manipulating the $PYTHONPATH etc.) so packages won’t break.

You can have multiple Python versions on the same machine, and use a Python version in different environments.

12

Page 13: Webinar - Analyzing Video

Dato Confidential13

Look Deeper!1) Building a Face Recognition System with OpenCV in the blink of an Eye• https://github.com/rragundez/PyData• Live video from webcam, online analytics

2) Using mxnet for deep feature extraction• https://

github.com/dmlc/mxnet/blob/master/example/notebooks/predict-with-pretrained-model.ipynb

• mxnet is now integrated into GraphLab!

3) mxnet-face• https://github.com/tornadomeet/mxnet-face

Page 14: Webinar - Analyzing Video

Dato ConfidentialConfidential – Dato internal use only. ©2015 Dato, Inc.

Questions?

“For the purpose of learning the Answer to theUltimate Question of Life, The Universe, and Everything,

the supercomputer Deep Thought was specially built.It takes Deep Thought 7½ million years to compute and check the answer, which turns out to be 42. Deep Thought points out that

the answer seems meaningless becausethe beings who instructed it

never actually knew what the Question was.”- Douglas Adams, “The Hitchhiker’s Guide to the Galaxy”

Page 15: Webinar - Analyzing Video

Dato Confidential15

Our Machine Learning Specializationin Coursera

https://www.coursera.org/learn/ml-foundations

Page 16: Webinar - Analyzing Video

Dato ConfidentialConfidential – Dato internal use only. ©2015 Dato, Inc.

Thanks!Install using pip: $ pip install -U graphlab-create

Dato Launcher Download:https://dato.com/download/

The benchmarks on GitHub:https://github.com/guy4261/glc_pagerank_benchmark

Coursera Course:https://www.coursera.org/learn/ml-foundations

Reach out: [email protected]