ai for media and entertainment

36
August 2018 AI FOR MEDIA AND ENTERTAINMENT

Upload: others

Post on 30-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: AI FOR MEDIA AND ENTERTAINMENT

August 2018

AI FOR MEDIA AND ENTERTAINMENT

Page 2: AI FOR MEDIA AND ENTERTAINMENT

Rick Grandy & Gary Burnett

Solutions Architects – ProViz Media & Entertainment

DEEP LEARNING – AN OVERVIEW

Page 3: AI FOR MEDIA AND ENTERTAINMENT

3

Artificial Intelligence (AI): general coverall for machines doing interesting things

Machine Learning (ML):computers complete tasks without explicit programming

Neural Networks (NN):one technique to achieve ML

Deep Learning (DL):adds “hidden layers” to Neural Networks to solve complex problems

Great explanation: https://goo.gl/hkayWG

1950’s 1960’s 1970’s 1980’s 1990’s 2000’s 2010’s

EVOLUTION OF ARTIFICIAL INTELLIGENCE

Page 4: AI FOR MEDIA AND ENTERTAINMENT

4

A NEW COMPUTING MODEL

TRADITIONAL APPROACH

Requires domain experts

Time-consuming experimentation

Custom algorithms

Not scalable to new problems

Algorithms that learn from examples

MACHINE LEARNING

Car Vehicle

Coupe

Page 5: AI FOR MEDIA AND ENTERTAINMENT

5

A NEW COMPUTING MODEL

TRADITIONAL APPROACH

Requires domain experts

Time-consuming experimentation

Custom algorithms

Not scalable to new problems

Algorithms that learn from examples

DEEP NEURAL NETWORKS

Learn from data

Easily to extend

Accelerated with GPUs

MACHINE LEARNING

Car Vehicle

Coupe

CarVehicle

Coupe

DEEP LEARNING

Page 6: AI FOR MEDIA AND ENTERTAINMENT

6

DEEP LEARNING 101

Page 7: AI FOR MEDIA AND ENTERTAINMENT

7

DEEP LEARNING 101its all about the data

Page 8: AI FOR MEDIA AND ENTERTAINMENT

8

QUESTION AI/DL TASK

Is “it” present

or not?Detection

What type of thing

is “it”?Classification

To what extent

is “it” present?Segmentation

What is the likely

outcome? Prediction

What will likely

satisfy the objective?Recommendation

What would be a

new variant?Generation

INPUTSEXAMPLE

OUTPUTS

Text Data Images

AudioVideo

Object Identification(Labeling)

Object Detection

Feature Tracking

Denoised Pixel Values

Animation Pose Selection

Texture Creation

WHAT PROBLEM ARE YOU SOLVING?Defining the AI/DL Task

Page 9: AI FOR MEDIA AND ENTERTAINMENT

9

CLASSIFICATION OBJECT DETECTION SEGMENTATION

Probabilities for each class Corners of a bounding box Pixels that belong to the cat

0.3 0.7PDOG PCAT

(10, 100)(X1, Y1)

(5, 120)(X1, Y1)

Page 10: AI FOR MEDIA AND ENTERTAINMENT

10

PREDICTION RECOMMENDATION GENERATION

Predict how quilted cat looks Recommend other cats Generate cats from nothing

Page 11: AI FOR MEDIA AND ENTERTAINMENT

11

DEEP LEARNING 101Application Development

Page 12: AI FOR MEDIA AND ENTERTAINMENT

12

DEEP LEARNING APPLICATION DEVELOPMENT

TRAININGLearning a new capability

from existing data

INFERENCEApplying this capability

to new data

Page 13: AI FOR MEDIA AND ENTERTAINMENT

13

DEEP LEARNING APPLICATION DEVELOPMENT

TRAININGLearning a new capability

from existing data

INFERENCEApplying this capability

to new data

Page 14: AI FOR MEDIA AND ENTERTAINMENT

14

DEEP LEARNING APPLICATION DEVELOPMENT

UntrainedNeural Network

Model

TRAININGLearning a new capability

from existing data

INFERENCEApplying this capability

to new data

Page 15: AI FOR MEDIA AND ENTERTAINMENT

15

DEEP LEARNING APPLICATION DEVELOPMENT

UntrainedNeural Network

Model

Deep Learning

Framework

TRAININGLearning a new capability

from existing data

INFERENCEApplying this capability

to new data

Page 16: AI FOR MEDIA AND ENTERTAINMENT

16

DEEP LEARNING APPLICATION DEVELOPMENT

UntrainedNeural Network

Model

Deep Learning

Framework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

INFERENCEApplying this capability

to new data

Page 17: AI FOR MEDIA AND ENTERTAINMENT

17

DEEP LEARNING APPLICATION DEVELOPMENT

UntrainedNeural Network

Model

Deep Learning

Framework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

Trained ModelOptimized for Performance

INFERENCEApplying this capability

to new data

Page 18: AI FOR MEDIA AND ENTERTAINMENT

18

DEEP LEARNING APPLICATION DEVELOPMENT

UntrainedNeural Network

Model

Deep Learning

Framework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

App or ServiceFeaturing Capability

INFERENCEApplying this capability

to new data

Trained ModelOptimized for Performance

Page 19: AI FOR MEDIA AND ENTERTAINMENT

AI FOR MEDIA AND

ENTERTAINMENTDigital Domain

NVIDIA2018

Page 20: AI FOR MEDIA AND ENTERTAINMENT

DEEP LEARNING CHANGES

EVERYTHING

Page 21: AI FOR MEDIA AND ENTERTAINMENT

VFX + MACHINE = LEARNING➤ Image Processing

➤ (Nearly) Automatic Rotoscoping

➤ Noise Removal

➤ Character Animation

➤ Facial Animation

➤ Muscle Simulation

➤ Hair Simulation

➤ Effects

➤ Fluid Simulation

➤ FEM Simulation

Page 22: AI FOR MEDIA AND ENTERTAINMENT

CAN YOU INTERPOLATE AN

EFFECT?

Page 23: AI FOR MEDIA AND ENTERTAINMENT

EXAMPLE: FACIAL

ANIMATION

Page 24: AI FOR MEDIA AND ENTERTAINMENT

TRADITIONAL APPROACHES

➤ Bones and skin deformation

➤ Blendshapes and FACS poses

Page 25: AI FOR MEDIA AND ENTERTAINMENT

GET A MACHINE TO LEARN THE

FACE

Page 26: AI FOR MEDIA AND ENTERTAINMENT

Input

Output

Convolutional

Layer

Convolutional

Layer

Pooling

Layer

Great Big Non-Linear InterpolatorOr

A Great Big Mapping from one probability distribution to another

Page 27: AI FOR MEDIA AND ENTERTAINMENT

““Torture the data, and it will confess to anything.”

-Ronald Coase

Page 28: AI FOR MEDIA AND ENTERTAINMENT

““My face is tired.”

-Doug Roble

Page 29: AI FOR MEDIA AND ENTERTAINMENT

TEMPORALLY CONSISTENT HIGH-

RESOLUTION MOVING MESHES

Page 30: AI FOR MEDIA AND ENTERTAINMENT

WE HAVE DATA.

NOW WHAT?

Page 31: AI FOR MEDIA AND ENTERTAINMENT

POINTS TO HIGH-RESOLUTION MESH➤ Built on work by Bermano et al (2014) and Bickel et al (2008)

➤ Lucio Moser created Masquerade (2017)

➤ A data-driven method to take tracked points to a high-resolution mesh.

Page 32: AI FOR MEDIA AND ENTERTAINMENT

IMAGE TO HIGH RESOLUTION MESH➤ Images (no markers) as input

➤ High resolution mesh as output

➤ Supervised Learning: Images correspond to meshes.

➤ Using the RIGHT data is important.

➤ Convolutional Neural Network

➤ Training takes a long time.

➤ Inference runs at 60 fps.

Page 33: AI FOR MEDIA AND ENTERTAINMENT

FULL PERFORMANCE IN REAL-TIME MOCAP

SUIT

Page 34: AI FOR MEDIA AND ENTERTAINMENT

LAYERED MACHINE LEARNING➤ Use machine learning to train a

machine!

Page 35: AI FOR MEDIA AND ENTERTAINMENT

SUPERVISED LEARNINGVS

UNSUPERVISED LEARNING

Page 36: AI FOR MEDIA AND ENTERTAINMENT