an (very brief) introduction - eso€¦ · prof. dr. laura leal-taixé an (very brief)...

50
Prof. Dr. Laura Leal-Taixé www.dvl.in.tum.de An (very brief) introduction to Deep Learning for Computer Vision

Upload: others

Post on 08-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Prof. Dr. Laura Leal-Taixé

www.dvl.in.tum.de

An (very brief) introduction to Deep Learning

for Computer

Vision

Page 2: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Computer Vision

Give eyes to a computer

Page 3: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Computer Vision

Understand every pixel of an image

Page 4: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Computer Vision

road

tree

Semantic segmentation

personcar

Understand every pixel of an image

Page 5: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Computer Vision

person 1

tree

Semantic segmentation

person 2

person 3

Instance-based

segmentation

road

car

Understand every pixel of an image

Page 6: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Computer Vision

Understand every pixel of a video

Semantic segmentation

Instance-based

segmentation

Multiple object

tracking

Page 7: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Dynamic Scene Understanding

Understand every pixel of a video

Semantic segmentation

Instance-based

segmentation

Multiple object

tracking

Page 8: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Artificial Intelligence

Data, examples

Computational models

Perform a given task on new unseen data

LEARNING, TRAINING

Page 9: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Artificial Intelligence

Data, examples

Computational models

Perform a given task on new unseen data

Expert eye

Page 10: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Self-driving cars

AI nowadays

Page 11: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

What is Deep Learning, really?

CAT

Page 12: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

What is Deep Learning, really?

Each node is a small classifier

Page 13: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

What is Deep Learning, really?

Each node is a small classifier

Page 14: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Each classifier makes tiny decision

Furry

Has two eyes

Has a tail

Has paws

Has two ears These are learned

from data!

Page 15: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Convolutions on Images

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6

Ou

tpu

t 3

x3

5 ⋅ 3 + −1 ⋅ 3 + −1 ⋅ 2 + −1 ⋅ 0 + −1 ⋅ 4 =15 − 9 = 6

Page 16: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1

Ou

tpu

t 3

x3

5 ⋅ 2 + −1 ⋅ 2 + −1 ⋅ 1 + −1 ⋅ 3 + −1 ⋅ 3 =10 − 9 = 1

Page 17: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8

Ou

tpu

t 3

x3

5 ⋅ 1 + −1 ⋅ (−5) + −1 ⋅ (−3) + −1 ⋅ 3 + −1 ⋅ 2 =5 + 3 = 1

Page 18: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7

Ou

tpu

t 3

x3

5 ⋅ 0 + −1 ⋅ 3 + −1 ⋅ 0 + −1 ⋅ 1 + −1 ⋅ 3 =0 − 7 = −7

Page 19: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7 9

Ou

tpu

t 3

x3

5 ⋅ 3 + −1 ⋅ 2 + −1 ⋅ 3 + −1 ⋅ 1 + −1 ⋅ 0 =15 − 6 = 9

Page 20: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7 9 2

Ou

tpu

t 3

x3

5 ⋅ 3 + −1 ⋅ 1 + −1 ⋅ 5 + −1 ⋅ 4 + −1 ⋅ 3 =15 − 13 = 2

Page 21: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7 9 2-5

Ou

tpu

t 3

x3

5 ⋅ 0 + −1 ⋅ 0 + −1 ⋅ 1 + −1 ⋅ 6+ −1 ⋅ (−2) = −5

Page 22: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7 9 2-5 -9

Ou

tpu

t 3

x3

5 ⋅ 1 + −1 ⋅ 3 + −1 ⋅ 4 + −1 ⋅ 7 + −1 ⋅ 0 =5 − 14 = −9

Page 23: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

-5 3 2 -5 34 3 2 1 -31 0 3 3 5-2 0 1 4 45 6 7 9 -1

Convolutions on Images

0 -1 0-1 5 -10 -1 0

Ima

ge

5x5

Ke

rne

l 3

x3

6 1 8-7 9 2-5 -9 3

Ou

tpu

t 3

x3

5 ⋅ 4 + −1 ⋅ 3 + −1 ⋅ 4 + −1 ⋅ 9 + −1 ⋅ 1 =20 − 17 = 3

Page 24: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Image filters

• Each kernel gives us a different image filter

Prof. Leal-Taixé and Prof. Niessner24

Input

Edge detection

−1 −1 −1−1 8 −1−1 −1 −1

Sharpen

0 −1 0−1 5 −10 −1 0

Box mean

191 1 11 1 11 1 1

Gaussian blur

116

1 2 12 4 21 2 1

LET’S L

EARN T

HESE FIL

TERS!

Page 25: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Convolutions on RGB Images

32

323

3 5

5

32×32×3 image (pixels %)

5×5×3 filter (weights &)

128

28

activation map(also feature map)

Convolve

Page 26: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Convolution Layer

32

323

3 5

5

32×32×3 image

5×5×3 filter

128

28

activation maps

Convolve

1

Let’s apply a different filterwith different weights!

Page 27: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Convolution Layer

32

323

32×32×3 image

528

28

activation maps

Convolve

Let’s apply **five** filters,each with different weights!

Convolution “Layer”

Page 28: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Visualizing a CNN

Low-level

features

Mid-level

features

Object

parts

Page 29: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Big Data

Models know where

to learn from

Hardware

Models are

trainable

Deep

Models are

complex

What made Deep Learning possible?

Page 30: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Big data

ImageNet: Goal 10.000 images per 100.000 words

Page 31: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Deep Learning: what is it good at?

Input A Response B

English sentence

Picture

Audio clip

French sentence

Photo tagging

Transcript

Machine translation

Face recognition

Speech recognition

Supervised learning

Page 32: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

How to obtain the model?

Data points

Model parameters

Labels (ground truth)

Estimation

Loss function

Optimization

Page 33: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Deep Learning for Image Classification

Classification

Classification

+

Localization

Object

detection

Instance

segmentation

Page 34: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Instance segmentation with Mask-RCNN

K. He, G. Gkioxari, P. Dollar, R. Girshick. Mask R-CNN. ICCV 2017.

Page 35: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Instance segmentation with Mask-RCNN

K. He, G. Gkioxari, P. Dollar, R. Girshick. Mask R-CNN. ICCV 2017.

Page 36: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Manipulating images

Jun-Yan Zhu*, Taesung Park*, Phillip Isola, and Alexei A. Efros. "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks", ICCV, 2017.

Page 37: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Manipulating images

Jun-Yan Zhu*, Taesung Park*, Phillip Isola, and Alexei A. Efros. "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks", ICCV, 2017.

Page 38: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

The world is dynamic!

• Deep Learning pushed single-image analysis to a point where results are usable in real-world scenarios

• The world is not static

What we still need in ML: good memory models

Page 39: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Multiple object tracking

Goal: detect and track all objects in a

scene

Page 40: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Video object segmentation

Page 41: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

TecoGANLowRes

M. Chu, Y. Xie, L. Leal-Taixé and N. Thuerey. „Temporally Coherent GANs for Video Super-Resolution (TecoGAN)“

Video super resolution

Page 42: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

TecoGANLowRes

M. Chu, Y. Xie, L. Leal-Taixé and N. Thuerey. „Temporally Coherent GANs for Video Super-Resolution (TecoGAN)“

Video super resolution

Page 43: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

M. Chu, Y. Xie, L. Leal-Taixé and N. Thuerey. „Temporally Coherent GANs for Video Super-Resolution (TecoGAN)“

TecoGAN results

Page 44: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Photo

MapObtain the camera

pose with respect to

the map

Visual localization

Page 45: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Visual Localization: applications

• Robot navigation

• Augmented reality

[Middelberg, Sattler, Untzelmann, Kobbelt, Scalable 6-DOF Localization on Mobile Devices. ECCV 2014]

Page 46: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

The challenge: Big data

• We need data, lots of data!

• We might not have the luxury of data for some applications such as medical diagnosis

• Data is biased

Big data

Page 47: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Data bias• Increase diversity in

the data

• Increase diversity in the AI community which is building the algorithms

Page 48: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Data bias

Page 49: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

The generalization problem

• Neural networks are GREAT at finding patterns in data they have seen, but not so great at generalizing to new scenarios

True intelligence is still far away

Page 50: An (very brief) introduction - ESO€¦ · Prof. Dr. Laura Leal-Taixé  An (very brief) introduction to Deep Learning f n

Prof. Dr. Laura Leal-Taixé

www.dvl.in.tum.de

Thank you

https://dvl.in.tum.deIf you have images, contact us!