high level computer vision deep learning for computer ... · pdf filedeep learning for...

80
High Level Computer Vision Deep Learning for Computer Vision Part 2 Bernt Schiele - [email protected] Mario Fritz - [email protected] https://www.mpi-inf.mpg.de/hlcv

Upload: trinhkhanh

Post on 31-Mar-2018

236 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision

Deep Learning for Computer Vision Part 2

Bernt Schiele - [email protected] Mario Fritz - [email protected]

https://www.mpi-inf.mpg.de/hlcv

Page 2: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision - June 16, 2o16

Overview Today

• Feature Generalization ‣ “pre-training” on large dataset,

“fine-tuning” on target dataset

• Object Detection ‣ from image classification to object detection

• R-CNN - Regions with CNN features ‣ Region-based Convolutional Networks for Accurate Object Detection and Semantic

Segmentation, R. Girshick, J. Donahue, T. Darrell, J. Malik (CVPR’14, accepted in May’15 for PAMI)

‣ Region Proposal Method: Selective Search for Object Recognition, J.R.R. Uijlings, K.E.A. van de Sande, T. Gevers, A. W. M. SmeuldersIn IJCV’13.

• VGG-network - alternative to AlexNet ‣ Very Deep Convolutional Networks for Large-Scale Image Recognition,

K. Simonyan, A. Zisserman, ICLR’15

2

Page 3: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Feature Generalization

slides from: Rob Fergus, NIPS’13 tutorial

Page 4: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Training Features on Other Datasets

• Train model on ImageNet 2012 training set

• Re-train classifier on new dataset– Just the softmax layer

• Classify test set of new dataset

Page 5: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

6 training examples

Caltech 256Zeiler & Fergus, Visualizing and Understanding Convolutional Networks, arXiv 1311.2901, 2013

Page 6: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Caltech 256

[3] L. Bo, X. Ren, and D. Fox. Multipath sparse coding using hierarchical matching pursuit. In CVPR, 2013.[16] K. Sohn, D. Jung, H. Lee, and A. Hero III. Efficient learning of sparse, distributed, convolutional feature representations for object recognition. In ICCV, 2011.

Zeiler & Fergus, Visualizing and Understanding Convolutional Networks, arXiv 1311.2901, 2013

Page 7: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Object Detection

slides from: Rob Fergus, NIPS’13 tutorial

Page 8: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Detection with ConvNets

• So far, all aboutclassification

• What aboutlocalizing objectswithin the scene?

Page 9: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Sliding Window with ConvNetConv Conv Conv Conv Conv Full Full

Page 10: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Sliding Window with ConvNet

Input Window

224

224

66

256C

classes

Conv Conv Conv Conv Conv Full Full

Feature Extractor Classifier

Page 11: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Sliding Window with ConvNet

Input Window

224 6256

Conv Conv Conv Conv Conv Full Full

Feature Extractor167

240

1

No need to compute two separate windowsJust one big input window, computed in a single pass

C classes

Page 12: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

C=1000

Class Maps

C=1000

C=1000

Classifier

C=1000

Page 13: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

Class Maps

Classifier

Boat

Boat

Boat

Boat

Page 14: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

Class Maps

Classifier

Boat

Boat

Boat

Boat

Page 15: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

ConvNets for Detection

Feature Extractor

256

256

256

256

FeatureMaps

256

RegressionNetwork

PredictedBounding

Box

GroundTruth

Output: [X,Y,W,H]

Page 16: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Bounding Box prediction example[Sermanet et al. CVPR’14]

Page 17: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Detection Results[Sermanet et al. CVPR’14]

Page 18: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

Detection Results[Sermanet et al. CVPR’14]

Page 19: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

slides from: Ross Girschick - CVPR’14 talk

Page 20: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

20

Page 21: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

21

Page 22: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

22

Page 23: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

23

Page 24: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

24

Page 25: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

25

Page 26: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

26

Page 27: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

27

Page 28: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

28

Page 29: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

29

Page 30: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

30

Page 31: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

31

Page 32: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

32

Page 33: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

33

Page 34: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

34

Page 35: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

35

Page 36: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision - June 16, 2o16

Region Proposal Step

36

Page 37: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

37

Page 38: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

38

Page 39: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

39

Page 40: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

40

Page 41: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision - June 16, 2o16

Method

• compute similarity measure between all adjacent region pairs a and b (e.g.) as:

‣ withencourages small regions to merge early

‣ and are color histograms, encouraging “similar” regions to merge

‣ for slightly more elaborated similarities see their IJCV-paper

41

Ssize(a, b) = 1� size(a) + size(b)

size(image)

ak, bk

Scolor

(a, b) =nX

k=1

min(ak, bk)

S(a, b) = ↵Szize

(a, b) + �Scolor

(a, b)

Page 42: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

42

Page 43: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

43

Page 44: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

44

Page 45: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

45

Page 46: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

46

Page 47: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision - June 16, 2o16

An Evaluation of Region Proposal Methods

• Hosang, Benenson, Dollar, Schiele @ Pami’15 • Recall (of ground truth bounding boxes) as a function of

‣ proposal method

‣ IoU (intersection over union)

‣ number of proposals per image

47

Page 48: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

48

Page 49: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

49

Page 50: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

50

Page 51: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

51

Page 52: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

52

Page 53: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

53

Page 54: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

54

Page 55: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

55

Page 56: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

56

Page 57: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

57

Page 58: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

58

Page 59: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

59

Page 60: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

60

Page 61: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

61

Page 62: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

62

Page 63: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

63

Page 64: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

64

Page 65: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

High Level Computer Vision - June 16, 2o16

Overview Today

• Feature Generalization ‣ “pre-training” on large dataset,

“fine-tuning” on target dataset

• Object Detection ‣ from image classification to object detection

• R-CNN - Regions with CNN features ‣ Region-based Convolutional Networks for Accurate Object Detection and Semantic

Segmentation, R. Girshick, J. Donahue, T. Darrell, J. Malik, PAMI’15 (accepted May 15)

‣ Region Proposal Method: Selective Search for Object Recognition, J.R.R. Uijlings, K.E.A. van de Sande, T. Gevers, A. W. M. SmeuldersIn IJCV’13.

• VGG-network - alternative to AlexNet ‣ Very Deep Convolutional Networks for Large-Scale Image Recognition,

K. Simonyan, A. Zisserman, ICLR’15

65

Page 66: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

66

Page 67: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

67

Page 68: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

68

Page 69: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

69

Page 70: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

70

Page 71: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

71

Page 72: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

72

Page 73: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

73

Page 74: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

74

Page 75: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

75

Page 76: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

76

Page 77: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

77

Page 78: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

78

Page 79: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

79

Page 80: High Level Computer Vision Deep Learning for Computer ... · PDF fileDeep Learning for Computer Vision ... • Object Detection ‣ from image classification to object detection

80