saliency prediction using deep learning techniques

75
Visual Saliency Prediction using Deep Learning Techniques Junting Pan Xavier Giró-i-Nieto AUTHOR ADVISOR 20/07/2014

Upload: xavier-giro

Post on 12-Aug-2015

94 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Saliency prediction using deep learning techniques

Visual Saliency Prediction using Deep Learning Techniques

Junting Pan Xavier Giró-i-Nieto

AUTHOR ADVISOR

20/07/2014

Page 2: Saliency prediction using deep learning techniques

2

OUTLINE

1. Motivation2. Related works3. Methodology4. Results5. Conclusions

Page 3: Saliency prediction using deep learning techniques

3

Let’s play a game!

Page 4: Saliency prediction using deep learning techniques

4

SALIENCY PREDICTION

Page 5: Saliency prediction using deep learning techniques

5

SALIENCY PREDICTION

What have you seen?

Page 6: Saliency prediction using deep learning techniques

6

Tower

SALIENCY PREDICTION

Page 7: Saliency prediction using deep learning techniques

7

Tower

SALIENCY PREDICTION

House

Page 8: Saliency prediction using deep learning techniques

8

SALIENCY PREDICTION

Tower House

Rocks

Page 9: Saliency prediction using deep learning techniques

9

SALIENCY PREDICTION

Page 10: Saliency prediction using deep learning techniques

10

SALIENCY PREDICTION

Eye Tracker Mouse Click

Page 11: Saliency prediction using deep learning techniques

11

LSUN SALIENCY CHALLENGE

Page 12: Saliency prediction using deep learning techniques

12

LSUN SALIENCY CHALLENGE

Page 13: Saliency prediction using deep learning techniques

13

LSUN SALIENCY CHALLENGE

Page 14: Saliency prediction using deep learning techniques

14

OUTLINE

1. Motivation2. Related Works3. Methodology4. Results5. Conclusions

Page 15: Saliency prediction using deep learning techniques

15

RELATED WORK: Deep Learning

@jponttuset

Page 16: Saliency prediction using deep learning techniques

16

RELATED WORK: Deep Learning

Deep Learning

http://insights.venturescanner.com/category/artificial-intelligence-2/

Page 23: Saliency prediction using deep learning techniques

23

RELATED WORK: Conventional Saliency

Jianming Zhang, Stan Sclaroff. Saliency detection: a boolean map approach [ICCV 2013]

Page 24: Saliency prediction using deep learning techniques

24

RELATED WORK: Deep Saliency

Kümmerer, Matthias, Lucas Theis, and Matthias Bethge. "Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet." arXiv preprint arXiv:1411.1045 (2014).

Page 25: Saliency prediction using deep learning techniques

25

RELATED WORK: Deep Saliency

Vig, Eleonora, Michael Dorr, and David Cox. "Large-scale optimization of hierarchical features for saliency prediction in natural images." Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on. IEEE, 2014.

Page 26: Saliency prediction using deep learning techniques

26

RELATED WORK: End-to-end Architecture

Long, Jonathan, Evan Shelhamer, and Trevor Darrell. "Fully convolutional networks for semantic segmentation." Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on. IEEE, 2015.

Page 27: Saliency prediction using deep learning techniques

27

OUTLINE

1. Motivation2. Related Works3. Methodology4. Results5. Conclusions

Page 28: Saliency prediction using deep learning techniques

28

SALIENCY PREDICTION: JuntingNet

Page 32: Saliency prediction using deep learning techniques

32

SALIENCY PREDICTION: Architecture

Upsample + filter

2D map

96x96 2340=48x48

IMAGE INPUT(RGB)

Page 33: Saliency prediction using deep learning techniques

33

SALIENCY PREDICTION: Architecture

Upsample + filter

2D map

96x96 2340=48x48

3 CONV LAYERS

Page 34: Saliency prediction using deep learning techniques

34

SALIENCY PREDICTION: Architecture

Upsample + filter

2D map

96x96 2340=48x48

2 DENSE LAYERS

Page 35: Saliency prediction using deep learning techniques

35

SALIENCY PREDICTION: Architecture

Upsample + filter

2D map

96x96 2340=48x48

Page 37: Saliency prediction using deep learning techniques

37

SALIENCY PREDICTION: Overfitting

Overfitting: More than 20 Milions of parameters

10.000 images for training

Page 38: Saliency prediction using deep learning techniques

38

SALIENCY PREDICTION: Training

Data augmentation with horizontal mirroring.

Page 39: Saliency prediction using deep learning techniques

39

SALIENCY PREDICTION: TrainingWe split the total training data in TWO parts:

80% Training

20% Validation (simultaneous testing)

Page 40: Saliency prediction using deep learning techniques

40

SALIENCY PREDICTION: Training

Training curve of iSUN Database

Page 41: Saliency prediction using deep learning techniques

Training curve of iSUN Database

41

SALIENCY PREDICTION: TrainingLower is better !!

Page 42: Saliency prediction using deep learning techniques

42

SALIENCY PREDICTION: Training

Training curve of iSUN Database

Number of iterations (Training time)

Page 43: Saliency prediction using deep learning techniques

43

SALIENCY PREDICTION: Training

Number of iterations (Training time)

Longer is better?

Training curve of iSUN Database

Page 44: Saliency prediction using deep learning techniques

44

SALIENCY PREDICTION: Training

Number of iterations (Training time)

If the validation loss stops decreasing...

Training curve of iSUN Database

Page 45: Saliency prediction using deep learning techniques

45

SALIENCY PREDICTION: Training

Number of iterations (Training time)

If the validation loss stops decreasing...

DANGER OF OVERFITTING!The model is learning from the data, NOT the problem itself

Training curve of iSUN Database

Page 46: Saliency prediction using deep learning techniques

46

SALIENCY PREDICTION: Training

Training curve of SALICON Database

Page 47: Saliency prediction using deep learning techniques

47

SALIENCY PREDICTION: Training

A: I have just show you our best model.

B: Why is this the best model?

Page 48: Saliency prediction using deep learning techniques

48

SALIENCY PREDICTION: Trial and ErrorWe tried many architectures, too many to be listed here..

Page 49: Saliency prediction using deep learning techniques

49

SALIENCY PREDICTION: Trial and ErrorWe tried many architectures, too many to be listed here..

Page 50: Saliency prediction using deep learning techniques

We tried many architectures, too many to be listed here..

50

SALIENCY PREDICTION: Trial and Error

Page 51: Saliency prediction using deep learning techniques

We tried many architectures, too many to be listed here..

51

SALIENCY PREDICTION: Trial and Error

Page 52: Saliency prediction using deep learning techniques

52

SALIENCY PREDICTION: Training

Loss function Mean Square Error (MSE)

Weight initialization Gaussian distribution

Learning rate 0.03 to 0.0001

Mini batch size 128

Training time 7h (SALICON) / 4h (iSUN)

Acceleration SGD+ nesterov momentum (0.9)

Regularisation Maxout norm

GPU NVidia GTX 980

Page 53: Saliency prediction using deep learning techniques

53

OUTLINE

1. Motivation2. Related Works3. Methodology4. Results5. Conclusions

Page 54: Saliency prediction using deep learning techniques

54

RESULTS: Qualitative (iSUN)

JuntingNetGround TruthPixels

Page 55: Saliency prediction using deep learning techniques

55

RESULTS: Qualitative (iSUN)

JuntingNetGround TruthPixels

Page 56: Saliency prediction using deep learning techniques

56

RESULTS: Qualitative (iSUN)

JuntingNetGround TruthPixels

Page 57: Saliency prediction using deep learning techniques

57

RESULTS: Qualitative (iSUN)

JuntingNetGround TruthPixels

Page 58: Saliency prediction using deep learning techniques

58

RESULTS: Quantitative (iSUN)Results from CVPR LSUN Challenge 2015

Page 59: Saliency prediction using deep learning techniques

59

RESULTS: Qualitative (SALICON)

JuntingNetGround TruthPixels

Page 60: Saliency prediction using deep learning techniques

60

RESULTS: Qualitative (SALICON)

JuntingNetGround TruthPixels

Page 61: Saliency prediction using deep learning techniques

61

RESULTS: Qualitative (SALICON)

JuntingNetGround TruthPixels

Page 62: Saliency prediction using deep learning techniques

62

RESULTS: Qualitative (SALICON)

JuntingNetGround TruthPixels

Page 63: Saliency prediction using deep learning techniques

63

RESULTS: Quantitative (SALICON)Results from CVPR LSUN Challenge 2015

Page 64: Saliency prediction using deep learning techniques

64

RESULTS: First Position at LSUN Challenge

Page 65: Saliency prediction using deep learning techniques

65

RESULTS: MIT Saliency Benchmark

Method SImilarity CC AUC_shuffled AUC_Borji AUC_Judd

Baseline: infinite human

1 1 0.80 0.87 0.91

Deep Gaze 0.39 0.48 0.66 0.85 0.84

eDN 0.41 0.45 0.62 0.81 0.82

Our work 0.4708 0.4285 0.5075 0.7416 0.7720

Torralba, Antonio, and Alexei Efros. "Unbiased look at dataset bias." Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 2011

Page 66: Saliency prediction using deep learning techniques

66

Future Work

Method SImilarity CC AUC_shuffled AUC_Borji AUC_Judd

Baseline: infinite human

1 1 0.80 0.87 0.91

Deep Gaze 0.39 0.48 0.66 0.85 0.84

SalNet 0.52 0.58 0.69 0.82 0.83

eDN 0.41 0.45 0.62 0.81 0.82

Our work 0.4708 0.4285 0.5075 0.7416 0.7720

Torralba, Antonio, and Alexei Efros. "Unbiased look at dataset bias." Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 2011

K. McGuinness

Page 67: Saliency prediction using deep learning techniques

67

RESULTS: Dissemination

http://bit.ly/juntingnet

Preprint Open Source Software & Models

http://arxiv.org/abs/1507.01422

Page 68: Saliency prediction using deep learning techniques

68

RESULTS: Dissemination

Article highlighted at www.upc.edu

on 17 July 2015

Page 69: Saliency prediction using deep learning techniques

69

OUTLINE

1. Motivation2. Related Works3. Methodology4. Results5. Conclusions

Page 70: Saliency prediction using deep learning techniques

70

LSUN SALIENCY CHALLENGE: A Déjà vu ?

John Markoff, “Scientists see promise in deep learning Programs”, The New York Times (Nov2012).

Photo: Keith Penner

Page 71: Saliency prediction using deep learning techniques

71

ACKNOWLEDGMENTSXavier Giró NietoCarlos SeguraCarles FernándezAlbert GilVictor CamposEnric MonteElisa SayrolEdu FontdevilaMíriam BellverAmaia SalvadorMarc CarnéJavier HernandoJavier VeraAll my family members and friends

Page 72: Saliency prediction using deep learning techniques

72

Thank you!

Page 73: Saliency prediction using deep learning techniques

73

Thank you!

Page 74: Saliency prediction using deep learning techniques

74

Thank you!

Page 75: Saliency prediction using deep learning techniques

75

Thank you! : )

Thank you!