advances in deep learning - wojciech zaremba in deep learning.pdf · 2017-05-21 · house number...

22
Advances in Deep Learning by Wojciech Zaremba Grad student at Intern at Google Brain Ex-Intern at

Upload: others

Post on 11-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Advances in Deep Learningby Wojciech Zaremba

Grad student at Intern at Google BrainEx-Intern at

Page 2: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Outline

● Success stories● Neural networks ● Convolutional neural networks● Recurrent neural networks● Flaws

Page 3: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

House Number Identification in Street View*

*“Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks“ by Ian J. Goodfellow, Yaroslav Bulatov, Julian Ibarz, Sacha Arnoud, Vinay Shet

Page 4: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Image Classification (query animal)

Page 5: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

ImageNet classification results1M training images, 1K categories, top-5 error

Best deep-learning models ~9%

Non-deep learning modelsISI, JapanOxford, EnglandINRIA, FranceUniversity of Amsterdam, etc.

~26%

Page 6: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Speech recognition

Page 7: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Text understanding*

*“Efficient estimation of word representations in vector space” by Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean

Page 8: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Translation*

*”Exploiting similarities among languages for machine translation” by Tomas Mikolov, Quoc V Le, Ilya Sutskever

Page 9: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Neural networks

● Depth is essential for goodperformance

● Large amount of data showstheir power

● Success due to fast GPUs

Page 10: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Convolutional neural networks

Assumptions:● on the very small scale every piece of image

should be processed the same way

Page 11: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Convolutional networks - setting

● 60 million parameters● Training takes ~5 days on GTX Titan● ~90% time takes convolutions, and ~10%

time fully connected● ~5% of parameters are in convolutions and

~95% of parameters in fully connected layers

Page 12: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Recurrent neural networks - schema

My

name

is

name

is

Wojciech

Page 13: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Long-short-term memory (LSTM)

● Variant of RNN● Differentiable memory

unit● Equivalent to RNN but

easier to train

Page 14: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Language modeling (Penn tree bank)

Model Perplexity (lower is better)

Previous state of the art 73

Our result 68

Page 15: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Recurrent networks - applications

● Text understanding● Video modeling thought consecutive frame

prediction● Translation● Speech recognition● Autocompletion

Page 16: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Recurrent neural networks

the tradition of the ancient humanreproduction: it isless favorable to thegood boy for when toremove her bigger.

The meaning of life is

Page 17: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Flaws*

Correctlypredictedobject

Predictsostrich

*”Intriguing properties of neural networks” joint work with Christian Szegedy, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus

Page 18: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

How to?

● Negative examples generated with Backpropagation

● Constrained to be in feasible set (proper color range)

Page 19: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Cross model transferTraining Error

Model A 0%

Model B 0%

Negative examples for Model A

Negative examples for Model B

Gaussian noise std = 0.1

Model A 100% 6.6% 0%

Model B 20.3% 100% 0%

Different fully connected networks trained on MNIST dataset. Average distortions by ~6%.

Page 20: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Cross training data transferTraining P1 Training P2

Model A 0% 2.4%

Model B 2.5% 0%

Different fully connected networks trained on MNIST dataset. Distortions by ~6%.

Test distortion for A Test distortion for B

Model A 100% 6.25%

Model B 26.2% 100%

Page 21: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Flaws - conclusions

● Different networks share properties, which are dependent on statistics of training sets (not only particular samples).

● Can negative examples be used to improve generalization ?

Page 22: Advances in Deep Learning - Wojciech Zaremba in deep learning.pdf · 2017-05-21 · House Number Identification in Street View* *“Multi-digit Number Recognition from Street View

Q&A

● Success stories● Neural networks ● Convolutional neural networks● Recurrent neural networks● Flaws

I am happy to take any questions.