convolutional neural networks & deep learning16385/s17/slides/9.6 cnns... · convolutional...

32
Convolutional Neural Networks & Deep Learning

Upload: others

Post on 21-May-2020

18 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

ConvolutionalNeuralNetworks&DeepLearning

Page 2: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Predeeplearningera

Cons:• Hand crafted features are difficult to engineer!

• Time consuming process.

• Which set of features maximizes accuracy?

• Tends to overfit.

Page 3: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

WhatisDeepLearning?Composition ofnon-linear transformationofdataWhy“deep”?Findcomplex patternsbylearninghierarchical features

Page 4: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Butdeeplearningissimple!• Deep Learning builds an end-to-end recognition system. • Non linear transformation of raw pixels directly to labels. • Build a complex non-linear system by combining 4 simple building blocks.

Convolutions

Softmax

Pooling

Activationfunctions

Page 5: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolutions

Page 6: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolutions

Page 7: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolutions– Indeeplearning

FigurefromS-1716-824CMU

Weneedtolearnthesefilters.

Page 8: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

FigurefromS-1716-824CMU

Convolutions– Indeeplearning

Page 9: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

FigurefromS-1716-824CMU

Convolutions– Indeeplearning

Page 10: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

FigurefromS-1716-824CMU

Convolutions– Indeeplearning

Page 11: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolutions– Indeeplearning

Page 12: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolution– SpatialDimensions

FigurefromS-1716-824CMU

Page 13: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolution– SpatialDimensions

FigurefromS-1716-824CMU

Page 14: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Convolution:Example

Page 15: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

WhynotuseFCsforlearningimagefeatures?

- Huge number of parameters in Fully connected network.

- Full connectivity is wasteful. Leads to overfitting.

- (200x200x3) x 5 neurons = 120,000x5 parameters in FC!

- No spatial relation in FCs.

- Just learn several filters (weights in CNNs).

- 5x5x100 = 2500 parameters for learning 100 filters in CNNs.

Page 16: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Max-pooling

Figure from Fei-Fei Li & Andrej Karpathy & Justin Johnson (CS231N)

• Non-linear down sampling.

• Input is partitioned into non-overlapping patches and maximum value in each

partition is chosen.

Page 17: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Max-pooling

Figure from Fei-Fei Li & Andrej Karpathy & Justin Johnson (CS231N)

Depth doesn’t change!

Page 18: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

WhyMax-pool?

• Reduce spatial size of representation.

• Reduce the number of parameters drastically.

• 2x2 filter with stride = 2 discards 75% of the activations!

• Control overfitting.

• Provides translation invariance.

Page 19: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

LinearActivations

SlidefromCMU16-720

Page 20: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Whynon-linear activationfunctions?Weneedanon-lineartransformationofdatasuchthattheoutputisacomplex,non-linear

transformationoftheinput.

Page 21: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

HistoryofActivationFunctions

Page 22: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Sigmoid

SlidefromCMU16-720

Page 23: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Sigmoid

• Squashes numbers to range [0,1] – can kill gradients. (Vanishing gradient)

• Best for learning “logical” functions – i.e. functions on binary inputs.

• Not as good for image networks (replaced by RELU)

Page 24: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

RectifiedLinearUnit

SlidefromCMU16-720

Page 25: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

WhyReLu?

• Inexpensive computations. (Almost 6x faster than sigmoid!)

• No vanishing gradient!

• Leaky ReLus used to prevent “dying” neurons.

• Sparse gradients. (Skip computations where input < 0)

Page 26: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

SoftmaxFunction

• Allpositivevalueswhichsumto1.

• Finallayerafteroutputlayer.

• Neatprobabilisticinterpretation – givesprobabilitiesofeachclass.

Page 27: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

DeepLearningisjustacombinationofConvolutions+Pooling+ReLu

Page 28: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

NetworkInitialization

Howdoyouinitializealltheweightsinthenetwork?

Wedonotknowthefinalvaluesoftheweights..

Page 29: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Allweights=0?

• Nolearning.

• Alloutputsare0.

• Errorsarenotbackpropagated.

• Noupdates.

Page 30: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Initializedtosmallrandomvalues

• Wewanttheweightscloseto0,butnotexactly0.• Initializetosmallrandomvaluestobreaksymmetry.• Recommended:SamplefromUniform(-r,r)

𝑟 = 46

𝑖𝑛 + 𝑜𝑢𝑡�

Page 31: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Topdeeplearninglibraries

Page 32: Convolutional Neural Networks & Deep Learning16385/s17/Slides/9.6 CNNs... · Convolutional Neural Networks & Deep Learning. Pre deep learning era Cons: • Hand crafted features are

Terminologies

• Iteration : 1 forward pass

• Epochs : 1 full training cycle on data set

• Batch-size : Number of samples trained per iteration

• Learning Rate : Update = Learning Rate x Gradient

• Max-Epochs : Usually 20. (Depends on data set)