an introduction to deep learning · deep learning an introduction to ... •deep learning •feed...

Deep Learning

An Introduction to

By Rahil Mahdian – December 12-13, 2017

Outline • Machine Learning

• Learning Strategies

• Neural Network Learning

• Deep Learning

• Feed Forward Network

• Problems of Deep Learning

• AutoEncoders

• Restricted Boltzmann Machines

• Convolutional Neural Networks

• Recurrent Neural Networks (RNNs, LSTM)

• Deep Learning Applications

Scope of Machine Learning

Nando de Freitas, Oxford

When to Apply Machine Learning

Nando de Freitas, Oxford

Machine Learning Pioneering ( C. Shannon 1961)

Learning Types

Supervised Learning UnSupervised Learning

Semi-Supervised Learning

Machine Learning vs Deep Learning

Perceptron – Single Neuron element, Rosenblatt 1958

e.g. Sigmoidal function

Neural Networks (MLP)

Training schemes (SGD, Batch, MiniBatch)

SGD Batch Mini-Batch

MLP - Function Approximation

Deep Motivation- Different Layers of Abstraction

Deep Neural Networks

Feed Forward Neural Networks

Training DNNs - Backpropagation

Deep NN - Training Problem

The back-propagation encounters the three following difficulties in the training process of deep neural networks:

Vanishing Gradient- output error fails to reach the farther back nodes Overfitting Computational Load

Vanishing Gradient Solutions

Overfitting – Generalization Problem

Bishop

Model Complexity

Data Matters

Among competing hypotheses, the one with the fewest assumptions should be selected.

In the related concept of overfitting, excessively complex models are affected by statistical

noise (a problem also known as the bias-variance trade-off), whereas simpler models may

capture the underlying structure better and may thus have better predictive performance.

Hoeffding’s inequalities:

Failure rate

Empirical error rate

Occam's razor: William of Ockham (c. 1287–1347)

# of model parameters

True error rate

Number of sufficient samples

A 32-bits floating point computer

Overfitting Solutions – Dropout & Regularization

50% of hidden layers, and 25% for the input layer

Rule of thumb:

Regularization:

Drop out:

Add a norm of the weights to the cost function. (l1-norm, l2-norm)

Data Augmentation is also a way to avoid overfitting; i.e., adding noise, translating data, etc.

AutoEncoder- Nonlinear dimensionality reduction

Restricted Boltzmann Machines (RBM)

Hugo Larochelle

Unsupervised Pre-training – another solution

Convolution Neural Network (Lecun et al. 1993, LeNet)

Convolution NN - Architecture

Photo: Phil Kim

ConvNet – How it works?

Vincent Vanhoucke- Google

Convolutional NN – (LeCun, Fukushima)

AlexNet - 2012

ConvNet for Speech

CNN Structures

LSTM and RNNs – sequential data

LSTM Training

RNNs & Multi-Hypothesis Tracking- BeamSearch

Captioning & Translation - RNNs

Deep Learning Applications: Computer Vision

DNN Application - Caption Generation

Wrap Up

Machine Learning Influence Learning Neural Networks Deep Learning Motivation, Problems, Solutions Unsupervised Neural Networks: AutoEncoders, Restricted Boltzmann Machines Deep Learning Training Solutions Feed Forward Neural Networks as MLPs Convolutional Neural Networks Recurrent Neural Networks for sequential data LSTMs as a generalization of RNNs Applications of DNNs

Thanks for attending the Talk.

Questions?

an introduction to deep learning · deep learning an introduction to ... •deep learning •feed...

Documents

unsupervised learning: autoencoders - yunsheng...

deep autoencoders for dimensionality reduction of...

autoencoders, minimum description length and helmholtz...

a tutorial on deep learning part 2: autoencoders,...

introduction to variational autoencoders · introduction to...

from reinforcement learning to deep reinforcement...

transforming exploratory creativity with...

deep learning i - korea...

deep learning - south dakota state universitydeep learning...

(dl hacks輪読) how to train deep variational autoencoders...

ee-559 { deep learning 9. autoencoders and generative...

introduccion´ autoencoders rbms redes de deep …´...

executing deep learning strategies masterclass preview -...

autoencoders, unsupervised learning, and deep architectures...

autoencoders -...

deep kernelized autoencoders - arxiv · and lorenzo livi2 1...

autoencoders for image_classification

intro to deep learning - autoencoders

cs 6501: deep learning for computer graphics …...overview...

deep inversion, autoencoders for learned regularization of...