neural networks tutorial - cs.toronto.edujlucas/teaching/csc411/lectures/tut5... · csc411 tutorial...

CSC411 Tutorial #5 Neural Networks

Oct, 2017 Shengyang Sun

ssy@cs.toronto.edu

*Based on the lectures given by Professor Sanja Fidler and the prev. tutorial by Boris Ivanovic, Yujia Li.

High-Level Overview

• A Neural Network is a function! • It (generally) comprised of: – Neurons which pass input values through

functions and output the result –Weights which carry values between neurons

• We group neurons into layers. There are 3 main types of layers: – Input Layer – Hidden Layer(s) – Output Layer

High-Level Overview

Neuron Breakdown

neuron

weight

activation

Neuron Breakdown

Activation Functions

Most popular recently for deep learning

Representation Power

What does this mean?

• Neural Networks are POWERFUL, it’s exactly why with recent computing power there was a renewed interest in them.

BUT • “With great power comes great overfitting.” – Boris Ivanovic, 2016

• Last slide, “20 hidden neurons” is an example.

How to mitigate this?

• Stay Tuned!

• First, how do we even use or train neural networks?

Training Neural Networks (Key Idea)

(Regression)

(Classification)

Training Compared to Other Models

• Training Neural Networks is a NON-CONVEX OPTIMIZATION PROBLEM.

• This means we can run into many local optima during training.

Weight

Training Neural Networks (Implementation)

• We need to first perform a forward pass • Then, we update weights with a

backward pass

Forward Pass (AKA “Inference”)

Backward Pass (AKA “Backprop.”)

Learning Weights during Backprop

• Do exactly what we’ve been doing! • Take the derivative of the error/cost/loss

function w.r.t. the weights and minimize via gradient descent!

Useful Derivatives

Preventing Overfitting

Weight decay

Early stop

Limiting the Size of the Weights

The Effects of Weight-Decay

Early Stopping

Why Early Stopping Works

Neural Network Visualizations

• http://playground.tensorflow.org • http://scs.ryerson.ca/~aharley/vis/conv/

Questions

neural networks tutorial - cs.toronto.edujlucas/teaching/csc411/lectures/tut5... · csc411 tutorial...

Documents

neural communication: the neural chain

cross-validation and decision...

mobile network management (imt-2000...

csc 411: lecture 09: naive bayes -...

csc411/2515 tutorial: k-nn and decision...

csc411: midterm reviewmren/teach/csc411_19s/tut/... ·...

tut5 stationarity

csc 411 lecture 10: neural networks i · xj j=1 x jw kj...

cross-validation and decision trees - department of...

cns development. stages in neural tube development neural...

rtl-to-gates synthesis using synopsys design...

neural networks tutorialurtasun/courses/csc411... · by...

csc411- machine learning and data mining

neural network for neural networks

welcome to csc411 - department of computer science...

midterm review - university of...

artificial neural networks: intro · 2018. 5. 17. ·...

probability basics for machine...

artificial neural networks: intro · 2018-05-17 ·...

neural information flow: learning neural information