review of hw2

Post on 04-Oct-2021

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Review of HW2Liyu Chen

When , to compute , we consider the following three cases:

1.

2.

3.

=

When , the derivative is simply 0.

Combining both cases, we get

By , we can store instead of

Make a prediction:

Update weights:

Computation: Loss:

Parameters:

Chain rule:

Chain rule:

Chain rule:

Note: should be

Can also write it as a -dimensional vector

top related