DSE 2020-21 2nd Sem DL Problem Solving 2.0
DSE 2020-21 2nd Sem DL Problem Solving 2.0
ɳx,opt = 1/6
ɳy,opt = 1/4
ɳz,opt = 1/8
• 1,opt 2,opt
• 2,opt
• 2,opt
• 2,opt
• 2,opt
• 2,opt
Minimization of Quadratic Error Function
Weight Updates – Ordinary Gradient Descent
Weight Updates – Ordinary Gradient Descent
Nestorov
w1_int=1.5+0.9x.5=1.95 w2_inter=2.0+0.9*1=2.9
dE/dw1(w1_int,w2_int)=0.5x(1.95-3)-(2.9-4)/6=-0.342
dE/dw2(w1_int,w2_int)=2/9x(2.9-4)-(1.95-3)/6=-0.0694
W1(t+1)=1.95+0.3*0.342=2.0526 w2(t+1)=2.9+0.3x0.0694=2.92
Weight Updates – Momentum Method
w2(t+1)= 2.058+0.9*(2.0-1.0)=2.958
Nestorov
w1_int=1.5+0.9x.5=1.95 w2_inter=2.0+0.9*1=2.9
dE/dw1(w1_int,w2_int)=0.5x(1.95-3)-(2.9-4)/6=-0.342
dE/dw2(w1_int,w2_int)=2/9x(2.9-4)-(1.95-3)/6=-0.0694
W1(t+1)=1.95+0.3*0.342=2.0526 w2(t+1)=2.9+0.3x0.0694=2.92
Weight Updates – RProp
At time t-1,
dE/dw1 =0.5*(1-3)-(1-4)/6=-0.5
dE/w2=2/9*(1-4)-(1- 3)/6=-0.333
At time t,
dE/dw1 =0.5*(1.5-3)-(2.0-4)/6=0.4167
dE/w2=2/9*(2-4)-(1.5- 3)/6=-0.194