0% found this document useful (0 votes)

18 views8 pages

Lesson 2

The document discusses neural network training techniques, including the calculation of sum of squared errors and cross-entropy error, as well as mini-batch learning. It introduces numerical differentiation, gradient descent, and the implementation of a two-layer neural network class. Additionally, it covers the process of mini-batch training and evaluating the model using test data.

Uploaded by

Thùy Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

Lesson 2

Uploaded by

Thùy Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

lesson2

March 16, 2024

#Chapter 4: Neural Network Training

##Sum of Squared Errors
[ ]: import numpy as np
y = [1.0,0.05,0.6,0.0,0.05,0.1,0.0,0.1,0.0,0.0]
#y = [1.0,0.05,0.1,0.0,0.05,0.1,0.0,0.6,0.0,0.0]
t = [0,0,1,0,0,0,0,0,0,0]

def sum_squared_error(y,t):
return 0.5*np.sum((y-t)**2)

sqe = sum_squared_error(np.array(y), np.array(t))

print(sqe)

0.5925
##Cross-Entrpy Error
[ ]: import numpy as np

def cross_entropy_error(y,t):
delta = 1e-7
return -np.sum(t*np.log(y+delta))

y = [1.0,0.05,0.6,0.0,0.05,0.1,0.0,0.1,0.0,0.0]
t = [0,0,1,0,0,0,0,0,0,0]

cee = cross_entropy_error(np.array(y), np.array(t))

print(cee)

0.510825457099338
• In the example, the output correct label is 0.6 and the crossp-entropy error is 0.51
##Mini-Batch Learning - In neural network training, some training data is selected, and training
is conducted for each group of data, which is called a mini-batch
[ ]: from google.colab import drive
drive.mount('/content/drive')

1
Mounted at /content/drive

[9]: cd /content/drive/MyDrive/GG Colab/Deep Learning/dataset

/content/drive/MyDrive/GG Colab/Deep Learning/dataset

[10]: from my_mnist import load_mnist

[ ]: import sys, os
sys.path.append(os.pardir)
import numpy as np

(x_train, t_train), (x_test, t_test) = load_mnist(normalize=True,␣

↪one_hot_label=False)

print(x_train.shape)
print(t_train.shape)

(60000, 784)
(60000,)
##Numericial Differentiation
[ ]: import numpy as np
import matplotlib.pyplot as plt

def numerical_diff(f,x):
h = 1e-4
return (f(x+h) - f(x-h)) / (2*h)

def function_1(x):
return 0.01*x**2 + 0.1*x

x = np.arange(0.0,20.0, 0.1)
y = function_1(x)
plt.xlabel("x")
plt.ylabel("f(x)")
plt.plot(x,y)
plt.show()

2
##Partial Derivative
[ ]: def function_2(x):
return x[0]**2 + x[1]**2

##Gradient
[ ]: import numpy as np

def function_2(x):
return x[0]**2 + x[1]**2

def numericial_gradient(f,x):
h = 1e-4
grad = np.zeros_like(x) #Tra ve mot mang co kich thuoc giong voi mang da cho␣
↪va bang 0

for idx in range(x.size):

tmp_val = x[idx]
x[idx] = tmp_val + h
fxh1 = f(x)

3
x[idx] = tmp_val - h
fxh2 = f(x)

grad[idx] = (fxh1 - fxh2) / (2*h)

x[idx] = tmp_val
return grad

numericial_gradient(function_2, np.array([3.0, 4.0]))

[ ]: array([6., 8.])

##Gradient descent
[ ]: import numpy as np

init_x = np.array([-3.0, 4.0])

def numericial_gradient(f,x):
h = 1e-4
grad = np.zeros_like(x) #Tra ve mot mang co kich thuoc giong voi mang da cho␣
↪va bang 0

for idx in range(x.size):

tmp_val = x[idx]
x[idx] = tmp_val + h
fxh1 = f(x)

x[idx] = tmp_val - h
fxh2 = f(x)

grad[idx] = (fxh1 - fxh2) / (2*h)

x[idx] = tmp_val
return grad

def function_2(x):
return x[0]**2 + x[1]**2

def gradient_descent(f, init_x, lr = 0.01, step_num = 100):

x = init_x

for i in range(step_num):
grad = numericial_gradient(f,x)
x -= lr * grad

return x

gradient_descent(function_2, init_x = init_x, lr = 0.1, step_num = 100)

4
[ ]: array([-6.11110793e-10, 8.14814391e-10])

##Gradients for a Neural Network

[3]: from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive

[4]: cd /content/drive/MyDrive/GG Colab/Deep Learning/common

/content/drive/MyDrive/GG Colab/Deep Learning/common

[5]: from my_functions import softmax, cross_entropy_error

[6]: from my_gradient import numerical_gradient

[ ]: import sys, os
sys.path.append(os.pardir)
import numpy as np

class simpleNet:
def __init__ (self):
self.W = np.random.randn(2,3)

def predict(self, x):

return np.dot(x,self.W)

def loss(self,x,t):
z = self.predict(x)
y = softmax(z)
loss = cross_entropy_error(y,t)

return loss
net = simpleNet()
print(net.W)

x = np.array([0.6, 0.9])
p = net.predict(x)
print(p)

t = np.array([0,0,1])
net.loss(x,t)

def f(W):
return net.loss(x,t)

dW = numerical_gradient(f, net.W)

5
print(dW)

[[-1.02003529 0.65014502 0.34236522]

[-0.24540338 -0.74331997 1.15400741]]
[-0.83288421 -0.27890096 1.2440258 ]
[[ 0.05597043 0.09739811 -0.15336855]
[ 0.08395565 0.14609717 -0.23005282]]
##A Two-layer Neural Network as a Class
[17]: import sys, os
sys.path.append(os.pardir)
from my_functions import *
from my_gradient import numerical_gradient

class TwoLayerNet:
def __init__(self, input_size, hidden_size, output_size, weight_init_std = 0.
↪01):

self.params = {}
self.params['W1'] = weight_init_std * np.random.randn(input_size,␣
↪hidden_size)

self.params['b1'] = np.zeros(hidden_size)
self.params['W2'] = weight_init_std * np.random.randn(hidden_size,␣
↪output_size)

self.params['b2'] = np.zeros(output_size)

def predict(self, x):

W1, W2 = self.params['W1'], self.params['W2']
b1, b2 = self.params['b1'], self.params['b2']

a1 = np.dot(x, W1) + b1
z1 = sigmoid(a1)
a2 = np.dot(z1, W2) + b2
y = softmax(a2)

return y

def loss(self, x, t):

y = self.predict(x)

return cross_entropy_error(y,t)

def accuracy(self, x, t):

y = self.predict(x)
y = np.argmax(y, axis = 1)
t = np.argmax(t, axis = 1)

accuracy = np.sum(y == t) / float(x.shape[0])

6
return accuracy

def numerical_gradient(self, x, t):

loss_W = lambda W: self.loss(x,t)
grads = {}
grads['W1'] = numerical_gradient(loss_W, self.params['W1'])
grads['b1'] = numerical_gradient(loss_W, self.params['b1'])
grads['W2'] = numerical_gradient(loss_W, self.params['W2'])
grads['b2'] = numerical_gradient(loss_W, self.params['b2'])

return grads

net = TwoLayerNet(input_size = 784, hidden_size = 100, output_size = 10)

net.params['W1'].shape
net.params['b1'].shape
net.params['W2'].shape
net.params['b2'].shape

[17]: (10,)

##Implementing Mini-Batch Training

[ ]: import numpy as np
from my_mnist import load_mnist

(x_train, t_train), (x_test, t_test) = load_mnist(normalize = True,␣

↪one_hot_label=True)

train_loss_list = []

inters_num = 10000
train_size = x_train.shape[0]
batch_size = 100
learning_rate = 0.1

network = TwoLayerNet(input_size = 784, hidden_size=50, output_size = 10)

for i in range(inters_num):
batch_mask = np.random.choice(train_size, batch_size)
x_batch = x_train[batch_mask]
t_batch = t_train[batch_mask]

grad = network.numerical_gradient(x_batch, t_batch)

for key in ('W1', 'b1', 'W2', 'b2'):

network.params[key] -= learning_rate * grad[key]

7
loss = network.loss(x_batch, t_batch)
train_loss_list.append(loss)

##Using Test Data for Evaluation

[ ]: import numpy as np
from my_mnist import load_mnist

(x_train, t_train), (x_test, t_test) = load_mnist(normalize = True,␣

↪one_hot_label=True)

train_loss_list = []
train_acc_list = []
test_acc_list = []
inter_per_epoch = max(train_size / batch_size, 1)

inters_num = 10000
batch_size = 100
learning_rate = 0.1

network = TwoLayerNet (input_size = 784, hidden_size = 50, output_size = 10)

for i in range (inters_num):

batch_mask = np.random.choice(train_size, batch_size)
x_batch = x_train[batch_mask]
t_batch = t_train[batch_mask]

grad = network.numerical_gradient(x_batch, t_batch)

for key in ('W1', 'b1', 'W2', 'b2'):

network.params[key] -= learning_rate * grad[key]

loss = network.loss(x_batch, t_batch)

train_loss_list.append(loss)

if i% inter_per_epoch == 0:
train_acc = network.accuracy(x_train, t_train)
test_acc = network.accuracy(x_test, t_test)
train_acc_list.append(train_acc)
test_acc_list.append(test_acc)
print("train acc, test acc | " + str(train_acc) + " , " + str(test_acc))

Apollo 11 Technical Air-To-Ground Voice Transcription
100% (1)
Apollo 11 Technical Air-To-Ground Voice Transcription
626 pages
Structural Drawing
No ratings yet
Structural Drawing
9 pages
In Tray Guide
100% (1)
In Tray Guide
10 pages
LSTM From Scratch in Python
No ratings yet
LSTM From Scratch in Python
11 pages
Reference PDF
No ratings yet
Reference PDF
66 pages
Bananini Chimpanzini
No ratings yet
Bananini Chimpanzini
8 pages
SPA Pretest and Post Test
No ratings yet
SPA Pretest and Post Test
2 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Vda Brochure
No ratings yet
Vda Brochure
8 pages
Career Day Power Point
No ratings yet
Career Day Power Point
12 pages
Essay On Multiculturalism
100% (2)
Essay On Multiculturalism
6 pages
C2 W2ok
No ratings yet
C2 W2ok
109 pages
Ad3511 Deep Learning Lab Manual - 241230 - 204240
No ratings yet
Ad3511 Deep Learning Lab Manual - 241230 - 204240
63 pages
Simatic PDM
No ratings yet
Simatic PDM
16 pages
AD3511 - Deep Learning Lab Manual
No ratings yet
AD3511 - Deep Learning Lab Manual
61 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
49 pages
NNDL 2
No ratings yet
NNDL 2
67 pages
Republic Act No 10121
No ratings yet
Republic Act No 10121
28 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Deep Record
No ratings yet
Deep Record
44 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
53 pages
HandsOnML Ch7E
No ratings yet
HandsOnML Ch7E
43 pages
DL Lab - Merged
No ratings yet
DL Lab - Merged
60 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
DL Lab Manual
No ratings yet
DL Lab Manual
44 pages
Sindhuja Assignment-2 AI
No ratings yet
Sindhuja Assignment-2 AI
22 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
VISA Dynamic Passcode Authentication
No ratings yet
VISA Dynamic Passcode Authentication
4 pages
Waukesha 7101 Spec
100% (1)
Waukesha 7101 Spec
4 pages
Ex8, 11,12
No ratings yet
Ex8, 11,12
26 pages
Sliding Pr-26 Profilco: JANUARY - 2006
No ratings yet
Sliding Pr-26 Profilco: JANUARY - 2006
47 pages
CCC
No ratings yet
CCC
25 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
c2000 Reference Guide
No ratings yet
c2000 Reference Guide
37 pages
Active RIS UAV Backhaul MDPI
No ratings yet
Active RIS UAV Backhaul MDPI
24 pages
HandsOnML Ch6E
No ratings yet
HandsOnML Ch6E
23 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Regularization For Neural Network
No ratings yet
Regularization For Neural Network
37 pages
Autoencoder From Scratch
No ratings yet
Autoencoder From Scratch
21 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Chap3 Basic Classification
No ratings yet
Chap3 Basic Classification
29 pages
MLP Pytorch Sigmoid Mse
No ratings yet
MLP Pytorch Sigmoid Mse
20 pages
Memory Management Operator
No ratings yet
Memory Management Operator
25 pages
Verbal and Nonverbal Elements of Communication
No ratings yet
Verbal and Nonverbal Elements of Communication
36 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Pytorch Demo 1749471354
No ratings yet
Pytorch Demo 1749471354
10 pages
Part 1.2. Back Propagation
No ratings yet
Part 1.2. Back Propagation
30 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
No ratings yet
Using A Three Layer Deep Neural Network To Solve An Unsupervised Learning Problem
13 pages
Project Report: A Brief Study On Perception of Women Towards Himalaya Beauty Care Products
No ratings yet
Project Report: A Brief Study On Perception of Women Towards Himalaya Beauty Care Products
40 pages
Da 3 Lab DL 21BCE2687
No ratings yet
Da 3 Lab DL 21BCE2687
15 pages
555610a19 DL Exp4
No ratings yet
555610a19 DL Exp4
11 pages
Lab Report 03
No ratings yet
Lab Report 03
14 pages
New Exp
No ratings yet
New Exp
12 pages
1 강의개요 경계보안
No ratings yet
1 강의개요 경계보안
18 pages
Lab 8
No ratings yet
Lab 8
10 pages
Lab Manual Ann
No ratings yet
Lab Manual Ann
12 pages
Niraj DL
No ratings yet
Niraj DL
15 pages
Yoga Varga Yoga Givers Results Ascribed To Yoga Brief Definition of Yoga
No ratings yet
Yoga Varga Yoga Givers Results Ascribed To Yoga Brief Definition of Yoga
2 pages
Deeplg 3
No ratings yet
Deeplg 3
8 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Neural Networks MATH Explained
No ratings yet
Neural Networks MATH Explained
14 pages
H2 AndresAlcivar
No ratings yet
H2 AndresAlcivar
12 pages
Mlp-Fromscratch Sigmoid-Mse
No ratings yet
Mlp-Fromscratch Sigmoid-Mse
13 pages
Experiments - With - Convolutional - Neural - Network - 2 - 6b.ipynb - Colaboratory
No ratings yet
Experiments - With - Convolutional - Neural - Network - 2 - 6b.ipynb - Colaboratory
6 pages
MNIST Tensorflow Mini Project 1749471354
No ratings yet
MNIST Tensorflow Mini Project 1749471354
4 pages
FCE词汇同义替换
No ratings yet
FCE词汇同义替换
9 pages
tEXTUAL pROPERTIES
No ratings yet
tEXTUAL pROPERTIES
12 pages
02 ML PDF
No ratings yet
02 ML PDF
5 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
DL 22043
No ratings yet
DL 22043
7 pages
Chapter02 Mathematical-Building-Blocks
No ratings yet
Chapter02 Mathematical-Building-Blocks
9 pages
Bản sao của simple - neural - net.ipynb - Colab
No ratings yet
Bản sao của simple - neural - net.ipynb - Colab
7 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
No ratings yet
A-Simple-Neural-Network-From-Scratch - Jupyter Notebook
9 pages
Trainina A NN Backpropagation
No ratings yet
Trainina A NN Backpropagation
6 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
Keras
No ratings yet
Keras
4 pages
The Use of Tricks To Deceive Someone (Usually To Extract Money From Them)
No ratings yet
The Use of Tricks To Deceive Someone (Usually To Extract Money From Them)
10 pages
X OR Problem Using DNN
No ratings yet
X OR Problem Using DNN
3 pages
Adaline SGD
No ratings yet
Adaline SGD
4 pages
AP World History - Unit 3 EC Packet
No ratings yet
AP World History - Unit 3 EC Packet
5 pages
Linear Regr GD
No ratings yet
Linear Regr GD
3 pages
L'altra Par
No ratings yet
L'altra Par
3 pages
Republic of The Philippines Department of Education Region X Anecito Siete ST., Tangub City Telefax: (088) 395-3372 Email
No ratings yet
Republic of The Philippines Department of Education Region X Anecito Siete ST., Tangub City Telefax: (088) 395-3372 Email
3 pages
Vxdisk List List All Disks Used by Veritas (VX) - Vxdisk List
No ratings yet
Vxdisk List List All Disks Used by Veritas (VX) - Vxdisk List
8 pages
Shubham B A. 4
No ratings yet
Shubham B A. 4
2 pages
Bằng Tốt Nghiệp Bản Gốc
No ratings yet
Bằng Tốt Nghiệp Bản Gốc
1 page
Bảng Điểm Bản Gốc
No ratings yet
Bảng Điểm Bản Gốc
2 pages
Project Synopsis
No ratings yet
Project Synopsis
4 pages
1.4 Technology and Literature Review: Introduction of PHP
No ratings yet
1.4 Technology and Literature Review: Introduction of PHP
4 pages
Appendix 5: Affirmations For Prosperity and Abundance: Higher Awareness Intuitive Resource List
No ratings yet
Appendix 5: Affirmations For Prosperity and Abundance: Higher Awareness Intuitive Resource List
2 pages