Markdown To PDF

"But I must explain to you how all this mistaken idea of denouncing pleasure and praising pain was born and I will give you a complete account of the system, and expound the actual teachings of the great explorer of the truth, the master-builder of human happiness. No one rejects, dislikes, or avoids pleasure itself, because it is pleasure, but because those who do not know how to pursue pleasure rationally encounter consequences that are extremely painful. Nor again is there anyone who loves or

Uploaded by

Dũng Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views2 pages

Markdown To PDF

Uploaded by

Dũng Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Report on the Training Process of the Neural Network Model

1. Introduction

This report explains the training process of a neural network implemented in the provided code. The model consists of a single hidden layer and uses the following
key components:

Sigmoid activation function in the hidden layer.

Softmax function in the output layer for multi-class classification.
Cross-entropy loss as the objective function.

We will also delve into how the derivatives of the loss function are computed with respect to weights and biases, which form the basis of the backpropagation
process.

2. Model Architecture

Input Layer: Accepts data with ( M ) features.

Hidden Layer: Consists of ( D ) hidden units.
Output Layer: Produces ( K ) outputs corresponding to ( K ) classes using the softmax function.

3. Forward Pass

The forward pass computes the predictions for each data sample using the following steps:

1. Input Augmentation: Adds a bias term to the input features.

2. Linear Transformation: ( a = X_{\text{bias}} \cdot \alpha^T ), where ( \alpha ) is the weight matrix for the hidden layer.
3. Activation : The sigmoid function ( z = \sigma(a) ) is applied to compute the activations of the hidden layer.
4. Augmented Hidden Outputs: Bias is added to ( z ) for input into the output layer.
5. Output Transformation: ( b = z_{\text{bias}} \cdot \beta^T ), where ( \beta ) is the weight matrix for the output layer.
6. Softmax Activation : The softmax function ( \hat{y} = \text{softmax}(b) ) computes the class probabilities.

4. Loss Function

The cross-entropy loss for a dataset of size ( N ) is: [ \text{Loss} = -\frac{1}{N} \sum_{i=1}^N \log(\hat{y} {i, y_i}) ] where ( \hat{y}{i, y_i} ) is the predicted probability of
the true class ( y_i ).

5. Backward Pass (Gradient Computation)

The backward pass calculates gradients of the loss with respect to weights and biases to update them during training. The following steps are performed:

1. Output Layer Gradients:

Error at Output Layer: [ \delta_{\text{output}} = \hat{y} - y_{\text{one-hot}} ] where ( y_{\text{one-hot}} ) is the one-hot encoding of the true labels.
Gradient of Output Weights: [ \nabla_{\beta} = \frac{1}{N} \delta_{\text{output}}^T \cdot z_{\text{bias}} ]

2. Hidden Layer Gradients:

Error at Hidden Layer: [ \delta_{\text{hidden}} = (\delta_{\text{output}} \cdot \beta_{\text{no-bias}}) \odot \sigma'(z) ] where ( \beta_{\text{no-bias}} )
excludes the bias weights, and ( \sigma'(z) = z \cdot (1 - z) ) is the derivative of the sigmoid function.
Gradient of Hidden Weights: [ \nabla_{\alpha} = \frac{1}{N} \delta_{\text{hidden}}^T \cdot X_{\text{bias}} ]

6. Parameter Updates

Using stochastic gradient descent (SGD), the weights are updated as: [ \alpha \leftarrow \alpha - \eta \nabla_{\alpha}, \quad \beta \leftarrow \beta - \eta
\nabla_{\beta} ] where ( \eta ) is the learning rate.

7. Training Process

The train_and_test function performs the following:

1. Weight Initialization: Random or zero initialization.

2. Epoch Loop: Repeats for the specified number of epochs:
Performs a forward pass for each sample.
Computes the gradients via the backward pass.
Updates the weights and biases.
Computes and stores the loss on training and validation datasets.
3. Evaluation: After training, the model computes the error rate and makes predictions on both training and validation datasets.
8. Results

The function outputs:

Training and Validation Loss per epoch.

Final Training and Validation Errors .
Predicted Labels for training and validation datasets.

9. Conclusion

The training process effectively leverages backpropagation to optimize the weights and biases by minimizing the cross-entropy loss. Gradients computed with
respect to the loss ensure that each parameter is adjusted to reduce classification error over time.

Question Example
No ratings yet
Question Example
10 pages
EPS-DL-Handout7-Ex2 ANN Model For Binary Classification
No ratings yet
EPS-DL-Handout7-Ex2 ANN Model For Binary Classification
17 pages
EPS-DL-Handout3-Build ANN From Scratch Basics
No ratings yet
EPS-DL-Handout3-Build ANN From Scratch Basics
25 pages
Lecture04 NeuralNetwork
No ratings yet
Lecture04 NeuralNetwork
77 pages
Neural Networks
No ratings yet
Neural Networks
52 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Server-Side Programming: Java Servlets: Web Technologies A Computer Science Perspective
No ratings yet
Server-Side Programming: Java Servlets: Web Technologies A Computer Science Perspective
115 pages
Redes Neuronales Desde 0
No ratings yet
Redes Neuronales Desde 0
21 pages
Slide 2-f2
No ratings yet
Slide 2-f2
52 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Module 1
No ratings yet
Module 1
22 pages
Part 1.2. Back Propagation
No ratings yet
Part 1.2. Back Propagation
30 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
Neural Network Intro Lecture 4
No ratings yet
Neural Network Intro Lecture 4
46 pages
ANN Analysis
No ratings yet
ANN Analysis
5 pages
Ai 2024
No ratings yet
Ai 2024
5 pages
APKA Report
No ratings yet
APKA Report
3 pages
Python Syllabus PDF
No ratings yet
Python Syllabus PDF
3 pages
Unit 2 Soft
No ratings yet
Unit 2 Soft
14 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
29 pages
cs231n Github Io Neural Networks Case Study
No ratings yet
cs231n Github Io Neural Networks Case Study
17 pages
Unit 2 DL
No ratings yet
Unit 2 DL
70 pages
Overall Explanation: 1. Data Loading and Preprocessing
No ratings yet
Overall Explanation: 1. Data Loading and Preprocessing
4 pages
Lab 4
No ratings yet
Lab 4
4 pages
Program 4
No ratings yet
Program 4
4 pages
Mal Report 5
No ratings yet
Mal Report 5
9 pages
ML807 Distributed and Federated Learning Slides 2
No ratings yet
ML807 Distributed and Federated Learning Slides 2
211 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Slides 11
No ratings yet
Slides 11
48 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
006-Multiple Layers DNN
No ratings yet
006-Multiple Layers DNN
26 pages
Solved - Chapter 11.1 Problem 15P Solution - Advanced Engineering Mathematics, Enhanced EText 10th Edition
100% (1)
Solved - Chapter 11.1 Problem 15P Solution - Advanced Engineering Mathematics, Enhanced EText 10th Edition
2 pages
DLP Ict 8 Q1 M1
No ratings yet
DLP Ict 8 Q1 M1
5 pages
Module 2 - Reading Comprehension - Part I
No ratings yet
Module 2 - Reading Comprehension - Part I
17 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
What Is A Neural Network?
No ratings yet
What Is A Neural Network?
7 pages
Exp 3
No ratings yet
Exp 3
7 pages
Lec 8
No ratings yet
Lec 8
43 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Tablet Website and Application UX PDF
100% (1)
Tablet Website and Application UX PDF
212 pages
DBMS Unit-1 PPT 1.2 (Advantages & Disadvantages of DBMS, Components, Overall System Tructure)
100% (1)
DBMS Unit-1 PPT 1.2 (Advantages & Disadvantages of DBMS, Components, Overall System Tructure)
5 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Reliability, Culture and Data-1
No ratings yet
Reliability, Culture and Data-1
5 pages
Pr2 ANN WriteUp
No ratings yet
Pr2 ANN WriteUp
11 pages
Samsung Bloatware List
No ratings yet
Samsung Bloatware List
2 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Pr3 ANN WriteUp
No ratings yet
Pr3 ANN WriteUp
8 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Spam Detection Viva Questions Full
No ratings yet
Spam Detection Viva Questions Full
5 pages
Module 3.docxaiml
No ratings yet
Module 3.docxaiml
20 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
FFNN
No ratings yet
FFNN
3 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
16 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
NPM Administrator Guide
No ratings yet
NPM Administrator Guide
183 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Curious Kids 3
No ratings yet
Curious Kids 3
10 pages
Tips For Mainframe Programmers
No ratings yet
Tips For Mainframe Programmers
101 pages
APS145 Applied Problem Solving: Black Box
No ratings yet
APS145 Applied Problem Solving: Black Box
17 pages
Harony P6 Past Papers Final 2023 Edited
No ratings yet
Harony P6 Past Papers Final 2023 Edited
314 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
IEEE - Software.magazine - vol.25.No.1.Jan - Feb.2008.retail - Ebook KiMERA
No ratings yet
IEEE - Software.magazine - vol.25.No.1.Jan - Feb.2008.retail - Ebook KiMERA
100 pages
Task 4: Firefighting (Firefighting) : Input
No ratings yet
Task 4: Firefighting (Firefighting) : Input
6 pages
XIV.B. 8019-0190-LOIE-00-000-SE-PH-00001 - 01 Security Concept
No ratings yet
XIV.B. 8019-0190-LOIE-00-000-SE-PH-00001 - 01 Security Concept
47 pages
4IR Assi (AH Sir)
No ratings yet
4IR Assi (AH Sir)
18 pages
Docker Private Registry
No ratings yet
Docker Private Registry
4 pages
Task 2: Fuel Station (Fuelstation) : Input
No ratings yet
Task 2: Fuel Station (Fuelstation) : Input
3 pages
Task 1: Cryptography (Crypto) : Input
No ratings yet
Task 1: Cryptography (Crypto) : Input
3 pages
SSRN 2277512
No ratings yet
SSRN 2277512
34 pages
Introduction To Programming
No ratings yet
Introduction To Programming
11 pages
Unit 04
No ratings yet
Unit 04
24 pages
Homework 4
No ratings yet
Homework 4
7 pages
Osv Manual
No ratings yet
Osv Manual
3 pages
WorldCom Vikalpa 2004
No ratings yet
WorldCom Vikalpa 2004
16 pages
0 MS Word Master Handout - Blank - Student - 2023 - Update
No ratings yet
0 MS Word Master Handout - Blank - Student - 2023 - Update
9 pages
SB601 datasheet-EN
No ratings yet
SB601 datasheet-EN
2 pages
COMP1020 2023 Lab13
No ratings yet
COMP1020 2023 Lab13
6 pages
Lazel's Resume
No ratings yet
Lazel's Resume
1 page
COMP3010 Lab03 newQ2Testcases
No ratings yet
COMP3010 Lab03 newQ2Testcases
9 pages
4A0 100 Demo
No ratings yet
4A0 100 Demo
5 pages
CCMS Smoke Test Plan
No ratings yet
CCMS Smoke Test Plan
6 pages
Hass1050 Final
No ratings yet
Hass1050 Final
2 pages
Cheat-Sheet 231130 104353
No ratings yet
Cheat-Sheet 231130 104353
2 pages
Eh Unit2
No ratings yet
Eh Unit2
10 pages
Lab and Homeworks
No ratings yet
Lab and Homeworks
1 page
Physics - Phy-H-Dse-T-03 (Communication Electronics)
No ratings yet
Physics - Phy-H-Dse-T-03 (Communication Electronics)
2 pages
Flipkart Labels 27 Jun 2025-07-47
No ratings yet
Flipkart Labels 27 Jun 2025-07-47
1 page
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet