0% found this document useful (0 votes)

24 views5 pages

Ai 2024

Uploaded by

Vettri Vinayagam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Ai 2024

Uploaded by

Vettri Vinayagam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Designing a linear neural network with a single hidden layer for classifying handwritten digits

(such as those in the popular MNIST dataset) involves several steps. Here's an outline of the
network design and the process for minimizing classification error:

1. Network Architecture

For classifying handwritten digits (0–9), we need a neural network that takes the input image,
processes it through one hidden layer, and outputs class probabilities for each digit.

Input Layer: Each digit image is 28x28 pixels, so the input layer will have 784 neurons (one for
each pixel). These neurons receive the pixel values, typically normalized to the range [0, 1].

Hidden Layer: This layer introduces non-linearity to help the network learn more complex
patterns. Let's assume the hidden layer has 128 neurons. The size of the hidden layer can be
adjusted based on experimentation, but 128 is a reasonable starting point for MNIST.

Output Layer: The output layer has 10 neurons, one for each digit class (0–9). The activation
function for this layer is softmax, which converts the raw output values (logits) into class
probabilities.

2. Network Structure

A linear neural network with one hidden layer looks like this:

Input: 784-dimensional vector (28x28 flattened image).

Hidden layer: 128 neurons, with a non-linear activation function such as ReLU (Rectified Linear
Unit).

Output layer: 10 neurons, with softmax activation to provide probabilities for each class.

Mathematically:

Let:

be the input vector (shape: [784, 1])

be the weight matrix between the input layer and hidden layer (shape: [128, 784])

be the bias vector for the hidden layer (shape: [128, 1])
be the weight matrix between the hidden layer and output layer (shape: [10, 128])

be the bias vector for the output layer (shape: [10, 1])

The network equations are:

1. Hidden layer pre-activation:

2. Hidden layer activation (ReLU):

3. Output layer pre-activation:

4. Output layer activation (softmax):

3. Loss Function

To minimize classification error, we use a loss function that measures the difference between
the predicted class probabilities () and the true class labels ().

Cross-Entropy Loss is commonly used for classification tasks. It is defined as:

\mathcal{L} = -\frac{1}{N} \sum_{i=1}^N \sum_{j=1}^{10} Y_{ij} \log(\hat{Y}_{ij})

4. Optimization Process

To minimize the classification error, we use an optimization algorithm to adjust the network's
weights and biases during training. The most common optimization algorithm is Stochastic
Gradient Descent (SGD) or its variants, like Adam.

Forward Propagation:

Pass the input through the network to compute the output .

Compute the Loss:

Use the cross-entropy loss to calculate the difference between the predicted and true labels.

Backward Propagation:

Compute the gradient of the loss with respect to the network's parameters (weights and biases)
using the chain rule.

Weight Updates:

Update the weights and biases using the gradients and a learning rate . For example, in SGD:

W_1 = W_1 - \eta \frac{\partial \mathcal{L}}{\partial W_1}

W_2 = W_2 - \eta \frac{\partial \mathcal{L}}{\partial W_2}

5. Training Process

Initialize Weights:

Initialize the weights (e.g., using He initialization for ReLU activations).

Training Loop:

For each epoch (complete pass through the training dataset):

Shuffle the data to avoid overfitting.

For each mini-batch of data:

1. Perform forward propagation.

2. Compute the loss.

3. Perform backpropagation to compute gradients.

4. Update weights using an optimizer like Adam.

Evaluation:

After training, evaluate the model on a validation set to check how well it generalizes to unseen
data.

6. Hyperparameter Tuning

To improve the model's performance, you might need to tune the following hyperparameters:

Learning rate

Number of neurons in the hidden layer

Number of epochs

Batch size

7. Avoiding Overfitting

To prevent overfitting, which can increase classification error on unseen data, use techniques
like:

Dropout: Randomly drop neurons during training to prevent over-reliance on specific paths in
the network.

Early Stopping: Stop training when performance on a validation set stops improving.

Regularization: Apply regularization to penalize large weights.

Conclusion
In summary, this linear neural network uses one hidden layer and softmax output to classify
handwritten digits. To minimize classification error, we use the cross-entropy loss function and
optimize weights via gradient descent or Adam, while incorporating techniques to prevent
overfitting. This approach can achieve high accuracy for digit classification tasks like MNIST.

Deep Learning Lab Manual - 23-24
No ratings yet
Deep Learning Lab Manual - 23-24
41 pages
Discussion 9
No ratings yet
Discussion 9
7 pages
Shivansh Exp8
No ratings yet
Shivansh Exp8
5 pages
Question Example
No ratings yet
Question Example
10 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Piyush Rastogi
No ratings yet
Piyush Rastogi
5 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Report
No ratings yet
Report
14 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
DL
No ratings yet
DL
12 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
Hand Writtendigit Recognition
No ratings yet
Hand Writtendigit Recognition
15 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
This Python Script Implements A Single
No ratings yet
This Python Script Implements A Single
6 pages
Programming Assignments of Deep Learning Specialization 5 Courses 1
No ratings yet
Programming Assignments of Deep Learning Specialization 5 Courses 1
304 pages
Deep Learning Assignment
No ratings yet
Deep Learning Assignment
11 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
No ratings yet
Ex No:1 Implementing A Perceptron Algorithm For Binary Classification Date: Aim
41 pages
HSBC Bank Statement TemplateLab Com
100% (1)
HSBC Bank Statement TemplateLab Com
1 page
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Overall Explanation: 1. Data Loading and Preprocessing
No ratings yet
Overall Explanation: 1. Data Loading and Preprocessing
4 pages
DL Practical
No ratings yet
DL Practical
23 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Markdown To PDF
No ratings yet
Markdown To PDF
2 pages
DL Practical 3
No ratings yet
DL Practical 3
5 pages
Report
No ratings yet
Report
4 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
4 pages
Manual Phonic
0% (1)
Manual Phonic
46 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
5 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
19 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
WEEK2
No ratings yet
WEEK2
3 pages
The Art of Troubleshooting - Ebook - V2
No ratings yet
The Art of Troubleshooting - Ebook - V2
356 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
New Exp
No ratings yet
New Exp
12 pages
Large Scale Production Fermenter Design
No ratings yet
Large Scale Production Fermenter Design
15 pages
English Manual v3 001
No ratings yet
English Manual v3 001
63 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
25 pages
Tense and Aspect in IE PDF
No ratings yet
Tense and Aspect in IE PDF
255 pages
Private Health Institutions Law
100% (1)
Private Health Institutions Law
22 pages
ML Ass2
No ratings yet
ML Ass2
8 pages
Anatomy of Neural Networks
No ratings yet
Anatomy of Neural Networks
2 pages
Newbie's Deep Learning Project To Recognize Handwritten Digit
No ratings yet
Newbie's Deep Learning Project To Recognize Handwritten Digit
6 pages
Assignment 2 - Neural Network Fundamentals
No ratings yet
Assignment 2 - Neural Network Fundamentals
7 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Chapter 2
No ratings yet
Chapter 2
179 pages
Complete Guide To Service Learning 2
No ratings yet
Complete Guide To Service Learning 2
110 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
No ratings yet
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
172 pages
Eng21cs0302 - Sgan
No ratings yet
Eng21cs0302 - Sgan
7 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
2014-Planmeca-Pricing Retail 0911814 Low
No ratings yet
2014-Planmeca-Pricing Retail 0911814 Low
122 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Bachelor Thesis
No ratings yet
Bachelor Thesis
88 pages
Excel - Chapter
No ratings yet
Excel - Chapter
69 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Update On Renewed Effort To Strengthen Routine Immunization
No ratings yet
Update On Renewed Effort To Strengthen Routine Immunization
49 pages
Lab 12
No ratings yet
Lab 12
6 pages
SSC GD Exam: Previous Paper
No ratings yet
SSC GD Exam: Previous Paper
31 pages
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
No ratings yet
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
64 pages
Thesis User Manual Sample
100% (3)
Thesis User Manual Sample
8 pages
Three High-Altitude Peoples, Three Adaptations To Thin Air
No ratings yet
Three High-Altitude Peoples, Three Adaptations To Thin Air
11 pages
Tec GR RS TWR 007 01 Dec 02
No ratings yet
Tec GR RS TWR 007 01 Dec 02
31 pages
Plumbing Tools and Their Uses
No ratings yet
Plumbing Tools and Their Uses
6 pages
American Ethnologist - February 1987 - BROWN - Religion Class and Context Continuities and Discontinuities in Brazilian
No ratings yet
American Ethnologist - February 1987 - BROWN - Religion Class and Context Continuities and Discontinuities in Brazilian
21 pages
Amanda Mcelvany Position Paper Final
No ratings yet
Amanda Mcelvany Position Paper Final
6 pages
Sample Boq CPWD Class V
No ratings yet
Sample Boq CPWD Class V
1 page
Natgeo-Formation-Of-Earth-2000002398-Article Quiz and Answers
No ratings yet
Natgeo-Formation-Of-Earth-2000002398-Article Quiz and Answers
4 pages
Vettri Lab Project
No ratings yet
Vettri Lab Project
12 pages
Amine Unit
100% (1)
Amine Unit
69 pages
Case Ih Tractor Ignition Electrical Parts
100% (2)
Case Ih Tractor Ignition Electrical Parts
16 pages
Curs 2-Formatare Conditionala
No ratings yet
Curs 2-Formatare Conditionala
12 pages
Sains (Kertas 2) PMR Perak
No ratings yet
Sains (Kertas 2) PMR Perak
17 pages
Concrete Mix Design Report: Francis Xavier Engineering College, Tirunelveli Civil Department
No ratings yet
Concrete Mix Design Report: Francis Xavier Engineering College, Tirunelveli Civil Department
7 pages
SCBA Pre-Use Inspection
No ratings yet
SCBA Pre-Use Inspection
2 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
6 pages
OOP Assignment 2
No ratings yet
OOP Assignment 2
2 pages
Biography of Adolf Hitler
No ratings yet
Biography of Adolf Hitler
1 page
A First Introduction To P-Adic Numbers
No ratings yet
A First Introduction To P-Adic Numbers
6 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet