Report On Handwritten Digit Recognition Using A Feedforward Neural Network

This report is written on the machine learning to utilize the Python codes and datasets to identify the hand written digits

Uploaded by

mariajutt7711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views8 pages

Report On Handwritten Digit Recognition Using A Feedforward Neural Network

This report is written on the machine learning to utilize the Python codes and datasets to identify the hand written digits

Uploaded by

mariajutt7711

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Report – Handwritten Digit Recognition using a Feedforward

Neural Network

Introduction
The Handwritten digit recognition is an essential problem in the fields of computer vision and machine
learning. It is often used as an introductory exercise for beginners in neural network modeling. The
MNIST dataset has 70,000 gray-scale images of handwritten digits between 0 and 9 and has been used
as the standard benchmark to evaluate and measure the performance of various classification
algorithms. Neural networks, especially deep learning architectures, have become the most prevalent
means to achieve high accuracy in the classification task.

This project involves building a feedforward neural network (FNN) using PyTorch on the MNIST
database to classify digits. This model, after training and evaluation, would help to understand the
performance of network and allows fine-tuning of parameters in the network to achieve the highest
level of accuracy possible. Key objectives of this project are developing a robust architecture for the
model and optimizing the training process as much as possible while analyzing the results in terms of the
ability of the network to recognize handwritten digits.

Dataset
MNIST dataset contains 28x28 pixels (grey scale images) of handwritten numbers. Each image
corresponds to a digit between 0 and 9. This dataset contains 60,000 images for training purposes and
for 10,000 images for testing purposes. There is an annotation for every image with the corresponding
digit that it represents.

In this project, the original training set is divided into two subsets which will constitute training subset of
about 50,000 images and a validation subset of about 10,000 images. This partition would enable to
have a trial evaluation of the model during training and then improve on it before testing it on unseen
data. Pixel values are normalized to be in the range [-1, 1] to speed up convergence during training.
Model Architecture
A feedforward neural network is a simple yet a powerful architecture for classification tasks, where the
data moves in one direction from input to output. In this project, an FNN model is designed for MNIST
image classification. The model architecture is as under:

 Input Layer: This layer receives the 28x28 pixel flattened images, making 784 input units. In
this layer each pixel is considered as a feature, and these features are passed to the neural
network for its classification.
 Hidden Layers: Inside hidden layer there are two layers:
o First Layer: It has 128 units which uses the Rectified Linear Unit (ReLU) activation
function. This function introduces the non-linearity in the network. This non-linearity
allows the model to learn the more complex patterns present in the data.
o Second Layer: Finally, there is the second hidden layer with 64 units which also uses
the ReLU activation in an attempt to refine the features learned.
 Output Layer: The output layer consists of 10 units, which is responsible to represent the
possible 10 digits i-e. from 0–9. Each unit outputs a score that shows the likelihood of the input
image belonging to the particular class.

Here, ReLU is used in the hidden layers to reduce the chances of vanishing gradients, making the process
of learning more efficient. However, in the outer layer the softmax is not directly used because it is fed
into the loss function.

Figure 1 Visual Representation of Model Arcgitecture

Loss Function and Optimizer
In multi-class classification problems like the digit recognition problem, the Cross-Entropy Loss function
is an ideal choice. This function is not only used to evaluate the difference between the predicted class
probabilities and the actual labels but also it penalizes an incorrect prediction relatively harshly. In this
project, `CrossEntropyLoss()` from PyTorch is used; which integrates the softmax function along with the
calculation of the loss.

The Adam optimizer which is used to optimize the parameters of the model has been selected for this
model. Adam is chosen instead of other optimization methods like stochastic gradient descent (SGD)
because of its adaptive nature; that is, the learning rate changes as it varies with the moments of the
gradient thus far computed in such a way that it converges faster and is more noise-resistant than many
other optimization methods. The training was carried with a learning rate value of 0.001 in order to
make the training smooth and efficient.

Training Process
The training of the model was carried out through repeated iterations over the whole dataset for many
epochs, in which all images of the training set are shown to the model during one epoch. Batches of 64
images feed into the model to avoid using too much memory and update only the model's parameters at
every step.

 Epochs:The model is trained for 20 epochs. Each epoch consists of a forward pass during which
the model scores the predicted class for every batch followed by a backpropagation step in
which the errors of the model were used for updating the weights in the network.
 Hyperparameters:
o Size of Batch : 64
o Rate of Learning: 0.001
o Epochs : 20

During the training phase of the model, both the training the validation losses were closely monitored to
keep the track of the learning progress of the model. The training loss were decreasing over every epoch
of the dataset during learning phase of the model , whereas, the validation loss were used to check if the

FNN model was generalizing well to the unseen data from the MNIST dataset.
Figure 2 Plot of training and validation loss over epochs

The training and validation loss over the epochs are shown in the above graphs. This visual
representation helps to observe the learning dynamics of the model, which indicates whether the model
is overfitting or generalizing well.

Evaluation
To perform the evaluation of the performance of the model, standard torchmetrics function present in
the PyTorch metrics which includes accuracy, precision, recall, and F1-score will be used. These metrics
provide insights into how well the model is classifying digits, including how many predictions are correct
(accuracy) and how well it identifies each class without making false predictions (precision and recall).

The model was tested on the 10,000 images in the test set, achieving a high accuracy, which is typical for
FNNs applied to MNIST.

 Accuracy: The overall accuracy on the test set was approximately 97%, indicating that the model
performed well on unseen data.
To further evaluate the model's performance, a confusion matrix was generated (Figure 3). This matrix
provides insight into the model’s predictions for each digit class, illustrating how many instances of each
digit were correctly or incorrectly classified.

Figure 3 Confusion Matrix

Results and Discussion

The training process of the model shows a steady improvement vis-à-vis a decrease in both training and
validation losses over the epochs. However, the performance of the model is leveled off toward the end
of training. This shows that the optimal configuration for the architecture of the model and
hyperparameters has been achieved.
Although the model has ackhieved a high accuracy, however, there are some limitations in the design of
model:

 The architecture of this model is relatively simple, with only two hidden layers. Whereas, deeper
architectures, like convolutional neural networks (CNNs), have shown higher levels of
performance over MNIST dataset.
 In our model the tunning of Hyperparameters was limited to the learning rate and batch size.
However, more comprehensive tunings, such as adjustments in the number of hidden units,
epochs, and regularization methods, could further improve the results.

Therefore, in future work, more dropout layers could be added to prevent overfitting of the model.
Similarly, learning rate schedules can be used to further fine-tune the process of learning of the model.

Figure 4 Training and Validation Losses for every epoch

In the left graph the blue dots are representing the training loss for each epoch, which is decreasing. This
indicates that the model is learning. Similarly, blue line is representing the validation loss for each epoch.
This is an indicator that the model is generalizing to unseen data. Whereas, in the right graph the blue
dots show the accuracy for each epoch. Similarly, the blue line shows the validation accuracy for each
epoch.

Figure 5 Training and Validation Losses over Epochs

The above graph shows the training and validation losses of the model over multiple epochs.

Conclusion
This project has successfully implemented a FNN to recognize handwritten numerical digits present in
the MNIST dataset. The network has achieved a high accuracy on both training and test sets, which
shows the effectiveness of FNNs in this classification task. Despite the simplicity of the model, it has
performed well, which demonstrates the potential of neural networks in tasks like image recognition.

The ability of the model to accurately classify digits has broad applications, such as automated data entry
systems and recognition of digitized documents. Future work could involve exploring more complex
architectures and optimization techniques to further enhance performance.
References

1. LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). "Gradient-based learning applied to document
recognition." Proceedings of the IEEE, 86(11), 2278-2324.

2. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.

3. Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., ... & Chintala, S. (2019).

4. Kingma, D. P., & Ba, J. (2015). "Adam: A Method for Stochastic Optimization." 3rd International
Conference on Learning Representations (ICLR).

5. Hinton, G. E., Srivastava, N., & Krizhevsky, A. (2012). "Improving neural networks by preventing co-
adaptation of feature detectors." arXiv preprint arXiv:1207.0580.

6. Deng, L. (2012). "The MNIST database of handwritten digit images for machine learning research."
IEEE Signal Processing Magazine, 29(6), 141–142.

7. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., ... & Duchesnay, É.

8. TorchMetrics Documentation (2023). TorchMetrics: A Metrics Collection for PyTorch.

Razii Abraham - AF24SYD010 - MNIST Classification Using Multilayer Perceptrons (MLPS)
No ratings yet
Razii Abraham - AF24SYD010 - MNIST Classification Using Multilayer Perceptrons (MLPS)
6 pages
Cset335 Lab1 Report
No ratings yet
Cset335 Lab1 Report
3 pages
Handwritten Digit Recognition With CNN
No ratings yet
Handwritten Digit Recognition With CNN
13 pages
Piyush Rastogi
No ratings yet
Piyush Rastogi
5 pages
School of Computer Science and Artificial Intelligence
No ratings yet
School of Computer Science and Artificial Intelligence
35 pages
01 - Mnist - Ipynb (4) - JupyterLab
No ratings yet
01 - Mnist - Ipynb (4) - JupyterLab
23 pages
Weekly Activity 6
No ratings yet
Weekly Activity 6
5 pages
Ai 2024
No ratings yet
Ai 2024
5 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
Pattern Recognition
No ratings yet
Pattern Recognition
18 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Handwritten Digit Recognition Roadmap
No ratings yet
Handwritten Digit Recognition Roadmap
17 pages
Major Project
No ratings yet
Major Project
10 pages
Technologies
No ratings yet
Technologies
9 pages
Muthu
No ratings yet
Muthu
9 pages
2nd Research
No ratings yet
2nd Research
7 pages
Report
No ratings yet
Report
4 pages
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
DL Practical 3
No ratings yet
DL Practical 3
5 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
Shaik Muneer Roll no:22KT1A4257 3rd Year (AI&ML) PSCMR College of Engineering and Technology
No ratings yet
Shaik Muneer Roll no:22KT1A4257 3rd Year (AI&ML) PSCMR College of Engineering and Technology
20 pages
Convolutional Neural Network CNN For Ima
No ratings yet
Convolutional Neural Network CNN For Ima
5 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
DL-basics-of-neural-networks-MNIST-dataset - Ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset - Ipynb - Colab
5 pages
On Handwritten Digit Recognition
100% (1)
On Handwritten Digit Recognition
15 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Assignment - 13: Title
No ratings yet
Assignment - 13: Title
2 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Mnist Classification Report
No ratings yet
Mnist Classification Report
15 pages
Enhancing Neural Network Models For MNIST Digit Recognition
No ratings yet
Enhancing Neural Network Models For MNIST Digit Recognition
6 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
19 pages
Digit Recognition Using Convolutional Neural Networks
No ratings yet
Digit Recognition Using Convolutional Neural Networks
4 pages
MNIST
No ratings yet
MNIST
3 pages
Analogy Between CNN and RNN Using MNIST Dataset: Prof. Rathi R Assistant Professor Sr. Grade 1
No ratings yet
Analogy Between CNN and RNN Using MNIST Dataset: Prof. Rathi R Assistant Professor Sr. Grade 1
21 pages
NNDL Assignment-2 Report
No ratings yet
NNDL Assignment-2 Report
9 pages
Ijirt162606 Paper
No ratings yet
Ijirt162606 Paper
4 pages
NNDL Assignment-2 Report
No ratings yet
NNDL Assignment-2 Report
7 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Group B Deep Learning Assignment No: 3B: Categories
No ratings yet
Group B Deep Learning Assignment No: 3B: Categories
13 pages
Eng21cs0302 - Sgan
No ratings yet
Eng21cs0302 - Sgan
7 pages
Newbie's Deep Learning Project To Recognize Handwritten Digit
No ratings yet
Newbie's Deep Learning Project To Recognize Handwritten Digit
6 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Capri
No ratings yet
Capri
100 pages
Xii CS - 2024-25 - Practical File
No ratings yet
Xii CS - 2024-25 - Practical File
77 pages
ManishGiri G 2018465 34
No ratings yet
ManishGiri G 2018465 34
12 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Experiment No. 10 TE SL-II (ANN)
No ratings yet
Experiment No. 10 TE SL-II (ANN)
3 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Distributed DataMining
No ratings yet
Distributed DataMining
16 pages
CourseMaterial-Access Notes
No ratings yet
CourseMaterial-Access Notes
303 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Classifying Hand-Written Digits Using Neural Network: A Project Report On
No ratings yet
Classifying Hand-Written Digits Using Neural Network: A Project Report On
19 pages
Ferrari Alberto Russo Marco Ferrari Alberto Analyzing Data With Power BI and Power Pivot For
No ratings yet
Ferrari Alberto Russo Marco Ferrari Alberto Analyzing Data With Power BI and Power Pivot For
412 pages
Base Paper
No ratings yet
Base Paper
5 pages
Advanced Load Runner
No ratings yet
Advanced Load Runner
127 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Erytjyukk 88 Oioio
No ratings yet
Erytjyukk 88 Oioio
7 pages
AI Mini Project Report
No ratings yet
AI Mini Project Report
7 pages
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
No ratings yet
Implementation of Handwritten Digit Recognizer Using CNN: Vinjit, Bhojak, Kumar and Nikam
9 pages
BPEL PM 11g Performance Tuning - 5
No ratings yet
BPEL PM 11g Performance Tuning - 5
104 pages
CS614 Mcqs FinalTerm by Vu Topper RM
No ratings yet
CS614 Mcqs FinalTerm by Vu Topper RM
27 pages
DBMSNotes
No ratings yet
DBMSNotes
17 pages
MCQs - CH 3 Information Systems & Its Components - Done
No ratings yet
MCQs - CH 3 Information Systems & Its Components - Done
16 pages
223 Assignment
No ratings yet
223 Assignment
11 pages
Handwritten Digit Recognition Using ML&DL
No ratings yet
Handwritten Digit Recognition Using ML&DL
3 pages
CHAPTER 7 - Function
No ratings yet
CHAPTER 7 - Function
11 pages
Handwritten Digit Recognition Using Convolutional Neural Networks
No ratings yet
Handwritten Digit Recognition Using Convolutional Neural Networks
6 pages
dm006w Mysql Introduction Online Oct20-En-V
No ratings yet
dm006w Mysql Introduction Online Oct20-En-V
27 pages
Jdegtaddupdate - Image/ Jdegtaddupdate - Imagekeystr: Syntax
No ratings yet
Jdegtaddupdate - Image/ Jdegtaddupdate - Imagekeystr: Syntax
5 pages
Introduction To Power BI: Lis Sulmont
No ratings yet
Introduction To Power BI: Lis Sulmont
34 pages
Lab - 02 - Building Websites Using ASP - NET Core Razor Pages
No ratings yet
Lab - 02 - Building Websites Using ASP - NET Core Razor Pages
27 pages
CS614 - Finalterm 2014 Solved MCQs With References by Moaaz
No ratings yet
CS614 - Finalterm 2014 Solved MCQs With References by Moaaz
15 pages
03 - 1 How To Install ZoneMinder Master On UBUNTU 20.04 LTS (Focal Fossa)
No ratings yet
03 - 1 How To Install ZoneMinder Master On UBUNTU 20.04 LTS (Focal Fossa)
2 pages
Oracle Dataguard
No ratings yet
Oracle Dataguard
7 pages
Aryan Mittal
No ratings yet
Aryan Mittal
1 page
Car Price Prediction Using Machine Learning Techniques
100% (1)
Car Price Prediction Using Machine Learning Techniques
6 pages
Task-1 (PT)
No ratings yet
Task-1 (PT)
35 pages
Decision Support and Business Intelligence Systems (Test Bank)
100% (1)
Decision Support and Business Intelligence Systems (Test Bank)
11 pages
Informatics Practices Practical List22-23
No ratings yet
Informatics Practices Practical List22-23
3 pages
So First About The Rapid Clone Method
No ratings yet
So First About The Rapid Clone Method
5 pages
Data Warehouse Definition: - Users and System Orientation
No ratings yet
Data Warehouse Definition: - Users and System Orientation
6 pages
Oracle View - Javatpoint
No ratings yet
Oracle View - Javatpoint
9 pages
Sr. No. Title of The Practical No. of Hours: Including RPM, Yum, Tar and Top Commands. Including Awk' Filter
No ratings yet
Sr. No. Title of The Practical No. of Hours: Including RPM, Yum, Tar and Top Commands. Including Awk' Filter
1 page
Cbse - Department of Skill Education: Information Technology (Subject Code-402)
100% (2)
Cbse - Department of Skill Education: Information Technology (Subject Code-402)
6 pages