Arabic Digit Recognition

fenwfkfnew

Uploaded by

Pradeep Bammidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views5 pages

Arabic Digit Recognition

fenwfkfnew

Uploaded by

Pradeep Bammidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

ARABIC DIGIT RECOGNITION

Bammidi pradeep
layer, which has 84 neurons. The output of the
Keywords: Arabic digit recognition, Optical second fully-connected layer is then fed into the
Character Recognition, Convolutional Neural output layer, which has 10 neurons, corresponding
Network. to the 10 Arabic digits.

Abstract Dataset:

Optical Character Recognition (OCR) is a There are 13440 training Arabic letter images of
technology that enables the conversion of images of 64x64 pixels.
handwritten or printed text into machine-encoded There are 3360 testing Arabic letter images of
text. Arabic digit recognition is a challenging task 64x64 pixels
due to the large number of similar-looking digits 5 rows × 4096 columns
and the presence of noise in real-world images.
This paper presents a novel approach for Arabic Visualizing the dataset: The function can be used
digit recognition using a Convolutional Neural to visualize the images in the dataset. This can be
Network (CNN). The proposed method utilizes a helpful for understanding the characteristics of the
architecture with modifications to adapt it for data and for identifying any potential problems with
Arabic digit recognition. The network is trained on the data.
a dataset of 10,000 handwritten Arabic digits. Preprocessing the data: The function can be used
The proposed method achieves an accuracy of to preprocess the images in the dataset. This may
98.5% on the test set, which is comparable to the include resizing the images, normalizing the pixel
state-of-the-art methods. The method is also robust values, and rotating the images to a consistent
to noise and distortions, making it suitable for real- orientation.
world applications.
Model
Introduction
Conv2D: This layer performs 2D convolution on the
Arabic digit recognition is an important task in
input image. It uses a filter size of 3x3 and a stride
various applications such as document processing,
of 1.
bank check processing, and postal automation.
MaxPooling2D: This layer performs max pooling
However, it is a challenging task due to the large
on the output of the Conv2D layer. It uses a pool
number of similar-looking digits and the presence of
size of 2x2 and a stride of 2.
noise in real-world images.
GlobalAveragePooling2D: This layer performs
The proposed method for Arabic digit recognition global average pooling on the output of the
utilizes a LeNet-5 architecture with modifications. MaxPooling2D layer. This reduces the
The network consists of two convolutional layers, dimensionality of the feature map.
followed by two fully-connected layers. The first Batch Normalization: This layer normalizes the
convolutional layer uses a filter size of 5x5 and a output of the GlobalAveragePooling2D layer. This
stride of 1, followed by a max-pooling layer with a helps to improve the training process.
pool size of 2x2 and a stride of 2. The second Dropout: This layer randomly drops out a certain
convolutional layer uses a filter size of 3x3 and a percentage of neurons during training. This helps to
stride of 1, followed by a max-pooling layer with a prevent overfitting.
pool size of 2x2 and a stride of 2. The output of the Dense: This layer is a fully-connected layer. It takes
second max-pooling layer is flattened and fed into the output of the Dropout layer as input and
the first fully-connected layer, which has 120 produces a probability distribution over the classes.
neurons. The output of the first fully-connected Conv2D:
layer is then fed into the second fully-connected This layer is responsible for extracting features
from the input image.
The filter size of 3x3 is a common choice for image
recognition tasks.
The stride of 1 means that the filter will be applied
to every pixel in the input image.
MaxPooling2D:
This layer reduces the dimensionality of the feature
map.
This helps to improve the computational efficiency
of the model and to prevent overfitting.
The pool size of 2x2 means that the filter will be
applied to every 2x2 block of pixels in the input
feature map.
GlobalAveragePooling2D:
This layer further reduces the dimensionality of the
feature map. This is done by taking the average of
all the values in the feature map.
This helps to make the model more robust to noise
and other distortions.
Batch Normalization:
This layer normalizes the output of the
GlobalAveragePooling2D layer.
This helps to improve the training process by
making it more stable and less sensitive to the initial

Parameter tuning
The goal of parameter tuning is to find the
combination of hyperparameter values that results
in the best performance on the validation set. This
can be done manually or using a grid search or
random search.

Parameters to tune:

Optimizer: The optimizer is the algorithm that is

used to update the model's weights during training.
Some common optimizers include SGD, Adam, and
RMSprop.
Kernel initializer: The kernel initializer is used to
initialize the weights of the model's convolutional
layers. Some common kernel initializers include
glorot uniform and he normal.
Activation function: The activation function is the
function that is applied to the output of each layer.
Some common activation functions include relu,
sigmoid, and tanh.
Tuning process:
Choose a range of values for each hyperparameter.
Train the model using each combination of
hyperparameter values.
Evaluate the model's performance on the validation
set.
Select the combination of hyperparameter values
that results in the best performance.

We will try different models with different

parameters to find the best parameter values.
we can see that best parameters are:
Optimizer: Adam Testing the model
Kernel initializer: uniform
Activation: relu After training the model on more epochs we gained
a better model which can classify complex patterns
Creating model with best parameters So when we tested it on our test data set we had
better results than before.
model = create_model(optimizer='Adam',
kernel_initializer='uniform', activation='relu') Test accuracy is improved from 98.286% to
98.862% As we train the model on 20 more epochs.
Training the model
Benchmark Model

Train the model using batch size=20 to reduce used We will use a very simple (vanilla) CNN model as
memory and make the training more quick. We will benchmark and Train/test it using the same data that
train the model first on 10 epochs to see the you have used for our model solution. Then
accuracy that we will obtain. Compare the results between the vanilla model and
our complex model.
Plotting Loss and Accuracy Curves with Epochs
Plotting the loss and accuracy curves over epochs We get test accuracy of 32.37% from the baseline
can help you visualize the training process of your Model (vanilla).
machine learning model. The loss curve shows how
Predict Image Classes
the model's loss decreases over time, while the
accuracy curve shows how the model's accuracy Making a method which takes a model, data and its
increases over time. true labels (optional for using in testing). Then it
gives the predicted classes of the given data using
the given model accuracy.

Comparing Evaluation Metrics between

Benchmark Model and Final Model

Making a method which will print all metrics

(precision, recall, f1-score and support) with each
class in the dataset.

Metrics We will use the following metrics

(Accuracy, Precision, Recall and F1-score). Log
loss might also be a practical metric to be used
when we tune/refine our model solution, So we will
use Log loss as well. Let’s study each metric
carefully with our problem.
Accuracy: Accuracy is the most intuitive Hyperparameters used for the model:
performance measure and it is simply a ratio of Dropout rate: 20% of the layer nodes.
correctly predicted observation to the total Epochs: 10 then we will fit the model incrementally
observations. We can use it here if we don’t care on 20 more
about misclassification of letter or digit specially. epochs.
Precision - Precision is the ratio of correctly Batch size: 20 as it is enough amount and divisible
predicted positive observations to the total predicted by the size of
positive observations. We use it here to evaluate total training dataset and also the size of total testing
false positive rate As High precision relates to the dataset.
low false positive rate. Optimizer: After the refinement section we will see
Recall (Sensitivity) - Recall is the ratio of correctly the best optimizer
predicted positive observations to the all we will use is Adam.
observations in actual class. We use it to select our Activation Layer: After the refinement section we
best model when there is a high cost associated with will see the best
False Negative/misclassified image. activation we will use is relu.
F1 score - F1 Score is the weighted average of Kernel initializer: After the refinement section we
Precision and Recall. Therefore, this score takes will see the best
both false positives and false negatives into account. kernel initializer we will use is uniform.
F1 is usually more useful than accuracy, especially
if you have an uneven class distribution. Accuracy
works best if false positives and false negatives
have similar cost. If the cost of false positives and
false negatives are very different, it’s better to look
at both Precision and Recall which means using F1
score.

Benchmark

We will use a very simple (vanilla) CNN model as

benchmark and Train/test it using
the same data that you have used for our model
solution.
Out Vanilla CNN will consist of:
Single Convolutional layer of 16 filters and window
size of 3 to capture the basic patterns like edges
from the input images. Single Pooling Layer to
down-sample the input to enable the model to make
assumptions about the features so as to reduce
overfitting. It also reduces the
number of parameters to learn, reducing the training
time.
The last layer is the output layer with 38 neurons
(number of output classes) and it uses SoftMax
activation function as we have multi-classes.
each neuron will give the probability of that class.
We will train our model using Adam Optimizer,
cross entropy (Log loss) as loss function, batch size
of 20 (to reduce training time and overfitting) and
finally using 5 epochs as we want a simple model
just to capture the basic patterns.
Conclusion References
Free-Form Visualization
In this project I built a CNN model which can HMM Based Approach for Handwritten Arabic
classify the Arabic images into digits and letters. Word Recognition
We tested the model on more than 13000 image An Arabic handwriting synthesis system
with all possible classes and got very high accuracy Advanced Convolutional Neural Networks
of 98.86% which is much better than the benchmark Normalization-Cooperated Gradient Feature
model. Extraction for Handwritten
See the following comparison charts to see the clear Character Recognition
improvement.

Summary
No ratings yet
Summary
36 pages
Gen Aiml Notes by Piyush
No ratings yet
Gen Aiml Notes by Piyush
39 pages
Cnnforfashionmnist 220403 160135
No ratings yet
Cnnforfashionmnist 220403 160135
25 pages
CNN With Tensor Flow
No ratings yet
CNN With Tensor Flow
61 pages
TP3 Mi204 Santos Scardellato
No ratings yet
TP3 Mi204 Santos Scardellato
20 pages
6 - Tips For Training Deep Neural Networks
No ratings yet
6 - Tips For Training Deep Neural Networks
59 pages
Implemented MobileNet On PyTorch
No ratings yet
Implemented MobileNet On PyTorch
20 pages
Chest Cancer - 90.8 On Test Data Set Code
No ratings yet
Chest Cancer - 90.8 On Test Data Set Code
17 pages
Report 2
No ratings yet
Report 2
17 pages
CNN Training Aspects Presentation
No ratings yet
CNN Training Aspects Presentation
26 pages
Create Simple Deep Learning Neural Network For Classification
No ratings yet
Create Simple Deep Learning Neural Network For Classification
11 pages
Docs
No ratings yet
Docs
25 pages
Experiement 1,2,4 and 5
No ratings yet
Experiement 1,2,4 and 5
12 pages
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
No ratings yet
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
19 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
7 CNN 3
No ratings yet
7 CNN 3
30 pages
DR Basit Assignments
No ratings yet
DR Basit Assignments
13 pages
Speech Command Recognition Using Deep Learning
No ratings yet
Speech Command Recognition Using Deep Learning
25 pages
Pattern Recognition
No ratings yet
Pattern Recognition
14 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
01 249212 012 10129792044 11122022 112910pm
No ratings yet
01 249212 012 10129792044 11122022 112910pm
8 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
MNIST Assignment Architecture Tuning and Realtime Processing
No ratings yet
MNIST Assignment Architecture Tuning and Realtime Processing
5 pages
L8 - Image Classification
No ratings yet
L8 - Image Classification
20 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
Layers
No ratings yet
Layers
4 pages
Week 6
No ratings yet
Week 6
8 pages
Final Code
No ratings yet
Final Code
16 pages
Neural Network Implementation Using Keras
No ratings yet
Neural Network Implementation Using Keras
8 pages
DL Practical 02 Binary Class Classifier Using ANN
No ratings yet
DL Practical 02 Binary Class Classifier Using ANN
5 pages
Ass 3
No ratings yet
Ass 3
5 pages
Image Classification: Keras
No ratings yet
Image Classification: Keras
21 pages
DL Expt 9
No ratings yet
DL Expt 9
4 pages
CNN Layers
No ratings yet
CNN Layers
2 pages
Implemented LeNet On PyTorch
100% (1)
Implemented LeNet On PyTorch
17 pages
CS 461 - Fall 2021 - Neural Networks - Machine Learning
No ratings yet
CS 461 - Fall 2021 - Neural Networks - Machine Learning
5 pages
Fixing Neural Network Course 2 1659759284
No ratings yet
Fixing Neural Network Course 2 1659759284
30 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Classification Classify Images of Clothing - ALI LAZIM
No ratings yet
Classification Classify Images of Clothing - ALI LAZIM
21 pages
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
No ratings yet
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
7 pages
Room Classification Using Machine Learning
No ratings yet
Room Classification Using Machine Learning
16 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Assignment No 2 - OCR CNN
No ratings yet
Assignment No 2 - OCR CNN
2 pages
B210317003 - Zeeshan Asghar - Assignment No 02
No ratings yet
B210317003 - Zeeshan Asghar - Assignment No 02
6 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
CE F417-Applications of AI in Civil Engineering-Jagadeesh
No ratings yet
CE F417-Applications of AI in Civil Engineering-Jagadeesh
3 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Tutorial 4
No ratings yet
Tutorial 4
6 pages
TensorFlow With R
No ratings yet
TensorFlow With R
46 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Input Image
No ratings yet
Input Image
8 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Project Report Title
No ratings yet
Project Report Title
9 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Image Analysis - Pattern Recognition - Pattern Patterns Represent Knowledge
No ratings yet
Image Analysis - Pattern Recognition - Pattern Patterns Represent Knowledge
22 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Action Unit Analysis Enhanced Facial Expression Recognition by Deep Neural Network Evolution
No ratings yet
Action Unit Analysis Enhanced Facial Expression Recognition by Deep Neural Network Evolution
14 pages
Karthik
No ratings yet
Karthik
10 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Ai-Ml Roadmap - 3
No ratings yet
Ai-Ml Roadmap - 3
6 pages
Neptune - Ai Hugging Face Pre-Trained Models
No ratings yet
Neptune - Ai Hugging Face Pre-Trained Models
14 pages
Basics of Deep Learning: Pierre-Marc Jodoin and Christian Desrosiers
No ratings yet
Basics of Deep Learning: Pierre-Marc Jodoin and Christian Desrosiers
183 pages
Deep Learning Tools
No ratings yet
Deep Learning Tools
23 pages
Hemanshu Kumar Saraf - Resume New
No ratings yet
Hemanshu Kumar Saraf - Resume New
1 page
Human Activity Detection Using Deep - 2-1
No ratings yet
Human Activity Detection Using Deep - 2-1
8 pages
Lab Manual For Aiml
No ratings yet
Lab Manual For Aiml
28 pages
Vision Graph Convolutional Network For Writer-Independent Offline Signature Verification
No ratings yet
Vision Graph Convolutional Network For Writer-Independent Offline Signature Verification
7 pages
AI Overview v3
No ratings yet
AI Overview v3
38 pages
Eti A1
No ratings yet
Eti A1
9 pages
Inteligencia Artificial: José Daniel López-Cabrera, Luis Alberto López Rodríguez, Marlén Pérez-Díaz
No ratings yet
Inteligencia Artificial: José Daniel López-Cabrera, Luis Alberto López Rodríguez, Marlén Pérez-Díaz
11 pages
Workspace
No ratings yet
Workspace
19 pages
CS229 Final Report - Music Genre Classification
No ratings yet
CS229 Final Report - Music Genre Classification
6 pages
Support Vector Machines Theory and Applications
No ratings yet
Support Vector Machines Theory and Applications
10 pages
Chatgpt Tweets Sentiment Analysis Using Machine Learning and Data Classification
No ratings yet
Chatgpt Tweets Sentiment Analysis Using Machine Learning and Data Classification
11 pages
A Deep Neuro-Fuzzy Network For Image Classification
No ratings yet
A Deep Neuro-Fuzzy Network For Image Classification
10 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
9 pages
Ai Specialist Web Developer
No ratings yet
Ai Specialist Web Developer
2 pages
Apssdc Edunet
No ratings yet
Apssdc Edunet
11 pages
MIC
No ratings yet
MIC
6 pages
Division of Electrical, Electronics, and Computer Sciences (EECS) Indian Institute of Science, Bangalore
No ratings yet
Division of Electrical, Electronics, and Computer Sciences (EECS) Indian Institute of Science, Bangalore
2 pages
ML - Question - Bank - Class - Test II
No ratings yet
ML - Question - Bank - Class - Test II
2 pages