Indian Paper Currency Recognition With Audio Output System For Visually Impaired Based On Image Processing Using Transfer Learning
Indian Paper Currency Recognition With Audio Output System For Visually Impaired Based On Image Processing Using Transfer Learning
Indian Paper Currency Recognition With Audio Output System For Visually Impaired Based On Image Processing Using Transfer Learning
https://doi.org/10.22214/ijraset.2022.45699
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
Abstract: We give an outline of camera-based computer vision technology in this paper. It can aid those who are not visually
blessed in instantly identifying paper currency. For those who are blind or visually challenged, an effective paper money
detection algorithm should have the following qualities: 1) Complete precision; and 2) adaptability in a range of circumstances
in various environments and emergence. Many of the currency identification algorithms in use today are constrained to specific
situations. We suggest a deep learning strategy focused on high accuracy in this project. This method works well for gathering
more class-specific data and is better at handling partial closure and viewpoint changes. Evaluation of transfer learning also
demonstrate its efficiency in coping with visual rotation, measures, and illumination changes.
Keywords: Indian Bank Notes Recognition, Visually Impaired, Transmission Reading, CNN
I. INTRODUCTION
Vision assists us with performing day to day undertakings as well as influences an individual's way of behaving. In the mean time,
innovation is developing quickly day by day. By utilizing innovation, individuals fathom their issues over time. Advanced adaptable
frameworks within the genuine world require a monetary acknowledgment framework. There are other conceivable employments in
genuine life such as cash registers, cash checking frameworks, cash trade machines, and a money-monitoring framework to assist
the outwardly disabled.Vision assists us for performing day to day undertakings as well as influences an individual's way of
behaving. Visual deficiency influences a person’s mental conduct; an individual with a visual impedance or an individual with a
typical vision may be less discouraged than a dazzle individual with a need of social intuitive and may moreover have an uneasiness
clutter. One of the greatest issues for individuals who are daze is seeing the genuine thing around them. Man is able to see and
recognize things, and after that meet the require for security and can believe their nature. For outwardly impeded individuals, one of
the greatest issues is seeing notes. The most cause of unmistakable abandons are unpredictable refractive abandons, cataract,
macular- related decrease, glaucoma, diabetic retinopathy, corneal murkiness, and trachoma (Ali & Manzoor). Numerous
individuals have utilized an assortment of cash acknowledgment strategies, such as the composition, estimate, and colour of a
money note. The cutting edge control of computer and the accessibility of a camera make it simple for us to construct an effective
framework that sees Indian cash. There are seven Pakistan money plans, shifting in size and colour. The proposed approach might
see a diverse Indian money [10, 20, 50, 100, 200, 500, and 2000], which changes over the yield into the money of a money into a
daze or outwardly impeded individual.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3037
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
There's a program, which comprises of 2 categories: hardware and software. The Equipment area comprises of three fundamental
parts: the establishment portion, the preparing portion, and the expulsion portion. Initially it will get a picture from the camera.
Within the handle of preparing the banknotes and coins are isolated by a microcontroller. The yield component contains the voice
recorder IC and speaker to supply yield to the microcontroller within the frame of voice and paper money. One paper speaks to the
fast-track handle utilized for the progressed binarization of content extraction from permit plate pictures captured by a portable
camera. Depending on the color conspire the analysts spoken to the innovation of content extraction and fracture of the content
picture captured by the camera. The restrain of this strategy was to get large-size writings with complex lighting changes and the
program was exceptionally complex. Jian Yuan et al. print a paper speaking to the content extricated from the billboards by taking a
picture with the assistance of a portable camera. Sometimes there's a money related emergency and liquidation due to the utilize of
the camera. In another paper, there are two sorts of fake identification methods included. One of them is to induce Ultra Violet (UV)
utilizing lab see and the other is the light division when transmitted by investing cash. The result will be superior in case both comes
about are great. Bank notes are effortlessly recognizable concurring to their particular colors with a gadget created by Mohammed et
al. But the result isn't so precise. The framework was planned that was solid in keeping the light on the paper cash. Fair depend on
banknotes to be tried on a level glass, and the framework identifies whether the paper cash is fake or not, and recognizes the esteem
of notes utilizing an infrared camera. Cash can be gotten and seen utilizing neural innovation. Words in printed content can be
identified by employing a versatile cursor such as a pencil. A little camera is connected to the cursor. When the client drops the
cursor at that point the voice synthesizer will donate the result. For the primary time with the assistance of a pointer, the locale is
analyzed. On the off chance that there's more content over the cursor then the picture is partitioned into squares. Each piece is
classified as ‘character’. At that point the next step is to seize the 'character' pieces. An extra step is picture examination of extremity.
In case extremity may be a diversion at that point the pixels of the mass-produced picture are changed over. Based on the Java stage,
the content- perusing program is planned to assist individuals with visual disabilities. In this case the content is changed over into
discourse. Content from a picture can be gotten employing a little representation, a novel-based approach and a non-guarded
perusing strategy. More approximately al. propose inserted innovation. Where the ARM 11 microcontroller can be utilized for
picture and video information extraction. Siddharth Mody et al. created a versatile "Text-to-Speech Converter". Discourse processor
changes over content to console input into discourse.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3038
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
The delta-E distance among preparing and information testing to recognize the sort of cash and and matching template for detection
of different identification marks, both end with 99 percent precision. In any case, the variety picture histogram and Markov's chain
of surface examination show exceptionally low precision as various banknotes can have similar tones and surfaces, hence lessening
proficiency.
The life expectancy of a bank can't be foreordained yet can have physical damages, for example, dirtinessbrought about by over the
sweaty touch of many people, oil microorganisms, and mud-borne microbes can cause these damages. To keep these ruined notes
from getting older and to keep them from spreading, a data set of new and old paper cash is perused on paper [4]. Picture acquisition
and preprocessing, highlight determination and result, and the development of the segment model were project stages. The Modified
SMOTE Algorithm has been utilized to reinforce old banknotes. The paper cash model was created utilizing the conventional
Support Vector Machine (SVM) calculation. By examining the study, the many benefits are that such a strategy can work on the
popularity accuracy ratio of around 20% and tackle the issue of recycling problem somewhat. However, the primary issue is the
outright and in this way the general worth of the old bank cash tests is far away yet those of the most recent financial balance and
subsequently the dataset is typically not equivalent. In the field of banking, arranging, counting, arranging, arranging and erasing
can be time consuming.
This paper [5] recommended a reaction involving TMS320C6416 DSK as a DSP upgrade stage, SV253A4 as a picture sensor,
XRD98L23 as a sensor processor and can kill fast assortment of bank picture images and use TMS320 organization's + 4/2C
photograph/video library. - TI pre-screening bank picture as a focal channel and securing of Sobel administrator edge to fulfil the
picture quality necessity and assortment speed to finish distinguishing proof highlights. Yet, this approach is significantly more
muddled because of the inclusion of numerous intricate sensors and equipment frameworks. There are gadgets currently accessible
inside the market however they don't work for Malaysian assets and the gadgets are extravagant.
The reason for this task [6] is to make something practical to assist with blinding individuals to isolate Malaysian paper cash. Cash
Note Recognizer (CNR) identifies different highlights of the Banknote and gives the result in grouping as bell burst sounds. The
ATMega328P-PU microcontroller will recognize different notes to the info gave from TCS230 and send the decent sound example
to the bell as a sign. Because of the insurance of the variety sensor utilizing a light safeguard, the result waves were inside an
exceptionally brief distance so the framework was basic. However, assuming that you take a gander at the bigger extension, this
technique doesn't function admirably, on the grounds that the sort of task has specific impediments.
B. Image Pre-process
The main objective of image filtering or preprocessing is to extend image perceivability and progress the affect of data sets. Image
pre-processing is one of the foremost common activities required earlier to key information examination and information extraction.
Image pre-processing, too called Image rebuilding, incorporates twisting alteration, clamor diminishment, and the sound presented
amid the photography handle. Picture alteration can upgrade the exactness of the test. It includes machine learning and deep learning
calculations to distinguish currency types.
C. Eliminate Background
The images are taken in an assortment of situations, depending on the lighting conditions and background while the money within
the photo itself can be harmed. Image segmentation is critical in decreasing information handling and evacuating undesirable
highlights (background) that will include choice- making.
D. Feature Extract
Feature removal is a special type of size reduction. When the input calculation is as well huge to be handled and the input
information isn't required it'll be changed over into a set of diminished values. Changing over the input information to a highlight set
is called a Feature Extraction.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3039
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
On the off chance if the extricated components are carefully chosen it is anticipated that the preset components will extricate the
related information from the input information to perform the desired work utilizing this decreased estimate rather than the full-size
input.
B. Dart
Dart is a client-optimized programming language used to code Flutter apps. Dart prioritize both development and high-quality
production. It combines in time compilation and ahead of time compilation. As it eliminates XML files, no separate declarative
layout like XML is required. The reason to use Dart is its portability and accessibility.
C. Flutter Application
When the app starts it searches for the camera feed. If the camera feed is not available it asks for the permission to turn on the
camera. Allowing the camera helps the person to click image. After the image is clicked the Image Processing starts. It retrieves the
data from the different datasets trained and Displays image and say out loud the current currency number. If we want to take the
picture again, click the retake Button.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3040
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
B. Transfer Learning
Using the knowledge gained from the previous problem of pre-trained model into a new problem is known as the Transfer Learning
in Machine Learning. To increase the prediction of a new task the machine used the knowledge learned from the previous
assignment. It reduces the training time, improve the performance of the neural network and also the absence of a large amount of
data. CNN is a multilayer neural network that was actually motivated by the visual cortex of animals.
We first run a Deep learning algorithm on a given datasets to generate the model. Our Transfer Learning techniques will use the
layers that have been pre-trained on source to solve a task that has been targeted. We download the pre-trained model from the
internet and remove the top portion (the fully-connected layer). It leaves us with only the convolutional and pooling layers. Using
the pre-trained layers, we will extract various visual features from our dataset. We freeze specific layers from the training and use
pre-trained weights by updating them with backpropagation. We first train the model without Fine-tuning. We initialize the best
trained weights and recompile the model allowing for back propagation to update the last two pre-trained layers. We again initialize
our Fully-Connected layer and also its weights for training.
We generate a model that can outperform a custom written CNN by using different Transfer Learning strategies such as Fine-
Tuning. We solved the paper cash recognition using a custom datasets of pre-trained model of Transfer Learning.
C. Fine-Tuning
Fine Tuning is the process of taking weights of a trained neural network and using it as an input or initialization for a new model
being trained on data from the image/videos. It speeds up the training process and also overcome the small datasets size. In this
model we load the pre-trained layers, pass image data through it and fine tune the trainable layers nearby Fully Connected Layer.
VII. RESULTS
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3041
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3042
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 10 Issue VII July 2022- Available at www.ijraset.com
[6] Ingulkar Ashwini Suresh,”Indian Currency Recognition and verification Using Image Processing”.IRJETInternational Research Journal of Engineering and
Technology, Vol.3, Issue-6, 2016
[7] Mohd Bilal Ganai”Implementation of Text to Speech Conversion Technique”-International Journal of Innovative Research in Computer andCommunication
Engineering (AnISO 3297: 2007 Certified Organization) Vol. 3, Issue 9,September 2015
[8] Mriganka Gogoi, et al.,”Automatic Indian Currency Denomination Recognition System based on Artificial Neural Network”, 978-1-4799-5991-4/15©2015
IEEE
[9] Kuldeep Verma, et al.,”Indian Currency Recognition Based On Texture Analysis”, Institute Of Technology, NirmaUniversity, Ahmedabad – 382 481, 08-
10 December, 2011
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 3043