0% found this document useful (0 votes)

2 views

AI_Based_Image_Processing

This document proposes an AI-based application that automates the development cycle for generating React JS scripts from images of charts, significantly reducing development and testing efforts. The system predicts chart attributes, generates validation scores, and facilitates easy implementation of changes, thereby streamlining the UI development process. The approach utilizes various machine learning algorithms for image classification and text detection, ultimately enhancing efficiency and accuracy in UI development tasks.

Uploaded by

sadwumble

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

AI_Based_Image_Processing

Uploaded by

sadwumble

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Title: AI based applica-on to process image and auto generate React JS scripts to minimise

development and tes-ng eﬀorts.

Abstract: As an average one engineer put 2-3 weeks eﬀort to develop and validate one chart,
moreover UX team reviews chart, QA team validate chart and data to ensure developed charts are
aligned with the given wireframe and it is mee-ng business requirement. It requires equal eﬀort when
there is change request/ enhancement suggested by product manager and en-re development life
cycle should be followed. We are proposing a mechanism to -

• Automate overall development cycle using the AI model.

• Predict chart type, chart -tle, sub-tles, legends, legend colour, X-Axis, Y-Axis.
• Generate React JS code
• Generate valida-on score to compare UX with actual result, this will save UX review eﬀort.
• Easy implementa-on of any rework/ enhancement

Problems Solved:
End to end data ﬂow diagram:
A9ributes list 1-Input image details:
Ini-al
Data Source Field
weight
Image Name 100%
Image Path 100%
Image Dataset
Image Resolu-on 100%
Image Format 50%

Derived A9ributes list 2:

Ini-al l
Data Source Field
weight
Image Chart Type 100%
Image Dataset Image Colour 100%
Image Text 100%

Progress Flow diagram:

Data CreaCon:
The first and the foremost part is the data set to be considered for analysis and predic-ons. The system
uses dataset which is formed by gathering the images from all the exis-ng dashboard graphs on the
Tech Pulse Portal from Veneer. The images are saved in a specific naming system. The name consists of
the chart name followed by the chart type further and saved as a valid image format like jpg, png or
jpeg image. The dataset consists of all the images of various graphs collec-vely stored in one single
folder which is used as the dataset for our predic-on model.
Data Cleaning and Processing:
The main challenge lies in handling a highly imbalanced dataset with a low rate of failed data points.
Raw dataset has various issues which need to be pre-processed before training. Hence images need to
be resized using open CV python packages to address these challenges and improve the model
performance. Further different data processing techniques are used to make the predic-on model
efficient for training.

Image DetecCon:
Python split() method is used to split the image name string by specifying the separator in order to
fetch the chart type and store it in the form of labels. A one hot encoding is used which allows the
representa-on of the fetched labels to be easily processed by the predic-on algorithm. This helps maps
the categorical values to integer values and represent as binary vectors.

Colour DetecCon:
By default, open cv converts the image into BGR (Blue Green Red). In order to get the original image,
we convert the image again into RGB format using cvtColor() python method under CV2 package.

Text DetecCon and extracCon:

The predic-on model used for text detec-on works best and gives best accuracy under the ﬁxed image
resolu-on at 720 pixels and 13 FPS. Hence, the resized image is then converted to the required
resolu-on.

Model SelecCon:
There are diﬀerent data mining and deep learning algorithms useful for predic-ons such as Random
Forest, Neural Network, CNN. The type of dataset used to build the model mainly decides the selec-on
of an algorithm. Here three algorithms CNN, OCR, East text detec-on Model are used for modelling
because of their capability to predict and classify the image data set for mul--class classiﬁers according
to the requirement.

Image ClassiﬁcaCon and DetecCon:

Our input is a training dataset that consists of N images, each labelled with one of K different classes.
Then, we use this training set to train a classifier to learn what every one of the classes looks like. In
the end, we evaluate the quality of the classifier by asking it to predict labels for a new set of images
that it has never seen before. We will then compare the true labels of these images to the ones
predicted by the classifier. We have different algorithms used for classifica-on of images.

Random Forest:
The random forests algorithm is a machine learning technique that is increasingly being used for image
classiﬁca-on and crea-on of con-nuous variables such as percent tree cover and forest biomass.
Random forests is an ensemble model which means that it uses the results from many diﬀerent models
to calculate a response. In most cases the result from an ensemble model will be beder than the result
from any one of the individual models. In the case of random forests, several decision trees are created
(grown) and the response is calculated based on the outcome of all of the decision trees.
However, the dataset we have consists of the large number of images and neural network is the next
best op-on for beder accuracy in classifying the images appropriately as it works best for the large
amount of dataset.

ArCﬁcial Neural Network:

Ar-ﬁcial Neural Network is capable of learning any nonlinear func-on. A single perceptron (or neuron)
can be imagined as a Logis-c Regression. Ar-ﬁcial Neural Network, or ANN, is a group of mul-ple
perceptron/ neurons at each layer. ANN is also known as a Feed-Forward Neural network because
inputs are processed only in the forward direc-on. ANNs have the capacity to learn weights that map
any input to the output. ANN consists of 3 layers – Input, Hidden and Output. The input layer accepts
the inputs, the hidden layer processes the inputs, and the output layer produces the result. Essen-ally,
each layer tries to learn certain weights.

The number of parameters in a neural network grows rapidly with the increase in the number of layers.
This can make training for a model computa-onally heavy (and some-mes not feasible). Tuning so
many of parameters can be a very huge task. The -me taken for tuning these parameters is diminished
by CNNs.

CNN:
Convolu-onal Neural Networks (CNNs) is the most popular neural network model being used for image
classiﬁca-on problems. The prac-cal beneﬁt is that having fewer parameters greatly improves the -me
it takes to learn as well as reduces the amount of data required to train the model. Instead of a fully
connected network of weights from each pixel, CNN has just enough weights to look at a small patch
of the image. It’s like reading a book by using a magnifying glass; eventually, you read the whole page,
but you look at only a small patch of the page at any given -me.

The beauty of CNN is that the number of parameters is independent of the size of the original image.
You can run the same CNN on a 300 × 300 image, and the number of parameters won’t change in the
convolu-on layer.

All the layers of a CNN have mul-ple convolu-onal filters working and scanning the complete feature
matrix and carry out the dimensionality reduc-on. This enables CNN to be a very apt and fit network
for image classifica-ons and processing. CNN learns the filters automa-cally without men-oning it
explicitly. These filters help in extrac-ng the right and relevant features from the input data.

Colour Code DetecCon:

K-Means:
Clustering is one of the most common exploratory data analysis techniques used to get an
intui9on about the structure of the data. It can be defined as the task of iden9fying subgroups
in the data such that data points in the same subgroup (cluster) are very similar while data
points in different clusters are very different. In other words, we try to find homogeneous
subgroups within the data such that data points in each cluster are as similar as possible
according to a similarity measure such as Euclidean-based distance or correla9on-based
distance.
K-means algorithm is an itera9ve algorithm that tries to par99on the dataset into K pre-
defined dis9nct non-overlapping subgroups (clusters) where each data point belongs to only
one group. It tries to make the intra-cluster data points as similar as possible while also keeping
the clusters as different (far) as possible.
AHer training the model using K-mean algorithm by specifying the number of clusters, we fetch
cluster centres for each cluster and convert the extracted RGB values to Hex colour code.

Text DetecCon and extracCon:

Text detec-on techniques required to detect the text in the image and create and bounding box around
the por-on of the image having text. OpenCV package uses the EAST model for text detec-on. The
tesseract package is for recognizing text in the bounding box detected for the text.

OCR
Op9cal character recogni9on or OCR refers to a set of computer vision problems that is
required to convert images of digital or hand-wriNen text images to machine readable text in
a form your computer can process, store and edit as a text ﬁle or as a part of a data entry and
manipula9on soHware.

EAST (Eﬃcient accurate scene text detector)

OpenCV’s text detector implementa-on of EAST is quite robust, capable of localizing text even when
it’s blurred, reflec-ve, or par-ally obscured. This is a very robust deep learning method for text
detec-on and only a text detec-on method. It can be used in combina-on with any text recogni-on
method. EAST can detect text both in images and in the video. It runs near real--me at 13FPS on 720p
images with high text detec-on accuracy. We use the Pre-trained EAST model and define the output
layers. Further PyTessaract python package is used for text recogni-on for each block created aler the
text detec-on process. It detects various fields of chart image like Chart name, the legends inside the
chart and X-axis and Y-axis for par-cular chart type.

Training and TesCng:

We split the original training data into 80% training and 20% valida-on to op-mize the classifier. The
test data is separated to finally evaluate the accuracy of the model on the data it has never seen. This
helps to see whether we are over-finng on the training data and whether we should lower the learning
rate and train for more epochs if valida-on accuracy is higher than training accuracy or stop
overtraining if training accuracy shil higher than the valida-on. We train the model for 100 epochs
with a batch size of 50 for CNN predic-on model.

Model Accuracy:
AUC-ROC curve is the model selec-on metric for mul- class classifica-on problem to depict how good
the model is for differen-a-ng the given classes as in terms of the predicted probability, ROC is used
which is nothing but a probability curve for various classes calculated aler training the model and
finding the predic-ons. The x-axis displays the False Posi-ve Rate (FPR) and the y-axis displays the True
Posi-ve Rate (TPR) values of probability. Here the threshold value is nothing but the op-mal cut-off
point in the curve. This point is present where the TPR value is high and the FPR value is low.
Sample Input image and outcome of model:
Image

Colour and Text Detec-on Outputs

React Code generaCon:
This module takes output of the above models like predicted chart type, colour codes, recognized chart
ﬁelds like the name, legends, X-axis and Y-axis as an input. The automated code is generated as per the
chart type. This u-lity generates a react code template and populates the predicted input values for
each chart type. This react code is generated using Python script which is redirected to .js ﬁle and
which can be downloaded by UI/UX Developers.

Advantages:
• With the help of AI, detec-ng and recognizing objects and paderns in images can be used to
fast-track overall UI development.
• The biggest advantage is automa-on in overall life cycle for end-to-end development of user
interface (UI)
• Any change request in exis-ng code can be completed with this solu-on in few minutes which
can save months of eﬀort for development and valida-on.
• Less chances of human errors for repe--ve work.
• Reduc-on in manual eﬀort for understanding images and code development using React JS
• The proposed solu-on will be next genera-on AI-based image processing solu-on, and can be
used in many applica-ons within HP.

Author:
Ravikumar Patel

Senior SoHware Engineer

IRIS SoHware Ltd

Training Manual V23 1006 IFM
100% (1)
Training Manual V23 1006 IFM
139 pages
CV_T3_ Unit-7
No ratings yet
CV_T3_ Unit-7
36 pages
Deep 2
No ratings yet
Deep 2
57 pages
exam AI
No ratings yet
exam AI
5 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
IMAGE CLASSIFICATION USING CNN PALLAVI
No ratings yet
IMAGE CLASSIFICATION USING CNN PALLAVI
26 pages
DIP Mini Project
100% (1)
DIP Mini Project
12 pages
p8
No ratings yet
p8
7 pages
Image Classification - Building Image Classification Model
No ratings yet
Image Classification - Building Image Classification Model
18 pages
Exp 9 DL
No ratings yet
Exp 9 DL
5 pages
Traffic Sign Classification Slides
No ratings yet
Traffic Sign Classification Slides
29 pages
Traffic Sign Classification: Mezzi Houssem
No ratings yet
Traffic Sign Classification: Mezzi Houssem
36 pages
Machine Learning Lab8 PDF
No ratings yet
Machine Learning Lab8 PDF
14 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Pattern Recognition
No ratings yet
Pattern Recognition
14 pages
Image Classification Using CNN: Page - 1
No ratings yet
Image Classification Using CNN: Page - 1
13 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
Project Report Final 1
No ratings yet
Project Report Final 1
63 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
1 page
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
AI Training2024Haile
No ratings yet
AI Training2024Haile
37 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
Dissertation
No ratings yet
Dissertation
86 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
CNN Image Classification - Image Classification Using CNN
No ratings yet
CNN Image Classification - Image Classification Using CNN
9 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
Gender Recognition: Meghdut Nandy
No ratings yet
Gender Recognition: Meghdut Nandy
11 pages
Deep Learning for Remote Sensing Images with Open Source Software (Rémi Cresson) (Z-Library)
No ratings yet
Deep Learning for Remote Sensing Images with Open Source Software (Rémi Cresson) (Z-Library)
165 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
Report23 24
No ratings yet
Report23 24
55 pages
Deep Learning Examples With Pytorch And Fastai A Developers Cookbook Bernhard J Mayr instant download
No ratings yet
Deep Learning Examples With Pytorch And Fastai A Developers Cookbook Bernhard J Mayr instant download
90 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
Research and Prospect of Image Recognition Based o
No ratings yet
Research and Prospect of Image Recognition Based o
7 pages
CNN with TensorFlow and Keras
No ratings yet
CNN with TensorFlow and Keras
11 pages
POSTER Classification of Fruits and Detection of Disease Using CNN
No ratings yet
POSTER Classification of Fruits and Detection of Disease Using CNN
1 page
PBBML L11
No ratings yet
PBBML L11
44 pages
AD8552-ML-UNIT-V (1)
No ratings yet
AD8552-ML-UNIT-V (1)
78 pages
Bao Cao Btl Python (2)
No ratings yet
Bao Cao Btl Python (2)
28 pages
Ee210-Project Report Pdf-Ilovepdf-Compressed
No ratings yet
Ee210-Project Report Pdf-Ilovepdf-Compressed
59 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
Object Recog
No ratings yet
Object Recog
102 pages
FULLTEXT02
No ratings yet
FULLTEXT02
87 pages
Deep Learning Project for Computer Vision with Python 2022
No ratings yet
Deep Learning Project for Computer Vision with Python 2022
297 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Jatin Shinde ANN MINIPROJECT
No ratings yet
Jatin Shinde ANN MINIPROJECT
13 pages
Jetson Nano
100% (1)
Jetson Nano
349 pages
Deep Learning With Python
100% (5)
Deep Learning With Python
396 pages
MA AjamMontassar 201704
No ratings yet
MA AjamMontassar 201704
65 pages
unit 3_1_1709014556934
No ratings yet
unit 3_1_1709014556934
49 pages
Image Recognition Using Machine Learning Research Paper
No ratings yet
Image Recognition Using Machine Learning Research Paper
5 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
DIP Unit-3 Chapter-2 Lecture 4
No ratings yet
DIP Unit-3 Chapter-2 Lecture 4
14 pages
Final Presentation
No ratings yet
Final Presentation
30 pages
Deep Learning notes
No ratings yet
Deep Learning notes
155 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
SSRN Id3611339
No ratings yet
SSRN Id3611339
4 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Canon_PIXMA_MX4921453040654775
No ratings yet
Canon_PIXMA_MX4921453040654775
1,004 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Artificial Wisdom_ a Potential Limit on AI in Law (and Elsewhere)
No ratings yet
Artificial Wisdom_ a Potential Limit on AI in Law (and Elsewhere)
40 pages
Better Images of AI Guide Feb 23
No ratings yet
Better Images of AI Guide Feb 23
16 pages
125968537
No ratings yet
125968537
9 pages
2024-3-2
No ratings yet
2024-3-2
20 pages
92830B90-FE3B-11EF-A416-FA90EC0D8D58
No ratings yet
92830B90-FE3B-11EF-A416-FA90EC0D8D58
20 pages
2433Ringvoldetalfinal
No ratings yet
2433Ringvoldetalfinal
22 pages
Lazy Nezumi Pro Doc
No ratings yet
Lazy Nezumi Pro Doc
42 pages
BLZ Log
No ratings yet
BLZ Log
29 pages
Presentation JHS
No ratings yet
Presentation JHS
37 pages
LAS ICT 3 Advanced Word Processing Skills
No ratings yet
LAS ICT 3 Advanced Word Processing Skills
15 pages
Sap How To Replace The Old Style Text Editor
No ratings yet
Sap How To Replace The Old Style Text Editor
5 pages
Unified Library Application Unit 5
No ratings yet
Unified Library Application Unit 5
11 pages
Reindeer Graphics™ Focus Extender: Reindeer Graphics, Inc. P. O. Box 2281 Asheville, NC 28802
No ratings yet
Reindeer Graphics™ Focus Extender: Reindeer Graphics, Inc. P. O. Box 2281 Asheville, NC 28802
9 pages
Effects of emerging technologies in minimising variations in construction projec
No ratings yet
Effects of emerging technologies in minimising variations in construction projec
9 pages
AutoCAD Civil 3D 2019_ Fundamentals (Metric Units)_ Autodesk -- Ascent - Center for Technical Knowledge -- 1st Edition, Charlottesville, VA, 2018 -- 9781947456211 -- 1d7cccee9432fed4d2bb72c7214c4c5d -- Anna’s Archive
No ratings yet
AutoCAD Civil 3D 2019_ Fundamentals (Metric Units)_ Autodesk -- Ascent - Center for Technical Knowledge -- 1st Edition, Charlottesville, VA, 2018 -- 9781947456211 -- 1d7cccee9432fed4d2bb72c7214c4c5d -- Anna’s Archive
666 pages
PLX7100A Digital Mobile C-Arm X-Ray Machine: 1. Technical Specification
No ratings yet
PLX7100A Digital Mobile C-Arm X-Ray Machine: 1. Technical Specification
3 pages
Competition Templates
No ratings yet
Competition Templates
4 pages
unit 3 ICT (PDF) 12TH
No ratings yet
unit 3 ICT (PDF) 12TH
13 pages
Computer 1
No ratings yet
Computer 1
7 pages
Word Processing Software Grade 10 New Syllabus
No ratings yet
Word Processing Software Grade 10 New Syllabus
4 pages
Computer Systems Servicing Daily Lesson Log
0% (1)
Computer Systems Servicing Daily Lesson Log
56 pages
MANUALTESTINGNotes
No ratings yet
MANUALTESTINGNotes
90 pages
Rahul Term Paper 1
No ratings yet
Rahul Term Paper 1
46 pages
Shiny - Shinyapps - Io - Getting Started PDF
No ratings yet
Shiny - Shinyapps - Io - Getting Started PDF
12 pages
HiVac Axia SOP Final
No ratings yet
HiVac Axia SOP Final
36 pages
Run Batocera in A Virtual Machine
No ratings yet
Run Batocera in A Virtual Machine
18 pages
Csec It Work Book Answers 01
No ratings yet
Csec It Work Book Answers 01
4 pages
2nd Sem Internship Report Ruthuparna
No ratings yet
2nd Sem Internship Report Ruthuparna
32 pages
Te2000 TC3 Hmi en
No ratings yet
Te2000 TC3 Hmi en
2,166 pages
Project Based Learning
No ratings yet
Project Based Learning
13 pages
User Manual NVMS2.0
No ratings yet
User Manual NVMS2.0
133 pages
JAA Design Sketch Render Drawing Competition Pages 1-50 - Flip PDF Download _ FlipHTML5
No ratings yet
JAA Design Sketch Render Drawing Competition Pages 1-50 - Flip PDF Download _ FlipHTML5
84 pages
DIP End Sem Paper
No ratings yet
DIP End Sem Paper
2 pages
PWP Project
No ratings yet
PWP Project
28 pages
Steganography and Cryptography Approaches Combined Using Medical Digital Images IJERTV4IS060270
No ratings yet
Steganography and Cryptography Approaches Combined Using Medical Digital Images IJERTV4IS060270
4 pages