Document Analyzing Using Deep Learning
Document Analyzing Using Deep Learning
https://doi.org/10.22214/ijraset.2023.51443
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
Abstract: Many businesses and large organizations have a large set of documents that need to be stored in various locations.
cluster. In recent years, this task has become time consuming as the number of documents and articles has increased. Analysis
of Documents are her one of the subjective research techniques analysts use to validate ideas. with some help. A visual
technique that takes full advantage of layout and text formatting in a very clean output format. with the help of the model Most
large and lots of architectural documents can be sorted. With the help of layout models and new interaction strategies Various
layouts of any format within a single.
Keywords: CNN, Deep Learning, Document Analyzer, Pre-Processing.
I. INTRODUCTION
Nowadays, world where there is an enormous amount of text data, digitization of documents is a technology used in different and
so many and different types of fields. A domain with a large archive. Document Analyzer focuses on classifying documents based
on their text. Document images and layout. Documents can usually be classified differently in many contexts. when we try In the
task of analyzing text documents, document classification is an important procedure that must be followed. However, while
recording Classification must address several and various challenges, including: B. High variability and low variability within the
same document or class Between different classes or documents. Previous studies have shown structural similarity between classes
and document.
II.DOCUMENT TYPE
All organizations, including universities, schools, corporations, etc., have data in various forms. Documents such as rating reports
and tc cast certificates are evaluated using deep learning systems. CNN
III.PURPOSE/OBJECTIVE
Identification and classification of Target documents is the purpose of this investigation. A form of qualitative research, known as
document analysis, in which an analyst reviews documents to evaluate the subject of assessment.
A focus group or interview transcript, coding content into categories is a document analysis process.Prepare Your Paper Before
Styling
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6636
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
V. LITERATURE REVIEW
1) Analysis and Perceptions, ICDAR 2019 Analysis and Perceptions, ICDAR 2019, Sydney, Australia, 20-25. September 2019;
pp. 726–731.
In 2019 edition of ICDAR, the International Conference on Document Analysis and Recognition ICDAR, which began in 1991 at St.
Malo in France, is celebrating its 28th anniversary at this exciting conference, which was organized by me and Prof. Guy Lorette.
ICDAR is now among the most significant international conferences in the pattern field. both artificial intelligence and recognition.
The primary topics covered are document analysis and recognition, handwriting analysis and verification, text detection and
processing, as well as other related subjects.
3) Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval Adam W. Harley, A. U.
The features used in this research papper were learned using deep convolutional neural networks and represent a new state-of-the-
art for document picture classification and retrieval (CNNs). Deep neural networks are capable of learning a hierarchical chain of
abstraction from pixel inputs to succinct and descriptive representations in object and scene analysis. In the context of document
analysis, the current work investigates this capability and finds that this representation method outperforms a number of common
hand-crafted alternatives. Additionally, experiments demonstrate that I CNN features are resilient to compression, (ii) CNNs trained
on non-document images perform well on tasks requiring document analysis, and (iii) with enough training data, it is not necessary
to enforce region-specific feature learning. Also, a new tagged subset of the IIT-CDIP collection with 400,000 documents is made
available through this study.
VI. ADVANTAGES
1) To analyze and classify the documents using CNN .
2) To extract features of the documents using algorithms.
3) Create a working model that classify the document on the basics of feature that are extracted.
4) The model will use image segmentation and CNN to determine the articles.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6637
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
2) Model Builder uses an automated process called training to train a model to respond to contextual requests. Once trained, the
model can make predictions using completely new inputs. For example, you can estimate the price of a home and predict the
sale price if a new home is on the market. Model Builder uses automatic machine learning (AutoML), so it requires no input or
configuration during training.
3) “How long should I train?” To determine which model performs best, Model Builder uses AutoML to analyze different
models.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6638
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
The process classifies the document according to their categories and classes, as we already discuss in datasets and training section
classes can be like mark sheets, caste certificate, tc etc. As shown second image above cluster of images are present and it has
classified into classes.
The sentences are used to examine the characters in the aforementioned input and to identify the most frequent pairings of the
different characters in the sentence.
Image Processing - This topic covers the fundamentals of image pre-processing. This aims to perform elementary picture pre-
processing such as image scaling, resizing, and compression, as well as morphological image pre-processing such as erosion and
dilation.
Pre-processed photos with a similar look will be this module's output.
OCR, often known as optical character recognition, is a method for extracting text from images. This module's objective is to
extract image.
The design of the document, the headers and footers, the document's text, and the style of writing all contribute to the identification
process and serve as criteria for determining a document type. Document kind A government certificate with a seal and/or a logo to
assist classify the document is an example of a form of document that shares common characteristics with other sorts of papers.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6639
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
1) The above image is showing the result of the model that has been train using deep learning algorithm and as we get accurate
prediction of the document the document has analyzed successfully.
2) Our motive solution is a model that will correctly categorise and classify documents and articles. The model was created using
CNN and image feature extraction, and results were improved even further by fine-tuning these features that were taken from
document pictures.
3) The CNN method of representing document images is more effective than hand-made alternatives.
Fig: Document analyser tool home page Fig: document analyser login page
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6640
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
XI. ACKNOWLEDGEMENT
We would like to thank Mrs. Surbhi Khare, our Professor-in-charge and our , HOD Mrs. Pallavi Choudhari for their support and
guidance in completing our project on the topic Document Analysing Using Deep Learning. It was a great learning experience.
I would like to take this opportunity to express my gratitude to all of my group members . The project would not have been
successful without their cooperation and inputs.
REFERENCES
[1] Emerson, S., Kennedy, R., O'Shea, L., & O'Brien, J. (2019, May). Trends and Applications of Machine Learning in Quantitative Finance. In 8th International
Conference on Economics and Finance Research (ICEFR 2019).
[2] Siami-Namini, S., & Namin, A. S. (2018). Forecasting economics and financial time series: Arima vs. lstm. arXiv preprint arXiv:1803.06386.
[3] Heaton, J. B., Polson, N. G., & Witte, J. H. (2017). Deep learning for finance: deep portfolios. Applied Stochastic Models in Business and Industry, 33(1), 3-
12.
[4] Moritz, B., & Zimmermann, T. (2016). Tree-based conditional portfolio sorts: The relation between past and future stock returns. Available at SSRN 2740751.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 6641