Intelligent Process Automation Ocr Whitepaper PDF
Intelligent Process Automation Ocr Whitepaper PDF
State of automation in modern enterprises p3/Overview of OCR p5/Need for intelligent OCR p7/
OCR complexities faced by RPA developers p8/UiPath 2017 vs UiPath 2018 comparison p10
www.pwc.in
2 PwC
State of automation in modern enterprises
In this era of technology disruption, enterprises are under immense pressure to digitise operations, and they are gearing up for
a future where human work can be augmented by software robots. Digitisation and automation continue to be the key business
drivers across various sectors and industries globally, including government organisations, which have now jumped onto the
automation bandwagon.
Enterprises are looking to build a digital workforce as part of their automation strategy by combining elements of robotic
process automation (RPA), artificial intelligence (AI), optical/intelligent character recognition (OCR/ICR) and analytics to
automate their business processes. While RPA technologies are capable of taking on low-value activities in a quick and efficient
manner, the next phase is leveraging such automation technologies to deliver intelligent process automation (IPA).
Cognitive learning
Robotic process
automation (RPA)
Plug-in
architecture tools
Business process
management
Macro or scripted
automation
Robotic process automation and intelligent character recognition: Smart data capture 3
With software robots becoming more advanced in recent years and undertaking more than just the automation of mundane
rule-based processes, organisations are expanding the scope of process automation end to end to include sections that were
initially deemed non-automatable as inputs were in the form of unstructured data, documents/scanned images, texts/human
judgement and natural language processing (NLP).
RPA + OCR
• Tasks with subjective rules, special
cases, etc.
• Relatively unstructured inputs
• Reading data from scanned images
Data analytics
• Requires significant human
judgement and expertise
• Data analytics provide valuable
insights in analysing and
forecasting data
Source: PwC analysis
1 Le Clair, C. (26 June 2018). The Forrester Wave™: Robotic Process Automation, Q2 2018. Retrieved from
https://www.uipath.com/hubfs/The_Forrester_Wave_RPA_2018_UiPath_RPA_Leader.pdf
(last accessed on 16 July 2018)
4 PwC
What is OCR and how does it work?
OCR is a technology that primarily aims to analyse an image,
detect based on patterns if the image contains text, and
extract that text into a machine readable format. This helps
convert scanned documents into a digitally editable format
while comparing the images of the available characters to the
ones stored on its database for traditional OCR engines. The
newer versions of OCR use machine learning (ML) techniques
to recreate the characters and render the best possible match
to the user.
Based on the image type and type of data that needs to
be extracted, these character recognition engines use the
options below to recognise text.
ICR/OCR:
ICR helps in converting handwritten text characters into a
machine-readable format. The core difference between OCR
and ICR is that in the case of OCR, its capability is restricted
to printed data that looks the same given the standardisation
of multiple fonts. However, in the case of ICR, it is intelligent
enough to decipher data from non-standard documents,
which contain handwritten texts with varied formats.
Robotic process automation and intelligent character recognition: Smart data capture 5
Semi-structured documents: Unstructured documents:
Semi-structured documents do not have a formal structure In this case, documents have no standard structure. The
in place for information. The document is usually the same, data is usually free-flowing and lacks consistency. Due
but design and layout may differ. The information will be to complexities in the way the data is presented in these
tagged in the document, but the placement of the information documents, it becomes challenging to come up with a solution
may vary from document to document. Common examples of for data extraction and companies usually have to appoint
semi-structured documents are invoices and purchase orders. staff who extract key information and feed the same into
the internal business systems. This is a time-consuming and
costly task, and prone to manual errors. Examples include
contracts, agreements and letters.
6 PwC
Where do enterprises need intelligent OCR?
Intelligent OCR is needed to simplify paper-driven processes where inputs are received in varied multiple formats such as PDF,
scanned, fax and handwritten documents. Examples of such processes where OCR can be implemented are:
Robotic process automation and intelligent character recognition: Smart data capture 7
What are the complexities faced by
RPA developers?
8 PwC
Scanning an already scanned image
Many a times, a hard copy document is printed which is a
scanned image. Scanning a printed copy of an already scanned
image would definitely impair the quality of the document,
thereby influencing the accuracy levels of the extracted data.
No labels on tables
Many invoices or purchase orders have tables where the
particulars are mentioned but they do not contain headers
or labels like amount, description and quantity. This
makes it challenging for the OCR engine to search for the
appropriate data.
Robotic process automation and intelligent character recognition: Smart data capture 9
The current RPA workarounds to potential OCR automation roadblocks may not be perfect and a foolproof solution may not
exist. UiPath has been working behind the scenes to tackle the roadblocks related to OCR automations and has integrated the
ABBYY OCR engine with its current OCR toolset to bring about a revolution in OCR automation. The 2018 version of UiPath
has been codenamed Firefly.
PwC was given a preview of Firefly and performed a comparative study of some of the key OCR functions/commands used in
Firefly and UiPath’s previous version, the 2017 enterprise RPA platform codenamed Moonlight. The table below compares and
assess the OCR capabilities of the two versions and scores them on three parameters.
UiPath 2018, Firefly, can help enterprises achieve intelligent OCR automation with ease using RPA. The integration of the
ABBYY OCR engine not only enhances automation for rules-driven processes, but also adds the flavour of NLP and widens the
scope of automation.
10 PwC
Firefly is an intelligent and enhanced version of its predecessor, Moonlight. Firefly brings to the table RPA coupled with cognitive
abilities, which help enterprises overcome the burden of comprehending unstructured data using the cognitive capabilities of
ABBYY FlexiCapture, amongst other ML/AI components.
Illustrative example
Sender 1 3b 9
Sends email with Receives email Receives email of
attached notifying of invalid successful with structured
document image attachment data and document image
Unattended bot 2 7a 8
Reads emails and Reads file with
Is it a semi-structured Sends confirmation8
pre-classifies document? structured data and
response
document image updates target database
FlexiCapture 3a 4 5a 6a
Classifies Extracts each Captures correction Stores all
document image recognised object or and updates structured data in
and selects template raises validation request internal records file per document
Attended bot 5b 5c
Receives validation
Sends back
request show
validated data item
pop-window user PC
Validator 6b
Reviews and
confirms or corrects
recognised result
Robotic process automation and intelligent character recognition: Smart data capture 11
Firefly: A preview
12 PwC
Organisations globally have accepted the reality that striking gold with processes that are high on volume will become harder
to find in the days to come. RPA vendors must get smarter to escape the tag of structured rule-based automation. While AI
will not replace RPA, RPA tools that use AI components will replace those that do not disrupt the modest roots of RPA. UiPath
is headed in this direction by collaborating with partners providing automation essentials such as data analytics, NLP, ML and
intelligent OCR engines.
As other RPA vendors in the market work on future versions of their tools and similar enhancements, we will bring out a
series of thought papers that cover other key players and product enhancements in the RPA world:
Issue 1 – RPA in a virtual environment (May 2018)
Issue 2 – RPA and intelligent optical character recognition (July 2018)
Robotic process automation and intelligent character recognition: Smart data capture 13
Notes
14 PwC
Notes
Robotic process automation and intelligent character recognition: Smart data capture 15
About PwC
At PwC, our purpose is to build trust in society and solve important problems. We’re a network of firms in 158 countries
with more than 2,36,000 people who are committed to delivering quality in assurance, advisory and tax services. Find
out more and tell us what matters to you by visiting us at www.pwc.com
In India, PwC has offices in these cities: Ahmedabad, Bengaluru, Chennai, Delhi NCR, Hyderabad, Kolkata, Mumbai and
Pune. For more information about PwC India’s service offerings, visit www.pwc.com/in
PwC refers to the PwC International network and/or one or more of its member firms, each of which is a separate,
independent and distinct legal entity. Please see www.pwc.com/structure for further details.
© 2018 PwC. All rights reserved
Contacts
Sumit Srivastav
Partner and Intelligent Process Automation Leader
PwC India
[email protected]
Authors
Nitin Kamra
Hariprasad Gajapathy
pwc.in
Data Classification: DC0
This document does not constitute professional advice. The information in this document has been obtained or derived from sources believed
by PricewaterhouseCoopers Private Limited (PwCPL) to be reliable but PwCPL does not represent that this information is accurate or complete.
Any opinions or estimates contained in this document represent the judgment of PwCPL at this time and are subject to change without notice.
Readers of this publication are advised to seek their own professional advice before taking any course of action or decision, for which they are
entirely responsible, based on the contents of this publication. PwCPL neither accepts or assumes any responsibility or liability to any reader of
this publication in respect of the information contained within it or for any decisions readers may take or decide not to or fail to take.
© 2018 PricewaterhouseCoopers Private Limited. All rights reserved. In this document, “PwC” refers to PricewaterhouseCoopers Private
Limited (a limited liability company in India having Corporate Identity Number or CIN : U74140WB1983PTC036093), which is a member firm of
PricewaterhouseCoopers International Limited (PwCIL), each member firm of which is a separate legal entity.
AW/July 2018-13763