0% found this document useful (0 votes)
22 views1 page

An Electronic Document

The document discusses challenges with optical character recognition including small text, numbers mixed with letters in various fonts, unusual fonts, and colored backgrounds and text.

Uploaded by

Himanshi Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views1 page

An Electronic Document

The document discusses challenges with optical character recognition including small text, numbers mixed with letters in various fonts, unusual fonts, and colored backgrounds and text.

Uploaded by

Himanshi Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

An Electronic Document

Summary

Electronic Sample Document

Small Text
The contents of this paragraph are very small and can introduce issues with Traditional OCR Extraction Techniques.

Numbers mixed with letters

Sometimes where it is not clear if OCR is looking at a letter O or a 0 and the engine finds letters
mixed with numbers such as with an Invoice Number or ID for example, the OCR engine can some
times make mistakes, here are some examples:

AI01O87 – LLNNLNN
This is a number Zero (0)
A letter O (O)
A letter I (I)
A Number 1 (1)

Alternative font - Numbers mixed with letters

And again in a different font face:

AI01O87 – LLNNLNN
A letter O (O)
A letter I (I)
A Number 1 (1)

Unusual Font

The contents of this paragraph are written in a font that can introduce issues with
Traditional OCR Extraction Techniques.

Coloured Backgrounds and Text

The Text in this section uses multiple colours on top of a shaded background that can cause some
issues with traditional OCR Techniques.

You might also like