0% found this document useful (0 votes)
20 views7 pages

Task 04 OCR Skill Creation

Uploaded by

deepthiaddepalli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views7 pages

Task 04 OCR Skill Creation

Uploaded by

deepthiaddepalli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

ABBYY VANTAGE 2.

VANTAGE FUNDAMENTALS
Task 04 OCR Skill Creation
Task 04 OCR Skill Creation

During this task, we will use the “OCR Skill” from the Skill Catalog and utilize the
Vantage API to facilitate end-to-end document processing. The most common usage
scenario for document processing in Vantage is calling Vantage Skills from other
software systems. Therefore, we will use the Swagger tool in this task to help you
visualize the workflow and the API code that needs to be created.
By completing this task, you will learn how to:
• Create a new OCR Skill
• Process documents using the Swagger tool

Note: Please be aware that screenshot depictions of the GUI in this task may differ from your
Vantage environment, depending on which version you are using.
Be sure to carefully follow the task instructions when creating and naming folders, Skills, and
files.

Please consider the environment before printing this task.


If you really need to print it—you may not need to print the two first pages (title list and table of contents).

Vantage Fundamentals. Task 04 OCR Skill Creation 2


Task 04 OCR Skill Creation

CREATE AN OCR SKILL


In this task, you will learn how to create an OCR Skill and use this Skill with Swagger (an API tool
allowing visualization and interaction with the API's resources without having any of the
implementation logic in place). This OCR Skill performs optical character recognition on
documents, thus recognizing all underlying text (and making the documents searchable). Barcode
values are also recognized, and document image quality is improved.

1. Log in to Vantage using your credentials. You


can find a detailed explanation on how to access
ABBYY Vantage in the Task 01 Document Skill
Creation document.
2. Select Skill Designer>OCR Skill on the left panel.
3. You will see a window with the Skill settings.
Let’s adjust some of them.
4. Name your new Skill
“Fundamentals_Task4_YourLastName”. Adding
a description is optional.
5. On the Languages tab, adjust the settings so that only the English language is selected.
6. Keep the default properties under the Image Enhancements and Barcode tabs.

Note: You can learn more about these properties on the Help page in Vantage.

7. On the Export tab, select JSON>Preserve document structure, PDF, DOCX and TXT formats.
The result should look like this:

8. Click the Create and Publish button.

Vantage Fundamentals. Task 04 OCR Skill Creation 3


OPEN THE SWAGGER TOOL AND CREATE A TRANSACTION
Swagger is one of the most well-known and widely used tools for implementing the OpenAPI
specification. The Swagger toolset can be used at different stages of the API lifecycle. In this task,
we will use it to see the OCR Skill in action.
9. Click Help, then Documentation. You can find the
Help link at the bottom-left corner of your screen.
10. Once you are on the Help page, select the Vantage
API card.
11. You have opened the Swagger tool. First of all, you need to Authorize. Click the Authorize
button to the right.
12. Check the checkboxes at the bottom-left corner to grant Swagger access to all scopes. Then
click Authorize:

13. Close the window.


14. Scroll down to the Skills section.
15. Click on the GET /api/publicapi/v1/skills to find out the Skill ID:

Vantage Fundamentals. Task 04 OCR Skill Creation 4


16. Start by clicking Try it out to the right, then click Execute.
17. In the Response body, find the name of the Skill: “Fundamentals_Task4_YourLastName”. If you
have a long list of Skills, you can enter CTRL+F, then enter your last name to perform a search
within Swagger to find your newly created Skill faster.

18. Copy and paste the Skill id into Notepad, and then
type the string “Skill ID” in the line above the
pasted value. We will need this value later.

19. Scroll down to the Transactions section.


20. Click on the POST line labelled api/publicapi/v1/transactions/launch to initiate file processing
in a new transaction:

21. Start by clicking Try it out to the right.


22. Paste the Skill id from Notepad into the corresponding field labelled skillid.
23. Find and click Add file item, then Choose file, then upload the Utilities_01.jpg file. If you
haven’t downloaded this file yet, here’s how you can find it:
• Open the ABBYY Vantage Fundamentals e-learning course on the ABBYY University.
• Open the Creating Skills From Scratch lesson, then go to the Practice: OCR Skill Creation
chapter.
• There you will find the required file, right under the PDF-file with this task instruction.
24. Click the blue Execute button.
25. Scroll down to the Responses section, then under Code 201, find the transactionID in the
response body:

Vantage Fundamentals. Task 04 OCR Skill Creation 5


26. Copy the transaction id (without the quotation marks).
Paste the transaction id into Notepad and type the
string “Transaction ID” in the line above the newly
pasted value.

27. The next step is to get transaction info. In Swagger, scroll down, then click on the following GET
line: /api/publicapi/v1/transactions/{transactionId}.
28. Click Try it out.
29. Paste your transaction id to the corresponding field, as shown in the screenshot below:

30. Click the blue Execute button.


31. The status under the Server response section will be
changed to Processed. Check that the status is
Processed (please see image below).
32. You need to copy the PDF format fileId to download
the PDF file. Paste this id into Notepad, and place the
label “File ID” above the value.

Note: In this step, you could alternatively select from any of the export formats we previously
selected: PDF, DOCX, JSON, or TXT. To be aligned with later instructions, we recommend you
copy the PDF format File ID.

Vantage Fundamentals. Task 04 OCR Skill Creation 6


33. Now you can download the searchable PDF result file. Scroll down to the GET line:
/api/publicapi/v1/transactions/{transactionId}/files/{fileId}/download, then click on it.

34. Click Try it out, then enter the transactionId, then the fileId (you stored both in Notepad),
then click the blue Execute button.
35. Click Download file (see below).

36. Open the file to check the results. This originally was a JPG file, which is an image format with
no text layer. Now we have a PDF file that looks the same, but there is an underlying text layer.
Click Ctrl+F and search for the word amount. You will find four instances; this is now a fully
searchable document!
Typically, this PDF file, and the other files in formats you initailly selected for export, are used in
various critical downstream business processes.

Great job! You have successfully tested the OCR Skill to do end-to-end document processing using
the Swagger tool!

Vantage Fundamentals. Task 04 OCR Skill Creation 7

You might also like