Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription

Borkar, Jaydeep; Smith, David A.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.00250 (cs)

[Submitted on 28 Jun 2024]

Title:Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription

Authors:Jaydeep Borkar, David A. Smith

View PDF HTML (experimental)

Abstract:Historical documents frequently suffer from damage and inconsistencies, including missing or illegible text resulting from issues such as holes, ink problems, and storage damage. These missing portions or gaps are referred to as lacunae. In this study, we employ transformer-based optical character recognition (OCR) models trained on synthetic data containing lacunae in a supervised manner. We demonstrate their effectiveness in detecting and restoring lacunae, achieving a success rate of 65%, compared to a base model lacking knowledge of lacunae, which achieves only 5% restoration. Additionally, we investigate the mechanistic properties of the model, such as the log probability of transcription, which can identify lacunae and other errors (e.g., mistranscriptions due to complex writing or ink issues) in line images without directly inspecting the image. This capability could be valuable for scholars seeking to distinguish images containing lacunae or errors from clean ones. Although we explore the potential of attention mechanisms in flagging lacunae and transcription errors, our findings suggest it is not a significant factor. Our work highlights a promising direction in utilizing transformer-based OCR models for restoring or analyzing damaged historical documents.

Comments:	Accepted to ICDAR 2024 Workshop on Computational Paleography
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2407.00250 [cs.CV]
	(or arXiv:2407.00250v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.00250

Submission history

From: Jaydeep Borkar [view email]
[v1] Fri, 28 Jun 2024 22:52:39 UTC (543 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators