loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Lorella Viola 1 and Jaap Verheul 2

Affiliations: 1 Luxembourg Centre for Contemporary and Digital History (C2DH), University of Luxembourg, Belval Campus, Maison des Sciences Humaines, 11, Porte des Sciences, L-4366 Esch-sur-Alzette, Luxembourg ; 2 Department of History and Art History, Utrecht University, Drift 6, 3512 BS, Utrecht, The Netherlands

Keyword(s): Machine Learning, Sequence Tagging, Spatial Humanities, Geographical Enrichment, Immigrant Newspapers.

Abstract: This paper discusses the added value of applying machine learning (ML) to contextually enrich digital collections. In this study, we employed ML as a method to geographically enrich historical datasets. Specifically, we used a sequence tagging tool (Riedl and Padó 2018) which implements TensorFlow to perform NER on a corpus of historical immigrant newspapers. Afterwards, the entities were extracted and geocoded. The aim was to prepare large quantities of unstructured data for a conceptual historical analysis of geographical references. The intention was to develop a method that would assist researchers working in spatial humanities, a recently emerged interdisciplinary field focused on geographic and conceptual space. Here we describe the ML methodology and the geocoding phase of the project, focussing on the advantages and challenges of this approach, particularly for humanities scholars. We also argue that, by choosing to use largely neglected sources such as immigrant newspapers ( also known as ethnic newspapers), this study contributes to the debate about diversity representation and archival biases in digital practices. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 2a06:98c0:3600::103

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Viola, L. and Verheul, J. (2020). Machine Learning to Geographically Enrich Understudied Sources: A Conceptual Approach. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ARTIDIGH; ISBN 978-989-758-395-7; ISSN 2184-433X, SciTePress, pages 469-475. DOI: 10.5220/0009094204690475

@conference{artidigh20,
author={Lorella Viola. and Jaap Verheul.},
title={Machine Learning to Geographically Enrich Understudied Sources: A Conceptual Approach},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ARTIDIGH},
year={2020},
pages={469-475},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009094204690475},
isbn={978-989-758-395-7},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 1: ARTIDIGH
TI - Machine Learning to Geographically Enrich Understudied Sources: A Conceptual Approach
SN - 978-989-758-395-7
IS - 2184-433X
AU - Viola, L.
AU - Verheul, J.
PY - 2020
SP - 469
EP - 475
DO - 10.5220/0009094204690475
PB - SciTePress