Use case: Romanian language resources in the LOD paradigm

VB Mititelu, E Irimia, V Păiș, AM Avram… - Proceedings of the 8th …, 2022 - aclanthology.org
Proceedings of the 8th Workshop on Linked Data in Linguistics within …, 2022aclanthology.org
In this paper, we report on (i) the conversion of Romanian language resources to the Linked
Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with
other language resources (for Romanian or for other languages). The pool of converted
resources is made up of the Romanian Wordnet, the morphosyntactic and phonemic lexicon
RoLEX, four treebanks, one for the general language (the Romanian Reference Treebank)
and others for specialised domains (SiMoNERo for medicine, LegalNERo for the legal …
Abstract
In this paper, we report on (i) the conversion of Romanian language resources to the Linked Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with other language resources (for Romanian or for other languages). The pool of converted resources is made up of the Romanian Wordnet, the morphosyntactic and phonemic lexicon RoLEX, four treebanks, one for the general language (the Romanian Reference Treebank) and others for specialised domains (SiMoNERo for medicine, LegalNERo for the legal domain, PARSEME-Ro for verbal multiword expressions), frequency information on lemmas and tokens and word embeddings as extracted from the reference corpus for contemporary Romanian (CoRoLa) and a bi-modal (text and speech) corpus. We also present the limitations coming from the representation of the resources in Linked Data format. The metadata of LOD resources have been published in the LOD Cloud. The resources are available for download on our website and a SPARQL endpoint is also available for querying them.
aclanthology.org
Showing the best result for this search. See all results