CERN Accelerating science

Article
Title Enabling interoperable data and application services in a federated ScienceMesh
Author(s) Arora, Ishank (CERN) ; Sainz, Samuel Alfageme (CERN) ; Ferreira, Pedro (CERN) ; Labrador, Hugo Gonzalez (CERN) ; Moscicki, Jakub (CERN)
Publication 2021
Number of pages 10
In: EPJ Web Conf. 251 (2021) 02041
In: 25th International Conference on Computing in High-Energy and Nuclear Physics (CHEP), Online, Online, 17 - 21 May 2021, pp.02041
DOI 10.1051/epjconf/202125102041
Subject category Computing and Computers
Abstract In recent years, cloud sync & share storage services, provided by academic and research institutions, have become a daily workplace environment for many local user groups in the High Energy Physics (HEP) community. These, however, are primarily disconnected and deployed in isolation from one another, even though new technologies have been developed and integrated to further increase the value of data. The EU-funded CS3MESH4EOSC project is connecting locally and individually provided sync and share services, and scaling them up to the European level and beyond. It aims to deliver the ScienceMesh service, an interoperable platform to easily sync and share data across institutions and extend functionalities by connecting to other research services using streamlined sets of interoperable protocols, APIs and deployment methodologies. This supports multiple distributed application workflows: data science environments, collaborative editing and data transfer services.In this paper, we present the architecture of ScienceMesh and the technical design of its reference implementation, a platform that allows organizations to join the federated service infrastructure easily and to access application services outof-the-box. We discuss the challenges faced during the process, which include diversity of sync & share platforms (Nextcloud, Owncloud, Seafile and others), absence of global user identities and user discovery, lack of interoperable protocols and APIs, and access control and protection of data endpoints. We present the rationale for the design decisions adopted to tackle these challenges and describe our deployment architecture based on Kubernetes, which enabled us to utilize monitoring and tracing functionalities. We conclude by reporting on the early user experience with ScienceMesh.
Copyright/License publication: © The Authors (License: CC-BY-4.0)

Corresponding record in: Inspire
 Journalen skapades 2021-09-07, och modifierades senast 2021-09-07


Fulltext:
Download fulltext
PDF