CERN Accelerating science

Published Articles
Title Evolution of the Hadoop platform and ecosystem for high energy physics
Author(s) Baranowski, Zbigniew (CERN) ; Kleszcz, Emil (CERN) ; Kothuri, Prasanth (CERN) ; Canali, Luca (CERN) ; Castellotti, Riccardo (CERN) ; Martin Marquez, Manuel (CERN) ; Matos de Barros, Nuno Guilherme (CERN) ; Motesnitsalis, Evangelos (CERN) ; Mrowczynski, Piotr (CERN) ; Luna Duran, Jose Carlos (CERN)
Publication 2019
Number of pages 10
In: EPJ Web Conf. 214 (2019) 04058
In: 23rd International Conference on Computing in High Energy and Nuclear Physics, CHEP 2018, Sofia, Bulgaria, 9 - 13 Jul 2018, pp.04058
DOI 10.1051/epjconf/201921404058
Subject category Computing and Computers
Abstract The interest in using scalable data processing solutions based on Apache Hadoop ecosystem is constantly growing in the High Energy Physics (HEP) community. This drives the need for increased reliability and availability of the central Hadoop service and underlying infrastructure provided to the community by the CERN IT department. This paper reports on the overall status of the Hadoop platform and related Hadoop and Spark service at CERN, detailing recent enhancements and features introduced in many areas including the service configuration, availability, alerting, monitoring and data protection, in order to meet the new requirements posed by the users’ community.
Copyright/License publication: © 2019-2024 The Authors (License: CC-BY-4.0)

Corresponding record in: Inspire


 Record created 2019-11-12, last modified 2022-08-10


Fulltext from publisher:
Download fulltext
PDF