CERN Accélérateur de science

ATLAS Note
Report number ATL-SOFT-PROC-2020-007
Title Using Kubernetes as an ATLAS computing site
Author(s)

Barreiro Megino, Fernando Harald (The University of Texas at Arlington) ; Albert, Jeffrey Ryan (University of Victoria) ; Berghaus, Frank (University of Victoria) ; De, Kaushik (The University of Texas at Arlington) ; Lin, Fahui (Academia Sinica, Taipei) ; Mac Donell, Danika Marina (University of Victoria) ; Maeno, Tadashi (Brookhaven National Laboratory (BNL)) ; Brito Da Rocha, Ricardo (CERN) ; Seuster, Rolf (University of Victoria) ; Taylor, Ryan P. (University of Victoria) ; Yang, Ming-jyuan (Academia Sinica, Taipei)

Corporate Author(s) The ATLAS collaboration
Publication 2020
Imprint 11 Feb 2020
Number of pages 7
In: EPJ Web Conf. 245 (2020) 07025
In: 24th International Conference on Computing in High Energy and Nuclear Physics, Adelaide, Australia, 4 - 8 Nov 2019, pp.07025
DOI 10.1051/epjconf/202024507025
Subject category Particle Physics - Experiment
Accelerator/Facility, Experiment CERN LHC ; ATLAS
Free keywords PanDA ; Harvester ; Kubernetes ; Batch ; Workload Management ; Grid ; Distributed Computing
Abstract In recent years containerization has revolutionized cloud environments, providing a secure, lightweight, standardized way to package and execute software. Solutions such as Kubernetes enable orchestration of containers in a cluster, including for the purpose of job scheduling. Kubernetes is becoming a de facto standard, available at all major cloud computing providers, and is gaining increased attention from some WLCG sites. In particular, CERN IT has integrated Kubernetes into their cloud infrastructure by providing an interface to instantly create Kubernetes clusters. Also, the University of Victoria is pursuing an infrastructure-as-code approach to deploying Kubernetes as a flexible and resilient platform for running services and delivering resources. ATLAS has partnered with CERN IT and the University of Victoria to explore and demonstrate the feasibility of running an ATLAS computing site directly on Kubernetes, replacing all grid computing services. We have interfaced ATLAS’ workload submission engine PanDA with Kubernetes, to directly submit and monitor the status of containerized jobs. This paper will describe the integration and deployment details, and focus on the lessons learned from running a wide variety of ATLAS production payloads on Kubernetes using clusters of several thousand cores at CERN and the Tier 2 computing site in Victoria.
Copyright/License © 2020-2024 The Authors (License: CC-BY-4.0)

Corresponding record in: Inspire


 Notice créée le 2020-02-11, modifiée le 2021-02-16