Pàgina inicial > CMS Collection > CMS Preprints > Exploitation of network-segregated CPU resources in CMS |
CMS Note | |
Report number | CMS-CR-2021-017 |
Title | Exploitation of network-segregated CPU resources in CMS |
Author(s) | Acosta-Silva, C (BIST, Barcelona ; PIC, Bellaterra) ; Peris, A Delgado (Madrid, CIEMAT) ; Flix, J (PIC, Bellaterra ; Madrid, CIEMAT) ; Frey, J (Wisconsin U., Madison) ; Hernández, J M (Madrid, CIEMAT) ; Pérez-Calero Yzquierdo, A (PIC, Bellaterra ; Madrid, CIEMAT) ; Tannenbaum, T (Wisconsin U., Madison) |
Collaboration | CMS Collaboration |
Publication | 2021 |
Imprint | 19 Feb 2021 |
Number of pages | 10 |
In: | EPJ Web Conf. 251 (2021) 02020 |
In: | 25th International Conference on Computing in High-Energy and Nuclear Physics (CHEP), Online, Online, 17 - 21 May 2021, pp.02020 |
DOI | 10.1051/epjconf/202125102020 |
Subject category | Detectors and Experimental Techniques ; Computing and Computers |
Accelerator/Facility, Experiment | CERN LHC ; CMS |
Abstract | CMS is tackling the exploitation of CPU resources at HPC centers where compute nodes do not have network connectivity to the Internet. Pilot agents and payload jobs need to interact with external services from the compute nodes access to the application software (cmvfs) and conditions data (Frontier), management of input and output data files (data management services), and job management (HTCondor). Finding an alternative route to these services is challenging. Seamless integration in the CMS production system without causing any operational overhead is a key goal.We describe in this paper the solutions developed within CMS to overcome the restrictions imposed by network-segregated compute nodes. The Barcelona Supercomputing Center (BSC) in Spain has been used as a testbed for the integration in production of this kind of resource. Singularity containers with application software releases are built and pre-placed in the HPC shared file system together with conditions data files. HTCondor has been extended to relay communications between running pilot jobs and HTCondor daemons through the HPC shared file system. This operation mode also allows piping input and output data files through the HPC file system. Results, issues encountered during the integration process, and remaining concerns are discussed in this report. |
Copyright/License | publication: © 2021-2025 The Authors (License: CC-BY-4.0) |