CERN Accelerating science

If you experience any problem watching the video, click the download button below
Download Embed
CMS Note
Report number CMS-CR-2023-115
Title Vectorization of CMSSW offline software
Author(s) Gartung, Patrick Elmo (Fermilab)
Publication 2024
Collaboration CMS Collaboration
Imprint 15 Aug 2023
Number of pages 5
Published in: EPJ Web Conf. 295 (2024) 11024
Presented at 26th International Conference on Computing in High Energy & Nuclear Physics, Norfolk, Virginia, Us, 8 - 12 May 2023
DOI 10.1051/epjconf/202429511024
Subject category Detectors and Experimental Techniques
Accelerator/Facility, Experiment CERN LHC ; CMS
Keywords General
Abstract Vectorization in CMSSW applicationsThe CMS experiment has been utilizing vectorization, or SIMD, in parts of its data processing applications for over a decade. On x86 platforms the vectorization level is still SSE3. In the past attempts to use wider vector instruction sets such as AVX or AVX-512 have, in practice, not resulted in improvements in the overall event processing throughput, because the CPUs scale down their frequency when processing AVX instructions. In addition, a notable part of the global pool of CMS resources has been old systems either not supporting AVX, or where the CPU frequency downscaling impacts all cores of the CPU. CMS has nevertheless continued to vectorize more of its application code, and in this work we review profiling methods we have found effective to find out pieces of code that would benefit from vectorization, and techniques to transform those codes such that the GCC compiler is able to auto-vectorize those codes. The build system used for CMSSW, Scram, has also been enhanced to be able to build code for multiple CPU microarchitectures such that the shared libraries of desired microarchitecture level can be loaded based on the CPU of the system. This multi-microarchitecture setup is invisible to the workflow management system, which makes its deployment straightforward. We describe in detail how this multi-microarchitecture build is set up, and measure the impact of using wider vector units than SSE3 on the event processing throughput of CMS applications such as simulation and reconstruction on recent x86 CPUs.
Other source Inspire

 


 Zapis kreiran 2023-09-25, zadnja izmjena 2024-12-02


Cjeloviti tekst:
Download fulltext
PDF