1.
|
Boosting RDataFrame performance with transparent bulk event processing
/ Guiraud, Enrico (CERN) ; Blomer, Jakob (CERN) ; Canal, Philippe (Fermilab) ; Naumann, Axel (CERN)
RDataFrame is ROOT’s high-level interface for Python and C++ data analysis. Since it first became available, RDataFrame adoption has grown steadily and it is now poised to be a major component of analysis software pipelines for LHC Run 3 and beyond. [...]
FERMILAB-CONF-24-0684-CSAID.-
2024 - 8 p.
- Published in : EPJ Web Conf. 295 (2024) 06006
Fulltext: 3afc9b57f03c0d303680c14cbbcb8c23 - PDF; document - PDF; External link: Fermilab Library Server
In : 26th International Conference on Computing in High Energy & Nuclear Physics, Norfolk, Virginia, Us, 8 - 12 May 2023, pp.06006
|
|
2.
|
First implementation and results of the Analysis Grand Challenge with a fully Pythonic RDataFrame
/ Padulano, Vincenzo Eduardo (CERN) ; Guiraud, Enrico (CERN ; Princeton U.) ; Falko, Andrii (Taras Shevchenko U.) ; Gazzarrini, Elena (CERN) ; Garcia Garcia, Enrique (CERN) ; Gosein, Domenic (CERN ; Mannheim U.)
The growing amount of data generated by the LHC requires a shift in how HEP analysis tasks are approached. Efforts to address this computational challenge have led to the rise of a middle-man software layer, a mixture of simple, effective APIs and fast execution engines underneath. [...]
2024 - 8 p.
- Published in : EPJ Web Conf. 295 (2024) 06011
Fulltext: PDF;
In : 26th International Conference on Computing in High Energy & Nuclear Physics, Norfolk, Virginia, Us, 8 - 12 May 2023, pp.06011
|
|
3.
|
I/O performance studies of analysis workloads on production and dedicated resources at CERN
/ Sciabà, Andrea (CERN) ; Blomer, Jakob (CERN) ; Canal, Philippe (Fermilab) ; Duellmann, Dirk (CERN) ; Guiraud, Enrico (CERN) ; Naumann, Axel (CERN) ; Padulano, Vincenzo Eduardo (CERN) ; Panzer-Steindel, Bernd (CERN) ; Peters, Andreas (CERN) ; Schulz, Markus (CERN) et al.
The recent evolutions of the analysis frameworks and physics data formats of the LHC experiments provide the opportunity of using central analysis facilities with a strong focus on interactivity and short turnaround times, to complement the more common distributed analysis on the Grid. In order to plan for such facilities, it is essential to know in detail the performance of the combination of a given analysis framework, of a specific analysis and of the installed computing and storage resources. [...]
FERMILAB-CONF-24-0689-CSAID.-
2024 - 9 p.
- Published in : EPJ Web Conf. 295 (2024) 07025
Fulltext: 9b80f56360993741399aef5f272f8af7 - PDF; document - PDF; External link: Fermilab Library Server
In : 26th International Conference on Computing in High Energy & Nuclear Physics, Norfolk, Virginia, Us, 8 - 12 May 2023, pp.07025
|
|
4.
|
|
5.
|
|
6.
|
RDataFrame enhancements for HEP analyses
/ Guiraud, E (CERN) ; Blomer, J (CERN) ; Hageboeck, S (CERN) ; Naumann, A (CERN) ; Padulano, V E (CERN) ; Tejedor, E (CERN) ; Wunsch, S (CERN)
In recent years, RDataFrame, ROOT’s high-level interface for data analysis and processing, has seen widespread adoption on the part of HEP physicists. Much of this success is due to RDataFrame’s ergonomic programming model that enables the implementation of common analysis tasks more easily than previous APIs, without compromising on application performance. [...]
2023 - 6 p.
- Published in : J. Phys. : Conf. Ser. 2438 (2023) 012116
Fulltext: PDF;
In : 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2021), Daejeon, Korea, 29 Nov - 3 Dec 2021, pp.012116
|
|
7.
|
Leveraging HPC resources with distributed RDataFrame
/ Padulano, V E (CERN ; Valencia, Polytechnic U.) ; Kabadzhov, I D (CERN ; Freiburg U., Inst. Phys. Chem.) ; Saavedra, E T (CERN) ; Guiraud, E (CERN)
The declarative approach to data analysis provides high-level abstractions for users to operate on their datasets in a much more ergonomic fashion compared to imperative interfaces. ROOT offers such a tool with RDataFrame, which has been tested in production environments and used in real-world analyses with optimal results. [...]
2023 - 5 p.
- Published in : J. Phys. : Conf. Ser. 2438 (2023) 012097
Fulltext: PDF;
In : 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2021), Daejeon, Korea, 29 Nov - 3 Dec 2021, pp.012097
|
|
8.
|
Prototyping a ROOT-based distributed analysis workflow for HL-LHC: the CMS use case
/ Tedeschi, Tommaso (INFN, Perugia ; Perugia U.) ; Padulano, Vincenzo Eduardo (CERN) ; Spiga, Daniele (INFN, Perugia) ; Ciangottini, Diego (INFN, Perugia) ; Tracolli, Mirco (INFN, Perugia) ; Tejedor Saavedra, Enric (CERN) ; Guiraud, Enrico (CERN ; Princeton U.) ; Biasotto, Massimo (INFN, Legnaro ; Padua U.)
The challenges expected for the next era of the Large Hadron Collider (LHC), both in terms of storage and computing resources, provide LHC experiments with a strong motivation for evaluating ways of rethinking their computing models at many levels. Great efforts have been put into optimizing the computing resource utilization for the data analysis, which leads both to lower hardware requirements and faster turnaround for physics analyses. [...]
arXiv:2307.12579.-
2023-10-16 - 26 p.
- Published in : Comput. Phys. Commun. 295 (2024) 108965
Fulltext: PDF;
|
|
9.
|
Leveraging State-of-the-Art Engines for Large-Scale Data Analysis in High Energy Physics
/ Padulano, Vincenzo Eduardo (CERN ; Valencia, Polytechnic U.) ; Kabadzhov, Ivan Donchev (CERN ; Freiburg U., Inst. Phys. Chem.) ; Tejedor Saavedra, Enric (CERN) ; Guiraud, Enrico (CERN) ; Alonso-Jordá, Pedro (Valencia, Polytechnic U.)
The Large Hadron Collider (LHC) at CERN has generated a vast amount of information from physics events, reaching peaks of TB of data per day which are then sent to large storage facilities. Traditionally, data processing workflows in the High Energy Physics (HEP) field have leveraged grid computing resources. [...]
2023 - 21 p.
- Published in : J. Grid Comput. 21 (2023) 9
Fulltext: PDF;
|
|
10.
|
|