Author(s)
|
Gorbunov, Sergey (Frankfurt U., FIAS) ; Rohr, David (Kirchhoff Inst. Phys.) ; Aamodt, Kenneth (Oslo U.) ; Alt, Torsten (Frankfurt U., FIAS) ; Appelshauser, Harald (Frankfurt U., Inst. Kernphys.) ; Arend, Andreas (Frankfurt U., Inst. Kernphys.) ; Bach, Matthias (Frankfurt U., FIAS) ; Becker, Bruce (INFN, Cagliari) ; Bottger, Stefan (Kirchhoff Inst. Phys.) ; Breitner, Timo (Kirchhoff Inst. Phys.) ; Busching, Henner (Frankfurt U., Inst. Kernphys.) ; Chattopadhyay, Sukalyan (Saha Inst.) ; Cleymans, Jean (Cape Town U.) ; Cicalo, Corrado (INFN, Cagliari) ; Das, Indranil (Saha Inst.) ; Djuvsland, Oystein (Bergen U.) ; Engel, Heiko (Kirchhoff Inst. Phys.) ; Erdal, Hege Austrheim (Bergen U.) ; Fearick, Roger (Cape Town U.) ; Haaland, Oystein Senneset (Bergen U.) ; Hille, Per Thomas (Yale U.) ; Kalcher, Sebastian (Frankfurt U., FIAS) ; Kanaki, Kalliopi (Bergen U.) ; Kebschull, Udo Wolfgang (Kirchhoff Inst. Phys.) ; Kisel, Ivan (Darmstadt, GSI) ; Kretz, Matthias (Frankfurt U., FIAS) ; Lara, Camillo (Kirchhoff Inst. Phys.) ; Lindal, Sven (Oslo U.) ; Lindenstruth, Volker (Frankfurt U., FIAS) ; Masoodi, Arshad Ahmad (Aligarh Muslim U.) ; Ovrebekk, Gaute (Bergen U.) ; Panse, Ralf (Kirchhoff Inst. Phys.) ; Peschek, Jorg (Kirchhoff Inst. Phys.) ; Ploskon, Mateusz (LBL, Berkeley) ; Pocheptsov, Timur (Oslo U.) ; Ram, Dinesh (Frankfurt U., FIAS) ; Rascanu, Theodor (Frankfurt U., Inst. Kernphys.) ; Richter, Matthias (Oslo U.) ; Rohrich, Dieter (Bergen U.) ; Ronchetti, Federico (Frascati) ; Skaali, Bernhard (Oslo U.) ; Smorholm, Olav (Frankfurt U., FIAS) ; Stokkevag, Camilla (Bergen U.) ; Steinbeck, Timm Morten (Frankfurt U., FIAS) ; Szostak, Artur (Bergen U.) ; Thader, Jochen (Darmstadt, GSI) ; Tveter, Trine (Oslo U.) ; Ullaland, Kjetil (Bergen U.) ; Vilakazi, Zeblon (iThemba LABS) ; Weis, Robert (Kirchhoff Inst. Phys.) ; Yin, Zhong-Bao (Hua-Zhong Normal U.) ; Zelnicek, Pierre (Kirchhoff Inst. Phys.) |
Abstract
| The on-line event reconstruction in ALICE is performed by the High Level Trigger, which should process up to 2000 events per second in proton-proton collisions and up to 300 central events per second in heavy-ion collisions, corresponding to an inp ut data stream of 30 GB/s. In order to fulfill the time requirements, a fast on-line tracker has been developed. The algorithm combines a Cellular Automaton method being used for a fast pattern recognition and the Kalman Filter method for fitting of found trajectories and for the final track selection. The tracker was adapted to run on Graphics Processing Units (GPU) using the NVIDIA Compute Unified Device Architecture (CUDA) framework. The implementation of the algorithm had to be adjusted at many points to allow for an efficient usage of the graphics cards. In particular, achieving a good overall workload for many processor cores, efficient transfer to and from the GPU, as well as optimized utilization of the different memories the GPU offers turned out to be critical. To cope with these problems a dynamic scheduler was introduced, which redistributes the workload among the processor cores. Additionally a pipeline was implemented so that the tracking on the GPU, the initialization and the output process ed by the CPU, as well as the DMA transfer can overlap. The GPU tracking algorithm significantly outperforms the CPU version for large events while it entirely maintains its efficiency. |