Associative Memory computing power and its simulation

Ancu, L S; Giannetti, P; Britzger, D; Schmitt, S; Pandini, C; Annovi, A; Luongo, C; Howarth, J W; Volpi, G

ATLAS Slides
Report number	ATL-DAQ-SLIDE-2014-217
Title	Associative Memory computing power and its simulation
Author(s)	Ancu, L S (Section de Physique, Université de Genève, Geneva, Switzerland) ; Annovi, A (INFN Frascati, Frascati, Roma, Italy) ; Britzger, D (Deutsches Elektronen-Synchrotron (DESY), Hamburg and Zeuthen, Germany) ; Giannetti, P (INFN Pisa, Pisa, Italy) ; Howarth, J W (Deutsches Elektronen-Synchrotron (DESY), Hamburg and Zeuthen, Germany) ; Luongo, C (INFN Pisa, Pisa, Italy) ; Pandini, C (Università di Milano, Milano, Italy) ; Schmitt, S (Deutsches Elektronen-Synchrotron (DESY), Hamburg and Zeuthen, Germany) ; Volpi, G (Università di Pisa, Pisa, Italy)
Corporate author(s)	The ATLAS collaboration
Submitted to	19th IEEE-NPSS Real-Time conference 2014, Nara, Japan, 26 - 30 May 2014
Submitted by	[email protected] on 20 May 2014
Subject category	Particle Physics - Experiment
Accelerator/Facility, Experiment	CERN LHC ; ATLAS
Free keywords	Associative Memory ; Simulation ; FTK ; ATLAS ; Trigger ; Road Finding ; Track Fitter
Abstract	The associative memory (AM) system is a computing device made of hundreds of AM ASICs chips designed to perform “pattern matching” at very high speed. Since each AM chip stores a data base of 130000 pre-calculated patterns and large numbers of chips can be easily assembled together, it is possible to produce huge AM banks. Speed and size of the system are crucial for real-time High Energy Physics applications, such as the ATLAS Fast TracKer (FTK) Processor. Using 80 million channels of the ATLAS tracker, FTK finds tracks within 100 micro seconds. The simulation of such a parallelized system is an extremely complex task if executed in commercial computers based on normal CPUs. The algorithm performance is limited, due to the lack of parallelism, and in addition the memory requirement is very large. In fact the AM chip uses a content addressable memory (CAM) architecture. Any data inquiry is broadcast to all memory elements simultaneously, thus data retrieval time is independent of the database size. The great computing power is also supported by a very powerful I/O. Each incoming hit reaches all the patterns in the AM system within the same clock cycle (10 ns). We report on the organization of the simulation into multiple jobs to satisfy the memory constraints and on the optimization performed to reduce the processing time. Finally, we introduce the idea of a new computing unit based on a small number of AM chips that could be plugged inside commercial PCs as coprocessors. This unit would both satisfy the need for very large memory and significantly reduce the simulation time due to the use of the highly parallelized AM chips.

Volver a la búsqueda

Registro creado el 2014-05-20, última modificación el 2016-06-30

Registros similares

Texto completo:
ATL-DAQ-SLIDE-2014-217 -

PDF
RT2014_talk_CarmelaLuongo -

PDF

Enlace externo:

Original Communication (restricted to ATLAS)

Añadir a la cesta personal
Exportar como BibTeX, MARC, MARCXML, DC, EndNote, NLM, RefWorks

CERN Document Server

Access articles, reports and multimedia content in HEP

Main menu

CERN Accelerating science