CERN Accelerating science

CMS Note
Report number CMS-CR-2012-128
Title High availability through full redundancy of the CMS detector controls system
Author(s) Bauer, Gerry (MIT) ; Behrens, Ulf (DESY) ; Bouffet, Olivier (CERN) ; Bowen, Matthew (CERN) ; Branson, James G (UC, San Diego) ; Bukowiec, Sebastian (CERN) ; Ciganek, Marek (CERN) ; Cittolin, Sergio (UC, San Diego) ; Jose Antonio Coarasa (CERN) ; Deldicque, Christian (CERN) ; Dobson, Marc (CERN) ; Dupont, Aymeric (CERN) ; Erhan, Samim (UCLA) ; Flossdorf, Alexander (DESY) ; Gigi, Dominique (CERN) ; Glege, Frank (CERN) ; Gomez-Reino, Robert (CERN) ; Hartl, Christian (CERN) ; Hegeman, Jeroen (Princeton U.) ; Holzner, André (UC, San Diego) ; Yi Ling Hwong (CERN) ; Masetti, Lorenzo (CERN) ; Meijers, Frans (CERN) ; Meschi, Emilio (CERN) ; Mommsen, Remigius K (Fermilab) ; O'Dell, Vivian (Fermilab) ; Orsini, Luciano (CERN) ; Paus, Christoph (MIT) ; Petrucci, Andrea (CERN) ; Pieri, Marco (UC, San Diego) ; Polese, Giovanni (CERN) ; Racz, Attila (CERN) ; Raginel, Olivier (MIT) ; Sakulin, Hannes (CERN) ; Sani, Matteo (UC, San Diego) ; Schwick, Christoph (CERN) ; Shpakov, Dennis (Fermilab) ; Simon, Michal (CERN) ; Andrei Cristian Spataru (CERN) ; Sumorok, Konstanty (MIT)
Publication 2012
Imprint 06 Jun 2012
Number of pages 9
In: J. Phys.: Conf. Ser. 396 (2012) pp.012041
In: Computing in High Energy and Nuclear Physics 2012, New York, NY, USA, 21 - 25 May 2012, pp.012041
Subject category Detectors and Experimental Techniques
Accelerator/Facility, Experiment CERN LHC ; CMS
Abstract The CMS detector control system (DCS) is responsible for controlling and monitoring the detector status and for the operation of all CMS sub detectors and infrastructure. This is required to ensure safe and efficient data taking so that high quality physics data can be recorded. The current system architecture is composed of more than 100 servers in order to provide the required processing resources. An optimization of the system software and hardware architecture is under development to ensure redundancy of all the controlled sub-systems and to reduce any downtime due to hardware or software failures. The new optimized structure is based mainly on powerful and highly reliable blade servers and makes use of a fully redundant approach, guaranteeing high availability and reliability. The analysis of the requirements, the challenges, the improvements and the optimized system architecture as well as its specific hardware and software solutions are presented.
Copyright/License Preprint: (License: CC-BY-4.0)



 Δημιουργία εγγραφής 2012-06-25, τελευταία τροποποίηση 2018-06-07


Πλήρες κείμενο:
Κατέβασμα πλήρες κειμένου
PDF