CERN Accelerating science

If you experience any problem watching the video, click the download button below
Download Embed
CMS Note
Report number CMS-CR-2013-369
Title CMS experience of running glideinWMS in High Availability mode
Author(s) Sfiligoi, Igor (UC, San Diego) ; Letts, James (UC, San Diego) ; Belforte, Stefano (INFN, Trieste) ; Mc Crea, Alison Jean (UC, San Diego) ; Larson, Krista Elaine (Fermilab) ; Zvada, Marian (Karlsruhe U., EKP) ; Holzman, Burt (Fermilab) ; P Mhashilkar ; Bradley, Daniel Charles (Wisconsin U., Madison) ; Saiz Santos, Maria Dolores (UC, San Diego) ; Fanzago, Federica (INFN, Padua) ; Gutsche, Oliver (Fermilab) ; Martin, Terrence (UC, San Diego) ; Wuerthwein, Frank Karl (UC, San Diego)
Publication 2013
Imprint 29 Oct 2013
Number of pages 6
Presented at 20th International Conference on Computing in High Energy and Nuclear Physics 2013, Amsterdam, Netherlands, 14 - 18 Oct 2013
Subject category Detectors and Experimental Techniques
Accelerator/Facility, Experiment CERN LHC ; CMS
Keywords General
Abstract The CMS experiment at the Large Hadron Collider is relying on the HTCondor-based glideinWMS batch system to handle most of its distributed computing needs. In order to minimize the risk of disruptions due to software and hardware problems, and also to simplify the maintenance procedures, CMS has set up its glideinWMS instance to use most of the attainable High Availability (HA) features. The setup involves running services distributed over multiple nodes, which in turn are located in several physical locations, including Geneva, Switzerland, Chicago, Illinois and San Diego, California. This paper describes the setup used by CMS, the HA limits of this setup, as well as a description of the actual operational experience spanning many months.
Copyright/License Preprint: (License: CC-BY-4.0)

 


 Record created 2013-10-30, last modified 2018-06-07


Fulltext:
Download fulltext
PDF