https://doi.org/10.1140/epjp/i2011-11009-9
Regular Article
MonALISA-based Grid monitoring and control
1
CERN, European Organization for Nuclear Research, 23, Geneve, Switzerland
2
California Institute of Technology, 91125, Pasadena, CA, USA
3
Politehnica University of Bucharest, Splaiul Independentei nr. 313, 6, Bucuresti, Romania
* e-mail: costin.grigoras@cern.ch
Received:
30
October
2010
Accepted:
4
January
2011
Published online:
21
January
2011
High-Energy Physics experiments like ALICE at LHC require petabytes of storage and thousand of CPU working in parallel to store, reconstruct and analyze the collected data. This computing power is provided by aggregating the resources of hundreds of institutes and research centers and in addition several purpose-built large computing centers. All these resources are transparently available to the users under the umbrella of ALICE Grid. To ensure smooth operation of this complex distributed machinery we have developed a set of tools to monitor and control the various services, based on the MonALISA monitoring framework. By integrating monitoring information in the system we have achieved a high degree of automation and have significantly reduced the burden on the Grid managers. In this article we present how we collect the monitoring information and a few of the tools that make use of it.
© Società Italiana di Fisica and Springer, 2011