Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana

For the last 10 years, the ATLAS Distributed Computing project has based its monitoring infrastructure on a set of custom designed dashboards provided by CERN. This system functioned very well for LHC Runs 1 and 2, but its maintenance has progressively become more difficult and the conditions for Ru...

Full description

Bibliographic Details
Main Authors: Beermann Thomas, Alekseev Aleksandr, Baberis Dario, Crépé-Renaudin Sabine, Elmsheuser Johannes, Glushkov Ivan, Svatos Michal, Vartapetian Armen, Vokac Petr, Wolters Helmut
Format: Article
Language:English
Published: EDP Sciences 2020-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_03031.pdf
id doaj-98d2cd266ea44d0aaa8c76ac8e95cc5c
record_format Article
spelling doaj-98d2cd266ea44d0aaa8c76ac8e95cc5c2021-08-02T22:58:33ZengEDP SciencesEPJ Web of Conferences2100-014X2020-01-012450303110.1051/epjconf/202024503031epjconf_chep2020_03031Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and GrafanaBeermann Thomas0Alekseev Aleksandr1Baberis Dario2Crépé-Renaudin Sabine3Elmsheuser Johannes4Glushkov Ivan5Svatos Michal6Vartapetian Armen7Vokac Petr8Wolters Helmut9Bergische Universität WuppertalUniversidad Andrés BelloUniversità e INFN GenovaLPSC-Grenoble, CNRS/UGABrookhaven National LaboratoryUniversity of Texas at ArlingtonAcad. of Sciences of the Czech Rep.University of Texas at ArlingtonCzech Technical UniversityLIP CoimbraFor the last 10 years, the ATLAS Distributed Computing project has based its monitoring infrastructure on a set of custom designed dashboards provided by CERN. This system functioned very well for LHC Runs 1 and 2, but its maintenance has progressively become more difficult and the conditions for Run 3, starting in 2021, will be even more demanding; hence a more standard code base and more automatic operations are needed. A new infrastructure has been provided by CERN, based on InfluxDB as the data store and Grafana as the display environment. ATLAS has adapted and further developed its monitoring tools to use this infrastructure for data and workflow management monitoring and accounting dashboards, expanding the range of previous possibilities with the aim to achieve a single, simpler, environment for all monitoring applications. This document describes these tools and the data flows for monitoring and accounting.https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_03031.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Beermann Thomas
Alekseev Aleksandr
Baberis Dario
Crépé-Renaudin Sabine
Elmsheuser Johannes
Glushkov Ivan
Svatos Michal
Vartapetian Armen
Vokac Petr
Wolters Helmut
spellingShingle Beermann Thomas
Alekseev Aleksandr
Baberis Dario
Crépé-Renaudin Sabine
Elmsheuser Johannes
Glushkov Ivan
Svatos Michal
Vartapetian Armen
Vokac Petr
Wolters Helmut
Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
EPJ Web of Conferences
author_facet Beermann Thomas
Alekseev Aleksandr
Baberis Dario
Crépé-Renaudin Sabine
Elmsheuser Johannes
Glushkov Ivan
Svatos Michal
Vartapetian Armen
Vokac Petr
Wolters Helmut
author_sort Beermann Thomas
title Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
title_short Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
title_full Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
title_fullStr Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
title_full_unstemmed Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana
title_sort implementation of atlas distributed computing monitoring dashboards using influxdb and grafana
publisher EDP Sciences
series EPJ Web of Conferences
issn 2100-014X
publishDate 2020-01-01
description For the last 10 years, the ATLAS Distributed Computing project has based its monitoring infrastructure on a set of custom designed dashboards provided by CERN. This system functioned very well for LHC Runs 1 and 2, but its maintenance has progressively become more difficult and the conditions for Run 3, starting in 2021, will be even more demanding; hence a more standard code base and more automatic operations are needed. A new infrastructure has been provided by CERN, based on InfluxDB as the data store and Grafana as the display environment. ATLAS has adapted and further developed its monitoring tools to use this infrastructure for data and workflow management monitoring and accounting dashboards, expanding the range of previous possibilities with the aim to achieve a single, simpler, environment for all monitoring applications. This document describes these tools and the data flows for monitoring and accounting.
url https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_03031.pdf
work_keys_str_mv AT beermannthomas implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT alekseevaleksandr implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT baberisdario implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT creperenaudinsabine implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT elmsheuserjohannes implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT glushkovivan implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT svatosmichal implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT vartapetianarmen implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT vokacpetr implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
AT woltershelmut implementationofatlasdistributedcomputingmonitoringdashboardsusinginfluxdbandgrafana
_version_ 1721225992963358720