The Belle II Raw Data Management System

The Belle II experiment, a major upgrade of the previous e+e− asymmetric collider experiment Belle, is expected to produce tens of petabytes of data per year due to the luminosity increase from the upgraded SuperKEKB accelerator. The distributed computing system of the Belle II experiment plays a ke...

Full description

Bibliographic Details
Main Authors: Hernández Villanueva Michel, Ueda Ikuo
Format: Article
Language:English
Published: EDP Sciences 2020-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04005.pdf
id doaj-753530deb2a74858aabe364891a2c756
record_format Article
spelling doaj-753530deb2a74858aabe364891a2c7562021-08-02T17:58:17ZengEDP SciencesEPJ Web of Conferences2100-014X2020-01-012450400510.1051/epjconf/202024504005epjconf_chep2020_04005The Belle II Raw Data Management SystemHernández Villanueva Michel0Ueda Ikuo1University of MississippiKEK IPNSThe Belle II experiment, a major upgrade of the previous e+e− asymmetric collider experiment Belle, is expected to produce tens of petabytes of data per year due to the luminosity increase from the upgraded SuperKEKB accelerator. The distributed computing system of the Belle II experiment plays a key role, storing and distributing data in a reliable way to be easily accessed and analyzed by more than 1000 collaborators. In particular, the Belle II Raw Data Management system has been developed with an aim to upload output files onto grid storage, register them into the file and metadata catalogs, and make two replicas of the full raw data set using the Belle II Distributed Data Management system. It has been implemented as an extension of DIRAC (Distributed Infrastructure with Remote Agent Control) and consists of a database, services, client and monitoring tools, and several agents that treat the data automatically. The first year of data taken with the Belle II full detector has been managed by the Belle II Raw Data Management system successfully. The design, current status, and performance are presented. Prospects for improvements towards the full luminosity data taking are also reviewed.https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04005.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Hernández Villanueva Michel
Ueda Ikuo
spellingShingle Hernández Villanueva Michel
Ueda Ikuo
The Belle II Raw Data Management System
EPJ Web of Conferences
author_facet Hernández Villanueva Michel
Ueda Ikuo
author_sort Hernández Villanueva Michel
title The Belle II Raw Data Management System
title_short The Belle II Raw Data Management System
title_full The Belle II Raw Data Management System
title_fullStr The Belle II Raw Data Management System
title_full_unstemmed The Belle II Raw Data Management System
title_sort belle ii raw data management system
publisher EDP Sciences
series EPJ Web of Conferences
issn 2100-014X
publishDate 2020-01-01
description The Belle II experiment, a major upgrade of the previous e+e− asymmetric collider experiment Belle, is expected to produce tens of petabytes of data per year due to the luminosity increase from the upgraded SuperKEKB accelerator. The distributed computing system of the Belle II experiment plays a key role, storing and distributing data in a reliable way to be easily accessed and analyzed by more than 1000 collaborators. In particular, the Belle II Raw Data Management system has been developed with an aim to upload output files onto grid storage, register them into the file and metadata catalogs, and make two replicas of the full raw data set using the Belle II Distributed Data Management system. It has been implemented as an extension of DIRAC (Distributed Infrastructure with Remote Agent Control) and consists of a database, services, client and monitoring tools, and several agents that treat the data automatically. The first year of data taken with the Belle II full detector has been managed by the Belle II Raw Data Management system successfully. The design, current status, and performance are presented. Prospects for improvements towards the full luminosity data taking are also reviewed.
url https://www.epj-conferences.org/articles/epjconf/pdf/2020/21/epjconf_chep2020_04005.pdf
work_keys_str_mv AT hernandezvillanuevamichel thebelleiirawdatamanagementsystem
AT uedaikuo thebelleiirawdatamanagementsystem
AT hernandezvillanuevamichel belleiirawdatamanagementsystem
AT uedaikuo belleiirawdatamanagementsystem
_version_ 1721228661325037568