Enabling interoperable data and application services in a federated ScienceMesh

In recent years, cloud sync & share storage services, provided by academic and research institutions, have become a daily workplace environment for many local user groups in the High Energy Physics (HEP) community. These, however, are primarily disconnected and deployed in isolation from one ano...

Full description

Bibliographic Details
Main Authors: Arora Ishank, Alfageme Sainz Samuel, Ferreira Pedro, Gonzalez Labrador Hugo, Moscicki Jakub
Format: Article
Language:English
Published: EDP Sciences 2021-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02041.pdf
id doaj-5e1e2a264a28469ebf9bca97dd3adc3a
record_format Article
spelling doaj-5e1e2a264a28469ebf9bca97dd3adc3a2021-08-26T09:27:25ZengEDP SciencesEPJ Web of Conferences2100-014X2021-01-012510204110.1051/epjconf/202125102041epjconf_chep2021_02041Enabling interoperable data and application services in a federated ScienceMeshArora Ishank0Alfageme Sainz Samuel1Ferreira Pedro2Gonzalez Labrador Hugo3Moscicki Jakub4CERNCERNCERNCERNCERNIn recent years, cloud sync & share storage services, provided by academic and research institutions, have become a daily workplace environment for many local user groups in the High Energy Physics (HEP) community. These, however, are primarily disconnected and deployed in isolation from one another, even though new technologies have been developed and integrated to further increase the value of data. The EU-funded CS3MESH4EOSC project is connecting locally and individually provided sync and share services, and scaling them up to the European level and beyond. It aims to deliver the ScienceMesh service, an interoperable platform to easily sync and share data across institutions and extend functionalities by connecting to other research services using streamlined sets of interoperable protocols, APIs and deployment methodologies. This supports multiple distributed application workflows: data science environments, collaborative editing and data transfer services. In this paper, we present the architecture of ScienceMesh and the technical design of its reference implementation, a platform that allows organizations to join the federated service infrastructure easily and to access application services outof-the-box. We discuss the challenges faced during the process, which include diversity of sync & share platforms (Nextcloud, Owncloud, Seafile and others), absence of global user identities and user discovery, lack of interoperable protocols and APIs, and access control and protection of data endpoints. We present the rationale for the design decisions adopted to tackle these challenges and describe our deployment architecture based on Kubernetes, which enabled us to utilize monitoring and tracing functionalities. We conclude by reporting on the early user experience with ScienceMesh.https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02041.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Arora Ishank
Alfageme Sainz Samuel
Ferreira Pedro
Gonzalez Labrador Hugo
Moscicki Jakub
spellingShingle Arora Ishank
Alfageme Sainz Samuel
Ferreira Pedro
Gonzalez Labrador Hugo
Moscicki Jakub
Enabling interoperable data and application services in a federated ScienceMesh
EPJ Web of Conferences
author_facet Arora Ishank
Alfageme Sainz Samuel
Ferreira Pedro
Gonzalez Labrador Hugo
Moscicki Jakub
author_sort Arora Ishank
title Enabling interoperable data and application services in a federated ScienceMesh
title_short Enabling interoperable data and application services in a federated ScienceMesh
title_full Enabling interoperable data and application services in a federated ScienceMesh
title_fullStr Enabling interoperable data and application services in a federated ScienceMesh
title_full_unstemmed Enabling interoperable data and application services in a federated ScienceMesh
title_sort enabling interoperable data and application services in a federated sciencemesh
publisher EDP Sciences
series EPJ Web of Conferences
issn 2100-014X
publishDate 2021-01-01
description In recent years, cloud sync & share storage services, provided by academic and research institutions, have become a daily workplace environment for many local user groups in the High Energy Physics (HEP) community. These, however, are primarily disconnected and deployed in isolation from one another, even though new technologies have been developed and integrated to further increase the value of data. The EU-funded CS3MESH4EOSC project is connecting locally and individually provided sync and share services, and scaling them up to the European level and beyond. It aims to deliver the ScienceMesh service, an interoperable platform to easily sync and share data across institutions and extend functionalities by connecting to other research services using streamlined sets of interoperable protocols, APIs and deployment methodologies. This supports multiple distributed application workflows: data science environments, collaborative editing and data transfer services. In this paper, we present the architecture of ScienceMesh and the technical design of its reference implementation, a platform that allows organizations to join the federated service infrastructure easily and to access application services outof-the-box. We discuss the challenges faced during the process, which include diversity of sync & share platforms (Nextcloud, Owncloud, Seafile and others), absence of global user identities and user discovery, lack of interoperable protocols and APIs, and access control and protection of data endpoints. We present the rationale for the design decisions adopted to tackle these challenges and describe our deployment architecture based on Kubernetes, which enabled us to utilize monitoring and tracing functionalities. We conclude by reporting on the early user experience with ScienceMesh.
url https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02041.pdf
work_keys_str_mv AT aroraishank enablinginteroperabledataandapplicationservicesinafederatedsciencemesh
AT alfagemesainzsamuel enablinginteroperabledataandapplicationservicesinafederatedsciencemesh
AT ferreirapedro enablinginteroperabledataandapplicationservicesinafederatedsciencemesh
AT gonzalezlabradorhugo enablinginteroperabledataandapplicationservicesinafederatedsciencemesh
AT moscickijakub enablinginteroperabledataandapplicationservicesinafederatedsciencemesh
_version_ 1721195788227313664