The PBase Scientific Workflow Provenance Repository

<!-- p.abstract-western { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "Times New Roman",serif; font-size: 10pt; }p.abstract-cjk { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "DejaVu Sans","Arial",s...

Full description

Bibliographic Details
Main Authors: Víctor Cuevas-Vicenttín, Parisa Kianmajd, Bertram Ludäscher, Paolo Missier, Fernando Chirigati, Yaxing Wei, David Koop, Saumen Dey
Format: Article
Language:English
Published: University of Edinburgh 2014-10-01
Series:International Journal of Digital Curation
Online Access:http://www.ijdc.net/index.php/ijdc/article/view/332
id doaj-4f4b07242af646e2847525681aa44c43
record_format Article
spelling doaj-4f4b07242af646e2847525681aa44c432020-11-25T01:48:37ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562014-10-0192283810.2218/ijdc.v9i2.332291The PBase Scientific Workflow Provenance RepositoryVíctor Cuevas-VicenttínParisa KianmajdBertram LudäscherPaolo MissierFernando ChirigatiYaxing WeiDavid KoopSaumen Dey<!-- p.abstract-western { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "Times New Roman",serif; font-size: 10pt; }p.abstract-cjk { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "DejaVu Sans","Arial",sans-serif; font-size: 10pt; }p.abstract-ctl { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "Lohit Hindi","Times New Roman"; font-size: 12pt; }p { text-indent: 0.64cm; margin-bottom: 0cm; direction: ltr; color: rgb(0, 0, 0); widows: 2; orphans: 2; }p.western { font-family: "Times New Roman",serif; font-size: 12pt; }p.cjk { font-family: "DejaVu Sans","Arial",sans-serif; font-size: 12pt; }p.ctl { font-family: "Lohit Hindi","Times New Roman"; font-size: 12pt; }a.cjk:visited { }a:link { color: rgb(0, 107, 107); text-decoration: none; }a.western:link { }a.ctl:link { }a.sdfootnotesym-western { font-size: 7pt; }a.sdfootnotesym-cjk { font-size: 7pt; } --> <p class="abstract-western" lang="en-US">Scientific workflows and their supporting systems are becoming increasingly popular for compute-intensive and data-intensive scientific experiments. The advantages scientific workflows offer include rapid and easy workflow design, software and data reuse, scalable execution, sharing and collaboration, and other advantages that altogether facilitate “reproducible science”. In this context, provenance – information about the origin, context, derivation, ownership, or history of some artifact – plays a key role, since scientists are interested in examining and auditing the results of scientific experiments.</p> <p class="abstract-western">However, in order to perform such analyses on scientific results as part of extended research collaborations, an adequate environment and tools are required. Concretely, the need arises for a repository that will facilitate the sharing of scientific workflows and their associated execution traces in an interoperable manner, also enabling querying and visualization. Furthermore, such functionality should be supported while taking performance and scalability into account.</p> <p class="abstract-western" lang="en-US">With this purpose in mind, we introduce PBase: a scientific workflow provenance repository implementing the ProvONE proposed standard, which extends the emerging W3C PROV standard for provenance data with workflow specific concepts. PBase is built on the Neo4j graph database, thus offering capabilities such as declarative and efficient querying. Our experiences demonstrate the power gained by supporting various types of queries for provenance data. In addition, PBase is equipped with a user friendly interface tailored for the visualization of scientific workflow provenance data, making the specification of queries and the interpretation of their results easier and more effective.</p>http://www.ijdc.net/index.php/ijdc/article/view/332
collection DOAJ
language English
format Article
sources DOAJ
author Víctor Cuevas-Vicenttín
Parisa Kianmajd
Bertram Ludäscher
Paolo Missier
Fernando Chirigati
Yaxing Wei
David Koop
Saumen Dey
spellingShingle Víctor Cuevas-Vicenttín
Parisa Kianmajd
Bertram Ludäscher
Paolo Missier
Fernando Chirigati
Yaxing Wei
David Koop
Saumen Dey
The PBase Scientific Workflow Provenance Repository
International Journal of Digital Curation
author_facet Víctor Cuevas-Vicenttín
Parisa Kianmajd
Bertram Ludäscher
Paolo Missier
Fernando Chirigati
Yaxing Wei
David Koop
Saumen Dey
author_sort Víctor Cuevas-Vicenttín
title The PBase Scientific Workflow Provenance Repository
title_short The PBase Scientific Workflow Provenance Repository
title_full The PBase Scientific Workflow Provenance Repository
title_fullStr The PBase Scientific Workflow Provenance Repository
title_full_unstemmed The PBase Scientific Workflow Provenance Repository
title_sort pbase scientific workflow provenance repository
publisher University of Edinburgh
series International Journal of Digital Curation
issn 1746-8256
publishDate 2014-10-01
description <!-- p.abstract-western { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "Times New Roman",serif; font-size: 10pt; }p.abstract-cjk { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "DejaVu Sans","Arial",sans-serif; font-size: 10pt; }p.abstract-ctl { margin-left: 1.27cm; margin-right: 1.27cm; margin-bottom: 0.18cm; font-family: "Lohit Hindi","Times New Roman"; font-size: 12pt; }p { text-indent: 0.64cm; margin-bottom: 0cm; direction: ltr; color: rgb(0, 0, 0); widows: 2; orphans: 2; }p.western { font-family: "Times New Roman",serif; font-size: 12pt; }p.cjk { font-family: "DejaVu Sans","Arial",sans-serif; font-size: 12pt; }p.ctl { font-family: "Lohit Hindi","Times New Roman"; font-size: 12pt; }a.cjk:visited { }a:link { color: rgb(0, 107, 107); text-decoration: none; }a.western:link { }a.ctl:link { }a.sdfootnotesym-western { font-size: 7pt; }a.sdfootnotesym-cjk { font-size: 7pt; } --> <p class="abstract-western" lang="en-US">Scientific workflows and their supporting systems are becoming increasingly popular for compute-intensive and data-intensive scientific experiments. The advantages scientific workflows offer include rapid and easy workflow design, software and data reuse, scalable execution, sharing and collaboration, and other advantages that altogether facilitate “reproducible science”. In this context, provenance – information about the origin, context, derivation, ownership, or history of some artifact – plays a key role, since scientists are interested in examining and auditing the results of scientific experiments.</p> <p class="abstract-western">However, in order to perform such analyses on scientific results as part of extended research collaborations, an adequate environment and tools are required. Concretely, the need arises for a repository that will facilitate the sharing of scientific workflows and their associated execution traces in an interoperable manner, also enabling querying and visualization. Furthermore, such functionality should be supported while taking performance and scalability into account.</p> <p class="abstract-western" lang="en-US">With this purpose in mind, we introduce PBase: a scientific workflow provenance repository implementing the ProvONE proposed standard, which extends the emerging W3C PROV standard for provenance data with workflow specific concepts. PBase is built on the Neo4j graph database, thus offering capabilities such as declarative and efficient querying. Our experiences demonstrate the power gained by supporting various types of queries for provenance data. In addition, PBase is equipped with a user friendly interface tailored for the visualization of scientific workflow provenance data, making the specification of queries and the interpretation of their results easier and more effective.</p>
url http://www.ijdc.net/index.php/ijdc/article/view/332
work_keys_str_mv AT victorcuevasvicenttin thepbasescientificworkflowprovenancerepository
AT parisakianmajd thepbasescientificworkflowprovenancerepository
AT bertramludascher thepbasescientificworkflowprovenancerepository
AT paolomissier thepbasescientificworkflowprovenancerepository
AT fernandochirigati thepbasescientificworkflowprovenancerepository
AT yaxingwei thepbasescientificworkflowprovenancerepository
AT davidkoop thepbasescientificworkflowprovenancerepository
AT saumendey thepbasescientificworkflowprovenancerepository
AT victorcuevasvicenttin pbasescientificworkflowprovenancerepository
AT parisakianmajd pbasescientificworkflowprovenancerepository
AT bertramludascher pbasescientificworkflowprovenancerepository
AT paolomissier pbasescientificworkflowprovenancerepository
AT fernandochirigati pbasescientificworkflowprovenancerepository
AT yaxingwei pbasescientificworkflowprovenancerepository
AT davidkoop pbasescientificworkflowprovenancerepository
AT saumendey pbasescientificworkflowprovenancerepository
_version_ 1725011107587817472