A unified framework for managing provenance information in translational research
<p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2011-11-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/12/461 |
id |
doaj-fa44634d90c94d478f9b4a10ed9919aa |
---|---|
record_format |
Article |
spelling |
doaj-fa44634d90c94d478f9b4a10ed9919aa2020-11-25T01:00:59ZengBMCBMC Bioinformatics1471-21052011-11-0112146110.1186/1471-2105-12-461A unified framework for managing provenance information in translational researchSahoo Satya SNguyen VinhBodenreider OlivierParikh PritiMinning ToddSheth Amit P<p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>provenance </it>metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.</p> <p>Results</p> <p>We identify a common set of challenges in managing provenance information across the <it>pre-publication </it>and <it>post-publication </it>phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:</p> <p>(a) Provenance <b>collection </b>- during data generation</p> <p>(b) Provenance <b>representation </b>- to support interoperability, reasoning, and incorporate domain semantics</p> <p>(c) Provenance <b>storage </b>and <b>propagation </b>- to allow efficient storage and seamless propagation of provenance as the data is transferred across applications</p> <p>(d) Provenance <b>query </b>- to support queries with increasing complexity over large data size and also support knowledge discovery applications</p> <p>We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for <it>Trypanosoma cruzi </it>(<it>T.cruzi </it>SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.</p> <p>Conclusions</p> <p>The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.</p> http://www.biomedcentral.com/1471-2105/12/461 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sahoo Satya S Nguyen Vinh Bodenreider Olivier Parikh Priti Minning Todd Sheth Amit P |
spellingShingle |
Sahoo Satya S Nguyen Vinh Bodenreider Olivier Parikh Priti Minning Todd Sheth Amit P A unified framework for managing provenance information in translational research BMC Bioinformatics |
author_facet |
Sahoo Satya S Nguyen Vinh Bodenreider Olivier Parikh Priti Minning Todd Sheth Amit P |
author_sort |
Sahoo Satya S |
title |
A unified framework for managing provenance information in translational research |
title_short |
A unified framework for managing provenance information in translational research |
title_full |
A unified framework for managing provenance information in translational research |
title_fullStr |
A unified framework for managing provenance information in translational research |
title_full_unstemmed |
A unified framework for managing provenance information in translational research |
title_sort |
unified framework for managing provenance information in translational research |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2011-11-01 |
description |
<p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>provenance </it>metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.</p> <p>Results</p> <p>We identify a common set of challenges in managing provenance information across the <it>pre-publication </it>and <it>post-publication </it>phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:</p> <p>(a) Provenance <b>collection </b>- during data generation</p> <p>(b) Provenance <b>representation </b>- to support interoperability, reasoning, and incorporate domain semantics</p> <p>(c) Provenance <b>storage </b>and <b>propagation </b>- to allow efficient storage and seamless propagation of provenance as the data is transferred across applications</p> <p>(d) Provenance <b>query </b>- to support queries with increasing complexity over large data size and also support knowledge discovery applications</p> <p>We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for <it>Trypanosoma cruzi </it>(<it>T.cruzi </it>SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.</p> <p>Conclusions</p> <p>The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.</p> |
url |
http://www.biomedcentral.com/1471-2105/12/461 |
work_keys_str_mv |
AT sahoosatyas aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT nguyenvinh aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT bodenreiderolivier aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT parikhpriti aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT minningtodd aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT shethamitp aunifiedframeworkformanagingprovenanceinformationintranslationalresearch AT sahoosatyas unifiedframeworkformanagingprovenanceinformationintranslationalresearch AT nguyenvinh unifiedframeworkformanagingprovenanceinformationintranslationalresearch AT bodenreiderolivier unifiedframeworkformanagingprovenanceinformationintranslationalresearch AT parikhpriti unifiedframeworkformanagingprovenanceinformationintranslationalresearch AT minningtodd unifiedframeworkformanagingprovenanceinformationintranslationalresearch AT shethamitp unifiedframeworkformanagingprovenanceinformationintranslationalresearch |
_version_ |
1725211486860607488 |