A unified framework for managing provenance information in translational research

<p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>...

Full description

Bibliographic Details
Main Authors: Sahoo Satya S, Nguyen Vinh, Bodenreider Olivier, Parikh Priti, Minning Todd, Sheth Amit P
Format: Article
Language:English
Published: BMC 2011-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/461
id doaj-fa44634d90c94d478f9b4a10ed9919aa
record_format Article
spelling doaj-fa44634d90c94d478f9b4a10ed9919aa2020-11-25T01:00:59ZengBMCBMC Bioinformatics1471-21052011-11-0112146110.1186/1471-2105-12-461A unified framework for managing provenance information in translational researchSahoo Satya SNguyen VinhBodenreider OlivierParikh PritiMinning ToddSheth Amit P<p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>provenance </it>metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.</p> <p>Results</p> <p>We identify a common set of challenges in managing provenance information across the <it>pre-publication </it>and <it>post-publication </it>phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:</p> <p>(a) Provenance <b>collection </b>- during data generation</p> <p>(b) Provenance <b>representation </b>- to support interoperability, reasoning, and incorporate domain semantics</p> <p>(c) Provenance <b>storage </b>and <b>propagation </b>- to allow efficient storage and seamless propagation of provenance as the data is transferred across applications</p> <p>(d) Provenance <b>query </b>- to support queries with increasing complexity over large data size and also support knowledge discovery applications</p> <p>We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for <it>Trypanosoma cruzi </it>(<it>T.cruzi </it>SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.</p> <p>Conclusions</p> <p>The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.</p> http://www.biomedcentral.com/1471-2105/12/461
collection DOAJ
language English
format Article
sources DOAJ
author Sahoo Satya S
Nguyen Vinh
Bodenreider Olivier
Parikh Priti
Minning Todd
Sheth Amit P
spellingShingle Sahoo Satya S
Nguyen Vinh
Bodenreider Olivier
Parikh Priti
Minning Todd
Sheth Amit P
A unified framework for managing provenance information in translational research
BMC Bioinformatics
author_facet Sahoo Satya S
Nguyen Vinh
Bodenreider Olivier
Parikh Priti
Minning Todd
Sheth Amit P
author_sort Sahoo Satya S
title A unified framework for managing provenance information in translational research
title_short A unified framework for managing provenance information in translational research
title_full A unified framework for managing provenance information in translational research
title_fullStr A unified framework for managing provenance information in translational research
title_full_unstemmed A unified framework for managing provenance information in translational research
title_sort unified framework for managing provenance information in translational research
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-11-01
description <p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>provenance </it>metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.</p> <p>Results</p> <p>We identify a common set of challenges in managing provenance information across the <it>pre-publication </it>and <it>post-publication </it>phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:</p> <p>(a) Provenance <b>collection </b>- during data generation</p> <p>(b) Provenance <b>representation </b>- to support interoperability, reasoning, and incorporate domain semantics</p> <p>(c) Provenance <b>storage </b>and <b>propagation </b>- to allow efficient storage and seamless propagation of provenance as the data is transferred across applications</p> <p>(d) Provenance <b>query </b>- to support queries with increasing complexity over large data size and also support knowledge discovery applications</p> <p>We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for <it>Trypanosoma cruzi </it>(<it>T.cruzi </it>SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.</p> <p>Conclusions</p> <p>The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.</p>
url http://www.biomedcentral.com/1471-2105/12/461
work_keys_str_mv AT sahoosatyas aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT nguyenvinh aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT bodenreiderolivier aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT parikhpriti aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT minningtodd aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT shethamitp aunifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT sahoosatyas unifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT nguyenvinh unifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT bodenreiderolivier unifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT parikhpriti unifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT minningtodd unifiedframeworkformanagingprovenanceinformationintranslationalresearch
AT shethamitp unifiedframeworkformanagingprovenanceinformationintranslationalresearch
_version_ 1725211486860607488