Design and implementation of a generalized laboratory data model

Abstract Background Investigators in the biological sciences continue to exploit laboratory automation methods and have dramatically increased the rates at which they can generate data. In many environments, the methods themselves also evolve in a rapid...

Full description

Bibliographic Details
Main Authors:	Nhan Mike, Carmichael Lynn, Leong Shin, Hepler Todd, Crouse Kevin, Chinwalla Asif T, Dooling David J, Pohl Craig S, Smith Scott, Wendl Michael C, Oberkfell Benjamin J, Mardis Elaine R, Hillier LaDeana W, Wilson Richard K
Format:	Article
Language:	English
Published:	BMC 2007-09-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/8/362

id	doaj-7568770e0c5d4d3fb26310466bd3e958
record_format	Article
spelling	doaj-7568770e0c5d4d3fb26310466bd3e9582020-11-24T22:22:25ZengBMCBMC Bioinformatics1471-21052007-09-018136210.1186/1471-2105-8-362Design and implementation of a generalized laboratory data modelNhan MikeCarmichael LynnLeong ShinHepler ToddCrouse KevinChinwalla Asif TDooling David JPohl Craig SSmith ScottWendl Michael COberkfell Benjamin JMardis Elaine RHillier LaDeana WWilson Richard K<p>Abstract</p> <p>Background</p> <p>Investigators in the biological sciences continue to exploit laboratory automation methods and have dramatically increased the rates at which they can generate data. In many environments, the methods themselves also evolve in a rapid and fluid manner. These observations point to the importance of robust information management systems in the modern laboratory. Designing and implementing such systems is non-trivial and it appears that in many cases a database project ultimately proves unserviceable.</p> <p>Results</p> <p>We describe a general modeling framework for laboratory data and its implementation as an information management system. The model utilizes several abstraction techniques, focusing especially on the concepts of inheritance and meta-data. Traditional approaches commingle event-oriented data with regular entity data in <it>ad hoc </it>ways. Instead, we define distinct regular entity and event schemas, but fully integrate these via a standardized interface. The design allows straightforward definition of a "processing pipeline" as a sequence of events, obviating the need for separate workflow management systems. A layer above the event-oriented schema integrates events into a workflow by defining "processing directives", which act as automated project managers of items in the system. Directives can be added or modified in an almost trivial fashion, i.e., without the need for schema modification or re-certification of applications. Association between regular entities and events is managed via simple "many-to-many" relationships. We describe the programming interface, as well as techniques for handling input/output, process control, and state transitions.</p> <p>Conclusion</p> <p>The implementation described here has served as the Washington University Genome Sequencing Center's primary information system for several years. It handles all transactions underlying a throughput rate of about 9 million sequencing reactions of various kinds per month and has handily weathered a number of major pipeline reconfigurations. The basic data model can be readily adapted to other high-volume processing environments.</p> http://www.biomedcentral.com/1471-2105/8/362
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Nhan Mike Carmichael Lynn Leong Shin Hepler Todd Crouse Kevin Chinwalla Asif T Dooling David J Pohl Craig S Smith Scott Wendl Michael C Oberkfell Benjamin J Mardis Elaine R Hillier LaDeana W Wilson Richard K
spellingShingle	Nhan Mike Carmichael Lynn Leong Shin Hepler Todd Crouse Kevin Chinwalla Asif T Dooling David J Pohl Craig S Smith Scott Wendl Michael C Oberkfell Benjamin J Mardis Elaine R Hillier LaDeana W Wilson Richard K Design and implementation of a generalized laboratory data model BMC Bioinformatics
author_facet	Nhan Mike Carmichael Lynn Leong Shin Hepler Todd Crouse Kevin Chinwalla Asif T Dooling David J Pohl Craig S Smith Scott Wendl Michael C Oberkfell Benjamin J Mardis Elaine R Hillier LaDeana W Wilson Richard K
author_sort	Nhan Mike
title	Design and implementation of a generalized laboratory data model
title_short	Design and implementation of a generalized laboratory data model
title_full	Design and implementation of a generalized laboratory data model
title_fullStr	Design and implementation of a generalized laboratory data model
title_full_unstemmed	Design and implementation of a generalized laboratory data model
title_sort	design and implementation of a generalized laboratory data model
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2007-09-01
description	<p>Abstract</p> <p>Background</p> <p>Investigators in the biological sciences continue to exploit laboratory automation methods and have dramatically increased the rates at which they can generate data. In many environments, the methods themselves also evolve in a rapid and fluid manner. These observations point to the importance of robust information management systems in the modern laboratory. Designing and implementing such systems is non-trivial and it appears that in many cases a database project ultimately proves unserviceable.</p> <p>Results</p> <p>We describe a general modeling framework for laboratory data and its implementation as an information management system. The model utilizes several abstraction techniques, focusing especially on the concepts of inheritance and meta-data. Traditional approaches commingle event-oriented data with regular entity data in <it>ad hoc </it>ways. Instead, we define distinct regular entity and event schemas, but fully integrate these via a standardized interface. The design allows straightforward definition of a "processing pipeline" as a sequence of events, obviating the need for separate workflow management systems. A layer above the event-oriented schema integrates events into a workflow by defining "processing directives", which act as automated project managers of items in the system. Directives can be added or modified in an almost trivial fashion, i.e., without the need for schema modification or re-certification of applications. Association between regular entities and events is managed via simple "many-to-many" relationships. We describe the programming interface, as well as techniques for handling input/output, process control, and state transitions.</p> <p>Conclusion</p> <p>The implementation described here has served as the Washington University Genome Sequencing Center's primary information system for several years. It handles all transactions underlying a throughput rate of about 9 million sequencing reactions of various kinds per month and has handily weathered a number of major pipeline reconfigurations. The basic data model can be readily adapted to other high-volume processing environments.</p>
url	http://www.biomedcentral.com/1471-2105/8/362
work_keys_str_mv	AT nhanmike designandimplementationofageneralizedlaboratorydatamodel AT carmichaellynn designandimplementationofageneralizedlaboratorydatamodel AT leongshin designandimplementationofageneralizedlaboratorydatamodel AT heplertodd designandimplementationofageneralizedlaboratorydatamodel AT crousekevin designandimplementationofageneralizedlaboratorydatamodel AT chinwallaasift designandimplementationofageneralizedlaboratorydatamodel AT doolingdavidj designandimplementationofageneralizedlaboratorydatamodel AT pohlcraigs designandimplementationofageneralizedlaboratorydatamodel AT smithscott designandimplementationofageneralizedlaboratorydatamodel AT wendlmichaelc designandimplementationofageneralizedlaboratorydatamodel AT oberkfellbenjaminj designandimplementationofageneralizedlaboratorydatamodel AT mardiselainer designandimplementationofageneralizedlaboratorydatamodel AT hillierladeanaw designandimplementationofageneralizedlaboratorydatamodel AT wilsonrichardk designandimplementationofageneralizedlaboratorydatamodel
_version_	1725768362193059840

Design and implementation of a generalized laboratory data model

Similar Items