Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment

Over the past thirty years, clinical research has benefited substantially from the adoption of electronic medical record systems. As deployment has increased, so too has the number of researchers seeking to improve the overall analytical environment by way of tool...

Full description

Bibliographic Details
Main Author: Hylock, Ray Hales
Other Authors: Eichmann, David
Format: Others
Language:English
Published: University of Iowa 2013
Subjects:
Online Access:https://ir.uiowa.edu/etd/2526
https://ir.uiowa.edu/cgi/viewcontent.cgi?article=4655&context=etd
id ndltd-uiowa.edu-oai-ir.uiowa.edu-etd-4655
record_format oai_dc
spelling ndltd-uiowa.edu-oai-ir.uiowa.edu-etd-46552019-10-13T05:02:40Z Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment Hylock, Ray Hales Over the past thirty years, clinical research has benefited substantially from the adoption of electronic medical record systems. As deployment has increased, so too has the number of researchers seeking to improve the overall analytical environment by way of tools and models. Although much work has been done, there are still many uninvestigated areas; two of which are explored in this dissertation. The first pertains to the physical storage of the data itself. There are two generally accepted storage models: relational and entity-attribute-value (EAV). For clinical data, EAV systems are preferred due to their natural way of managing many-to-many relationships, sparse attributes, and dynamic processes along with minimal conversion effort and reduction in federation complexities. However, the relational database management systems on which they are implemented, are not intended to organize and retrieve data in this format; eroding their performance gains. To combat this effect, we present the foundation for an EAV Database Management System (EDBMS). We discuss data conversion methodologies, formulate the requisite metadata and partitioned type-sensing index structures, and provide detailed runtime and experimental analysis with five extant methods. Our results show that the prototype, EAVDB, reduces space and conversion requirements while enhancing overall query performance. The second topic concerns query performance in a federated environment. One method used to decrease query execution time, is to pre-compute and store "beneficial" queries (views). The View Selection Problem (VSP) identifies these views subject to resource constraints. A federated model, however, has yet to be developed. In this dissertation, we submit three advances in view materialization. First, a more robust optimization function, the Minimum-Maintenance View Selection Problem (MMVSP), is derived by combining existing approaches. Second, the Federated View Selection Problem (FVSP), built upon the MMVSP, and federated data cube lattice are formalized. The FVSP allows for multiple querying nodes, partial and full materialization, and data propagation constriction. The latter two are shown to greatly reduce the overall number of valid solutions within the solution space and thus a novel, multi-tiered approach is given. Lastly, EAV materialization, which is introduced in this dissertation, is incorporated into an expanded, multi-modal variant of the FVSP. As models and heuristics for both the federated and EAV VSP, to the best of our knowledge, do not exist, this research defines two new branches of data warehouse optimization. Coupled with our EDBMS design, this dissertation confronts two main challenges associated with clinical data warehousing and federation. 2013-05-01T07:00:00Z dissertation application/pdf https://ir.uiowa.edu/etd/2526 https://ir.uiowa.edu/cgi/viewcontent.cgi?article=4655&context=etd Copyright 2013 Ray Hylock Theses and Dissertations eng University of IowaEichmann, David Clinical data warehousing EAV database management system EAV view selection problem Entity-attribute-value system Federated view selection problem Bioinformatics
collection NDLTD
language English
format Others
sources NDLTD
topic Clinical data warehousing
EAV database management system
EAV view selection problem
Entity-attribute-value system
Federated view selection problem
Bioinformatics
spellingShingle Clinical data warehousing
EAV database management system
EAV view selection problem
Entity-attribute-value system
Federated view selection problem
Bioinformatics
Hylock, Ray Hales
Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
description Over the past thirty years, clinical research has benefited substantially from the adoption of electronic medical record systems. As deployment has increased, so too has the number of researchers seeking to improve the overall analytical environment by way of tools and models. Although much work has been done, there are still many uninvestigated areas; two of which are explored in this dissertation. The first pertains to the physical storage of the data itself. There are two generally accepted storage models: relational and entity-attribute-value (EAV). For clinical data, EAV systems are preferred due to their natural way of managing many-to-many relationships, sparse attributes, and dynamic processes along with minimal conversion effort and reduction in federation complexities. However, the relational database management systems on which they are implemented, are not intended to organize and retrieve data in this format; eroding their performance gains. To combat this effect, we present the foundation for an EAV Database Management System (EDBMS). We discuss data conversion methodologies, formulate the requisite metadata and partitioned type-sensing index structures, and provide detailed runtime and experimental analysis with five extant methods. Our results show that the prototype, EAVDB, reduces space and conversion requirements while enhancing overall query performance. The second topic concerns query performance in a federated environment. One method used to decrease query execution time, is to pre-compute and store "beneficial" queries (views). The View Selection Problem (VSP) identifies these views subject to resource constraints. A federated model, however, has yet to be developed. In this dissertation, we submit three advances in view materialization. First, a more robust optimization function, the Minimum-Maintenance View Selection Problem (MMVSP), is derived by combining existing approaches. Second, the Federated View Selection Problem (FVSP), built upon the MMVSP, and federated data cube lattice are formalized. The FVSP allows for multiple querying nodes, partial and full materialization, and data propagation constriction. The latter two are shown to greatly reduce the overall number of valid solutions within the solution space and thus a novel, multi-tiered approach is given. Lastly, EAV materialization, which is introduced in this dissertation, is incorporated into an expanded, multi-modal variant of the FVSP. As models and heuristics for both the federated and EAV VSP, to the best of our knowledge, do not exist, this research defines two new branches of data warehouse optimization. Coupled with our EDBMS design, this dissertation confronts two main challenges associated with clinical data warehousing and federation.
author2 Eichmann, David
author_facet Eichmann, David
Hylock, Ray Hales
author Hylock, Ray Hales
author_sort Hylock, Ray Hales
title Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
title_short Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
title_full Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
title_fullStr Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
title_full_unstemmed Beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
title_sort beyond relational: a database architecture and federated query optimization in a multi-modal healthcare environment
publisher University of Iowa
publishDate 2013
url https://ir.uiowa.edu/etd/2526
https://ir.uiowa.edu/cgi/viewcontent.cgi?article=4655&context=etd
work_keys_str_mv AT hylockrayhales beyondrelationaladatabasearchitectureandfederatedqueryoptimizationinamultimodalhealthcareenvironment
_version_ 1719265625627426816