A Performance Study of XML Query Optimization Techniques

Bibliographic Details
Main Author: Richardson, Bartley D.
Language:English
Published: University of Cincinnati / OhioLINK 2009
Subjects:
XML
Online Access:http://rave.ohiolink.edu/etdc/view?acc_num=ucin1258475256
id ndltd-OhioLink-oai-etd.ohiolink.edu-ucin1258475256
record_format oai_dc
spelling ndltd-OhioLink-oai-etd.ohiolink.edu-ucin12584752562021-08-03T06:13:37Z A Performance Study of XML Query Optimization Techniques Richardson, Bartley D. Computer Science XML query optimization As computers and technology continue to become more commonplace and essential to everyday life, more data is captured, stored, and analyzed by a variety of institutions in government, education, and the private sector. As this amount of data grows, so does the need for efficient methodologies and tools used to store, retrieve, and transform the data. A common method used to store this schemaless, semi-structured data is through the Extensible Markup Language, XML. In this way, an XML document is viewed as a database. With this sizable amount of data stored in a common format, one problem is how to efficiently query XML documents. While relational database man- agement systems contain built-in query optimizers, no such framework exists for XML databases. A multitude of document shapes, query shapes, index structures, and query techniques exist for XML databases, but the implications of these choices and their effects on query processing have not been investigated in a common framework. This dissertation identifies a set of representative query techniques, document structures, and query styles for XML databases and provides a com- mon framework for classifying the various query techniques, structures, and styles. We identify two broad classifications of query techniques, native XML and non-native XML, and develop a cost-based model for each technique that models query performance from an execution standpoint. We also develop our own query technique, RDBQuery, as an extension and major enhancement to a previously existing non-native XML query technique that leverages a relational database man- agement system to efficiently process XML queries. To evaluate relative query performance, we compare the techniques for various parameters that impact their performance, including query shape and document shape/size, and the results are presented through a series of graphs. These graphs and their underlying cost models are used to present an optimization framework for XML queries, and this provides the essential foundation in development of an integrated cost-based XML query optimizer. 2009 English text University of Cincinnati / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=ucin1258475256 http://rave.ohiolink.edu/etdc/view?acc_num=ucin1258475256 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws.
collection NDLTD
language English
sources NDLTD
topic Computer Science
XML
query
optimization
spellingShingle Computer Science
XML
query
optimization
Richardson, Bartley D.
A Performance Study of XML Query Optimization Techniques
author Richardson, Bartley D.
author_facet Richardson, Bartley D.
author_sort Richardson, Bartley D.
title A Performance Study of XML Query Optimization Techniques
title_short A Performance Study of XML Query Optimization Techniques
title_full A Performance Study of XML Query Optimization Techniques
title_fullStr A Performance Study of XML Query Optimization Techniques
title_full_unstemmed A Performance Study of XML Query Optimization Techniques
title_sort performance study of xml query optimization techniques
publisher University of Cincinnati / OhioLINK
publishDate 2009
url http://rave.ohiolink.edu/etdc/view?acc_num=ucin1258475256
work_keys_str_mv AT richardsonbartleyd aperformancestudyofxmlqueryoptimizationtechniques
AT richardsonbartleyd performancestudyofxmlqueryoptimizationtechniques
_version_ 1719433067452432384