Implementation of Tuned Schema Merging Approach

Schema merging is a process of integrating multiple data sources into a GCS (Global Conceptual Schema). It is pivotal to various application domains, like data ware housing and multi-databases. Schema merging requires the identification of corresponding elements, which is done through schema matchin...

Full description

Bibliographic Details
Main Authors: Nayyer Masood, Gul Jabee
Format: Article
Language:English
Published: Mehran University of Engineering and Technology 2018-10-01
Series:Mehran University Research Journal of Engineering and Technology
Online Access:http://publications.muet.edu.pk/index.php/muetrj/article/view/558
id doaj-e6a42b42429b4a8981213415a32d7a63
record_format Article
spelling doaj-e6a42b42429b4a8981213415a32d7a632020-11-25T02:30:51ZengMehran University of Engineering and TechnologyMehran University Research Journal of Engineering and Technology0254-78212413-72192018-10-0137449750610.22581/muet1982.1804.05558Implementation of Tuned Schema Merging ApproachNayyer Masood0Gul Jabee1Department of Computer Science, Capital University of Science and Technology, Islamabad, Pakistan.Department of Computer Science, Karakoram International University, Gilgit-Baltistan, Pakistan.Schema merging is a process of integrating multiple data sources into a GCS (Global Conceptual Schema). It is pivotal to various application domains, like data ware housing and multi-databases. Schema merging requires the identification of corresponding elements, which is done through schema matching process. In this process, corresponding elements across multiple data sources are identified after the comparison of these data sources with each other. In this way, for a given set of data sources and the correspondence between them, different possibilities for creating GCS can be achieved. In applications like multi-databases and data warehousing, new data sources keep joining in and GCS relations are usually expanded horizontally or vertically. Schema merging approaches usually expand GCS relations horizontally or vertically as new data sources join in. As a result of such expansions, an unbalanced GCS is created which either produces too much NULL values in response to global queries or a result of too many Joins causes poor query processing. In this paper, a novel approach, TuSMe (Tuned Schema Merging) techniqueis introduced to overcome the above mentioned issue via developing a balanced GCS, which will be able to control both vertical and horizontal expansion of GCS relations. The approach employs a weighting mechanism in which the weights are assigned to individual attributes of GCS. These weights reflect the connectedness of GCS attributes in accordance with the attributes of the principle data sources. Moreover, the overall strength of the GCS could be scrutinized by combining these weights. A prototype implementation of TuSMe shows significant improvement against other contemporary state-of-the-art approaches.http://publications.muet.edu.pk/index.php/muetrj/article/view/558
collection DOAJ
language English
format Article
sources DOAJ
author Nayyer Masood
Gul Jabee
spellingShingle Nayyer Masood
Gul Jabee
Implementation of Tuned Schema Merging Approach
Mehran University Research Journal of Engineering and Technology
author_facet Nayyer Masood
Gul Jabee
author_sort Nayyer Masood
title Implementation of Tuned Schema Merging Approach
title_short Implementation of Tuned Schema Merging Approach
title_full Implementation of Tuned Schema Merging Approach
title_fullStr Implementation of Tuned Schema Merging Approach
title_full_unstemmed Implementation of Tuned Schema Merging Approach
title_sort implementation of tuned schema merging approach
publisher Mehran University of Engineering and Technology
series Mehran University Research Journal of Engineering and Technology
issn 0254-7821
2413-7219
publishDate 2018-10-01
description Schema merging is a process of integrating multiple data sources into a GCS (Global Conceptual Schema). It is pivotal to various application domains, like data ware housing and multi-databases. Schema merging requires the identification of corresponding elements, which is done through schema matching process. In this process, corresponding elements across multiple data sources are identified after the comparison of these data sources with each other. In this way, for a given set of data sources and the correspondence between them, different possibilities for creating GCS can be achieved. In applications like multi-databases and data warehousing, new data sources keep joining in and GCS relations are usually expanded horizontally or vertically. Schema merging approaches usually expand GCS relations horizontally or vertically as new data sources join in. As a result of such expansions, an unbalanced GCS is created which either produces too much NULL values in response to global queries or a result of too many Joins causes poor query processing. In this paper, a novel approach, TuSMe (Tuned Schema Merging) techniqueis introduced to overcome the above mentioned issue via developing a balanced GCS, which will be able to control both vertical and horizontal expansion of GCS relations. The approach employs a weighting mechanism in which the weights are assigned to individual attributes of GCS. These weights reflect the connectedness of GCS attributes in accordance with the attributes of the principle data sources. Moreover, the overall strength of the GCS could be scrutinized by combining these weights. A prototype implementation of TuSMe shows significant improvement against other contemporary state-of-the-art approaches.
url http://publications.muet.edu.pk/index.php/muetrj/article/view/558
work_keys_str_mv AT nayyermasood implementationoftunedschemamergingapproach
AT guljabee implementationoftunedschemamergingapproach
_version_ 1724827383910891520