Optimization of Queries with Fragmented Relations and Large Join Queries

碩士 === 國立中興大學 === 資訊科學學系 === 83 === Aiming at reducing the communication cost for distributed query optimization, this thesis investigates two problems, minimizing the communication cost for queries that refer to n fragments (n>=2...

Full description

Bibliographic Details
Main Authors: Chun-Xian Chang, 張純嫻
Other Authors: Kuen-Fang J.Jea
Format: Others
Language:en_US
Published: 1995
Online Access:http://ndltd.ncl.edu.tw/handle/03335774856839337995
Description
Summary:碩士 === 國立中興大學 === 資訊科學學系 === 83 === Aiming at reducing the communication cost for distributed query optimization, this thesis investigates two problems, minimizing the communication cost for queries that refer to n fragments (n>=2) and optimizing the execution orders of large join queries. We define a pair - fragment allocation matrix called PFAM and propose an algorithm to fill the PFAM. The filled PFAM assures that any two fragments are both allocated to at least one of the local sites, and all queries that refer to two fragments can be executed just in the local sites. The communication cost of all queries referring to two fragments is thus reduced to zero and the total communication cost in distributed query processing is also reduced. A large join query consists of a series of join operations. The order in which these joins are executed has a great impact on the communication cost of distributed query processing.We propose a dynamic programming algorithm to optimize large join queries in distributed database systems. By the algorithm, we can obtain the minimal communication cost for some large join query as well as its optimal parenthesization exec- ution orders.We can also determine the sites to perform the query and the data fragments to be transmitted among the sites.To demo- nstrate the research results, the filled PFAM is applied to both the Distributed INGRES optimization algorithm and the dynamic programming algorithm for optimizing large join queries. It has been shown that the communication cost is further reduced as compared with that of the original two algorithms.