An Improved Distance Matrix Computation Algorithm for Multicore Clusters

Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computa...

Full description

Bibliographic Details
Main Authors: Mohammed W. Al-Neama, Naglaa M. Reda, Fayed F. M. Ghaleb
Format: Article
Language:English
Published: Hindawi Limited 2014-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2014/406178
id doaj-4446ea3e9c2b4a8c8e74c16bad2727d3
record_format Article
spelling doaj-4446ea3e9c2b4a8c8e74c16bad2727d32020-11-24T23:17:10ZengHindawi LimitedBioMed Research International2314-61332314-61412014-01-01201410.1155/2014/406178406178An Improved Distance Matrix Computation Algorithm for Multicore ClustersMohammed W. Al-Neama0Naglaa M. Reda1Fayed F. M. Ghaleb2Department of Mathematics, Faculty of Science, Al-Azhar University, Cairo, EgyptDepartment of Mathematics, Faculty of Science, Ain Shams University, Cairo, EgyptDepartment of Mathematics, Faculty of Science, Ain Shams University, Cairo, EgyptDistance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computations. DistVect algorithm was introduced in the paper of Al-Neama et al. (in press) to present a recent approach for vectorizing distance matrix computing. It showed an efficient performance in both sequential and parallel computing. However, the multicore cluster systems, which are available now, with their scalability and performance/cost ratio, meet the need for more powerful and efficient performance. This paper proposes DistVect1 as highly efficient parallel vectorized algorithm with high performance for computing distance matrix, addressed to multicore clusters. It reformulates DistVect1 vectorized algorithm in terms of clusters primitives. It deduces an efficient approach of partitioning and scheduling computations, convenient to this type of architecture. Implementations employ potential of both MPI and OpenMP libraries. Experimental results show that the proposed method performs improvement of around 3-fold speedup upon SSE2. Further it also achieves speedups more than 9 orders of magnitude compared to the publicly available parallel implementation utilized in ClustalW-MPI.http://dx.doi.org/10.1155/2014/406178
collection DOAJ
language English
format Article
sources DOAJ
author Mohammed W. Al-Neama
Naglaa M. Reda
Fayed F. M. Ghaleb
spellingShingle Mohammed W. Al-Neama
Naglaa M. Reda
Fayed F. M. Ghaleb
An Improved Distance Matrix Computation Algorithm for Multicore Clusters
BioMed Research International
author_facet Mohammed W. Al-Neama
Naglaa M. Reda
Fayed F. M. Ghaleb
author_sort Mohammed W. Al-Neama
title An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_short An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_full An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_fullStr An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_full_unstemmed An Improved Distance Matrix Computation Algorithm for Multicore Clusters
title_sort improved distance matrix computation algorithm for multicore clusters
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2014-01-01
description Distance matrix has diverse usage in different research areas. Its computation is typically an essential task in most bioinformatics applications, especially in multiple sequence alignment. The gigantic explosion of biological sequence databases leads to an urgent need for accelerating these computations. DistVect algorithm was introduced in the paper of Al-Neama et al. (in press) to present a recent approach for vectorizing distance matrix computing. It showed an efficient performance in both sequential and parallel computing. However, the multicore cluster systems, which are available now, with their scalability and performance/cost ratio, meet the need for more powerful and efficient performance. This paper proposes DistVect1 as highly efficient parallel vectorized algorithm with high performance for computing distance matrix, addressed to multicore clusters. It reformulates DistVect1 vectorized algorithm in terms of clusters primitives. It deduces an efficient approach of partitioning and scheduling computations, convenient to this type of architecture. Implementations employ potential of both MPI and OpenMP libraries. Experimental results show that the proposed method performs improvement of around 3-fold speedup upon SSE2. Further it also achieves speedups more than 9 orders of magnitude compared to the publicly available parallel implementation utilized in ClustalW-MPI.
url http://dx.doi.org/10.1155/2014/406178
work_keys_str_mv AT mohammedwalneama animproveddistancematrixcomputationalgorithmformulticoreclusters
AT naglaamreda animproveddistancematrixcomputationalgorithmformulticoreclusters
AT fayedfmghaleb animproveddistancematrixcomputationalgorithmformulticoreclusters
AT mohammedwalneama improveddistancematrixcomputationalgorithmformulticoreclusters
AT naglaamreda improveddistancematrixcomputationalgorithmformulticoreclusters
AT fayedfmghaleb improveddistancematrixcomputationalgorithmformulticoreclusters
_version_ 1725584510736662528