Prefiltering Model for Homology Detection Algorithms on GPU

Homology detection has evolved over the time from heavy algorithms based on dynamic programming approaches to lightweight alternatives based on different heuristic models. However, the main problem with these algorithms is that they use complex statistical models, which makes it difficult to achieve...

Full description

Bibliographic Details
Main Authors: Germán Retamosa, Luis de Pedro, Ivan González, Javier Tamames
Format: Article
Language:English
Published: SAGE Publishing 2016-01-01
Series:Evolutionary Bioinformatics
Online Access:https://doi.org/10.4137/EBO.S40877
id doaj-e008f3951fcd4d488740122428d3152b
record_format Article
spelling doaj-e008f3951fcd4d488740122428d3152b2020-11-25T03:10:45ZengSAGE PublishingEvolutionary Bioinformatics1176-93432016-01-011210.4137/EBO.S40877Prefiltering Model for Homology Detection Algorithms on GPUGermán Retamosa0Luis de Pedro1Ivan González2Javier Tamames3High Performance Computing and Networking Department, Universidad Autonóma de Madrid, Madrid, Spain.High Performance Computing and Networking Department, Universidad Autonóma de Madrid, Madrid, Spain.High Performance Computing and Networking Department, Universidad Autonóma de Madrid, Madrid, Spain.National Center for Biotechnology, CSIC, Madrid, Spain.Homology detection has evolved over the time from heavy algorithms based on dynamic programming approaches to lightweight alternatives based on different heuristic models. However, the main problem with these algorithms is that they use complex statistical models, which makes it difficult to achieve a relevant speedup and find exact matches with the original results. Thus, their acceleration is essential. The aim of this article was to prefilter a sequence database. To make this work, we have implemented a groundbreaking heuristic model based on NVIDIA's graphics processing units (GPUs) and multicore processors. Depending on the sensitivity settings, this makes it possible to quickly reduce the sequence database by factors between 50% and 95%, while rejecting no significant sequences. Furthermore, this prefiltering application can be used together with multiple homology detection algorithms as a part of a next-generation sequencing system. Extensive performance and accuracy tests have been carried out in the Spanish National Centre for Biotechnology (NCB). The results show that GPU hardware can accelerate the execution times of former homology detection applications, such as National Centre for Biotechnology Information (NCBI), Basic Local Alignment Search Tool for Proteins (BLASTP), up to a factor of 4. KEY POINTS • Owing to the increasing size of the current sequence datasets, filtering approach and high-performance computing (HPC) techniques are the best solution to process all these information in acceptable processing times. • Graphics processing unit cards and their corresponding programming models are good options to carry out these processing methods. • Combination of filtration models with HPC techniques is able to offer new levels of performance and accuracy in homology detection algorithms such as National Centre for Biotechnology Information Basic Local Alignment Search Tool.https://doi.org/10.4137/EBO.S40877
collection DOAJ
language English
format Article
sources DOAJ
author Germán Retamosa
Luis de Pedro
Ivan González
Javier Tamames
spellingShingle Germán Retamosa
Luis de Pedro
Ivan González
Javier Tamames
Prefiltering Model for Homology Detection Algorithms on GPU
Evolutionary Bioinformatics
author_facet Germán Retamosa
Luis de Pedro
Ivan González
Javier Tamames
author_sort Germán Retamosa
title Prefiltering Model for Homology Detection Algorithms on GPU
title_short Prefiltering Model for Homology Detection Algorithms on GPU
title_full Prefiltering Model for Homology Detection Algorithms on GPU
title_fullStr Prefiltering Model for Homology Detection Algorithms on GPU
title_full_unstemmed Prefiltering Model for Homology Detection Algorithms on GPU
title_sort prefiltering model for homology detection algorithms on gpu
publisher SAGE Publishing
series Evolutionary Bioinformatics
issn 1176-9343
publishDate 2016-01-01
description Homology detection has evolved over the time from heavy algorithms based on dynamic programming approaches to lightweight alternatives based on different heuristic models. However, the main problem with these algorithms is that they use complex statistical models, which makes it difficult to achieve a relevant speedup and find exact matches with the original results. Thus, their acceleration is essential. The aim of this article was to prefilter a sequence database. To make this work, we have implemented a groundbreaking heuristic model based on NVIDIA's graphics processing units (GPUs) and multicore processors. Depending on the sensitivity settings, this makes it possible to quickly reduce the sequence database by factors between 50% and 95%, while rejecting no significant sequences. Furthermore, this prefiltering application can be used together with multiple homology detection algorithms as a part of a next-generation sequencing system. Extensive performance and accuracy tests have been carried out in the Spanish National Centre for Biotechnology (NCB). The results show that GPU hardware can accelerate the execution times of former homology detection applications, such as National Centre for Biotechnology Information (NCBI), Basic Local Alignment Search Tool for Proteins (BLASTP), up to a factor of 4. KEY POINTS • Owing to the increasing size of the current sequence datasets, filtering approach and high-performance computing (HPC) techniques are the best solution to process all these information in acceptable processing times. • Graphics processing unit cards and their corresponding programming models are good options to carry out these processing methods. • Combination of filtration models with HPC techniques is able to offer new levels of performance and accuracy in homology detection algorithms such as National Centre for Biotechnology Information Basic Local Alignment Search Tool.
url https://doi.org/10.4137/EBO.S40877
work_keys_str_mv AT germanretamosa prefilteringmodelforhomologydetectionalgorithmsongpu
AT luisdepedro prefilteringmodelforhomologydetectionalgorithmsongpu
AT ivangonzalez prefilteringmodelforhomologydetectionalgorithmsongpu
AT javiertamames prefilteringmodelforhomologydetectionalgorithmsongpu
_version_ 1724657549949534208