CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System

In genome-wide association studies (GWAS), regression analysis has been most commonly used to establish an association between a phenotype and genetic variants, such as single nucleotide polymorphism (SNP). However, most applications of regression analysis have been restricted to the investigation o...

Full description

Bibliographic Details
Main Authors: Sungyoung Lee, Min-Seok Kwon, Taesung Park
Format: Article
Language:English
Published: SAGE Publishing 2014-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.4137/CIN.S16349
id doaj-974482e4d1bf42ada253f0ce1cc8a2a3
record_format Article
spelling doaj-974482e4d1bf42ada253f0ce1cc8a2a32020-11-25T03:45:05ZengSAGE PublishingCancer Informatics1176-93512014-01-0113s710.4137/CIN.S16349CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing SystemSungyoung Lee0Min-Seok Kwon1Taesung Park2Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.Department of Statistics, Seoul National University, Seoul, South Korea.In genome-wide association studies (GWAS), regression analysis has been most commonly used to establish an association between a phenotype and genetic variants, such as single nucleotide polymorphism (SNP). However, most applications of regression analysis have been restricted to the investigation of single marker because of the large computational burden. Thus, there have been limited applications of regression analysis to multiple SNPs, including gene–gene interaction (GGI) in large-scale GWAS data. In order to overcome this limitation, we propose CARAT-GxG, a GPU computing system-oriented toolkit, for performing regression analysis with GGI using CUDA (compute unified device architecture). Compared to other methods, CARAT-GxG achieved almost 700-fold execution speed and delivered highly reliable results through our GPU-specific optimization techniques. In addition, it was possible to achieve almost-linear speed acceleration with the application of a GPU computing system, which is implemented by the TORQUE Resource Manager. We expect that CARAT-GxG will enable large-scale regression analysis with GGI for GWAS data.https://doi.org/10.4137/CIN.S16349
collection DOAJ
language English
format Article
sources DOAJ
author Sungyoung Lee
Min-Seok Kwon
Taesung Park
spellingShingle Sungyoung Lee
Min-Seok Kwon
Taesung Park
CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
Cancer Informatics
author_facet Sungyoung Lee
Min-Seok Kwon
Taesung Park
author_sort Sungyoung Lee
title CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
title_short CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
title_full CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
title_fullStr CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
title_full_unstemmed CARAT-GxG: CUDA-Accelerated Regression Analysis Toolkit for Large-Scale Gene–Gene Interaction with GPU Computing System
title_sort carat-gxg: cuda-accelerated regression analysis toolkit for large-scale gene–gene interaction with gpu computing system
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2014-01-01
description In genome-wide association studies (GWAS), regression analysis has been most commonly used to establish an association between a phenotype and genetic variants, such as single nucleotide polymorphism (SNP). However, most applications of regression analysis have been restricted to the investigation of single marker because of the large computational burden. Thus, there have been limited applications of regression analysis to multiple SNPs, including gene–gene interaction (GGI) in large-scale GWAS data. In order to overcome this limitation, we propose CARAT-GxG, a GPU computing system-oriented toolkit, for performing regression analysis with GGI using CUDA (compute unified device architecture). Compared to other methods, CARAT-GxG achieved almost 700-fold execution speed and delivered highly reliable results through our GPU-specific optimization techniques. In addition, it was possible to achieve almost-linear speed acceleration with the application of a GPU computing system, which is implemented by the TORQUE Resource Manager. We expect that CARAT-GxG will enable large-scale regression analysis with GGI for GWAS data.
url https://doi.org/10.4137/CIN.S16349
work_keys_str_mv AT sungyounglee caratgxgcudaacceleratedregressionanalysistoolkitforlargescalegenegeneinteractionwithgpucomputingsystem
AT minseokkwon caratgxgcudaacceleratedregressionanalysistoolkitforlargescalegenegeneinteractionwithgpucomputingsystem
AT taesungpark caratgxgcudaacceleratedregressionanalysistoolkitforlargescalegenegeneinteractionwithgpucomputingsystem
_version_ 1724511534884847616