An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.

We consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expressio...

Full description

Bibliographic Details
Main Authors: Qian Wan, Ranadip Pal
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4076307?pdf=render
id doaj-3e5a26ded33e4fcf88c61276935d1f37
record_format Article
spelling doaj-3e5a26ded33e4fcf88c61276935d1f372020-11-25T01:34:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0196e10118310.1371/journal.pone.0101183An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.Qian WanRanadip PalWe consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expression, DNA methylation and protein abundance. To extract relevant information from the various data types, we applied a random forest based approach to generate sensitivity predictions from each type of data and combined the predictions in a linear regression model to generate the final drug sensitivity prediction. Our approach when applied to the NCI-DREAM drug sensitivity prediction challenge was a top performer among 47 teams and produced high accuracy predictions. Our results show that the incorporation of multiple genomic characterizations lowered the mean and variance of the estimated bootstrap prediction error. We also applied our approach to the Cancer Cell Line Encyclopedia database for sensitivity prediction and the ability to extract the top targets of an anti-cancer drug. The results illustrate the effectiveness of our approach in predicting drug sensitivity from heterogeneous genomic datasets.http://europepmc.org/articles/PMC4076307?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Qian Wan
Ranadip Pal
spellingShingle Qian Wan
Ranadip Pal
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
PLoS ONE
author_facet Qian Wan
Ranadip Pal
author_sort Qian Wan
title An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
title_short An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
title_full An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
title_fullStr An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
title_full_unstemmed An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
title_sort ensemble based top performing approach for nci-dream drug sensitivity prediction challenge.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description We consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expression, DNA methylation and protein abundance. To extract relevant information from the various data types, we applied a random forest based approach to generate sensitivity predictions from each type of data and combined the predictions in a linear regression model to generate the final drug sensitivity prediction. Our approach when applied to the NCI-DREAM drug sensitivity prediction challenge was a top performer among 47 teams and produced high accuracy predictions. Our results show that the incorporation of multiple genomic characterizations lowered the mean and variance of the estimated bootstrap prediction error. We also applied our approach to the Cancer Cell Line Encyclopedia database for sensitivity prediction and the ability to extract the top targets of an anti-cancer drug. The results illustrate the effectiveness of our approach in predicting drug sensitivity from heterogeneous genomic datasets.
url http://europepmc.org/articles/PMC4076307?pdf=render
work_keys_str_mv AT qianwan anensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge
AT ranadippal anensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge
AT qianwan ensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge
AT ranadippal ensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge
_version_ 1725070985332260864