An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.
We consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expressio...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2014-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4076307?pdf=render |
id |
doaj-3e5a26ded33e4fcf88c61276935d1f37 |
---|---|
record_format |
Article |
spelling |
doaj-3e5a26ded33e4fcf88c61276935d1f372020-11-25T01:34:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0196e10118310.1371/journal.pone.0101183An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge.Qian WanRanadip PalWe consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expression, DNA methylation and protein abundance. To extract relevant information from the various data types, we applied a random forest based approach to generate sensitivity predictions from each type of data and combined the predictions in a linear regression model to generate the final drug sensitivity prediction. Our approach when applied to the NCI-DREAM drug sensitivity prediction challenge was a top performer among 47 teams and produced high accuracy predictions. Our results show that the incorporation of multiple genomic characterizations lowered the mean and variance of the estimated bootstrap prediction error. We also applied our approach to the Cancer Cell Line Encyclopedia database for sensitivity prediction and the ability to extract the top targets of an anti-cancer drug. The results illustrate the effectiveness of our approach in predicting drug sensitivity from heterogeneous genomic datasets.http://europepmc.org/articles/PMC4076307?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Qian Wan Ranadip Pal |
spellingShingle |
Qian Wan Ranadip Pal An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. PLoS ONE |
author_facet |
Qian Wan Ranadip Pal |
author_sort |
Qian Wan |
title |
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. |
title_short |
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. |
title_full |
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. |
title_fullStr |
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. |
title_full_unstemmed |
An ensemble based top performing approach for NCI-DREAM drug sensitivity prediction challenge. |
title_sort |
ensemble based top performing approach for nci-dream drug sensitivity prediction challenge. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2014-01-01 |
description |
We consider the problem of predicting sensitivity of cancer cell lines to new drugs based on supervised learning on genomic profiles. The genetic and epigenetic characterization of a cell line provides observations on various aspects of regulation including DNA copy number variations, gene expression, DNA methylation and protein abundance. To extract relevant information from the various data types, we applied a random forest based approach to generate sensitivity predictions from each type of data and combined the predictions in a linear regression model to generate the final drug sensitivity prediction. Our approach when applied to the NCI-DREAM drug sensitivity prediction challenge was a top performer among 47 teams and produced high accuracy predictions. Our results show that the incorporation of multiple genomic characterizations lowered the mean and variance of the estimated bootstrap prediction error. We also applied our approach to the Cancer Cell Line Encyclopedia database for sensitivity prediction and the ability to extract the top targets of an anti-cancer drug. The results illustrate the effectiveness of our approach in predicting drug sensitivity from heterogeneous genomic datasets. |
url |
http://europepmc.org/articles/PMC4076307?pdf=render |
work_keys_str_mv |
AT qianwan anensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge AT ranadippal anensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge AT qianwan ensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge AT ranadippal ensemblebasedtopperformingapproachforncidreamdrugsensitivitypredictionchallenge |
_version_ |
1725070985332260864 |