Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biolo...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2017-07-01
|
Series: | Cancer Informatics |
Online Access: | https://doi.org/10.1177/1176935117718517 |
id |
doaj-a9f316d8373a4ee7b452b36b6137f723 |
---|---|
record_format |
Article |
spelling |
doaj-a9f316d8373a4ee7b452b36b6137f7232020-11-25T03:33:45ZengSAGE PublishingCancer Informatics1176-93512017-07-011610.1177/1176935117718517Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient SurvivalAdam KaplanEric F LockPredictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.https://doi.org/10.1177/1176935117718517 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Adam Kaplan Eric F Lock |
spellingShingle |
Adam Kaplan Eric F Lock Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival Cancer Informatics |
author_facet |
Adam Kaplan Eric F Lock |
author_sort |
Adam Kaplan |
title |
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival |
title_short |
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival |
title_full |
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival |
title_fullStr |
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival |
title_full_unstemmed |
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival |
title_sort |
prediction with dimension reduction of multiple molecular data sources for patient survival |
publisher |
SAGE Publishing |
series |
Cancer Informatics |
issn |
1176-9351 |
publishDate |
2017-07-01 |
description |
Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict. |
url |
https://doi.org/10.1177/1176935117718517 |
work_keys_str_mv |
AT adamkaplan predictionwithdimensionreductionofmultiplemoleculardatasourcesforpatientsurvival AT ericflock predictionwithdimensionreductionofmultiplemoleculardatasourcesforpatientsurvival |
_version_ |
1724561864186134528 |