Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival

Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biolo...

Full description

Bibliographic Details
Main Authors: Adam Kaplan, Eric F Lock
Format: Article
Language:English
Published: SAGE Publishing 2017-07-01
Series:Cancer Informatics
Online Access:https://doi.org/10.1177/1176935117718517
id doaj-a9f316d8373a4ee7b452b36b6137f723
record_format Article
spelling doaj-a9f316d8373a4ee7b452b36b6137f7232020-11-25T03:33:45ZengSAGE PublishingCancer Informatics1176-93512017-07-011610.1177/1176935117718517Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient SurvivalAdam KaplanEric F LockPredictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.https://doi.org/10.1177/1176935117718517
collection DOAJ
language English
format Article
sources DOAJ
author Adam Kaplan
Eric F Lock
spellingShingle Adam Kaplan
Eric F Lock
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
Cancer Informatics
author_facet Adam Kaplan
Eric F Lock
author_sort Adam Kaplan
title Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
title_short Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
title_full Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
title_fullStr Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
title_full_unstemmed Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival
title_sort prediction with dimension reduction of multiple molecular data sources for patient survival
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2017-07-01
description Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of ‘omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.
url https://doi.org/10.1177/1176935117718517
work_keys_str_mv AT adamkaplan predictionwithdimensionreductionofmultiplemoleculardatasourcesforpatientsurvival
AT ericflock predictionwithdimensionreductionofmultiplemoleculardatasourcesforpatientsurvival
_version_ 1724561864186134528