Multi-omics integration for neuroblastoma clinical endpoint prediction
Abstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-04-01
|
Series: | Biology Direct |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13062-018-0207-8 |
id |
doaj-67bf3d51548a49d990032bdb10ed61d5 |
---|---|
record_format |
Article |
spelling |
doaj-67bf3d51548a49d990032bdb10ed61d52020-11-25T01:02:59ZengBMCBiology Direct1745-61502018-04-0113111210.1186/s13062-018-0207-8Multi-omics integration for neuroblastoma clinical endpoint predictionMargherita Francescatto0Marco Chierici1Setareh Rezvan Dezfooli2Alessandro Zandonà3Giuseppe Jurman4Cesare Furlanello5Fondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerAbstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. Results In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. Conclusions The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves. Reviewers This article was reviewed by Djork-Arné Clevert and Tieliu Shi.http://link.springer.com/article/10.1186/s13062-018-0207-8NeuroblastomaIntegrationPredictionClassificationAutoencoder |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Margherita Francescatto Marco Chierici Setareh Rezvan Dezfooli Alessandro Zandonà Giuseppe Jurman Cesare Furlanello |
spellingShingle |
Margherita Francescatto Marco Chierici Setareh Rezvan Dezfooli Alessandro Zandonà Giuseppe Jurman Cesare Furlanello Multi-omics integration for neuroblastoma clinical endpoint prediction Biology Direct Neuroblastoma Integration Prediction Classification Autoencoder |
author_facet |
Margherita Francescatto Marco Chierici Setareh Rezvan Dezfooli Alessandro Zandonà Giuseppe Jurman Cesare Furlanello |
author_sort |
Margherita Francescatto |
title |
Multi-omics integration for neuroblastoma clinical endpoint prediction |
title_short |
Multi-omics integration for neuroblastoma clinical endpoint prediction |
title_full |
Multi-omics integration for neuroblastoma clinical endpoint prediction |
title_fullStr |
Multi-omics integration for neuroblastoma clinical endpoint prediction |
title_full_unstemmed |
Multi-omics integration for neuroblastoma clinical endpoint prediction |
title_sort |
multi-omics integration for neuroblastoma clinical endpoint prediction |
publisher |
BMC |
series |
Biology Direct |
issn |
1745-6150 |
publishDate |
2018-04-01 |
description |
Abstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. Results In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. Conclusions The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves. Reviewers This article was reviewed by Djork-Arné Clevert and Tieliu Shi. |
topic |
Neuroblastoma Integration Prediction Classification Autoencoder |
url |
http://link.springer.com/article/10.1186/s13062-018-0207-8 |
work_keys_str_mv |
AT margheritafrancescatto multiomicsintegrationforneuroblastomaclinicalendpointprediction AT marcochierici multiomicsintegrationforneuroblastomaclinicalendpointprediction AT setarehrezvandezfooli multiomicsintegrationforneuroblastomaclinicalendpointprediction AT alessandrozandona multiomicsintegrationforneuroblastomaclinicalendpointprediction AT giuseppejurman multiomicsintegrationforneuroblastomaclinicalendpointprediction AT cesarefurlanello multiomicsintegrationforneuroblastomaclinicalendpointprediction |
_version_ |
1725202924057919488 |