Multi-omics integration for neuroblastoma clinical endpoint prediction

Abstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology...

Full description

Bibliographic Details
Main Authors: Margherita Francescatto, Marco Chierici, Setareh Rezvan Dezfooli, Alessandro Zandonà, Giuseppe Jurman, Cesare Furlanello
Format: Article
Language:English
Published: BMC 2018-04-01
Series:Biology Direct
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13062-018-0207-8
id doaj-67bf3d51548a49d990032bdb10ed61d5
record_format Article
spelling doaj-67bf3d51548a49d990032bdb10ed61d52020-11-25T01:02:59ZengBMCBiology Direct1745-61502018-04-0113111210.1186/s13062-018-0207-8Multi-omics integration for neuroblastoma clinical endpoint predictionMargherita Francescatto0Marco Chierici1Setareh Rezvan Dezfooli2Alessandro Zandonà3Giuseppe Jurman4Cesare Furlanello5Fondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerFondazione Bruno KesslerAbstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. Results In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. Conclusions The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves. Reviewers This article was reviewed by Djork-Arné Clevert and Tieliu Shi.http://link.springer.com/article/10.1186/s13062-018-0207-8NeuroblastomaIntegrationPredictionClassificationAutoencoder
collection DOAJ
language English
format Article
sources DOAJ
author Margherita Francescatto
Marco Chierici
Setareh Rezvan Dezfooli
Alessandro Zandonà
Giuseppe Jurman
Cesare Furlanello
spellingShingle Margherita Francescatto
Marco Chierici
Setareh Rezvan Dezfooli
Alessandro Zandonà
Giuseppe Jurman
Cesare Furlanello
Multi-omics integration for neuroblastoma clinical endpoint prediction
Biology Direct
Neuroblastoma
Integration
Prediction
Classification
Autoencoder
author_facet Margherita Francescatto
Marco Chierici
Setareh Rezvan Dezfooli
Alessandro Zandonà
Giuseppe Jurman
Cesare Furlanello
author_sort Margherita Francescatto
title Multi-omics integration for neuroblastoma clinical endpoint prediction
title_short Multi-omics integration for neuroblastoma clinical endpoint prediction
title_full Multi-omics integration for neuroblastoma clinical endpoint prediction
title_fullStr Multi-omics integration for neuroblastoma clinical endpoint prediction
title_full_unstemmed Multi-omics integration for neuroblastoma clinical endpoint prediction
title_sort multi-omics integration for neuroblastoma clinical endpoint prediction
publisher BMC
series Biology Direct
issn 1745-6150
publishDate 2018-04-01
description Abstract Background High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. Results In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. Conclusions The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves. Reviewers This article was reviewed by Djork-Arné Clevert and Tieliu Shi.
topic Neuroblastoma
Integration
Prediction
Classification
Autoencoder
url http://link.springer.com/article/10.1186/s13062-018-0207-8
work_keys_str_mv AT margheritafrancescatto multiomicsintegrationforneuroblastomaclinicalendpointprediction
AT marcochierici multiomicsintegrationforneuroblastomaclinicalendpointprediction
AT setarehrezvandezfooli multiomicsintegrationforneuroblastomaclinicalendpointprediction
AT alessandrozandona multiomicsintegrationforneuroblastomaclinicalendpointprediction
AT giuseppejurman multiomicsintegrationforneuroblastomaclinicalendpointprediction
AT cesarefurlanello multiomicsintegrationforneuroblastomaclinicalendpointprediction
_version_ 1725202924057919488