Predicting the HMA-LMA Status in Marine Sponges by Machine Learning

The dichotomy between high microbial abundance (HMA) and low microbial abundance (LMA) sponges has been observed in sponge-microbe symbiosis, although the extent of this pattern remains poorly unknown. We characterized the differences between the microbiomes of HMA (n = 19) and LMA (n = 17) sponges...

Full description

Bibliographic Details
Main Authors: Lucas Moitinho-Silva, Georg Steinert, Shaun Nielsen, Cristiane C. P. Hardoim, Yu-Chen Wu, Grace P. McCormack, Susanna López-Legentil, Roman Marchant, Nicole Webster, Torsten Thomas, Ute Hentschel
Format: Article
Language:English
Published: Frontiers Media S.A. 2017-05-01
Series:Frontiers in Microbiology
Subjects:
Online Access:http://journal.frontiersin.org/article/10.3389/fmicb.2017.00752/full
id doaj-9ba3ebf8fc5d46a084b840cdb450951c
record_format Article
spelling doaj-9ba3ebf8fc5d46a084b840cdb450951c2020-11-24T22:39:22ZengFrontiers Media S.A.Frontiers in Microbiology1664-302X2017-05-01810.3389/fmicb.2017.00752258275Predicting the HMA-LMA Status in Marine Sponges by Machine LearningLucas Moitinho-Silva0Lucas Moitinho-Silva1Georg Steinert2Shaun Nielsen3Shaun Nielsen4Cristiane C. P. Hardoim5Yu-Chen Wu6Grace P. McCormack7Susanna López-Legentil8Roman Marchant9Nicole Webster10Nicole Webster11Torsten Thomas12Torsten Thomas13Ute Hentschel14Centre for Marine Bio-Innovation, University of New South WalesSydney, NSW, AustraliaSchool of Biological, Earth and Environmental Sciences, University of New South WalesSydney, NSW, AustraliaLaboratory of Microbiology, Wageningen UniversityWageningen, NetherlandsCentre for Marine Bio-Innovation, University of New South WalesSydney, NSW, AustraliaSchool of Biological, Earth and Environmental Sciences, University of New South WalesSydney, NSW, AustraliaDepartamento de Invertebrados, Museu Nacional, Universidade Federal do Rio de JaneiroRio de Janeiro, BrazilRD3 Marine Microbiology, GEOMAR Helmholtz Centre for Ocean Research Kiel and Christian-Albrechts UniversityKiel, GermanyZoology, Ryan Institute, School of Natural Sciences, National University of Ireland GalwayGalway, IrelandDepartment of Biology and Marine Biology, and Center for Marine Science, University of North CarolinaWilmington, NC, USACentre for Translational Data Science, School of Information Technologies, University of SydneySydney, NSW, AustraliaAustralian Institute of Marine ScienceTownsville, QLD, Australia0Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of QueenslandSt. Lucia, QLD, AustraliaCentre for Marine Bio-Innovation, University of New South WalesSydney, NSW, AustraliaSchool of Biological, Earth and Environmental Sciences, University of New South WalesSydney, NSW, AustraliaRD3 Marine Microbiology, GEOMAR Helmholtz Centre for Ocean Research Kiel and Christian-Albrechts UniversityKiel, GermanyThe dichotomy between high microbial abundance (HMA) and low microbial abundance (LMA) sponges has been observed in sponge-microbe symbiosis, although the extent of this pattern remains poorly unknown. We characterized the differences between the microbiomes of HMA (n = 19) and LMA (n = 17) sponges (575 specimens) present in the Sponge Microbiome Project. HMA sponges were associated with richer and more diverse microbiomes than LMA sponges, as indicated by the comparison of alpha diversity metrics. Microbial community structures differed between HMA and LMA sponges considering Operational Taxonomic Units (OTU) abundances and across microbial taxonomic levels, from phylum to species. The largest proportion of microbiome variation was explained by the host identity. Several phyla, classes, and OTUs were found differentially abundant in either group, which were considered “HMA indicators” and “LMA indicators.” Machine learning algorithms (classifiers) were trained to predict the HMA-LMA status of sponges. Among nine different classifiers, higher performances were achieved by Random Forest trained with phylum and class abundances. Random Forest with optimized parameters predicted the HMA-LMA status of additional 135 sponge species (1,232 specimens) without a priori knowledge. These sponges were grouped in four clusters, from which the largest two were composed of species consistently predicted as HMA (n = 44) and LMA (n = 74). In summary, our analyses shown distinct features of the microbial communities associated with HMA and LMA sponges. The prediction of the HMA-LMA status based on the microbiome profiles of sponges demonstrates the application of machine learning to explore patterns of host-associated microbial communities.http://journal.frontiersin.org/article/10.3389/fmicb.2017.00752/fullmarine spongesmicrobiome16S rRNA genemicrobial diversitysymbiosisrandom forest
collection DOAJ
language English
format Article
sources DOAJ
author Lucas Moitinho-Silva
Lucas Moitinho-Silva
Georg Steinert
Shaun Nielsen
Shaun Nielsen
Cristiane C. P. Hardoim
Yu-Chen Wu
Grace P. McCormack
Susanna López-Legentil
Roman Marchant
Nicole Webster
Nicole Webster
Torsten Thomas
Torsten Thomas
Ute Hentschel
spellingShingle Lucas Moitinho-Silva
Lucas Moitinho-Silva
Georg Steinert
Shaun Nielsen
Shaun Nielsen
Cristiane C. P. Hardoim
Yu-Chen Wu
Grace P. McCormack
Susanna López-Legentil
Roman Marchant
Nicole Webster
Nicole Webster
Torsten Thomas
Torsten Thomas
Ute Hentschel
Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
Frontiers in Microbiology
marine sponges
microbiome
16S rRNA gene
microbial diversity
symbiosis
random forest
author_facet Lucas Moitinho-Silva
Lucas Moitinho-Silva
Georg Steinert
Shaun Nielsen
Shaun Nielsen
Cristiane C. P. Hardoim
Yu-Chen Wu
Grace P. McCormack
Susanna López-Legentil
Roman Marchant
Nicole Webster
Nicole Webster
Torsten Thomas
Torsten Thomas
Ute Hentschel
author_sort Lucas Moitinho-Silva
title Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
title_short Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
title_full Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
title_fullStr Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
title_full_unstemmed Predicting the HMA-LMA Status in Marine Sponges by Machine Learning
title_sort predicting the hma-lma status in marine sponges by machine learning
publisher Frontiers Media S.A.
series Frontiers in Microbiology
issn 1664-302X
publishDate 2017-05-01
description The dichotomy between high microbial abundance (HMA) and low microbial abundance (LMA) sponges has been observed in sponge-microbe symbiosis, although the extent of this pattern remains poorly unknown. We characterized the differences between the microbiomes of HMA (n = 19) and LMA (n = 17) sponges (575 specimens) present in the Sponge Microbiome Project. HMA sponges were associated with richer and more diverse microbiomes than LMA sponges, as indicated by the comparison of alpha diversity metrics. Microbial community structures differed between HMA and LMA sponges considering Operational Taxonomic Units (OTU) abundances and across microbial taxonomic levels, from phylum to species. The largest proportion of microbiome variation was explained by the host identity. Several phyla, classes, and OTUs were found differentially abundant in either group, which were considered “HMA indicators” and “LMA indicators.” Machine learning algorithms (classifiers) were trained to predict the HMA-LMA status of sponges. Among nine different classifiers, higher performances were achieved by Random Forest trained with phylum and class abundances. Random Forest with optimized parameters predicted the HMA-LMA status of additional 135 sponge species (1,232 specimens) without a priori knowledge. These sponges were grouped in four clusters, from which the largest two were composed of species consistently predicted as HMA (n = 44) and LMA (n = 74). In summary, our analyses shown distinct features of the microbial communities associated with HMA and LMA sponges. The prediction of the HMA-LMA status based on the microbiome profiles of sponges demonstrates the application of machine learning to explore patterns of host-associated microbial communities.
topic marine sponges
microbiome
16S rRNA gene
microbial diversity
symbiosis
random forest
url http://journal.frontiersin.org/article/10.3389/fmicb.2017.00752/full
work_keys_str_mv AT lucasmoitinhosilva predictingthehmalmastatusinmarinespongesbymachinelearning
AT lucasmoitinhosilva predictingthehmalmastatusinmarinespongesbymachinelearning
AT georgsteinert predictingthehmalmastatusinmarinespongesbymachinelearning
AT shaunnielsen predictingthehmalmastatusinmarinespongesbymachinelearning
AT shaunnielsen predictingthehmalmastatusinmarinespongesbymachinelearning
AT cristianecphardoim predictingthehmalmastatusinmarinespongesbymachinelearning
AT yuchenwu predictingthehmalmastatusinmarinespongesbymachinelearning
AT gracepmccormack predictingthehmalmastatusinmarinespongesbymachinelearning
AT susannalopezlegentil predictingthehmalmastatusinmarinespongesbymachinelearning
AT romanmarchant predictingthehmalmastatusinmarinespongesbymachinelearning
AT nicolewebster predictingthehmalmastatusinmarinespongesbymachinelearning
AT nicolewebster predictingthehmalmastatusinmarinespongesbymachinelearning
AT torstenthomas predictingthehmalmastatusinmarinespongesbymachinelearning
AT torstenthomas predictingthehmalmastatusinmarinespongesbymachinelearning
AT utehentschel predictingthehmalmastatusinmarinespongesbymachinelearning
_version_ 1725709255802093568