Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.

Determination of metagenome composition is still one of the most interesting problems of bioinformatics. It involves a wide range of mathematical methods, from probabilistic models of combinatorics to cluster analysis and pattern recognition techniques. The successful advance of rapid sequencing met...

Full description

Bibliographic Details
Main Authors: Valery Kirzhner, Dvora Toledano-Kitai, Zeev Volkovich
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0237205
id doaj-d4576561e2ec4bf683a16e50fc9da594
record_format Article
spelling doaj-d4576561e2ec4bf683a16e50fc9da5942021-03-04T12:26:41ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011511e023720510.1371/journal.pone.0237205Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.Valery KirzhnerDvora Toledano-KitaiZeev VolkovichDetermination of metagenome composition is still one of the most interesting problems of bioinformatics. It involves a wide range of mathematical methods, from probabilistic models of combinatorics to cluster analysis and pattern recognition techniques. The successful advance of rapid sequencing methods and fast and precise metagenome analysis will increase the diagnostic value of healthy or pathological human metagenomes. The article presents the theoretical foundations of the algorithm for calculating the number of different genomes in the medium under study. The approach is based on analysis of the compositional spectra of subsequently sequenced samples of the medium. Its essential feature is using random fluctuations in the bacteria number in different samples of the same metagenome. The possibility of effective implementation of the algorithm in the presence of data errors is also discussed. In the work, the algorithm of a metagenome evaluation is described, including the estimation of the genome number and the identification of the genomes with known compositional spectra. It should be emphasized that evaluating the genome number in a metagenome can be always helpful, regardless of the metagenome separation techniques, such as clustering the sequencing results or marker analysis.https://doi.org/10.1371/journal.pone.0237205
collection DOAJ
language English
format Article
sources DOAJ
author Valery Kirzhner
Dvora Toledano-Kitai
Zeev Volkovich
spellingShingle Valery Kirzhner
Dvora Toledano-Kitai
Zeev Volkovich
Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
PLoS ONE
author_facet Valery Kirzhner
Dvora Toledano-Kitai
Zeev Volkovich
author_sort Valery Kirzhner
title Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
title_short Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
title_full Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
title_fullStr Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
title_full_unstemmed Evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
title_sort evaluating the number of different genomes in a metagenome by means of the compositional spectra approach.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description Determination of metagenome composition is still one of the most interesting problems of bioinformatics. It involves a wide range of mathematical methods, from probabilistic models of combinatorics to cluster analysis and pattern recognition techniques. The successful advance of rapid sequencing methods and fast and precise metagenome analysis will increase the diagnostic value of healthy or pathological human metagenomes. The article presents the theoretical foundations of the algorithm for calculating the number of different genomes in the medium under study. The approach is based on analysis of the compositional spectra of subsequently sequenced samples of the medium. Its essential feature is using random fluctuations in the bacteria number in different samples of the same metagenome. The possibility of effective implementation of the algorithm in the presence of data errors is also discussed. In the work, the algorithm of a metagenome evaluation is described, including the estimation of the genome number and the identification of the genomes with known compositional spectra. It should be emphasized that evaluating the genome number in a metagenome can be always helpful, regardless of the metagenome separation techniques, such as clustering the sequencing results or marker analysis.
url https://doi.org/10.1371/journal.pone.0237205
work_keys_str_mv AT valerykirzhner evaluatingthenumberofdifferentgenomesinametagenomebymeansofthecompositionalspectraapproach
AT dvoratoledanokitai evaluatingthenumberofdifferentgenomesinametagenomebymeansofthecompositionalspectraapproach
AT zeevvolkovich evaluatingthenumberofdifferentgenomesinametagenomebymeansofthecompositionalspectraapproach
_version_ 1714802789558779904