P-value based visualization of codon usage data

<p>Abstract</p> <p>Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particula...

Full description

Bibliographic Details
Main Authors: Fricke Wolfgang, Brodag Thomas, Meinicke Peter, Waack Stephan
Format: Article
Language:English
Published: BMC 2006-06-01
Series:Algorithms for Molecular Biology
Online Access:http://www.almob.org/content/1/1/10
id doaj-0b848014e32349daaf39da359d32fb2d
record_format Article
spelling doaj-0b848014e32349daaf39da359d32fb2d2020-11-24T23:52:32ZengBMCAlgorithms for Molecular Biology1748-71882006-06-01111010.1186/1748-7188-1-10P-value based visualization of codon usage dataFricke WolfgangBrodag ThomasMeinicke PeterWaack Stephan<p>Abstract</p> <p>Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases.</p> http://www.almob.org/content/1/1/10
collection DOAJ
language English
format Article
sources DOAJ
author Fricke Wolfgang
Brodag Thomas
Meinicke Peter
Waack Stephan
spellingShingle Fricke Wolfgang
Brodag Thomas
Meinicke Peter
Waack Stephan
P-value based visualization of codon usage data
Algorithms for Molecular Biology
author_facet Fricke Wolfgang
Brodag Thomas
Meinicke Peter
Waack Stephan
author_sort Fricke Wolfgang
title P-value based visualization of codon usage data
title_short P-value based visualization of codon usage data
title_full P-value based visualization of codon usage data
title_fullStr P-value based visualization of codon usage data
title_full_unstemmed P-value based visualization of codon usage data
title_sort p-value based visualization of codon usage data
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2006-06-01
description <p>Abstract</p> <p>Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases.</p>
url http://www.almob.org/content/1/1/10
work_keys_str_mv AT frickewolfgang pvaluebasedvisualizationofcodonusagedata
AT brodagthomas pvaluebasedvisualizationofcodonusagedata
AT meinickepeter pvaluebasedvisualizationofcodonusagedata
AT waackstephan pvaluebasedvisualizationofcodonusagedata
_version_ 1725473225679306752