<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Abstract Background A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-va...

Full description

Bibliographic Details
Main Authors:	Zwick Matthias, Merkl Rainer
Format:	Article
Language:	English
Published:	BMC 2008-03-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/9/151

id	doaj-7eb245349d7244a8babcc2cd25d1f443
record_format	Article
spelling	doaj-7eb245349d7244a8babcc2cd25d1f4432020-11-24T21:37:56ZengBMCBMC Bioinformatics1471-21052008-03-019115110.1186/1471-2105-9-151<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignmentsZwick MatthiasMerkl Rainer<p>Abstract</p> <p>Background</p> <p>A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues.</p> <p>Results</p> <p>We introduce <it>conn(k)</it>, a novel parameter for the characterisation of individual residues. For each residue <it>k</it>, <it>conn(k) </it>is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (<it>MI</it>) value <it>U</it>(<it>k</it>, <it>l</it>) computed for all pairs of residues <it>k</it>, <it>l</it>. We demonstrate that <it>conn(k) </it>is a more robust indicator than an individual <it>MI</it>-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes <it>conn(k)</it>-values is available at <url>http://www-bioinf.uni-regensburg.de</url>.</p> <p>Conclusion</p> <p>The algorithms <it>H2r</it>, which analyses MSAs and computes <it>conn(k)</it>-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by <it>conn(k)</it>.</p> http://www.biomedcentral.com/1471-2105/9/151
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zwick Matthias Merkl Rainer
spellingShingle	Zwick Matthias Merkl Rainer <it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments BMC Bioinformatics
author_facet	Zwick Matthias Merkl Rainer
author_sort	Zwick Matthias
title	<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
title_short	<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
title_full	<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
title_fullStr	<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
title_full_unstemmed	<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
title_sort	<it>h2r</it>: identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2008-03-01
description	<p>Abstract</p> <p>Background</p> <p>A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues.</p> <p>Results</p> <p>We introduce <it>conn(k)</it>, a novel parameter for the characterisation of individual residues. For each residue <it>k</it>, <it>conn(k) </it>is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (<it>MI</it>) value <it>U</it>(<it>k</it>, <it>l</it>) computed for all pairs of residues <it>k</it>, <it>l</it>. We demonstrate that <it>conn(k) </it>is a more robust indicator than an individual <it>MI</it>-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes <it>conn(k)</it>-values is available at <url>http://www-bioinf.uni-regensburg.de</url>.</p> <p>Conclusion</p> <p>The algorithms <it>H2r</it>, which analyses MSAs and computes <it>conn(k)</it>-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by <it>conn(k)</it>.</p>
url	http://www.biomedcentral.com/1471-2105/9/151
work_keys_str_mv	AT zwickmatthias ith2ritidentificationofevolutionaryimportantresiduesbymeansofanentropybasedanalysisofmultiplesequencealignments AT merklrainer ith2ritidentificationofevolutionaryimportantresiduesbymeansofanentropybasedanalysisofmultiplesequencealignments
_version_	1725936325530484736

<it>H2r</it>: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Similar Items