Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.

Prediction of key features of protein structures, such as secondary structure, solvent accessibility and number of contacts between residues, provides useful structural constraints for comparative modeling, fold recognition, ab-initio fold prediction and detection of remote relationships. In this st...

Full description

Bibliographic Details
Main Authors: Guillaume Fourty, Isabelle Callebaut, Jean-Paul Mornon
Format: Article
Language:English
Published: SAGE Publishing 2008-01-01
Series:Bioinformatics and Biology Insights
Online Access:https://doi.org/10.4137/BBI.S426
id doaj-125bfb9288a849f4ba590010fde8df66
record_format Article
spelling doaj-125bfb9288a849f4ba590010fde8df662020-11-25T03:40:30ZengSAGE PublishingBioinformatics and Biology Insights1177-93222008-01-01210.4137/BBI.S426Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.Guillaume Fourty0Isabelle Callebaut1Jean-Paul Mornon2Département de Biologie Structurale, Institut de Minéralogie et de Physique des Milieux Condensés (IMPMC), CNRS UMR 7590 — Universités Paris 6/Paris 7, France.Département de Biologie Structurale, Institut de Minéralogie et de Physique des Milieux Condensés (IMPMC), CNRS UMR 7590 — Universités Paris 6/Paris 7, France.Département de Biologie Structurale, Institut de Minéralogie et de Physique des Milieux Condensés (IMPMC), CNRS UMR 7590 — Universités Paris 6/Paris 7, France.Prediction of key features of protein structures, such as secondary structure, solvent accessibility and number of contacts between residues, provides useful structural constraints for comparative modeling, fold recognition, ab-initio fold prediction and detection of remote relationships. In this study, we aim at characterizing the number of non-trivial close neighbors, or long-range contacts of a residue, as a function of its “topohydrophobic” index deduced from multiple sequence alignments and of the secondary structure in which it is embedded. The “topohydrophobic” index is calculated using a two-class distribution of amino acids, based on their mean atom depths. From a large set of structural alignments processed from the FSSP database, we selected 1485 structural sub-families including at least 8 members, with accurate alignments and limited redundancy. We show that residues within helices, even when deeply buried, have few non-trivial neighbors (0–2), whereas β-strand residues clearly exhibit a multimodal behavior, dominated by the local geometry of the tetrahedron (3 non-trivial close neighbors associated with one tetrahedron; 6 with two tetrahedra). This observed behavior allows the distinction, from sequence profiles, between edge and central β-strands within β-sheets. Useful topological constraints on the immediate neighborhood of an amino acid, but also on its correlated solvent accessibility, can thus be derived using this approach, from the simple knowledge of multiple sequence alignments.https://doi.org/10.4137/BBI.S426
collection DOAJ
language English
format Article
sources DOAJ
author Guillaume Fourty
Isabelle Callebaut
Jean-Paul Mornon
spellingShingle Guillaume Fourty
Isabelle Callebaut
Jean-Paul Mornon
Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
Bioinformatics and Biology Insights
author_facet Guillaume Fourty
Isabelle Callebaut
Jean-Paul Mornon
author_sort Guillaume Fourty
title Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
title_short Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
title_full Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
title_fullStr Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
title_full_unstemmed Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
title_sort characterization of non-trivial neighborhood fold constraints from protein sequences using generalized topohydrophobicity.
publisher SAGE Publishing
series Bioinformatics and Biology Insights
issn 1177-9322
publishDate 2008-01-01
description Prediction of key features of protein structures, such as secondary structure, solvent accessibility and number of contacts between residues, provides useful structural constraints for comparative modeling, fold recognition, ab-initio fold prediction and detection of remote relationships. In this study, we aim at characterizing the number of non-trivial close neighbors, or long-range contacts of a residue, as a function of its “topohydrophobic” index deduced from multiple sequence alignments and of the secondary structure in which it is embedded. The “topohydrophobic” index is calculated using a two-class distribution of amino acids, based on their mean atom depths. From a large set of structural alignments processed from the FSSP database, we selected 1485 structural sub-families including at least 8 members, with accurate alignments and limited redundancy. We show that residues within helices, even when deeply buried, have few non-trivial neighbors (0–2), whereas β-strand residues clearly exhibit a multimodal behavior, dominated by the local geometry of the tetrahedron (3 non-trivial close neighbors associated with one tetrahedron; 6 with two tetrahedra). This observed behavior allows the distinction, from sequence profiles, between edge and central β-strands within β-sheets. Useful topological constraints on the immediate neighborhood of an amino acid, but also on its correlated solvent accessibility, can thus be derived using this approach, from the simple knowledge of multiple sequence alignments.
url https://doi.org/10.4137/BBI.S426
work_keys_str_mv AT guillaumefourty characterizationofnontrivialneighborhoodfoldconstraintsfromproteinsequencesusinggeneralizedtopohydrophobicity
AT isabellecallebaut characterizationofnontrivialneighborhoodfoldconstraintsfromproteinsequencesusinggeneralizedtopohydrophobicity
AT jeanpaulmornon characterizationofnontrivialneighborhoodfoldconstraintsfromproteinsequencesusinggeneralizedtopohydrophobicity
_version_ 1724534436790272000