A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations

Over the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing smal...

Full description

Bibliographic Details
Main Authors: Azzam Alwan, Remi Cogranne, Pierre Beauseroy, Edith Grall-Maes, Nicolas Belloy, Laurent Debelle, Stephanie Baud, Manuel Dauchez, Sebastien Almagro
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9462920/
id doaj-67b6802432f44a2b8774289c3bc36cd3
record_format Article
spelling doaj-67b6802432f44a2b8774289c3bc36cd32021-07-01T23:00:43ZengIEEEIEEE Access2169-35362021-01-019921439215610.1109/ACCESS.2021.30919399462920A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D ConformationsAzzam Alwan0https://orcid.org/0000-0003-2531-2943Remi Cogranne1https://orcid.org/0000-0002-4205-4694Pierre Beauseroy2https://orcid.org/0000-0002-2883-1303Edith Grall-Maes3Nicolas Belloy4https://orcid.org/0000-0002-6003-3184Laurent Debelle5Stephanie Baud6https://orcid.org/0000-0002-4436-0652Manuel Dauchez7Sebastien Almagro8https://orcid.org/0000-0002-6974-1187Research Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceOver the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing small and highly flexible protein chunks, namely oligopeptides made of a handful of amino acids, remains challenging because of their dynamics and wide range of conformations. In this paper, a statistical methodology based on unsupervised statistical learning is proposed for analyzing 3D conformations small and highly flexible elastin-derived peptides. The goal of this study is twofold: first, is it aimed at identifying the most frequent conformations of each peptide and to study their stability. Second, and most important, it is aimed at comparing main conformations of different elastin-derived peptides to identify the “signature” than can be linked to a biological activity. The main strength of the present work is to propose a method for confirmation recognition that is not affected by peptide rotations or translations and, hence, avoids the use of the complex superposition methods. In addition, the proposed approach uses Kernel PCA to eliminate atypical peptide conformations. Due to the instability of those peptides, removing outliers is crucial since they may dramatically impact clustering results. To extract the most frequent conformations, we propose to use a hierarchical clustering method. Eventually, a peptide activity detector is defined based on comparison of main conformation found in different peptides. The main interests of the proposed method are twofold: first, it is fully automatic method, second, it does not require any additional information or expertise and, third, it can identify conformations accurately that make peptides enabling a given biological activity. Experimental results on a large dataset of peptides conformations highlight the relevance and efficiency of the proposed method.https://ieeexplore.ieee.org/document/9462920/Automatic conformation identificationhierarchical classificationflexible peptide conformationstructure classification of proteinoutliers detection
collection DOAJ
language English
format Article
sources DOAJ
author Azzam Alwan
Remi Cogranne
Pierre Beauseroy
Edith Grall-Maes
Nicolas Belloy
Laurent Debelle
Stephanie Baud
Manuel Dauchez
Sebastien Almagro
spellingShingle Azzam Alwan
Remi Cogranne
Pierre Beauseroy
Edith Grall-Maes
Nicolas Belloy
Laurent Debelle
Stephanie Baud
Manuel Dauchez
Sebastien Almagro
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
IEEE Access
Automatic conformation identification
hierarchical classification
flexible peptide conformation
structure classification of protein
outliers detection
author_facet Azzam Alwan
Remi Cogranne
Pierre Beauseroy
Edith Grall-Maes
Nicolas Belloy
Laurent Debelle
Stephanie Baud
Manuel Dauchez
Sebastien Almagro
author_sort Azzam Alwan
title A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
title_short A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
title_full A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
title_fullStr A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
title_full_unstemmed A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
title_sort fully automatic and efficient methodology for peptide activity identification using their 3d conformations
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Over the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing small and highly flexible protein chunks, namely oligopeptides made of a handful of amino acids, remains challenging because of their dynamics and wide range of conformations. In this paper, a statistical methodology based on unsupervised statistical learning is proposed for analyzing 3D conformations small and highly flexible elastin-derived peptides. The goal of this study is twofold: first, is it aimed at identifying the most frequent conformations of each peptide and to study their stability. Second, and most important, it is aimed at comparing main conformations of different elastin-derived peptides to identify the “signature” than can be linked to a biological activity. The main strength of the present work is to propose a method for confirmation recognition that is not affected by peptide rotations or translations and, hence, avoids the use of the complex superposition methods. In addition, the proposed approach uses Kernel PCA to eliminate atypical peptide conformations. Due to the instability of those peptides, removing outliers is crucial since they may dramatically impact clustering results. To extract the most frequent conformations, we propose to use a hierarchical clustering method. Eventually, a peptide activity detector is defined based on comparison of main conformation found in different peptides. The main interests of the proposed method are twofold: first, it is fully automatic method, second, it does not require any additional information or expertise and, third, it can identify conformations accurately that make peptides enabling a given biological activity. Experimental results on a large dataset of peptides conformations highlight the relevance and efficiency of the proposed method.
topic Automatic conformation identification
hierarchical classification
flexible peptide conformation
structure classification of protein
outliers detection
url https://ieeexplore.ieee.org/document/9462920/
work_keys_str_mv AT azzamalwan afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT remicogranne afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT pierrebeauseroy afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT edithgrallmaes afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT nicolasbelloy afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT laurentdebelle afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT stephaniebaud afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT manueldauchez afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT sebastienalmagro afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT azzamalwan fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT remicogranne fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT pierrebeauseroy fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT edithgrallmaes fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT nicolasbelloy fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT laurentdebelle fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT stephaniebaud fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT manueldauchez fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
AT sebastienalmagro fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations
_version_ 1721345653052801024