A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations
Over the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing smal...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9462920/ |
id |
doaj-67b6802432f44a2b8774289c3bc36cd3 |
---|---|
record_format |
Article |
spelling |
doaj-67b6802432f44a2b8774289c3bc36cd32021-07-01T23:00:43ZengIEEEIEEE Access2169-35362021-01-019921439215610.1109/ACCESS.2021.30919399462920A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D ConformationsAzzam Alwan0https://orcid.org/0000-0003-2531-2943Remi Cogranne1https://orcid.org/0000-0002-4205-4694Pierre Beauseroy2https://orcid.org/0000-0002-2883-1303Edith Grall-Maes3Nicolas Belloy4https://orcid.org/0000-0002-6003-3184Laurent Debelle5Stephanie Baud6https://orcid.org/0000-0002-4436-0652Manuel Dauchez7Sebastien Almagro8https://orcid.org/0000-0002-6974-1187Research Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceResearch Unit LIST3N, M2S laboratory, Troyes University of Technology, Troyes, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceUMR CNRS 7369 MEDyC, Université of Reims Champagne-Ardenne, Reims, FranceOver the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing small and highly flexible protein chunks, namely oligopeptides made of a handful of amino acids, remains challenging because of their dynamics and wide range of conformations. In this paper, a statistical methodology based on unsupervised statistical learning is proposed for analyzing 3D conformations small and highly flexible elastin-derived peptides. The goal of this study is twofold: first, is it aimed at identifying the most frequent conformations of each peptide and to study their stability. Second, and most important, it is aimed at comparing main conformations of different elastin-derived peptides to identify the “signature” than can be linked to a biological activity. The main strength of the present work is to propose a method for confirmation recognition that is not affected by peptide rotations or translations and, hence, avoids the use of the complex superposition methods. In addition, the proposed approach uses Kernel PCA to eliminate atypical peptide conformations. Due to the instability of those peptides, removing outliers is crucial since they may dramatically impact clustering results. To extract the most frequent conformations, we propose to use a hierarchical clustering method. Eventually, a peptide activity detector is defined based on comparison of main conformation found in different peptides. The main interests of the proposed method are twofold: first, it is fully automatic method, second, it does not require any additional information or expertise and, third, it can identify conformations accurately that make peptides enabling a given biological activity. Experimental results on a large dataset of peptides conformations highlight the relevance and efficiency of the proposed method.https://ieeexplore.ieee.org/document/9462920/Automatic conformation identificationhierarchical classificationflexible peptide conformationstructure classification of proteinoutliers detection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Azzam Alwan Remi Cogranne Pierre Beauseroy Edith Grall-Maes Nicolas Belloy Laurent Debelle Stephanie Baud Manuel Dauchez Sebastien Almagro |
spellingShingle |
Azzam Alwan Remi Cogranne Pierre Beauseroy Edith Grall-Maes Nicolas Belloy Laurent Debelle Stephanie Baud Manuel Dauchez Sebastien Almagro A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations IEEE Access Automatic conformation identification hierarchical classification flexible peptide conformation structure classification of protein outliers detection |
author_facet |
Azzam Alwan Remi Cogranne Pierre Beauseroy Edith Grall-Maes Nicolas Belloy Laurent Debelle Stephanie Baud Manuel Dauchez Sebastien Almagro |
author_sort |
Azzam Alwan |
title |
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations |
title_short |
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations |
title_full |
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations |
title_fullStr |
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations |
title_full_unstemmed |
A Fully Automatic and Efficient Methodology for Peptide Activity Identification Using Their 3D Conformations |
title_sort |
fully automatic and efficient methodology for peptide activity identification using their 3d conformations |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Over the past decades, the understanding of peptides and proteins biological functions has been an active research topic. Latest research works in this field have suggested that protein conformations may be a key feature for gaining insights into protein biological functions. However, analyzing small and highly flexible protein chunks, namely oligopeptides made of a handful of amino acids, remains challenging because of their dynamics and wide range of conformations. In this paper, a statistical methodology based on unsupervised statistical learning is proposed for analyzing 3D conformations small and highly flexible elastin-derived peptides. The goal of this study is twofold: first, is it aimed at identifying the most frequent conformations of each peptide and to study their stability. Second, and most important, it is aimed at comparing main conformations of different elastin-derived peptides to identify the “signature” than can be linked to a biological activity. The main strength of the present work is to propose a method for confirmation recognition that is not affected by peptide rotations or translations and, hence, avoids the use of the complex superposition methods. In addition, the proposed approach uses Kernel PCA to eliminate atypical peptide conformations. Due to the instability of those peptides, removing outliers is crucial since they may dramatically impact clustering results. To extract the most frequent conformations, we propose to use a hierarchical clustering method. Eventually, a peptide activity detector is defined based on comparison of main conformation found in different peptides. The main interests of the proposed method are twofold: first, it is fully automatic method, second, it does not require any additional information or expertise and, third, it can identify conformations accurately that make peptides enabling a given biological activity. Experimental results on a large dataset of peptides conformations highlight the relevance and efficiency of the proposed method. |
topic |
Automatic conformation identification hierarchical classification flexible peptide conformation structure classification of protein outliers detection |
url |
https://ieeexplore.ieee.org/document/9462920/ |
work_keys_str_mv |
AT azzamalwan afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT remicogranne afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT pierrebeauseroy afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT edithgrallmaes afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT nicolasbelloy afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT laurentdebelle afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT stephaniebaud afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT manueldauchez afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT sebastienalmagro afullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT azzamalwan fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT remicogranne fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT pierrebeauseroy fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT edithgrallmaes fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT nicolasbelloy fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT laurentdebelle fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT stephaniebaud fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT manueldauchez fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations AT sebastienalmagro fullyautomaticandefficientmethodologyforpeptideactivityidentificationusingtheir3dconformations |
_version_ |
1721345653052801024 |