Functional and genomic analyses of alpha-solenoid proteins.

Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we ex...

Full description

Bibliographic Details
Main Authors: David Fournier, Gareth A Palidwor, Sergey Shcherbinin, Angelika Szengel, Martin H Schaefer, Carol Perez-Iratxeta, Miguel A Andrade-Navarro
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24278209/pdf/?tool=EBI
id doaj-88a72ad9995a458a8a90390d1b8e954e
record_format Article
spelling doaj-88a72ad9995a458a8a90390d1b8e954e2021-03-04T10:14:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01811e7989410.1371/journal.pone.0079894Functional and genomic analyses of alpha-solenoid proteins.David FournierGareth A PalidworSergey ShcherbininAngelika SzengelMartin H SchaeferCarol Perez-IratxetaMiguel A Andrade-NavarroAlpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24278209/pdf/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author David Fournier
Gareth A Palidwor
Sergey Shcherbinin
Angelika Szengel
Martin H Schaefer
Carol Perez-Iratxeta
Miguel A Andrade-Navarro
spellingShingle David Fournier
Gareth A Palidwor
Sergey Shcherbinin
Angelika Szengel
Martin H Schaefer
Carol Perez-Iratxeta
Miguel A Andrade-Navarro
Functional and genomic analyses of alpha-solenoid proteins.
PLoS ONE
author_facet David Fournier
Gareth A Palidwor
Sergey Shcherbinin
Angelika Szengel
Martin H Schaefer
Carol Perez-Iratxeta
Miguel A Andrade-Navarro
author_sort David Fournier
title Functional and genomic analyses of alpha-solenoid proteins.
title_short Functional and genomic analyses of alpha-solenoid proteins.
title_full Functional and genomic analyses of alpha-solenoid proteins.
title_fullStr Functional and genomic analyses of alpha-solenoid proteins.
title_full_unstemmed Functional and genomic analyses of alpha-solenoid proteins.
title_sort functional and genomic analyses of alpha-solenoid proteins.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Alpha-solenoids are flexible protein structural domains formed by ensembles of alpha-helical repeats (Armadillo and HEAT repeats among others). While homology can be used to detect many of these repeats, some alpha-solenoids have very little sequence homology to proteins of known structure and we expect that many remain undetected. We previously developed a method for detection of alpha-helical repeats based on a neural network trained on a dataset of protein structures. Here we improved the detection algorithm and updated the training dataset using recently solved structures of alpha-solenoids. Unexpectedly, we identified occurrences of alpha-solenoids in solved protein structures that escaped attention, for example within the core of the catalytic subunit of PI3KC. Our results expand the current set of known alpha-solenoids. Application of our tool to the protein universe allowed us to detect their significant enrichment in proteins interacting with many proteins, confirming that alpha-solenoids are generally involved in protein-protein interactions. We then studied the taxonomic distribution of alpha-solenoids to discuss an evolutionary scenario for the emergence of this type of domain, speculating that alpha-solenoids have emerged in multiple taxa in independent events by convergent evolution. We observe a higher rate of alpha-solenoids in eukaryotic genomes and in some prokaryotic families, such as Cyanobacteria and Planctomycetes, which could be associated to increased cellular complexity. The method is available at http://cbdm.mdc-berlin.de/~ard2/.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24278209/pdf/?tool=EBI
work_keys_str_mv AT davidfournier functionalandgenomicanalysesofalphasolenoidproteins
AT garethapalidwor functionalandgenomicanalysesofalphasolenoidproteins
AT sergeyshcherbinin functionalandgenomicanalysesofalphasolenoidproteins
AT angelikaszengel functionalandgenomicanalysesofalphasolenoidproteins
AT martinhschaefer functionalandgenomicanalysesofalphasolenoidproteins
AT carolpereziratxeta functionalandgenomicanalysesofalphasolenoidproteins
AT miguelaandradenavarro functionalandgenomicanalysesofalphasolenoidproteins
_version_ 1714806469545689088