Mining folded proteomes in the era of accurate structure prediction

Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure predictio...

Full description

Bibliographic Details
Main Authors: Bayly-Jones, C. (Author), Whisstock, J.C (Author)
Format: Article
Language:English
Published: Public Library of Science 2022
Online Access:View Fulltext in Publisher
Description
Summary:Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins. © 2022 Bayly-Jones, Whisstock. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
ISBN:1553734X (ISSN)
DOI:10.1371/journal.pcbi.1009930