Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.

High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identific...

Full description

Bibliographic Details
Main Authors: Daniel R Ripoll, Sidhartha Chaudhury, Anders Wallqvist
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-03-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1008864
id doaj-67a41a9e057444e395ba458eda83019f
record_format Article
spelling doaj-67a41a9e057444e395ba458eda83019f2021-08-01T04:30:54ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582021-03-01173e100886410.1371/journal.pcbi.1008864Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.Daniel R RipollSidhartha ChaudhuryAnders WallqvistHigh-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a "fingerprint" to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71-96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification.https://doi.org/10.1371/journal.pcbi.1008864
collection DOAJ
language English
format Article
sources DOAJ
author Daniel R Ripoll
Sidhartha Chaudhury
Anders Wallqvist
spellingShingle Daniel R Ripoll
Sidhartha Chaudhury
Anders Wallqvist
Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
PLoS Computational Biology
author_facet Daniel R Ripoll
Sidhartha Chaudhury
Anders Wallqvist
author_sort Daniel R Ripoll
title Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
title_short Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
title_full Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
title_fullStr Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
title_full_unstemmed Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
title_sort using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2021-03-01
description High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a "fingerprint" to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71-96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification.
url https://doi.org/10.1371/journal.pcbi.1008864
work_keys_str_mv AT danielrripoll usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification
AT sidharthachaudhury usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification
AT anderswallqvist usingtheantibodyantigenbindinginterfacetotrainimagebaseddeepneuralnetworksforantibodyepitopeclassification
_version_ 1721246405512658944