Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs

Abstract The global population is at present suffering from a pandemic of Coronavirus disease 2019 (COVID-19), caused by the novel coronavirus Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). The goal of this study was to use artificial intelligence (AI) to predict blueprints for design...

Full description

Bibliographic Details
Main Authors: Brandon Malone, Boris Simovski, Clément Moliné, Jun Cheng, Marius Gheorghe, Hugues Fontenelle, Ioannis Vardaxis, Simen Tennøe, Jenny-Ann Malmberg, Richard Stratford, Trevor Clancy
Format: Article
Language:English
Published: Nature Publishing Group 2020-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-78758-5
id doaj-b6083f2efc1b428fbf2eba0c45005e9d
record_format Article
spelling doaj-b6083f2efc1b428fbf2eba0c45005e9d2020-12-27T12:18:00ZengNature Publishing GroupScientific Reports2045-23222020-12-0110111410.1038/s41598-020-78758-5Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designsBrandon Malone0Boris Simovski1Clément Moliné2Jun Cheng3Marius Gheorghe4Hugues Fontenelle5Ioannis Vardaxis6Simen Tennøe7Jenny-Ann Malmberg8Richard Stratford9Trevor Clancy10NEC Laboratories Europe GmbHNEC OncoImmunity ASNEC OncoImmunity ASNEC Laboratories Europe GmbHNEC OncoImmunity ASNEC OncoImmunity ASNEC OncoImmunity ASNEC OncoImmunity ASNEC OncoImmunity ASNEC OncoImmunity ASNEC OncoImmunity ASAbstract The global population is at present suffering from a pandemic of Coronavirus disease 2019 (COVID-19), caused by the novel coronavirus Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). The goal of this study was to use artificial intelligence (AI) to predict blueprints for designing universal vaccines against SARS-CoV-2, that contain a sufficiently broad repertoire of T-cell epitopes capable of providing coverage and protection across the global population. To help achieve these aims, we profiled the entire SARS-CoV-2 proteome across the most frequent 100 HLA-A, HLA-B and HLA-DR alleles in the human population, using host-infected cell surface antigen presentation and immunogenicity predictors from the NEC Immune Profiler suite of tools, and generated comprehensive epitope maps. We then used these epitope maps as input for a Monte Carlo simulation designed to identify statistically significant “epitope hotspot” regions in the virus that are most likely to be immunogenic across a broad spectrum of HLA types. We then removed epitope hotspots that shared significant homology with proteins in the human proteome to reduce the chance of inducing off-target autoimmune responses. We also analyzed the antigen presentation and immunogenic landscape of all the nonsynonymous mutations across 3,400 different sequences of the virus, to identify a trend whereby SARS-COV-2 mutations are predicted to have reduced potential to be presented by host-infected cells, and consequently detected by the host immune system. A sequence conservation analysis then removed epitope hotspots that occurred in less-conserved regions of the viral proteome. Finally, we used a database of the HLA haplotypes of approximately 22,000 individuals to develop a “digital twin” type simulation to model how effective different combinations of hotspots would work in a diverse human population; the approach identified an optimal constellation of epitope hotspots that could provide maximum coverage in the global population. By combining the antigen presentation to the infected-host cell surface and immunogenicity predictions of the NEC Immune Profiler with a robust Monte Carlo and digital twin simulation, we have profiled the entire SARS-CoV-2 proteome and identified a subset of epitope hotspots that could be harnessed in a vaccine formulation to provide a broad coverage across the global population.https://doi.org/10.1038/s41598-020-78758-5
collection DOAJ
language English
format Article
sources DOAJ
author Brandon Malone
Boris Simovski
Clément Moliné
Jun Cheng
Marius Gheorghe
Hugues Fontenelle
Ioannis Vardaxis
Simen Tennøe
Jenny-Ann Malmberg
Richard Stratford
Trevor Clancy
spellingShingle Brandon Malone
Boris Simovski
Clément Moliné
Jun Cheng
Marius Gheorghe
Hugues Fontenelle
Ioannis Vardaxis
Simen Tennøe
Jenny-Ann Malmberg
Richard Stratford
Trevor Clancy
Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
Scientific Reports
author_facet Brandon Malone
Boris Simovski
Clément Moliné
Jun Cheng
Marius Gheorghe
Hugues Fontenelle
Ioannis Vardaxis
Simen Tennøe
Jenny-Ann Malmberg
Richard Stratford
Trevor Clancy
author_sort Brandon Malone
title Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
title_short Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
title_full Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
title_fullStr Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
title_full_unstemmed Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs
title_sort artificial intelligence predicts the immunogenic landscape of sars-cov-2 leading to universal blueprints for vaccine designs
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2020-12-01
description Abstract The global population is at present suffering from a pandemic of Coronavirus disease 2019 (COVID-19), caused by the novel coronavirus Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). The goal of this study was to use artificial intelligence (AI) to predict blueprints for designing universal vaccines against SARS-CoV-2, that contain a sufficiently broad repertoire of T-cell epitopes capable of providing coverage and protection across the global population. To help achieve these aims, we profiled the entire SARS-CoV-2 proteome across the most frequent 100 HLA-A, HLA-B and HLA-DR alleles in the human population, using host-infected cell surface antigen presentation and immunogenicity predictors from the NEC Immune Profiler suite of tools, and generated comprehensive epitope maps. We then used these epitope maps as input for a Monte Carlo simulation designed to identify statistically significant “epitope hotspot” regions in the virus that are most likely to be immunogenic across a broad spectrum of HLA types. We then removed epitope hotspots that shared significant homology with proteins in the human proteome to reduce the chance of inducing off-target autoimmune responses. We also analyzed the antigen presentation and immunogenic landscape of all the nonsynonymous mutations across 3,400 different sequences of the virus, to identify a trend whereby SARS-COV-2 mutations are predicted to have reduced potential to be presented by host-infected cells, and consequently detected by the host immune system. A sequence conservation analysis then removed epitope hotspots that occurred in less-conserved regions of the viral proteome. Finally, we used a database of the HLA haplotypes of approximately 22,000 individuals to develop a “digital twin” type simulation to model how effective different combinations of hotspots would work in a diverse human population; the approach identified an optimal constellation of epitope hotspots that could provide maximum coverage in the global population. By combining the antigen presentation to the infected-host cell surface and immunogenicity predictions of the NEC Immune Profiler with a robust Monte Carlo and digital twin simulation, we have profiled the entire SARS-CoV-2 proteome and identified a subset of epitope hotspots that could be harnessed in a vaccine formulation to provide a broad coverage across the global population.
url https://doi.org/10.1038/s41598-020-78758-5
work_keys_str_mv AT brandonmalone artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT borissimovski artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT clementmoline artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT juncheng artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT mariusgheorghe artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT huguesfontenelle artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT ioannisvardaxis artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT simentennøe artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT jennyannmalmberg artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT richardstratford artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
AT trevorclancy artificialintelligencepredictstheimmunogeniclandscapeofsarscov2leadingtouniversalblueprintsforvaccinedesigns
_version_ 1724369201099964416