An accurate machine learning approach to predict immunogenic peptides in human

Cancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinica...

Full description

Bibliographic Details
Main Authors: Priyanka Shah, Anand Kumar Maurya, Rohit Gupta, Amit Chaudhuri, Ravi Gupta
Format: Article
Language:English
Published: Science Planet Inc. 2017-12-01
Series:Canadian Journal of Biotechnology
Online Access:https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf
id doaj-7e4d6e9ab7314929904d2281f21a38bc
record_format Article
spelling doaj-7e4d6e9ab7314929904d2281f21a38bc2020-11-25T00:12:37ZengScience Planet Inc.Canadian Journal of Biotechnology2560-83042017-12-011Special Issue-Supplement22422410.24870/cjb.2017-a209An accurate machine learning approach to predict immunogenic peptides in humanPriyanka Shah0Anand Kumar Maurya1Rohit Gupta2Amit Chaudhuri3Ravi Gupta4MedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Inc., Foster City, USAMedGenome Labs Pvt. Ltd., Bangalore, INDIACancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinical studies have demonstrated that cancer vaccines evoke strong anti-tumor immune response by mobilizing CD8+ T-cells. A challenge in the field of cancer vaccines is identifying mutations that are T-cell activating (neoepitopes). Advances in next generation sequencing permit accurate detection of cancer mutations, even when present at a low frequency. However, neoepitope prediction involves a large number of steps many of which cannot be accurately modeled. In humans, class-I peptides, 9-11–mer in length are presented by HLA – A, B and C alleles and activate CD8+ T-cells. Class-II peptides are 14-17-mer, presented by DPA, DPB, DQA, DQB, DRA and DRB alleles and activate CD4+ T-cells.The current method to identify epitopes (peptides) depends primarily on HLA binding prediction algorithm. Our analysis of 9mer peptides from IEDB database showed that there is no difference in binding affinity of peptide that can activate (immunogenic) and that cannot activate (non-immunogenic) the T-cells. The specificity to identify immunogenic peptide using HLA binding based method is only 27.59%. In this study, we present a novel approach using machine learning technique that can predict whether the peptide will be immunogenic or not. We have generated the model using features generated from amino-acid composition, HLA-binding, structural features, peptide processing and peptide transport. Our model achieved an overall accuracy of 77.30% with a specificity of 92.24% on unseen dataset.https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Priyanka Shah
Anand Kumar Maurya
Rohit Gupta
Amit Chaudhuri
Ravi Gupta
spellingShingle Priyanka Shah
Anand Kumar Maurya
Rohit Gupta
Amit Chaudhuri
Ravi Gupta
An accurate machine learning approach to predict immunogenic peptides in human
Canadian Journal of Biotechnology
author_facet Priyanka Shah
Anand Kumar Maurya
Rohit Gupta
Amit Chaudhuri
Ravi Gupta
author_sort Priyanka Shah
title An accurate machine learning approach to predict immunogenic peptides in human
title_short An accurate machine learning approach to predict immunogenic peptides in human
title_full An accurate machine learning approach to predict immunogenic peptides in human
title_fullStr An accurate machine learning approach to predict immunogenic peptides in human
title_full_unstemmed An accurate machine learning approach to predict immunogenic peptides in human
title_sort accurate machine learning approach to predict immunogenic peptides in human
publisher Science Planet Inc.
series Canadian Journal of Biotechnology
issn 2560-8304
publishDate 2017-12-01
description Cancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinical studies have demonstrated that cancer vaccines evoke strong anti-tumor immune response by mobilizing CD8+ T-cells. A challenge in the field of cancer vaccines is identifying mutations that are T-cell activating (neoepitopes). Advances in next generation sequencing permit accurate detection of cancer mutations, even when present at a low frequency. However, neoepitope prediction involves a large number of steps many of which cannot be accurately modeled. In humans, class-I peptides, 9-11–mer in length are presented by HLA – A, B and C alleles and activate CD8+ T-cells. Class-II peptides are 14-17-mer, presented by DPA, DPB, DQA, DQB, DRA and DRB alleles and activate CD4+ T-cells.The current method to identify epitopes (peptides) depends primarily on HLA binding prediction algorithm. Our analysis of 9mer peptides from IEDB database showed that there is no difference in binding affinity of peptide that can activate (immunogenic) and that cannot activate (non-immunogenic) the T-cells. The specificity to identify immunogenic peptide using HLA binding based method is only 27.59%. In this study, we present a novel approach using machine learning technique that can predict whether the peptide will be immunogenic or not. We have generated the model using features generated from amino-acid composition, HLA-binding, structural features, peptide processing and peptide transport. Our model achieved an overall accuracy of 77.30% with a specificity of 92.24% on unseen dataset.
url https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf
work_keys_str_mv AT priyankashah anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT anandkumarmaurya anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT rohitgupta anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT amitchaudhuri anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT ravigupta anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT priyankashah accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT anandkumarmaurya accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT rohitgupta accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT amitchaudhuri accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
AT ravigupta accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman
_version_ 1725398648224743424