An accurate machine learning approach to predict immunogenic peptides in human
Cancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinica...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Science Planet Inc.
2017-12-01
|
Series: | Canadian Journal of Biotechnology |
Online Access: | https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf |
id |
doaj-7e4d6e9ab7314929904d2281f21a38bc |
---|---|
record_format |
Article |
spelling |
doaj-7e4d6e9ab7314929904d2281f21a38bc2020-11-25T00:12:37ZengScience Planet Inc.Canadian Journal of Biotechnology2560-83042017-12-011Special Issue-Supplement22422410.24870/cjb.2017-a209An accurate machine learning approach to predict immunogenic peptides in humanPriyanka Shah0Anand Kumar Maurya1Rohit Gupta2Amit Chaudhuri3Ravi Gupta4MedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Labs Pvt. Ltd., Bangalore, INDIAMedGenome Inc., Foster City, USAMedGenome Labs Pvt. Ltd., Bangalore, INDIACancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinical studies have demonstrated that cancer vaccines evoke strong anti-tumor immune response by mobilizing CD8+ T-cells. A challenge in the field of cancer vaccines is identifying mutations that are T-cell activating (neoepitopes). Advances in next generation sequencing permit accurate detection of cancer mutations, even when present at a low frequency. However, neoepitope prediction involves a large number of steps many of which cannot be accurately modeled. In humans, class-I peptides, 9-11–mer in length are presented by HLA – A, B and C alleles and activate CD8+ T-cells. Class-II peptides are 14-17-mer, presented by DPA, DPB, DQA, DQB, DRA and DRB alleles and activate CD4+ T-cells.The current method to identify epitopes (peptides) depends primarily on HLA binding prediction algorithm. Our analysis of 9mer peptides from IEDB database showed that there is no difference in binding affinity of peptide that can activate (immunogenic) and that cannot activate (non-immunogenic) the T-cells. The specificity to identify immunogenic peptide using HLA binding based method is only 27.59%. In this study, we present a novel approach using machine learning technique that can predict whether the peptide will be immunogenic or not. We have generated the model using features generated from amino-acid composition, HLA-binding, structural features, peptide processing and peptide transport. Our model achieved an overall accuracy of 77.30% with a specificity of 92.24% on unseen dataset.https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Priyanka Shah Anand Kumar Maurya Rohit Gupta Amit Chaudhuri Ravi Gupta |
spellingShingle |
Priyanka Shah Anand Kumar Maurya Rohit Gupta Amit Chaudhuri Ravi Gupta An accurate machine learning approach to predict immunogenic peptides in human Canadian Journal of Biotechnology |
author_facet |
Priyanka Shah Anand Kumar Maurya Rohit Gupta Amit Chaudhuri Ravi Gupta |
author_sort |
Priyanka Shah |
title |
An accurate machine learning approach to predict immunogenic peptides in human |
title_short |
An accurate machine learning approach to predict immunogenic peptides in human |
title_full |
An accurate machine learning approach to predict immunogenic peptides in human |
title_fullStr |
An accurate machine learning approach to predict immunogenic peptides in human |
title_full_unstemmed |
An accurate machine learning approach to predict immunogenic peptides in human |
title_sort |
accurate machine learning approach to predict immunogenic peptides in human |
publisher |
Science Planet Inc. |
series |
Canadian Journal of Biotechnology |
issn |
2560-8304 |
publishDate |
2017-12-01 |
description |
Cancer immunotherapy provides durable response to a small subset of treated patients. A variety of approaches are being developed to increase the long term benefit of checkpoint blockade. These include radiation and cytotoxic therapies and use of cancer vaccines among others. Preclinical and clinical studies have demonstrated that cancer vaccines evoke strong anti-tumor immune response by mobilizing CD8+ T-cells. A challenge in the field of cancer vaccines is identifying mutations that are T-cell activating (neoepitopes). Advances in next generation sequencing permit accurate detection of cancer mutations, even when present at a low frequency. However, neoepitope prediction involves a large number of steps many of which cannot be accurately modeled. In humans, class-I peptides, 9-11–mer in length are presented by HLA – A, B and C alleles and activate CD8+ T-cells. Class-II peptides are 14-17-mer, presented by DPA, DPB, DQA, DQB, DRA and DRB alleles and activate CD4+ T-cells.The current method to identify epitopes (peptides) depends primarily on HLA binding prediction algorithm. Our analysis of 9mer peptides from IEDB database showed that there is no difference in binding affinity of peptide that can activate (immunogenic) and that cannot activate (non-immunogenic) the T-cells. The specificity to identify immunogenic peptide using HLA binding based method is only 27.59%. In this study, we present a novel approach using machine learning technique that can predict whether the peptide will be immunogenic or not. We have generated the model using features generated from amino-acid composition, HLA-binding, structural features, peptide processing and peptide transport. Our model achieved an overall accuracy of 77.30% with a specificity of 92.24% on unseen dataset. |
url |
https://www.canadianjbiotech.com/CAN_J_BIOTECH/Archives/v1/Special Issue-Supplement/cjb.2017-a209.pdf |
work_keys_str_mv |
AT priyankashah anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT anandkumarmaurya anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT rohitgupta anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT amitchaudhuri anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT ravigupta anaccuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT priyankashah accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT anandkumarmaurya accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT rohitgupta accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT amitchaudhuri accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman AT ravigupta accuratemachinelearningapproachtopredictimmunogenicpeptidesinhuman |
_version_ |
1725398648224743424 |