Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method

Abstract As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descr...

Full description

Bibliographic Details
Main Authors: Phasit Charoenkwan, Wararat Chiangjong, Vannajan Sanghiran Lee, Chanin Nantasenamat, Md. Mehedi Hasan, Watshara Shoombuatong
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-82513-9
id doaj-4da6028b3ec44c70afc3d6b6a8e19110
record_format Article
spelling doaj-4da6028b3ec44c70afc3d6b6a8e191102021-02-07T12:37:00ZengNature Publishing GroupScientific Reports2045-23222021-02-0111111310.1038/s41598-021-82513-9Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card methodPhasit Charoenkwan0Wararat Chiangjong1Vannajan Sanghiran Lee2Chanin Nantasenamat3Md. Mehedi Hasan4Watshara Shoombuatong5Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai UniversityPediatric Translational Research Unit, Department of Pediatrics, Faculty of Medicine, Ramathibodi Hospital, Mahidol UniversityDepartment of Chemistry, Centre of Theoretical and Computational Physics, Faculty of Science, University of MalayaCenter of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol UniversityDepartment of Bioscience and Bioinformatics, Kyushu Institute of TechnologyCenter of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol UniversityAbstract As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descriptors together with complex ensemble approaches that consequently leads to low interpretability and thus poses a challenge for biologists and biochemists. Therefore, it is desirable to develop a simple, interpretable and efficient predictor for accurate ACP identification as well as providing the means for the rational design of new anticancer peptides with promising potential for clinical application. Herein, we propose a novel flexible scoring card method (FSCM) making use of propensity scores of local and global sequential information for the development of a sequence-based ACP predictor (named iACP-FSCM) for improving the prediction accuracy and model interpretability. To the best of our knowledge, iACP-FSCM represents the first sequence-based ACP predictor for rationalizing an in-depth understanding into the molecular basis for the enhancement of anticancer activities of peptides via the use of FSCM-derived propensity scores. The independent testing results showed that the iACP-FSCM provided accuracies of 0.825 and 0.910 as evaluated on the main and alternative datasets, respectively. Results from comparative benchmarking demonstrated that iACP-FSCM could outperform seven other existing ACP predictors with marked improvements of 7% and 17% for accuracy and MCC, respectively, on the main dataset. Furthermore, the iACP-FSCM (0.910) achieved very comparable results to that of the state-of-the-art ensemble model AntiCP2.0 (0.920) as evaluated on the alternative dataset. Comparative results demonstrated that iACP-FSCM was the most suitable choice for ACP identification and characterization considering its simplicity, interpretability and generalizability. It is highly anticipated that the iACP-FSCM may be a robust tool for the rapid screening and identification of promising ACPs for clinical use.https://doi.org/10.1038/s41598-021-82513-9
collection DOAJ
language English
format Article
sources DOAJ
author Phasit Charoenkwan
Wararat Chiangjong
Vannajan Sanghiran Lee
Chanin Nantasenamat
Md. Mehedi Hasan
Watshara Shoombuatong
spellingShingle Phasit Charoenkwan
Wararat Chiangjong
Vannajan Sanghiran Lee
Chanin Nantasenamat
Md. Mehedi Hasan
Watshara Shoombuatong
Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
Scientific Reports
author_facet Phasit Charoenkwan
Wararat Chiangjong
Vannajan Sanghiran Lee
Chanin Nantasenamat
Md. Mehedi Hasan
Watshara Shoombuatong
author_sort Phasit Charoenkwan
title Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_short Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_full Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_fullStr Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_full_unstemmed Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
title_sort improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-02-01
description Abstract As anticancer peptides (ACPs) have attracted great interest for cancer treatment, several approaches based on machine learning have been proposed for ACP identification. Although existing methods have afforded high prediction accuracies, however such models are using a large number of descriptors together with complex ensemble approaches that consequently leads to low interpretability and thus poses a challenge for biologists and biochemists. Therefore, it is desirable to develop a simple, interpretable and efficient predictor for accurate ACP identification as well as providing the means for the rational design of new anticancer peptides with promising potential for clinical application. Herein, we propose a novel flexible scoring card method (FSCM) making use of propensity scores of local and global sequential information for the development of a sequence-based ACP predictor (named iACP-FSCM) for improving the prediction accuracy and model interpretability. To the best of our knowledge, iACP-FSCM represents the first sequence-based ACP predictor for rationalizing an in-depth understanding into the molecular basis for the enhancement of anticancer activities of peptides via the use of FSCM-derived propensity scores. The independent testing results showed that the iACP-FSCM provided accuracies of 0.825 and 0.910 as evaluated on the main and alternative datasets, respectively. Results from comparative benchmarking demonstrated that iACP-FSCM could outperform seven other existing ACP predictors with marked improvements of 7% and 17% for accuracy and MCC, respectively, on the main dataset. Furthermore, the iACP-FSCM (0.910) achieved very comparable results to that of the state-of-the-art ensemble model AntiCP2.0 (0.920) as evaluated on the alternative dataset. Comparative results demonstrated that iACP-FSCM was the most suitable choice for ACP identification and characterization considering its simplicity, interpretability and generalizability. It is highly anticipated that the iACP-FSCM may be a robust tool for the rapid screening and identification of promising ACPs for clinical use.
url https://doi.org/10.1038/s41598-021-82513-9
work_keys_str_mv AT phasitcharoenkwan improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
AT wararatchiangjong improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
AT vannajansanghiranlee improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
AT chaninnantasenamat improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
AT mdmehedihasan improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
AT watsharashoombuatong improvedpredictionandcharacterizationofanticanceractivitiesofpeptidesusinganovelflexiblescoringcardmethod
_version_ 1724280858151485440