AptaNet as a deep learning approach for aptamer–protein interaction prediction

Abstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we...

Full description

Bibliographic Details
Main Authors: Neda Emami, Reza Ferdousi
Format: Article
Language:English
Published: Nature Publishing Group 2021-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-85629-0
id doaj-f9b26603b44f4b3da984cc4e94310d8c
record_format Article
spelling doaj-f9b26603b44f4b3da984cc4e94310d8c2021-03-21T12:36:18ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111910.1038/s41598-021-85629-0AptaNet as a deep learning approach for aptamer–protein interaction predictionNeda Emami0Reza Ferdousi1Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical SciencesDepartment of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical SciencesAbstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we present AptaNet—a new deep neural network—to predict the aptamer–protein interaction pairs by integrating features derived from both aptamers and the target proteins. Aptamers were encoded by using two different strategies, including k-mer and reverse complement k-mer frequency. Amino acid composition (AAC) and pseudo amino acid composition (PseAAC) were applied to represent target information using 24 physicochemical and conformational properties of the proteins. To handle the imbalance problem in the data, we applied a neighborhood cleaning algorithm. The predictor was constructed based on a deep neural network, and optimal features were selected using the random forest algorithm. As a result, 99.79% accuracy was achieved for the training dataset, and 91.38% accuracy was obtained for the testing dataset. AptaNet achieved high performance on our constructed aptamer-protein benchmark dataset. The results indicate that AptaNet can help identify novel aptamer–protein interacting pairs and build more-efficient insights into the relationship between aptamers and proteins. Our benchmark dataset and the source codes for AptaNet are available in: https://github.com/nedaemami/AptaNet .https://doi.org/10.1038/s41598-021-85629-0
collection DOAJ
language English
format Article
sources DOAJ
author Neda Emami
Reza Ferdousi
spellingShingle Neda Emami
Reza Ferdousi
AptaNet as a deep learning approach for aptamer–protein interaction prediction
Scientific Reports
author_facet Neda Emami
Reza Ferdousi
author_sort Neda Emami
title AptaNet as a deep learning approach for aptamer–protein interaction prediction
title_short AptaNet as a deep learning approach for aptamer–protein interaction prediction
title_full AptaNet as a deep learning approach for aptamer–protein interaction prediction
title_fullStr AptaNet as a deep learning approach for aptamer–protein interaction prediction
title_full_unstemmed AptaNet as a deep learning approach for aptamer–protein interaction prediction
title_sort aptanet as a deep learning approach for aptamer–protein interaction prediction
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-03-01
description Abstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we present AptaNet—a new deep neural network—to predict the aptamer–protein interaction pairs by integrating features derived from both aptamers and the target proteins. Aptamers were encoded by using two different strategies, including k-mer and reverse complement k-mer frequency. Amino acid composition (AAC) and pseudo amino acid composition (PseAAC) were applied to represent target information using 24 physicochemical and conformational properties of the proteins. To handle the imbalance problem in the data, we applied a neighborhood cleaning algorithm. The predictor was constructed based on a deep neural network, and optimal features were selected using the random forest algorithm. As a result, 99.79% accuracy was achieved for the training dataset, and 91.38% accuracy was obtained for the testing dataset. AptaNet achieved high performance on our constructed aptamer-protein benchmark dataset. The results indicate that AptaNet can help identify novel aptamer–protein interacting pairs and build more-efficient insights into the relationship between aptamers and proteins. Our benchmark dataset and the source codes for AptaNet are available in: https://github.com/nedaemami/AptaNet .
url https://doi.org/10.1038/s41598-021-85629-0
work_keys_str_mv AT nedaemami aptanetasadeeplearningapproachforaptamerproteininteractionprediction
AT rezaferdousi aptanetasadeeplearningapproachforaptamerproteininteractionprediction
_version_ 1724210324637220864