AptaNet as a deep learning approach for aptamer–protein interaction prediction
Abstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-85629-0 |
id |
doaj-f9b26603b44f4b3da984cc4e94310d8c |
---|---|
record_format |
Article |
spelling |
doaj-f9b26603b44f4b3da984cc4e94310d8c2021-03-21T12:36:18ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111910.1038/s41598-021-85629-0AptaNet as a deep learning approach for aptamer–protein interaction predictionNeda Emami0Reza Ferdousi1Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical SciencesDepartment of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical SciencesAbstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we present AptaNet—a new deep neural network—to predict the aptamer–protein interaction pairs by integrating features derived from both aptamers and the target proteins. Aptamers were encoded by using two different strategies, including k-mer and reverse complement k-mer frequency. Amino acid composition (AAC) and pseudo amino acid composition (PseAAC) were applied to represent target information using 24 physicochemical and conformational properties of the proteins. To handle the imbalance problem in the data, we applied a neighborhood cleaning algorithm. The predictor was constructed based on a deep neural network, and optimal features were selected using the random forest algorithm. As a result, 99.79% accuracy was achieved for the training dataset, and 91.38% accuracy was obtained for the testing dataset. AptaNet achieved high performance on our constructed aptamer-protein benchmark dataset. The results indicate that AptaNet can help identify novel aptamer–protein interacting pairs and build more-efficient insights into the relationship between aptamers and proteins. Our benchmark dataset and the source codes for AptaNet are available in: https://github.com/nedaemami/AptaNet .https://doi.org/10.1038/s41598-021-85629-0 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Neda Emami Reza Ferdousi |
spellingShingle |
Neda Emami Reza Ferdousi AptaNet as a deep learning approach for aptamer–protein interaction prediction Scientific Reports |
author_facet |
Neda Emami Reza Ferdousi |
author_sort |
Neda Emami |
title |
AptaNet as a deep learning approach for aptamer–protein interaction prediction |
title_short |
AptaNet as a deep learning approach for aptamer–protein interaction prediction |
title_full |
AptaNet as a deep learning approach for aptamer–protein interaction prediction |
title_fullStr |
AptaNet as a deep learning approach for aptamer–protein interaction prediction |
title_full_unstemmed |
AptaNet as a deep learning approach for aptamer–protein interaction prediction |
title_sort |
aptanet as a deep learning approach for aptamer–protein interaction prediction |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-03-01 |
description |
Abstract Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we present AptaNet—a new deep neural network—to predict the aptamer–protein interaction pairs by integrating features derived from both aptamers and the target proteins. Aptamers were encoded by using two different strategies, including k-mer and reverse complement k-mer frequency. Amino acid composition (AAC) and pseudo amino acid composition (PseAAC) were applied to represent target information using 24 physicochemical and conformational properties of the proteins. To handle the imbalance problem in the data, we applied a neighborhood cleaning algorithm. The predictor was constructed based on a deep neural network, and optimal features were selected using the random forest algorithm. As a result, 99.79% accuracy was achieved for the training dataset, and 91.38% accuracy was obtained for the testing dataset. AptaNet achieved high performance on our constructed aptamer-protein benchmark dataset. The results indicate that AptaNet can help identify novel aptamer–protein interacting pairs and build more-efficient insights into the relationship between aptamers and proteins. Our benchmark dataset and the source codes for AptaNet are available in: https://github.com/nedaemami/AptaNet . |
url |
https://doi.org/10.1038/s41598-021-85629-0 |
work_keys_str_mv |
AT nedaemami aptanetasadeeplearningapproachforaptamerproteininteractionprediction AT rezaferdousi aptanetasadeeplearningapproachforaptamerproteininteractionprediction |
_version_ |
1724210324637220864 |