4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network

The epigenetic modification, DNA N4 - methylcytosine(4mC) plays an important role in DNA expression, repair, and replication. It simply plays a crucial role in restriction-modification systems. The better and accurate prediction of 4mC sites in DNA is much-needed work to understand their functional...

Full description

Bibliographic Details
Main Authors: Jhabindra Khanal, Iman Nazari, Hilal Tayara, Kil To Chong
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8846679/
id doaj-c31c0b06615a434380e5572d899db6ec
record_format Article
spelling doaj-c31c0b06615a434380e5572d899db6ec2021-04-05T17:33:58ZengIEEEIEEE Access2169-35362019-01-01714545514546110.1109/ACCESS.2019.294316988466794mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural NetworkJhabindra Khanal0https://orcid.org/0000-0001-6470-1365Iman Nazari1https://orcid.org/0000-0002-2584-0909Hilal Tayara2https://orcid.org/0000-0001-5678-3479Kil To Chong3Department of Electronics and Information Engineering, Chonbuk National University, Jeonju, South KoreaDepartment of Electronics and Information Engineering, Chonbuk National University, Jeonju, South KoreaDepartment of Electronics and Information Engineering, Chonbuk National University, Jeonju, South KoreaAdvanced Electronics and Information Research Center, Chonbuk National University, Jeonju, South KoreaThe epigenetic modification, DNA N4 - methylcytosine(4mC) plays an important role in DNA expression, repair, and replication. It simply plays a crucial role in restriction-modification systems. The better and accurate prediction of 4mC sites in DNA is much-needed work to understand their functional behavior that leads to help in both drug discovery and biomedical research. Therefore, an accurate computational model is required. In this work, we present an efficient one-dimensional convolutional neural network (CNN) model, called 4mCCNN, for 4mc sites identifications in Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Escherichia coli, Geoalkalibacter subterraneus, and Geobacter pickeringii. Existing methods were developed by machine learning algorithms for identifying the 4mc sites using handcrafted features, while the proposed model extracts the features of the 4mC sites from DNA sequence automatically using the CNN model. The performance of the proposed model has been evaluated on benchmark datasets and achieved generally better outcomes in identifying 4mc sites as compared to the state-of-the-art predictors. The developed 4mCNN model was constructed in a web server at https://home.jbnu.ac.kr/NSCL/4mCCNN.htm.https://ieeexplore.ieee.org/document/8846679/Convolutional neural networkDNA methylationDNA N4 – methylcytosine(4mC)sequence analysis
collection DOAJ
language English
format Article
sources DOAJ
author Jhabindra Khanal
Iman Nazari
Hilal Tayara
Kil To Chong
spellingShingle Jhabindra Khanal
Iman Nazari
Hilal Tayara
Kil To Chong
4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
IEEE Access
Convolutional neural network
DNA methylation
DNA N4 – methylcytosine(4mC)
sequence analysis
author_facet Jhabindra Khanal
Iman Nazari
Hilal Tayara
Kil To Chong
author_sort Jhabindra Khanal
title 4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
title_short 4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
title_full 4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
title_fullStr 4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
title_full_unstemmed 4mCCNN: Identification of N4-Methylcytosine Sites in Prokaryotes Using Convolutional Neural Network
title_sort 4mccnn: identification of n4-methylcytosine sites in prokaryotes using convolutional neural network
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The epigenetic modification, DNA N4 - methylcytosine(4mC) plays an important role in DNA expression, repair, and replication. It simply plays a crucial role in restriction-modification systems. The better and accurate prediction of 4mC sites in DNA is much-needed work to understand their functional behavior that leads to help in both drug discovery and biomedical research. Therefore, an accurate computational model is required. In this work, we present an efficient one-dimensional convolutional neural network (CNN) model, called 4mCCNN, for 4mc sites identifications in Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Escherichia coli, Geoalkalibacter subterraneus, and Geobacter pickeringii. Existing methods were developed by machine learning algorithms for identifying the 4mc sites using handcrafted features, while the proposed model extracts the features of the 4mC sites from DNA sequence automatically using the CNN model. The performance of the proposed model has been evaluated on benchmark datasets and achieved generally better outcomes in identifying 4mc sites as compared to the state-of-the-art predictors. The developed 4mCNN model was constructed in a web server at https://home.jbnu.ac.kr/NSCL/4mCCNN.htm.
topic Convolutional neural network
DNA methylation
DNA N4 – methylcytosine(4mC)
sequence analysis
url https://ieeexplore.ieee.org/document/8846679/
work_keys_str_mv AT jhabindrakhanal 4mccnnidentificationofn4methylcytosinesitesinprokaryotesusingconvolutionalneuralnetwork
AT imannazari 4mccnnidentificationofn4methylcytosinesitesinprokaryotesusingconvolutionalneuralnetwork
AT hilaltayara 4mccnnidentificationofn4methylcytosinesitesinprokaryotesusingconvolutionalneuralnetwork
AT kiltochong 4mccnnidentificationofn4methylcytosinesitesinprokaryotesusingconvolutionalneuralnetwork
_version_ 1721539344682975232