Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method

Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolution...

Full description

Bibliographic Details
Main Authors: Faegheh Golabi, Elnaz Mehdizadeh Aghdam, Mousa Shamsi, Mohammad Hossein Sedaaghi, Abolfazl Barzegar, Mohammad Saeid Hejazi
Format: Article
Language:English
Published: Tabriz University of Medical Sciences 2021-03-01
Series:BioImpacts
Subjects:
Online Access:https://bi.tbzmed.ac.ir/PDF/bi-11-101.pdf
id doaj-bf992d3a8ae14583b458bf6ae235765f
record_format Article
spelling doaj-bf992d3a8ae14583b458bf6ae235765f2021-06-21T10:49:13ZengTabriz University of Medical SciencesBioImpacts2228-56602228-56522021-03-0111210110910.34172/bi.2021.17bi-21958Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) methodFaegheh Golabi0Elnaz Mehdizadeh Aghdam1Mousa Shamsi2Mohammad Hossein Sedaaghi3Abolfazl Barzegar4Mohammad Saeid Hejazi5Genomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, IranMolecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, IranGenomic Signal Processing Laboratory, Faculty of Biomedical Engineering, Sahand University of Technology, Tabriz, IranFaculty of Electrical Engineering, Sahand University of Technology, Tabriz, IranResearch Institute of Bioscience and Biotechnology, University of Tabriz, Tabriz, IranMolecular Medicine Research Center, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, IranIntroduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.https://bi.tbzmed.ac.ir/PDF/bi-11-101.pdfriboswitchesfeature extractionblock-finding algorithmblbfeclassification
collection DOAJ
language English
format Article
sources DOAJ
author Faegheh Golabi
Elnaz Mehdizadeh Aghdam
Mousa Shamsi
Mohammad Hossein Sedaaghi
Abolfazl Barzegar
Mohammad Saeid Hejazi
spellingShingle Faegheh Golabi
Elnaz Mehdizadeh Aghdam
Mousa Shamsi
Mohammad Hossein Sedaaghi
Abolfazl Barzegar
Mohammad Saeid Hejazi
Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
BioImpacts
riboswitches
feature extraction
block-finding algorithm
blbfe
classification
author_facet Faegheh Golabi
Elnaz Mehdizadeh Aghdam
Mousa Shamsi
Mohammad Hossein Sedaaghi
Abolfazl Barzegar
Mohammad Saeid Hejazi
author_sort Faegheh Golabi
title Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_short Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_full Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_fullStr Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_full_unstemmed Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_sort classification of seed members of five riboswitch families as short sequences based on the features extracted by block location-based feature extraction (blbfe) method
publisher Tabriz University of Medical Sciences
series BioImpacts
issn 2228-5660
2228-5652
publishDate 2021-03-01
description Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.
topic riboswitches
feature extraction
block-finding algorithm
blbfe
classification
url https://bi.tbzmed.ac.ir/PDF/bi-11-101.pdf
work_keys_str_mv AT faeghehgolabi classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT elnazmehdizadehaghdam classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT mousashamsi classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT mohammadhosseinsedaaghi classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT abolfazlbarzegar classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT mohammadsaeidhejazi classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
_version_ 1721368564454129664