Short k-mer abundance profiles yield robust machine learning features and accurate classifiers for RNA viruses

High-throughput sequencing technologies have greatly enabled the study of genomics, transcriptomics and metagenomics. Automated annotation and classification of the vast amounts of generated sequence data has become paramount for facilitating biological sciences. Genomes of viruses can be radically...

Full description

Bibliographic Details
Main Authors: Md. Nafis Ul Alam, Umar Faruq Chowdhury, Ruslan Kalendar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7500682/?tool=EBI