Nanopore ReCappable sequencing maps SARS-CoV-2 5′ capping sites and provides new insights into the structure of sgRNAs

The SARS-CoV-2 virus has a complex transcriptome characterised by multiple, nested subgenomic RNAsused to express structural and accessory proteins. Long-read sequencing technologies such as nanopore direct RNA sequencing can recover full-length transcripts, greatly simplifying the assembly of struc...

Full description

Bibliographic Details
Main Authors: Akeson, M. (Author), Almuqrin, A. (Author), Birney, E. (Author), Castelli, M. (Author), Clementi, M. (Author), Clementi, N. (Author), Corrêa, I.R., Jr (Author), Criscuolo, E. (Author), Davidson, A.D (Author), Ettwiller, L. (Author), Frigè, G. (Author), Giambruno, R. (Author), Jain, M. (Author), Leger, A. (Author), Leonardi, T. (Author), Mancini, N. (Author), Matthews, D.A (Author), Mulroney, L. (Author), Nicassio, F. (Author), Olsen, H. (Author), Schildkraut, I. (Author), Tzertzinis, G. (Author), Ugolini, C. (Author), Williamson, M.K (Author), Wulf, M.G (Author)
Format: Article
Language:English
Published: Oxford University Press 2022
Online Access:View Fulltext in Publisher
Description
Summary:The SARS-CoV-2 virus has a complex transcriptome characterised by multiple, nested subgenomic RNAsused to express structural and accessory proteins. Long-read sequencing technologies such as nanopore direct RNA sequencing can recover full-length transcripts, greatly simplifying the assembly of structurally complex RNAs. However, these techniques do not detect the 5′ cap, thus preventing reliable identification and quantification of full-length, coding transcript models. Here we used Nanopore ReCappable Sequencing (NRCeq), a new technique that can identify capped full-length RNAs, to assemble a complete annotation of SARS-CoV-2 sgRNAs and annotate the location of capping sites across the viral genome. We obtained robust estimates of sgRNA expression across cell lines and viral isolates and identified novel canonical and non-canonical sgRNAs, including one that uses a previously un-annotated leader-to-body junction site. The data generated in this work constitute a useful resource for the scientific community and provide important insights into the mechanisms that regulate the transcription of SARS-CoV-2 sgRNAs. © 2022 The Author(s).
Physical Description:15
ISBN:03051048 (ISSN)
DOI:10.1093/nar/gkac144