Seqpac: a framework for sRNA-seq analysis in R using sequence-based counts
Motivation: Feature-based counting is commonly used in RNA-sequencing (RNA-seq) analyses. Here, sequences must align to target features (like genes or non-coding RNAs) and related sequences with different compositions are counted into the same feature. Consequently, sequence integrity is lost, makin...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Oxford University Press
2023
|
Online Access: | View Fulltext in Publisher View in Scopus |
LEADER | 02368nam a2200193Ia 4500 | ||
---|---|---|---|
001 | 10.1093-bioinformatics-btad144 | ||
008 | 230529s2023 CNT 000 0 und d | ||
020 | |a 13674803 (ISSN) | ||
245 | 1 | 0 | |a Seqpac: a framework for sRNA-seq analysis in R using sequence-based counts |
260 | 0 | |b Oxford University Press |c 2023 | |
856 | |z View Fulltext in Publisher |u https://doi.org/10.1093/bioinformatics/btad144 | ||
856 | |z View in Scopus |u https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159118738&doi=10.1093%2fbioinformatics%2fbtad144&partnerID=40&md5=c6976b0181d4c758b0985a088c3e45bf | ||
520 | 3 | |a Motivation: Feature-based counting is commonly used in RNA-sequencing (RNA-seq) analyses. Here, sequences must align to target features (like genes or non-coding RNAs) and related sequences with different compositions are counted into the same feature. Consequently, sequence integrity is lost, making results less traceable against raw data. Small RNA (sRNA) often maps to multiple features and shows an incredible diversity in form and function. Therefore, applying feature-based strategies may increase the risk of misinterpretation. We present a strategy for sRNA-seq analysis that preserves the integrity of the raw sequence making the data lineage fully traceable. We have consolidated this strategy into Seqpac: An R package that makes a complete sRNA analysis available on multiple platforms. Using published biological data, we show that Seqpac reveals hidden bias and adds new insights to studies that were previously analyzed using feature-based counting. We have identified limitations in the concurrent analysis of RNA-seq data. We call it the traceability dilemma in alignment-based sequencing strategies. By building a flexible framework that preserves the integrity of the read sequence throughout the analysis, we demonstrate better interpretability in sRNA-seq experiments, which are particularly vulnerable to this problem. Applying similar strategies to other transcriptomic workflows may aid in resolving the replication crisis experienced by many fields that depend on transcriptome analyses. © The Author(s) 2023. Published by Oxford University Press. | |
700 | 1 | 0 | |a Kugelberg, U. |e author |
700 | 1 | 0 | |a Nätt, D. |e author |
700 | 1 | 0 | |a Örkenby, L. |e author |
700 | 1 | 0 | |a Öst, A. |e author |
700 | 1 | 0 | |a Skog, S. |e author |
773 | |t Bioinformatics |