RNA-seq: impact of RNA degradation on transcript quantification

Background The use of low quality RNA samples in whole-genome gene expression profiling remains controversial. It is unclear if transcript degradation in low quality RNA samples occurs uniformly, in which case the effects of degradation can be corrected via data normalization, or whether different t...

Full description

Bibliographic Details
Main Authors: Gallego Romero, Irene (Author), Tung, Jenny (Author), Gilad, Yoav (Author), Pai, Athma A. (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Biology (Contributor)
Format: Article
Language:English
Published: BioMed Central, 2015-06-29T16:37:10Z.
Subjects:
Online Access:Get fulltext
LEADER 02376 am a22002173u 4500
001 97557
042 |a dc 
100 1 0 |a Gallego Romero, Irene  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Biology  |e contributor 
100 1 0 |a Pai, Athma A.  |e contributor 
700 1 0 |a Tung, Jenny  |e author 
700 1 0 |a Gilad, Yoav  |e author 
700 1 0 |a Pai, Athma A.  |e author 
245 0 0 |a RNA-seq: impact of RNA degradation on transcript quantification 
260 |b BioMed Central,   |c 2015-06-29T16:37:10Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/97557 
520 |a Background The use of low quality RNA samples in whole-genome gene expression profiling remains controversial. It is unclear if transcript degradation in low quality RNA samples occurs uniformly, in which case the effects of degradation can be corrected via data normalization, or whether different transcripts are degraded at different rates, potentially biasing measurements of expression levels. This concern has rendered the use of low quality RNA samples in whole-genome expression profiling problematic. Yet, low quality samples (for example, samples collected in the course of fieldwork) are at times the sole means of addressing specific questions. Results We sought to quantify the impact of variation in RNA quality on estimates of gene expression levels based on RNA-seq data. To do so, we collected expression data from tissue samples that were allowed to decay for varying amounts of time prior to RNA extraction. The RNA samples we collected spanned the entire range of RNA Integrity Number (RIN) values (a metric commonly used to assess RNA quality). We observed widespread effects of RNA quality on measurements of gene expression levels, as well as a slight but significant loss of library complexity in more degraded samples. Conclusions While standard normalizations failed to account for the effects of degradation, we found that by explicitly controlling for the effects of RIN using a linear model framework we can correct for the majority of these effects. We conclude that in instances in which RIN and the effect of interest are not associated, this approach can help recover biologically meaningful signals in data from degraded RNA samples. 
520 |a American Heart Association (Predoctoral Fellowship) 
546 |a en 
655 7 |a Article 
773 |t BMC Biology