Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing

<p>Abstract</p> <p>Background</p> <p>RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiple...

Full description

Bibliographic Details
Main Authors: Robles José A, Qureshi Sumaira E, Stephen Stuart J, Wilson Susan R, Burden Conrad J, Taylor Jennifer M
Format: Article
Language:English
Published: BMC 2012-09-01
Series:BMC Genomics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2164/13/484
id doaj-57c18bdf84bf4dea9bb6a6ad2342b7ed
record_format Article
spelling doaj-57c18bdf84bf4dea9bb6a6ad2342b7ed2020-11-24T23:07:51ZengBMCBMC Genomics1471-21642012-09-0113148410.1186/1471-2164-13-484Efficient experimental design and analysis strategies for the detection of differential expression using RNA-SequencingRobles José AQureshi Sumaira EStephen Stuart JWilson Susan RBurden Conrad JTaylor Jennifer M<p>Abstract</p> <p>Background</p> <p>RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods.</p> <p>Results</p> <p>Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices.</p> <p>Conclusions</p> <p>This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.</p> http://www.biomedcentral.com/1471-2164/13/484RNA-SeqDifferential expression analysisSequencing depthReplicationExperimental designMultiplex
collection DOAJ
language English
format Article
sources DOAJ
author Robles José A
Qureshi Sumaira E
Stephen Stuart J
Wilson Susan R
Burden Conrad J
Taylor Jennifer M
spellingShingle Robles José A
Qureshi Sumaira E
Stephen Stuart J
Wilson Susan R
Burden Conrad J
Taylor Jennifer M
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
BMC Genomics
RNA-Seq
Differential expression analysis
Sequencing depth
Replication
Experimental design
Multiplex
author_facet Robles José A
Qureshi Sumaira E
Stephen Stuart J
Wilson Susan R
Burden Conrad J
Taylor Jennifer M
author_sort Robles José A
title Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
title_short Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
title_full Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
title_fullStr Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
title_full_unstemmed Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
title_sort efficient experimental design and analysis strategies for the detection of differential expression using rna-sequencing
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2012-09-01
description <p>Abstract</p> <p>Background</p> <p>RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods.</p> <p>Results</p> <p>Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices.</p> <p>Conclusions</p> <p>This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.</p>
topic RNA-Seq
Differential expression analysis
Sequencing depth
Replication
Experimental design
Multiplex
url http://www.biomedcentral.com/1471-2164/13/484
work_keys_str_mv AT roblesjosea efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
AT qureshisumairae efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
AT stephenstuartj efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
AT wilsonsusanr efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
AT burdenconradj efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
AT taylorjenniferm efficientexperimentaldesignandanalysisstrategiesforthedetectionofdifferentialexpressionusingrnasequencing
_version_ 1725616688606478336