QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model
Abstract Background As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcripto...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-08-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-017-1808-4 |
id |
doaj-d7ffbdf40f6449fea1a647ecedf93928 |
---|---|
record_format |
Article |
spelling |
doaj-d7ffbdf40f6449fea1a647ecedf939282020-11-25T00:26:36ZengBMCBMC Bioinformatics1471-21052017-08-0118111210.1186/s12859-017-1808-4QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial modelLian Liu0Shao-Wu Zhang1Yufei Huang2Jia Meng3Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical UniversityKey Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical UniversityDepartment of Electrical and Computation Engineering, University of Texas at San AntonioDepartment of Biological Sciences, HRINU, SUERI, Xi’an Jiaotong-Liverpool UniversityAbstract Background As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. Results We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. Conclusion QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m1A-Seq, Par-CLIP, RIP-Seq, etc.http://link.springer.com/article/10.1186/s12859-017-1808-4Differential methylation analysism6ANegative binomial distributionRNA methylationSmall-sample size |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lian Liu Shao-Wu Zhang Yufei Huang Jia Meng |
spellingShingle |
Lian Liu Shao-Wu Zhang Yufei Huang Jia Meng QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model BMC Bioinformatics Differential methylation analysis m6A Negative binomial distribution RNA methylation Small-sample size |
author_facet |
Lian Liu Shao-Wu Zhang Yufei Huang Jia Meng |
author_sort |
Lian Liu |
title |
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
title_short |
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
title_full |
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
title_fullStr |
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
title_full_unstemmed |
QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
title_sort |
qnb: differential rna methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2017-08-01 |
description |
Abstract Background As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. Results We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. Conclusion QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m1A-Seq, Par-CLIP, RIP-Seq, etc. |
topic |
Differential methylation analysis m6A Negative binomial distribution RNA methylation Small-sample size |
url |
http://link.springer.com/article/10.1186/s12859-017-1808-4 |
work_keys_str_mv |
AT lianliu qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel AT shaowuzhang qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel AT yufeihuang qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel AT jiameng qnbdifferentialrnamethylationanalysisforcountbasedsmallsamplesequencingdatawithaquadnegativebinomialmodel |
_version_ |
1725343710862901248 |