A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
Abstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the sa...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-02-01
|
Series: | BioData Mining |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13040-018-0163-y |
id |
doaj-1bcb5fc6952944609a53830f60f8a3bc |
---|---|
record_format |
Article |
spelling |
doaj-1bcb5fc6952944609a53830f60f8a3bc2020-11-24T22:08:07ZengBMCBioData Mining1756-03812018-02-0111111710.1186/s13040-018-0163-yA novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysisWenyi Qin0Hui Lu1Department of Bioengineering, University of Illinois at ChicagoDepartment of Bioengineering, University of Illinois at ChicagoAbstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.http://link.springer.com/article/10.1186/s13040-018-0163-yPublic data integrationCross disease transcriptomeGene expressionDifferentially expressed |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenyi Qin Hui Lu |
spellingShingle |
Wenyi Qin Hui Lu A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis BioData Mining Public data integration Cross disease transcriptome Gene expression Differentially expressed |
author_facet |
Wenyi Qin Hui Lu |
author_sort |
Wenyi Qin |
title |
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_short |
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_full |
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_fullStr |
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_full_unstemmed |
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_sort |
novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
publisher |
BMC |
series |
BioData Mining |
issn |
1756-0381 |
publishDate |
2018-02-01 |
description |
Abstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited. |
topic |
Public data integration Cross disease transcriptome Gene expression Differentially expressed |
url |
http://link.springer.com/article/10.1186/s13040-018-0163-y |
work_keys_str_mv |
AT wenyiqin anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT huilu anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT wenyiqin noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT huilu noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis |
_version_ |
1725817694486265856 |