A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis

Abstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the sa...

Full description

Bibliographic Details
Main Authors: Wenyi Qin, Hui Lu
Format: Article
Language:English
Published: BMC 2018-02-01
Series:BioData Mining
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13040-018-0163-y
id doaj-1bcb5fc6952944609a53830f60f8a3bc
record_format Article
spelling doaj-1bcb5fc6952944609a53830f60f8a3bc2020-11-24T22:08:07ZengBMCBioData Mining1756-03812018-02-0111111710.1186/s13040-018-0163-yA novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysisWenyi Qin0Hui Lu1Department of Bioengineering, University of Illinois at ChicagoDepartment of Bioengineering, University of Illinois at ChicagoAbstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.http://link.springer.com/article/10.1186/s13040-018-0163-yPublic data integrationCross disease transcriptomeGene expressionDifferentially expressed
collection DOAJ
language English
format Article
sources DOAJ
author Wenyi Qin
Hui Lu
spellingShingle Wenyi Qin
Hui Lu
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
BioData Mining
Public data integration
Cross disease transcriptome
Gene expression
Differentially expressed
author_facet Wenyi Qin
Hui Lu
author_sort Wenyi Qin
title A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_short A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_full A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_fullStr A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_full_unstemmed A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_sort novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
publisher BMC
series BioData Mining
issn 1756-0381
publishDate 2018-02-01
description Abstract Motivation Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.
topic Public data integration
Cross disease transcriptome
Gene expression
Differentially expressed
url http://link.springer.com/article/10.1186/s13040-018-0163-y
work_keys_str_mv AT wenyiqin anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT huilu anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT wenyiqin noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT huilu noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
_version_ 1725817694486265856