Identifying significantly impacted pathways: a comprehensive review and assessment
Abstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods ha...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-10-01
|
Series: | Genome Biology |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13059-019-1790-4 |
id |
doaj-08c2ecdaf3a14e308e54ab218a2e1932 |
---|---|
record_format |
Article |
spelling |
doaj-08c2ecdaf3a14e308e54ab218a2e19322020-11-25T03:10:37ZengBMCGenome Biology1474-760X2019-10-0120111510.1186/s13059-019-1790-4Identifying significantly impacted pathways: a comprehensive review and assessmentTuan-Minh Nguyen0Adib Shafi1Tin Nguyen2Sorin Draghici3Department of Computer Science, Wayne State UniversityDepartment of Computer Science, Wayne State UniversityDepartment of Computer Science and Engineering, University of NevadaDepartment of Computer Science, Wayne State UniversityAbstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods have been proposed so far. These can be categorized into two main categories: non-topology-based (non-TB) and topology-based (TB). Although some review papers discuss this topic from different aspects, there is no systematic, large-scale assessment of such methods. Furthermore, the majority of the pathway analysis approaches rely on the assumption of uniformity of p values under the null hypothesis, which is often not true. Results This article presents the most comprehensive comparative study on pathway analysis methods available to date. We compare the actual performance of 13 widely used pathway analysis methods in over 1085 analyses. These comparisons were performed using 2601 samples from 75 human disease data sets and 121 samples from 11 knockout mouse data sets. In addition, we investigate the extent to which each method is biased under the null hypothesis. Together, these data and results constitute a reliable benchmark against which future pathway analysis methods could and should be tested. Conclusion Overall, the result shows that no method is perfect. In general, TB methods appear to perform better than non-TB methods. This is somewhat expected since the TB methods take into consideration the structure of the pathway which is meant to describe the underlying phenomena. We also discover that most, if not all, listed approaches are biased and can produce skewed results under the null.http://link.springer.com/article/10.1186/s13059-019-1790-4Pathway analysisSignaling pathwaysNetwork topologyMetabolic pathwaysStatistical significanceBias |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tuan-Minh Nguyen Adib Shafi Tin Nguyen Sorin Draghici |
spellingShingle |
Tuan-Minh Nguyen Adib Shafi Tin Nguyen Sorin Draghici Identifying significantly impacted pathways: a comprehensive review and assessment Genome Biology Pathway analysis Signaling pathways Network topology Metabolic pathways Statistical significance Bias |
author_facet |
Tuan-Minh Nguyen Adib Shafi Tin Nguyen Sorin Draghici |
author_sort |
Tuan-Minh Nguyen |
title |
Identifying significantly impacted pathways: a comprehensive review and assessment |
title_short |
Identifying significantly impacted pathways: a comprehensive review and assessment |
title_full |
Identifying significantly impacted pathways: a comprehensive review and assessment |
title_fullStr |
Identifying significantly impacted pathways: a comprehensive review and assessment |
title_full_unstemmed |
Identifying significantly impacted pathways: a comprehensive review and assessment |
title_sort |
identifying significantly impacted pathways: a comprehensive review and assessment |
publisher |
BMC |
series |
Genome Biology |
issn |
1474-760X |
publishDate |
2019-10-01 |
description |
Abstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods have been proposed so far. These can be categorized into two main categories: non-topology-based (non-TB) and topology-based (TB). Although some review papers discuss this topic from different aspects, there is no systematic, large-scale assessment of such methods. Furthermore, the majority of the pathway analysis approaches rely on the assumption of uniformity of p values under the null hypothesis, which is often not true. Results This article presents the most comprehensive comparative study on pathway analysis methods available to date. We compare the actual performance of 13 widely used pathway analysis methods in over 1085 analyses. These comparisons were performed using 2601 samples from 75 human disease data sets and 121 samples from 11 knockout mouse data sets. In addition, we investigate the extent to which each method is biased under the null hypothesis. Together, these data and results constitute a reliable benchmark against which future pathway analysis methods could and should be tested. Conclusion Overall, the result shows that no method is perfect. In general, TB methods appear to perform better than non-TB methods. This is somewhat expected since the TB methods take into consideration the structure of the pathway which is meant to describe the underlying phenomena. We also discover that most, if not all, listed approaches are biased and can produce skewed results under the null. |
topic |
Pathway analysis Signaling pathways Network topology Metabolic pathways Statistical significance Bias |
url |
http://link.springer.com/article/10.1186/s13059-019-1790-4 |
work_keys_str_mv |
AT tuanminhnguyen identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment AT adibshafi identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment AT tinnguyen identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment AT sorindraghici identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment |
_version_ |
1724658351138144256 |