Identifying significantly impacted pathways: a comprehensive review and assessment

Abstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods ha...

Full description

Bibliographic Details
Main Authors: Tuan-Minh Nguyen, Adib Shafi, Tin Nguyen, Sorin Draghici
Format: Article
Language:English
Published: BMC 2019-10-01
Series:Genome Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13059-019-1790-4
id doaj-08c2ecdaf3a14e308e54ab218a2e1932
record_format Article
spelling doaj-08c2ecdaf3a14e308e54ab218a2e19322020-11-25T03:10:37ZengBMCGenome Biology1474-760X2019-10-0120111510.1186/s13059-019-1790-4Identifying significantly impacted pathways: a comprehensive review and assessmentTuan-Minh Nguyen0Adib Shafi1Tin Nguyen2Sorin Draghici3Department of Computer Science, Wayne State UniversityDepartment of Computer Science, Wayne State UniversityDepartment of Computer Science and Engineering, University of NevadaDepartment of Computer Science, Wayne State UniversityAbstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods have been proposed so far. These can be categorized into two main categories: non-topology-based (non-TB) and topology-based (TB). Although some review papers discuss this topic from different aspects, there is no systematic, large-scale assessment of such methods. Furthermore, the majority of the pathway analysis approaches rely on the assumption of uniformity of p values under the null hypothesis, which is often not true. Results This article presents the most comprehensive comparative study on pathway analysis methods available to date. We compare the actual performance of 13 widely used pathway analysis methods in over 1085 analyses. These comparisons were performed using 2601 samples from 75 human disease data sets and 121 samples from 11 knockout mouse data sets. In addition, we investigate the extent to which each method is biased under the null hypothesis. Together, these data and results constitute a reliable benchmark against which future pathway analysis methods could and should be tested. Conclusion Overall, the result shows that no method is perfect. In general, TB methods appear to perform better than non-TB methods. This is somewhat expected since the TB methods take into consideration the structure of the pathway which is meant to describe the underlying phenomena. We also discover that most, if not all, listed approaches are biased and can produce skewed results under the null.http://link.springer.com/article/10.1186/s13059-019-1790-4Pathway analysisSignaling pathwaysNetwork topologyMetabolic pathwaysStatistical significanceBias
collection DOAJ
language English
format Article
sources DOAJ
author Tuan-Minh Nguyen
Adib Shafi
Tin Nguyen
Sorin Draghici
spellingShingle Tuan-Minh Nguyen
Adib Shafi
Tin Nguyen
Sorin Draghici
Identifying significantly impacted pathways: a comprehensive review and assessment
Genome Biology
Pathway analysis
Signaling pathways
Network topology
Metabolic pathways
Statistical significance
Bias
author_facet Tuan-Minh Nguyen
Adib Shafi
Tin Nguyen
Sorin Draghici
author_sort Tuan-Minh Nguyen
title Identifying significantly impacted pathways: a comprehensive review and assessment
title_short Identifying significantly impacted pathways: a comprehensive review and assessment
title_full Identifying significantly impacted pathways: a comprehensive review and assessment
title_fullStr Identifying significantly impacted pathways: a comprehensive review and assessment
title_full_unstemmed Identifying significantly impacted pathways: a comprehensive review and assessment
title_sort identifying significantly impacted pathways: a comprehensive review and assessment
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2019-10-01
description Abstract Background Many high-throughput experiments compare two phenotypes such as disease vs. healthy, with the goal of understanding the underlying biological phenomena characterizing the given phenotype. Because of the importance of this type of analysis, more than 70 pathway analysis methods have been proposed so far. These can be categorized into two main categories: non-topology-based (non-TB) and topology-based (TB). Although some review papers discuss this topic from different aspects, there is no systematic, large-scale assessment of such methods. Furthermore, the majority of the pathway analysis approaches rely on the assumption of uniformity of p values under the null hypothesis, which is often not true. Results This article presents the most comprehensive comparative study on pathway analysis methods available to date. We compare the actual performance of 13 widely used pathway analysis methods in over 1085 analyses. These comparisons were performed using 2601 samples from 75 human disease data sets and 121 samples from 11 knockout mouse data sets. In addition, we investigate the extent to which each method is biased under the null hypothesis. Together, these data and results constitute a reliable benchmark against which future pathway analysis methods could and should be tested. Conclusion Overall, the result shows that no method is perfect. In general, TB methods appear to perform better than non-TB methods. This is somewhat expected since the TB methods take into consideration the structure of the pathway which is meant to describe the underlying phenomena. We also discover that most, if not all, listed approaches are biased and can produce skewed results under the null.
topic Pathway analysis
Signaling pathways
Network topology
Metabolic pathways
Statistical significance
Bias
url http://link.springer.com/article/10.1186/s13059-019-1790-4
work_keys_str_mv AT tuanminhnguyen identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment
AT adibshafi identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment
AT tinnguyen identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment
AT sorindraghici identifyingsignificantlyimpactedpathwaysacomprehensivereviewandassessment
_version_ 1724658351138144256