PyGNA: a unified framework for geneset network analysis

Abstract Background Gene and protein interaction experiments provide unique opportunities to study the molecular wiring of a cell. Integrating high-throughput functional genomics data with this information can help identifying networks associated with complex diseases and phenotypes. Results Here we...

Full description

Bibliographic Details
Main Authors: Viola Fanfani, Fabio Cassano, Giovanni Stracquadanio
Format: Article
Language:English
Published: BMC 2020-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-03801-1
id doaj-fc05c284c9d44b4bbbbe3e197d2758dd
record_format Article
spelling doaj-fc05c284c9d44b4bbbbe3e197d2758dd2020-11-25T04:00:18ZengBMCBMC Bioinformatics1471-21052020-10-0121112210.1186/s12859-020-03801-1PyGNA: a unified framework for geneset network analysisViola Fanfani0Fabio Cassano1Giovanni Stracquadanio2School of Biological Science, The University of EdinburghSchool of Biological Science, The University of EdinburghSchool of Biological Science, The University of EdinburghAbstract Background Gene and protein interaction experiments provide unique opportunities to study the molecular wiring of a cell. Integrating high-throughput functional genomics data with this information can help identifying networks associated with complex diseases and phenotypes. Results Here we introduce an integrated statistical framework to test network properties of single and multiple genesets under different interaction models. We implemented this framework as an open-source software, called Python Geneset Network Analysis (PyGNA). Our software is designed for easy integration into existing analysis pipelines and to generate high quality figures and reports. We also developed PyGNA to take advantage of multi-core systems to generate calibrated null distributions on large datasets. We then present the results of extensive benchmarking of the tests implemented in PyGNA and a use case inspired by RNA sequencing data analysis, showing how PyGNA can be easily integrated to study biological networks. PyGNA is available at http://github.com/stracquadaniolab/pygna and can be easily installed using the PyPi or Anaconda package managers, and Docker. Conclusions We present a tool for network-aware geneset analysis. PyGNA can either be readily used and easily integrated into existing high-performance data analysis pipelines or as a Python package to implement new tests and analyses. With the increasing availability of population-scale omic data, PyGNA provides a viable approach for large scale geneset network analysis.http://link.springer.com/article/10.1186/s12859-020-03801-1Geneset Network AnalysisBiological NetworksNetwork analysis workflow
collection DOAJ
language English
format Article
sources DOAJ
author Viola Fanfani
Fabio Cassano
Giovanni Stracquadanio
spellingShingle Viola Fanfani
Fabio Cassano
Giovanni Stracquadanio
PyGNA: a unified framework for geneset network analysis
BMC Bioinformatics
Geneset Network Analysis
Biological Networks
Network analysis workflow
author_facet Viola Fanfani
Fabio Cassano
Giovanni Stracquadanio
author_sort Viola Fanfani
title PyGNA: a unified framework for geneset network analysis
title_short PyGNA: a unified framework for geneset network analysis
title_full PyGNA: a unified framework for geneset network analysis
title_fullStr PyGNA: a unified framework for geneset network analysis
title_full_unstemmed PyGNA: a unified framework for geneset network analysis
title_sort pygna: a unified framework for geneset network analysis
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-10-01
description Abstract Background Gene and protein interaction experiments provide unique opportunities to study the molecular wiring of a cell. Integrating high-throughput functional genomics data with this information can help identifying networks associated with complex diseases and phenotypes. Results Here we introduce an integrated statistical framework to test network properties of single and multiple genesets under different interaction models. We implemented this framework as an open-source software, called Python Geneset Network Analysis (PyGNA). Our software is designed for easy integration into existing analysis pipelines and to generate high quality figures and reports. We also developed PyGNA to take advantage of multi-core systems to generate calibrated null distributions on large datasets. We then present the results of extensive benchmarking of the tests implemented in PyGNA and a use case inspired by RNA sequencing data analysis, showing how PyGNA can be easily integrated to study biological networks. PyGNA is available at http://github.com/stracquadaniolab/pygna and can be easily installed using the PyPi or Anaconda package managers, and Docker. Conclusions We present a tool for network-aware geneset analysis. PyGNA can either be readily used and easily integrated into existing high-performance data analysis pipelines or as a Python package to implement new tests and analyses. With the increasing availability of population-scale omic data, PyGNA provides a viable approach for large scale geneset network analysis.
topic Geneset Network Analysis
Biological Networks
Network analysis workflow
url http://link.springer.com/article/10.1186/s12859-020-03801-1
work_keys_str_mv AT violafanfani pygnaaunifiedframeworkforgenesetnetworkanalysis
AT fabiocassano pygnaaunifiedframeworkforgenesetnetworkanalysis
AT giovannistracquadanio pygnaaunifiedframeworkforgenesetnetworkanalysis
_version_ 1724451442509479936