Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.

The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval a...

Full description

Bibliographic Details
Main Authors: Anthony Deeter, Mark Dalman, Joseph Haddad, Zhong-Hui Duan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5648141?pdf=render
id doaj-d514c6e1674d4925be943df04f4cdc71
record_format Article
spelling doaj-d514c6e1674d4925be943df04f4cdc712020-11-25T02:48:54ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-011210e018600410.1371/journal.pone.0186004Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.Anthony DeeterMark DalmanJoseph HaddadZhong-Hui DuanThe PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.http://europepmc.org/articles/PMC5648141?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Anthony Deeter
Mark Dalman
Joseph Haddad
Zhong-Hui Duan
spellingShingle Anthony Deeter
Mark Dalman
Joseph Haddad
Zhong-Hui Duan
Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
PLoS ONE
author_facet Anthony Deeter
Mark Dalman
Joseph Haddad
Zhong-Hui Duan
author_sort Anthony Deeter
title Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
title_short Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
title_full Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
title_fullStr Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
title_full_unstemmed Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.
title_sort inferring gene and protein interactions using pubmed citations and consensus bayesian networks.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.
url http://europepmc.org/articles/PMC5648141?pdf=render
work_keys_str_mv AT anthonydeeter inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT markdalman inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT josephhaddad inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT zhonghuiduan inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
_version_ 1724745986164654080