Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study

Gene Regulatory Networks (GRNs) allow the study of regulation of gene expression of whole genomes. Among the most relevant advantages of using networks to depict this key process, there is the visual representation of large amounts of information and the application of graph theory to generate new k...

Full description

Bibliographic Details
Main Authors: Yesid Cuesta-Astroz, Guilherme Gischkow Rucatti, Leandro Murgas, Carol D. SanMartín, Mario Sanhueza, Alberto J. M. Martin
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-07-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2021.649764/full
id doaj-4ec44de3a1544e3c88b2a2f82c1c1ba1
record_format Article
spelling doaj-4ec44de3a1544e3c88b2a2f82c1c1ba12021-07-28T09:38:52ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-07-011210.3389/fgene.2021.649764649764Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case StudyYesid Cuesta-Astroz0Guilherme Gischkow Rucatti1Leandro Murgas2Leandro Murgas3Carol D. SanMartín4Carol D. SanMartín5Mario Sanhueza6Mario Sanhueza7Alberto J. M. Martin8Alberto J. M. Martin9Colombian Institute of Tropical Medicine, CES University, Medellin, ColombiaCentro de Biología Integrativa, Facultad de Ciencias, Universidad Mayor, Santiago, ChileLaboratorio de Biologia de Redes, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, ChilePrograma de Doctorado en Genómica Integrativa, Vicerrectoría de Investigación, Universidad Mayor, Santiago, ChileDepartamento de Neurología y Neurocirugía, Hospital Clínico Universidad de Chile, Santiago, ChileCentro de Investigacíon Clínica Avanzada (CICA), Hospital Clínico Universidad de Chile, Santiago, ChileCentro de Biología Integrativa, Facultad de Ciencias, Universidad Mayor, Santiago, ChileEscuela de Biotecnología, Facultad de Ciencias, Universidad Mayor, Santiago, ChileLaboratorio de Biologia de Redes, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, ChileEscuela de Biotecnología, Facultad de Ciencias, Universidad Mayor, Santiago, ChileGene Regulatory Networks (GRNs) allow the study of regulation of gene expression of whole genomes. Among the most relevant advantages of using networks to depict this key process, there is the visual representation of large amounts of information and the application of graph theory to generate new knowledge. Nonetheless, despite the many uses of GRNs, it is still difficult and expensive to assign Transcription Factors (TFs) to the regulation of specific genes. ChIP-Seq allows the determination of TF Binding Sites (TFBSs) over whole genomes, but it is still an expensive technique that can only be applied one TF at a time and requires replicates to reduce its noise. Once TFBSs are determined, the assignment of each TF and its binding sites to the regulation of specific genes is not trivial, and it is often performed by carrying out site-specific experiments that are unfeasible to perform in all possible binding sites. Here, we addressed these relevant issues with a two-step methodology using Drosophila melanogaster as a case study. First, our protocol starts by gathering all transcription factor binding sites (TFBSs) determined with ChIP-Seq experiments available at ENCODE and FlyBase. Then each TFBS is used to assign TFs to the regulation of likely target genes based on the TFBS proximity to the transcription start site of all genes. In the final step, to try to select the most likely regulatory TF from those previously assigned to each gene, we employ GENIE3, a random forest-based method, and more than 9,000 RNA-seq experiments from D. melanogaster. Following, we employed known TF protein-protein interactions to estimate the feasibility of regulatory events in our filtered networks. Finally, we show how known interactions between co-regulatory TFs of each gene increase after the second step of our approach, and thus, the consistency of the TF-gene assignment. Also, we employed our methodology to create a network centered on the Drosophila melanogaster gene Hr96 to demonstrate the role of this transcription factor on mitochondrial gene regulation.https://www.frontiersin.org/articles/10.3389/fgene.2021.649764/fullgene regulatory networktranscriptional regulationtranscription factor targetsDrosophila melanogasterHR96
collection DOAJ
language English
format Article
sources DOAJ
author Yesid Cuesta-Astroz
Guilherme Gischkow Rucatti
Leandro Murgas
Leandro Murgas
Carol D. SanMartín
Carol D. SanMartín
Mario Sanhueza
Mario Sanhueza
Alberto J. M. Martin
Alberto J. M. Martin
spellingShingle Yesid Cuesta-Astroz
Guilherme Gischkow Rucatti
Leandro Murgas
Leandro Murgas
Carol D. SanMartín
Carol D. SanMartín
Mario Sanhueza
Mario Sanhueza
Alberto J. M. Martin
Alberto J. M. Martin
Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
Frontiers in Genetics
gene regulatory network
transcriptional regulation
transcription factor targets
Drosophila melanogaster
HR96
author_facet Yesid Cuesta-Astroz
Guilherme Gischkow Rucatti
Leandro Murgas
Leandro Murgas
Carol D. SanMartín
Carol D. SanMartín
Mario Sanhueza
Mario Sanhueza
Alberto J. M. Martin
Alberto J. M. Martin
author_sort Yesid Cuesta-Astroz
title Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
title_short Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
title_full Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
title_fullStr Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
title_full_unstemmed Filtering of Data-Driven Gene Regulatory Networks Using Drosophila melanogaster as a Case Study
title_sort filtering of data-driven gene regulatory networks using drosophila melanogaster as a case study
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2021-07-01
description Gene Regulatory Networks (GRNs) allow the study of regulation of gene expression of whole genomes. Among the most relevant advantages of using networks to depict this key process, there is the visual representation of large amounts of information and the application of graph theory to generate new knowledge. Nonetheless, despite the many uses of GRNs, it is still difficult and expensive to assign Transcription Factors (TFs) to the regulation of specific genes. ChIP-Seq allows the determination of TF Binding Sites (TFBSs) over whole genomes, but it is still an expensive technique that can only be applied one TF at a time and requires replicates to reduce its noise. Once TFBSs are determined, the assignment of each TF and its binding sites to the regulation of specific genes is not trivial, and it is often performed by carrying out site-specific experiments that are unfeasible to perform in all possible binding sites. Here, we addressed these relevant issues with a two-step methodology using Drosophila melanogaster as a case study. First, our protocol starts by gathering all transcription factor binding sites (TFBSs) determined with ChIP-Seq experiments available at ENCODE and FlyBase. Then each TFBS is used to assign TFs to the regulation of likely target genes based on the TFBS proximity to the transcription start site of all genes. In the final step, to try to select the most likely regulatory TF from those previously assigned to each gene, we employ GENIE3, a random forest-based method, and more than 9,000 RNA-seq experiments from D. melanogaster. Following, we employed known TF protein-protein interactions to estimate the feasibility of regulatory events in our filtered networks. Finally, we show how known interactions between co-regulatory TFs of each gene increase after the second step of our approach, and thus, the consistency of the TF-gene assignment. Also, we employed our methodology to create a network centered on the Drosophila melanogaster gene Hr96 to demonstrate the role of this transcription factor on mitochondrial gene regulation.
topic gene regulatory network
transcriptional regulation
transcription factor targets
Drosophila melanogaster
HR96
url https://www.frontiersin.org/articles/10.3389/fgene.2021.649764/full
work_keys_str_mv AT yesidcuestaastroz filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT guilhermegischkowrucatti filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT leandromurgas filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT leandromurgas filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT caroldsanmartin filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT caroldsanmartin filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT mariosanhueza filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT mariosanhueza filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT albertojmmartin filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
AT albertojmmartin filteringofdatadrivengeneregulatorynetworksusingdrosophilamelanogasterasacasestudy
_version_ 1721278904420794368