Probing instructions for expression regulation in gene nucleotide compositions.

Gene expression is orchestrated by distinct regulatory regions to ensure a wide variety of cell types and functions. A challenge is to identify which regulatory regions are active, what are their associated features and how they work together in each cell type. Several approaches have tackled this p...

Full description

Bibliographic Details
Main Authors: Chloé Bessière, May Taha, Florent Petitprez, Jimmy Vandel, Jean-Michel Marin, Laurent Bréhélin, Sophie Lèbre, Charles-Henri Lecellier
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1005921
id doaj-4364bca7da2c4e7485c066e824dc79f0
record_format Article
spelling doaj-4364bca7da2c4e7485c066e824dc79f02021-04-21T15:10:01ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582018-01-01141e100592110.1371/journal.pcbi.1005921Probing instructions for expression regulation in gene nucleotide compositions.Chloé BessièreMay TahaFlorent PetitprezJimmy VandelJean-Michel MarinLaurent BréhélinSophie LèbreCharles-Henri LecellierGene expression is orchestrated by distinct regulatory regions to ensure a wide variety of cell types and functions. A challenge is to identify which regulatory regions are active, what are their associated features and how they work together in each cell type. Several approaches have tackled this problem by modeling gene expression based on epigenetic marks, with the ultimate goal of identifying driving regions and associated genomic variations that are clinically relevant in particular in precision medicine. However, these models rely on experimental data, which are limited to specific samples (even often to cell lines) and cannot be generated for all regulators and all patients. In addition, we show here that, although these approaches are accurate in predicting gene expression, inference of TF combinations from this type of models is not straightforward. Furthermore these methods are not designed to capture regulation instructions present at the sequence level, before the binding of regulators or the opening of the chromatin. Here, we probe sequence-level instructions for gene expression and develop a method to explain mRNA levels based solely on nucleotide features. Our method positions nucleotide composition as a critical component of gene expression. Moreover, our approach, able to rank regulatory regions according to their contribution, unveils a strong influence of the gene body sequence, in particular introns. We further provide evidence that the contribution of nucleotide content can be linked to co-regulations associated with genome 3D architecture and to associations of genes within topologically associated domains.https://doi.org/10.1371/journal.pcbi.1005921
collection DOAJ
language English
format Article
sources DOAJ
author Chloé Bessière
May Taha
Florent Petitprez
Jimmy Vandel
Jean-Michel Marin
Laurent Bréhélin
Sophie Lèbre
Charles-Henri Lecellier
spellingShingle Chloé Bessière
May Taha
Florent Petitprez
Jimmy Vandel
Jean-Michel Marin
Laurent Bréhélin
Sophie Lèbre
Charles-Henri Lecellier
Probing instructions for expression regulation in gene nucleotide compositions.
PLoS Computational Biology
author_facet Chloé Bessière
May Taha
Florent Petitprez
Jimmy Vandel
Jean-Michel Marin
Laurent Bréhélin
Sophie Lèbre
Charles-Henri Lecellier
author_sort Chloé Bessière
title Probing instructions for expression regulation in gene nucleotide compositions.
title_short Probing instructions for expression regulation in gene nucleotide compositions.
title_full Probing instructions for expression regulation in gene nucleotide compositions.
title_fullStr Probing instructions for expression regulation in gene nucleotide compositions.
title_full_unstemmed Probing instructions for expression regulation in gene nucleotide compositions.
title_sort probing instructions for expression regulation in gene nucleotide compositions.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2018-01-01
description Gene expression is orchestrated by distinct regulatory regions to ensure a wide variety of cell types and functions. A challenge is to identify which regulatory regions are active, what are their associated features and how they work together in each cell type. Several approaches have tackled this problem by modeling gene expression based on epigenetic marks, with the ultimate goal of identifying driving regions and associated genomic variations that are clinically relevant in particular in precision medicine. However, these models rely on experimental data, which are limited to specific samples (even often to cell lines) and cannot be generated for all regulators and all patients. In addition, we show here that, although these approaches are accurate in predicting gene expression, inference of TF combinations from this type of models is not straightforward. Furthermore these methods are not designed to capture regulation instructions present at the sequence level, before the binding of regulators or the opening of the chromatin. Here, we probe sequence-level instructions for gene expression and develop a method to explain mRNA levels based solely on nucleotide features. Our method positions nucleotide composition as a critical component of gene expression. Moreover, our approach, able to rank regulatory regions according to their contribution, unveils a strong influence of the gene body sequence, in particular introns. We further provide evidence that the contribution of nucleotide content can be linked to co-regulations associated with genome 3D architecture and to associations of genes within topologically associated domains.
url https://doi.org/10.1371/journal.pcbi.1005921
work_keys_str_mv AT chloebessiere probinginstructionsforexpressionregulationingenenucleotidecompositions
AT maytaha probinginstructionsforexpressionregulationingenenucleotidecompositions
AT florentpetitprez probinginstructionsforexpressionregulationingenenucleotidecompositions
AT jimmyvandel probinginstructionsforexpressionregulationingenenucleotidecompositions
AT jeanmichelmarin probinginstructionsforexpressionregulationingenenucleotidecompositions
AT laurentbrehelin probinginstructionsforexpressionregulationingenenucleotidecompositions
AT sophielebre probinginstructionsforexpressionregulationingenenucleotidecompositions
AT charleshenrilecellier probinginstructionsforexpressionregulationingenenucleotidecompositions
_version_ 1714667891709706240