K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells

Maps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer)...

Full description

Bibliographic Details
Main Authors: Kinga Szyman, Bartek Wilczyński, Michał Dąbrowski
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:International Journal of Molecular Sciences
Subjects:
CpG
Online Access:https://www.mdpi.com/1422-0067/22/15/8067
id doaj-ed8de4650888414682fa94d7e32d49c7
record_format Article
spelling doaj-ed8de4650888414682fa94d7e32d49c72021-08-06T15:25:30ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672021-07-01228067806710.3390/ijms22158067K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES CellsKinga Szyman0Bartek Wilczyński1Michał Dąbrowski2Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, 02-093 Warsaw, PolandFaculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warsaw, PolandLaboratory of Bioinformatics, Nencki Institute of Experimental Biology, 02-093 Warsaw, PolandMaps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer) and the node degree (number of regions interacting with a given promoter or enhancer) are reflected by sequence composition or sequence similarity of the interacting nodes. We used counts of all k-mers (k = 4) to analyze the sequence composition and the Euclidean distance between the k-mer count vectors (k-mer distance) as the measure of sequence (dis)similarity. The results we obtained with 4-mers are interpretable in terms of dinucleotides. Promoters are GC-rich as compared to enhancers, which is known. Enhancers are enriched in scaffold/matrix attachment regions (S/MARs) patterns and depleted of CpGs. Furthermore, we show that promoters are more similar to their interacting enhancers than vice-versa. Most notably, in both promoters and enhancers, the GC content and the CpG count increase with the node degree. As a consequence, enhancers of higher node degree become more similar to promoters, whereas higher degree promoters become less similar to enhancers. We confirmed the key results also for human keratinocytes.https://www.mdpi.com/1422-0067/22/15/8067Hi-C4-merdinucleotideCpGS/MARembryonic stem cell
collection DOAJ
language English
format Article
sources DOAJ
author Kinga Szyman
Bartek Wilczyński
Michał Dąbrowski
spellingShingle Kinga Szyman
Bartek Wilczyński
Michał Dąbrowski
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
International Journal of Molecular Sciences
Hi-C
4-mer
dinucleotide
CpG
S/MAR
embryonic stem cell
author_facet Kinga Szyman
Bartek Wilczyński
Michał Dąbrowski
author_sort Kinga Szyman
title K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
title_short K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
title_full K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
title_fullStr K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
title_full_unstemmed K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
title_sort k-mer content changes with node degree in promoter–enhancer network of mouse es cells
publisher MDPI AG
series International Journal of Molecular Sciences
issn 1661-6596
1422-0067
publishDate 2021-07-01
description Maps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer) and the node degree (number of regions interacting with a given promoter or enhancer) are reflected by sequence composition or sequence similarity of the interacting nodes. We used counts of all k-mers (k = 4) to analyze the sequence composition and the Euclidean distance between the k-mer count vectors (k-mer distance) as the measure of sequence (dis)similarity. The results we obtained with 4-mers are interpretable in terms of dinucleotides. Promoters are GC-rich as compared to enhancers, which is known. Enhancers are enriched in scaffold/matrix attachment regions (S/MARs) patterns and depleted of CpGs. Furthermore, we show that promoters are more similar to their interacting enhancers than vice-versa. Most notably, in both promoters and enhancers, the GC content and the CpG count increase with the node degree. As a consequence, enhancers of higher node degree become more similar to promoters, whereas higher degree promoters become less similar to enhancers. We confirmed the key results also for human keratinocytes.
topic Hi-C
4-mer
dinucleotide
CpG
S/MAR
embryonic stem cell
url https://www.mdpi.com/1422-0067/22/15/8067
work_keys_str_mv AT kingaszyman kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells
AT bartekwilczynski kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells
AT michałdabrowski kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells
_version_ 1721218331075149824