K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells
Maps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer)...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-07-01
|
Series: | International Journal of Molecular Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/1422-0067/22/15/8067 |
id |
doaj-ed8de4650888414682fa94d7e32d49c7 |
---|---|
record_format |
Article |
spelling |
doaj-ed8de4650888414682fa94d7e32d49c72021-08-06T15:25:30ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672021-07-01228067806710.3390/ijms22158067K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES CellsKinga Szyman0Bartek Wilczyński1Michał Dąbrowski2Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, 02-093 Warsaw, PolandFaculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warsaw, PolandLaboratory of Bioinformatics, Nencki Institute of Experimental Biology, 02-093 Warsaw, PolandMaps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer) and the node degree (number of regions interacting with a given promoter or enhancer) are reflected by sequence composition or sequence similarity of the interacting nodes. We used counts of all k-mers (k = 4) to analyze the sequence composition and the Euclidean distance between the k-mer count vectors (k-mer distance) as the measure of sequence (dis)similarity. The results we obtained with 4-mers are interpretable in terms of dinucleotides. Promoters are GC-rich as compared to enhancers, which is known. Enhancers are enriched in scaffold/matrix attachment regions (S/MARs) patterns and depleted of CpGs. Furthermore, we show that promoters are more similar to their interacting enhancers than vice-versa. Most notably, in both promoters and enhancers, the GC content and the CpG count increase with the node degree. As a consequence, enhancers of higher node degree become more similar to promoters, whereas higher degree promoters become less similar to enhancers. We confirmed the key results also for human keratinocytes.https://www.mdpi.com/1422-0067/22/15/8067Hi-C4-merdinucleotideCpGS/MARembryonic stem cell |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kinga Szyman Bartek Wilczyński Michał Dąbrowski |
spellingShingle |
Kinga Szyman Bartek Wilczyński Michał Dąbrowski K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells International Journal of Molecular Sciences Hi-C 4-mer dinucleotide CpG S/MAR embryonic stem cell |
author_facet |
Kinga Szyman Bartek Wilczyński Michał Dąbrowski |
author_sort |
Kinga Szyman |
title |
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells |
title_short |
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells |
title_full |
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells |
title_fullStr |
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells |
title_full_unstemmed |
K-mer Content Changes with Node Degree in Promoter–Enhancer Network of Mouse ES Cells |
title_sort |
k-mer content changes with node degree in promoter–enhancer network of mouse es cells |
publisher |
MDPI AG |
series |
International Journal of Molecular Sciences |
issn |
1661-6596 1422-0067 |
publishDate |
2021-07-01 |
description |
Maps of Hi-C contacts between promoters and enhancers can be analyzed as networks, with cis-regulatory regions as nodes and their interactions as edges. We checked if in the published promoter–enhancer network of mouse embryonic stem (ES) cells the differences in the node type (promoter or enhancer) and the node degree (number of regions interacting with a given promoter or enhancer) are reflected by sequence composition or sequence similarity of the interacting nodes. We used counts of all k-mers (k = 4) to analyze the sequence composition and the Euclidean distance between the k-mer count vectors (k-mer distance) as the measure of sequence (dis)similarity. The results we obtained with 4-mers are interpretable in terms of dinucleotides. Promoters are GC-rich as compared to enhancers, which is known. Enhancers are enriched in scaffold/matrix attachment regions (S/MARs) patterns and depleted of CpGs. Furthermore, we show that promoters are more similar to their interacting enhancers than vice-versa. Most notably, in both promoters and enhancers, the GC content and the CpG count increase with the node degree. As a consequence, enhancers of higher node degree become more similar to promoters, whereas higher degree promoters become less similar to enhancers. We confirmed the key results also for human keratinocytes. |
topic |
Hi-C 4-mer dinucleotide CpG S/MAR embryonic stem cell |
url |
https://www.mdpi.com/1422-0067/22/15/8067 |
work_keys_str_mv |
AT kingaszyman kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells AT bartekwilczynski kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells AT michałdabrowski kmercontentchangeswithnodedegreeinpromoterenhancernetworkofmouseescells |
_version_ |
1721218331075149824 |