McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
Abstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-sup...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-10-01
|
Series: | Genome Biology |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13059-017-1316-x |
id |
doaj-1c5d3f52b70a4f9ea6413e9095061cb1 |
---|---|
record_format |
Article |
spelling |
doaj-1c5d3f52b70a4f9ea6413e9095061cb12020-11-24T21:47:11ZengBMCGenome Biology1474-760X2017-10-0118112110.1186/s13059-017-1316-xMcEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genesDina Hafez0Aslihan Karabacak1Sabrina Krueger2Yih-Chii Hwang3Li-San Wang4Robert P. Zinzen5Uwe Ohler6Department of Computer Science, Duke UniversityBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineGenomics and Computational Biology Graduate Program, University of PennsylvaniaGenomics and Computational Biology Graduate Program, University of PennsylvaniaBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineDepartment of Computer Science, Duke UniversityAbstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-supervised learning algorithm that predicts gene expression patterns based on enriched sequence features. Predicted expression patterns were 73–98% accurate, predicted assignments showed strong Hi-C interaction enrichment, enhancer-associated histone modifications were evident, and known functional motifs were recovered. Our model provides a general framework to link globally identified enhancers to targets and contributes to deciphering the regulatory genome.http://link.springer.com/article/10.1186/s13059-017-1316-xInterpolated Markov modelEnhancer to target gene assignmentGene expressionDrosophila melanogasterGene regulationMachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Dina Hafez Aslihan Karabacak Sabrina Krueger Yih-Chii Hwang Li-San Wang Robert P. Zinzen Uwe Ohler |
spellingShingle |
Dina Hafez Aslihan Karabacak Sabrina Krueger Yih-Chii Hwang Li-San Wang Robert P. Zinzen Uwe Ohler McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes Genome Biology Interpolated Markov model Enhancer to target gene assignment Gene expression Drosophila melanogaster Gene regulation Machine learning |
author_facet |
Dina Hafez Aslihan Karabacak Sabrina Krueger Yih-Chii Hwang Li-San Wang Robert P. Zinzen Uwe Ohler |
author_sort |
Dina Hafez |
title |
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
title_short |
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
title_full |
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
title_fullStr |
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
title_full_unstemmed |
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
title_sort |
mcenhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes |
publisher |
BMC |
series |
Genome Biology |
issn |
1474-760X |
publishDate |
2017-10-01 |
description |
Abstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-supervised learning algorithm that predicts gene expression patterns based on enriched sequence features. Predicted expression patterns were 73–98% accurate, predicted assignments showed strong Hi-C interaction enrichment, enhancer-associated histone modifications were evident, and known functional motifs were recovered. Our model provides a general framework to link globally identified enhancers to targets and contributes to deciphering the regulatory genome. |
topic |
Interpolated Markov model Enhancer to target gene assignment Gene expression Drosophila melanogaster Gene regulation Machine learning |
url |
http://link.springer.com/article/10.1186/s13059-017-1316-x |
work_keys_str_mv |
AT dinahafez mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT aslihankarabacak mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT sabrinakrueger mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT yihchiihwang mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT lisanwang mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT robertpzinzen mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes AT uweohler mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes |
_version_ |
1725898868474773504 |