McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes

Abstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-sup...

Full description

Bibliographic Details
Main Authors: Dina Hafez, Aslihan Karabacak, Sabrina Krueger, Yih-Chii Hwang, Li-San Wang, Robert P. Zinzen, Uwe Ohler
Format: Article
Language:English
Published: BMC 2017-10-01
Series:Genome Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13059-017-1316-x
id doaj-1c5d3f52b70a4f9ea6413e9095061cb1
record_format Article
spelling doaj-1c5d3f52b70a4f9ea6413e9095061cb12020-11-24T21:47:11ZengBMCGenome Biology1474-760X2017-10-0118112110.1186/s13059-017-1316-xMcEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genesDina Hafez0Aslihan Karabacak1Sabrina Krueger2Yih-Chii Hwang3Li-San Wang4Robert P. Zinzen5Uwe Ohler6Department of Computer Science, Duke UniversityBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineGenomics and Computational Biology Graduate Program, University of PennsylvaniaGenomics and Computational Biology Graduate Program, University of PennsylvaniaBerlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular MedicineDepartment of Computer Science, Duke UniversityAbstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-supervised learning algorithm that predicts gene expression patterns based on enriched sequence features. Predicted expression patterns were 73–98% accurate, predicted assignments showed strong Hi-C interaction enrichment, enhancer-associated histone modifications were evident, and known functional motifs were recovered. Our model provides a general framework to link globally identified enhancers to targets and contributes to deciphering the regulatory genome.http://link.springer.com/article/10.1186/s13059-017-1316-xInterpolated Markov modelEnhancer to target gene assignmentGene expressionDrosophila melanogasterGene regulationMachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Dina Hafez
Aslihan Karabacak
Sabrina Krueger
Yih-Chii Hwang
Li-San Wang
Robert P. Zinzen
Uwe Ohler
spellingShingle Dina Hafez
Aslihan Karabacak
Sabrina Krueger
Yih-Chii Hwang
Li-San Wang
Robert P. Zinzen
Uwe Ohler
McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
Genome Biology
Interpolated Markov model
Enhancer to target gene assignment
Gene expression
Drosophila melanogaster
Gene regulation
Machine learning
author_facet Dina Hafez
Aslihan Karabacak
Sabrina Krueger
Yih-Chii Hwang
Li-San Wang
Robert P. Zinzen
Uwe Ohler
author_sort Dina Hafez
title McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
title_short McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
title_full McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
title_fullStr McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
title_full_unstemmed McEnhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
title_sort mcenhancer: predicting gene expression via semi-supervised assignment of enhancers to target genes
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2017-10-01
description Abstract Transcriptional enhancers regulate spatio-temporal gene expression. While genomic assays can identify putative enhancers en masse, assigning target genes is a complex challenge. We devised a machine learning approach, McEnhancer, which links target genes to putative enhancers via a semi-supervised learning algorithm that predicts gene expression patterns based on enriched sequence features. Predicted expression patterns were 73–98% accurate, predicted assignments showed strong Hi-C interaction enrichment, enhancer-associated histone modifications were evident, and known functional motifs were recovered. Our model provides a general framework to link globally identified enhancers to targets and contributes to deciphering the regulatory genome.
topic Interpolated Markov model
Enhancer to target gene assignment
Gene expression
Drosophila melanogaster
Gene regulation
Machine learning
url http://link.springer.com/article/10.1186/s13059-017-1316-x
work_keys_str_mv AT dinahafez mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT aslihankarabacak mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT sabrinakrueger mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT yihchiihwang mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT lisanwang mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT robertpzinzen mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
AT uweohler mcenhancerpredictinggeneexpressionviasemisupervisedassignmentofenhancerstotargetgenes
_version_ 1725898868474773504