A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation

<p>Abstract</p> <p>Background</p> <p><it>Drosophila </it>gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for study...

Full description

Bibliographic Details
Main Authors: Kumar Sudhir, Zhou Zhi-Hua, Li Ying-Xin, Ji Shuiwang, Ye Jieping
Format: Article
Language:English
Published: BMC 2009-04-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/119
id doaj-6dfcbf77330d4b7fa3a737aa39b7ec66
record_format Article
spelling doaj-6dfcbf77330d4b7fa3a737aa39b7ec662020-11-25T00:19:18ZengBMCBMC Bioinformatics1471-21052009-04-0110111910.1186/1471-2105-10-119A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotationKumar SudhirZhou Zhi-HuaLi Ying-XinJi ShuiwangYe Jieping<p>Abstract</p> <p>Background</p> <p><it>Drosophila </it>gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley <it>Drosophila </it>Genome Project (BDGP) high-throughput study were annotated with a variable number of anatomical terms manually using a controlled vocabulary. Considering that the number of available images is rapidly increasing, it is imperative to design computational methods to automate this task.</p> <p>Results</p> <p>We present a computational method to annotate gene expression pattern images automatically. The proposed method uses the bag-of-words scheme to utilize the existing information on pattern annotation and annotates images using a model that exploits correlations among terms. The proposed method can annotate images individually or in groups (e.g., according to the developmental stage). In addition, the proposed method can integrate information from different two-dimensional views of embryos. Results on embryonic patterns from BDGP data demonstrate that our method significantly outperforms other methods.</p> <p>Conclusion</p> <p>The proposed bag-of-words scheme is effective in representing a set of annotations assigned to a group of images, and the model employed to annotate images successfully captures the correlations among different controlled vocabulary terms. The integration of existing annotation information from multiple embryonic views improves annotation performance.</p> http://www.biomedcentral.com/1471-2105/10/119
collection DOAJ
language English
format Article
sources DOAJ
author Kumar Sudhir
Zhou Zhi-Hua
Li Ying-Xin
Ji Shuiwang
Ye Jieping
spellingShingle Kumar Sudhir
Zhou Zhi-Hua
Li Ying-Xin
Ji Shuiwang
Ye Jieping
A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
BMC Bioinformatics
author_facet Kumar Sudhir
Zhou Zhi-Hua
Li Ying-Xin
Ji Shuiwang
Ye Jieping
author_sort Kumar Sudhir
title A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
title_short A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
title_full A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
title_fullStr A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
title_full_unstemmed A bag-of-words approach for <it>Drosophila </it>gene expression pattern annotation
title_sort bag-of-words approach for <it>drosophila </it>gene expression pattern annotation
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-04-01
description <p>Abstract</p> <p>Background</p> <p><it>Drosophila </it>gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley <it>Drosophila </it>Genome Project (BDGP) high-throughput study were annotated with a variable number of anatomical terms manually using a controlled vocabulary. Considering that the number of available images is rapidly increasing, it is imperative to design computational methods to automate this task.</p> <p>Results</p> <p>We present a computational method to annotate gene expression pattern images automatically. The proposed method uses the bag-of-words scheme to utilize the existing information on pattern annotation and annotates images using a model that exploits correlations among terms. The proposed method can annotate images individually or in groups (e.g., according to the developmental stage). In addition, the proposed method can integrate information from different two-dimensional views of embryos. Results on embryonic patterns from BDGP data demonstrate that our method significantly outperforms other methods.</p> <p>Conclusion</p> <p>The proposed bag-of-words scheme is effective in representing a set of annotations assigned to a group of images, and the model employed to annotate images successfully captures the correlations among different controlled vocabulary terms. The integration of existing annotation information from multiple embryonic views improves annotation performance.</p>
url http://www.biomedcentral.com/1471-2105/10/119
work_keys_str_mv AT kumarsudhir abagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT zhouzhihua abagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT liyingxin abagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT jishuiwang abagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT yejieping abagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT kumarsudhir bagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT zhouzhihua bagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT liyingxin bagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT jishuiwang bagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
AT yejieping bagofwordsapproachforitdrosophilaitgeneexpressionpatternannotation
_version_ 1725372130837659648