Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data

<p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expr...

Full description

Bibliographic Details
Main Authors: Wilson Tyler J, Lai Liming, Ban Yuguang, Ge Steven X
Format: Article
Language:English
Published: BMC 2012-06-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/13/237
id doaj-4f79d4ed3a9a4e66bb360ec6b87ae15b
record_format Article
spelling doaj-4f79d4ed3a9a4e66bb360ec6b87ae15b2020-11-24T21:13:57ZengBMCBMC Genomics1471-21642012-06-0113123710.1186/1471-2164-13-237Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression DataWilson Tyler JLai LimingBan YuguangGe Steven X<p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes.</p> <p>Results</p> <p>We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant <it>Arabidopsis thaliana</it>. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps.</p> <p>Conclusions</p> <p>This study identifies a network of correlated metagenes composed of <it>Arabidopsis</it> genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.</p> http://www.biomedcentral.com/1471-2164/13/237
collection DOAJ
language English
format Article
sources DOAJ
author Wilson Tyler J
Lai Liming
Ban Yuguang
Ge Steven X
spellingShingle Wilson Tyler J
Lai Liming
Ban Yuguang
Ge Steven X
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
BMC Genomics
author_facet Wilson Tyler J
Lai Liming
Ban Yuguang
Ge Steven X
author_sort Wilson Tyler J
title Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
title_short Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
title_full Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
title_fullStr Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
title_full_unstemmed Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
title_sort identification of metagenes and their interactions through large-scale analysis of <it>arabidopsis</it> gene expression data
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2012-06-01
description <p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes.</p> <p>Results</p> <p>We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant <it>Arabidopsis thaliana</it>. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps.</p> <p>Conclusions</p> <p>This study identifies a network of correlated metagenes composed of <it>Arabidopsis</it> genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.</p>
url http://www.biomedcentral.com/1471-2164/13/237
work_keys_str_mv AT wilsontylerj identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata
AT lailiming identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata
AT banyuguang identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata
AT gestevenx identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata
_version_ 1716747773206855680