Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data
<p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expr...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2012-06-01
|
Series: | BMC Genomics |
Online Access: | http://www.biomedcentral.com/1471-2164/13/237 |
id |
doaj-4f79d4ed3a9a4e66bb360ec6b87ae15b |
---|---|
record_format |
Article |
spelling |
doaj-4f79d4ed3a9a4e66bb360ec6b87ae15b2020-11-24T21:13:57ZengBMCBMC Genomics1471-21642012-06-0113123710.1186/1471-2164-13-237Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression DataWilson Tyler JLai LimingBan YuguangGe Steven X<p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes.</p> <p>Results</p> <p>We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant <it>Arabidopsis thaliana</it>. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps.</p> <p>Conclusions</p> <p>This study identifies a network of correlated metagenes composed of <it>Arabidopsis</it> genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.</p> http://www.biomedcentral.com/1471-2164/13/237 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wilson Tyler J Lai Liming Ban Yuguang Ge Steven X |
spellingShingle |
Wilson Tyler J Lai Liming Ban Yuguang Ge Steven X Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data BMC Genomics |
author_facet |
Wilson Tyler J Lai Liming Ban Yuguang Ge Steven X |
author_sort |
Wilson Tyler J |
title |
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data |
title_short |
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data |
title_full |
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data |
title_fullStr |
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data |
title_full_unstemmed |
Identification of metagenes and their Interactions through Large-scale Analysis of <it>Arabidopsis</it> Gene Expression Data |
title_sort |
identification of metagenes and their interactions through large-scale analysis of <it>arabidopsis</it> gene expression data |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2012-06-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes.</p> <p>Results</p> <p>We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant <it>Arabidopsis thaliana</it>. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps.</p> <p>Conclusions</p> <p>This study identifies a network of correlated metagenes composed of <it>Arabidopsis</it> genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.</p> |
url |
http://www.biomedcentral.com/1471-2164/13/237 |
work_keys_str_mv |
AT wilsontylerj identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata AT lailiming identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata AT banyuguang identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata AT gestevenx identificationofmetagenesandtheirinteractionsthroughlargescaleanalysisofitarabidopsisitgeneexpressiondata |
_version_ |
1716747773206855680 |