Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction

A number of transcriptome datasets for differential expression (DE) genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from popl...

Full description

Bibliographic Details
Main Authors: Jiaping Zhao, Fan Yang, Jinxia Feng, Yanli Wang, Barbara Lachenbruch, Jiange Wang, Xianchong Wan
Format: Article
Language:English
Published: Frontiers Media S.A. 2017-10-01
Series:Frontiers in Plant Science
Subjects:
Online Access:http://journal.frontiersin.org/article/10.3389/fpls.2017.01876/full
id doaj-4b850238b2ee4bedbc151a5b2c79a040
record_format Article
spelling doaj-4b850238b2ee4bedbc151a5b2c79a0402020-11-24T22:58:21ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2017-10-01810.3389/fpls.2017.01876275010Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease InteractionJiaping Zhao0Fan Yang1Fan Yang2Jinxia Feng3Yanli Wang4Barbara Lachenbruch5Jiange Wang6Xianchong Wan7State Key Laboratory of Tree Genetics and Breeding, Institute of New Forestry Technology, Chinese Academy of Forestry, Beijing, ChinaState Key Laboratory of Tree Genetics and Breeding, Institute of New Forestry Technology, Chinese Academy of Forestry, Beijing, ChinaDepartment of Forestry, College of Forestry, Jiangxi Agricultural University, Nanchang, ChinaState Key Laboratory of Tree Genetics and Breeding, Institute of New Forestry Technology, Chinese Academy of Forestry, Beijing, ChinaDepartment of Horticulture, School of Horticulture Landscape Architecture, Henan Institute of Science and Technology, Xinxiang, ChinaDepartment of Forest Ecosystems and Society, Oregon State University, Corvallis, OR, United StatesDepartment of Forestry, College of Forestry, Jiangxi Agricultural University, Nanchang, ChinaState Key Laboratory of Tree Genetics and Breeding, Institute of New Forestry Technology, Chinese Academy of Forestry, Beijing, ChinaA number of transcriptome datasets for differential expression (DE) genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from poplar/canker disease interaction system, we describe a methodology to identify candidate reference genes from high-throughput sequencing data. This methodology will improve the accuracy of RT-qPCR and will lead to better standards for the normalization of expression data. Expression stability analysis from xylem and phloem of Populus bejingensis inoculated with the fungal canker pathogen Botryosphaeria dothidea revealed that 729 poplar transcripts (1.11%) were stably expressed, at a threshold level of coefficient of variance (CV) of FPKM < 20% and maximum fold change (MFC) of FPKM < 2.0. Expression stability and bioinformatics analysis suggested that commonly used house-keeping (HK) genes were not the most appropriate internal controls: 70 of the 72 commonly used HK genes were not stably expressed, 45 of the 72 produced multiple isoform transcripts, and some of their reported primers produced unspecific amplicons in PCR amplification. RT-qPCR analysis to compare and evaluate the expression stability of 10 commonly used poplar HK genes and 20 of the 729 newly-identified stably expressed transcripts showed that some of the newly-identified genes (such as SSU_S8e, LSU_L5e, and 20S_PSU) had higher stability ranking than most of commonly used HK genes. Based on these results, we recommend a pipeline for deriving reference genes from transcriptome data. An appropriate candidate gene should have a unique transcript, constitutive expression, CV value of expression < 20% (or possibly 30%) and MFC value of expression <2, and an expression level of 50–1,000 units. Lastly, when four of the newly identified HK genes were used in the normalization of expression data for 20 differential expressed genes, expression analysis gave similar values to Cufflinks output. The methods described here provide an alternative pathway for the normalization of transcriptome data, a process that is essential for integrating analyses of transcriptome data across environments, laboratories, sequencing platforms, and species.http://journal.frontiersin.org/article/10.3389/fpls.2017.01876/fullhigh-throughput sequencingexpression stabilityhouse-keeping geneinternal controldifferential expressionintegrate analysis
collection DOAJ
language English
format Article
sources DOAJ
author Jiaping Zhao
Fan Yang
Fan Yang
Jinxia Feng
Yanli Wang
Barbara Lachenbruch
Jiange Wang
Xianchong Wan
spellingShingle Jiaping Zhao
Fan Yang
Fan Yang
Jinxia Feng
Yanli Wang
Barbara Lachenbruch
Jiange Wang
Xianchong Wan
Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
Frontiers in Plant Science
high-throughput sequencing
expression stability
house-keeping gene
internal control
differential expression
integrate analysis
author_facet Jiaping Zhao
Fan Yang
Fan Yang
Jinxia Feng
Yanli Wang
Barbara Lachenbruch
Jiange Wang
Xianchong Wan
author_sort Jiaping Zhao
title Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
title_short Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
title_full Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
title_fullStr Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
title_full_unstemmed Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction
title_sort genome-wide constitutively expressed gene analysis and new reference gene selection based on transcriptome data: a case study from poplar/canker disease interaction
publisher Frontiers Media S.A.
series Frontiers in Plant Science
issn 1664-462X
publishDate 2017-10-01
description A number of transcriptome datasets for differential expression (DE) genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from poplar/canker disease interaction system, we describe a methodology to identify candidate reference genes from high-throughput sequencing data. This methodology will improve the accuracy of RT-qPCR and will lead to better standards for the normalization of expression data. Expression stability analysis from xylem and phloem of Populus bejingensis inoculated with the fungal canker pathogen Botryosphaeria dothidea revealed that 729 poplar transcripts (1.11%) were stably expressed, at a threshold level of coefficient of variance (CV) of FPKM < 20% and maximum fold change (MFC) of FPKM < 2.0. Expression stability and bioinformatics analysis suggested that commonly used house-keeping (HK) genes were not the most appropriate internal controls: 70 of the 72 commonly used HK genes were not stably expressed, 45 of the 72 produced multiple isoform transcripts, and some of their reported primers produced unspecific amplicons in PCR amplification. RT-qPCR analysis to compare and evaluate the expression stability of 10 commonly used poplar HK genes and 20 of the 729 newly-identified stably expressed transcripts showed that some of the newly-identified genes (such as SSU_S8e, LSU_L5e, and 20S_PSU) had higher stability ranking than most of commonly used HK genes. Based on these results, we recommend a pipeline for deriving reference genes from transcriptome data. An appropriate candidate gene should have a unique transcript, constitutive expression, CV value of expression < 20% (or possibly 30%) and MFC value of expression <2, and an expression level of 50–1,000 units. Lastly, when four of the newly identified HK genes were used in the normalization of expression data for 20 differential expressed genes, expression analysis gave similar values to Cufflinks output. The methods described here provide an alternative pathway for the normalization of transcriptome data, a process that is essential for integrating analyses of transcriptome data across environments, laboratories, sequencing platforms, and species.
topic high-throughput sequencing
expression stability
house-keeping gene
internal control
differential expression
integrate analysis
url http://journal.frontiersin.org/article/10.3389/fpls.2017.01876/full
work_keys_str_mv AT jiapingzhao genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT fanyang genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT fanyang genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT jinxiafeng genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT yanliwang genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT barbaralachenbruch genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT jiangewang genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
AT xianchongwan genomewideconstitutivelyexpressedgeneanalysisandnewreferencegeneselectionbasedontranscriptomedataacasestudyfrompoplarcankerdiseaseinteraction
_version_ 1725647375886712832