Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species

Abstract Cauliflower is an important variety of Brassica oleracea and is planted worldwide. Here, the high-quality genome sequence of cauliflower was reported. The assembled cauliflower genome was 584.60 Mb in size, with a contig N50 of 2.11 Mb, and contained 47,772 genes; 56.65% of the genome was c...

Full description

Bibliographic Details
Main Authors: Deling Sun, Chunguo Wang, Xiaoli Zhang, Wenlin Zhang, Hanmin Jiang, Xingwei Yao, Lili Liu, Zhenghua Wen, Guobao Niu, Xiaozheng Shan
Format: Article
Language:English
Published: Nature Publishing Group 2019-07-01
Series:Horticulture Research
Online Access:https://doi.org/10.1038/s41438-019-0164-0
id doaj-80200ce2412e4e82b8d9bcad85e71abb
record_format Article
spelling doaj-80200ce2412e4e82b8d9bcad85e71abb2020-12-07T23:36:20ZengNature Publishing GroupHorticulture Research2052-72762019-07-016111110.1038/s41438-019-0164-0Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica speciesDeling Sun0Chunguo Wang1Xiaoli Zhang2Wenlin Zhang3Hanmin Jiang4Xingwei Yao5Lili Liu6Zhenghua Wen7Guobao Niu8Xiaozheng Shan9Tianjin Academy of Agricultural SciencesCollege of Life Sciences, Nankai UniversityTianjin Kernel Vegetable Research InstituteNovogene Bioinformatics InstituteTianjin Kernel Vegetable Research InstituteTianjin Kernel Vegetable Research InstituteTianjin Kernel Vegetable Research InstituteTianjin Kernel Vegetable Research InstituteTianjin Kernel Vegetable Research InstituteTianjin Kernel Vegetable Research InstituteAbstract Cauliflower is an important variety of Brassica oleracea and is planted worldwide. Here, the high-quality genome sequence of cauliflower was reported. The assembled cauliflower genome was 584.60 Mb in size, with a contig N50 of 2.11 Mb, and contained 47,772 genes; 56.65% of the genome was composed of repetitive sequences. Among these sequences, long terminal repeats (LTRs) were the most abundant (32.71% of the genome), followed by transposable elements (TEs) (12.62%). Comparative genomic analysis confirmed that after an ancient paleohexaploidy (γ) event, cauliflower underwent two whole-genome duplication (WGD) events shared with Arabidopsis and an additional whole-genome triplication (WGT) event shared with other Brassica species. The present cultivated cauliflower diverged from the ancestral B. oleracea species ~3.0 million years ago (Mya). The speciation of cauliflower (~2.0 Mya) was later than that of B. oleracea L. var. capitata (approximately 2.6 Mya) and other Brassica species (over 2.0 Mya). Chromosome no. 03 of cauliflower shared the most syntenic blocks with the A, B, and C genomes of Brassica species and its eight other chromosomes, implying that chromosome no. 03 might be the most ancient one in the cauliflower genome, which was consistent with the chromosome being inherited from the common ancestor of Brassica species. In addition, 2,718 specific genes, 228 expanded genes, 2 contracted genes, and 1,065 positively selected genes in cauliflower were identified and functionally annotated. These findings provide new insights into the genomic diversity of Brassica species and serve as a valuable reference for molecular breeding of cauliflower.https://doi.org/10.1038/s41438-019-0164-0
collection DOAJ
language English
format Article
sources DOAJ
author Deling Sun
Chunguo Wang
Xiaoli Zhang
Wenlin Zhang
Hanmin Jiang
Xingwei Yao
Lili Liu
Zhenghua Wen
Guobao Niu
Xiaozheng Shan
spellingShingle Deling Sun
Chunguo Wang
Xiaoli Zhang
Wenlin Zhang
Hanmin Jiang
Xingwei Yao
Lili Liu
Zhenghua Wen
Guobao Niu
Xiaozheng Shan
Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
Horticulture Research
author_facet Deling Sun
Chunguo Wang
Xiaoli Zhang
Wenlin Zhang
Hanmin Jiang
Xingwei Yao
Lili Liu
Zhenghua Wen
Guobao Niu
Xiaozheng Shan
author_sort Deling Sun
title Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
title_short Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
title_full Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
title_fullStr Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
title_full_unstemmed Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species
title_sort draft genome sequence of cauliflower (brassica oleracea l. var. botrytis) provides new insights into the c genome in brassica species
publisher Nature Publishing Group
series Horticulture Research
issn 2052-7276
publishDate 2019-07-01
description Abstract Cauliflower is an important variety of Brassica oleracea and is planted worldwide. Here, the high-quality genome sequence of cauliflower was reported. The assembled cauliflower genome was 584.60 Mb in size, with a contig N50 of 2.11 Mb, and contained 47,772 genes; 56.65% of the genome was composed of repetitive sequences. Among these sequences, long terminal repeats (LTRs) were the most abundant (32.71% of the genome), followed by transposable elements (TEs) (12.62%). Comparative genomic analysis confirmed that after an ancient paleohexaploidy (γ) event, cauliflower underwent two whole-genome duplication (WGD) events shared with Arabidopsis and an additional whole-genome triplication (WGT) event shared with other Brassica species. The present cultivated cauliflower diverged from the ancestral B. oleracea species ~3.0 million years ago (Mya). The speciation of cauliflower (~2.0 Mya) was later than that of B. oleracea L. var. capitata (approximately 2.6 Mya) and other Brassica species (over 2.0 Mya). Chromosome no. 03 of cauliflower shared the most syntenic blocks with the A, B, and C genomes of Brassica species and its eight other chromosomes, implying that chromosome no. 03 might be the most ancient one in the cauliflower genome, which was consistent with the chromosome being inherited from the common ancestor of Brassica species. In addition, 2,718 specific genes, 228 expanded genes, 2 contracted genes, and 1,065 positively selected genes in cauliflower were identified and functionally annotated. These findings provide new insights into the genomic diversity of Brassica species and serve as a valuable reference for molecular breeding of cauliflower.
url https://doi.org/10.1038/s41438-019-0164-0
work_keys_str_mv AT delingsun draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT chunguowang draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT xiaolizhang draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT wenlinzhang draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT hanminjiang draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT xingweiyao draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT lililiu draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT zhenghuawen draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT guobaoniu draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
AT xiaozhengshan draftgenomesequenceofcauliflowerbrassicaoleracealvarbotrytisprovidesnewinsightsintothecgenomeinbrassicaspecies
_version_ 1724397164130467840