Summary: | Rehmannia glutinosa is a potent medicinal plant with a significant importance in traditional Chinese medicine. Its root is enriched with various bioactive molecules mainly iridoids, possessing important pharmaceutical properties. However, the molecular biology and evolution of R. glutinosa have been largely unexplored. Here, we report a reference genome of R. glutinosa using Nanopore technology, Illumina and Hi-C sequencing. The assembly genome is 2.49 Gb long with a scaffold N50 length of 70 Mb and high heterozygosity (2%). Since R. glutinosa is an autotetraploid (4n = 56), the difference between each set of chromosomes is very small, and it is difficult to distinguish the two sets of chromosomes using Hi-C. Hence, only one set of the genome size was mounted to the chromosome level. Scaffolds covering 52.61% of the assembled genome were anchored on 14 pseudochromosomes. Over 67% of the genome consists of repetitive sequences dominated by Copia long terminal repeats and 48,475 protein-coding genes were predicted. Phylogenetic analysis corroborates the placement of R. glutinosa in the Orobanchaceae family. Our results indicated an independent and very recent whole genome duplication event that occurred 3.64 million year ago in the R. glutinosa lineage. Comparative genomics analysis demonstrated expansion of the UDP-dependent glycosyltransferases and terpene synthase gene families, known to be involved in terpenoid biosynthesis and diversification. Furthermore, the molecular biosynthetic pathway of iridoids has been clarified in this work. Collectively, the generated reference genome of R. glutinosa will facilitate discovery and development of important pharmacological compounds.
|