Short-read Chromosome Level Genome Assembly of Digitaria exilis

Genomics has become an important tool in agriculture. Many modern crop breeding approaches such as genomic selection and genome editing require detailed information of the genomic composition of a crop species. However, the assembly of high-quality genome sequences is prone to technical artifacts th...

Full description

Bibliographic Details
Main Author: Gapa, Liubov
Other Authors: Krattinger, Simon G.
Language:en
Published: 2019
Subjects:
Online Access:Gapa, L. (2019). Short-read Chromosome Level Genome Assembly of Digitaria exilis. KAUST Research Repository. https://doi.org/10.25781/KAUST-0NS7F
http://hdl.handle.net/10754/660202
id ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-660202
record_format oai_dc
spelling ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-6602022021-02-20T05:10:56Z Short-read Chromosome Level Genome Assembly of Digitaria exilis Gapa, Liubov Krattinger, Simon G. Biological and Environmental Sciences and Engineering (BESE) Division Aranda, Manuel Zuccolo, Andrea Genome assembly Plant-genomics Open-source Genomics has become an important tool in agriculture. Many modern crop breeding approaches such as genomic selection and genome editing require detailed information of the genomic composition of a crop species. However, the assembly of high-quality genome sequences is prone to technical artifacts that arise from inaccuracies in the sequencing technology and assembly algorithms. This is particularly true for the genomes of cereal crops, which are often very large, repeat-rich, and polyploid. Until recently, the highly continuous assembly of such cereal crop genomes from short-read data was mainly possible with proprietary assembly tools. In this work, we combined data generated with several short-read sequencing protocols and genomics technologies, including paired-end and mate-pair reads with multiple insert sizes, 10X linked reads, Hi-C contacts, and optical maps to assemble a chromosome level reference genome of Digitaria exilis (fonio millet) with open-source tools. Fonio millet is a semi-domesticated cereal orphan crop native to West Africa that has a high potential for desert agriculture. We implemented the TRITEX pipeline - a recently developed open-source pipeline for the assembly of large Triticeae genomes. We modified the pipeline to include 10X and Hi-C reads into the assembly process independently. We then compared the TRITEX assembly to the fonio reference genome, which had previously been assembled from the same input data but using proprietary algorithms. We found the two assemblies highly similar in content with high concordance in the local order (0.91 Pearson coefficient for alignments). However, we detected many small putative discrepancies between the two assemblies. While the TRITEX assembly was able to produce a highly continuous genome assembly, further work is needed to characterize the putative discrepancies in more detail. 2019-11-24T11:59:08Z 2019-11-24T11:59:08Z 2019-11 Thesis Gapa, L. (2019). Short-read Chromosome Level Genome Assembly of Digitaria exilis. KAUST Research Repository. https://doi.org/10.25781/KAUST-0NS7F 10.25781/KAUST-0NS7F http://hdl.handle.net/10754/660202 en 2020-11-24 At the time of archiving, the student author of this thesis opted to temporarily restrict access to it. The full text of this thesis will become available to the public after the expiration of the embargo on 2020-11-24.
collection NDLTD
language en
sources NDLTD
topic Genome assembly
Plant-genomics
Open-source
spellingShingle Genome assembly
Plant-genomics
Open-source
Gapa, Liubov
Short-read Chromosome Level Genome Assembly of Digitaria exilis
description Genomics has become an important tool in agriculture. Many modern crop breeding approaches such as genomic selection and genome editing require detailed information of the genomic composition of a crop species. However, the assembly of high-quality genome sequences is prone to technical artifacts that arise from inaccuracies in the sequencing technology and assembly algorithms. This is particularly true for the genomes of cereal crops, which are often very large, repeat-rich, and polyploid. Until recently, the highly continuous assembly of such cereal crop genomes from short-read data was mainly possible with proprietary assembly tools. In this work, we combined data generated with several short-read sequencing protocols and genomics technologies, including paired-end and mate-pair reads with multiple insert sizes, 10X linked reads, Hi-C contacts, and optical maps to assemble a chromosome level reference genome of Digitaria exilis (fonio millet) with open-source tools. Fonio millet is a semi-domesticated cereal orphan crop native to West Africa that has a high potential for desert agriculture. We implemented the TRITEX pipeline - a recently developed open-source pipeline for the assembly of large Triticeae genomes. We modified the pipeline to include 10X and Hi-C reads into the assembly process independently. We then compared the TRITEX assembly to the fonio reference genome, which had previously been assembled from the same input data but using proprietary algorithms. We found the two assemblies highly similar in content with high concordance in the local order (0.91 Pearson coefficient for alignments). However, we detected many small putative discrepancies between the two assemblies. While the TRITEX assembly was able to produce a highly continuous genome assembly, further work is needed to characterize the putative discrepancies in more detail.
author2 Krattinger, Simon G.
author_facet Krattinger, Simon G.
Gapa, Liubov
author Gapa, Liubov
author_sort Gapa, Liubov
title Short-read Chromosome Level Genome Assembly of Digitaria exilis
title_short Short-read Chromosome Level Genome Assembly of Digitaria exilis
title_full Short-read Chromosome Level Genome Assembly of Digitaria exilis
title_fullStr Short-read Chromosome Level Genome Assembly of Digitaria exilis
title_full_unstemmed Short-read Chromosome Level Genome Assembly of Digitaria exilis
title_sort short-read chromosome level genome assembly of digitaria exilis
publishDate 2019
url Gapa, L. (2019). Short-read Chromosome Level Genome Assembly of Digitaria exilis. KAUST Research Repository. https://doi.org/10.25781/KAUST-0NS7F
http://hdl.handle.net/10754/660202
work_keys_str_mv AT gapaliubov shortreadchromosomelevelgenomeassemblyofdigitariaexilis
_version_ 1719378116076371968