Comparative Genomics Using the Colored de Bruijn Graph

Comparing genomes in a computationally efficient manner is a difficult problem. Methods that provide the highest resolution are too inefficient and methods that are efficient are too low resolution. In this thesis, we show that the Colored de Bruijn Graph (CdBG) is a suitable method for comparing ge...

Full description

Bibliographic Details
Main Author: Lyman, Cole Andrew
Format: Others
Published: BYU ScholarsArchive 2020
Subjects:
Online Access:https://scholarsarchive.byu.edu/etd/8441
https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=9441&context=etd
Description
Summary:Comparing genomes in a computationally efficient manner is a difficult problem. Methods that provide the highest resolution are too inefficient and methods that are efficient are too low resolution. In this thesis, we show that the Colored de Bruijn Graph (CdBG) is a suitable method for comparing genomes because it is efficient while maintaining a useful amount of resolution. To illustrate the usefulness of the CdBG, the phylogenetic tree for 12 species in the Drosophila genus is reconstructed using pseudo-homologous regions of the genome contained in the CdBG.