Phylogenetic comparative assembly

<p>Abstract</p> <p>Background</p> <p>Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several con...

Full description

Bibliographic Details
Main Authors: Husemann Peter, Stoye Jens
Format: Article
Language:English
Published: BMC 2010-01-01
Series:Algorithms for Molecular Biology
Online Access:http://www.almob.org/content/5/1/3
id doaj-195628aac3594d8ca262050874888649
record_format Article
spelling doaj-195628aac3594d8ca2620508748886492020-11-25T00:18:34ZengBMCAlgorithms for Molecular Biology1748-71882010-01-0151310.1186/1748-7188-5-3Phylogenetic comparative assemblyHusemann PeterStoye Jens<p>Abstract</p> <p>Background</p> <p>Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence.</p> <p>Results</p> <p>Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a graph that contains the likelihood for each pair of contigs to be adjacent.</p> <p>Subsequently, this graph can be used to compute a layout graph that shows the most promising contig adjacencies in order to aid biologists in finishing the complete genomic sequence. The layout graph shows unique contig orderings where possible, and the best alternatives where necessary.</p> <p>Conclusions</p> <p>Our new algorithm for contig ordering uses sequence similarity as well as phylogenetic information to estimate adjacencies of contigs. An evaluation of our implementation shows that it performs better than recent approaches while being much faster at the same time.</p> http://www.almob.org/content/5/1/3
collection DOAJ
language English
format Article
sources DOAJ
author Husemann Peter
Stoye Jens
spellingShingle Husemann Peter
Stoye Jens
Phylogenetic comparative assembly
Algorithms for Molecular Biology
author_facet Husemann Peter
Stoye Jens
author_sort Husemann Peter
title Phylogenetic comparative assembly
title_short Phylogenetic comparative assembly
title_full Phylogenetic comparative assembly
title_fullStr Phylogenetic comparative assembly
title_full_unstemmed Phylogenetic comparative assembly
title_sort phylogenetic comparative assembly
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2010-01-01
description <p>Abstract</p> <p>Background</p> <p>Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence.</p> <p>Results</p> <p>Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a graph that contains the likelihood for each pair of contigs to be adjacent.</p> <p>Subsequently, this graph can be used to compute a layout graph that shows the most promising contig adjacencies in order to aid biologists in finishing the complete genomic sequence. The layout graph shows unique contig orderings where possible, and the best alternatives where necessary.</p> <p>Conclusions</p> <p>Our new algorithm for contig ordering uses sequence similarity as well as phylogenetic information to estimate adjacencies of contigs. An evaluation of our implementation shows that it performs better than recent approaches while being much faster at the same time.</p>
url http://www.almob.org/content/5/1/3
work_keys_str_mv AT husemannpeter phylogeneticcomparativeassembly
AT stoyejens phylogeneticcomparativeassembly
_version_ 1725375720337702912