Modern technologies and algorithms for scaffolding assembled genomes.

The computational reconstruction of genome sequences from shotgun sequencing data has been greatly simplified by the advent of sequencing technologies that generate long reads. In the case of relatively small genomes (e.g., bacterial or viral), complete genome sequences can frequently be reconstruct...

Full description

Bibliographic Details
Main Authors: Jay Ghurye, Mihai Pop
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-06-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1006994
id doaj-595c1ebf6da544959505c8a5fadd177b
record_format Article
spelling doaj-595c1ebf6da544959505c8a5fadd177b2021-04-21T15:38:27ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582019-06-01156e100699410.1371/journal.pcbi.1006994Modern technologies and algorithms for scaffolding assembled genomes.Jay GhuryeMihai PopThe computational reconstruction of genome sequences from shotgun sequencing data has been greatly simplified by the advent of sequencing technologies that generate long reads. In the case of relatively small genomes (e.g., bacterial or viral), complete genome sequences can frequently be reconstructed computationally without the need for further experiments. However, large and complex genomes, such as those of most animals and plants, continue to pose significant challenges. In such genomes, assembly software produces incomplete and fragmented reconstructions that require additional experimentally derived information and manual intervention in order to reconstruct individual chromosome arms. Recent technologies originally designed to capture chromatin structure have been shown to effectively complement sequencing data, leading to much more contiguous reconstructions of genomes than previously possible. Here, we survey these technologies and the algorithms used to assemble and analyze large eukaryotic genomes, placed within the historical context of genome scaffolding technologies that have been in existence since the dawn of the genomic era.https://doi.org/10.1371/journal.pcbi.1006994
collection DOAJ
language English
format Article
sources DOAJ
author Jay Ghurye
Mihai Pop
spellingShingle Jay Ghurye
Mihai Pop
Modern technologies and algorithms for scaffolding assembled genomes.
PLoS Computational Biology
author_facet Jay Ghurye
Mihai Pop
author_sort Jay Ghurye
title Modern technologies and algorithms for scaffolding assembled genomes.
title_short Modern technologies and algorithms for scaffolding assembled genomes.
title_full Modern technologies and algorithms for scaffolding assembled genomes.
title_fullStr Modern technologies and algorithms for scaffolding assembled genomes.
title_full_unstemmed Modern technologies and algorithms for scaffolding assembled genomes.
title_sort modern technologies and algorithms for scaffolding assembled genomes.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2019-06-01
description The computational reconstruction of genome sequences from shotgun sequencing data has been greatly simplified by the advent of sequencing technologies that generate long reads. In the case of relatively small genomes (e.g., bacterial or viral), complete genome sequences can frequently be reconstructed computationally without the need for further experiments. However, large and complex genomes, such as those of most animals and plants, continue to pose significant challenges. In such genomes, assembly software produces incomplete and fragmented reconstructions that require additional experimentally derived information and manual intervention in order to reconstruct individual chromosome arms. Recent technologies originally designed to capture chromatin structure have been shown to effectively complement sequencing data, leading to much more contiguous reconstructions of genomes than previously possible. Here, we survey these technologies and the algorithms used to assemble and analyze large eukaryotic genomes, placed within the historical context of genome scaffolding technologies that have been in existence since the dawn of the genomic era.
url https://doi.org/10.1371/journal.pcbi.1006994
work_keys_str_mv AT jayghurye moderntechnologiesandalgorithmsforscaffoldingassembledgenomes
AT mihaipop moderntechnologiesandalgorithmsforscaffoldingassembledgenomes
_version_ 1714667136971964416