Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.

A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM,...

Full description

Bibliographic Details
Main Authors: Yonglong Pan, Xiaoming Wang, Lin Liu, Hao Wang, Meizhong Luo
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5017645?pdf=render
id doaj-61971589f4104979b1ce3683756dbb8b
record_format Article
spelling doaj-61971589f4104979b1ce3683756dbb8b2020-11-25T01:55:53ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01119e016158310.1371/journal.pone.0161583Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.Yonglong PanXiaoming WangLin LiuHao WangMeizhong LuoA good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences.http://europepmc.org/articles/PMC5017645?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Yonglong Pan
Xiaoming Wang
Lin Liu
Hao Wang
Meizhong Luo
spellingShingle Yonglong Pan
Xiaoming Wang
Lin Liu
Hao Wang
Meizhong Luo
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
PLoS ONE
author_facet Yonglong Pan
Xiaoming Wang
Lin Liu
Hao Wang
Meizhong Luo
author_sort Yonglong Pan
title Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
title_short Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
title_full Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
title_fullStr Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
title_full_unstemmed Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
title_sort whole genome mapping with feature sets from high-throughput sequencing data.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2016-01-01
description A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences.
url http://europepmc.org/articles/PMC5017645?pdf=render
work_keys_str_mv AT yonglongpan wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata
AT xiaomingwang wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata
AT linliu wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata
AT haowang wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata
AT meizhongluo wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata
_version_ 1724982762814832640