Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.
A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM,...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2016-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5017645?pdf=render |
id |
doaj-61971589f4104979b1ce3683756dbb8b |
---|---|
record_format |
Article |
spelling |
doaj-61971589f4104979b1ce3683756dbb8b2020-11-25T01:55:53ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01119e016158310.1371/journal.pone.0161583Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.Yonglong PanXiaoming WangLin LiuHao WangMeizhong LuoA good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences.http://europepmc.org/articles/PMC5017645?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yonglong Pan Xiaoming Wang Lin Liu Hao Wang Meizhong Luo |
spellingShingle |
Yonglong Pan Xiaoming Wang Lin Liu Hao Wang Meizhong Luo Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. PLoS ONE |
author_facet |
Yonglong Pan Xiaoming Wang Lin Liu Hao Wang Meizhong Luo |
author_sort |
Yonglong Pan |
title |
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. |
title_short |
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. |
title_full |
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. |
title_fullStr |
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. |
title_full_unstemmed |
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data. |
title_sort |
whole genome mapping with feature sets from high-throughput sequencing data. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2016-01-01 |
description |
A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences. |
url |
http://europepmc.org/articles/PMC5017645?pdf=render |
work_keys_str_mv |
AT yonglongpan wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata AT xiaomingwang wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata AT linliu wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata AT haowang wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata AT meizhongluo wholegenomemappingwithfeaturesetsfromhighthroughputsequencingdata |
_version_ |
1724982762814832640 |