Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing
<p>Abstract</p> <p>Background</p> <p>Soybean, <it>Glycine max </it>(L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Dep...
Main Authors: | , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2007-09-01
|
Series: | BMC Genomics |
Online Access: | http://www.biomedcentral.com/1471-2164/8/330 |
id |
doaj-e5327cfe666c43a38a08ff62ba4da5c6 |
---|---|
record_format |
Article |
spelling |
doaj-e5327cfe666c43a38a08ff62ba4da5c62020-11-25T00:51:32ZengBMCBMC Genomics1471-21642007-09-018133010.1186/1471-2164-8-330Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencingNelson Rex TRoe Bruce AO'Bleness MajestaYi JingDeshpande ShwetaVasylenko-Sanders Iryna FSchlueter Shannon DLin Jer-YoungSchlueter Jessica AScheffler Brian EJackson Scott AShoemaker Randy C<p>Abstract</p> <p>Background</p> <p>Soybean, <it>Glycine max </it>(L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Department of Energy Joint Genome Institute and United States Department of Agriculture jointly announced the sequencing of the soybean genome. One of the initial concerns is to what extent sequence identity in homeologous regions would have on whole genome shotgun sequence assembly.</p> <p>Results</p> <p>Seventeen BACs representing ~2.03 Mb were sequenced as representative potential homeologous regions from the soybean genome. Genetic mapping of each BAC shows that 11 of the 20 chromosomes are represented. Sequence comparisons between homeologous BACs shows that the soybean genome is a mosaic of retained paleopolyploid regions. Some regions appear to be highly conserved while other regions have diverged significantly. Large-scale "batch" reassembly of all 17 BACs combined showed that even the most homeologous BACs with upwards of 95% sequence identity resolve into their respective homeologous sequences. Potential assembly errors were generated by tandemly duplicated pentatricopeptide repeat containing genes and long simple sequence repeats. Analysis of a whole-genome shotgun assembly of 80,000 randomly chosen JGI-DOE sequence traces reveals some new soybean-specific repeat sequences.</p> <p>Conclusion</p> <p>This analysis investigated both the structure of the paleopolyploid soybean genome and the potential effects retained homeology will have on assembling the whole genome shotgun sequence. Based upon these results, homeologous regions similar to those characterized here will not cause major assembly issues.</p> http://www.biomedcentral.com/1471-2164/8/330 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nelson Rex T Roe Bruce A O'Bleness Majesta Yi Jing Deshpande Shweta Vasylenko-Sanders Iryna F Schlueter Shannon D Lin Jer-Young Schlueter Jessica A Scheffler Brian E Jackson Scott A Shoemaker Randy C |
spellingShingle |
Nelson Rex T Roe Bruce A O'Bleness Majesta Yi Jing Deshpande Shweta Vasylenko-Sanders Iryna F Schlueter Shannon D Lin Jer-Young Schlueter Jessica A Scheffler Brian E Jackson Scott A Shoemaker Randy C Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing BMC Genomics |
author_facet |
Nelson Rex T Roe Bruce A O'Bleness Majesta Yi Jing Deshpande Shweta Vasylenko-Sanders Iryna F Schlueter Shannon D Lin Jer-Young Schlueter Jessica A Scheffler Brian E Jackson Scott A Shoemaker Randy C |
author_sort |
Nelson Rex T |
title |
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
title_short |
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
title_full |
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
title_fullStr |
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
title_full_unstemmed |
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
title_sort |
gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2007-09-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Soybean, <it>Glycine max </it>(L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Department of Energy Joint Genome Institute and United States Department of Agriculture jointly announced the sequencing of the soybean genome. One of the initial concerns is to what extent sequence identity in homeologous regions would have on whole genome shotgun sequence assembly.</p> <p>Results</p> <p>Seventeen BACs representing ~2.03 Mb were sequenced as representative potential homeologous regions from the soybean genome. Genetic mapping of each BAC shows that 11 of the 20 chromosomes are represented. Sequence comparisons between homeologous BACs shows that the soybean genome is a mosaic of retained paleopolyploid regions. Some regions appear to be highly conserved while other regions have diverged significantly. Large-scale "batch" reassembly of all 17 BACs combined showed that even the most homeologous BACs with upwards of 95% sequence identity resolve into their respective homeologous sequences. Potential assembly errors were generated by tandemly duplicated pentatricopeptide repeat containing genes and long simple sequence repeats. Analysis of a whole-genome shotgun assembly of 80,000 randomly chosen JGI-DOE sequence traces reveals some new soybean-specific repeat sequences.</p> <p>Conclusion</p> <p>This analysis investigated both the structure of the paleopolyploid soybean genome and the potential effects retained homeology will have on assembling the whole genome shotgun sequence. Based upon these results, homeologous regions similar to those characterized here will not cause major assembly issues.</p> |
url |
http://www.biomedcentral.com/1471-2164/8/330 |
work_keys_str_mv |
AT nelsonrext geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT roebrucea geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT oblenessmajesta geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT yijing geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT deshpandeshweta geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT vasylenkosandersirynaf geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT schluetershannond geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT linjeryoung geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT schlueterjessicaa geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT schefflerbriane geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT jacksonscotta geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing AT shoemakerrandyc geneduplicationandpaleopolyploidyinsoybeanandtheimplicationsforwholegenomesequencing |
_version_ |
1725245305291538432 |