An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we presen...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
The Royal Society
2018-01-01
|
Series: | Royal Society Open Science |
Subjects: | |
Online Access: | https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589 |
id |
doaj-f033b580999c454ab53bcb0f78de6740 |
---|---|
record_format |
Article |
spelling |
doaj-f033b580999c454ab53bcb0f78de67402020-11-25T04:06:38ZengThe Royal SocietyRoyal Society Open Science2054-57032018-01-015210.1098/rsos.171589171589An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individualsYu-Long LiDong-Xiu XueBai-Dong ZhangJin-Xian LiuRestriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589overlapping paired-end sequencingrad-seqoptimized assemblypipeline softwareoptimal clustering |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yu-Long Li Dong-Xiu Xue Bai-Dong Zhang Jin-Xian Liu |
spellingShingle |
Yu-Long Li Dong-Xiu Xue Bai-Dong Zhang Jin-Xian Liu An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals Royal Society Open Science overlapping paired-end sequencing rad-seq optimized assembly pipeline software optimal clustering |
author_facet |
Yu-Long Li Dong-Xiu Xue Bai-Dong Zhang Jin-Xian Liu |
author_sort |
Yu-Long Li |
title |
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals |
title_short |
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals |
title_full |
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals |
title_fullStr |
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals |
title_full_unstemmed |
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals |
title_sort |
optimized approach for local de novo assembly of overlapping paired-end rad reads from multiple individuals |
publisher |
The Royal Society |
series |
Royal Society Open Science |
issn |
2054-5703 |
publishDate |
2018-01-01 |
description |
Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts. |
topic |
overlapping paired-end sequencing rad-seq optimized assembly pipeline software optimal clustering |
url |
https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589 |
work_keys_str_mv |
AT yulongli anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT dongxiuxue anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT baidongzhang anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT jinxianliu anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT yulongli optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT dongxiuxue optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT baidongzhang optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals AT jinxianliu optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals |
_version_ |
1724431290315309056 |