An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals

Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we presen...

Full description

Bibliographic Details
Main Authors: Yu-Long Li, Dong-Xiu Xue, Bai-Dong Zhang, Jin-Xian Liu
Format: Article
Language:English
Published: The Royal Society 2018-01-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589
id doaj-f033b580999c454ab53bcb0f78de6740
record_format Article
spelling doaj-f033b580999c454ab53bcb0f78de67402020-11-25T04:06:38ZengThe Royal SocietyRoyal Society Open Science2054-57032018-01-015210.1098/rsos.171589171589An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individualsYu-Long LiDong-Xiu XueBai-Dong ZhangJin-Xian LiuRestriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589overlapping paired-end sequencingrad-seqoptimized assemblypipeline softwareoptimal clustering
collection DOAJ
language English
format Article
sources DOAJ
author Yu-Long Li
Dong-Xiu Xue
Bai-Dong Zhang
Jin-Xian Liu
spellingShingle Yu-Long Li
Dong-Xiu Xue
Bai-Dong Zhang
Jin-Xian Liu
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
Royal Society Open Science
overlapping paired-end sequencing
rad-seq
optimized assembly
pipeline software
optimal clustering
author_facet Yu-Long Li
Dong-Xiu Xue
Bai-Dong Zhang
Jin-Xian Liu
author_sort Yu-Long Li
title An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_short An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_full An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_fullStr An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_full_unstemmed An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_sort optimized approach for local de novo assembly of overlapping paired-end rad reads from multiple individuals
publisher The Royal Society
series Royal Society Open Science
issn 2054-5703
publishDate 2018-01-01
description Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.
topic overlapping paired-end sequencing
rad-seq
optimized assembly
pipeline software
optimal clustering
url https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171589
work_keys_str_mv AT yulongli anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT dongxiuxue anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT baidongzhang anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT jinxianliu anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT yulongli optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT dongxiuxue optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT baidongzhang optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT jinxianliu optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
_version_ 1724431290315309056