Genome Reassembler for Low-coverage Sequencing Regions

碩士 === 國立中正大學 === 資訊工程研究所 === 105 === Third-generation sequencing (TGS) can produce much longer reads within shorter turnaround time, which is becoming the preferred choice for de novo genome assembly. Unfortunately, most large-genome sequencing projects are not affordable of sufficient sequencing d...

Full description

Bibliographic Details
Main Authors: Lin, Jyun-Hong, 林駿宏
Other Authors: Huang, Yao-Ting
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/eg2d86
id ndltd-TW-105CCU00392077
record_format oai_dc
spelling ndltd-TW-105CCU003920772019-05-15T23:32:16Z http://ndltd.ncl.edu.tw/handle/eg2d86 Genome Reassembler for Low-coverage Sequencing Regions 針對低覆蓋率定序區域之基因體重組軟體 Lin, Jyun-Hong 林駿宏 碩士 國立中正大學 資訊工程研究所 105 Third-generation sequencing (TGS) can produce much longer reads within shorter turnaround time, which is becoming the preferred choice for de novo genome assembly. Unfortunately, most large-genome sequencing projects are not affordable of sufficient sequencing depth required by existing TGS assemblers. Even the desired sequencing depth is achieved, the coverage across the entire genome is still uneven. This thesis designs and implements a genome re-assembler which aims to improve existing TGS assemblers by recovering missing overlap and performing re-assembly in low-coverage regions. A novel dimensional-reduction technique is developed for efficient overlap computation tailored for low-coverage regions. Experimental results indicated that our method improves assembly of Canu, Falcon, and Miniasm under low-sequencing depth ($\le$ 30x). For large genomes, the results of low- and high-sequencing depth are both improved probably owing to larger coverage variance. Huang, Yao-Ting 黃耀廷 2017 學位論文 ; thesis 35 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊工程研究所 === 105 === Third-generation sequencing (TGS) can produce much longer reads within shorter turnaround time, which is becoming the preferred choice for de novo genome assembly. Unfortunately, most large-genome sequencing projects are not affordable of sufficient sequencing depth required by existing TGS assemblers. Even the desired sequencing depth is achieved, the coverage across the entire genome is still uneven. This thesis designs and implements a genome re-assembler which aims to improve existing TGS assemblers by recovering missing overlap and performing re-assembly in low-coverage regions. A novel dimensional-reduction technique is developed for efficient overlap computation tailored for low-coverage regions. Experimental results indicated that our method improves assembly of Canu, Falcon, and Miniasm under low-sequencing depth ($\le$ 30x). For large genomes, the results of low- and high-sequencing depth are both improved probably owing to larger coverage variance.
author2 Huang, Yao-Ting
author_facet Huang, Yao-Ting
Lin, Jyun-Hong
林駿宏
author Lin, Jyun-Hong
林駿宏
spellingShingle Lin, Jyun-Hong
林駿宏
Genome Reassembler for Low-coverage Sequencing Regions
author_sort Lin, Jyun-Hong
title Genome Reassembler for Low-coverage Sequencing Regions
title_short Genome Reassembler for Low-coverage Sequencing Regions
title_full Genome Reassembler for Low-coverage Sequencing Regions
title_fullStr Genome Reassembler for Low-coverage Sequencing Regions
title_full_unstemmed Genome Reassembler for Low-coverage Sequencing Regions
title_sort genome reassembler for low-coverage sequencing regions
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/eg2d86
work_keys_str_mv AT linjyunhong genomereassemblerforlowcoveragesequencingregions
AT línjùnhóng genomereassemblerforlowcoveragesequencingregions
AT linjyunhong zhēnduìdīfùgàilǜdìngxùqūyùzhījīyīntǐzhòngzǔruǎntǐ
AT línjùnhóng zhēnduìdīfùgàilǜdìngxùqūyùzhījīyīntǐzhòngzǔruǎntǐ
_version_ 1719148561701011456