Genome Polishing of Nanopore-Only Assembly Using Coding Sequences

碩士 === 國立中正大學 === 資訊工程研究所 === 107 === Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS asse...

Full description

Bibliographic Details
Main Authors: HUNG, YU-KAI, 洪裕凱
Other Authors: Huang, Yao-Ting
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/9r9vu4
id ndltd-TW-107CCU00392075
record_format oai_dc
spelling ndltd-TW-107CCU003920752019-09-03T03:43:16Z http://ndltd.ncl.edu.tw/handle/9r9vu4 Genome Polishing of Nanopore-Only Assembly Using Coding Sequences 以編碼區序列對牛津奈米孔定序序列進行錯誤校正 HUNG, YU-KAI 洪裕凱 碩士 國立中正大學 資訊工程研究所 107 Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS assembly due to high error rate. Although hybrid NGS and TGS can generate complete and high-quality genomes, the sequencing cost is too high to be practical. This thesis aims to polish Oxford Nanopore (ONT)-assembled genomes using TGS reads only. By using the conservation of coding sequence (CDS) across bacterial species, we develop a suite of polishing methods for correcting indel errors leading to frameshift. The polished genome can achieve 97-100\% completeness, which significantly outperform previous polishing methods (65-90\%). The remaining uncorrected errors are mainly due to incompleteness of CDS in the current database, which can be improved as more sequencing data are collected in the future. Huang, Yao-Ting 黃耀廷 2019 學位論文 ; thesis 34 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊工程研究所 === 107 === Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS assembly due to high error rate. Although hybrid NGS and TGS can generate complete and high-quality genomes, the sequencing cost is too high to be practical. This thesis aims to polish Oxford Nanopore (ONT)-assembled genomes using TGS reads only. By using the conservation of coding sequence (CDS) across bacterial species, we develop a suite of polishing methods for correcting indel errors leading to frameshift. The polished genome can achieve 97-100\% completeness, which significantly outperform previous polishing methods (65-90\%). The remaining uncorrected errors are mainly due to incompleteness of CDS in the current database, which can be improved as more sequencing data are collected in the future.
author2 Huang, Yao-Ting
author_facet Huang, Yao-Ting
HUNG, YU-KAI
洪裕凱
author HUNG, YU-KAI
洪裕凱
spellingShingle HUNG, YU-KAI
洪裕凱
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
author_sort HUNG, YU-KAI
title Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
title_short Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
title_full Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
title_fullStr Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
title_full_unstemmed Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
title_sort genome polishing of nanopore-only assembly using coding sequences
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/9r9vu4
work_keys_str_mv AT hungyukai genomepolishingofnanoporeonlyassemblyusingcodingsequences
AT hóngyùkǎi genomepolishingofnanoporeonlyassemblyusingcodingsequences
AT hungyukai yǐbiānmǎqūxùlièduìniújīnnàimǐkǒngdìngxùxùlièjìnxíngcuòwùxiàozhèng
AT hóngyùkǎi yǐbiānmǎqūxùlièduìniújīnnàimǐkǒngdìngxùxùlièjìnxíngcuòwùxiàozhèng
_version_ 1719242509734903808