Genome Polishing of Nanopore-Only Assembly Using Coding Sequences
碩士 === 國立中正大學 === 資訊工程研究所 === 107 === Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS asse...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/9r9vu4 |
id |
ndltd-TW-107CCU00392075 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107CCU003920752019-09-03T03:43:16Z http://ndltd.ncl.edu.tw/handle/9r9vu4 Genome Polishing of Nanopore-Only Assembly Using Coding Sequences 以編碼區序列對牛津奈米孔定序序列進行錯誤校正 HUNG, YU-KAI 洪裕凱 碩士 國立中正大學 資訊工程研究所 107 Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS assembly due to high error rate. Although hybrid NGS and TGS can generate complete and high-quality genomes, the sequencing cost is too high to be practical. This thesis aims to polish Oxford Nanopore (ONT)-assembled genomes using TGS reads only. By using the conservation of coding sequence (CDS) across bacterial species, we develop a suite of polishing methods for correcting indel errors leading to frameshift. The polished genome can achieve 97-100\% completeness, which significantly outperform previous polishing methods (65-90\%). The remaining uncorrected errors are mainly due to incompleteness of CDS in the current database, which can be improved as more sequencing data are collected in the future. Huang, Yao-Ting 黃耀廷 2019 學位論文 ; thesis 34 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中正大學 === 資訊工程研究所 === 107 === Third-generation sequencing (TGS) produces longer reads, unbiased coverage, and faster sequencing speed in comparison with next-generation sequencing (NGS). Genome assembly based on TGS can usually assemble complete genome but its accuracy is lower than NGS assembly due to high error rate. Although hybrid NGS and TGS can generate complete and high-quality genomes, the sequencing cost is too high to be practical. This thesis aims to polish Oxford Nanopore (ONT)-assembled genomes using TGS reads only. By using the conservation of coding sequence (CDS) across bacterial species, we develop a suite of polishing methods for correcting indel errors leading to frameshift. The polished genome can achieve 97-100\% completeness, which significantly outperform previous polishing methods (65-90\%). The remaining uncorrected errors are mainly due to incompleteness of CDS in the current database, which can be improved as more sequencing data are collected in the future.
|
author2 |
Huang, Yao-Ting |
author_facet |
Huang, Yao-Ting HUNG, YU-KAI 洪裕凱 |
author |
HUNG, YU-KAI 洪裕凱 |
spellingShingle |
HUNG, YU-KAI 洪裕凱 Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
author_sort |
HUNG, YU-KAI |
title |
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
title_short |
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
title_full |
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
title_fullStr |
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
title_full_unstemmed |
Genome Polishing of Nanopore-Only Assembly Using Coding Sequences |
title_sort |
genome polishing of nanopore-only assembly using coding sequences |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/9r9vu4 |
work_keys_str_mv |
AT hungyukai genomepolishingofnanoporeonlyassemblyusingcodingsequences AT hóngyùkǎi genomepolishingofnanoporeonlyassemblyusingcodingsequences AT hungyukai yǐbiānmǎqūxùlièduìniújīnnàimǐkǒngdìngxùxùlièjìnxíngcuòwùxiàozhèng AT hóngyùkǎi yǐbiānmǎqūxùlièduìniújīnnàimǐkǒngdìngxùxùlièjìnxíngcuòwùxiàozhèng |
_version_ |
1719242509734903808 |