Quality of Base Calling for NextGeneration Sequence

碩士 === 國立成功大學 === 統計學系 === 103 === There are four different bases, adenine (A), guanine (G), cytosine (C), and thymine (T), making up DNA. Next Generation Sequencing is a new technique allowing to sequence DNA much more quickly and cheaply than the previously used Sanger sequencing. However, issue o...

Full description

Bibliographic Details
Main Authors: Yi-ChunLin, 林怡君
Other Authors: Shih-Huang Chan
Format: Others
Language:en_US
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/62028518932776264013
id ndltd-TW-103NCKU5337010
record_format oai_dc
spelling ndltd-TW-103NCKU53370102016-05-22T04:40:56Z http://ndltd.ncl.edu.tw/handle/62028518932776264013 Quality of Base Calling for NextGeneration Sequence 次世代基因定序之品質 Yi-ChunLin 林怡君 碩士 國立成功大學 統計學系 103 There are four different bases, adenine (A), guanine (G), cytosine (C), and thymine (T), making up DNA. Next Generation Sequencing is a new technique allowing to sequence DNA much more quickly and cheaply than the previously used Sanger sequencing. However, issue on quality of sequencing, although plays an important role in coding the DNA, does not receive much attention academically and practically. Illumina, one of the famous companies, claims that the positions of bases follow the same sequels vertically after the fourth cycle. However, several works have been conducted and proved that the positions of base calling are not fixed. See Li (2012), Shao (2013) and Lin (2014). The above authors basically used a specific region to prove that there does exist a shift in base position. In the thesis, we apply machine learning technique called eigenface recognition and principal component scores to represent the overall behavior of cycles, and find the relationship between shift and cycles using the coefficients of eigenfaces. Shih-Huang Chan 詹世煌 2015 學位論文 ; thesis 27 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 統計學系 === 103 === There are four different bases, adenine (A), guanine (G), cytosine (C), and thymine (T), making up DNA. Next Generation Sequencing is a new technique allowing to sequence DNA much more quickly and cheaply than the previously used Sanger sequencing. However, issue on quality of sequencing, although plays an important role in coding the DNA, does not receive much attention academically and practically. Illumina, one of the famous companies, claims that the positions of bases follow the same sequels vertically after the fourth cycle. However, several works have been conducted and proved that the positions of base calling are not fixed. See Li (2012), Shao (2013) and Lin (2014). The above authors basically used a specific region to prove that there does exist a shift in base position. In the thesis, we apply machine learning technique called eigenface recognition and principal component scores to represent the overall behavior of cycles, and find the relationship between shift and cycles using the coefficients of eigenfaces.
author2 Shih-Huang Chan
author_facet Shih-Huang Chan
Yi-ChunLin
林怡君
author Yi-ChunLin
林怡君
spellingShingle Yi-ChunLin
林怡君
Quality of Base Calling for NextGeneration Sequence
author_sort Yi-ChunLin
title Quality of Base Calling for NextGeneration Sequence
title_short Quality of Base Calling for NextGeneration Sequence
title_full Quality of Base Calling for NextGeneration Sequence
title_fullStr Quality of Base Calling for NextGeneration Sequence
title_full_unstemmed Quality of Base Calling for NextGeneration Sequence
title_sort quality of base calling for nextgeneration sequence
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/62028518932776264013
work_keys_str_mv AT yichunlin qualityofbasecallingfornextgenerationsequence
AT línyíjūn qualityofbasecallingfornextgenerationsequence
AT yichunlin cìshìdàijīyīndìngxùzhīpǐnzhì
AT línyíjūn cìshìdàijīyīndìngxùzhīpǐnzhì
_version_ 1718277114992525312