Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis

碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 99 === With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genot...

Full description

Bibliographic Details
Main Authors: Jui-Tse Hsu, 徐瑞澤
Other Authors: Ueng-Cheng Yang
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/30179719060150810849
id ndltd-TW-099YM005114026
record_format oai_dc
spelling ndltd-TW-099YM0051140262015-10-13T20:37:08Z http://ndltd.ncl.edu.tw/handle/30179719060150810849 Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis 新世代定序結果分析:發現與疾病相關之單鹼基變異及融合基因的高通量分析方法 Jui-Tse Hsu 徐瑞澤 碩士 國立陽明大學 生物醫學資訊研究所 99 With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genotype data in relatively short time, it also helps scientists reduce the time needed in conducting a research project. Although the data output rate is constantly improving, the integration of these genotype data with known experimental data and the use of these high-throughput sequencing data to find interesting biological phenomena rely on the development of adequate computational algorithms and analytical pipelines. In order to use these high-throughput sequencing data effectively, I have assembled two pipelines to process the next generation sequencing data, one for the DNA sequencing data, and the other for the RNA sequencing data. For the DNA sequencing data, this pipeline can link personal genomic sequence with public domain genotype data. It integrates Ensembl single nucleotide variation data, dbSNP data, reference genotype data into analysis. It also links OMIM (Online Mendelian Inheritance in Man) known disease data to known gene variations for each individual. For the RNA sequencing data, I have developed a fusion gene detector pipeline to discover abnormal chromosome and gene aberrations from RNA sequencing data. It can find gene rearrangement events such as chromosome insertion, deletion, inversion, duplication, translocation, intra-chromosome gene fusion events, and inter-chromosome fusion events. I use this pipeline to discover the relationships between gene fusion events and amino acid conformation change, such as the change of promoter of house keeping gene, the loss of apoptosis genes, the changes of reading frame. By applying some high-throuput sequencing data into our analytical pipeline, we have found tens of thousands of single nucleotide variations between each individual. It may explain the difference between each individual, and affect the susceptibility to certain diseases. We have use this pipeline to discover the complex gene variations in cancer genome, including chromosome rearrangements, single nucleotide variation, abnormal gene splicing and connecting patterns. Ueng-Cheng Yang 楊永正 2011 學位論文 ; thesis 140 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 99 === With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genotype data in relatively short time, it also helps scientists reduce the time needed in conducting a research project. Although the data output rate is constantly improving, the integration of these genotype data with known experimental data and the use of these high-throughput sequencing data to find interesting biological phenomena rely on the development of adequate computational algorithms and analytical pipelines. In order to use these high-throughput sequencing data effectively, I have assembled two pipelines to process the next generation sequencing data, one for the DNA sequencing data, and the other for the RNA sequencing data. For the DNA sequencing data, this pipeline can link personal genomic sequence with public domain genotype data. It integrates Ensembl single nucleotide variation data, dbSNP data, reference genotype data into analysis. It also links OMIM (Online Mendelian Inheritance in Man) known disease data to known gene variations for each individual. For the RNA sequencing data, I have developed a fusion gene detector pipeline to discover abnormal chromosome and gene aberrations from RNA sequencing data. It can find gene rearrangement events such as chromosome insertion, deletion, inversion, duplication, translocation, intra-chromosome gene fusion events, and inter-chromosome fusion events. I use this pipeline to discover the relationships between gene fusion events and amino acid conformation change, such as the change of promoter of house keeping gene, the loss of apoptosis genes, the changes of reading frame. By applying some high-throuput sequencing data into our analytical pipeline, we have found tens of thousands of single nucleotide variations between each individual. It may explain the difference between each individual, and affect the susceptibility to certain diseases. We have use this pipeline to discover the complex gene variations in cancer genome, including chromosome rearrangements, single nucleotide variation, abnormal gene splicing and connecting patterns.
author2 Ueng-Cheng Yang
author_facet Ueng-Cheng Yang
Jui-Tse Hsu
徐瑞澤
author Jui-Tse Hsu
徐瑞澤
spellingShingle Jui-Tse Hsu
徐瑞澤
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
author_sort Jui-Tse Hsu
title Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
title_short Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
title_full Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
title_fullStr Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
title_full_unstemmed Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
title_sort analysis of the next generation sequencing results: discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/30179719060150810849
work_keys_str_mv AT juitsehsu analysisofthenextgenerationsequencingresultsdiscoveryofdiseaseassociatedsinglenucleotidevariantsandfusiongenesbyhighthroughputanalysis
AT xúruìzé analysisofthenextgenerationsequencingresultsdiscoveryofdiseaseassociatedsinglenucleotidevariantsandfusiongenesbyhighthroughputanalysis
AT juitsehsu xīnshìdàidìngxùjiéguǒfēnxīfāxiànyǔjíbìngxiāngguānzhīdānjiǎnjībiànyìjírónghéjīyīndegāotōngliàngfēnxīfāngfǎ
AT xúruìzé xīnshìdàidìngxùjiéguǒfēnxīfāxiànyǔjíbìngxiāngguānzhīdānjiǎnjībiànyìjírónghéjīyīndegāotōngliàngfēnxīfāngfǎ
_version_ 1718049098625449984