Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis
碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 99 === With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genot...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/30179719060150810849 |
id |
ndltd-TW-099YM005114026 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099YM0051140262015-10-13T20:37:08Z http://ndltd.ncl.edu.tw/handle/30179719060150810849 Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis 新世代定序結果分析:發現與疾病相關之單鹼基變異及融合基因的高通量分析方法 Jui-Tse Hsu 徐瑞澤 碩士 國立陽明大學 生物醫學資訊研究所 99 With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genotype data in relatively short time, it also helps scientists reduce the time needed in conducting a research project. Although the data output rate is constantly improving, the integration of these genotype data with known experimental data and the use of these high-throughput sequencing data to find interesting biological phenomena rely on the development of adequate computational algorithms and analytical pipelines. In order to use these high-throughput sequencing data effectively, I have assembled two pipelines to process the next generation sequencing data, one for the DNA sequencing data, and the other for the RNA sequencing data. For the DNA sequencing data, this pipeline can link personal genomic sequence with public domain genotype data. It integrates Ensembl single nucleotide variation data, dbSNP data, reference genotype data into analysis. It also links OMIM (Online Mendelian Inheritance in Man) known disease data to known gene variations for each individual. For the RNA sequencing data, I have developed a fusion gene detector pipeline to discover abnormal chromosome and gene aberrations from RNA sequencing data. It can find gene rearrangement events such as chromosome insertion, deletion, inversion, duplication, translocation, intra-chromosome gene fusion events, and inter-chromosome fusion events. I use this pipeline to discover the relationships between gene fusion events and amino acid conformation change, such as the change of promoter of house keeping gene, the loss of apoptosis genes, the changes of reading frame. By applying some high-throuput sequencing data into our analytical pipeline, we have found tens of thousands of single nucleotide variations between each individual. It may explain the difference between each individual, and affect the susceptibility to certain diseases. We have use this pipeline to discover the complex gene variations in cancer genome, including chromosome rearrangements, single nucleotide variation, abnormal gene splicing and connecting patterns. Ueng-Cheng Yang 楊永正 2011 學位論文 ; thesis 140 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立陽明大學 === 生物醫學資訊研究所 === 99 === With the coming of post genomic era, numerous scientists around the world are studying the disease gene and genotype for each individual. The development of next generation sequencing technology not only has allowed researchers to get very high-throughput genotype data in relatively short time, it also helps scientists reduce the time needed in conducting a research project.
Although the data output rate is constantly improving, the integration of these genotype data with known experimental data and the use of these high-throughput sequencing data to find interesting biological phenomena rely on the development of adequate computational algorithms and analytical pipelines.
In order to use these high-throughput sequencing data effectively, I have assembled two pipelines to process the next generation sequencing data, one for the DNA sequencing data, and the other for the RNA sequencing data.
For the DNA sequencing data, this pipeline can link personal genomic sequence with public domain genotype data. It integrates Ensembl single nucleotide variation data, dbSNP data, reference genotype data into analysis. It also links OMIM (Online Mendelian Inheritance in Man) known disease data to known gene variations for each individual.
For the RNA sequencing data, I have developed a fusion gene detector pipeline to discover abnormal chromosome and gene aberrations from RNA sequencing data. It can find gene rearrangement events such as chromosome insertion, deletion, inversion, duplication, translocation, intra-chromosome gene fusion events, and inter-chromosome fusion events. I use this pipeline to discover the relationships between gene fusion events and amino acid conformation change, such as the change of promoter of house keeping gene, the loss of apoptosis genes, the changes of reading frame.
By applying some high-throuput sequencing data into our analytical pipeline, we have found tens of thousands of single nucleotide variations between each individual. It may explain the difference between each individual, and affect the susceptibility to certain diseases. We have use this pipeline to discover the complex gene variations in cancer genome, including chromosome rearrangements, single nucleotide variation, abnormal gene splicing and connecting patterns.
|
author2 |
Ueng-Cheng Yang |
author_facet |
Ueng-Cheng Yang Jui-Tse Hsu 徐瑞澤 |
author |
Jui-Tse Hsu 徐瑞澤 |
spellingShingle |
Jui-Tse Hsu 徐瑞澤 Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
author_sort |
Jui-Tse Hsu |
title |
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
title_short |
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
title_full |
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
title_fullStr |
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
title_full_unstemmed |
Analysis of the next generation sequencing results: Discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
title_sort |
analysis of the next generation sequencing results: discovery of disease-associated single nucleotide variants and fusion genes by high throughput analysis |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/30179719060150810849 |
work_keys_str_mv |
AT juitsehsu analysisofthenextgenerationsequencingresultsdiscoveryofdiseaseassociatedsinglenucleotidevariantsandfusiongenesbyhighthroughputanalysis AT xúruìzé analysisofthenextgenerationsequencingresultsdiscoveryofdiseaseassociatedsinglenucleotidevariantsandfusiongenesbyhighthroughputanalysis AT juitsehsu xīnshìdàidìngxùjiéguǒfēnxīfāxiànyǔjíbìngxiāngguānzhīdānjiǎnjībiànyìjírónghéjīyīndegāotōngliàngfēnxīfāngfǎ AT xúruìzé xīnshìdàidìngxùjiéguǒfēnxīfāxiànyǔjíbìngxiāngguānzhīdānjiǎnjībiànyìjírónghéjīyīndegāotōngliàngfēnxīfāngfǎ |
_version_ |
1718049098625449984 |