Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data

碩士 === 國立臺灣大學 === 生醫電子與資訊學研究所 === 101 === Exposure of ionizing radiation (IR) can cause DNA damages in cells due to DNA double-strand break. It is well known that exposure to IR could lead to the increased risks of cancer and other acute radiation sickness, which may eventually cause rapid death...

Full description

Bibliographic Details
Main Authors: Yi-Yao Hsu, 許逸堯
Other Authors: Eric Y. Chuang
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/70722013808234904592
id ndltd-TW-101NTU05114017
record_format oai_dc
spelling ndltd-TW-101NTU051140172015-10-13T23:05:30Z http://ndltd.ncl.edu.tw/handle/70722013808234904592 Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data 利用跨平台基因晶片資料建立預測輻射線劑量之基因圖譜 Yi-Yao Hsu 許逸堯 碩士 國立臺灣大學 生醫電子與資訊學研究所 101 Exposure of ionizing radiation (IR) can cause DNA damages in cells due to DNA double-strand break. It is well known that exposure to IR could lead to the increased risks of cancer and other acute radiation sickness, which may eventually cause rapid death in high exposure dose. In a radiological emergency, such as Fukushima Daiichi nuclear disaster in 2011, exactly evaluating the exposure dose would be highly required for better health care in the early treatment. In recent years, several studies had suggested some gene expression signatures associated with radiation doses, which can serve as effective predictors for the exposed radiation doses by using high-throughput microarray technique. However, most of them only tested the performances in their original studies and no external datasets were further used for validation. These observations clearly revealed that results based on one microarray dataset was prone to suffer from systematic biases, and thus was limited in biological interpretation. On the other hand, most of the studies that only focused on a single cell line typically had high false positive rates since the transcriptional responses to IR were cell line-dependent. To overcome these issues, we proposed a comprehensive approach by performing a meta-analysis of publicly available microarray datasets. First, differentially expressed genes in responding to different radiation doses were identified in the respective dataset by using statistical models. Next, the identified genes that selected from differential expression analyses were further filtering by the frequency in which they are present among different datasets. A machine learning algorithm, support vector machine, was utilized to develop prediction models distinguishing high-dose and low-dose exposure. Lastly, the performances of the meta-signature was evaluated by using cross-validation in internal datasets and independently validated in external studies. In this study, a 29-gene meta-signature was identified that can distinguish cells with either high-dose exposure (> 8 Gy) or low-dose exposure (< 2 Gy). The gene functions related to the apoptosis regulating pathway and the p53 signaling pathway were significantly enriched in the identified meta-signature. Compared to the previous studies, our findings had superior performance in predicting IR exposure levels with a total accuracy of 85% (6–14% higher than previous studies) in the internal cross-validation. Even in an external validation, this meta-signature correctly predicted the samples with an overall accuracy of 84% (81/94). In conclusion, we provide a comprehensive approach to dissect the radiation responses among different doses in the independent studies. The findings may facilitate exploration in biological functions regulated by IR. Above all, the results shown improvement in the robustness of prediction and may be applied into practical usage. Eric Y. Chuang 莊曜宇 2013 學位論文 ; thesis 65 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 生醫電子與資訊學研究所 === 101 === Exposure of ionizing radiation (IR) can cause DNA damages in cells due to DNA double-strand break. It is well known that exposure to IR could lead to the increased risks of cancer and other acute radiation sickness, which may eventually cause rapid death in high exposure dose. In a radiological emergency, such as Fukushima Daiichi nuclear disaster in 2011, exactly evaluating the exposure dose would be highly required for better health care in the early treatment. In recent years, several studies had suggested some gene expression signatures associated with radiation doses, which can serve as effective predictors for the exposed radiation doses by using high-throughput microarray technique. However, most of them only tested the performances in their original studies and no external datasets were further used for validation. These observations clearly revealed that results based on one microarray dataset was prone to suffer from systematic biases, and thus was limited in biological interpretation. On the other hand, most of the studies that only focused on a single cell line typically had high false positive rates since the transcriptional responses to IR were cell line-dependent. To overcome these issues, we proposed a comprehensive approach by performing a meta-analysis of publicly available microarray datasets. First, differentially expressed genes in responding to different radiation doses were identified in the respective dataset by using statistical models. Next, the identified genes that selected from differential expression analyses were further filtering by the frequency in which they are present among different datasets. A machine learning algorithm, support vector machine, was utilized to develop prediction models distinguishing high-dose and low-dose exposure. Lastly, the performances of the meta-signature was evaluated by using cross-validation in internal datasets and independently validated in external studies. In this study, a 29-gene meta-signature was identified that can distinguish cells with either high-dose exposure (> 8 Gy) or low-dose exposure (< 2 Gy). The gene functions related to the apoptosis regulating pathway and the p53 signaling pathway were significantly enriched in the identified meta-signature. Compared to the previous studies, our findings had superior performance in predicting IR exposure levels with a total accuracy of 85% (6–14% higher than previous studies) in the internal cross-validation. Even in an external validation, this meta-signature correctly predicted the samples with an overall accuracy of 84% (81/94). In conclusion, we provide a comprehensive approach to dissect the radiation responses among different doses in the independent studies. The findings may facilitate exploration in biological functions regulated by IR. Above all, the results shown improvement in the robustness of prediction and may be applied into practical usage.
author2 Eric Y. Chuang
author_facet Eric Y. Chuang
Yi-Yao Hsu
許逸堯
author Yi-Yao Hsu
許逸堯
spellingShingle Yi-Yao Hsu
許逸堯
Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
author_sort Yi-Yao Hsu
title Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
title_short Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
title_full Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
title_fullStr Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
title_full_unstemmed Development of Gene Expression Signatures for Radiation Exposure Prediction using Cross-Platform Microarray Data
title_sort development of gene expression signatures for radiation exposure prediction using cross-platform microarray data
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/70722013808234904592
work_keys_str_mv AT yiyaohsu developmentofgeneexpressionsignaturesforradiationexposurepredictionusingcrossplatformmicroarraydata
AT xǔyìyáo developmentofgeneexpressionsignaturesforradiationexposurepredictionusingcrossplatformmicroarraydata
AT yiyaohsu lìyòngkuàpíngtáijīyīnjīngpiànzīliàojiànlìyùcèfúshèxiànjìliàngzhījīyīntúpǔ
AT xǔyìyáo lìyòngkuàpíngtáijīyīnjīngpiànzīliàojiànlìyùcèfúshèxiànjìliàngzhījīyīntúpǔ
_version_ 1718083582463836160