CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling

碩士 === 國立交通大學 === 生物資訊及系統生物研究所 === 103 === World Health Organization (WHO) mentioned that early detection of cancer greatly increases the chances for successful treatment. Genetic testing in clinical diagnosis plays an integral role. In breast cancer, mammography, ultrasound, and magnetic resonance...

Full description

Bibliographic Details
Main Authors: Chen, Yi-Chun, 陳怡君
Other Authors: Yang, Jinn-Moon
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/g3967t
id ndltd-TW-103NCTU5112142
record_format oai_dc
spelling ndltd-TW-103NCTU51121422019-05-15T22:34:02Z http://ndltd.ncl.edu.tw/handle/g3967t CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling CoMI:以共識互信息鑑定生物標記及分析全基因組表達譜 Chen, Yi-Chun 陳怡君 碩士 國立交通大學 生物資訊及系統生物研究所 103 World Health Organization (WHO) mentioned that early detection of cancer greatly increases the chances for successful treatment. Genetic testing in clinical diagnosis plays an integral role. In breast cancer, mammography, ultrasound, and magnetic resonance imaging (MRI) are considered as an effective strategy for early detection. Once judged suffering breast cancer, biopsy could determine whether the tumor is benign or malignant. Furthermore, Genetic testing as a prognostic molecular way determines breast cancer subtype that provides a chance to treatment. In general, a biomarker should have the following properties: (1) Readily quantifiable in accessible biological samples, (2) Expression is consistent in the general population, and (3) Expression is significantly increased especially in the disease condition. Recently, genome-wide expression profiling (e.g. microarray and NGS) holds tremendous promise for revealing the patterns of coordinately regulated genes for early detection of diseases. Currently, T-test is a commonly used method for identifying biomarkers from genome-wide expression gene profiling. T-test often identified numerous significant genes without biomarker properties. Here, we propose a new method, called (Consensus Mutual Information, CoMI), for analyzing genome-wide expression profiling and discovering biomarkers fitting the biomarker properties. First, we keep high expression genes by filtering low expression genes for readily quantifiable. Based on the second and third biomarker properties, we have developed a new scoring function SCoMI to identify consistent expression genes and significant differential expressed genes between normal and disease state. The scoring function SCoMI consist of mutual information (SMI), entropy (Scon), and differential of group rank (Sdist). The mutual information and entropy are used to measure the differential and consistent expression genes, respectively. Sdist can evaluate the differential expression between normal and disease state. We have evaluated our method and scoring function SCoMI to analyze genome-wide expression profiling and discover the biomarkers on two microarray data sets, breast cancer and Alzheimer's disease. For discovery of the selected genes using our method, we applied the enrichments of gene ontology terms (i.e., biological process and cellular component,) and gene clustering to check the biological meanings and biomarker properties of these genes. Experimental results indicate that these selected genes are highly correlated with breast cancer and Alzheimer's disease. In addition, we integrated T-test and our method on these two sets. In breast cancer dataset, we not only identified 115 genes which incriminate normal and tumor samples and but also identify basal-like type patients. Interestingly, we got the similar results from these 115 selected genes that were applied to an independent TCGA data set. Our method and scoring function SCoMI provide a useful method to discover a set of biomarkers for early detection of diseases. Yang, Jinn-Moon 楊進木 2015 學位論文 ; thesis 55 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 生物資訊及系統生物研究所 === 103 === World Health Organization (WHO) mentioned that early detection of cancer greatly increases the chances for successful treatment. Genetic testing in clinical diagnosis plays an integral role. In breast cancer, mammography, ultrasound, and magnetic resonance imaging (MRI) are considered as an effective strategy for early detection. Once judged suffering breast cancer, biopsy could determine whether the tumor is benign or malignant. Furthermore, Genetic testing as a prognostic molecular way determines breast cancer subtype that provides a chance to treatment. In general, a biomarker should have the following properties: (1) Readily quantifiable in accessible biological samples, (2) Expression is consistent in the general population, and (3) Expression is significantly increased especially in the disease condition. Recently, genome-wide expression profiling (e.g. microarray and NGS) holds tremendous promise for revealing the patterns of coordinately regulated genes for early detection of diseases. Currently, T-test is a commonly used method for identifying biomarkers from genome-wide expression gene profiling. T-test often identified numerous significant genes without biomarker properties. Here, we propose a new method, called (Consensus Mutual Information, CoMI), for analyzing genome-wide expression profiling and discovering biomarkers fitting the biomarker properties. First, we keep high expression genes by filtering low expression genes for readily quantifiable. Based on the second and third biomarker properties, we have developed a new scoring function SCoMI to identify consistent expression genes and significant differential expressed genes between normal and disease state. The scoring function SCoMI consist of mutual information (SMI), entropy (Scon), and differential of group rank (Sdist). The mutual information and entropy are used to measure the differential and consistent expression genes, respectively. Sdist can evaluate the differential expression between normal and disease state. We have evaluated our method and scoring function SCoMI to analyze genome-wide expression profiling and discover the biomarkers on two microarray data sets, breast cancer and Alzheimer's disease. For discovery of the selected genes using our method, we applied the enrichments of gene ontology terms (i.e., biological process and cellular component,) and gene clustering to check the biological meanings and biomarker properties of these genes. Experimental results indicate that these selected genes are highly correlated with breast cancer and Alzheimer's disease. In addition, we integrated T-test and our method on these two sets. In breast cancer dataset, we not only identified 115 genes which incriminate normal and tumor samples and but also identify basal-like type patients. Interestingly, we got the similar results from these 115 selected genes that were applied to an independent TCGA data set. Our method and scoring function SCoMI provide a useful method to discover a set of biomarkers for early detection of diseases.
author2 Yang, Jinn-Moon
author_facet Yang, Jinn-Moon
Chen, Yi-Chun
陳怡君
author Chen, Yi-Chun
陳怡君
spellingShingle Chen, Yi-Chun
陳怡君
CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
author_sort Chen, Yi-Chun
title CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
title_short CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
title_full CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
title_fullStr CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
title_full_unstemmed CoMI: Consensus Mutual Information for discovering biomarkers and analyzing genome-wide expression profiling
title_sort comi: consensus mutual information for discovering biomarkers and analyzing genome-wide expression profiling
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/g3967t
work_keys_str_mv AT chenyichun comiconsensusmutualinformationfordiscoveringbiomarkersandanalyzinggenomewideexpressionprofiling
AT chényíjūn comiconsensusmutualinformationfordiscoveringbiomarkersandanalyzinggenomewideexpressionprofiling
AT chenyichun comiyǐgòngshíhùxìnxījiàndìngshēngwùbiāojìjífēnxīquánjīyīnzǔbiǎodápǔ
AT chényíjūn comiyǐgòngshíhùxìnxījiàndìngshēngwùbiāojìjífēnxīquánjīyīnzǔbiǎodápǔ
_version_ 1719130818309259264