Inferring genetic network based on various types of biological data using machine learning algorithms
博士 === 臺灣大學 === 醫學工程學研究所 === 98 === In the post-genome era, the analysis of high-throughput data has become a critical requirement in many laboratories. Many computational approaches have been developed to identify genetic or transcriptional interactions that may be used to prevent or disable unwant...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/96486950789473109001 |
id |
ndltd-TW-098NTU05530010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NTU055300102015-10-13T13:43:18Z http://ndltd.ncl.edu.tw/handle/96486950789473109001 Inferring genetic network based on various types of biological data using machine learning algorithms 以多種生物訊息與智慧型演算法進行基因網路預測 Cheng-Long Chuang 莊欽龍 博士 臺灣大學 醫學工程學研究所 98 In the post-genome era, the analysis of high-throughput data has become a critical requirement in many laboratories. Many computational approaches have been developed to identify genetic or transcriptional interactions that may be used to prevent or disable unwanted state, such as those associated with oncogenesis or a disease. Therefore, inferring genetic interactions and transcriptional interactions through inspection of high-throughput data are essential issues in post-genomic research. In this study, we developed three computational models to extract the nonlinear relationship between genes, and also construct transcription regulatory networks and genetic regulatory networks with higher accuracy and larger biological significance. The first method is a pattern recognition (called PARE) approach that infers time-lagged genetic interactions from time-course microarray data. A non-linear score extracts some characteristics, the first and second derivatives and the enclosed area, of paired gene-expression curves to approximate the non-linear association and dynamics between the curves. Such non-linear score is then used to identify subclasses of gene pairs with different time lags. Finally, PARE integrates both MGED and existing knowledge via machine learning, and subsequently predicts the other genetic interactions in the subclass. The second method consists of two components, a robust correlation estimator and a nonlinear recurrent model. The method was used to simulate the underlying nonlinear regulatory mechanisms in biological organisms without any prior knowledge. The proposed algorithm was applied to infer the regulatory mechanisms of the general network in Saccharomyces cerevisiae and the pulmonary disease pathways in Homo sapiens with interesting outcomes. The third method is a fuzzy-logic approach, called AdaFuzzy, which integrates DNA sequence, microarray and ChIP-chip data to infer TIs. A robust position weight matrix and a feature vector are proposed in AdaFuzzy to search for consensus sequence motifs. AdaFuzzy was also able to classify all predicted TIs into one or more of the four promoter architectures. The validated success in the prediction results implies that AdaFuzzy can be applied to uncover TIs in yeast. Chung-Ming Chen 陳中明 2010 學位論文 ; thesis 159 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 臺灣大學 === 醫學工程學研究所 === 98 === In the post-genome era, the analysis of high-throughput data has become a critical requirement in many laboratories. Many computational approaches have been developed to identify genetic or transcriptional interactions that may be used to prevent or disable unwanted state, such as those associated with oncogenesis or a disease. Therefore, inferring genetic interactions and transcriptional interactions through inspection of high-throughput data are essential issues in post-genomic research. In this study, we developed three computational models to extract the nonlinear relationship between genes, and also construct transcription regulatory networks and genetic regulatory networks with higher accuracy and larger biological significance.
The first method is a pattern recognition (called PARE) approach that infers time-lagged genetic interactions from time-course microarray data. A non-linear score extracts some characteristics, the first and second derivatives and the enclosed area, of paired gene-expression curves to approximate the non-linear association and dynamics between the curves. Such non-linear score is then used to identify subclasses of gene pairs with different time lags. Finally, PARE integrates both MGED and existing knowledge via machine learning, and subsequently predicts the other genetic interactions in the subclass. The second method consists of two components, a robust correlation estimator and a nonlinear recurrent model. The method was used to simulate the underlying nonlinear regulatory mechanisms in biological organisms without any prior knowledge. The proposed algorithm was applied to infer the regulatory mechanisms of the general network in Saccharomyces cerevisiae and the pulmonary disease pathways in Homo sapiens with interesting outcomes. The third method is a fuzzy-logic approach, called AdaFuzzy, which integrates DNA sequence, microarray and ChIP-chip data to infer TIs. A robust position weight matrix and a feature vector are proposed in AdaFuzzy to search for consensus sequence motifs. AdaFuzzy was also able to classify all predicted TIs into one or more of the four promoter architectures. The validated success in the prediction results implies that AdaFuzzy can be applied to uncover TIs in yeast.
|
author2 |
Chung-Ming Chen |
author_facet |
Chung-Ming Chen Cheng-Long Chuang 莊欽龍 |
author |
Cheng-Long Chuang 莊欽龍 |
spellingShingle |
Cheng-Long Chuang 莊欽龍 Inferring genetic network based on various types of biological data using machine learning algorithms |
author_sort |
Cheng-Long Chuang |
title |
Inferring genetic network based on various types of biological data using machine learning algorithms |
title_short |
Inferring genetic network based on various types of biological data using machine learning algorithms |
title_full |
Inferring genetic network based on various types of biological data using machine learning algorithms |
title_fullStr |
Inferring genetic network based on various types of biological data using machine learning algorithms |
title_full_unstemmed |
Inferring genetic network based on various types of biological data using machine learning algorithms |
title_sort |
inferring genetic network based on various types of biological data using machine learning algorithms |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/96486950789473109001 |
work_keys_str_mv |
AT chenglongchuang inferringgeneticnetworkbasedonvarioustypesofbiologicaldatausingmachinelearningalgorithms AT zhuāngqīnlóng inferringgeneticnetworkbasedonvarioustypesofbiologicaldatausingmachinelearningalgorithms AT chenglongchuang yǐduōzhǒngshēngwùxùnxīyǔzhìhuìxíngyǎnsuànfǎjìnxíngjīyīnwǎnglùyùcè AT zhuāngqīnlóng yǐduōzhǒngshēngwùxùnxīyǔzhìhuìxíngyǎnsuànfǎjìnxíngjīyīnwǎnglùyùcè |
_version_ |
1717740735501959168 |