Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches

博士 === 逢甲大學 === 資訊工程所 === 95 === Evolutionary algorithm (EA) is a powerful optimization tool and has been widely in bioinformatics area. For a complex prediction problem which involved of large amount of tuning parameters, the prediction accuracy is dominated by the optimization performance of the u...

Full description

Bibliographic Details
Main Authors: Hung-Ming Chen, 陳宏銘
Other Authors: Shinn-Ying Ho
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/23881120666961758413
id ndltd-TW-095FCU05392015
record_format oai_dc
spelling ndltd-TW-095FCU053920152015-12-11T04:04:31Z http://ndltd.ncl.edu.tw/handle/23881120666961758413 Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches 用於蛋白質與小分子嵌合及結構預測的演化式計算方法 Hung-Ming Chen 陳宏銘 博士 逢甲大學 資訊工程所 95 Evolutionary algorithm (EA) is a powerful optimization tool and has been widely in bioinformatics area. For a complex prediction problem which involved of large amount of tuning parameters, the prediction accuracy is dominated by the optimization performance of the used evolutionary algorithm. In this dissertation, we use several efficient evolutionary computation approaches to solve the following prediction problems: design of fuzzy rule-based classifier, flexible protein-ligand docking, and protein structural class prediction. Firstly, an evolutionary approach to designing accurate classifiers with a compact fuzzy-rule base using a scatter partition of feature space is proposed, in which all the elements of the fuzzy classifier design problem have been moved in parameters of a complex optimization problem. An intelligent genetic algorithm (IGA) is used to effectively solve the design problem of fuzzy classifiers with many tuning parameters. The merits of the proposed method are threefold: 1) the proposed method has high search ability to efficiently find fuzzy rule-based systems with high fitness values, 2) obtained fuzzy rules have high interpretability, and 3) obtained compact classifiers have high classification accuracy on unseen test patterns. The performance comparison and statistical analysis of experimental results using ten-fold cross validation show that the IGA-based method without heuristics is efficient in designing accurate and compact fuzzy classifiers using 11 well-known data sets with numerical attribute values. Consequently, an application of the fuzzy classifier to a prediction problem in gene expression analysis is introduced. Flexible protein-ligand docking can be formulated as a parameter optimization problem whose objective is to find the translation, orientation, and conformation of a ligand relative to the active site of a target protein with the lowest energy. For highly flexible ligands with a lot of rotatable bonds, the optimization problem of flexible docking would be more difficult due to the extremely large conformation space. We proposed a novel optimization algorithm, Swarm Optimization for flexible DOCKing (SODOCK), based on particle swarm optimization (PSO) for solving flexible protein-ligand docking problems. The computer simulation results shown that SODOCK can obtain more accurate results, comparing with several state-of-the-art docking methods. Finally, we propose an evolutionary feature selection approach based on inheritable intelligent genetic algorithm for the prediction of protein structural class. Adding physicochemical properties into protein features can improve the prediction accuracy of a proper classifier. However, selection of useful features from hundreds of physicochemical properties is very difficult. The proposed evolutionary feature selection method can obtain high quality feature subsets from amino acid composition and physicochemical properties AAindex. The experimental results show that the obtained feature subsets improve the prediction accuracies of naive Bayes classifier, support vector machine (SVM), and logistic regression, comparing with these classifiers using amino acid composition features alone. The average prediction accuracy of these classifiers with the obtained feature subsets are also superior to an existing 66-dimensional feature set designed by experts. Shinn-Ying Ho 何信瑩 2007 學位論文 ; thesis 90 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 逢甲大學 === 資訊工程所 === 95 === Evolutionary algorithm (EA) is a powerful optimization tool and has been widely in bioinformatics area. For a complex prediction problem which involved of large amount of tuning parameters, the prediction accuracy is dominated by the optimization performance of the used evolutionary algorithm. In this dissertation, we use several efficient evolutionary computation approaches to solve the following prediction problems: design of fuzzy rule-based classifier, flexible protein-ligand docking, and protein structural class prediction. Firstly, an evolutionary approach to designing accurate classifiers with a compact fuzzy-rule base using a scatter partition of feature space is proposed, in which all the elements of the fuzzy classifier design problem have been moved in parameters of a complex optimization problem. An intelligent genetic algorithm (IGA) is used to effectively solve the design problem of fuzzy classifiers with many tuning parameters. The merits of the proposed method are threefold: 1) the proposed method has high search ability to efficiently find fuzzy rule-based systems with high fitness values, 2) obtained fuzzy rules have high interpretability, and 3) obtained compact classifiers have high classification accuracy on unseen test patterns. The performance comparison and statistical analysis of experimental results using ten-fold cross validation show that the IGA-based method without heuristics is efficient in designing accurate and compact fuzzy classifiers using 11 well-known data sets with numerical attribute values. Consequently, an application of the fuzzy classifier to a prediction problem in gene expression analysis is introduced. Flexible protein-ligand docking can be formulated as a parameter optimization problem whose objective is to find the translation, orientation, and conformation of a ligand relative to the active site of a target protein with the lowest energy. For highly flexible ligands with a lot of rotatable bonds, the optimization problem of flexible docking would be more difficult due to the extremely large conformation space. We proposed a novel optimization algorithm, Swarm Optimization for flexible DOCKing (SODOCK), based on particle swarm optimization (PSO) for solving flexible protein-ligand docking problems. The computer simulation results shown that SODOCK can obtain more accurate results, comparing with several state-of-the-art docking methods. Finally, we propose an evolutionary feature selection approach based on inheritable intelligent genetic algorithm for the prediction of protein structural class. Adding physicochemical properties into protein features can improve the prediction accuracy of a proper classifier. However, selection of useful features from hundreds of physicochemical properties is very difficult. The proposed evolutionary feature selection method can obtain high quality feature subsets from amino acid composition and physicochemical properties AAindex. The experimental results show that the obtained feature subsets improve the prediction accuracies of naive Bayes classifier, support vector machine (SVM), and logistic regression, comparing with these classifiers using amino acid composition features alone. The average prediction accuracy of these classifiers with the obtained feature subsets are also superior to an existing 66-dimensional feature set designed by experts.
author2 Shinn-Ying Ho
author_facet Shinn-Ying Ho
Hung-Ming Chen
陳宏銘
author Hung-Ming Chen
陳宏銘
spellingShingle Hung-Ming Chen
陳宏銘
Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
author_sort Hung-Ming Chen
title Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
title_short Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
title_full Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
title_fullStr Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
title_full_unstemmed Protein-Ligand Docking and Structure Prediction Using Evolutionary Computation Approaches
title_sort protein-ligand docking and structure prediction using evolutionary computation approaches
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/23881120666961758413
work_keys_str_mv AT hungmingchen proteinliganddockingandstructurepredictionusingevolutionarycomputationapproaches
AT chénhóngmíng proteinliganddockingandstructurepredictionusingevolutionarycomputationapproaches
AT hungmingchen yòngyúdànbáizhìyǔxiǎofēnziqiànhéjíjiégòuyùcèdeyǎnhuàshìjìsuànfāngfǎ
AT chénhóngmíng yòngyúdànbáizhìyǔxiǎofēnziqiànhéjíjiégòuyùcèdeyǎnhuàshìjìsuànfāngfǎ
_version_ 1718148264956526592