Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
Ras proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed mode...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2019-06-01
|
Series: | Heliyon |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2405844019311661 |
id |
doaj-d9fe398b50a34003b4f40bdbe80a2e2c |
---|---|
record_format |
Article |
spelling |
doaj-d9fe398b50a34003b4f40bdbe80a2e2c2020-11-25T03:16:26ZengElsevierHeliyon2405-84402019-06-0156e01884Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniquesMajid Masso0Arnav Bansal1Preethi Prem2Akhil Gajjala3Iosif I. Vaisman4Corresponding author.; School of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USARas proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed models that reliably predict fitness for all single residue mutants of H-ras proto-oncogene protein p21. The computational mutagenesis generated a feature vector of protein structural changes for each variant, and these data correlated well with fitness. Random forest classification and tree regression machine learning algorithms were implemented for training predictive models. Cross-validations were used to evaluate model performance, and control experiments were performed to assess statistical significance. Classification models revealed a balanced accuracy rate as high as 82%, with a Matthew's correlation of 0.63, and an area under ROC curve of 0.90. Similarly, regression models displayed Pearson's correlation reaching 0.79. On the other hand, control data sets led to performance values consistent with random guessing. Comparisons with several related state-of-the-art methods reflected favorably on our trained models. This H-Ras proof-of-principle study suggests a complementary approach for understanding mechanisms with which other proteins are involved in oncogenesis, including related Ras isoforms, and for providing useful insights into designing future diagnostic and treatment modalities.http://www.sciencedirect.com/science/article/pii/S2405844019311661BioinformaticsBiophysicsCancer researchComputational biologySystems biology |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Majid Masso Arnav Bansal Preethi Prem Akhil Gajjala Iosif I. Vaisman |
spellingShingle |
Majid Masso Arnav Bansal Preethi Prem Akhil Gajjala Iosif I. Vaisman Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques Heliyon Bioinformatics Biophysics Cancer research Computational biology Systems biology |
author_facet |
Majid Masso Arnav Bansal Preethi Prem Akhil Gajjala Iosif I. Vaisman |
author_sort |
Majid Masso |
title |
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
title_short |
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
title_full |
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
title_fullStr |
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
title_full_unstemmed |
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
title_sort |
fitness of unregulated human ras mutants modeled by implementing computational mutagenesis and machine learning techniques |
publisher |
Elsevier |
series |
Heliyon |
issn |
2405-8440 |
publishDate |
2019-06-01 |
description |
Ras proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed models that reliably predict fitness for all single residue mutants of H-ras proto-oncogene protein p21. The computational mutagenesis generated a feature vector of protein structural changes for each variant, and these data correlated well with fitness. Random forest classification and tree regression machine learning algorithms were implemented for training predictive models. Cross-validations were used to evaluate model performance, and control experiments were performed to assess statistical significance. Classification models revealed a balanced accuracy rate as high as 82%, with a Matthew's correlation of 0.63, and an area under ROC curve of 0.90. Similarly, regression models displayed Pearson's correlation reaching 0.79. On the other hand, control data sets led to performance values consistent with random guessing. Comparisons with several related state-of-the-art methods reflected favorably on our trained models. This H-Ras proof-of-principle study suggests a complementary approach for understanding mechanisms with which other proteins are involved in oncogenesis, including related Ras isoforms, and for providing useful insights into designing future diagnostic and treatment modalities. |
topic |
Bioinformatics Biophysics Cancer research Computational biology Systems biology |
url |
http://www.sciencedirect.com/science/article/pii/S2405844019311661 |
work_keys_str_mv |
AT majidmasso fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques AT arnavbansal fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques AT preethiprem fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques AT akhilgajjala fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques AT iosifivaisman fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques |
_version_ |
1724636150722723840 |