Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques

Ras proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed mode...

Full description

Bibliographic Details
Main Authors: Majid Masso, Arnav Bansal, Preethi Prem, Akhil Gajjala, Iosif I. Vaisman
Format: Article
Language:English
Published: Elsevier 2019-06-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844019311661
id doaj-d9fe398b50a34003b4f40bdbe80a2e2c
record_format Article
spelling doaj-d9fe398b50a34003b4f40bdbe80a2e2c2020-11-25T03:16:26ZengElsevierHeliyon2405-84402019-06-0156e01884Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniquesMajid Masso0Arnav Bansal1Preethi Prem2Akhil Gajjala3Iosif I. Vaisman4Corresponding author.; School of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USASchool of Systems Biology, George Mason University, 10900 University Blvd. MS 5B3, Manassas, Virginia, 20110, USARas proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed models that reliably predict fitness for all single residue mutants of H-ras proto-oncogene protein p21. The computational mutagenesis generated a feature vector of protein structural changes for each variant, and these data correlated well with fitness. Random forest classification and tree regression machine learning algorithms were implemented for training predictive models. Cross-validations were used to evaluate model performance, and control experiments were performed to assess statistical significance. Classification models revealed a balanced accuracy rate as high as 82%, with a Matthew's correlation of 0.63, and an area under ROC curve of 0.90. Similarly, regression models displayed Pearson's correlation reaching 0.79. On the other hand, control data sets led to performance values consistent with random guessing. Comparisons with several related state-of-the-art methods reflected favorably on our trained models. This H-Ras proof-of-principle study suggests a complementary approach for understanding mechanisms with which other proteins are involved in oncogenesis, including related Ras isoforms, and for providing useful insights into designing future diagnostic and treatment modalities.http://www.sciencedirect.com/science/article/pii/S2405844019311661BioinformaticsBiophysicsCancer researchComputational biologySystems biology
collection DOAJ
language English
format Article
sources DOAJ
author Majid Masso
Arnav Bansal
Preethi Prem
Akhil Gajjala
Iosif I. Vaisman
spellingShingle Majid Masso
Arnav Bansal
Preethi Prem
Akhil Gajjala
Iosif I. Vaisman
Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
Heliyon
Bioinformatics
Biophysics
Cancer research
Computational biology
Systems biology
author_facet Majid Masso
Arnav Bansal
Preethi Prem
Akhil Gajjala
Iosif I. Vaisman
author_sort Majid Masso
title Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
title_short Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
title_full Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
title_fullStr Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
title_full_unstemmed Fitness of unregulated human Ras mutants modeled by implementing computational mutagenesis and machine learning techniques
title_sort fitness of unregulated human ras mutants modeled by implementing computational mutagenesis and machine learning techniques
publisher Elsevier
series Heliyon
issn 2405-8440
publishDate 2019-06-01
description Ras proteins play a pivotal role as oncogenes by participating in diverse signaling events, including those linked to cell growth, differentiation, and proliferation. Using experimental fitness data and implementing artificial intelligence and a computational mutagenesis technique, we developed models that reliably predict fitness for all single residue mutants of H-ras proto-oncogene protein p21. The computational mutagenesis generated a feature vector of protein structural changes for each variant, and these data correlated well with fitness. Random forest classification and tree regression machine learning algorithms were implemented for training predictive models. Cross-validations were used to evaluate model performance, and control experiments were performed to assess statistical significance. Classification models revealed a balanced accuracy rate as high as 82%, with a Matthew's correlation of 0.63, and an area under ROC curve of 0.90. Similarly, regression models displayed Pearson's correlation reaching 0.79. On the other hand, control data sets led to performance values consistent with random guessing. Comparisons with several related state-of-the-art methods reflected favorably on our trained models. This H-Ras proof-of-principle study suggests a complementary approach for understanding mechanisms with which other proteins are involved in oncogenesis, including related Ras isoforms, and for providing useful insights into designing future diagnostic and treatment modalities.
topic Bioinformatics
Biophysics
Cancer research
Computational biology
Systems biology
url http://www.sciencedirect.com/science/article/pii/S2405844019311661
work_keys_str_mv AT majidmasso fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques
AT arnavbansal fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques
AT preethiprem fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques
AT akhilgajjala fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques
AT iosifivaisman fitnessofunregulatedhumanrasmutantsmodeledbyimplementingcomputationalmutagenesisandmachinelearningtechniques
_version_ 1724636150722723840