Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets

<p>Abstract</p> <p>Background</p> <p>Tuberculosis is a contagious disease caused by <it>Mycobacterium tuberculosis </it>(Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing...

Full description

Bibliographic Details
Main Authors: Periwal Vinita, Rajappan Jinuraj K, Jaleel Abdul UC, Scaria Vinod
Format: Article
Language:English
Published: BMC 2011-11-01
Series:BMC Research Notes
Online Access:http://www.biomedcentral.com/1756-0500/4/504
id doaj-14ba4993c9614f7997fb6e64cc7e0099
record_format Article
spelling doaj-14ba4993c9614f7997fb6e64cc7e00992020-11-25T02:12:51ZengBMCBMC Research Notes1756-05002011-11-014150410.1186/1756-0500-4-504Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasetsPeriwal VinitaRajappan Jinuraj KJaleel Abdul UCScaria Vinod<p>Abstract</p> <p>Background</p> <p>Tuberculosis is a contagious disease caused by <it>Mycobacterium tuberculosis </it>(Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing world. Recent reports suggest that Mtb has been developing resistance to the widely used anti-tubercular drugs resulting in the emergence and spread of multi drug-resistant (MDR) and extensively drug-resistant (XDR) strains throughout the world. In view of this global epidemic, there is an urgent need to facilitate fast and efficient lead identification methodologies. Target based screening of large compound libraries has been widely used as a fast and efficient approach for lead identification, but is restricted by the knowledge about the target structure. Whole organism screens on the other hand are target-agnostic and have been now widely employed as an alternative for lead identification but they are limited by the time and cost involved in running the screens for large compound libraries. This could be possibly be circumvented by using computational approaches to prioritize molecules for screening programmes.</p> <p>Results</p> <p>We utilized physicochemical properties of compounds to train four supervised classifiers (Naïve Bayes, Random Forest, J48 and SMO) on three publicly available bioassay screens of Mtb inhibitors and validated the robustness of the predictive models using various statistical measures.</p> <p>Conclusions</p> <p>This study is a comprehensive analysis of high-throughput bioassay data for anti-tubercular activity and the application of machine learning approaches to create target-agnostic predictive models for anti-tubercular agents.</p> http://www.biomedcentral.com/1756-0500/4/504
collection DOAJ
language English
format Article
sources DOAJ
author Periwal Vinita
Rajappan Jinuraj K
Jaleel Abdul UC
Scaria Vinod
spellingShingle Periwal Vinita
Rajappan Jinuraj K
Jaleel Abdul UC
Scaria Vinod
Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
BMC Research Notes
author_facet Periwal Vinita
Rajappan Jinuraj K
Jaleel Abdul UC
Scaria Vinod
author_sort Periwal Vinita
title Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
title_short Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
title_full Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
title_fullStr Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
title_full_unstemmed Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
title_sort predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets
publisher BMC
series BMC Research Notes
issn 1756-0500
publishDate 2011-11-01
description <p>Abstract</p> <p>Background</p> <p>Tuberculosis is a contagious disease caused by <it>Mycobacterium tuberculosis </it>(Mtb), affecting more than two billion people around the globe and is one of the major causes of morbidity and mortality in the developing world. Recent reports suggest that Mtb has been developing resistance to the widely used anti-tubercular drugs resulting in the emergence and spread of multi drug-resistant (MDR) and extensively drug-resistant (XDR) strains throughout the world. In view of this global epidemic, there is an urgent need to facilitate fast and efficient lead identification methodologies. Target based screening of large compound libraries has been widely used as a fast and efficient approach for lead identification, but is restricted by the knowledge about the target structure. Whole organism screens on the other hand are target-agnostic and have been now widely employed as an alternative for lead identification but they are limited by the time and cost involved in running the screens for large compound libraries. This could be possibly be circumvented by using computational approaches to prioritize molecules for screening programmes.</p> <p>Results</p> <p>We utilized physicochemical properties of compounds to train four supervised classifiers (Naïve Bayes, Random Forest, J48 and SMO) on three publicly available bioassay screens of Mtb inhibitors and validated the robustness of the predictive models using various statistical measures.</p> <p>Conclusions</p> <p>This study is a comprehensive analysis of high-throughput bioassay data for anti-tubercular activity and the application of machine learning approaches to create target-agnostic predictive models for anti-tubercular agents.</p>
url http://www.biomedcentral.com/1756-0500/4/504
work_keys_str_mv AT periwalvinita predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets
AT rajappanjinurajk predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets
AT jaleelabduluc predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets
AT scariavinod predictivemodelsforantitubercularmoleculesusingmachinelearningonhighthroughputbiologicalscreeningdatasets
_version_ 1724907911092633600