Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset

Drug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing no...

Full description

Bibliographic Details
Main Authors:	Robert Ancuceanu, Marilena Viorica Hovanet, Adriana Iuliana Anghel, Florentina Furtunescu, Monica Neagu, Carolina Constantin, Mihaela Dinu
Format:	Article
Language:	English
Published:	MDPI AG 2020-03-01
Series:	International Journal of Molecular Sciences
Subjects:	dilirank dili drug hepatotoxicity qsar nested cross-validation virtual screening in silico
Online Access:	https://www.mdpi.com/1422-0067/21/6/2114

id	doaj-e77f5cafd95e48659199a64013095f10
record_format	Article
spelling	doaj-e77f5cafd95e48659199a64013095f102020-11-25T03:29:28ZengMDPI AGInternational Journal of Molecular Sciences1422-00672020-03-01216211410.3390/ijms21062114ijms21062114Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank DatasetRobert Ancuceanu0Marilena Viorica Hovanet1Adriana Iuliana Anghel2Florentina Furtunescu3Monica Neagu4Carolina Constantin5Mihaela Dinu6Faculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, RomaniaFaculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, RomaniaFaculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, RomaniaFaculty of Medicine, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, RomaniaImmunology Laboratory, Victor Babes National Institute of Pathology, 050096 Bucharest, RomaniaImmunology Laboratory, Victor Babes National Institute of Pathology, 050096 Bucharest, RomaniaFaculty of Pharmacy, Carol Davila University of Medicine and Pharmacy, 020956 Bucharest, RomaniaDrug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing non-clinical tests with in vitro or in silico alternatives. In 2016, a group of researchers from the FDA published an improved annotated list of drugs with respect to their DILI risk, constituting “the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans” (DILIrank). This paper is one of the few attempting to predict liver toxicity using the DILIrank dataset. Molecular descriptors were computed with the Dragon 7.0 software, and a variety of feature selection and machine learning algorithms were implemented in the R computing environment. Nested (double) cross-validation was used to externally validate the models selected. A total of 78 models with reasonable performance were selected and stacked through several approaches, including the building of multiple meta-models. The performance of the stacked models was slightly superior to other models published. The models were applied in a virtual screening exercise on over 100,000 compounds from the ZINC database and about 20% of them were predicted to be non-hepatotoxic.https://www.mdpi.com/1422-0067/21/6/2114dilirankdilidrug hepatotoxicityqsarnested cross-validationvirtual screeningin silico
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Robert Ancuceanu Marilena Viorica Hovanet Adriana Iuliana Anghel Florentina Furtunescu Monica Neagu Carolina Constantin Mihaela Dinu
spellingShingle	Robert Ancuceanu Marilena Viorica Hovanet Adriana Iuliana Anghel Florentina Furtunescu Monica Neagu Carolina Constantin Mihaela Dinu Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset International Journal of Molecular Sciences dilirank dili drug hepatotoxicity qsar nested cross-validation virtual screening in silico
author_facet	Robert Ancuceanu Marilena Viorica Hovanet Adriana Iuliana Anghel Florentina Furtunescu Monica Neagu Carolina Constantin Mihaela Dinu
author_sort	Robert Ancuceanu
title	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_short	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_full	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_fullStr	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_full_unstemmed	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_sort	computational models using multiple machine learning algorithms for predicting drug hepatotoxicity with the dilirank dataset
publisher	MDPI AG
series	International Journal of Molecular Sciences
issn	1422-0067
publishDate	2020-03-01
description	Drug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing non-clinical tests with in vitro or in silico alternatives. In 2016, a group of researchers from the FDA published an improved annotated list of drugs with respect to their DILI risk, constituting “the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans” (DILIrank). This paper is one of the few attempting to predict liver toxicity using the DILIrank dataset. Molecular descriptors were computed with the Dragon 7.0 software, and a variety of feature selection and machine learning algorithms were implemented in the R computing environment. Nested (double) cross-validation was used to externally validate the models selected. A total of 78 models with reasonable performance were selected and stacked through several approaches, including the building of multiple meta-models. The performance of the stacked models was slightly superior to other models published. The models were applied in a virtual screening exercise on over 100,000 compounds from the ZINC database and about 20% of them were predicted to be non-hepatotoxic.
topic	dilirank dili drug hepatotoxicity qsar nested cross-validation virtual screening in silico
url	https://www.mdpi.com/1422-0067/21/6/2114
work_keys_str_mv	AT robertancuceanu computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT marilenavioricahovanet computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT adrianaiulianaanghel computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT florentinafurtunescu computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT monicaneagu computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT carolinaconstantin computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT mihaeladinu computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset
_version_	1724578941595811840

Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset

Similar Items