A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.

BACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and...

Full description

Bibliographic Details
Main Authors: Diana de la Iglesia, Miguel García-Remesal, Alberto Anguita, Miguel Muñoz-Mármol, Casimir Kulikowski, Víctor Maojo
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0110331
id doaj-e65042f6ac254494aa04b325f4a09523
record_format Article
spelling doaj-e65042f6ac254494aa04b325f4a095232021-03-03T20:11:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-01910e11033110.1371/journal.pone.0110331A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.Diana de la IglesiaMiguel García-RemesalAlberto AnguitaMiguel Muñoz-MármolCasimir KulikowskiVíctor MaojoBACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and nanodevices could advance novel nanomaterials as agents for diagnosis and therapy. Although there is publicly available information about nanomedicine-related CTs, the online archiving of this information is carried out without adhering to criteria that discriminate between studies involving nanomaterials or nanotechnology-based processes (nano), and CTs that do not involve nanotechnology (non-nano). Finding out whether nanodrugs and nanodevices were involved in a study from CT summaries alone is a challenging task. At the time of writing, CTs archived in the well-known online registry ClinicalTrials.gov are not easily told apart as to whether they are nano or non-nano CTs-even when performed by domain experts, due to the lack of both a common definition for nanotechnology and of standards for reporting nanomedical experiments and results. METHODS:We propose a supervised learning approach for classifying CT summaries from ClinicalTrials.gov according to whether they fall into the nano or the non-nano categories. Our method involves several stages: i) extraction and manual annotation of CTs as nano vs. non-nano, ii) pre-processing and automatic classification, and iii) performance evaluation using several state-of-the-art classifiers under different transformations of the original dataset. RESULTS AND CONCLUSIONS:The performance of the best automated classifier closely matches that of experts (AUC over 0.95), suggesting that it is feasible to automatically detect the presence of nanotechnology products in CT summaries with a high degree of accuracy. This can significantly speed up the process of finding whether reports on ClinicalTrials.gov might be relevant to a particular nanoparticle or nanodevice, which is essential to discover any precedents for nanotoxicity events or advantages for targeted drug therapy.https://doi.org/10.1371/journal.pone.0110331
collection DOAJ
language English
format Article
sources DOAJ
author Diana de la Iglesia
Miguel García-Remesal
Alberto Anguita
Miguel Muñoz-Mármol
Casimir Kulikowski
Víctor Maojo
spellingShingle Diana de la Iglesia
Miguel García-Remesal
Alberto Anguita
Miguel Muñoz-Mármol
Casimir Kulikowski
Víctor Maojo
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
PLoS ONE
author_facet Diana de la Iglesia
Miguel García-Remesal
Alberto Anguita
Miguel Muñoz-Mármol
Casimir Kulikowski
Víctor Maojo
author_sort Diana de la Iglesia
title A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
title_short A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
title_full A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
title_fullStr A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
title_full_unstemmed A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
title_sort machine learning approach to identify clinical trials involving nanodrugs and nanodevices from clinicaltrials.gov.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description BACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and nanodevices could advance novel nanomaterials as agents for diagnosis and therapy. Although there is publicly available information about nanomedicine-related CTs, the online archiving of this information is carried out without adhering to criteria that discriminate between studies involving nanomaterials or nanotechnology-based processes (nano), and CTs that do not involve nanotechnology (non-nano). Finding out whether nanodrugs and nanodevices were involved in a study from CT summaries alone is a challenging task. At the time of writing, CTs archived in the well-known online registry ClinicalTrials.gov are not easily told apart as to whether they are nano or non-nano CTs-even when performed by domain experts, due to the lack of both a common definition for nanotechnology and of standards for reporting nanomedical experiments and results. METHODS:We propose a supervised learning approach for classifying CT summaries from ClinicalTrials.gov according to whether they fall into the nano or the non-nano categories. Our method involves several stages: i) extraction and manual annotation of CTs as nano vs. non-nano, ii) pre-processing and automatic classification, and iii) performance evaluation using several state-of-the-art classifiers under different transformations of the original dataset. RESULTS AND CONCLUSIONS:The performance of the best automated classifier closely matches that of experts (AUC over 0.95), suggesting that it is feasible to automatically detect the presence of nanotechnology products in CT summaries with a high degree of accuracy. This can significantly speed up the process of finding whether reports on ClinicalTrials.gov might be relevant to a particular nanoparticle or nanodevice, which is essential to discover any precedents for nanotoxicity events or advantages for targeted drug therapy.
url https://doi.org/10.1371/journal.pone.0110331
work_keys_str_mv AT dianadelaiglesia amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT miguelgarciaremesal amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT albertoanguita amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT miguelmunozmarmol amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT casimirkulikowski amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT victormaojo amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT dianadelaiglesia machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT miguelgarciaremesal machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT albertoanguita machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT miguelmunozmarmol machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT casimirkulikowski machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
AT victormaojo machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov
_version_ 1714823661930676224