A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.
BACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2014-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0110331 |
id |
doaj-e65042f6ac254494aa04b325f4a09523 |
---|---|
record_format |
Article |
spelling |
doaj-e65042f6ac254494aa04b325f4a095232021-03-03T20:11:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-01910e11033110.1371/journal.pone.0110331A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov.Diana de la IglesiaMiguel García-RemesalAlberto AnguitaMiguel Muñoz-MármolCasimir KulikowskiVíctor MaojoBACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and nanodevices could advance novel nanomaterials as agents for diagnosis and therapy. Although there is publicly available information about nanomedicine-related CTs, the online archiving of this information is carried out without adhering to criteria that discriminate between studies involving nanomaterials or nanotechnology-based processes (nano), and CTs that do not involve nanotechnology (non-nano). Finding out whether nanodrugs and nanodevices were involved in a study from CT summaries alone is a challenging task. At the time of writing, CTs archived in the well-known online registry ClinicalTrials.gov are not easily told apart as to whether they are nano or non-nano CTs-even when performed by domain experts, due to the lack of both a common definition for nanotechnology and of standards for reporting nanomedical experiments and results. METHODS:We propose a supervised learning approach for classifying CT summaries from ClinicalTrials.gov according to whether they fall into the nano or the non-nano categories. Our method involves several stages: i) extraction and manual annotation of CTs as nano vs. non-nano, ii) pre-processing and automatic classification, and iii) performance evaluation using several state-of-the-art classifiers under different transformations of the original dataset. RESULTS AND CONCLUSIONS:The performance of the best automated classifier closely matches that of experts (AUC over 0.95), suggesting that it is feasible to automatically detect the presence of nanotechnology products in CT summaries with a high degree of accuracy. This can significantly speed up the process of finding whether reports on ClinicalTrials.gov might be relevant to a particular nanoparticle or nanodevice, which is essential to discover any precedents for nanotoxicity events or advantages for targeted drug therapy.https://doi.org/10.1371/journal.pone.0110331 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Diana de la Iglesia Miguel García-Remesal Alberto Anguita Miguel Muñoz-Mármol Casimir Kulikowski Víctor Maojo |
spellingShingle |
Diana de la Iglesia Miguel García-Remesal Alberto Anguita Miguel Muñoz-Mármol Casimir Kulikowski Víctor Maojo A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. PLoS ONE |
author_facet |
Diana de la Iglesia Miguel García-Remesal Alberto Anguita Miguel Muñoz-Mármol Casimir Kulikowski Víctor Maojo |
author_sort |
Diana de la Iglesia |
title |
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. |
title_short |
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. |
title_full |
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. |
title_fullStr |
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. |
title_full_unstemmed |
A machine learning approach to identify clinical trials involving nanodrugs and nanodevices from ClinicalTrials.gov. |
title_sort |
machine learning approach to identify clinical trials involving nanodrugs and nanodevices from clinicaltrials.gov. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2014-01-01 |
description |
BACKGROUND:Clinical Trials (CTs) are essential for bridging the gap between experimental research on new drugs and their clinical application. Just like CTs for traditional drugs and biologics have helped accelerate the translation of biomedical findings into medical practice, CTs for nanodrugs and nanodevices could advance novel nanomaterials as agents for diagnosis and therapy. Although there is publicly available information about nanomedicine-related CTs, the online archiving of this information is carried out without adhering to criteria that discriminate between studies involving nanomaterials or nanotechnology-based processes (nano), and CTs that do not involve nanotechnology (non-nano). Finding out whether nanodrugs and nanodevices were involved in a study from CT summaries alone is a challenging task. At the time of writing, CTs archived in the well-known online registry ClinicalTrials.gov are not easily told apart as to whether they are nano or non-nano CTs-even when performed by domain experts, due to the lack of both a common definition for nanotechnology and of standards for reporting nanomedical experiments and results. METHODS:We propose a supervised learning approach for classifying CT summaries from ClinicalTrials.gov according to whether they fall into the nano or the non-nano categories. Our method involves several stages: i) extraction and manual annotation of CTs as nano vs. non-nano, ii) pre-processing and automatic classification, and iii) performance evaluation using several state-of-the-art classifiers under different transformations of the original dataset. RESULTS AND CONCLUSIONS:The performance of the best automated classifier closely matches that of experts (AUC over 0.95), suggesting that it is feasible to automatically detect the presence of nanotechnology products in CT summaries with a high degree of accuracy. This can significantly speed up the process of finding whether reports on ClinicalTrials.gov might be relevant to a particular nanoparticle or nanodevice, which is essential to discover any precedents for nanotoxicity events or advantages for targeted drug therapy. |
url |
https://doi.org/10.1371/journal.pone.0110331 |
work_keys_str_mv |
AT dianadelaiglesia amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT miguelgarciaremesal amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT albertoanguita amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT miguelmunozmarmol amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT casimirkulikowski amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT victormaojo amachinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT dianadelaiglesia machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT miguelgarciaremesal machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT albertoanguita machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT miguelmunozmarmol machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT casimirkulikowski machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov AT victormaojo machinelearningapproachtoidentifyclinicaltrialsinvolvingnanodrugsandnanodevicesfromclinicaltrialsgov |
_version_ |
1714823661930676224 |