A novel method for drug-adverse event extraction using machine learning

Background: An extensive amount of data derived from medical case reports regarding potential adverse events is subjected to manual review. Devising efficient strategies for identification and information extraction concerning potential adverse events are needed to support timely monitoring of the r...

Full description

Bibliographic Details
Main Authors:	Kajal Negi, Arun Pavuri, Ladle Patel, Chirag Jain
Format:	Article
Language:	English
Published:	Elsevier 2019-01-01
Series:	Informatics in Medicine Unlocked
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352914819300991

id	doaj-dc8e73d4aa324b78a1d31d6a2953324b
record_format	Article
spelling	doaj-dc8e73d4aa324b78a1d31d6a2953324b2020-11-25T01:56:24ZengElsevierInformatics in Medicine Unlocked2352-91482019-01-0117A novel method for drug-adverse event extraction using machine learningKajal Negi0Arun Pavuri1Ladle Patel2Chirag Jain3Corresponding author.; Data Science and Insights, Genpact, Bangalore, IndiaData Science and Insights, Genpact, Bangalore, IndiaData Science and Insights, Genpact, Bangalore, IndiaData Science and Insights, Genpact, Bangalore, IndiaBackground: An extensive amount of data derived from medical case reports regarding potential adverse events is subjected to manual review. Devising efficient strategies for identification and information extraction concerning potential adverse events are needed to support timely monitoring of the reports and decision making. Methods: This paper aims at providing a machine learning (ML) and natural language processing (NLP) based solution for extracting suspect drugs and adverse events. The solution is based upon two approaches: Causal Sentence Classification classifies the relationship between drug and medical condition as causal or non-causal, and Suspect Drug Identification classifies each drug present in the report as a suspect drug or non-suspect drug. Results: Causal Sentence Classification yielded a precision of 0.85 and recall of 0.84 in establishing causality between drugs and medical conditions on the testing dataset consisting of 6252 records. After evaluation on a reliable testing dataset of 3522 records, the Suspect Drug Identification successfully identified suspect drugs with a precision of 0.72 and recall of 0.77. Conclusions: The developed solution relies on semantic and syntactic based features to capture the writing style of incoming reports, and showcases the potential of ML and NLP for Pharmacovigilance. Keywords: Pharmacovigilance, Text analytics, Natural language processing, Machine learninghttp://www.sciencedirect.com/science/article/pii/S2352914819300991
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Kajal Negi Arun Pavuri Ladle Patel Chirag Jain
spellingShingle	Kajal Negi Arun Pavuri Ladle Patel Chirag Jain A novel method for drug-adverse event extraction using machine learning Informatics in Medicine Unlocked
author_facet	Kajal Negi Arun Pavuri Ladle Patel Chirag Jain
author_sort	Kajal Negi
title	A novel method for drug-adverse event extraction using machine learning
title_short	A novel method for drug-adverse event extraction using machine learning
title_full	A novel method for drug-adverse event extraction using machine learning
title_fullStr	A novel method for drug-adverse event extraction using machine learning
title_full_unstemmed	A novel method for drug-adverse event extraction using machine learning
title_sort	novel method for drug-adverse event extraction using machine learning
publisher	Elsevier
series	Informatics in Medicine Unlocked
issn	2352-9148
publishDate	2019-01-01
description	Background: An extensive amount of data derived from medical case reports regarding potential adverse events is subjected to manual review. Devising efficient strategies for identification and information extraction concerning potential adverse events are needed to support timely monitoring of the reports and decision making. Methods: This paper aims at providing a machine learning (ML) and natural language processing (NLP) based solution for extracting suspect drugs and adverse events. The solution is based upon two approaches: Causal Sentence Classification classifies the relationship between drug and medical condition as causal or non-causal, and Suspect Drug Identification classifies each drug present in the report as a suspect drug or non-suspect drug. Results: Causal Sentence Classification yielded a precision of 0.85 and recall of 0.84 in establishing causality between drugs and medical conditions on the testing dataset consisting of 6252 records. After evaluation on a reliable testing dataset of 3522 records, the Suspect Drug Identification successfully identified suspect drugs with a precision of 0.72 and recall of 0.77. Conclusions: The developed solution relies on semantic and syntactic based features to capture the writing style of incoming reports, and showcases the potential of ML and NLP for Pharmacovigilance. Keywords: Pharmacovigilance, Text analytics, Natural language processing, Machine learning
url	http://www.sciencedirect.com/science/article/pii/S2352914819300991
work_keys_str_mv	AT kajalnegi anovelmethodfordrugadverseeventextractionusingmachinelearning AT arunpavuri anovelmethodfordrugadverseeventextractionusingmachinelearning AT ladlepatel anovelmethodfordrugadverseeventextractionusingmachinelearning AT chiragjain anovelmethodfordrugadverseeventextractionusingmachinelearning AT kajalnegi novelmethodfordrugadverseeventextractionusingmachinelearning AT arunpavuri novelmethodfordrugadverseeventextractionusingmachinelearning AT ladlepatel novelmethodfordrugadverseeventextractionusingmachinelearning AT chiragjain novelmethodfordrugadverseeventextractionusingmachinelearning
_version_	1724980393352888320

A novel method for drug-adverse event extraction using machine learning

Similar Items