Dependency-based long short term memory network for drug-drug interaction extraction

Abstract Background Drug-drug interaction extraction (DDI) needs assistance from automated methods to address the explosively increasing biomedical texts. In recent years, deep neural network based models have been developed to address such needs and they have made significant progress in relation i...

Full description

Bibliographic Details
Main Authors: Wei Wang, Xi Yang, Canqun Yang, Xiaowei Guo, Xiang Zhang, Chengkun Wu
Format: Article
Language:English
Published: BMC 2017-12-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-017-1962-8
id doaj-b25d81e8e2644e3d9319c83493ff7eeb
record_format Article
spelling doaj-b25d81e8e2644e3d9319c83493ff7eeb2020-11-25T00:47:20ZengBMCBMC Bioinformatics1471-21052017-12-0118S169910910.1186/s12859-017-1962-8Dependency-based long short term memory network for drug-drug interaction extractionWei Wang0Xi Yang1Canqun Yang2Xiaowei Guo3Xiang Zhang4Chengkun Wu5School of Computer Science, National University of Defense TechnologySchool of Computer Science, National University of Defense TechnologySchool of Computer Science, National University of Defense TechnologySchool of Computer Science, National University of Defense TechnologySchool of Computer Science, National University of Defense TechnologySchool of Computer Science, National University of Defense TechnologyAbstract Background Drug-drug interaction extraction (DDI) needs assistance from automated methods to address the explosively increasing biomedical texts. In recent years, deep neural network based models have been developed to address such needs and they have made significant progress in relation identification. Methods We propose a dependency-based deep neural network model for DDI extraction. By introducing the dependency-based technique to a bi-directional long short term memory network (Bi-LSTM), we build three channels, namely, Linear channel, DFS channel and BFS channel. All of these channels are constructed with three network layers, including embedding layer, LSTM layer and max pooling layer from bottom up. In the embedding layer, we extract two types of features, one is distance-based feature and another is dependency-based feature. In the LSTM layer, a Bi-LSTM is instituted in each channel to better capture relation information. Then max pooling is used to get optimal features from the entire encoding sequential data. At last, we concatenate the outputs of all channels and then link it to the softmax layer for relation identification. Results To the best of our knowledge, our model achieves new state-of-the-art performance with the F-score of 72.0% on the DDIExtraction 2013 corpus. Moreover, our approach obtains much higher Recall value compared to the existing methods. Conclusions The dependency-based Bi-LSTM model can learn effective relation information with less feature engineering in the task of DDI extraction. Besides, the experimental results show that our model excels at balancing the Precision and Recall values.http://link.springer.com/article/10.1186/s12859-017-1962-8Relation extractionLong short term memoryDependency treeData imbalance
collection DOAJ
language English
format Article
sources DOAJ
author Wei Wang
Xi Yang
Canqun Yang
Xiaowei Guo
Xiang Zhang
Chengkun Wu
spellingShingle Wei Wang
Xi Yang
Canqun Yang
Xiaowei Guo
Xiang Zhang
Chengkun Wu
Dependency-based long short term memory network for drug-drug interaction extraction
BMC Bioinformatics
Relation extraction
Long short term memory
Dependency tree
Data imbalance
author_facet Wei Wang
Xi Yang
Canqun Yang
Xiaowei Guo
Xiang Zhang
Chengkun Wu
author_sort Wei Wang
title Dependency-based long short term memory network for drug-drug interaction extraction
title_short Dependency-based long short term memory network for drug-drug interaction extraction
title_full Dependency-based long short term memory network for drug-drug interaction extraction
title_fullStr Dependency-based long short term memory network for drug-drug interaction extraction
title_full_unstemmed Dependency-based long short term memory network for drug-drug interaction extraction
title_sort dependency-based long short term memory network for drug-drug interaction extraction
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2017-12-01
description Abstract Background Drug-drug interaction extraction (DDI) needs assistance from automated methods to address the explosively increasing biomedical texts. In recent years, deep neural network based models have been developed to address such needs and they have made significant progress in relation identification. Methods We propose a dependency-based deep neural network model for DDI extraction. By introducing the dependency-based technique to a bi-directional long short term memory network (Bi-LSTM), we build three channels, namely, Linear channel, DFS channel and BFS channel. All of these channels are constructed with three network layers, including embedding layer, LSTM layer and max pooling layer from bottom up. In the embedding layer, we extract two types of features, one is distance-based feature and another is dependency-based feature. In the LSTM layer, a Bi-LSTM is instituted in each channel to better capture relation information. Then max pooling is used to get optimal features from the entire encoding sequential data. At last, we concatenate the outputs of all channels and then link it to the softmax layer for relation identification. Results To the best of our knowledge, our model achieves new state-of-the-art performance with the F-score of 72.0% on the DDIExtraction 2013 corpus. Moreover, our approach obtains much higher Recall value compared to the existing methods. Conclusions The dependency-based Bi-LSTM model can learn effective relation information with less feature engineering in the task of DDI extraction. Besides, the experimental results show that our model excels at balancing the Precision and Recall values.
topic Relation extraction
Long short term memory
Dependency tree
Data imbalance
url http://link.springer.com/article/10.1186/s12859-017-1962-8
work_keys_str_mv AT weiwang dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
AT xiyang dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
AT canqunyang dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
AT xiaoweiguo dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
AT xiangzhang dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
AT chengkunwu dependencybasedlongshorttermmemorynetworkfordrugdruginteractionextraction
_version_ 1725260519624933376