Multichannel Convolutional Neural Network for Biological Relation Extraction
The plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data s...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2016-01-01
|
Series: | BioMed Research International |
Online Access: | http://dx.doi.org/10.1155/2016/1850404 |
id |
doaj-567b1cca47ce4cbc9543f966fec97242 |
---|---|
record_format |
Article |
spelling |
doaj-567b1cca47ce4cbc9543f966fec972422020-11-24T22:20:10ZengHindawi LimitedBioMed Research International2314-61332314-61412016-01-01201610.1155/2016/18504041850404Multichannel Convolutional Neural Network for Biological Relation ExtractionChanqin Quan0Lei Hua1Xiao Sun2Wenjun Bai3Graduate School of System Informatics, Kobe University, Kobe, JapanDepartment of Computer and Information Science, Hefei University of Technology, Hefei, ChinaDepartment of Computer and Information Science, Hefei University of Technology, Hefei, ChinaGraduate School of System Informatics, Kobe University, Kobe, JapanThe plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data sparseness and the unattainable automation process in feature extraction. To address aforementioned issues, in this work, we propose a multichannel convolutional neural network (MCCNN) for automated biomedical relation extraction. The proposed model has the following two contributions: (1) it enables the fusion of multiple (e.g., five) versions in word embeddings; (2) the need for manual feature engineering can be obviated by automated feature learning with convolutional neural network (CNN). We evaluated our model on two biomedical relation extraction tasks: drug-drug interaction (DDI) extraction and protein-protein interaction (PPI) extraction. For DDI task, our system achieved an overall f-score of 70.2% compared to the standard linear SVM based system (e.g., 67.0%) on DDIExtraction 2013 challenge dataset. And for PPI task, we evaluated our system on Aimed and BioInfer PPI corpus; our system exceeded the state-of-art ensemble SVM system by 2.7% and 5.6% on f-scores.http://dx.doi.org/10.1155/2016/1850404 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chanqin Quan Lei Hua Xiao Sun Wenjun Bai |
spellingShingle |
Chanqin Quan Lei Hua Xiao Sun Wenjun Bai Multichannel Convolutional Neural Network for Biological Relation Extraction BioMed Research International |
author_facet |
Chanqin Quan Lei Hua Xiao Sun Wenjun Bai |
author_sort |
Chanqin Quan |
title |
Multichannel Convolutional Neural Network for Biological Relation Extraction |
title_short |
Multichannel Convolutional Neural Network for Biological Relation Extraction |
title_full |
Multichannel Convolutional Neural Network for Biological Relation Extraction |
title_fullStr |
Multichannel Convolutional Neural Network for Biological Relation Extraction |
title_full_unstemmed |
Multichannel Convolutional Neural Network for Biological Relation Extraction |
title_sort |
multichannel convolutional neural network for biological relation extraction |
publisher |
Hindawi Limited |
series |
BioMed Research International |
issn |
2314-6133 2314-6141 |
publishDate |
2016-01-01 |
description |
The plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data sparseness and the unattainable automation process in feature extraction. To address aforementioned issues, in this work, we propose a multichannel convolutional neural network (MCCNN) for automated biomedical relation extraction. The proposed model has the following two contributions: (1) it enables the fusion of multiple (e.g., five) versions in word embeddings; (2) the need for manual feature engineering can be obviated by automated feature learning with convolutional neural network (CNN). We evaluated our model on two biomedical relation extraction tasks: drug-drug interaction (DDI) extraction and protein-protein interaction (PPI) extraction. For DDI task, our system achieved an overall f-score of 70.2% compared to the standard linear SVM based system (e.g., 67.0%) on DDIExtraction 2013 challenge dataset. And for PPI task, we evaluated our system on Aimed and BioInfer PPI corpus; our system exceeded the state-of-art ensemble SVM system by 2.7% and 5.6% on f-scores. |
url |
http://dx.doi.org/10.1155/2016/1850404 |
work_keys_str_mv |
AT chanqinquan multichannelconvolutionalneuralnetworkforbiologicalrelationextraction AT leihua multichannelconvolutionalneuralnetworkforbiologicalrelationextraction AT xiaosun multichannelconvolutionalneuralnetworkforbiologicalrelationextraction AT wenjunbai multichannelconvolutionalneuralnetworkforbiologicalrelationextraction |
_version_ |
1725776551658651648 |