Multichannel Convolutional Neural Network for Biological Relation Extraction

The plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data s...

Full description

Bibliographic Details
Main Authors: Chanqin Quan, Lei Hua, Xiao Sun, Wenjun Bai
Format: Article
Language:English
Published: Hindawi Limited 2016-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2016/1850404
id doaj-567b1cca47ce4cbc9543f966fec97242
record_format Article
spelling doaj-567b1cca47ce4cbc9543f966fec972422020-11-24T22:20:10ZengHindawi LimitedBioMed Research International2314-61332314-61412016-01-01201610.1155/2016/18504041850404Multichannel Convolutional Neural Network for Biological Relation ExtractionChanqin Quan0Lei Hua1Xiao Sun2Wenjun Bai3Graduate School of System Informatics, Kobe University, Kobe, JapanDepartment of Computer and Information Science, Hefei University of Technology, Hefei, ChinaDepartment of Computer and Information Science, Hefei University of Technology, Hefei, ChinaGraduate School of System Informatics, Kobe University, Kobe, JapanThe plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data sparseness and the unattainable automation process in feature extraction. To address aforementioned issues, in this work, we propose a multichannel convolutional neural network (MCCNN) for automated biomedical relation extraction. The proposed model has the following two contributions: (1) it enables the fusion of multiple (e.g., five) versions in word embeddings; (2) the need for manual feature engineering can be obviated by automated feature learning with convolutional neural network (CNN). We evaluated our model on two biomedical relation extraction tasks: drug-drug interaction (DDI) extraction and protein-protein interaction (PPI) extraction. For DDI task, our system achieved an overall f-score of 70.2% compared to the standard linear SVM based system (e.g., 67.0%) on DDIExtraction 2013 challenge dataset. And for PPI task, we evaluated our system on Aimed and BioInfer PPI corpus; our system exceeded the state-of-art ensemble SVM system by 2.7% and 5.6% on f-scores.http://dx.doi.org/10.1155/2016/1850404
collection DOAJ
language English
format Article
sources DOAJ
author Chanqin Quan
Lei Hua
Xiao Sun
Wenjun Bai
spellingShingle Chanqin Quan
Lei Hua
Xiao Sun
Wenjun Bai
Multichannel Convolutional Neural Network for Biological Relation Extraction
BioMed Research International
author_facet Chanqin Quan
Lei Hua
Xiao Sun
Wenjun Bai
author_sort Chanqin Quan
title Multichannel Convolutional Neural Network for Biological Relation Extraction
title_short Multichannel Convolutional Neural Network for Biological Relation Extraction
title_full Multichannel Convolutional Neural Network for Biological Relation Extraction
title_fullStr Multichannel Convolutional Neural Network for Biological Relation Extraction
title_full_unstemmed Multichannel Convolutional Neural Network for Biological Relation Extraction
title_sort multichannel convolutional neural network for biological relation extraction
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2016-01-01
description The plethora of biomedical relations which are embedded in medical logs (records) demands researchers’ attention. Previous theoretical and practical focuses were restricted on traditional machine learning techniques. However, these methods are susceptible to the issues of “vocabulary gap” and data sparseness and the unattainable automation process in feature extraction. To address aforementioned issues, in this work, we propose a multichannel convolutional neural network (MCCNN) for automated biomedical relation extraction. The proposed model has the following two contributions: (1) it enables the fusion of multiple (e.g., five) versions in word embeddings; (2) the need for manual feature engineering can be obviated by automated feature learning with convolutional neural network (CNN). We evaluated our model on two biomedical relation extraction tasks: drug-drug interaction (DDI) extraction and protein-protein interaction (PPI) extraction. For DDI task, our system achieved an overall f-score of 70.2% compared to the standard linear SVM based system (e.g., 67.0%) on DDIExtraction 2013 challenge dataset. And for PPI task, we evaluated our system on Aimed and BioInfer PPI corpus; our system exceeded the state-of-art ensemble SVM system by 2.7% and 5.6% on f-scores.
url http://dx.doi.org/10.1155/2016/1850404
work_keys_str_mv AT chanqinquan multichannelconvolutionalneuralnetworkforbiologicalrelationextraction
AT leihua multichannelconvolutionalneuralnetworkforbiologicalrelationextraction
AT xiaosun multichannelconvolutionalneuralnetworkforbiologicalrelationextraction
AT wenjunbai multichannelconvolutionalneuralnetworkforbiologicalrelationextraction
_version_ 1725776551658651648