Reinforcement Learning for Distantly Supervised Relation Extraction

Relation extraction is a task of identifying semantic relations between entity pairs from plain text, which can benefit a lot of AI applications such as knowledge base construction and answer questioning. The distant supervision strategy is introduced to automatically create large-scale training dat...

Full description

Bibliographic Details
Main Authors: Tingting Sun, Chunhong Zhang, Yang Ji, Zheng Hu
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8768386/
id doaj-d3c1aca1d45c4663a2837d25602dc433
record_format Article
spelling doaj-d3c1aca1d45c4663a2837d25602dc4332021-04-05T17:10:02ZengIEEEIEEE Access2169-35362019-01-017980239803310.1109/ACCESS.2019.29303408768386Reinforcement Learning for Distantly Supervised Relation ExtractionTingting Sun0https://orcid.org/0000-0002-7383-1847Chunhong Zhang1Yang Ji2Zheng Hu3State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, ChinaState Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, ChinaState Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, ChinaState Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, ChinaRelation extraction is a task of identifying semantic relations between entity pairs from plain text, which can benefit a lot of AI applications such as knowledge base construction and answer questioning. The distant supervision strategy is introduced to automatically create large-scale training data, which inevitably suffers from noisy label problem. Recent works handle the sentence-level denoising by reinforcement learning, which regards the labels from distant supervision as the ground-truth. However, few works focus on the label-level denoising that corrects noisy labels directly. In this paper, we propose a reinforcement learning-based label denoising method for distantly supervised relation extraction. The model consists of two modules: extraction network (ENet) and policy network (PNet). The core of our label denoising is designing a policy in the PNet to obtain latent labels, where we can select the actions of using the distantly supervised labels or the predicted labels from the ENet. More concretely, the task can be modeled as an iterative process. First, the ENet predicts the relation probability, through which the model generates state representation. Second, the PNet learns the latent labels with taken actions and uses them to update the ENet. Then the optimized ENet gives the rewards to the PNet. The joint learning of two modules can obtain a reliable latent label and effectively improve the classification performance. The experimental results show that reinforcement learning is effective for noisy label correction and the proposed method can outperform the state-of-the-art relation extraction systems.https://ieeexplore.ieee.org/document/8768386/Relation extractiondistant supervisionreinforcement learningnoisy label
collection DOAJ
language English
format Article
sources DOAJ
author Tingting Sun
Chunhong Zhang
Yang Ji
Zheng Hu
spellingShingle Tingting Sun
Chunhong Zhang
Yang Ji
Zheng Hu
Reinforcement Learning for Distantly Supervised Relation Extraction
IEEE Access
Relation extraction
distant supervision
reinforcement learning
noisy label
author_facet Tingting Sun
Chunhong Zhang
Yang Ji
Zheng Hu
author_sort Tingting Sun
title Reinforcement Learning for Distantly Supervised Relation Extraction
title_short Reinforcement Learning for Distantly Supervised Relation Extraction
title_full Reinforcement Learning for Distantly Supervised Relation Extraction
title_fullStr Reinforcement Learning for Distantly Supervised Relation Extraction
title_full_unstemmed Reinforcement Learning for Distantly Supervised Relation Extraction
title_sort reinforcement learning for distantly supervised relation extraction
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Relation extraction is a task of identifying semantic relations between entity pairs from plain text, which can benefit a lot of AI applications such as knowledge base construction and answer questioning. The distant supervision strategy is introduced to automatically create large-scale training data, which inevitably suffers from noisy label problem. Recent works handle the sentence-level denoising by reinforcement learning, which regards the labels from distant supervision as the ground-truth. However, few works focus on the label-level denoising that corrects noisy labels directly. In this paper, we propose a reinforcement learning-based label denoising method for distantly supervised relation extraction. The model consists of two modules: extraction network (ENet) and policy network (PNet). The core of our label denoising is designing a policy in the PNet to obtain latent labels, where we can select the actions of using the distantly supervised labels or the predicted labels from the ENet. More concretely, the task can be modeled as an iterative process. First, the ENet predicts the relation probability, through which the model generates state representation. Second, the PNet learns the latent labels with taken actions and uses them to update the ENet. Then the optimized ENet gives the rewards to the PNet. The joint learning of two modules can obtain a reliable latent label and effectively improve the classification performance. The experimental results show that reinforcement learning is effective for noisy label correction and the proposed method can outperform the state-of-the-art relation extraction systems.
topic Relation extraction
distant supervision
reinforcement learning
noisy label
url https://ieeexplore.ieee.org/document/8768386/
work_keys_str_mv AT tingtingsun reinforcementlearningfordistantlysupervisedrelationextraction
AT chunhongzhang reinforcementlearningfordistantlysupervisedrelationextraction
AT yangji reinforcementlearningfordistantlysupervisedrelationextraction
AT zhenghu reinforcementlearningfordistantlysupervisedrelationextraction
_version_ 1721540182009708544