Entity perception of Two-Step-Matching framework for public opinions

Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by B...

Full description

Bibliographic Details
Main Authors: Ren-De Li, Hao-Tian Ma, Zi-Yi Wang, Qiang Guo, Jian-Guo Liu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2020-09-01
Series:Journal of Safety Science and Resilience
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666449620300050
id doaj-bf3486ad18e74d3abea8eda850d6cc25
record_format Article
spelling doaj-bf3486ad18e74d3abea8eda850d6cc252021-04-02T16:18:30ZengKeAi Communications Co., Ltd.Journal of Safety Science and Resilience2666-44962020-09-01113643Entity perception of Two-Step-Matching framework for public opinionsRen-De Li0Hao-Tian Ma1Zi-Yi Wang2Qiang Guo3Jian-Guo Liu4Library and Research Center of Computer Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR ChinaSchool of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR ChinaSchool of Humanities, Shanghai University of Finance and Economics, Shanghai 200433, PR ChinaLibrary and Research Center of Computer Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR ChinaCorresponding author at: School of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR China.; School of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR China; Institute of Sina WRD Big Data, Shanghai 201204, PR ChinaEntity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.http://www.sciencedirect.com/science/article/pii/S2666449620300050Entity perceptionBiLSTM-CRF modelJaro–Winkler distance algorithmUser commentsPublic opinions
collection DOAJ
language English
format Article
sources DOAJ
author Ren-De Li
Hao-Tian Ma
Zi-Yi Wang
Qiang Guo
Jian-Guo Liu
spellingShingle Ren-De Li
Hao-Tian Ma
Zi-Yi Wang
Qiang Guo
Jian-Guo Liu
Entity perception of Two-Step-Matching framework for public opinions
Journal of Safety Science and Resilience
Entity perception
BiLSTM-CRF model
Jaro–Winkler distance algorithm
User comments
Public opinions
author_facet Ren-De Li
Hao-Tian Ma
Zi-Yi Wang
Qiang Guo
Jian-Guo Liu
author_sort Ren-De Li
title Entity perception of Two-Step-Matching framework for public opinions
title_short Entity perception of Two-Step-Matching framework for public opinions
title_full Entity perception of Two-Step-Matching framework for public opinions
title_fullStr Entity perception of Two-Step-Matching framework for public opinions
title_full_unstemmed Entity perception of Two-Step-Matching framework for public opinions
title_sort entity perception of two-step-matching framework for public opinions
publisher KeAi Communications Co., Ltd.
series Journal of Safety Science and Resilience
issn 2666-4496
publishDate 2020-09-01
description Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.
topic Entity perception
BiLSTM-CRF model
Jaro–Winkler distance algorithm
User comments
Public opinions
url http://www.sciencedirect.com/science/article/pii/S2666449620300050
work_keys_str_mv AT rendeli entityperceptionoftwostepmatchingframeworkforpublicopinions
AT haotianma entityperceptionoftwostepmatchingframeworkforpublicopinions
AT ziyiwang entityperceptionoftwostepmatchingframeworkforpublicopinions
AT qiangguo entityperceptionoftwostepmatchingframeworkforpublicopinions
AT jianguoliu entityperceptionoftwostepmatchingframeworkforpublicopinions
_version_ 1721557173458173952