Entity perception of Two-Step-Matching framework for public opinions
Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by B...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2020-09-01
|
Series: | Journal of Safety Science and Resilience |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666449620300050 |
id |
doaj-bf3486ad18e74d3abea8eda850d6cc25 |
---|---|
record_format |
Article |
spelling |
doaj-bf3486ad18e74d3abea8eda850d6cc252021-04-02T16:18:30ZengKeAi Communications Co., Ltd.Journal of Safety Science and Resilience2666-44962020-09-01113643Entity perception of Two-Step-Matching framework for public opinionsRen-De Li0Hao-Tian Ma1Zi-Yi Wang2Qiang Guo3Jian-Guo Liu4Library and Research Center of Computer Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR ChinaSchool of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR ChinaSchool of Humanities, Shanghai University of Finance and Economics, Shanghai 200433, PR ChinaLibrary and Research Center of Computer Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR ChinaCorresponding author at: School of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR China.; School of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR China; Institute of Sina WRD Big Data, Shanghai 201204, PR ChinaEntity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.http://www.sciencedirect.com/science/article/pii/S2666449620300050Entity perceptionBiLSTM-CRF modelJaro–Winkler distance algorithmUser commentsPublic opinions |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ren-De Li Hao-Tian Ma Zi-Yi Wang Qiang Guo Jian-Guo Liu |
spellingShingle |
Ren-De Li Hao-Tian Ma Zi-Yi Wang Qiang Guo Jian-Guo Liu Entity perception of Two-Step-Matching framework for public opinions Journal of Safety Science and Resilience Entity perception BiLSTM-CRF model Jaro–Winkler distance algorithm User comments Public opinions |
author_facet |
Ren-De Li Hao-Tian Ma Zi-Yi Wang Qiang Guo Jian-Guo Liu |
author_sort |
Ren-De Li |
title |
Entity perception of Two-Step-Matching framework for public opinions |
title_short |
Entity perception of Two-Step-Matching framework for public opinions |
title_full |
Entity perception of Two-Step-Matching framework for public opinions |
title_fullStr |
Entity perception of Two-Step-Matching framework for public opinions |
title_full_unstemmed |
Entity perception of Two-Step-Matching framework for public opinions |
title_sort |
entity perception of two-step-matching framework for public opinions |
publisher |
KeAi Communications Co., Ltd. |
series |
Journal of Safety Science and Resilience |
issn |
2666-4496 |
publishDate |
2020-09-01 |
description |
Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions. |
topic |
Entity perception BiLSTM-CRF model Jaro–Winkler distance algorithm User comments Public opinions |
url |
http://www.sciencedirect.com/science/article/pii/S2666449620300050 |
work_keys_str_mv |
AT rendeli entityperceptionoftwostepmatchingframeworkforpublicopinions AT haotianma entityperceptionoftwostepmatchingframeworkforpublicopinions AT ziyiwang entityperceptionoftwostepmatchingframeworkforpublicopinions AT qiangguo entityperceptionoftwostepmatchingframeworkforpublicopinions AT jianguoliu entityperceptionoftwostepmatchingframeworkforpublicopinions |
_version_ |
1721557173458173952 |