Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
Short text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-07-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/10/14/4893 |
id |
doaj-e03af171213f4046b7c3b88a766569c1 |
---|---|
record_format |
Article |
spelling |
doaj-e03af171213f4046b7c3b88a766569c12020-11-25T03:28:13ZengMDPI AGApplied Sciences2076-34172020-07-01104893489310.3390/app10144893Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and KnowledgeWenfeng Hou0Qing Liu1Longbing Cao2Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaShort text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive aspects of text, including semantic concept, knowledge, and category. In this paper, we propose a named Entity-based Concept Knowledge-Aware (ECKA) representation model which incorporates semantic information into short text representation. ECKA is a multi-level short text semantic representation model, which extracts the semantic features from the word, entity, concept and knowledge levels by CNN, respectively. Since word, entity, concept and knowledge entity in the same short text have different cognitive informativeness for short text classification, attention networks are formed to capture these category-related attentive representations from the multi-level textual features, respectively. The final multi-level semantic representations are formed by concatenating all of these individual-level representations, which are used for text classification. Experiments on three tasks demonstrate our method significantly outperforms the state-of-the-art methods.https://www.mdpi.com/2076-3417/10/14/4893short text representationsemantic representationshort text classificationknowledge graphconvolutional neural networkattention network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenfeng Hou Qing Liu Longbing Cao |
spellingShingle |
Wenfeng Hou Qing Liu Longbing Cao Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge Applied Sciences short text representation semantic representation short text classification knowledge graph convolutional neural network attention network |
author_facet |
Wenfeng Hou Qing Liu Longbing Cao |
author_sort |
Wenfeng Hou |
title |
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge |
title_short |
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge |
title_full |
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge |
title_fullStr |
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge |
title_full_unstemmed |
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge |
title_sort |
cognitive aspects-based short text representation with named entity, concept and knowledge |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2020-07-01 |
description |
Short text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive aspects of text, including semantic concept, knowledge, and category. In this paper, we propose a named Entity-based Concept Knowledge-Aware (ECKA) representation model which incorporates semantic information into short text representation. ECKA is a multi-level short text semantic representation model, which extracts the semantic features from the word, entity, concept and knowledge levels by CNN, respectively. Since word, entity, concept and knowledge entity in the same short text have different cognitive informativeness for short text classification, attention networks are formed to capture these category-related attentive representations from the multi-level textual features, respectively. The final multi-level semantic representations are formed by concatenating all of these individual-level representations, which are used for text classification. Experiments on three tasks demonstrate our method significantly outperforms the state-of-the-art methods. |
topic |
short text representation semantic representation short text classification knowledge graph convolutional neural network attention network |
url |
https://www.mdpi.com/2076-3417/10/14/4893 |
work_keys_str_mv |
AT wenfenghou cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge AT qingliu cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge AT longbingcao cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge |
_version_ |
1724585615557656576 |