Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge

Short text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive...

Full description

Bibliographic Details
Main Authors: Wenfeng Hou, Qing Liu, Longbing Cao
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/14/4893
id doaj-e03af171213f4046b7c3b88a766569c1
record_format Article
spelling doaj-e03af171213f4046b7c3b88a766569c12020-11-25T03:28:13ZengMDPI AGApplied Sciences2076-34172020-07-01104893489310.3390/app10144893Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and KnowledgeWenfeng Hou0Qing Liu1Longbing Cao2Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaFaculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, AustraliaShort text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive aspects of text, including semantic concept, knowledge, and category. In this paper, we propose a named Entity-based Concept Knowledge-Aware (ECKA) representation model which incorporates semantic information into short text representation. ECKA is a multi-level short text semantic representation model, which extracts the semantic features from the word, entity, concept and knowledge levels by CNN, respectively. Since word, entity, concept and knowledge entity in the same short text have different cognitive informativeness for short text classification, attention networks are formed to capture these category-related attentive representations from the multi-level textual features, respectively. The final multi-level semantic representations are formed by concatenating all of these individual-level representations, which are used for text classification. Experiments on three tasks demonstrate our method significantly outperforms the state-of-the-art methods.https://www.mdpi.com/2076-3417/10/14/4893short text representationsemantic representationshort text classificationknowledge graphconvolutional neural networkattention network
collection DOAJ
language English
format Article
sources DOAJ
author Wenfeng Hou
Qing Liu
Longbing Cao
spellingShingle Wenfeng Hou
Qing Liu
Longbing Cao
Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
Applied Sciences
short text representation
semantic representation
short text classification
knowledge graph
convolutional neural network
attention network
author_facet Wenfeng Hou
Qing Liu
Longbing Cao
author_sort Wenfeng Hou
title Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
title_short Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
title_full Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
title_fullStr Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
title_full_unstemmed Cognitive Aspects-Based Short Text Representation with Named Entity, Concept and Knowledge
title_sort cognitive aspects-based short text representation with named entity, concept and knowledge
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-07-01
description Short text is widely seen in applications including Internet of Things (IoT). The appropriate representation and classification of short text could be severely disrupted by the sparsity and shortness of short text. One important solution is to enrich short text representation by involving cognitive aspects of text, including semantic concept, knowledge, and category. In this paper, we propose a named Entity-based Concept Knowledge-Aware (ECKA) representation model which incorporates semantic information into short text representation. ECKA is a multi-level short text semantic representation model, which extracts the semantic features from the word, entity, concept and knowledge levels by CNN, respectively. Since word, entity, concept and knowledge entity in the same short text have different cognitive informativeness for short text classification, attention networks are formed to capture these category-related attentive representations from the multi-level textual features, respectively. The final multi-level semantic representations are formed by concatenating all of these individual-level representations, which are used for text classification. Experiments on three tasks demonstrate our method significantly outperforms the state-of-the-art methods.
topic short text representation
semantic representation
short text classification
knowledge graph
convolutional neural network
attention network
url https://www.mdpi.com/2076-3417/10/14/4893
work_keys_str_mv AT wenfenghou cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge
AT qingliu cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge
AT longbingcao cognitiveaspectsbasedshorttextrepresentationwithnamedentityconceptandknowledge
_version_ 1724585615557656576