Enriching basic features via multilayer bag-of-words binding for Chinese question classification

Question classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question clas...

Full description

Bibliographic Details
Main Authors: Sichun Yang, Chao Gao
Format: Article
Language:English
Published: Wiley 2019-03-01
Series:CAAI Transactions on Intelligence Technology
Subjects:
Online Access:https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016
id doaj-0ee744d072f341c796f371144ba7c9ee
record_format Article
spelling doaj-0ee744d072f341c796f371144ba7c9ee2021-04-02T13:07:37ZengWileyCAAI Transactions on Intelligence Technology2468-23222019-03-0110.1049/trit.2017.0016TRIT.2017.0016Enriching basic features via multilayer bag-of-words binding for Chinese question classificationSichun Yang0Chao Gao1School of Computer Science and Technology, Anhui University of TechnologyDepartment of Computer and Software Engineering, Anhui Institute of Information TechnologyQuestion classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question classification, an operator called MBWB (multilayer bag-of-words binding) is proposed to extract potential features by binding part-of-speech, word sense, named entity and other basic features to bag-of-words, respectively. Through performing MBWB operator on two kinds of bag-of-words, i.e. A_BOW and T_BOW, the corresponding A_MBWB and T_MBWB features are generated automatically. The MBWB operator can explore potential information contained in basic features, and enrich syntactic and semantic representation of questions. Experimental results on the Chinese question set show that the classification accuracy gets significantly improved when combining two kinds of MBWB features with basic features.https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016natural language processingquestion answering (information retrieval)support vector machineslearning (artificial intelligence)pattern classificationChinese question classificationmultilayer bag-of-words bindingpotential featuresword senseMBWB operatorclassification accuracyMBWB featuresaccurate answersquestion answering systemsemantic featuresbasic featuresquestion classifier
collection DOAJ
language English
format Article
sources DOAJ
author Sichun Yang
Chao Gao
spellingShingle Sichun Yang
Chao Gao
Enriching basic features via multilayer bag-of-words binding for Chinese question classification
CAAI Transactions on Intelligence Technology
natural language processing
question answering (information retrieval)
support vector machines
learning (artificial intelligence)
pattern classification
Chinese question classification
multilayer bag-of-words binding
potential features
word sense
MBWB operator
classification accuracy
MBWB features
accurate answers
question answering system
semantic features
basic features
question classifier
author_facet Sichun Yang
Chao Gao
author_sort Sichun Yang
title Enriching basic features via multilayer bag-of-words binding for Chinese question classification
title_short Enriching basic features via multilayer bag-of-words binding for Chinese question classification
title_full Enriching basic features via multilayer bag-of-words binding for Chinese question classification
title_fullStr Enriching basic features via multilayer bag-of-words binding for Chinese question classification
title_full_unstemmed Enriching basic features via multilayer bag-of-words binding for Chinese question classification
title_sort enriching basic features via multilayer bag-of-words binding for chinese question classification
publisher Wiley
series CAAI Transactions on Intelligence Technology
issn 2468-2322
publishDate 2019-03-01
description Question classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question classification, an operator called MBWB (multilayer bag-of-words binding) is proposed to extract potential features by binding part-of-speech, word sense, named entity and other basic features to bag-of-words, respectively. Through performing MBWB operator on two kinds of bag-of-words, i.e. A_BOW and T_BOW, the corresponding A_MBWB and T_MBWB features are generated automatically. The MBWB operator can explore potential information contained in basic features, and enrich syntactic and semantic representation of questions. Experimental results on the Chinese question set show that the classification accuracy gets significantly improved when combining two kinds of MBWB features with basic features.
topic natural language processing
question answering (information retrieval)
support vector machines
learning (artificial intelligence)
pattern classification
Chinese question classification
multilayer bag-of-words binding
potential features
word sense
MBWB operator
classification accuracy
MBWB features
accurate answers
question answering system
semantic features
basic features
question classifier
url https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016
work_keys_str_mv AT sichunyang enrichingbasicfeaturesviamultilayerbagofwordsbindingforchinesequestionclassification
AT chaogao enrichingbasicfeaturesviamultilayerbagofwordsbindingforchinesequestionclassification
_version_ 1721566268694200320