Enriching basic features via multilayer bag-of-words binding for Chinese question classification
Question classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question clas...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2019-03-01
|
Series: | CAAI Transactions on Intelligence Technology |
Subjects: | |
Online Access: | https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016 |
id |
doaj-0ee744d072f341c796f371144ba7c9ee |
---|---|
record_format |
Article |
spelling |
doaj-0ee744d072f341c796f371144ba7c9ee2021-04-02T13:07:37ZengWileyCAAI Transactions on Intelligence Technology2468-23222019-03-0110.1049/trit.2017.0016TRIT.2017.0016Enriching basic features via multilayer bag-of-words binding for Chinese question classificationSichun Yang0Chao Gao1School of Computer Science and Technology, Anhui University of TechnologyDepartment of Computer and Software Engineering, Anhui Institute of Information TechnologyQuestion classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question classification, an operator called MBWB (multilayer bag-of-words binding) is proposed to extract potential features by binding part-of-speech, word sense, named entity and other basic features to bag-of-words, respectively. Through performing MBWB operator on two kinds of bag-of-words, i.e. A_BOW and T_BOW, the corresponding A_MBWB and T_MBWB features are generated automatically. The MBWB operator can explore potential information contained in basic features, and enrich syntactic and semantic representation of questions. Experimental results on the Chinese question set show that the classification accuracy gets significantly improved when combining two kinds of MBWB features with basic features.https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016natural language processingquestion answering (information retrieval)support vector machineslearning (artificial intelligence)pattern classificationChinese question classificationmultilayer bag-of-words bindingpotential featuresword senseMBWB operatorclassification accuracyMBWB featuresaccurate answersquestion answering systemsemantic featuresbasic featuresquestion classifier |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sichun Yang Chao Gao |
spellingShingle |
Sichun Yang Chao Gao Enriching basic features via multilayer bag-of-words binding for Chinese question classification CAAI Transactions on Intelligence Technology natural language processing question answering (information retrieval) support vector machines learning (artificial intelligence) pattern classification Chinese question classification multilayer bag-of-words binding potential features word sense MBWB operator classification accuracy MBWB features accurate answers question answering system semantic features basic features question classifier |
author_facet |
Sichun Yang Chao Gao |
author_sort |
Sichun Yang |
title |
Enriching basic features via multilayer bag-of-words binding for Chinese question classification |
title_short |
Enriching basic features via multilayer bag-of-words binding for Chinese question classification |
title_full |
Enriching basic features via multilayer bag-of-words binding for Chinese question classification |
title_fullStr |
Enriching basic features via multilayer bag-of-words binding for Chinese question classification |
title_full_unstemmed |
Enriching basic features via multilayer bag-of-words binding for Chinese question classification |
title_sort |
enriching basic features via multilayer bag-of-words binding for chinese question classification |
publisher |
Wiley |
series |
CAAI Transactions on Intelligence Technology |
issn |
2468-2322 |
publishDate |
2019-03-01 |
description |
Question classification helps to generate more accurate answers in question answering system. For an efficient question classifier, one of the most important tasks is to fully mine useful features. Aiming at solving the problem of lacking of rich syntax and semantic features in Chinese question classification, an operator called MBWB (multilayer bag-of-words binding) is proposed to extract potential features by binding part-of-speech, word sense, named entity and other basic features to bag-of-words, respectively. Through performing MBWB operator on two kinds of bag-of-words, i.e. A_BOW and T_BOW, the corresponding A_MBWB and T_MBWB features are generated automatically. The MBWB operator can explore potential information contained in basic features, and enrich syntactic and semantic representation of questions. Experimental results on the Chinese question set show that the classification accuracy gets significantly improved when combining two kinds of MBWB features with basic features. |
topic |
natural language processing question answering (information retrieval) support vector machines learning (artificial intelligence) pattern classification Chinese question classification multilayer bag-of-words binding potential features word sense MBWB operator classification accuracy MBWB features accurate answers question answering system semantic features basic features question classifier |
url |
https://digital-library.theiet.org/content/journals/10.1049/trit.2017.0016 |
work_keys_str_mv |
AT sichunyang enrichingbasicfeaturesviamultilayerbagofwordsbindingforchinesequestionclassification AT chaogao enrichingbasicfeaturesviamultilayerbagofwordsbindingforchinesequestionclassification |
_version_ |
1721566268694200320 |