Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method

Word sense disambiguation (WSD) is one of tricky tasks in natural language processing (NLP) as it needs to take into full account all the complexities of language. Because WSD involves in discovering semantic structures from unstructured text, automatic knowledge acquisition of word sense is profoun...

Full description

Bibliographic Details
Main Authors:	Xu Huang, Xiulan Hao, Qing Shen, Bin Shao
Format:	Article
Language:	English
Published:	European Alliance for Innovation (EAI) 2015-07-01
Series:	EAI Endorsed Transactions on Scalable Information Systems
Subjects:	Rough Set (RS) Instance-based learning (IL) Word Sense Disambiguation (WSD) Knowledge Acquisition Natural Language Processing (NLP)
Online Access:	http://eudl.eu/doi/10.4108/sis.2.5.e3

id	doaj-21864666060b42648c8591d6b944e800
record_format	Article
spelling	doaj-21864666060b42648c8591d6b944e8002020-11-25T02:20:20ZengEuropean Alliance for Innovation (EAI)EAI Endorsed Transactions on Scalable Information Systems2032-94072015-07-01251710.4108/sis.2.5.e3Incremental Knowledge Acquisition for WSD: A Rough Set and IL based MethodXu Huang0Xiulan Hao1Qing Shen2Bin Shao3School of Information Engineering, Huzhou University, Huzhou, Zhejiang, 313000, China; Department of Control Science and Engineering, Zhejiang University, Hangzhou, Zhejiang, 310058, ChinaSchool of Information Engineering, Huzhou University, Huzhou, Zhejiang, 313000, ChinaSchool of Information Engineering, Huzhou University, Huzhou, Zhejiang, 313000, ChinaSchool of Information Engineering, Huzhou University, Huzhou, Zhejiang, 313000, ChinaWord sense disambiguation (WSD) is one of tricky tasks in natural language processing (NLP) as it needs to take into full account all the complexities of language. Because WSD involves in discovering semantic structures from unstructured text, automatic knowledge acquisition of word sense is profoundly difficult. To acquire knowledge about Chinese multi-sense verbs, we introduce an incremental machine learning method which combines rough set method and instance based learning. First, context of a multi-sense verb is extracted into a table; its sense is annotated by a skilled human and stored in the same table. By this way, decision table is formed, and then rules can be extracted within the framework of attributive value reduction of rough set. Instances not entailed by any rule are treated as outliers. When new instances are added to decision table, only the new added and outliers need to be learned further, thus incremental leaning is fulfilled. Experiments show the scale of decision table can be reduced dramatically by this method without performance decline.http://eudl.eu/doi/10.4108/sis.2.5.e3Rough Set (RS)Instance-based learning (IL)Word Sense Disambiguation (WSD)Knowledge AcquisitionNatural Language Processing (NLP)
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xu Huang Xiulan Hao Qing Shen Bin Shao
spellingShingle	Xu Huang Xiulan Hao Qing Shen Bin Shao Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method EAI Endorsed Transactions on Scalable Information Systems Rough Set (RS) Instance-based learning (IL) Word Sense Disambiguation (WSD) Knowledge Acquisition Natural Language Processing (NLP)
author_facet	Xu Huang Xiulan Hao Qing Shen Bin Shao
author_sort	Xu Huang
title	Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method
title_short	Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method
title_full	Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method
title_fullStr	Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method
title_full_unstemmed	Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method
title_sort	incremental knowledge acquisition for wsd: a rough set and il based method
publisher	European Alliance for Innovation (EAI)
series	EAI Endorsed Transactions on Scalable Information Systems
issn	2032-9407
publishDate	2015-07-01
description	Word sense disambiguation (WSD) is one of tricky tasks in natural language processing (NLP) as it needs to take into full account all the complexities of language. Because WSD involves in discovering semantic structures from unstructured text, automatic knowledge acquisition of word sense is profoundly difficult. To acquire knowledge about Chinese multi-sense verbs, we introduce an incremental machine learning method which combines rough set method and instance based learning. First, context of a multi-sense verb is extracted into a table; its sense is annotated by a skilled human and stored in the same table. By this way, decision table is formed, and then rules can be extracted within the framework of attributive value reduction of rough set. Instances not entailed by any rule are treated as outliers. When new instances are added to decision table, only the new added and outliers need to be learned further, thus incremental leaning is fulfilled. Experiments show the scale of decision table can be reduced dramatically by this method without performance decline.
topic	Rough Set (RS) Instance-based learning (IL) Word Sense Disambiguation (WSD) Knowledge Acquisition Natural Language Processing (NLP)
url	http://eudl.eu/doi/10.4108/sis.2.5.e3
work_keys_str_mv	AT xuhuang incrementalknowledgeacquisitionforwsdaroughsetandilbasedmethod AT xiulanhao incrementalknowledgeacquisitionforwsdaroughsetandilbasedmethod AT qingshen incrementalknowledgeacquisitionforwsdaroughsetandilbasedmethod AT binshao incrementalknowledgeacquisitionforwsdaroughsetandilbasedmethod
_version_	1724872107121180672

Incremental Knowledge Acquisition for WSD: A Rough Set and IL based Method

Similar Items