A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts

碩士 === 國立雲林科技大學 === 資訊工程系碩士班 === 101 === NTCIR-9 RITE task evaluates systems which automatically detect entailment, paraphrase, and contradiction in texts. We developed a preliminary system for the NTCIR-9 RITE task based on rules. In NTCIR-10, we tried machine learning approaches. We transformed th...

Full description

Bibliographic Details
Main Authors: Nai-Hsuan Han, 韓乃軒
Other Authors: Lun-Wei Ku
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/44594957024872746237
id ndltd-TW-101YUNT5392011
record_format oai_dc
spelling ndltd-TW-101YUNT53920112015-10-13T22:57:23Z http://ndltd.ncl.edu.tw/handle/44594957024872746237 A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts 語句關係判斷:機器學習特徵選取之研究 Nai-Hsuan Han 韓乃軒 碩士 國立雲林科技大學 資訊工程系碩士班 101 NTCIR-9 RITE task evaluates systems which automatically detect entailment, paraphrase, and contradiction in texts. We developed a preliminary system for the NTCIR-9 RITE task based on rules. In NTCIR-10, we tried machine learning approaches. We transformed the existing rules into features and then added additional syntactic and semantic features for SVM. The straightforward assumption was still kept in NTCIR-10: the relation between two sentences was determined by the different parts between them instead of the identical parts. Therefore, features in NTCIR-9 including sentence lengths, the content of matched keywords, quantities of matched keywords, and their parts of speech together with new features such as parsing tree information, dependency relations, negation words and synonyms were considered. We found that some features were useful for the BC subtask while some help more in the MC subtask. Lun-Wei Ku Edward T.-H. Chu 古倫維 朱宗賢 2013 學位論文 ; thesis 48 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立雲林科技大學 === 資訊工程系碩士班 === 101 === NTCIR-9 RITE task evaluates systems which automatically detect entailment, paraphrase, and contradiction in texts. We developed a preliminary system for the NTCIR-9 RITE task based on rules. In NTCIR-10, we tried machine learning approaches. We transformed the existing rules into features and then added additional syntactic and semantic features for SVM. The straightforward assumption was still kept in NTCIR-10: the relation between two sentences was determined by the different parts between them instead of the identical parts. Therefore, features in NTCIR-9 including sentence lengths, the content of matched keywords, quantities of matched keywords, and their parts of speech together with new features such as parsing tree information, dependency relations, negation words and synonyms were considered. We found that some features were useful for the BC subtask while some help more in the MC subtask.
author2 Lun-Wei Ku
author_facet Lun-Wei Ku
Nai-Hsuan Han
韓乃軒
author Nai-Hsuan Han
韓乃軒
spellingShingle Nai-Hsuan Han
韓乃軒
A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
author_sort Nai-Hsuan Han
title A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
title_short A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
title_full A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
title_fullStr A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
title_full_unstemmed A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts
title_sort study of selecting machine learning features for detecting entailment, paraphrase and contradiction in texts
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/44594957024872746237
work_keys_str_mv AT naihsuanhan astudyofselectingmachinelearningfeaturesfordetectingentailmentparaphraseandcontradictionintexts
AT hánnǎixuān astudyofselectingmachinelearningfeaturesfordetectingentailmentparaphraseandcontradictionintexts
AT naihsuanhan yǔjùguānxìpànduànjīqìxuéxítèzhēngxuǎnqǔzhīyánjiū
AT hánnǎixuān yǔjùguānxìpànduànjīqìxuéxítèzhēngxuǎnqǔzhīyánjiū
AT naihsuanhan studyofselectingmachinelearningfeaturesfordetectingentailmentparaphraseandcontradictionintexts
AT hánnǎixuān studyofselectingmachinelearningfeaturesfordetectingentailmentparaphraseandcontradictionintexts
_version_ 1718082879805718528