A Study of Selecting Machine Learning Features for Detecting Entailment, Paraphrase and Contradiction in Texts

碩士 === 國立雲林科技大學 === 資訊工程系碩士班 === 101 === NTCIR-9 RITE task evaluates systems which automatically detect entailment, paraphrase, and contradiction in texts. We developed a preliminary system for the NTCIR-9 RITE task based on rules. In NTCIR-10, we tried machine learning approaches. We transformed th...

Full description

Bibliographic Details
Main Authors: Nai-Hsuan Han, 韓乃軒
Other Authors: Lun-Wei Ku
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/44594957024872746237
Description
Summary:碩士 === 國立雲林科技大學 === 資訊工程系碩士班 === 101 === NTCIR-9 RITE task evaluates systems which automatically detect entailment, paraphrase, and contradiction in texts. We developed a preliminary system for the NTCIR-9 RITE task based on rules. In NTCIR-10, we tried machine learning approaches. We transformed the existing rules into features and then added additional syntactic and semantic features for SVM. The straightforward assumption was still kept in NTCIR-10: the relation between two sentences was determined by the different parts between them instead of the identical parts. Therefore, features in NTCIR-9 including sentence lengths, the content of matched keywords, quantities of matched keywords, and their parts of speech together with new features such as parsing tree information, dependency relations, negation words and synonyms were considered. We found that some features were useful for the BC subtask while some help more in the MC subtask.