Feature Combination for Measuring Sentence Similarity

Sentence similarity is one of the core elements of Natural Language Processing (NLP) tasks such as Recognizing Textual Entailment, and Paraphrase Recognition. Over the years, different systems have been proposed to measure similarity between fragments of texts. In this research, we propose a ne...

Full description

Bibliographic Details
Main Author: Shareghi Nojehdeh, Ehsan
Format: Others
Published: 2013
Online Access:http://spectrum.library.concordia.ca/977146/1/Shareghi%2DNojehdeh_MCompSc_S2013.pdf
Shareghi Nojehdeh, Ehsan <http://spectrum.library.concordia.ca/view/creators/Shareghi_Nojehdeh=3AEhsan=3A=3A.html> (2013) Feature Combination for Measuring Sentence Similarity. Masters thesis, Concordia University.
id ndltd-LACETR-oai-collectionscanada.gc.ca-QMG.977146
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-QMG.9771462013-10-22T03:48:15Z Feature Combination for Measuring Sentence Similarity Shareghi Nojehdeh, Ehsan Sentence similarity is one of the core elements of Natural Language Processing (NLP) tasks such as Recognizing Textual Entailment, and Paraphrase Recognition. Over the years, different systems have been proposed to measure similarity between fragments of texts. In this research, we propose a new two phase supervised learning method which uses a combination of lexical features to train a model for predicting similarity between sentences. Each of these features, covers an aspect of the text on implicit or explicit level. The two phase method uses all combinations of the features in the feature space and trains separate models based on each combination. Then it creates a meta-feature space and trains a final model based on that. The thesis contrasts existing approaches that use feature selection, because it does not aim to find the best subset of the possible features. We show that this two step process significantly improves the results achieved by single-layer standard learning methodology, and achieves the level of performance that is comparable to the existing state-of-the-art methods. 2013-04-10 Thesis NonPeerReviewed application/pdf http://spectrum.library.concordia.ca/977146/1/Shareghi%2DNojehdeh_MCompSc_S2013.pdf Shareghi Nojehdeh, Ehsan <http://spectrum.library.concordia.ca/view/creators/Shareghi_Nojehdeh=3AEhsan=3A=3A.html> (2013) Feature Combination for Measuring Sentence Similarity. Masters thesis, Concordia University. http://spectrum.library.concordia.ca/977146/
collection NDLTD
format Others
sources NDLTD
description Sentence similarity is one of the core elements of Natural Language Processing (NLP) tasks such as Recognizing Textual Entailment, and Paraphrase Recognition. Over the years, different systems have been proposed to measure similarity between fragments of texts. In this research, we propose a new two phase supervised learning method which uses a combination of lexical features to train a model for predicting similarity between sentences. Each of these features, covers an aspect of the text on implicit or explicit level. The two phase method uses all combinations of the features in the feature space and trains separate models based on each combination. Then it creates a meta-feature space and trains a final model based on that. The thesis contrasts existing approaches that use feature selection, because it does not aim to find the best subset of the possible features. We show that this two step process significantly improves the results achieved by single-layer standard learning methodology, and achieves the level of performance that is comparable to the existing state-of-the-art methods.
author Shareghi Nojehdeh, Ehsan
spellingShingle Shareghi Nojehdeh, Ehsan
Feature Combination for Measuring Sentence Similarity
author_facet Shareghi Nojehdeh, Ehsan
author_sort Shareghi Nojehdeh, Ehsan
title Feature Combination for Measuring Sentence Similarity
title_short Feature Combination for Measuring Sentence Similarity
title_full Feature Combination for Measuring Sentence Similarity
title_fullStr Feature Combination for Measuring Sentence Similarity
title_full_unstemmed Feature Combination for Measuring Sentence Similarity
title_sort feature combination for measuring sentence similarity
publishDate 2013
url http://spectrum.library.concordia.ca/977146/1/Shareghi%2DNojehdeh_MCompSc_S2013.pdf
Shareghi Nojehdeh, Ehsan <http://spectrum.library.concordia.ca/view/creators/Shareghi_Nojehdeh=3AEhsan=3A=3A.html> (2013) Feature Combination for Measuring Sentence Similarity. Masters thesis, Concordia University.
work_keys_str_mv AT shareghinojehdehehsan featurecombinationformeasuringsentencesimilarity
_version_ 1716608384742981632