An iterative approach for the global estimation of sentence similarity.

Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must...

Full description

Bibliographic Details
Main Authors: Tomoyuki Kajiwara, Danushka Bollegala, Yuichi Yoshida, Ken-Ichi Kawarabayashi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5595307?pdf=render
id doaj-0c3316de02714932b5e25fd148d38417
record_format Article
spelling doaj-0c3316de02714932b5e25fd148d384172020-11-25T00:09:03ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01129e018088510.1371/journal.pone.0180885An iterative approach for the global estimation of sentence similarity.Tomoyuki KajiwaraDanushka BollegalaYuichi YoshidaKen-Ichi KawarabayashiMeasuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts.http://europepmc.org/articles/PMC5595307?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Tomoyuki Kajiwara
Danushka Bollegala
Yuichi Yoshida
Ken-Ichi Kawarabayashi
spellingShingle Tomoyuki Kajiwara
Danushka Bollegala
Yuichi Yoshida
Ken-Ichi Kawarabayashi
An iterative approach for the global estimation of sentence similarity.
PLoS ONE
author_facet Tomoyuki Kajiwara
Danushka Bollegala
Yuichi Yoshida
Ken-Ichi Kawarabayashi
author_sort Tomoyuki Kajiwara
title An iterative approach for the global estimation of sentence similarity.
title_short An iterative approach for the global estimation of sentence similarity.
title_full An iterative approach for the global estimation of sentence similarity.
title_fullStr An iterative approach for the global estimation of sentence similarity.
title_full_unstemmed An iterative approach for the global estimation of sentence similarity.
title_sort iterative approach for the global estimation of sentence similarity.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts.
url http://europepmc.org/articles/PMC5595307?pdf=render
work_keys_str_mv AT tomoyukikajiwara aniterativeapproachfortheglobalestimationofsentencesimilarity
AT danushkabollegala aniterativeapproachfortheglobalestimationofsentencesimilarity
AT yuichiyoshida aniterativeapproachfortheglobalestimationofsentencesimilarity
AT kenichikawarabayashi aniterativeapproachfortheglobalestimationofsentencesimilarity
AT tomoyukikajiwara iterativeapproachfortheglobalestimationofsentencesimilarity
AT danushkabollegala iterativeapproachfortheglobalestimationofsentencesimilarity
AT yuichiyoshida iterativeapproachfortheglobalestimationofsentencesimilarity
AT kenichikawarabayashi iterativeapproachfortheglobalestimationofsentencesimilarity
_version_ 1725413185041727488