An iterative approach for the global estimation of sentence similarity.
Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2017-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5595307?pdf=render |
id |
doaj-0c3316de02714932b5e25fd148d38417 |
---|---|
record_format |
Article |
spelling |
doaj-0c3316de02714932b5e25fd148d384172020-11-25T00:09:03ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01129e018088510.1371/journal.pone.0180885An iterative approach for the global estimation of sentence similarity.Tomoyuki KajiwaraDanushka BollegalaYuichi YoshidaKen-Ichi KawarabayashiMeasuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts.http://europepmc.org/articles/PMC5595307?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tomoyuki Kajiwara Danushka Bollegala Yuichi Yoshida Ken-Ichi Kawarabayashi |
spellingShingle |
Tomoyuki Kajiwara Danushka Bollegala Yuichi Yoshida Ken-Ichi Kawarabayashi An iterative approach for the global estimation of sentence similarity. PLoS ONE |
author_facet |
Tomoyuki Kajiwara Danushka Bollegala Yuichi Yoshida Ken-Ichi Kawarabayashi |
author_sort |
Tomoyuki Kajiwara |
title |
An iterative approach for the global estimation of sentence similarity. |
title_short |
An iterative approach for the global estimation of sentence similarity. |
title_full |
An iterative approach for the global estimation of sentence similarity. |
title_fullStr |
An iterative approach for the global estimation of sentence similarity. |
title_full_unstemmed |
An iterative approach for the global estimation of sentence similarity. |
title_sort |
iterative approach for the global estimation of sentence similarity. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2017-01-01 |
description |
Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts. |
url |
http://europepmc.org/articles/PMC5595307?pdf=render |
work_keys_str_mv |
AT tomoyukikajiwara aniterativeapproachfortheglobalestimationofsentencesimilarity AT danushkabollegala aniterativeapproachfortheglobalestimationofsentencesimilarity AT yuichiyoshida aniterativeapproachfortheglobalestimationofsentencesimilarity AT kenichikawarabayashi aniterativeapproachfortheglobalestimationofsentencesimilarity AT tomoyukikajiwara iterativeapproachfortheglobalestimationofsentencesimilarity AT danushkabollegala iterativeapproachfortheglobalestimationofsentencesimilarity AT yuichiyoshida iterativeapproachfortheglobalestimationofsentencesimilarity AT kenichikawarabayashi iterativeapproachfortheglobalestimationofsentencesimilarity |
_version_ |
1725413185041727488 |