The development of a fuzzy semantic sentence similarity measure

A problem in the field of semantic sentence similarity is the inability of sentence similarity measures to accurately represent the effect perception based (fuzzy) words, which are commonly used in natural language, have on sentence similarity. This research project developed a new sentence similari...

Full description

Bibliographic Details
Main Author: Chandran, Gautam David
Published: Manchester Metropolitan University 2013
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.765088
id ndltd-bl.uk-oai-ethos.bl.uk-765088
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-7650882019-03-05T15:45:08ZThe development of a fuzzy semantic sentence similarity measureChandran, Gautam David2013A problem in the field of semantic sentence similarity is the inability of sentence similarity measures to accurately represent the effect perception based (fuzzy) words, which are commonly used in natural language, have on sentence similarity. This research project developed a new sentence similarity measure to solve this problem. The new measure, Fuzzy Algorithm for Similarity Testing (FAST) is a novel ontology-based similarity measure that uses concepts of fuzzy and computing with words to allow for the accurate representation of fuzzy based words. Through human experimentation fuzzy sets were created for six categories of words based on their levels of association with particular concepts. These fuzzy sets were then defuzzified and the results used to create new ontological relations between the fuzzy words contained within them and from that a new fuzzy ontology was created. Using these relationships allows for the creation of a new ontology-based fuzzy semantic text similarity algorithm that is able to show the effect of fuzzy words on computing sentence similarity as well as the effect that fuzzy words have on non-fuzzy words within a sentence. In order to evaluate FAST, two new test datasets were created through the use of questionnaire based human experimentation. This involved the generation of a robust methodology for creating usable fuzzy datasets (including an automated method that was used to create one of the two fuzzy datasets). FAST was evaluated through experiments conducted using the new fuzzy datasets. The results of the evaluation showed that there was an improved level of correlation between FAST and human test results over two existing sentence similarity measures demonstrating its success in representing the similarity between pairs of sentences containing fuzzy words.Manchester Metropolitan Universityhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.765088http://e-space.mmu.ac.uk/617190/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
description A problem in the field of semantic sentence similarity is the inability of sentence similarity measures to accurately represent the effect perception based (fuzzy) words, which are commonly used in natural language, have on sentence similarity. This research project developed a new sentence similarity measure to solve this problem. The new measure, Fuzzy Algorithm for Similarity Testing (FAST) is a novel ontology-based similarity measure that uses concepts of fuzzy and computing with words to allow for the accurate representation of fuzzy based words. Through human experimentation fuzzy sets were created for six categories of words based on their levels of association with particular concepts. These fuzzy sets were then defuzzified and the results used to create new ontological relations between the fuzzy words contained within them and from that a new fuzzy ontology was created. Using these relationships allows for the creation of a new ontology-based fuzzy semantic text similarity algorithm that is able to show the effect of fuzzy words on computing sentence similarity as well as the effect that fuzzy words have on non-fuzzy words within a sentence. In order to evaluate FAST, two new test datasets were created through the use of questionnaire based human experimentation. This involved the generation of a robust methodology for creating usable fuzzy datasets (including an automated method that was used to create one of the two fuzzy datasets). FAST was evaluated through experiments conducted using the new fuzzy datasets. The results of the evaluation showed that there was an improved level of correlation between FAST and human test results over two existing sentence similarity measures demonstrating its success in representing the similarity between pairs of sentences containing fuzzy words.
author Chandran, Gautam David
spellingShingle Chandran, Gautam David
The development of a fuzzy semantic sentence similarity measure
author_facet Chandran, Gautam David
author_sort Chandran, Gautam David
title The development of a fuzzy semantic sentence similarity measure
title_short The development of a fuzzy semantic sentence similarity measure
title_full The development of a fuzzy semantic sentence similarity measure
title_fullStr The development of a fuzzy semantic sentence similarity measure
title_full_unstemmed The development of a fuzzy semantic sentence similarity measure
title_sort development of a fuzzy semantic sentence similarity measure
publisher Manchester Metropolitan University
publishDate 2013
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.765088
work_keys_str_mv AT chandrangautamdavid thedevelopmentofafuzzysemanticsentencesimilaritymeasure
AT chandrangautamdavid developmentofafuzzysemanticsentencesimilaritymeasure
_version_ 1718996723647381504