A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information r...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2014-01-01
|
Series: | The Scientific World Journal |
Online Access: | http://dx.doi.org/10.1155/2014/437162 |
id |
doaj-215f6934fc3342c984ee6c2dd629822b |
---|---|
record_format |
Article |
spelling |
doaj-215f6934fc3342c984ee6c2dd629822b2020-11-25T00:48:42ZengHindawi LimitedThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/437162437162A Grammar-Based Semantic Similarity Algorithm for Natural Language SentencesMing Che Lee0Jia Wei Chang1Tung Cheng Hsieh2Department of Computer and Communication Engineering, Ming Chuan University, Taoyuan 333, TaiwanDepartment of Engineering Science, National Cheng Kung University, Tainan 701, TaiwanDepartment of Visual Communication Design, Hsuan Chuang University, Hsinchu 300, TaiwanThis paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.http://dx.doi.org/10.1155/2014/437162 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ming Che Lee Jia Wei Chang Tung Cheng Hsieh |
spellingShingle |
Ming Che Lee Jia Wei Chang Tung Cheng Hsieh A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences The Scientific World Journal |
author_facet |
Ming Che Lee Jia Wei Chang Tung Cheng Hsieh |
author_sort |
Ming Che Lee |
title |
A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences |
title_short |
A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences |
title_full |
A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences |
title_fullStr |
A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences |
title_full_unstemmed |
A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences |
title_sort |
grammar-based semantic similarity algorithm for natural language sentences |
publisher |
Hindawi Limited |
series |
The Scientific World Journal |
issn |
2356-6140 1537-744X |
publishDate |
2014-01-01 |
description |
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure. |
url |
http://dx.doi.org/10.1155/2014/437162 |
work_keys_str_mv |
AT mingchelee agrammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences AT jiaweichang agrammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences AT tungchenghsieh agrammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences AT mingchelee grammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences AT jiaweichang grammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences AT tungchenghsieh grammarbasedsemanticsimilarityalgorithmfornaturallanguagesentences |
_version_ |
1725254855080017920 |