A Polarity Capturing Sphere for Word to Vector Representation
Embedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embeddin...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-06-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/10/12/4386 |
id |
doaj-b2c100689b7a4e59b732dfa4d22501a7 |
---|---|
record_format |
Article |
spelling |
doaj-b2c100689b7a4e59b732dfa4d22501a72020-11-25T03:02:45ZengMDPI AGApplied Sciences2076-34172020-06-01104386438610.3390/app10124386A Polarity Capturing Sphere for Word to Vector RepresentationSandra Rizkallah0Amir F. Atiya1Samir Shaheen2Faculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptFaculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptFaculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptEmbedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embedding methods is that they often fail to distinguish between synonymous, antonymous, and unrelated word pairs. Meanwhile, polarity detection is crucial for applications such as sentiment analysis. In this work we propose an embedding approach that is designed to capture the polarity issue. The approach is based on embedding the word vectors into a sphere, whereby the dot product between any vectors represents the similarity. Vectors corresponding to synonymous words would be close to each other on the sphere, while a word and its antonym would lie at opposite poles of the sphere. The approach used to design the vectors is a simple relaxation algorithm. The proposed word embedding is successful in distinguishing between synonyms, antonyms, and unrelated word pairs. It achieves results that are better than those of some of the state-of-the-art techniques and competes well with the others.https://www.mdpi.com/2076-3417/10/12/4386word to vectorword embeddingsantonymy detectionpolarity |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sandra Rizkallah Amir F. Atiya Samir Shaheen |
spellingShingle |
Sandra Rizkallah Amir F. Atiya Samir Shaheen A Polarity Capturing Sphere for Word to Vector Representation Applied Sciences word to vector word embeddings antonymy detection polarity |
author_facet |
Sandra Rizkallah Amir F. Atiya Samir Shaheen |
author_sort |
Sandra Rizkallah |
title |
A Polarity Capturing Sphere for Word to Vector Representation |
title_short |
A Polarity Capturing Sphere for Word to Vector Representation |
title_full |
A Polarity Capturing Sphere for Word to Vector Representation |
title_fullStr |
A Polarity Capturing Sphere for Word to Vector Representation |
title_full_unstemmed |
A Polarity Capturing Sphere for Word to Vector Representation |
title_sort |
polarity capturing sphere for word to vector representation |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2020-06-01 |
description |
Embedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embedding methods is that they often fail to distinguish between synonymous, antonymous, and unrelated word pairs. Meanwhile, polarity detection is crucial for applications such as sentiment analysis. In this work we propose an embedding approach that is designed to capture the polarity issue. The approach is based on embedding the word vectors into a sphere, whereby the dot product between any vectors represents the similarity. Vectors corresponding to synonymous words would be close to each other on the sphere, while a word and its antonym would lie at opposite poles of the sphere. The approach used to design the vectors is a simple relaxation algorithm. The proposed word embedding is successful in distinguishing between synonyms, antonyms, and unrelated word pairs. It achieves results that are better than those of some of the state-of-the-art techniques and competes well with the others. |
topic |
word to vector word embeddings antonymy detection polarity |
url |
https://www.mdpi.com/2076-3417/10/12/4386 |
work_keys_str_mv |
AT sandrarizkallah apolaritycapturingsphereforwordtovectorrepresentation AT amirfatiya apolaritycapturingsphereforwordtovectorrepresentation AT samirshaheen apolaritycapturingsphereforwordtovectorrepresentation AT sandrarizkallah polaritycapturingsphereforwordtovectorrepresentation AT amirfatiya polaritycapturingsphereforwordtovectorrepresentation AT samirshaheen polaritycapturingsphereforwordtovectorrepresentation |
_version_ |
1724688668218621952 |