A Polarity Capturing Sphere for Word to Vector Representation

Embedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embeddin...

Full description

Bibliographic Details
Main Authors: Sandra Rizkallah, Amir F. Atiya, Samir Shaheen
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/12/4386
id doaj-b2c100689b7a4e59b732dfa4d22501a7
record_format Article
spelling doaj-b2c100689b7a4e59b732dfa4d22501a72020-11-25T03:02:45ZengMDPI AGApplied Sciences2076-34172020-06-01104386438610.3390/app10124386A Polarity Capturing Sphere for Word to Vector RepresentationSandra Rizkallah0Amir F. Atiya1Samir Shaheen2Faculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptFaculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptFaculty of Engineering, Department of Computer Engineering, Cairo University, Giza Governorate 12613, EgyptEmbedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embedding methods is that they often fail to distinguish between synonymous, antonymous, and unrelated word pairs. Meanwhile, polarity detection is crucial for applications such as sentiment analysis. In this work we propose an embedding approach that is designed to capture the polarity issue. The approach is based on embedding the word vectors into a sphere, whereby the dot product between any vectors represents the similarity. Vectors corresponding to synonymous words would be close to each other on the sphere, while a word and its antonym would lie at opposite poles of the sphere. The approach used to design the vectors is a simple relaxation algorithm. The proposed word embedding is successful in distinguishing between synonyms, antonyms, and unrelated word pairs. It achieves results that are better than those of some of the state-of-the-art techniques and competes well with the others.https://www.mdpi.com/2076-3417/10/12/4386word to vectorword embeddingsantonymy detectionpolarity
collection DOAJ
language English
format Article
sources DOAJ
author Sandra Rizkallah
Amir F. Atiya
Samir Shaheen
spellingShingle Sandra Rizkallah
Amir F. Atiya
Samir Shaheen
A Polarity Capturing Sphere for Word to Vector Representation
Applied Sciences
word to vector
word embeddings
antonymy detection
polarity
author_facet Sandra Rizkallah
Amir F. Atiya
Samir Shaheen
author_sort Sandra Rizkallah
title A Polarity Capturing Sphere for Word to Vector Representation
title_short A Polarity Capturing Sphere for Word to Vector Representation
title_full A Polarity Capturing Sphere for Word to Vector Representation
title_fullStr A Polarity Capturing Sphere for Word to Vector Representation
title_full_unstemmed A Polarity Capturing Sphere for Word to Vector Representation
title_sort polarity capturing sphere for word to vector representation
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-06-01
description Embedding words from a dictionary as vectors in a space has become an active research field, due to its many uses in several natural language processing applications. Distances between the vectors should reflect the relatedness between the corresponding words. The problem with existing word embedding methods is that they often fail to distinguish between synonymous, antonymous, and unrelated word pairs. Meanwhile, polarity detection is crucial for applications such as sentiment analysis. In this work we propose an embedding approach that is designed to capture the polarity issue. The approach is based on embedding the word vectors into a sphere, whereby the dot product between any vectors represents the similarity. Vectors corresponding to synonymous words would be close to each other on the sphere, while a word and its antonym would lie at opposite poles of the sphere. The approach used to design the vectors is a simple relaxation algorithm. The proposed word embedding is successful in distinguishing between synonyms, antonyms, and unrelated word pairs. It achieves results that are better than those of some of the state-of-the-art techniques and competes well with the others.
topic word to vector
word embeddings
antonymy detection
polarity
url https://www.mdpi.com/2076-3417/10/12/4386
work_keys_str_mv AT sandrarizkallah apolaritycapturingsphereforwordtovectorrepresentation
AT amirfatiya apolaritycapturingsphereforwordtovectorrepresentation
AT samirshaheen apolaritycapturingsphereforwordtovectorrepresentation
AT sandrarizkallah polaritycapturingsphereforwordtovectorrepresentation
AT amirfatiya polaritycapturingsphereforwordtovectorrepresentation
AT samirshaheen polaritycapturingsphereforwordtovectorrepresentation
_version_ 1724688668218621952