Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings

Word embeddings trained on natural corpora (e.g., newspaper collections, Wikipedia or the Web) excel in capturing thematic similarity (“topical relatedness”) on word pairs such as ‘coffee’ and ‘cup’ or ’bus’ and ‘road’. However, they are less successful on pairs showing taxonomic similarity, like ‘c...

Full description

Bibliographic Details
Main Authors: Maldonado Alfredo, Klubička Filip, Kelleher John
Format: Article
Language:English
Published: De Gruyter 2019-10-01
Series:Open Computer Science
Subjects:
Online Access:https://doi.org/10.1515/comp-2019-0009