Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings

Big data approaches to psychology have become increasing popular (Jones, 2017). Two of the main developments of this line of research is the advent of distributional models of semantics (e.g., Landauer and Dumais, 1997), which learn the meaning of words from large text corpora, and the collection of...

Full description

Bibliographic Details
Main Author: Brendan T. Johns
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-02-01
Series:Frontiers in Psychology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fpsyg.2019.00268/full
id doaj-4080ddef113e4d70be70cf5179a6a571
record_format Article
spelling doaj-4080ddef113e4d70be70cf5179a6a5712020-11-24T23:12:19ZengFrontiers Media S.A.Frontiers in Psychology1664-10782019-02-011010.3389/fpsyg.2019.00268429879Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word MeaningsBrendan T. JohnsBig data approaches to psychology have become increasing popular (Jones, 2017). Two of the main developments of this line of research is the advent of distributional models of semantics (e.g., Landauer and Dumais, 1997), which learn the meaning of words from large text corpora, and the collection of mega datasets of human behavior (e.g., The English lexicon project; Balota et al., 2007). The current article combines these two approaches, with the goal being to understand the consistency and preference that people have for word meanings. This was accomplished by mining a large amount of data from an online, crowdsourced dictionary and analyzing this data with a distributional model. Overall, it was found that even for words that are not an active part of the language environment, there is a large amount of consistency in the word meanings that different people have. Additionally, it was demonstrated that users of a language have strong preferences for word meanings, such that definitions to words that do not conform to people’s conceptions are rejected by a community of language users. The results of this article provides insights into the cultural evolution of word meanings, and sheds light on alternative methodologies that can be used to understand lexical behavior.https://www.frontiersin.org/article/10.3389/fpsyg.2019.00268/fulldistributional semanticssemantic memorybig datacorpus studiesknowledge acquisition
collection DOAJ
language English
format Article
sources DOAJ
author Brendan T. Johns
spellingShingle Brendan T. Johns
Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
Frontiers in Psychology
distributional semantics
semantic memory
big data
corpus studies
knowledge acquisition
author_facet Brendan T. Johns
author_sort Brendan T. Johns
title Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
title_short Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
title_full Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
title_fullStr Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
title_full_unstemmed Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings
title_sort mining a crowdsourced dictionary to understand consistency and preference in word meanings
publisher Frontiers Media S.A.
series Frontiers in Psychology
issn 1664-1078
publishDate 2019-02-01
description Big data approaches to psychology have become increasing popular (Jones, 2017). Two of the main developments of this line of research is the advent of distributional models of semantics (e.g., Landauer and Dumais, 1997), which learn the meaning of words from large text corpora, and the collection of mega datasets of human behavior (e.g., The English lexicon project; Balota et al., 2007). The current article combines these two approaches, with the goal being to understand the consistency and preference that people have for word meanings. This was accomplished by mining a large amount of data from an online, crowdsourced dictionary and analyzing this data with a distributional model. Overall, it was found that even for words that are not an active part of the language environment, there is a large amount of consistency in the word meanings that different people have. Additionally, it was demonstrated that users of a language have strong preferences for word meanings, such that definitions to words that do not conform to people’s conceptions are rejected by a community of language users. The results of this article provides insights into the cultural evolution of word meanings, and sheds light on alternative methodologies that can be used to understand lexical behavior.
topic distributional semantics
semantic memory
big data
corpus studies
knowledge acquisition
url https://www.frontiersin.org/article/10.3389/fpsyg.2019.00268/full
work_keys_str_mv AT brendantjohns miningacrowdsourceddictionarytounderstandconsistencyandpreferenceinwordmeanings
_version_ 1725601377144537088