Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.

Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are no...

Full description

Bibliographic Details
Main Authors: Jan Ketil Arnulf, Kai Rune Larsen, Øyvind Lund Martinsen, Chih How Bong
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4153608?pdf=render
id doaj-52fb3da0c5e14649b2d093bc69916ffd
record_format Article
spelling doaj-52fb3da0c5e14649b2d093bc69916ffd2020-11-24T22:08:10ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0199e10636110.1371/journal.pone.0106361Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.Jan Ketil ArnulfKai Rune LarsenØyvind Lund MartinsenChih How BongSome disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language processing algorithms were used to calculate the semantic similarity among all items in state-of-the-art surveys from Organisational Behaviour research. These surveys covered areas such as transformational leadership, work motivation and work outcomes. This information was used to explain and predict the response patterns from real subjects. Semantic algorithms explained 60-86% of the variance in the response patterns and allowed remarkably precise prediction of survey responses from humans, except in a personality test. Even the relationships between independent and their purported dependent variables were accurately predicted. This raises concern about the empirical nature of data collected through some surveys if results are already given a priori through the way subjects are being asked. Survey response patterns seem heavily determined by semantics. Language algorithms may suggest these prior to administering a survey. This study suggests that semantic algorithms are becoming new tools for the social sciences, opening perspectives on survey responses that prevalent psychometric theory cannot explain.http://europepmc.org/articles/PMC4153608?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Jan Ketil Arnulf
Kai Rune Larsen
Øyvind Lund Martinsen
Chih How Bong
spellingShingle Jan Ketil Arnulf
Kai Rune Larsen
Øyvind Lund Martinsen
Chih How Bong
Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
PLoS ONE
author_facet Jan Ketil Arnulf
Kai Rune Larsen
Øyvind Lund Martinsen
Chih How Bong
author_sort Jan Ketil Arnulf
title Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
title_short Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
title_full Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
title_fullStr Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
title_full_unstemmed Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
title_sort predicting survey responses: how and why semantics shape survey statistics on organizational behaviour.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language processing algorithms were used to calculate the semantic similarity among all items in state-of-the-art surveys from Organisational Behaviour research. These surveys covered areas such as transformational leadership, work motivation and work outcomes. This information was used to explain and predict the response patterns from real subjects. Semantic algorithms explained 60-86% of the variance in the response patterns and allowed remarkably precise prediction of survey responses from humans, except in a personality test. Even the relationships between independent and their purported dependent variables were accurately predicted. This raises concern about the empirical nature of data collected through some surveys if results are already given a priori through the way subjects are being asked. Survey response patterns seem heavily determined by semantics. Language algorithms may suggest these prior to administering a survey. This study suggests that semantic algorithms are becoming new tools for the social sciences, opening perspectives on survey responses that prevalent psychometric theory cannot explain.
url http://europepmc.org/articles/PMC4153608?pdf=render
work_keys_str_mv AT janketilarnulf predictingsurveyresponseshowandwhysemanticsshapesurveystatisticsonorganizationalbehaviour
AT kairunelarsen predictingsurveyresponseshowandwhysemanticsshapesurveystatisticsonorganizationalbehaviour
AT øyvindlundmartinsen predictingsurveyresponseshowandwhysemanticsshapesurveystatisticsonorganizationalbehaviour
AT chihhowbong predictingsurveyresponseshowandwhysemanticsshapesurveystatisticsonorganizationalbehaviour
_version_ 1725817370339966976