Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges

The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its n...

Full description

Bibliographic Details
Main Authors: Evelina Leivada, Roberta D’Alessandro, Kleanthes K. Grohmann
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-02-01
Series:Frontiers in Psychology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fpsyg.2019.00313/full
id doaj-ab89e779598d4d268f1b9137975c1ae3
record_format Article
spelling doaj-ab89e779598d4d268f1b9137975c1ae32020-11-24T21:40:44ZengFrontiers Media S.A.Frontiers in Psychology1664-10782019-02-011010.3389/fpsyg.2019.00313429300Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental ChallengesEvelina Leivada0Roberta D’Alessandro1Kleanthes K. Grohmann2Department of Language and Culture, UiT The Arctic University of Norway, Tromsø, NorwayUtrecht Institute of Linguistics, UiL-OTS, Utrecht University, Utrecht, NetherlandsDepartment of English Studies, University of Cyprus, Nicosia, CyprusThe aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages.https://www.frontiersin.org/article/10.3389/fpsyg.2019.00313/fullfieldworkrich databig dataexperimental designdialectsign language
collection DOAJ
language English
format Article
sources DOAJ
author Evelina Leivada
Roberta D’Alessandro
Kleanthes K. Grohmann
spellingShingle Evelina Leivada
Roberta D’Alessandro
Kleanthes K. Grohmann
Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
Frontiers in Psychology
fieldwork
rich data
big data
experimental design
dialect
sign language
author_facet Evelina Leivada
Roberta D’Alessandro
Kleanthes K. Grohmann
author_sort Evelina Leivada
title Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_short Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_full Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_fullStr Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_full_unstemmed Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_sort eliciting big data from small, young, or non-standard languages: 10 experimental challenges
publisher Frontiers Media S.A.
series Frontiers in Psychology
issn 1664-1078
publishDate 2019-02-01
description The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages.
topic fieldwork
rich data
big data
experimental design
dialect
sign language
url https://www.frontiersin.org/article/10.3389/fpsyg.2019.00313/full
work_keys_str_mv AT evelinaleivada elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges
AT robertadalessandro elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges
AT kleantheskgrohmann elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges
_version_ 1725924850692784128