Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.

Stretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within soci...

Full description

Bibliographic Details
Main Authors: Tyler J Gray, Christopher M Danforth, Peter Sheridan Dodds
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0232938
id doaj-a3724a9a1388474e8c6a2bf2f0699224
record_format Article
spelling doaj-a3724a9a1388474e8c6a2bf2f06992242021-03-03T21:46:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01155e023293810.1371/journal.pone.0232938Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.Tyler J GrayChristopher M DanforthPeter Sheridan DoddsStretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of 'stretchable words' found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, 'balance' and 'stretch', that capture their main characteristics, and explore their dynamics by creating visual tools we call 'balance plots' and 'spelling trees'. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics.https://doi.org/10.1371/journal.pone.0232938
collection DOAJ
language English
format Article
sources DOAJ
author Tyler J Gray
Christopher M Danforth
Peter Sheridan Dodds
spellingShingle Tyler J Gray
Christopher M Danforth
Peter Sheridan Dodds
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
PLoS ONE
author_facet Tyler J Gray
Christopher M Danforth
Peter Sheridan Dodds
author_sort Tyler J Gray
title Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
title_short Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
title_full Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
title_fullStr Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
title_full_unstemmed Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
title_sort hahahahaha, duuuuude, yeeessss!: a two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description Stretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of 'stretchable words' found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, 'balance' and 'stretch', that capture their main characteristics, and explore their dynamics by creating visual tools we call 'balance plots' and 'spelling trees'. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics.
url https://doi.org/10.1371/journal.pone.0232938
work_keys_str_mv AT tylerjgray hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings
AT christophermdanforth hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings
AT petersheridandodds hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings
_version_ 1714815180569837568