Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.
Stretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within soci...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0232938 |
id |
doaj-a3724a9a1388474e8c6a2bf2f0699224 |
---|---|
record_format |
Article |
spelling |
doaj-a3724a9a1388474e8c6a2bf2f06992242021-03-03T21:46:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01155e023293810.1371/journal.pone.0232938Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings.Tyler J GrayChristopher M DanforthPeter Sheridan DoddsStretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of 'stretchable words' found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, 'balance' and 'stretch', that capture their main characteristics, and explore their dynamics by creating visual tools we call 'balance plots' and 'spelling trees'. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics.https://doi.org/10.1371/journal.pone.0232938 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tyler J Gray Christopher M Danforth Peter Sheridan Dodds |
spellingShingle |
Tyler J Gray Christopher M Danforth Peter Sheridan Dodds Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. PLoS ONE |
author_facet |
Tyler J Gray Christopher M Danforth Peter Sheridan Dodds |
author_sort |
Tyler J Gray |
title |
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
title_short |
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
title_full |
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
title_fullStr |
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
title_full_unstemmed |
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
title_sort |
hahahahaha, duuuuude, yeeessss!: a two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
Stretched words like 'heellllp' or 'heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of 'stretchable words' found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, 'balance' and 'stretch', that capture their main characteristics, and explore their dynamics by creating visual tools we call 'balance plots' and 'spelling trees'. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics. |
url |
https://doi.org/10.1371/journal.pone.0232938 |
work_keys_str_mv |
AT tylerjgray hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings AT christophermdanforth hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings AT petersheridandodds hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings |
_version_ |
1714815180569837568 |