Stylo visualisations of Middle English documents

International audience Automated approaches to identifying authorship of a text have become commonplace in the stylometric studies. The current article applies an unsupervised stylometric approach on Middle English documents using the script Stylo in R, in an attempt to distinguish between texts fro...

Full description

Bibliographic Details
Main Author:	Martti Mäkinen
Format:	Article
Language:	English
Published:	Nicolas Turenne 2020-12-01
Series:	Journal of Data Mining and Digital Humanities
Subjects:	middle english historical dialectology diatopical variation unattended analysis stylometry authorship attribution r non-standard spelling [shs.langue]humanities and social sciences/linguistics [shs]humanities and social sciences
Online Access:	https://jdmdh.episciences.org/7022/pdf

Description
Summary:	International audience Automated approaches to identifying authorship of a text have become commonplace in the stylometric studies. The current article applies an unsupervised stylometric approach on Middle English documents using the script Stylo in R, in an attempt to distinguish between texts from different dialectal areas. The approach is based on the distribution of character 3-grams generated from the texts of the corpus of Middle English Local Documents (MELD). The article adopts the middle ground in the study of Middle English spelling variation, between the concept of relational linguistic space and the real linguistic continuum of medieval England. Stylo can distinguish between Middle English dialects by using the less frequent character 3-grams.
ISSN:	2416-5999

Stylo visualisations of Middle English documents

Similar Items