Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
This methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Peoples' Friendship University of Russia (RUDN University)
2019-12-01
|
Series: | Russian journal of linguistics: Vestnik RUDN |
Subjects: | |
Online Access: | http://journals.rudn.ru/linguistics/article/viewFile/21771/17233 |
id |
doaj-8baba6c5e9404412b4bc313887ce2ca7 |
---|---|
record_format |
Article |
spelling |
doaj-8baba6c5e9404412b4bc313887ce2ca72020-11-25T02:12:31ZengPeoples' Friendship University of Russia (RUDN University)Russian journal of linguistics: Vestnik RUDN2312-91822312-92122019-12-0123365968010.22363/2312-9182-2019-23-3-659-68017905Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven StudyŁukasz Grabowski0University of Opole; University of OstravaThis methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and - as a secondary goal - to describe their discoursal functions. The keywords were identified using three methods (G2, Hedges’ g and Neozeta) and the overlap between the three metrics was explored. The overlapping keywords were qualitatively analyzed in terms of discoursal functions. As for the distinctive multi-word patterns, we focused on recurrent n-grams with the largest coverage in the corpus: these were identified using the Formulex method (Forsyth, 2015b), which provides complementary data with respect to more conservative n- gram and lexical bundles approaches. The results revealed that the most distinctive keywords were identified using Hedges’ g metric, that the largest overlap occurred between G2 and Neozeta metrics, and that the frequent use and discoursal functions of the identified lexical patterns correspond with situational contexts and communicative purposes of patient information leaflets. It is hoped that this study will provide an opportunity for a methodological reflection and inspire further corpus-driven research on distinctive recurrent lexical patterns (e.g., keywords, n-grams, lexical bundles) or - more generally - on formulaic language in texts originally written in Russian.http://journals.rudn.ru/linguistics/article/viewFile/21771/17233n-gramsformulaic languagephraseologypatient information leafletsRussian language |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Łukasz Grabowski |
spellingShingle |
Łukasz Grabowski Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study Russian journal of linguistics: Vestnik RUDN n-grams formulaic language phraseology patient information leaflets Russian language |
author_facet |
Łukasz Grabowski |
author_sort |
Łukasz Grabowski |
title |
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study |
title_short |
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study |
title_full |
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study |
title_fullStr |
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study |
title_full_unstemmed |
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study |
title_sort |
distinctive lexical patterns in russian patient information leaflets: a corpus-driven study |
publisher |
Peoples' Friendship University of Russia (RUDN University) |
series |
Russian journal of linguistics: Vestnik RUDN |
issn |
2312-9182 2312-9212 |
publishDate |
2019-12-01 |
description |
This methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and - as a secondary goal - to describe their discoursal functions. The keywords were identified using three methods (G2, Hedges’ g and Neozeta) and the overlap between the three metrics was explored. The overlapping keywords were qualitatively analyzed in terms of discoursal functions. As for the distinctive multi-word patterns, we focused on recurrent n-grams with the largest coverage in the corpus: these were identified using the Formulex method (Forsyth, 2015b), which provides complementary data with respect to more conservative n- gram and lexical bundles approaches. The results revealed that the most distinctive keywords were identified using Hedges’ g metric, that the largest overlap occurred between G2 and Neozeta metrics, and that the frequent use and discoursal functions of the identified lexical patterns correspond with situational contexts and communicative purposes of patient information leaflets. It is hoped that this study will provide an opportunity for a methodological reflection and inspire further corpus-driven research on distinctive recurrent lexical patterns (e.g., keywords, n-grams, lexical bundles) or - more generally - on formulaic language in texts originally written in Russian. |
topic |
n-grams formulaic language phraseology patient information leaflets Russian language |
url |
http://journals.rudn.ru/linguistics/article/viewFile/21771/17233 |
work_keys_str_mv |
AT łukaszgrabowski distinctivelexicalpatternsinrussianpatientinformationleafletsacorpusdrivenstudy |
_version_ |
1724908906533093376 |