Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study

This methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and...

Full description

Bibliographic Details
Main Author: Łukasz Grabowski
Format: Article
Language:English
Published: Peoples' Friendship University of Russia (RUDN University) 2019-12-01
Series:Russian journal of linguistics: Vestnik RUDN
Subjects:
Online Access:http://journals.rudn.ru/linguistics/article/viewFile/21771/17233
id doaj-8baba6c5e9404412b4bc313887ce2ca7
record_format Article
spelling doaj-8baba6c5e9404412b4bc313887ce2ca72020-11-25T02:12:31ZengPeoples' Friendship University of Russia (RUDN University)Russian journal of linguistics: Vestnik RUDN2312-91822312-92122019-12-0123365968010.22363/2312-9182-2019-23-3-659-68017905Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven StudyŁukasz Grabowski0University of Opole; University of OstravaThis methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and - as a secondary goal - to describe their discoursal functions. The keywords were identified using three methods (G2, Hedges’ g and Neozeta) and the overlap between the three metrics was explored. The overlapping keywords were qualitatively analyzed in terms of discoursal functions. As for the distinctive multi-word patterns, we focused on recurrent n-grams with the largest coverage in the corpus: these were identified using the Formulex method (Forsyth, 2015b), which provides complementary data with respect to more conservative n- gram and lexical bundles approaches. The results revealed that the most distinctive keywords were identified using Hedges’ g metric, that the largest overlap occurred between G2 and Neozeta metrics, and that the frequent use and discoursal functions of the identified lexical patterns correspond with situational contexts and communicative purposes of patient information leaflets. It is hoped that this study will provide an opportunity for a methodological reflection and inspire further corpus-driven research on distinctive recurrent lexical patterns (e.g., keywords, n-grams, lexical bundles) or - more generally - on formulaic language in texts originally written in Russian.http://journals.rudn.ru/linguistics/article/viewFile/21771/17233n-gramsformulaic languagephraseologypatient information leafletsRussian language
collection DOAJ
language English
format Article
sources DOAJ
author Łukasz Grabowski
spellingShingle Łukasz Grabowski
Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
Russian journal of linguistics: Vestnik RUDN
n-grams
formulaic language
phraseology
patient information leaflets
Russian language
author_facet Łukasz Grabowski
author_sort Łukasz Grabowski
title Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
title_short Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
title_full Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
title_fullStr Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
title_full_unstemmed Distinctive Lexical Patterns in Russian Patient Information Leaflets: A Corpus-Driven Study
title_sort distinctive lexical patterns in russian patient information leaflets: a corpus-driven study
publisher Peoples' Friendship University of Russia (RUDN University)
series Russian journal of linguistics: Vestnik RUDN
issn 2312-9182
2312-9212
publishDate 2019-12-01
description This methodologically-oriented corpus-driven study focuses on distinctive patterns of language use in a specialized text type, namely Russian patient information leaflets. The study’s main goal is to identify keywords and recurrent sequences of words that account for the leaflets’ formulaicity, and - as a secondary goal - to describe their discoursal functions. The keywords were identified using three methods (G2, Hedges’ g and Neozeta) and the overlap between the three metrics was explored. The overlapping keywords were qualitatively analyzed in terms of discoursal functions. As for the distinctive multi-word patterns, we focused on recurrent n-grams with the largest coverage in the corpus: these were identified using the Formulex method (Forsyth, 2015b), which provides complementary data with respect to more conservative n- gram and lexical bundles approaches. The results revealed that the most distinctive keywords were identified using Hedges’ g metric, that the largest overlap occurred between G2 and Neozeta metrics, and that the frequent use and discoursal functions of the identified lexical patterns correspond with situational contexts and communicative purposes of patient information leaflets. It is hoped that this study will provide an opportunity for a methodological reflection and inspire further corpus-driven research on distinctive recurrent lexical patterns (e.g., keywords, n-grams, lexical bundles) or - more generally - on formulaic language in texts originally written in Russian.
topic n-grams
formulaic language
phraseology
patient information leaflets
Russian language
url http://journals.rudn.ru/linguistics/article/viewFile/21771/17233
work_keys_str_mv AT łukaszgrabowski distinctivelexicalpatternsinrussianpatientinformationleafletsacorpusdrivenstudy
_version_ 1724908906533093376