Text-Based Detection of the Risk of Depression

This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprin...

Full description

Bibliographic Details
Main Authors: Jana M. Havigerová, Jiří Haviger, Dalibor Kučera, Petra Hoffmannová
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-03-01
Series:Frontiers in Psychology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/full
id doaj-5a28e676193b42ecbb2c0a43a373c08d
record_format Article
spelling doaj-5a28e676193b42ecbb2c0a43a373c08d2020-11-25T01:20:23ZengFrontiers Media S.A.Frontiers in Psychology1664-10782019-03-011010.3389/fpsyg.2019.00513385113Text-Based Detection of the Risk of DepressionJana M. Havigerová0Jiří Haviger1Dalibor Kučera2Petra Hoffmannová3Institute of Psychology, Masaryk University, Brno, CzechiaDepartment of Informatics and Quantitative Methods, University of Hradec Králové, Hradec Králové, CzechiaDepartment of Pedagogy and Psychology, University of South Bohemia, České Budějovice, CzechiaInstitute of Psychology, Masaryk University, Brno, CzechiaThis study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models. Results: Across all participants, the data best fit predictive models of depressivity using morphological characteristics from the informal text “letter from holidays” (Nagelkerke r2 = 0.526 for men and 0.670 for women). For men, models for the formal texts “cover letter” and “complaint” showed moderate fit with the data (r2 = 0.479 and 0.435). The constructed models show weak to substantial recall (0.235 – 0.800) and moderate to substantial precision (0.571 – 0.889). Morphological variables appearing in the final models vary. There are no key morphological characteristics suitable for all models or for all genres. The resulting models’ properties demonstrate that they should be suitable for screening individuals at risk of depression and the most suitable genre is informal text (“letter from holidays”).https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/fulldepressiongenremorphologyquantitative linguisticspredictive model
collection DOAJ
language English
format Article
sources DOAJ
author Jana M. Havigerová
Jiří Haviger
Dalibor Kučera
Petra Hoffmannová
spellingShingle Jana M. Havigerová
Jiří Haviger
Dalibor Kučera
Petra Hoffmannová
Text-Based Detection of the Risk of Depression
Frontiers in Psychology
depression
genre
morphology
quantitative linguistics
predictive model
author_facet Jana M. Havigerová
Jiří Haviger
Dalibor Kučera
Petra Hoffmannová
author_sort Jana M. Havigerová
title Text-Based Detection of the Risk of Depression
title_short Text-Based Detection of the Risk of Depression
title_full Text-Based Detection of the Risk of Depression
title_fullStr Text-Based Detection of the Risk of Depression
title_full_unstemmed Text-Based Detection of the Risk of Depression
title_sort text-based detection of the risk of depression
publisher Frontiers Media S.A.
series Frontiers in Psychology
issn 1664-1078
publishDate 2019-03-01
description This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models. Results: Across all participants, the data best fit predictive models of depressivity using morphological characteristics from the informal text “letter from holidays” (Nagelkerke r2 = 0.526 for men and 0.670 for women). For men, models for the formal texts “cover letter” and “complaint” showed moderate fit with the data (r2 = 0.479 and 0.435). The constructed models show weak to substantial recall (0.235 – 0.800) and moderate to substantial precision (0.571 – 0.889). Morphological variables appearing in the final models vary. There are no key morphological characteristics suitable for all models or for all genres. The resulting models’ properties demonstrate that they should be suitable for screening individuals at risk of depression and the most suitable genre is informal text (“letter from holidays”).
topic depression
genre
morphology
quantitative linguistics
predictive model
url https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/full
work_keys_str_mv AT janamhavigerova textbaseddetectionoftheriskofdepression
AT jirihaviger textbaseddetectionoftheriskofdepression
AT daliborkucera textbaseddetectionoftheriskofdepression
AT petrahoffmannova textbaseddetectionoftheriskofdepression
_version_ 1725134578093391872