Text-Based Detection of the Risk of Depression
This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprin...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2019-03-01
|
Series: | Frontiers in Psychology |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/full |
id |
doaj-5a28e676193b42ecbb2c0a43a373c08d |
---|---|
record_format |
Article |
spelling |
doaj-5a28e676193b42ecbb2c0a43a373c08d2020-11-25T01:20:23ZengFrontiers Media S.A.Frontiers in Psychology1664-10782019-03-011010.3389/fpsyg.2019.00513385113Text-Based Detection of the Risk of DepressionJana M. Havigerová0Jiří Haviger1Dalibor Kučera2Petra Hoffmannová3Institute of Psychology, Masaryk University, Brno, CzechiaDepartment of Informatics and Quantitative Methods, University of Hradec Králové, Hradec Králové, CzechiaDepartment of Pedagogy and Psychology, University of South Bohemia, České Budějovice, CzechiaInstitute of Psychology, Masaryk University, Brno, CzechiaThis study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models. Results: Across all participants, the data best fit predictive models of depressivity using morphological characteristics from the informal text “letter from holidays” (Nagelkerke r2 = 0.526 for men and 0.670 for women). For men, models for the formal texts “cover letter” and “complaint” showed moderate fit with the data (r2 = 0.479 and 0.435). The constructed models show weak to substantial recall (0.235 – 0.800) and moderate to substantial precision (0.571 – 0.889). Morphological variables appearing in the final models vary. There are no key morphological characteristics suitable for all models or for all genres. The resulting models’ properties demonstrate that they should be suitable for screening individuals at risk of depression and the most suitable genre is informal text (“letter from holidays”).https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/fulldepressiongenremorphologyquantitative linguisticspredictive model |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jana M. Havigerová Jiří Haviger Dalibor Kučera Petra Hoffmannová |
spellingShingle |
Jana M. Havigerová Jiří Haviger Dalibor Kučera Petra Hoffmannová Text-Based Detection of the Risk of Depression Frontiers in Psychology depression genre morphology quantitative linguistics predictive model |
author_facet |
Jana M. Havigerová Jiří Haviger Dalibor Kučera Petra Hoffmannová |
author_sort |
Jana M. Havigerová |
title |
Text-Based Detection of the Risk of Depression |
title_short |
Text-Based Detection of the Risk of Depression |
title_full |
Text-Based Detection of the Risk of Depression |
title_fullStr |
Text-Based Detection of the Risk of Depression |
title_full_unstemmed |
Text-Based Detection of the Risk of Depression |
title_sort |
text-based detection of the risk of depression |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Psychology |
issn |
1664-1078 |
publishDate |
2019-03-01 |
description |
This study examines the relationship between language use and psychological characteristics of the communicator. The aim of the study was to find models predicting the depressivity of the writer based on the computational linguistic markers of his/her written text. Respondents’ linguistic fingerprints were traced in four texts of different genres. Depressivity was measured using the Depression, Anxiety and Stress Scale (DASS-21). The research sample (N = 172, 83 men, 89 women) was created by quota sampling an adult Czech population. Morphological variables of the texts showing differences (M-W test) between the non-depressive and depressive groups were incorporated into predictive models. Results: Across all participants, the data best fit predictive models of depressivity using morphological characteristics from the informal text “letter from holidays” (Nagelkerke r2 = 0.526 for men and 0.670 for women). For men, models for the formal texts “cover letter” and “complaint” showed moderate fit with the data (r2 = 0.479 and 0.435). The constructed models show weak to substantial recall (0.235 – 0.800) and moderate to substantial precision (0.571 – 0.889). Morphological variables appearing in the final models vary. There are no key morphological characteristics suitable for all models or for all genres. The resulting models’ properties demonstrate that they should be suitable for screening individuals at risk of depression and the most suitable genre is informal text (“letter from holidays”). |
topic |
depression genre morphology quantitative linguistics predictive model |
url |
https://www.frontiersin.org/article/10.3389/fpsyg.2019.00513/full |
work_keys_str_mv |
AT janamhavigerova textbaseddetectionoftheriskofdepression AT jirihaviger textbaseddetectionoftheriskofdepression AT daliborkucera textbaseddetectionoftheriskofdepression AT petrahoffmannova textbaseddetectionoftheriskofdepression |
_version_ |
1725134578093391872 |