Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.

<h4>Background</h4>Suicide is a major public health concern globally. Accurately predicting suicidal behavior remains challenging. This study aimed to use machine learning approaches to examine the potential of the Swedish national registry data for prediction of suicidal behavior.<h4...

Full description

Bibliographic Details
Main Authors: Qi Chen, Yanli Zhang-James, Eric J Barnett, Paul Lichtenstein, Jussi Jokinen, Brian M D'Onofrio, Stephen V Faraone, Henrik Larsson, Seena Fazel
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-11-01
Series:PLoS Medicine
Online Access:https://doi.org/10.1371/journal.pmed.1003416
id doaj-6d8070d6ee934c849737312fc78a86f0
record_format Article
spelling doaj-6d8070d6ee934c849737312fc78a86f02021-04-21T18:38:58ZengPublic Library of Science (PLoS)PLoS Medicine1549-12771549-16762020-11-011711e100341610.1371/journal.pmed.1003416Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.Qi ChenYanli Zhang-JamesEric J BarnettPaul LichtensteinJussi JokinenBrian M D'OnofrioStephen V FaraoneHenrik LarssonSeena Fazel<h4>Background</h4>Suicide is a major public health concern globally. Accurately predicting suicidal behavior remains challenging. This study aimed to use machine learning approaches to examine the potential of the Swedish national registry data for prediction of suicidal behavior.<h4>Methods and findings</h4>The study sample consisted of 541,300 inpatient and outpatient visits by 126,205 Sweden-born patients (54% female and 46% male) aged 18 to 39 (mean age at the visit: 27.3) years to psychiatric specialty care in Sweden between January 1, 2011 and December 31, 2012. The most common psychiatric diagnoses at the visit were anxiety disorders (20.0%), major depressive disorder (16.9%), and substance use disorders (13.6%). A total of 425 candidate predictors covering demographic characteristics, socioeconomic status (SES), electronic medical records, criminality, as well as family history of disease and crime were extracted from the Swedish registry data. The sample was randomly split into an 80% training set containing 433,024 visits and a 20% test set containing 108,276 visits. Models were trained separately for suicide attempt/death within 90 and 30 days following a visit using multiple machine learning algorithms. Model discrimination and calibration were both evaluated. Among all eligible visits, 3.5% (18,682) were followed by a suicide attempt/death within 90 days and 1.7% (9,099) within 30 days. The final models were based on ensemble learning that combined predictions from elastic net penalized logistic regression, random forest, gradient boosting, and a neural network. The area under the receiver operating characteristic (ROC) curves (AUCs) on the test set were 0.88 (95% confidence interval [CI] = 0.87-0.89) and 0.89 (95% CI = 0.88-0.90) for the outcome within 90 days and 30 days, respectively, both being significantly better than chance (i.e., AUC = 0.50) (p < 0.01). Sensitivity, specificity, and predictive values were reported at different risk thresholds. A limitation of our study is that our models have not yet been externally validated, and thus, the generalizability of the models to other populations remains unknown.<h4>Conclusions</h4>By combining the ensemble method of multiple machine learning algorithms and high-quality data solely from the Swedish registers, we developed prognostic models to predict short-term suicide attempt/death with good discrimination and calibration. Whether novel predictors can improve predictive performance requires further investigation.https://doi.org/10.1371/journal.pmed.1003416
collection DOAJ
language English
format Article
sources DOAJ
author Qi Chen
Yanli Zhang-James
Eric J Barnett
Paul Lichtenstein
Jussi Jokinen
Brian M D'Onofrio
Stephen V Faraone
Henrik Larsson
Seena Fazel
spellingShingle Qi Chen
Yanli Zhang-James
Eric J Barnett
Paul Lichtenstein
Jussi Jokinen
Brian M D'Onofrio
Stephen V Faraone
Henrik Larsson
Seena Fazel
Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
PLoS Medicine
author_facet Qi Chen
Yanli Zhang-James
Eric J Barnett
Paul Lichtenstein
Jussi Jokinen
Brian M D'Onofrio
Stephen V Faraone
Henrik Larsson
Seena Fazel
author_sort Qi Chen
title Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
title_short Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
title_full Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
title_fullStr Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
title_full_unstemmed Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.
title_sort predicting suicide attempt or suicide death following a visit to psychiatric specialty care: a machine learning study using swedish national registry data.
publisher Public Library of Science (PLoS)
series PLoS Medicine
issn 1549-1277
1549-1676
publishDate 2020-11-01
description <h4>Background</h4>Suicide is a major public health concern globally. Accurately predicting suicidal behavior remains challenging. This study aimed to use machine learning approaches to examine the potential of the Swedish national registry data for prediction of suicidal behavior.<h4>Methods and findings</h4>The study sample consisted of 541,300 inpatient and outpatient visits by 126,205 Sweden-born patients (54% female and 46% male) aged 18 to 39 (mean age at the visit: 27.3) years to psychiatric specialty care in Sweden between January 1, 2011 and December 31, 2012. The most common psychiatric diagnoses at the visit were anxiety disorders (20.0%), major depressive disorder (16.9%), and substance use disorders (13.6%). A total of 425 candidate predictors covering demographic characteristics, socioeconomic status (SES), electronic medical records, criminality, as well as family history of disease and crime were extracted from the Swedish registry data. The sample was randomly split into an 80% training set containing 433,024 visits and a 20% test set containing 108,276 visits. Models were trained separately for suicide attempt/death within 90 and 30 days following a visit using multiple machine learning algorithms. Model discrimination and calibration were both evaluated. Among all eligible visits, 3.5% (18,682) were followed by a suicide attempt/death within 90 days and 1.7% (9,099) within 30 days. The final models were based on ensemble learning that combined predictions from elastic net penalized logistic regression, random forest, gradient boosting, and a neural network. The area under the receiver operating characteristic (ROC) curves (AUCs) on the test set were 0.88 (95% confidence interval [CI] = 0.87-0.89) and 0.89 (95% CI = 0.88-0.90) for the outcome within 90 days and 30 days, respectively, both being significantly better than chance (i.e., AUC = 0.50) (p < 0.01). Sensitivity, specificity, and predictive values were reported at different risk thresholds. A limitation of our study is that our models have not yet been externally validated, and thus, the generalizability of the models to other populations remains unknown.<h4>Conclusions</h4>By combining the ensemble method of multiple machine learning algorithms and high-quality data solely from the Swedish registers, we developed prognostic models to predict short-term suicide attempt/death with good discrimination and calibration. Whether novel predictors can improve predictive performance requires further investigation.
url https://doi.org/10.1371/journal.pmed.1003416
work_keys_str_mv AT qichen predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT yanlizhangjames predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT ericjbarnett predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT paullichtenstein predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT jussijokinen predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT brianmdonofrio predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT stephenvfaraone predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT henriklarsson predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
AT seenafazel predictingsuicideattemptorsuicidedeathfollowingavisittopsychiatricspecialtycareamachinelearningstudyusingswedishnationalregistrydata
_version_ 1714664584043823104