Creating a Chinese suicide dictionary for identifying suicide risk on social media

Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to det...

Full description

Bibliographic Details
Main Authors: Meizhen Lv, Ang Li, Tianli Liu, Tingshao Zhu
Format: Article
Language:English
Published: PeerJ Inc. 2015-12-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/1455.pdf
id doaj-59ef9497e33c44cf9c5b567a56bde735
record_format Article
spelling doaj-59ef9497e33c44cf9c5b567a56bde7352020-11-25T00:47:10ZengPeerJ Inc.PeerJ2167-83592015-12-013e145510.7717/peerj.1455Creating a Chinese suicide dictionary for identifying suicide risk on social mediaMeizhen Lv0Ang Li1Tianli Liu2Tingshao Zhu3Key Lab of Behavioral Science of Chinese Academy of Sciences (CAS), Institute of Psychology, CAS, Beijing, ChinaDepartment of Psychology, Beijing Forestry University, Beijing, ChinaInstitute of Population Research, Peking University, Beijing, ChinaKey Lab of Behavioral Science of Chinese Academy of Sciences (CAS), Institute of Psychology, CAS, Beijing, ChinaIntroduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary.Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models.Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F1 = 0.48; t2: F1 = 0.56) produced a more accurate identification than SCLIWC (t1: F1 = 0.41; t2: F1 = 0.48) on different observation windows.Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries.https://peerj.com/articles/1455.pdfWeiboSuicide riskMicroblogSocial mediaChina
collection DOAJ
language English
format Article
sources DOAJ
author Meizhen Lv
Ang Li
Tianli Liu
Tingshao Zhu
spellingShingle Meizhen Lv
Ang Li
Tianli Liu
Tingshao Zhu
Creating a Chinese suicide dictionary for identifying suicide risk on social media
PeerJ
Weibo
Suicide risk
Microblog
Social media
China
author_facet Meizhen Lv
Ang Li
Tianli Liu
Tingshao Zhu
author_sort Meizhen Lv
title Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_short Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_full Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_fullStr Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_full_unstemmed Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_sort creating a chinese suicide dictionary for identifying suicide risk on social media
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2015-12-01
description Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary.Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models.Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F1 = 0.48; t2: F1 = 0.56) produced a more accurate identification than SCLIWC (t1: F1 = 0.41; t2: F1 = 0.48) on different observation windows.Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries.
topic Weibo
Suicide risk
Microblog
Social media
China
url https://peerj.com/articles/1455.pdf
work_keys_str_mv AT meizhenlv creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT angli creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT tianliliu creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT tingshaozhu creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
_version_ 1725261416863105024