Creating a Chinese suicide dictionary for identifying suicide risk on social media
Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to det...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2015-12-01
|
Series: | PeerJ |
Subjects: | |
Online Access: | https://peerj.com/articles/1455.pdf |
id |
doaj-59ef9497e33c44cf9c5b567a56bde735 |
---|---|
record_format |
Article |
spelling |
doaj-59ef9497e33c44cf9c5b567a56bde7352020-11-25T00:47:10ZengPeerJ Inc.PeerJ2167-83592015-12-013e145510.7717/peerj.1455Creating a Chinese suicide dictionary for identifying suicide risk on social mediaMeizhen Lv0Ang Li1Tianli Liu2Tingshao Zhu3Key Lab of Behavioral Science of Chinese Academy of Sciences (CAS), Institute of Psychology, CAS, Beijing, ChinaDepartment of Psychology, Beijing Forestry University, Beijing, ChinaInstitute of Population Research, Peking University, Beijing, ChinaKey Lab of Behavioral Science of Chinese Academy of Sciences (CAS), Institute of Psychology, CAS, Beijing, ChinaIntroduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary.Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models.Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F1 = 0.48; t2: F1 = 0.56) produced a more accurate identification than SCLIWC (t1: F1 = 0.41; t2: F1 = 0.48) on different observation windows.Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries.https://peerj.com/articles/1455.pdfWeiboSuicide riskMicroblogSocial mediaChina |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Meizhen Lv Ang Li Tianli Liu Tingshao Zhu |
spellingShingle |
Meizhen Lv Ang Li Tianli Liu Tingshao Zhu Creating a Chinese suicide dictionary for identifying suicide risk on social media PeerJ Suicide risk Microblog Social media China |
author_facet |
Meizhen Lv Ang Li Tianli Liu Tingshao Zhu |
author_sort |
Meizhen Lv |
title |
Creating a Chinese suicide dictionary for identifying suicide risk on social media |
title_short |
Creating a Chinese suicide dictionary for identifying suicide risk on social media |
title_full |
Creating a Chinese suicide dictionary for identifying suicide risk on social media |
title_fullStr |
Creating a Chinese suicide dictionary for identifying suicide risk on social media |
title_full_unstemmed |
Creating a Chinese suicide dictionary for identifying suicide risk on social media |
title_sort |
creating a chinese suicide dictionary for identifying suicide risk on social media |
publisher |
PeerJ Inc. |
series |
PeerJ |
issn |
2167-8359 |
publishDate |
2015-12-01 |
description |
Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary.Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models.Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F1 = 0.48; t2: F1 = 0.56) produced a more accurate identification than SCLIWC (t1: F1 = 0.41; t2: F1 = 0.48) on different observation windows.Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries. |
topic |
Weibo Suicide risk Microblog Social media China |
url |
https://peerj.com/articles/1455.pdf |
work_keys_str_mv |
AT meizhenlv creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia AT angli creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia AT tianliliu creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia AT tingshaozhu creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia |
_version_ |
1725261416863105024 |