Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
ABSTRACT Objectives Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2017-04-01
|
Series: | International Journal of Population Data Science |
Online Access: | https://ijpds.org/article/view/370 |
id |
doaj-d46e1318e7084741a1d2e5c04bcf3db5 |
---|---|
record_format |
Article |
spelling |
doaj-d46e1318e7084741a1d2e5c04bcf3db52020-11-24T23:32:45ZengSwansea UniversityInternational Journal of Population Data Science2399-49082017-04-011110.23889/ijpds.v1i1.370370Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorderMladen Dinev0Maksim Belousov1Rohan Morris2Natalie Berry3Goran Nenadic4University of ManchesterUniversity of ManchesterUniversity of ManchesterUniversity of ManchesterUniversity of ManchesterABSTRACT Objectives Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not classified. This may indicate a clinically significant relationship between sleep and PLEs in the general population, a notion supported by the literature. Based on our previous investigation, the current study aimed to explore whether this methodology could be amended to generate datasets regarding sleep experiences in people who self-report a diagnosis of a psychotic disorder. Approach The current investigation seeks to establish if it is feasible to generate anonymised datasets regarding sleep by extracting information from the timelines of people who self-report a psychotic diagnosis. A text mining method was implemented that utilised rule-based semantic filters that aimed to identify self-reported diagnoses. This focused on occurrences of personal and possessive pronouns to detect the subjectivity of tweets, as well as potential diagnostic verb indicators and any mentions of other related factors. For each diagnostic tweet, we collected information from user timelines. A sleep-related classifier was then implemented, which used lexical features (e.g. bag-of-words, part-of-speech tags) to predict whether a given tweet refers to sleep-related experience. Results After training the classifier on the bag-of-words model, the most informative words which contributed to the performance of the classifier were: ‘sleep’, ‘can’t awake’, ‘never’, ‘stress’. Part-of-speech tags (e.g. verbs, adverbs) were also important features. The classification accuracy of the ‘bag-of-words’ model was better than the ‘part-of-speech’ model. Through the method outlined herein, we were able to improve the quality of the generated datasets in comparison to the previous investigation. This methodology also facilitated the mining of individual Twitter users timelines who stated a personal diagnosis. To this end, an additional filter was implemented to identify tweets regarding sleep experience. The potential relationship between sentiment and temporality expressed in diagnosis and sleep experiences are also discussed. Conclusions The results from this study have implications for mental health research on Twitter. Specifically, the refinements in the methodology enabled retrieval of two high quality datasets regarding psychosis and sleep. Therefore it is feasible other psychosis-related phenomena (e.g. visual hallucinations, delusions, medication) could also be applied as separate filters to create one dataset of psychosis-related experiences within individuals diagnosed with psychosis.https://ijpds.org/article/view/370 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mladen Dinev Maksim Belousov Rohan Morris Natalie Berry Goran Nenadic |
spellingShingle |
Mladen Dinev Maksim Belousov Rohan Morris Natalie Berry Goran Nenadic Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder International Journal of Population Data Science |
author_facet |
Mladen Dinev Maksim Belousov Rohan Morris Natalie Berry Goran Nenadic |
author_sort |
Mladen Dinev |
title |
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
title_short |
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
title_full |
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
title_fullStr |
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
title_full_unstemmed |
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
title_sort |
using twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder |
publisher |
Swansea University |
series |
International Journal of Population Data Science |
issn |
2399-4908 |
publishDate |
2017-04-01 |
description |
ABSTRACT
Objectives
Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not classified. This may indicate a clinically significant relationship between sleep and PLEs in the general population, a notion supported by the literature. Based on our previous investigation, the current study aimed to explore whether this methodology could be amended to generate datasets regarding sleep experiences in people who self-report a diagnosis of a psychotic disorder.
Approach
The current investigation seeks to establish if it is feasible to generate anonymised datasets regarding sleep by extracting information from the timelines of people who self-report a psychotic diagnosis. A text mining method was implemented that utilised rule-based semantic filters that aimed to identify self-reported diagnoses. This focused on occurrences of personal and possessive pronouns to detect the subjectivity of tweets, as well as potential diagnostic verb indicators and any mentions of other related factors. For each diagnostic tweet, we collected information from user timelines. A sleep-related classifier was then implemented, which used lexical features (e.g. bag-of-words, part-of-speech tags) to predict whether a given tweet refers to sleep-related experience.
Results
After training the classifier on the bag-of-words model, the most informative words which contributed to the performance of the classifier were: ‘sleep’, ‘can’t awake’, ‘never’, ‘stress’. Part-of-speech tags (e.g. verbs, adverbs) were also important features. The classification accuracy of the ‘bag-of-words’ model was better than the ‘part-of-speech’ model.
Through the method outlined herein, we were able to improve the quality of the generated datasets in comparison to the previous investigation. This methodology also facilitated the mining of individual Twitter users timelines who stated a personal diagnosis. To this end, an additional filter was implemented to identify tweets regarding sleep experience. The potential relationship between sentiment and temporality expressed in diagnosis and sleep experiences are also discussed.
Conclusions
The results from this study have implications for mental health research on Twitter. Specifically, the refinements in the methodology enabled retrieval of two high quality datasets regarding psychosis and sleep. Therefore it is feasible other psychosis-related phenomena (e.g. visual hallucinations, delusions, medication) could also be applied as separate filters to create one dataset of psychosis-related experiences within individuals diagnosed with psychosis. |
url |
https://ijpds.org/article/view/370 |
work_keys_str_mv |
AT mladendinev usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder AT maksimbelousov usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder AT rohanmorris usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder AT natalieberry usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder AT gorannenadic usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder |
_version_ |
1725533337054871552 |