Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder

ABSTRACT Objectives Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not...

Full description

Bibliographic Details
Main Authors: Mladen Dinev, Maksim Belousov, Rohan Morris, Natalie Berry, Goran Nenadic
Format: Article
Language:English
Published: Swansea University 2017-04-01
Series:International Journal of Population Data Science
Online Access:https://ijpds.org/article/view/370
id doaj-d46e1318e7084741a1d2e5c04bcf3db5
record_format Article
spelling doaj-d46e1318e7084741a1d2e5c04bcf3db52020-11-24T23:32:45ZengSwansea UniversityInternational Journal of Population Data Science2399-49082017-04-011110.23889/ijpds.v1i1.370370Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorderMladen Dinev0Maksim Belousov1Rohan Morris2Natalie Berry3Goran Nenadic4University of ManchesterUniversity of ManchesterUniversity of ManchesterUniversity of ManchesterUniversity of ManchesterABSTRACT Objectives Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not classified. This may indicate a clinically significant relationship between sleep and PLEs in the general population, a notion supported by the literature. Based on our previous investigation, the current study aimed to explore whether this methodology could be amended to generate datasets regarding sleep experiences in people who self-report a diagnosis of a psychotic disorder. Approach The current investigation seeks to establish if it is feasible to generate anonymised datasets regarding sleep by extracting information from the timelines of people who self-report a psychotic diagnosis. A text mining method was implemented that utilised rule-based semantic filters that aimed to identify self-reported diagnoses. This focused on occurrences of personal and possessive pronouns to detect the subjectivity of tweets, as well as potential diagnostic verb indicators and any mentions of other related factors. For each diagnostic tweet, we collected information from user timelines. A sleep-related classifier was then implemented, which used lexical features (e.g. bag-of-words, part-of-speech tags) to predict whether a given tweet refers to sleep-related experience. Results After training the classifier on the bag-of-words model, the most informative words which contributed to the performance of the classifier were: ‘sleep’, ‘can’t awake’, ‘never’, ‘stress’. Part-of-speech tags (e.g. verbs, adverbs) were also important features. The classification accuracy of the ‘bag-of-words’ model was better than the ‘part-of-speech’ model. Through the method outlined herein, we were able to improve the quality of the generated datasets in comparison to the previous investigation. This methodology also facilitated the mining of individual Twitter users timelines who stated a personal diagnosis. To this end, an additional filter was implemented to identify tweets regarding sleep experience. The potential relationship between sentiment and temporality expressed in diagnosis and sleep experiences are also discussed. Conclusions The results from this study have implications for mental health research on Twitter. Specifically, the refinements in the methodology enabled retrieval of two high quality datasets regarding psychosis and sleep. Therefore it is feasible other psychosis-related phenomena (e.g. visual hallucinations, delusions, medication) could also be applied as separate filters to create one dataset of psychosis-related experiences within individuals diagnosed with psychosis.https://ijpds.org/article/view/370
collection DOAJ
language English
format Article
sources DOAJ
author Mladen Dinev
Maksim Belousov
Rohan Morris
Natalie Berry
Goran Nenadic
spellingShingle Mladen Dinev
Maksim Belousov
Rohan Morris
Natalie Berry
Goran Nenadic
Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
International Journal of Population Data Science
author_facet Mladen Dinev
Maksim Belousov
Rohan Morris
Natalie Berry
Goran Nenadic
author_sort Mladen Dinev
title Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
title_short Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
title_full Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
title_fullStr Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
title_full_unstemmed Using Twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
title_sort using twitter to mine sleep related information from people who declare a diagnosis of a psychotic disorder
publisher Swansea University
series International Journal of Population Data Science
issn 2399-4908
publishDate 2017-04-01
description ABSTRACT Objectives Our group has investigated the occurrence of psychotic(-like) experiences (PLEs) in Twitter posts, namely auditory hallucinations. Tweets classified as potentially related to auditory hallucinations were proportionately higher between 23:00 and 5:00 in comparison to tweets not classified. This may indicate a clinically significant relationship between sleep and PLEs in the general population, a notion supported by the literature. Based on our previous investigation, the current study aimed to explore whether this methodology could be amended to generate datasets regarding sleep experiences in people who self-report a diagnosis of a psychotic disorder. Approach The current investigation seeks to establish if it is feasible to generate anonymised datasets regarding sleep by extracting information from the timelines of people who self-report a psychotic diagnosis. A text mining method was implemented that utilised rule-based semantic filters that aimed to identify self-reported diagnoses. This focused on occurrences of personal and possessive pronouns to detect the subjectivity of tweets, as well as potential diagnostic verb indicators and any mentions of other related factors. For each diagnostic tweet, we collected information from user timelines. A sleep-related classifier was then implemented, which used lexical features (e.g. bag-of-words, part-of-speech tags) to predict whether a given tweet refers to sleep-related experience. Results After training the classifier on the bag-of-words model, the most informative words which contributed to the performance of the classifier were: ‘sleep’, ‘can’t awake’, ‘never’, ‘stress’. Part-of-speech tags (e.g. verbs, adverbs) were also important features. The classification accuracy of the ‘bag-of-words’ model was better than the ‘part-of-speech’ model. Through the method outlined herein, we were able to improve the quality of the generated datasets in comparison to the previous investigation. This methodology also facilitated the mining of individual Twitter users timelines who stated a personal diagnosis. To this end, an additional filter was implemented to identify tweets regarding sleep experience. The potential relationship between sentiment and temporality expressed in diagnosis and sleep experiences are also discussed. Conclusions The results from this study have implications for mental health research on Twitter. Specifically, the refinements in the methodology enabled retrieval of two high quality datasets regarding psychosis and sleep. Therefore it is feasible other psychosis-related phenomena (e.g. visual hallucinations, delusions, medication) could also be applied as separate filters to create one dataset of psychosis-related experiences within individuals diagnosed with psychosis.
url https://ijpds.org/article/view/370
work_keys_str_mv AT mladendinev usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder
AT maksimbelousov usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder
AT rohanmorris usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder
AT natalieberry usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder
AT gorannenadic usingtwittertominesleeprelatedinformationfrompeoplewhodeclareadiagnosisofapsychoticdisorder
_version_ 1725533337054871552