A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts

Context: Recently, research community of certain domain showing their eagerness towards the use of social media networks to gain constructive knowledge in decision making and automation, such as aid to perform software development activities, crypto-currencies usage, network community detection and...

Full description

Bibliographic Details
Main Authors: Subhan Tariq, Nadeem Akhtar, Humaira Afzal, Shahzad Khalid, Muhammad Rafiq Mufti, Shahid Hussain, Asad Habib, Ghufran Ahmad
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8901145/
id doaj-863cb3a4b3e64b83a0ea9adb3f272399
record_format Article
spelling doaj-863cb3a4b3e64b83a0ea9adb3f2723992021-03-29T23:01:36ZengIEEEIEEE Access2169-35362019-01-01716616516617210.1109/ACCESS.2019.29530878901145A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media PostsSubhan Tariq0https://orcid.org/0000-0001-9554-8367Nadeem Akhtar1https://orcid.org/0000-0003-2475-5590Humaira Afzal2https://orcid.org/0000-0001-9054-8798Shahzad Khalid3https://orcid.org/0000-0003-0899-7354Muhammad Rafiq Mufti4https://orcid.org/0000-0002-1267-5510Shahid Hussain5https://orcid.org/0000-0002-1006-8952Asad Habib6https://orcid.org/0000-0003-2846-5347Ghufran Ahmad7https://orcid.org/0000-0002-0077-9638Department of Computer Science, COMSATS University Islamabad, Islamabad, PakistanDepartment of Computer Science and IT, The Islamia University of Bahawalpur, Bahawalpur, PakistanDepartment of Computer Science, Bahauddin Zakariya University, Multan, PakistanDepartment of Computer Engineering, Bahria University, Islamabad, PakistanDepartment of Computer Science, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanDepartment of Computer Science, Kohat University of Science and Technology, Kohat, PakistanDepartment of Computer Science, Kohat University of Science and Technology, Kohat, PakistanDepartment of Computer Science, COMSATS University Islamabad, Islamabad, PakistanContext: Recently, research community of certain domain showing their eagerness towards the use of social media networks to gain constructive knowledge in decision making and automation, such as aid to perform software development activities, crypto-currencies usage, network community detection and recommendation and so on. Recently, besides other domains of eHealth, the use of social media and big data analytics has become hot topic to predict the patient of mental illness involved in either depression, schizophrenia, eating disorders, anxiety or addictive behaviors. Problem: Traditional methods either need enough historic data or to keep the regular monitoring on patient activities for identification of a patient associated with a mental illness disease. Method: In order to address this issue, we propose a methodology to classify the patients associated with chronic mental illness diseases (i.e. Anxiety, Depression, Bipolar, and ADHD (Attention Deficit Hyperactivity Disorder) based on the data extracted from the Reddit, a well-known network community platform. The proposed method is employed through Co-training (type of semi-supervised learning approach) technique by incorporating the discriminative power of widely used classifiers namely Random Forrest (RF), Support Vector Machine (SVM), and Naïve Bayes (NB). We used Reddit API to download posts and top five associated comments for construction of a feature space. Results: The experimental results indicate the effectiveness of Co-training based classification rather than the state of the art classifiers by a margin of 3% on average in par with every state of art technique. In future, the proposed method could be employed to investigate any classification problem of any domain by extracting date from the social media.https://ieeexplore.ieee.org/document/8901145/Mental diseaseredditanxietydepressionbipolarADHD
collection DOAJ
language English
format Article
sources DOAJ
author Subhan Tariq
Nadeem Akhtar
Humaira Afzal
Shahzad Khalid
Muhammad Rafiq Mufti
Shahid Hussain
Asad Habib
Ghufran Ahmad
spellingShingle Subhan Tariq
Nadeem Akhtar
Humaira Afzal
Shahzad Khalid
Muhammad Rafiq Mufti
Shahid Hussain
Asad Habib
Ghufran Ahmad
A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
IEEE Access
Mental disease
reddit
anxiety
depression
bipolar
ADHD
author_facet Subhan Tariq
Nadeem Akhtar
Humaira Afzal
Shahzad Khalid
Muhammad Rafiq Mufti
Shahid Hussain
Asad Habib
Ghufran Ahmad
author_sort Subhan Tariq
title A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
title_short A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
title_full A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
title_fullStr A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
title_full_unstemmed A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts
title_sort novel co-training-based approach for the classification of mental illnesses using social media posts
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Context: Recently, research community of certain domain showing their eagerness towards the use of social media networks to gain constructive knowledge in decision making and automation, such as aid to perform software development activities, crypto-currencies usage, network community detection and recommendation and so on. Recently, besides other domains of eHealth, the use of social media and big data analytics has become hot topic to predict the patient of mental illness involved in either depression, schizophrenia, eating disorders, anxiety or addictive behaviors. Problem: Traditional methods either need enough historic data or to keep the regular monitoring on patient activities for identification of a patient associated with a mental illness disease. Method: In order to address this issue, we propose a methodology to classify the patients associated with chronic mental illness diseases (i.e. Anxiety, Depression, Bipolar, and ADHD (Attention Deficit Hyperactivity Disorder) based on the data extracted from the Reddit, a well-known network community platform. The proposed method is employed through Co-training (type of semi-supervised learning approach) technique by incorporating the discriminative power of widely used classifiers namely Random Forrest (RF), Support Vector Machine (SVM), and Naïve Bayes (NB). We used Reddit API to download posts and top five associated comments for construction of a feature space. Results: The experimental results indicate the effectiveness of Co-training based classification rather than the state of the art classifiers by a margin of 3% on average in par with every state of art technique. In future, the proposed method could be employed to investigate any classification problem of any domain by extracting date from the social media.
topic Mental disease
reddit
anxiety
depression
bipolar
ADHD
url https://ieeexplore.ieee.org/document/8901145/
work_keys_str_mv AT subhantariq anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT nadeemakhtar anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT humairaafzal anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT shahzadkhalid anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT muhammadrafiqmufti anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT shahidhussain anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT asadhabib anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT ghufranahmad anovelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT subhantariq novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT nadeemakhtar novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT humairaafzal novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT shahzadkhalid novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT muhammadrafiqmufti novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT shahidhussain novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT asadhabib novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
AT ghufranahmad novelcotrainingbasedapproachfortheclassificationofmentalillnessesusingsocialmediaposts
_version_ 1724190342079578112