Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?

BACKGROUND AND OBJECTIVE:Efficiently capturing the severity of positive valence symptoms could aid in risk stratification for adverse outcomes among patients with psychiatric disorders and identify optimal treatment strategies for patient subgroups. Motivated by the success of convolutional neural n...

Full description

Bibliographic Details
Main Authors: Hong-Jie Dai, Jitendra Jonnagaddala
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6191093?pdf=render
id doaj-e9d4a5f6a5b34d699ab19a9283d40728
record_format Article
spelling doaj-e9d4a5f6a5b34d699ab19a9283d407282020-11-24T21:50:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011310e020449310.1371/journal.pone.0204493Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?Hong-Jie DaiJitendra JonnagaddalaBACKGROUND AND OBJECTIVE:Efficiently capturing the severity of positive valence symptoms could aid in risk stratification for adverse outcomes among patients with psychiatric disorders and identify optimal treatment strategies for patient subgroups. Motivated by the success of convolutional neural networks (CNNs) in classification tasks, we studied the application of various CNN architectures and their performance in predicting the severity of positive valence symptoms in patients with psychiatric disorders based on initial psychiatric evaluation records. METHODS:Psychiatric evaluation records contain unstructured text and semi-structured data such as question-answer pairs. For a given record, we tokenise and normalise the semi-structured content. Pre-processed tokenised words are represented as one-hot encoded word vectors. We then apply different configurations of convolutional and max pooling layers to automatically learn important features from various word representations. We conducted a series of experiments to explore the effect of different CNN architectures on the classification of psychiatric records. RESULTS:Our best CNN model achieved a mean absolute error (MAE) of 0.539 and a normalized MAE of 0.785 on the test dataset, which is comparable to the other well-known text classification algorithms studied in this work. Our results also suggest that the normalisation step has a great impact on the performance of the developed models. CONCLUSIONS:We demonstrate that normalisation of the semi-structured contents can improve the MAE among all CNN configurations. Without advanced feature engineering, CNN-based approaches can provide a comparable solution for classifying positive valence symptom severity in initial psychiatric evaluation records. Although word embedding is well known for its ability to capture relatively low-dimensional similarity between words, our experimental results show that pre-trained embeddings do not improve the classification performance. This phenomenon may be due to the inability of word embeddings to capture problem specific contextual semantic information implying the quality of the employing embedding is critical for obtaining an accurate CNN model.http://europepmc.org/articles/PMC6191093?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Hong-Jie Dai
Jitendra Jonnagaddala
spellingShingle Hong-Jie Dai
Jitendra Jonnagaddala
Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
PLoS ONE
author_facet Hong-Jie Dai
Jitendra Jonnagaddala
author_sort Hong-Jie Dai
title Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
title_short Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
title_full Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
title_fullStr Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
title_full_unstemmed Assessing the severity of positive valence symptoms in initial psychiatric evaluation records: Should we use convolutional neural networks?
title_sort assessing the severity of positive valence symptoms in initial psychiatric evaluation records: should we use convolutional neural networks?
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description BACKGROUND AND OBJECTIVE:Efficiently capturing the severity of positive valence symptoms could aid in risk stratification for adverse outcomes among patients with psychiatric disorders and identify optimal treatment strategies for patient subgroups. Motivated by the success of convolutional neural networks (CNNs) in classification tasks, we studied the application of various CNN architectures and their performance in predicting the severity of positive valence symptoms in patients with psychiatric disorders based on initial psychiatric evaluation records. METHODS:Psychiatric evaluation records contain unstructured text and semi-structured data such as question-answer pairs. For a given record, we tokenise and normalise the semi-structured content. Pre-processed tokenised words are represented as one-hot encoded word vectors. We then apply different configurations of convolutional and max pooling layers to automatically learn important features from various word representations. We conducted a series of experiments to explore the effect of different CNN architectures on the classification of psychiatric records. RESULTS:Our best CNN model achieved a mean absolute error (MAE) of 0.539 and a normalized MAE of 0.785 on the test dataset, which is comparable to the other well-known text classification algorithms studied in this work. Our results also suggest that the normalisation step has a great impact on the performance of the developed models. CONCLUSIONS:We demonstrate that normalisation of the semi-structured contents can improve the MAE among all CNN configurations. Without advanced feature engineering, CNN-based approaches can provide a comparable solution for classifying positive valence symptom severity in initial psychiatric evaluation records. Although word embedding is well known for its ability to capture relatively low-dimensional similarity between words, our experimental results show that pre-trained embeddings do not improve the classification performance. This phenomenon may be due to the inability of word embeddings to capture problem specific contextual semantic information implying the quality of the employing embedding is critical for obtaining an accurate CNN model.
url http://europepmc.org/articles/PMC6191093?pdf=render
work_keys_str_mv AT hongjiedai assessingtheseverityofpositivevalencesymptomsininitialpsychiatricevaluationrecordsshouldweuseconvolutionalneuralnetworks
AT jitendrajonnagaddala assessingtheseverityofpositivevalencesymptomsininitialpsychiatricevaluationrecordsshouldweuseconvolutionalneuralnetworks
_version_ 1725884094918688768