Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data

Introduction: Clinical trials of blended Internet-based treatments deliver a wealth of data from various sources, such as self-report questionnaires, diagnostic interviews, treatment platform log files and Ecological Momentary Assessments (EMA). Mining these complex data for clinically relevant patt...

Full description

Bibliographic Details
Main Authors: Artur Rocha, Rui Camacho, Jeroen Ruwaard, Heleen Riper
Format: Article
Language:English
Published: Elsevier 2018-06-01
Series:Internet Interventions
Online Access:http://www.sciencedirect.com/science/article/pii/S2214782917300763
id doaj-663523cc8bd24b8a83ae01c967ba165c
record_format Article
spelling doaj-663523cc8bd24b8a83ae01c967ba165c2020-11-24T22:49:00ZengElsevierInternet Interventions2214-78292018-06-0112176180Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log dataArtur Rocha0Rui Camacho1Jeroen Ruwaard2Heleen Riper3INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal; Corresponding author.DEI & Faculdade de Engenharia & LIAAD-INESC TEC, Universidade do Porto, PortugalVrije Universiteit Amsterdam, Department of Clinical Psychology, De Boelelaan 1081, Amsterdam 1081 HV, NetherlandsVrije Universiteit Amsterdam, Department of Clinical Psychology, De Boelelaan 1081, Amsterdam 1081 HV, NetherlandsIntroduction: Clinical trials of blended Internet-based treatments deliver a wealth of data from various sources, such as self-report questionnaires, diagnostic interviews, treatment platform log files and Ecological Momentary Assessments (EMA). Mining these complex data for clinically relevant patterns is a daunting task for which no definitive best method exists. In this paper, we explore the expressive power of the multi-relational Inductive Logic Programming (ILP) data mining approach, using combined trial data of the EU E-COMPARED depression trial. Methods: We explored the capability of ILP to handle and combine (implicit) multiple relationships in the E-COMPARED data. This data set has the following features that favor ILP analysis: 1) Time reasoning is involved; 2) there is a reasonable amount of explicit useful relations to be analyzed; 3) ILP is capable of building comprehensible models that might be perceived as putative explanations by domain experts; 4) both numerical and statistical models may coexist within ILP models if necessary. In our analyses, we focused on scores of the PHQ-8 self-report questionnaire (which taps depressive symptom severity), and on EMA of mood and various other clinically relevant factors. Both measures were administered during treatment, which lasted between 9 to 16 weeks. Results: E-COMPARED trial data revealed different individual improvement patterns: PHQ-8 scores suggested that some individuals improved quickly during the first weeks of the treatment, while others improved at a (much) slower pace, or not at all. Combining self-reported Ecological Momentary Assessments (EMA), PHQ-8 scores and log data about the usage of the ICT4D platform in the context of blended care, we set out to unveil possible causes for these different trajectories. Discussion: This work complements other studies into alternative data mining approaches to E-COMPARED trial data analysis, which are all aimed to identify clinically meaningful predictors of system use and treatment outcome. Strengths and limitations of the ILP approach given this objective will be discussed. Keywords: Multi-relational data mining, Internet intervention, Moodbuster, Log data, Ecological momentary assessmenthttp://www.sciencedirect.com/science/article/pii/S2214782917300763
collection DOAJ
language English
format Article
sources DOAJ
author Artur Rocha
Rui Camacho
Jeroen Ruwaard
Heleen Riper
spellingShingle Artur Rocha
Rui Camacho
Jeroen Ruwaard
Heleen Riper
Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
Internet Interventions
author_facet Artur Rocha
Rui Camacho
Jeroen Ruwaard
Heleen Riper
author_sort Artur Rocha
title Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
title_short Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
title_full Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
title_fullStr Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
title_full_unstemmed Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
title_sort using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data
publisher Elsevier
series Internet Interventions
issn 2214-7829
publishDate 2018-06-01
description Introduction: Clinical trials of blended Internet-based treatments deliver a wealth of data from various sources, such as self-report questionnaires, diagnostic interviews, treatment platform log files and Ecological Momentary Assessments (EMA). Mining these complex data for clinically relevant patterns is a daunting task for which no definitive best method exists. In this paper, we explore the expressive power of the multi-relational Inductive Logic Programming (ILP) data mining approach, using combined trial data of the EU E-COMPARED depression trial. Methods: We explored the capability of ILP to handle and combine (implicit) multiple relationships in the E-COMPARED data. This data set has the following features that favor ILP analysis: 1) Time reasoning is involved; 2) there is a reasonable amount of explicit useful relations to be analyzed; 3) ILP is capable of building comprehensible models that might be perceived as putative explanations by domain experts; 4) both numerical and statistical models may coexist within ILP models if necessary. In our analyses, we focused on scores of the PHQ-8 self-report questionnaire (which taps depressive symptom severity), and on EMA of mood and various other clinically relevant factors. Both measures were administered during treatment, which lasted between 9 to 16 weeks. Results: E-COMPARED trial data revealed different individual improvement patterns: PHQ-8 scores suggested that some individuals improved quickly during the first weeks of the treatment, while others improved at a (much) slower pace, or not at all. Combining self-reported Ecological Momentary Assessments (EMA), PHQ-8 scores and log data about the usage of the ICT4D platform in the context of blended care, we set out to unveil possible causes for these different trajectories. Discussion: This work complements other studies into alternative data mining approaches to E-COMPARED trial data analysis, which are all aimed to identify clinically meaningful predictors of system use and treatment outcome. Strengths and limitations of the ILP approach given this objective will be discussed. Keywords: Multi-relational data mining, Internet intervention, Moodbuster, Log data, Ecological momentary assessment
url http://www.sciencedirect.com/science/article/pii/S2214782917300763
work_keys_str_mv AT arturrocha usingmultirelationaldataminingtodiscriminateblendedtherapyefficiencyonpatientsbasedonlogdata
AT ruicamacho usingmultirelationaldataminingtodiscriminateblendedtherapyefficiencyonpatientsbasedonlogdata
AT jeroenruwaard usingmultirelationaldataminingtodiscriminateblendedtherapyefficiencyonpatientsbasedonlogdata
AT heleenriper usingmultirelationaldataminingtodiscriminateblendedtherapyefficiencyonpatientsbasedonlogdata
_version_ 1725677700883939328