Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care

Background: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the ques...

Full description

Bibliographic Details
Main Authors: Odile Sauzet, Oliver Razum, Teresia Widera, Patrick Brzoska
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-06-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/full
id doaj-8632272bed784d5f92ee2f0dfc073654
record_format Article
spelling doaj-8632272bed784d5f92ee2f0dfc0736542020-11-24T22:04:14ZengFrontiers Media S.A.Frontiers in Public Health2296-25652019-06-01710.3389/fpubh.2019.00146456304Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health CareOdile Sauzet0Odile Sauzet1Oliver Razum2Teresia Widera3Patrick Brzoska4Department of Epidemiology and International Public Health, School of Public Health, Bielefeld University, Bielefeld, GermanyCentre for Statistics, Bielefeld University, Bielefeld, GermanyDepartment of Epidemiology and International Public Health, School of Public Health, Bielefeld University, Bielefeld, GermanyBundesarbeitsgemeinschaft für Rehabilitation e.V., Frankfurt, GermanyHealth Services Research Unit, Faculty of Health, School of Medicine, Witten/Herdecke University, Witten, GermanyBackground: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the questionnaire and those who will always provide a completely positive evaluation irrespective of the item. The aim of the present study is to compare various statistical approaches to the analysis of such data using data from a rehabilitation patient survey of the German Statutory Pension Insurance Scheme as an example.Method: We used data from 272,806 respondents who participated in the survey from 2008 to 2011. We illustrate four models: linear regression, logistic regression, a two-part model based on the assumption of two underlying populations and quantile regression, which does not require any distributional assumptions. For each model we consider the relationship of the satisfaction score with two covariates.Results: While providing correct estimates of the mean values (marginal effects), the assumptions of the linear model are violated which can lead to false interpretations. A two-part regression which consists of a logistic regression followed by a linear regression conditional on not being fully satisfied is a useful alternative. For research questions focusing on specific parts of the distribution, logistic regression as well as quantile regression are to be considered.Discussion: Data with a spike represents a statistical challenge but a range of modeling approaches is available to provide sound interpretations and correct answers to research questions.https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/fullsatisfaction surveytwo-part regression modelsquantile regressiondata with spikelinear regresion
collection DOAJ
language English
format Article
sources DOAJ
author Odile Sauzet
Odile Sauzet
Oliver Razum
Teresia Widera
Patrick Brzoska
spellingShingle Odile Sauzet
Odile Sauzet
Oliver Razum
Teresia Widera
Patrick Brzoska
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
Frontiers in Public Health
satisfaction survey
two-part regression models
quantile regression
data with spike
linear regresion
author_facet Odile Sauzet
Odile Sauzet
Oliver Razum
Teresia Widera
Patrick Brzoska
author_sort Odile Sauzet
title Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
title_short Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
title_full Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
title_fullStr Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
title_full_unstemmed Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
title_sort two-part models and quantile regression for the analysis of survey data with a spike. the example of satisfaction with health care
publisher Frontiers Media S.A.
series Frontiers in Public Health
issn 2296-2565
publishDate 2019-06-01
description Background: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the questionnaire and those who will always provide a completely positive evaluation irrespective of the item. The aim of the present study is to compare various statistical approaches to the analysis of such data using data from a rehabilitation patient survey of the German Statutory Pension Insurance Scheme as an example.Method: We used data from 272,806 respondents who participated in the survey from 2008 to 2011. We illustrate four models: linear regression, logistic regression, a two-part model based on the assumption of two underlying populations and quantile regression, which does not require any distributional assumptions. For each model we consider the relationship of the satisfaction score with two covariates.Results: While providing correct estimates of the mean values (marginal effects), the assumptions of the linear model are violated which can lead to false interpretations. A two-part regression which consists of a logistic regression followed by a linear regression conditional on not being fully satisfied is a useful alternative. For research questions focusing on specific parts of the distribution, logistic regression as well as quantile regression are to be considered.Discussion: Data with a spike represents a statistical challenge but a range of modeling approaches is available to provide sound interpretations and correct answers to research questions.
topic satisfaction survey
two-part regression models
quantile regression
data with spike
linear regresion
url https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/full
work_keys_str_mv AT odilesauzet twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare
AT odilesauzet twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare
AT oliverrazum twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare
AT teresiawidera twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare
AT patrickbrzoska twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare
_version_ 1725829755250409472