Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care
Background: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the ques...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2019-06-01
|
Series: | Frontiers in Public Health |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/full |
id |
doaj-8632272bed784d5f92ee2f0dfc073654 |
---|---|
record_format |
Article |
spelling |
doaj-8632272bed784d5f92ee2f0dfc0736542020-11-24T22:04:14ZengFrontiers Media S.A.Frontiers in Public Health2296-25652019-06-01710.3389/fpubh.2019.00146456304Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health CareOdile Sauzet0Odile Sauzet1Oliver Razum2Teresia Widera3Patrick Brzoska4Department of Epidemiology and International Public Health, School of Public Health, Bielefeld University, Bielefeld, GermanyCentre for Statistics, Bielefeld University, Bielefeld, GermanyDepartment of Epidemiology and International Public Health, School of Public Health, Bielefeld University, Bielefeld, GermanyBundesarbeitsgemeinschaft für Rehabilitation e.V., Frankfurt, GermanyHealth Services Research Unit, Faculty of Health, School of Medicine, Witten/Herdecke University, Witten, GermanyBackground: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the questionnaire and those who will always provide a completely positive evaluation irrespective of the item. The aim of the present study is to compare various statistical approaches to the analysis of such data using data from a rehabilitation patient survey of the German Statutory Pension Insurance Scheme as an example.Method: We used data from 272,806 respondents who participated in the survey from 2008 to 2011. We illustrate four models: linear regression, logistic regression, a two-part model based on the assumption of two underlying populations and quantile regression, which does not require any distributional assumptions. For each model we consider the relationship of the satisfaction score with two covariates.Results: While providing correct estimates of the mean values (marginal effects), the assumptions of the linear model are violated which can lead to false interpretations. A two-part regression which consists of a logistic regression followed by a linear regression conditional on not being fully satisfied is a useful alternative. For research questions focusing on specific parts of the distribution, logistic regression as well as quantile regression are to be considered.Discussion: Data with a spike represents a statistical challenge but a range of modeling approaches is available to provide sound interpretations and correct answers to research questions.https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/fullsatisfaction surveytwo-part regression modelsquantile regressiondata with spikelinear regresion |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Odile Sauzet Odile Sauzet Oliver Razum Teresia Widera Patrick Brzoska |
spellingShingle |
Odile Sauzet Odile Sauzet Oliver Razum Teresia Widera Patrick Brzoska Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care Frontiers in Public Health satisfaction survey two-part regression models quantile regression data with spike linear regresion |
author_facet |
Odile Sauzet Odile Sauzet Oliver Razum Teresia Widera Patrick Brzoska |
author_sort |
Odile Sauzet |
title |
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care |
title_short |
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care |
title_full |
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care |
title_fullStr |
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care |
title_full_unstemmed |
Two-Part Models and Quantile Regression for the Analysis of Survey Data With a Spike. The Example of Satisfaction With Health Care |
title_sort |
two-part models and quantile regression for the analysis of survey data with a spike. the example of satisfaction with health care |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Public Health |
issn |
2296-2565 |
publishDate |
2019-06-01 |
description |
Background: Results of patient satisfaction questionnaires can contain a spike at the value corresponding to a complete satisfaction. A possible interpretation is that there are two types of respondents, those who are willing to provide a negative evaluation to one or more items proposed in the questionnaire and those who will always provide a completely positive evaluation irrespective of the item. The aim of the present study is to compare various statistical approaches to the analysis of such data using data from a rehabilitation patient survey of the German Statutory Pension Insurance Scheme as an example.Method: We used data from 272,806 respondents who participated in the survey from 2008 to 2011. We illustrate four models: linear regression, logistic regression, a two-part model based on the assumption of two underlying populations and quantile regression, which does not require any distributional assumptions. For each model we consider the relationship of the satisfaction score with two covariates.Results: While providing correct estimates of the mean values (marginal effects), the assumptions of the linear model are violated which can lead to false interpretations. A two-part regression which consists of a logistic regression followed by a linear regression conditional on not being fully satisfied is a useful alternative. For research questions focusing on specific parts of the distribution, logistic regression as well as quantile regression are to be considered.Discussion: Data with a spike represents a statistical challenge but a range of modeling approaches is available to provide sound interpretations and correct answers to research questions. |
topic |
satisfaction survey two-part regression models quantile regression data with spike linear regresion |
url |
https://www.frontiersin.org/article/10.3389/fpubh.2019.00146/full |
work_keys_str_mv |
AT odilesauzet twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare AT odilesauzet twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare AT oliverrazum twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare AT teresiawidera twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare AT patrickbrzoska twopartmodelsandquantileregressionfortheanalysisofsurveydatawithaspiketheexampleofsatisfactionwithhealthcare |
_version_ |
1725829755250409472 |