Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments
Background: Commonly used response scales in patient-reported outcome (PRO) measures include the visual analogue scale, 11-point numeric rating scale, 5-point numeric rating scale, 5-point verbal rating scale, and 5-point verbal-numeric rating scale. Although prior studies have explored the interpr...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
The University of Arizona.
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10150/306918 |
id |
ndltd-arizona.edu-oai-arizona.openrepository.com-10150-306918 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-arizona.edu-oai-arizona.openrepository.com-10150-3069182015-10-23T05:29:07Z Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments Mutebi, Alex Coons, Stephen Joel Slack, Marion K. Coons, Stephen Joel Slack, Marion K. Roe, Denise J. Skrepnek, Grant H. Warholak, Terri J. Response Scales Pharmaceutical Sciences Patient Reported Outcome Instruments Background: Commonly used response scales in patient-reported outcome (PRO) measures include the visual analogue scale, 11-point numeric rating scale, 5-point numeric rating scale, 5-point verbal rating scale, and 5-point verbal-numeric rating scale. Although prior studies have explored the interpretation of response scale labels and compared scores resulting from the response scale, many questions remain. Purpose: To identify sets of verbal descriptors interpreted with the least variation and to explore whether the response scales provide interval level data and whether the scales are interchangeable. Methods: Subject recruitment and screening was through an online drug-drug interaction service (MediGuard.org). Via an online survey platform, subjects used a scale (0 = lowest possible and 10=highest possible) to assign interpretation scores to verbal descriptors. Repeated measures analysis of variance informed the test interval data between scores. Subjects also completed repeated administrations of four symptom-specific item stems with different response scales. Ordinal regression informed the analysis of scores assigned to verbal descriptors, comparison of probabilities of responding in given categories across scales, and prediction of response category on one scale conditional on observed response on another scale. Cut-points informed tests for interval level data. Results: The sample (n=350) comprised 223 females and 127 males with a mean (SD) age of 56.9 (12.1) years. Number of health conditions per subject ranged from 1 to 12 (median = 5). Age, sex, level of education, and number of health conditions were associated with the interpretation of verbal descriptors. Scores assigned to "poor," "fair," "good," "very good," "excellent," "somewhat," "sometimes," and "quite a bit," had the largest variation. The probability of responding in the same categories on the different response scales was significantly different across scales before and after collapsing categories. No scale yielded interval level data. The 11-NRS data tended more towards interval level than the data from other scales. Conclusions: Using different response scales with verbal descriptors in non-randomized studies may introduce bias. Differential item functioning and subgroup analyses should be investigated in the development and use of these response scales. The scales are not interchangeable. Compared with other scales the 11-NRS produced data approaching interval level. Collapsing categories entails significant probabilities of misclassification. 2013 text Electronic Dissertation http://hdl.handle.net/10150/306918 en_US Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. The University of Arizona. |
collection |
NDLTD |
language |
en_US |
sources |
NDLTD |
topic |
Response Scales Pharmaceutical Sciences Patient Reported Outcome Instruments |
spellingShingle |
Response Scales Pharmaceutical Sciences Patient Reported Outcome Instruments Mutebi, Alex Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
description |
Background: Commonly used response scales in patient-reported outcome (PRO) measures include the visual analogue scale, 11-point numeric rating scale, 5-point numeric rating scale, 5-point verbal rating scale, and 5-point verbal-numeric rating scale. Although prior studies have explored the interpretation of response scale labels and compared scores resulting from the response scale, many questions remain. Purpose: To identify sets of verbal descriptors interpreted with the least variation and to explore whether the response scales provide interval level data and whether the scales are interchangeable. Methods: Subject recruitment and screening was through an online drug-drug interaction service (MediGuard.org). Via an online survey platform, subjects used a scale (0 = lowest possible and 10=highest possible) to assign interpretation scores to verbal descriptors. Repeated measures analysis of variance informed the test interval data between scores. Subjects also completed repeated administrations of four symptom-specific item stems with different response scales. Ordinal regression informed the analysis of scores assigned to verbal descriptors, comparison of probabilities of responding in given categories across scales, and prediction of response category on one scale conditional on observed response on another scale. Cut-points informed tests for interval level data. Results: The sample (n=350) comprised 223 females and 127 males with a mean (SD) age of 56.9 (12.1) years. Number of health conditions per subject ranged from 1 to 12 (median = 5). Age, sex, level of education, and number of health conditions were associated with the interpretation of verbal descriptors. Scores assigned to "poor," "fair," "good," "very good," "excellent," "somewhat," "sometimes," and "quite a bit," had the largest variation. The probability of responding in the same categories on the different response scales was significantly different across scales before and after collapsing categories. No scale yielded interval level data. The 11-NRS data tended more towards interval level than the data from other scales. Conclusions: Using different response scales with verbal descriptors in non-randomized studies may introduce bias. Differential item functioning and subgroup analyses should be investigated in the development and use of these response scales. The scales are not interchangeable. Compared with other scales the 11-NRS produced data approaching interval level. Collapsing categories entails significant probabilities of misclassification. |
author2 |
Coons, Stephen Joel |
author_facet |
Coons, Stephen Joel Mutebi, Alex |
author |
Mutebi, Alex |
author_sort |
Mutebi, Alex |
title |
Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
title_short |
Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
title_full |
Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
title_fullStr |
Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
title_full_unstemmed |
Comparing Response Scaling Formats Used in Patient-Reported Outcome (PRO) Instruments |
title_sort |
comparing response scaling formats used in patient-reported outcome (pro) instruments |
publisher |
The University of Arizona. |
publishDate |
2013 |
url |
http://hdl.handle.net/10150/306918 |
work_keys_str_mv |
AT mutebialex comparingresponsescalingformatsusedinpatientreportedoutcomeproinstruments |
_version_ |
1718106336337592320 |