Implications of using Likert data in multiple regression analysis

Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. L...

Full description

Bibliographic Details
Main Author: Owuor, Charles Ochieng
Language:English
Published: 2009
Online Access:http://hdl.handle.net/2429/13778
id ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-13778
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-137782014-03-14T15:47:15Z Implications of using Likert data in multiple regression analysis Owuor, Charles Ochieng Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. Little research has been done to examine how much information is lost and how inappropriate it is to use Likert variables in ordinary least-squares multiple regression. Therefore, this study examined the effect of Likert-type responses in the criterion variable and predictors for various scale points, on the accuracy of regression models using normal and skewed observed response patterns. This was done for the case of three predictors and one criterion. Similarly, eight levels of Likert-type categorization ranging from two to nine scale points were considered for both predictors and criterion variables. It was found that the largest bias in the estimation of the model R-squared, the relative Pratt Index, and Pearson correlation coefficient occurred for two or three-point Likert scales. The bias did not substantially reduce any further beyond the four-point Likert scale. Type of correlation matrix had no effect on the model fit. However, skewed response distribution resulted in large biases in both R² and Pearson correlation, but not in Relative Pratt index, which was not affected by the response distribution. Practical contribution and significance of the study is that it has provided information and insight on how much information is lost due to bias, and the extent to which accuracy is compromised in using Likert data in linear regression models in education and social science research. It is recommended that researchers and practitioners should recognize the extent of the bias in ordinary least-squares regression models with Likert data, resulting in substantial loss of information. For variable importance, the relative Pratt index should be used given that it is robust to Likert conditions and response distributions. Finally, when interpreting reported regression results in the research literature one should recognize that the reported R-squared values are underestimated and that the Pearson correlations are also typically underestimated and sometimes substantially underestimated. 2009-10-08T21:23:38Z 2009-10-08T21:23:38Z 2001 2009-10-08T21:23:38Z 2001-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/13778 eng UBC Retrospective Theses Digitization Project [http://www.library.ubc.ca/archives/retro_theses/]
collection NDLTD
language English
sources NDLTD
description Many of the measures obtained in educational research are Likert-type responses on questionnaires. These Likert-type variables are sometimes used in ordinary least-squares regression analysis. However, among the key implications of the assumptions of regression is that the criterion is continuous. Little research has been done to examine how much information is lost and how inappropriate it is to use Likert variables in ordinary least-squares multiple regression. Therefore, this study examined the effect of Likert-type responses in the criterion variable and predictors for various scale points, on the accuracy of regression models using normal and skewed observed response patterns. This was done for the case of three predictors and one criterion. Similarly, eight levels of Likert-type categorization ranging from two to nine scale points were considered for both predictors and criterion variables. It was found that the largest bias in the estimation of the model R-squared, the relative Pratt Index, and Pearson correlation coefficient occurred for two or three-point Likert scales. The bias did not substantially reduce any further beyond the four-point Likert scale. Type of correlation matrix had no effect on the model fit. However, skewed response distribution resulted in large biases in both R² and Pearson correlation, but not in Relative Pratt index, which was not affected by the response distribution. Practical contribution and significance of the study is that it has provided information and insight on how much information is lost due to bias, and the extent to which accuracy is compromised in using Likert data in linear regression models in education and social science research. It is recommended that researchers and practitioners should recognize the extent of the bias in ordinary least-squares regression models with Likert data, resulting in substantial loss of information. For variable importance, the relative Pratt index should be used given that it is robust to Likert conditions and response distributions. Finally, when interpreting reported regression results in the research literature one should recognize that the reported R-squared values are underestimated and that the Pearson correlations are also typically underestimated and sometimes substantially underestimated.
author Owuor, Charles Ochieng
spellingShingle Owuor, Charles Ochieng
Implications of using Likert data in multiple regression analysis
author_facet Owuor, Charles Ochieng
author_sort Owuor, Charles Ochieng
title Implications of using Likert data in multiple regression analysis
title_short Implications of using Likert data in multiple regression analysis
title_full Implications of using Likert data in multiple regression analysis
title_fullStr Implications of using Likert data in multiple regression analysis
title_full_unstemmed Implications of using Likert data in multiple regression analysis
title_sort implications of using likert data in multiple regression analysis
publishDate 2009
url http://hdl.handle.net/2429/13778
work_keys_str_mv AT owuorcharlesochieng implicationsofusinglikertdatainmultipleregressionanalysis
_version_ 1716652829102309376