Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection

The current study employed a Monte Carlo design to examine whether samplebased and formula-based estimates of cross-validated R2 differ in accuracy when predictor selection is and is not performed. Analyses were conducted on three datasets with 5, 10, or 15 predictors and different predictor-criteri...

Full description

Bibliographic Details
Main Author: Kircher, Andrew J.
Format: Others
Published: TopSCHOLAR® 2015
Subjects:
Online Access:http://digitalcommons.wku.edu/theses/1472
http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2475&context=theses
id ndltd-WKU-oai-digitalcommons.wku.edu-theses-2475
record_format oai_dc
spelling ndltd-WKU-oai-digitalcommons.wku.edu-theses-24752015-05-23T05:39:48Z Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection Kircher, Andrew J. The current study employed a Monte Carlo design to examine whether samplebased and formula-based estimates of cross-validated R2 differ in accuracy when predictor selection is and is not performed. Analyses were conducted on three datasets with 5, 10, or 15 predictors and different predictor-criterion relationships. Results demonstrated that, in most cases, a formula-based estimate of the cross-validated R2 was as accurate as a sample-based estimate. The one exception was the five predictor case wherein the formula-based estimate exhibited substantially greater bias than the estimate from a sample-based cross validation study. Thus, formula-based estimates, which have an enormous practical advantage over a two sample cross validation study, can be used in most cases without fear of greater error. 2015-05-01T07:00:00Z text application/pdf http://digitalcommons.wku.edu/theses/1472 http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2475&context=theses Masters Theses & Specialist Projects TopSCHOLAR® Predictor Criterion Formula Based Applied Behavior Analysis Psychology
collection NDLTD
format Others
sources NDLTD
topic Predictor
Criterion
Formula
Based
Applied Behavior Analysis
Psychology
spellingShingle Predictor
Criterion
Formula
Based
Applied Behavior Analysis
Psychology
Kircher, Andrew J.
Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
description The current study employed a Monte Carlo design to examine whether samplebased and formula-based estimates of cross-validated R2 differ in accuracy when predictor selection is and is not performed. Analyses were conducted on three datasets with 5, 10, or 15 predictors and different predictor-criterion relationships. Results demonstrated that, in most cases, a formula-based estimate of the cross-validated R2 was as accurate as a sample-based estimate. The one exception was the five predictor case wherein the formula-based estimate exhibited substantially greater bias than the estimate from a sample-based cross validation study. Thus, formula-based estimates, which have an enormous practical advantage over a two sample cross validation study, can be used in most cases without fear of greater error.
author Kircher, Andrew J.
author_facet Kircher, Andrew J.
author_sort Kircher, Andrew J.
title Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
title_short Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
title_full Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
title_fullStr Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
title_full_unstemmed Estimation of the Squared Population Cross-Validity Under Conditions of Predictor Selection
title_sort estimation of the squared population cross-validity under conditions of predictor selection
publisher TopSCHOLAR®
publishDate 2015
url http://digitalcommons.wku.edu/theses/1472
http://digitalcommons.wku.edu/cgi/viewcontent.cgi?article=2475&context=theses
work_keys_str_mv AT kircherandrewj estimationofthesquaredpopulationcrossvalidityunderconditionsofpredictorselection
_version_ 1716804098699821056