Model comparison and assessment by cross validation

Cross validation (CV) is widely used for model assessment and comparison. In this thesis, we first review and compare three v-fold CV strategies: best single CV, repeated and averaged CV and double CV. The mean squared errors of the CV strategies in estimating the best predictive performance are ill...

Full description

Bibliographic Details
Main Author:	Shen, Hui
Format:	Others
Language:	English
Published:	University of British Columbia 2008
Subjects:	Model assessment Cross validation
Online Access:	http://hdl.handle.net/2429/1286

id	ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.-1286
record_format	oai_dc
spelling	ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.-12862013-06-05T04:16:51ZModel comparison and assessment by cross validationShen, HuiModel assessmentCross validationCross validation (CV) is widely used for model assessment and comparison. In this thesis, we first review and compare three v-fold CV strategies: best single CV, repeated and averaged CV and double CV. The mean squared errors of the CV strategies in estimating the best predictive performance are illustrated by using simulated and real data examples. The results show that repeated and averaged CV is a good strategy and outperforms the other two CV strategies for finite samples in terms of the mean squared error in estimating prediction accuracy and the probability of choosing an optimal model. In practice, when we need to compare many models, conducting repeated and averaged CV strategy is not computational feasible. We develop an efficient sequential methodology for model comparison based on CV. It also takes into account the randomness in CV. The number of models is reduced via an adaptive, multiplicity-adjusted sequential algorithm, where poor performers are quickly eliminated. By exploiting matching of individual observations, it is sometimes even possible to establish the statistically significant inferiority of some models with just one execution of CV. This adaptive and computationally efficient methodology is demonstrated on a large cheminformatics data set from PubChem. Cross validated mean squared error (CVMSE) is widely used to estimate the prediction mean squared error (MSE) of statistical methods. For linear models, we show how CVMSE depends on the number of folds, v, used in cross validation, the number of observations, and the number of model parameters. We establish that the bias of CVMSE in estimating the true MSE decreases with v and increases with model complexity. In particular, the bias may be very substantial for models with many parameters relative to the number of observations, even if v is large. These results are used to correct CVMSE for its bias. We compare our proposed bias correction with that of Burman (1989), through simulated and real examples. We also illustrate that our method of correcting for the bias of CVMSE may change the results of model selection.University of British Columbia2008-08-07T19:56:40Z2008-08-07T19:56:40Z20082008-08-07T19:56:40Z2008-11Electronic Thesis or Dissertation630843 bytesapplication/pdfhttp://hdl.handle.net/2429/1286eng
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Model assessment Cross validation
spellingShingle	Model assessment Cross validation Shen, Hui Model comparison and assessment by cross validation
description	Cross validation (CV) is widely used for model assessment and comparison. In this thesis, we first review and compare three v-fold CV strategies: best single CV, repeated and averaged CV and double CV. The mean squared errors of the CV strategies in estimating the best predictive performance are illustrated by using simulated and real data examples. The results show that repeated and averaged CV is a good strategy and outperforms the other two CV strategies for finite samples in terms of the mean squared error in estimating prediction accuracy and the probability of choosing an optimal model. In practice, when we need to compare many models, conducting repeated and averaged CV strategy is not computational feasible. We develop an efficient sequential methodology for model comparison based on CV. It also takes into account the randomness in CV. The number of models is reduced via an adaptive, multiplicity-adjusted sequential algorithm, where poor performers are quickly eliminated. By exploiting matching of individual observations, it is sometimes even possible to establish the statistically significant inferiority of some models with just one execution of CV. This adaptive and computationally efficient methodology is demonstrated on a large cheminformatics data set from PubChem. Cross validated mean squared error (CVMSE) is widely used to estimate the prediction mean squared error (MSE) of statistical methods. For linear models, we show how CVMSE depends on the number of folds, v, used in cross validation, the number of observations, and the number of model parameters. We establish that the bias of CVMSE in estimating the true MSE decreases with v and increases with model complexity. In particular, the bias may be very substantial for models with many parameters relative to the number of observations, even if v is large. These results are used to correct CVMSE for its bias. We compare our proposed bias correction with that of Burman (1989), through simulated and real examples. We also illustrate that our method of correcting for the bias of CVMSE may change the results of model selection.
author	Shen, Hui
author_facet	Shen, Hui
author_sort	Shen, Hui
title	Model comparison and assessment by cross validation
title_short	Model comparison and assessment by cross validation
title_full	Model comparison and assessment by cross validation
title_fullStr	Model comparison and assessment by cross validation
title_full_unstemmed	Model comparison and assessment by cross validation
title_sort	model comparison and assessment by cross validation
publisher	University of British Columbia
publishDate	2008
url	http://hdl.handle.net/2429/1286
work_keys_str_mv	AT shenhui modelcomparisonandassessmentbycrossvalidation
_version_	1716586751464570880

Model comparison and assessment by cross validation

Similar Items