Variable selection in multiple linear regression: The influence of individual cases

The influence of individual cases in a data set is studied when variable selection is applied in multiple linear regression. Two different influence measures, based on the C_p criterion and Akaike's information criterion, are introduced. The relative change in the selection criterion when an in...

Full description

Bibliographic Details
Main Authors: SJ Steel, DW Uys
Format: Article
Language:English
Published: Operations Research Society of South Africa (ORSSA) 2007-12-01
Series:ORiON
Online Access:http://orion.journals.ac.za/pub/article/view/52
Description
Summary:The influence of individual cases in a data set is studied when variable selection is applied in multiple linear regression. Two different influence measures, based on the C_p criterion and Akaike's information criterion, are introduced. The relative change in the selection criterion when an individual case is omitted is proposed as the selection influence of the specific omitted case. Four standard examples from the literature are considered and the selection influence of the cases is calculated. It is argued that the selection procedure may be improved by taking the selection influence of individual data cases into account.
ISSN:2224-0004