Mis-specification and goodness-of-fit in logistic regression

The logistic regression model has become a standard model for binary outcomes in many areas of application and is widely used in medical statistics. Much work has been carried out to examine the asymptotic behaviour of the distribution of Maximum Likelihood Estimates (MLE) for the logistic regressio...

Full description

Bibliographic Details
Main Author: Badi, Nuri H. Salem
Published: University of Newcastle upon Tyne 2014
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.618239
id ndltd-bl.uk-oai-ethos.bl.uk-618239
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6182392016-08-04T03:36:43ZMis-specification and goodness-of-fit in logistic regressionBadi, Nuri H. Salem2014The logistic regression model has become a standard model for binary outcomes in many areas of application and is widely used in medical statistics. Much work has been carried out to examine the asymptotic behaviour of the distribution of Maximum Likelihood Estimates (MLE) for the logistic regression model, although the most widely known properties apply only if the assumed model is correct. There has been much work on goodness-of- t tests to address the last point. The rst part of this thesis investigates the behaviour of the asymptotic distribution of the (MLE) under a form of model mis-speci cation, namely when covariates from the true model are omitted from the tted model. When the incorrect model is tted the maximum likelihood estimates converge to the least false values. In this work, key integrals cannot be evaluated explicitly but we use properties of the skew-Normal distribution and the approximation of the Logit by a suitable Probit function to obtain a good approximation for the least false values. The second part of the thesis investigates the assessment of a particular goodness-of- t test namely the information matrix test (IM) test as applied to binary data models. Kuss (2002), claimed that the IM test has reasonable power compared with other statistics. In this part of the thesis we investigate this claim, consider the distribution of the moments of the IM statistic and the asymptotic distribution of the IM test (IMT) statistic. We had di culty in reproducing the results claimed by Kuss (2002) and considered that this was probably due to the near singularity of the variance of IMT. We de ne a new form of the IMT statistic, IMTR, which addresses this issue.519.5University of Newcastle upon Tynehttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.618239http://hdl.handle.net/10443/2376Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 519.5
spellingShingle 519.5
Badi, Nuri H. Salem
Mis-specification and goodness-of-fit in logistic regression
description The logistic regression model has become a standard model for binary outcomes in many areas of application and is widely used in medical statistics. Much work has been carried out to examine the asymptotic behaviour of the distribution of Maximum Likelihood Estimates (MLE) for the logistic regression model, although the most widely known properties apply only if the assumed model is correct. There has been much work on goodness-of- t tests to address the last point. The rst part of this thesis investigates the behaviour of the asymptotic distribution of the (MLE) under a form of model mis-speci cation, namely when covariates from the true model are omitted from the tted model. When the incorrect model is tted the maximum likelihood estimates converge to the least false values. In this work, key integrals cannot be evaluated explicitly but we use properties of the skew-Normal distribution and the approximation of the Logit by a suitable Probit function to obtain a good approximation for the least false values. The second part of the thesis investigates the assessment of a particular goodness-of- t test namely the information matrix test (IM) test as applied to binary data models. Kuss (2002), claimed that the IM test has reasonable power compared with other statistics. In this part of the thesis we investigate this claim, consider the distribution of the moments of the IM statistic and the asymptotic distribution of the IM test (IMT) statistic. We had di culty in reproducing the results claimed by Kuss (2002) and considered that this was probably due to the near singularity of the variance of IMT. We de ne a new form of the IMT statistic, IMTR, which addresses this issue.
author Badi, Nuri H. Salem
author_facet Badi, Nuri H. Salem
author_sort Badi, Nuri H. Salem
title Mis-specification and goodness-of-fit in logistic regression
title_short Mis-specification and goodness-of-fit in logistic regression
title_full Mis-specification and goodness-of-fit in logistic regression
title_fullStr Mis-specification and goodness-of-fit in logistic regression
title_full_unstemmed Mis-specification and goodness-of-fit in logistic regression
title_sort mis-specification and goodness-of-fit in logistic regression
publisher University of Newcastle upon Tyne
publishDate 2014
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.618239
work_keys_str_mv AT badinurihsalem misspecificationandgoodnessoffitinlogisticregression
_version_ 1718370493066641408