The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data

碩士 === 國立臺北大學 === 統計學系 === 93 === Latent class models (LCM) has found important applications in the social and behavioral sciences for modeling responses to sets of categorical variables and non-response is typical when collecting data. In this study, the non-response mainly included “skip pattern”...

Full description

Bibliographic Details
Main Authors: Pei-Ying Lin, 林珮瑩
Other Authors: TING-HSIANG LIN
Format: Others
Language:en_US
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/59095725789280361776
id ndltd-TW-093NTPU0337025
record_format oai_dc
spelling ndltd-TW-093NTPU03370252015-10-13T15:29:20Z http://ndltd.ncl.edu.tw/handle/59095725789280361776 The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data 無回答資料對隱藏類別分析模型選擇的影響:遺失值和跳題型式上之應用 Pei-Ying Lin 林珮瑩 碩士 國立臺北大學 統計學系 93 Latent class models (LCM) has found important applications in the social and behavioral sciences for modeling responses to sets of categorical variables and non-response is typical when collecting data. In this study, the non-response mainly included “skip pattern” and “missing data”. The primary purpose of this research is to evaluate the effects of several factors (latent class proportions, conditional probabilities, missing data rates, skip pattern rates, sample sizes and number of items) for non-response data across LCM. Some researches have studied latent class with missing items. However, there is no research on the latent class model selection when both skip and missing items occur simultaneously. Therefore, we are interested in investigating in the effect of non-response data on model selection in the latent class analysis. We simulated these non-response data and used the eight information criteria to evaluate the accuracy rates for selecting the best model. The results showed the main factors in model selections of these information criteria are latent class proportions, conditional probabilities, number of sample sizes and items. In addition, when we contrasted with comparison data, the levels of skip and missing rates are also significant. We recommend to use AIC, BIC* and CAIC* when sample size is small and the number of items is four and under more extreme different conditional probabilities. But if item is increased to eight and sample size is still small, only BIC* is not suitable and the other seven indices perform better. AIC and HT-AIC don not improve with large sample sizes and more items situation. With fewer samples and items, BIC and CAIC have very low accuracy rates. The adjusted BIC (BIC*) and adjust CAIC (CAIC*) have noteworthy improvements for selecting correct model. TING-HSIANG LIN 林定香 2005 學位論文 ; thesis 91 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺北大學 === 統計學系 === 93 === Latent class models (LCM) has found important applications in the social and behavioral sciences for modeling responses to sets of categorical variables and non-response is typical when collecting data. In this study, the non-response mainly included “skip pattern” and “missing data”. The primary purpose of this research is to evaluate the effects of several factors (latent class proportions, conditional probabilities, missing data rates, skip pattern rates, sample sizes and number of items) for non-response data across LCM. Some researches have studied latent class with missing items. However, there is no research on the latent class model selection when both skip and missing items occur simultaneously. Therefore, we are interested in investigating in the effect of non-response data on model selection in the latent class analysis. We simulated these non-response data and used the eight information criteria to evaluate the accuracy rates for selecting the best model. The results showed the main factors in model selections of these information criteria are latent class proportions, conditional probabilities, number of sample sizes and items. In addition, when we contrasted with comparison data, the levels of skip and missing rates are also significant. We recommend to use AIC, BIC* and CAIC* when sample size is small and the number of items is four and under more extreme different conditional probabilities. But if item is increased to eight and sample size is still small, only BIC* is not suitable and the other seven indices perform better. AIC and HT-AIC don not improve with large sample sizes and more items situation. With fewer samples and items, BIC and CAIC have very low accuracy rates. The adjusted BIC (BIC*) and adjust CAIC (CAIC*) have noteworthy improvements for selecting correct model.
author2 TING-HSIANG LIN
author_facet TING-HSIANG LIN
Pei-Ying Lin
林珮瑩
author Pei-Ying Lin
林珮瑩
spellingShingle Pei-Ying Lin
林珮瑩
The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
author_sort Pei-Ying Lin
title The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
title_short The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
title_full The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
title_fullStr The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
title_full_unstemmed The effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
title_sort effect of non-response data on model selection in latent class analysis: an application of missing and skip response data
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/59095725789280361776
work_keys_str_mv AT peiyinglin theeffectofnonresponsedataonmodelselectioninlatentclassanalysisanapplicationofmissingandskipresponsedata
AT línpèiyíng theeffectofnonresponsedataonmodelselectioninlatentclassanalysisanapplicationofmissingandskipresponsedata
AT peiyinglin wúhuídázīliàoduìyǐncánglèibiéfēnxīmóxíngxuǎnzédeyǐngxiǎngyíshīzhíhétiàotíxíngshìshàngzhīyīngyòng
AT línpèiyíng wúhuídázīliàoduìyǐncánglèibiéfēnxīmóxíngxuǎnzédeyǐngxiǎngyíshīzhíhétiàotíxíngshìshàngzhīyīngyòng
AT peiyinglin effectofnonresponsedataonmodelselectioninlatentclassanalysisanapplicationofmissingandskipresponsedata
AT línpèiyíng effectofnonresponsedataonmodelselectioninlatentclassanalysisanapplicationofmissingandskipresponsedata
_version_ 1717766140295380992