Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm
Multiple correspondence analysis (MCA) in the presence of missing data is usually performed by removing the records that have missing or not available (NA) data; sometimes, an entire row or column of a data matrix is removed, which is not ideal because relevant information on an individual or variab...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universidad Nacional de Colombia
2019-10-01
|
Series: | Dyna |
Subjects: | |
Online Access: | https://revistas.unal.edu.co/index.php/dyna/article/view/80261 |
id |
doaj-76e9f42d91e1411983a6ad1dc0bf66ce |
---|---|
record_format |
Article |
spelling |
doaj-76e9f42d91e1411983a6ad1dc0bf66ce2020-11-25T02:40:04ZengUniversidad Nacional de Colombia Dyna0012-73532346-21832019-10-018621124925710.15446/dyna.v86n211.8026152397Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithmAndrés Felipe Ochoa Muñoz0Víctor Manuel Gonzalez Rojas1Campo Elías Pardo Turriago2Universidad del ValleUniversidad del ValleUniversidad Nacional de Colombia - Sede BogotáMultiple correspondence analysis (MCA) in the presence of missing data is usually performed by removing the records that have missing or not available (NA) data; sometimes, an entire row or column of a data matrix is removed, which is not ideal because relevant information on an individual or variable of the study is lost. In some cases, it is assumed that the missing data are a category of the qualitative variable, resulting in a greater variance dispersion in the new axes. Possible solutions to this problem can be the imputation of the missing data or using an algorithm suited to the presence of this type of data. This work is focused on performing the MCA method in the presence of missing data, without using imputation techniques, by using the available data principle of the nonlinear estimation by iterative partial least squares (NIPALS) algorithm [25].https://revistas.unal.edu.co/index.php/dyna/article/view/80261multiple correspondence analysismissing datanipalsavailable data principle |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Andrés Felipe Ochoa Muñoz Víctor Manuel Gonzalez Rojas Campo Elías Pardo Turriago |
spellingShingle |
Andrés Felipe Ochoa Muñoz Víctor Manuel Gonzalez Rojas Campo Elías Pardo Turriago Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm Dyna multiple correspondence analysis missing data nipals available data principle |
author_facet |
Andrés Felipe Ochoa Muñoz Víctor Manuel Gonzalez Rojas Campo Elías Pardo Turriago |
author_sort |
Andrés Felipe Ochoa Muñoz |
title |
Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm |
title_short |
Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm |
title_full |
Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm |
title_fullStr |
Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm |
title_full_unstemmed |
Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm |
title_sort |
missing data in multiple correspondence analysis under the available data principle of the nipals algorithm |
publisher |
Universidad Nacional de Colombia |
series |
Dyna |
issn |
0012-7353 2346-2183 |
publishDate |
2019-10-01 |
description |
Multiple correspondence analysis (MCA) in the presence of missing data is usually performed by removing the records that have missing or not available (NA) data; sometimes, an entire row or column of a data matrix is removed, which is not ideal because relevant information on an individual or variable of the study is lost. In some cases, it is assumed that the missing data are a category of the qualitative variable, resulting in a greater variance dispersion in the new axes. Possible solutions to this problem can be the imputation of the missing data or using an algorithm suited to the presence of this type of data. This work is focused on performing the MCA method in the presence of missing data, without using imputation techniques, by using the available data principle of the nonlinear estimation by iterative partial least squares (NIPALS) algorithm [25]. |
topic |
multiple correspondence analysis missing data nipals available data principle |
url |
https://revistas.unal.edu.co/index.php/dyna/article/view/80261 |
work_keys_str_mv |
AT andresfelipeochoamunoz missingdatainmultiplecorrespondenceanalysisundertheavailabledataprincipleofthenipalsalgorithm AT victormanuelgonzalezrojas missingdatainmultiplecorrespondenceanalysisundertheavailabledataprincipleofthenipalsalgorithm AT campoeliaspardoturriago missingdatainmultiplecorrespondenceanalysisundertheavailabledataprincipleofthenipalsalgorithm |
_version_ |
1724783219818102784 |