<b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance

Current study employs Monte Carlo simulation in the building of a significance test to indicate the principal components that best discriminate against outliers. Different sample sizes were generated by multivariate normal distribution with different numbers of variables and correlation structures....

Full description

Bibliographic Details
Main Authors: Manoel Vitor de Souza Veloso, Marcelo Angelo Cirillo
Format: Article
Language:English
Published: Universidade Estadual de Maringá 2016-04-01
Series:Acta Scientiarum: Technology
Subjects:
Online Access:http://periodicos.uem.br/ojs/index.php/ActaSciTechnol/article/view/26046
id doaj-128d5bfba3d54a1aa04207741f436b1a
record_format Article
spelling doaj-128d5bfba3d54a1aa04207741f436b1a2020-11-24T20:43:19ZengUniversidade Estadual de MaringáActa Scientiarum: Technology1806-25631807-86642016-04-0138219320010.4025/actascitechnol.v38i2.2604613477<b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distanceManoel Vitor de Souza Veloso0Marcelo Angelo Cirillo1Universidade Federal de AlfenasUniversidade Federal de LavrasCurrent study employs Monte Carlo simulation in the building of a significance test to indicate the principal components that best discriminate against outliers. Different sample sizes were generated by multivariate normal distribution with different numbers of variables and correlation structures. Corrections by chi-square distance of Pearson´s and Yates's were provided for each sample size. Pearson´s correlation test showed the best performance. By increasing the number of variables, significance probabilities in favor of hypothesis H0 were reduced. So that the proposed method could be illustrated, a multivariate time series was applied with regard to sales volume rates in the state of Minas Gerais, obtained in different market segments.http://periodicos.uem.br/ojs/index.php/ActaSciTechnol/article/view/26046contaminated samplesMonte Carlosignificance testp-value
collection DOAJ
language English
format Article
sources DOAJ
author Manoel Vitor de Souza Veloso
Marcelo Angelo Cirillo
spellingShingle Manoel Vitor de Souza Veloso
Marcelo Angelo Cirillo
<b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
Acta Scientiarum: Technology
contaminated samples
Monte Carlo
significance test
p-value
author_facet Manoel Vitor de Souza Veloso
Marcelo Angelo Cirillo
author_sort Manoel Vitor de Souza Veloso
title <b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
title_short <b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
title_full <b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
title_fullStr <b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
title_full_unstemmed <b>Principal components in the discrimination of outliers: A study in simulation sample data corrected by Pearson's and Yates´s chi-square distance
title_sort <b>principal components in the discrimination of outliers: a study in simulation sample data corrected by pearson's and yates´s chi-square distance
publisher Universidade Estadual de Maringá
series Acta Scientiarum: Technology
issn 1806-2563
1807-8664
publishDate 2016-04-01
description Current study employs Monte Carlo simulation in the building of a significance test to indicate the principal components that best discriminate against outliers. Different sample sizes were generated by multivariate normal distribution with different numbers of variables and correlation structures. Corrections by chi-square distance of Pearson´s and Yates's were provided for each sample size. Pearson´s correlation test showed the best performance. By increasing the number of variables, significance probabilities in favor of hypothesis H0 were reduced. So that the proposed method could be illustrated, a multivariate time series was applied with regard to sales volume rates in the state of Minas Gerais, obtained in different market segments.
topic contaminated samples
Monte Carlo
significance test
p-value
url http://periodicos.uem.br/ojs/index.php/ActaSciTechnol/article/view/26046
work_keys_str_mv AT manoelvitordesouzaveloso bprincipalcomponentsinthediscriminationofoutliersastudyinsimulationsampledatacorrectedbypearsonsandyatesschisquaredistance
AT marceloangelocirillo bprincipalcomponentsinthediscriminationofoutliersastudyinsimulationsampledatacorrectedbypearsonsandyatesschisquaredistance
_version_ 1716820243601424384