Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions

Background: In data analysis and machine learning, we often need to identify and quantify the correlation between variables. Although Pearson’s correlation coefficient has been widely used, its value is reliable only for linear relationships and Distance correlation was introduced to address this sh...

Full description

Bibliographic Details
Main Authors: Nezamoddin N. Kachouie, Wejdan Deebani
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/4/440
id doaj-513939282c384e75b9d466b194b9c08f
record_format Article
spelling doaj-513939282c384e75b9d466b194b9c08f2020-11-25T02:04:02ZengMDPI AGEntropy1099-43002020-04-012244044010.3390/e22040440Association Factor for Identifying Linear and Nonlinear Correlations in Noisy ConditionsNezamoddin N. Kachouie0Wejdan Deebani1Department of Mathematical Sciences, Florida Institute of Technology, Melbourne, FL 32901, USADeparments of Mathematics, College of Science and Arts, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi ArabiaBackground: In data analysis and machine learning, we often need to identify and quantify the correlation between variables. Although Pearson’s correlation coefficient has been widely used, its value is reliable only for linear relationships and Distance correlation was introduced to address this shortcoming. Methods: Distance correlation can identify linear and nonlinear correlations. However, its performance drops in noisy conditions. In this paper, we introduce the Association Factor (AF) as a robust method for identification and quantification of linear and nonlinear associations in noisy conditions. Results: To test the performance of the proposed Association Factor, we modeled several simulations of linear and nonlinear relationships in different noise conditions and computed Pearson’s correlation, Distance correlation, and the proposed Association Factor. Conclusion: Our results show that the proposed method is robust in two ways. First, it can identify both linear and nonlinear associations. Second, the proposed Association Factor is reliable in both noiseless and noisy conditions.https://www.mdpi.com/1099-4300/22/4/440association factorPearson’s correlationdistance correlationmaximal information coefficient (MIC)detrended fluctuation analysis (DFA)nonlinear relation
collection DOAJ
language English
format Article
sources DOAJ
author Nezamoddin N. Kachouie
Wejdan Deebani
spellingShingle Nezamoddin N. Kachouie
Wejdan Deebani
Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
Entropy
association factor
Pearson’s correlation
distance correlation
maximal information coefficient (MIC)
detrended fluctuation analysis (DFA)
nonlinear relation
author_facet Nezamoddin N. Kachouie
Wejdan Deebani
author_sort Nezamoddin N. Kachouie
title Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
title_short Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
title_full Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
title_fullStr Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
title_full_unstemmed Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions
title_sort association factor for identifying linear and nonlinear correlations in noisy conditions
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2020-04-01
description Background: In data analysis and machine learning, we often need to identify and quantify the correlation between variables. Although Pearson’s correlation coefficient has been widely used, its value is reliable only for linear relationships and Distance correlation was introduced to address this shortcoming. Methods: Distance correlation can identify linear and nonlinear correlations. However, its performance drops in noisy conditions. In this paper, we introduce the Association Factor (AF) as a robust method for identification and quantification of linear and nonlinear associations in noisy conditions. Results: To test the performance of the proposed Association Factor, we modeled several simulations of linear and nonlinear relationships in different noise conditions and computed Pearson’s correlation, Distance correlation, and the proposed Association Factor. Conclusion: Our results show that the proposed method is robust in two ways. First, it can identify both linear and nonlinear associations. Second, the proposed Association Factor is reliable in both noiseless and noisy conditions.
topic association factor
Pearson’s correlation
distance correlation
maximal information coefficient (MIC)
detrended fluctuation analysis (DFA)
nonlinear relation
url https://www.mdpi.com/1099-4300/22/4/440
work_keys_str_mv AT nezamoddinnkachouie associationfactorforidentifyinglinearandnonlinearcorrelationsinnoisyconditions
AT wejdandeebani associationfactorforidentifyinglinearandnonlinearcorrelationsinnoisyconditions
_version_ 1724945012628652032