An introduction to new robust linear and monotonic correlation coefficients

Background: The most common measure of association between two continuous variables is the Pearson correlation (Maronna et al. in Safari an OMC. Robust statistics, 2019. https://login.proxy.bib.uottawa.ca/login?url=https://learning.oreilly.com/library/view/-/9781119214687/?ar&orpq&email=^u)....

Full description

Bibliographic Details
Main Authors: Bailey, S. (Author), Bursac, Z. (Author), Singh, K.P (Author), Tabatabai, H. (Author), Tabatabai, M. (Author), Wilus, D. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03645nam a2200541Ia 4500
001 10.1186-s12859-021-04098-4
008 220427s2021 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a An introduction to new robust linear and monotonic correlation coefficients 
260 0 |b BioMed Central Ltd  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-021-04098-4 
520 3 |a Background: The most common measure of association between two continuous variables is the Pearson correlation (Maronna et al. in Safari an OMC. Robust statistics, 2019. https://login.proxy.bib.uottawa.ca/login?url=https://learning.oreilly.com/library/view/-/9781119214687/?ar&orpq&email=^u). When outliers are present, Pearson does not accurately measure association and robust measures are needed. This article introduces three new robust measures of correlation: Taba (T), TabWil (TW), and TabWil rank (TWR). The correlation estimators T and TW measure a linear association between two continuous or ordinal variables; whereas TWR measures a monotonic association. The robustness of these proposed measures in comparison with Pearson (P), Spearman (S), Quadrant (Q), Median (M), and Minimum Covariance Determinant (MCD) are examined through simulation. Taba distance is used to analyze genes, and statistical tests were used to identify those genes most significantly associated with Williams Syndrome (WS). Results: Based on the root mean square error (RMSE) and bias, the three proposed correlation measures are highly competitive when compared to classical measures such as P and S as well as robust measures such as Q, M, and MCD. Our findings indicate TBL2 was the most significant gene among patients diagnosed with WS and had the most significant reduction in gene expression level when compared with control (P value = 6.37E-05). Conclusions: Overall, when the distribution is bivariate Log-Normal or bivariate Weibull, TWR performs best in terms of bias and T performs best with respect to RMSE. Under the Normal distribution, MCD performs well with respect to bias and RMSE; but TW, TWR, T, S, and P correlations were in close proximity. The identification of TBL2 may serve as a diagnostic tool for WS patients. A Taba R package has been developed and is available for use to perform all necessary computations for the proposed methods. © 2021, The Author(s). 
650 0 4 |a computer simulation 
650 0 4 |a Computer Simulation 
650 0 4 |a Continuous variables 
650 0 4 |a Correlation coefficient 
650 0 4 |a Correlation estimator 
650 0 4 |a Correlation measures 
650 0 4 |a Correlation methods 
650 0 4 |a Correlation of Data 
650 0 4 |a Diagnosis 
650 0 4 |a Dissimilarity measures 
650 0 4 |a Gene expression 
650 0 4 |a Gene expression 
650 0 4 |a Gene expression levels 
650 0 4 |a HTTP 
650 0 4 |a human 
650 0 4 |a Humans 
650 0 4 |a Linear associations 
650 0 4 |a Mean square error 
650 0 4 |a Median correlation 
650 0 4 |a Minimum covariance determinant 
650 0 4 |a Minimum covariance determinant correlation 
650 0 4 |a Normal distribution 
650 0 4 |a Patient monitoring 
650 0 4 |a Pearson correlation 
650 0 4 |a Quadrant correlation 
650 0 4 |a Root mean square errors 
650 0 4 |a Spearman correlation 
650 0 4 |a Weibull distribution 
650 0 4 |a Williams syndrome 
700 1 |a Bailey, S.  |e author 
700 1 |a Bursac, Z.  |e author 
700 1 |a Singh, K.P.  |e author 
700 1 |a Tabatabai, H.  |e author 
700 1 |a Tabatabai, M.  |e author 
700 1 |a Wilus, D.  |e author 
773 |t BMC Bioinformatics