Skip to content
BY 4.0 license Open Access Published by De Gruyter Open Access December 16, 2020

Two symmetric and computationally efficient Gini correlations

  • Courtney Vanderford , Yongli Sang and Xin Dang EMAIL logo
From the journal Dependence Modeling

Abstract

Standard Gini correlation plays an important role in measuring the dependence between random variables with heavy-tailed distributions. It is based on the covariance between one variable and the rank of the other. Hence for each pair of random variables, there are two Gini correlations and they are not equal in general, which brings a substantial difficulty in interpretation. Recently, Sang et al (2016) proposed a symmetric Gini correlation based on the joint spatial rank function with a computation cost of O(n2) where n is the sample size. In this paper, we study two symmetric and computationally efficient Gini correlations with the computational complexity of O(n log n). The properties of the new symmetric Gini correlations are explored. The influence function approach is utilized to study the robustness and the asymptotic behavior of these correlations. The asymptotic relative efficiencies are considered to compare several popular correlations under symmetric distributions with different tail-heaviness as well as an asymmetric log-normal distribution. Simulation and real data application are conducted to demonstrate the desirable performance of the two new symmetric Gini correlations.

MSC 2010: 62G35; 62G20

References

[1] Blitz, R.C. and J.A. Brittain (1964). An extension of the Lorenz diagram to the correlation of two variables. Metron 23(1-4), 137–143.Search in Google Scholar

[2] Blomqvist, N. (1950). On a measure of dependence between two random variables. Ann. Math. Statist. 21(4), 593–600.10.1214/aoms/1177729754Search in Google Scholar

[3] Croux, C. and C. Dehon (2010). Influence function of the Spearman and Kendall correlation measures. Stat. Methods Appl. 19(4), 497–515.10.1007/s10260-010-0142-zSearch in Google Scholar

[4] Dang, X., H. Sang, and L. Weatherall (2019). Gini covariance matrix and its affine equivariant version. Statist. Papers 60, 641–666.10.1007/s00362-016-0842-zSearch in Google Scholar

[5] Devlin, S.J., R. Gnanadesikan, and J.R. Kettenring (1975). Robust estimation and outlier detection with correlation coefficients. Biometrika 62(3), 531–545.10.1093/biomet/62.3.531Search in Google Scholar

[6] Fang, K.T. and T. W. Anderson (1990). Statistical Inference in Elliptically Contoured and Related Distributions. Allerton Press, New York.Search in Google Scholar

[7] Fisher, R.A. (1922). On the mathematical foundations of theoretical statistics. Philos. Trans. Roy. Soc. A 222, 309–368.10.1098/rsta.1922.0009Search in Google Scholar

[8] Fontanari, A., P. Cirillo, and C.W. Oosterlee (2020). Lorenz-generated bivariate Archimedean copulas. Depend. Model. 8, 186–209.10.1515/demo-2020-0011Search in Google Scholar

[9] Furman, E. and R. Zitikis (2017). Beyond the Pearson correlation: heavy-tailed risks, weighted Gini correlations, and a Gini-type weighted insurance pricing model. Astin Bull. 47(3), 919–942.10.1017/asb.2017.20Search in Google Scholar

[10] Genest, C., J. Nešlehová, and N. Ghorbal (2010). Spearman’s footrule and Gini’s gamma: a review with complements. J. Nonparametr. Stat. 22(8), 937–954.10.1080/10485250903499667Search in Google Scholar

[11] Gini, C. (1914). L’Ammontare e la Composizione della Ricchezza delle Nazioni. UTET, Torino.Search in Google Scholar

[12] Gribkova, N. and R. Zitikis (2019). Weighted allocations, their concomitant-based estimators, and asymptotics. Ann. Inst. Statist. Math. 71(4), 811–835.10.1007/s10463-018-0660-2Search in Google Scholar

[13] Hampel, F.R. (1974). The influence curve and its role in robust estimation. J. Amer. Statist. Assoc. 69(346), 383–393.10.1080/01621459.1974.10482962Search in Google Scholar

[14] Hampel, F.R., E.M. Ronchetti, P.J. Rousseeuw, and W.A. Stahel (1986). Robust Statistics. Wiley, New York.Search in Google Scholar

[15] Jing, B.-Y., J. Yuan, and W. Zhou (2009). Jackknife empirical likelihood. J. Amer. Statist. Assoc. 104(487), 1224–1232.10.1198/jasa.2009.tm08260Search in Google Scholar

[16] Joe, H. (1997). Multivariate Models and Dependence Concepts. Chapman & Hall, London.10.1201/9780367803896Search in Google Scholar

[17] Kendall, M.G. (1938). A new measure of rank correlation. Biometrika 30(1-2), 81–93.10.1093/biomet/30.1-2.81Search in Google Scholar

[18] Knight, W.R. (1966). A computer method for calculating Kendall’s tau with ungrouped data. J. Amer. Statist. Assoc. 61(314), 436–439.10.1080/01621459.1966.10480879Search in Google Scholar

[19] Lai, C.D., J.C.W. Rayner, and T.P. Hutchinson (1999). Robustness of the sample correlation - the bivariate lognormal case. J. Appl. Math. Decis. Sci. 3(1), 7–19.10.1155/S1173912699000012Search in Google Scholar

[20] Lindskog, F., A. McNeil, and U. Schmock (2003). Kendall’s tau for elliptical distributions. In G. Bol, G. Nakhaeizadeh, S.T. Rachev, T. Ridder, and K.-H. Vollmer (Eds.), Credit Risk, pp. 149–156. Springer, Heidelberg.10.1007/978-3-642-59365-9_8Search in Google Scholar

[21] Mari, D.D. and S. Kotz (2001). Correlation and Dependence. Imperial College Press, London.10.1142/p226Search in Google Scholar

[22] Moran, P.A.P. (1948). Rank correlation and permutation distributions. Proc. Cambridge Philos. Soc. 44(1), 142–144.10.1017/S0305004100024087Search in Google Scholar

[23] Mostafa, M.D. and M.W. Mahmoud (1964). On the problem of estimation for the bivariate lognormal distribution. Biometrika 51(3-4), 522–527.10.1093/biomet/51.3-4.522Search in Google Scholar

[24] Nelsen, R.B. (1998). Concordance and Gini’s measure of association. J. Nonparametr. Statist. 9(3), 227–238.10.1080/10485259808832744Search in Google Scholar

[25] Ng, E.W. and M. Geller (1969). A table of integrals of the error functions. J. Res. Nat. Bur. Standards Sect. B 73B(1), 1–20.10.6028/jres.073B.001Search in Google Scholar

[26] Reimherr, M. and D.L. Nicolae (2013). On quantifying dependence: A framework for developing interpretable measures. Statist. Sci. 28(1), 116–130.10.1214/12-STS405Search in Google Scholar

[27] Rényi, A. (1959). On measures of dependence. Acta Math. Acad. Sci. Hungar. 10, 441–451.10.1007/BF02024507Search in Google Scholar

[28] Sang, Y., X. Dang, and H. Sang (2016). Symmetric Gini covariance and correlation. Canad. J. Statist. 44(3), 323–342.10.1002/cjs.11287Search in Google Scholar

[29] Sang, Y., X. Dang, and Y. Zhao (2019). Jackknife empirical likelihood methods for Gini correlations and their equality testing. J. Statist. Plann. Inference 199, 45–59.10.1016/j.jspi.2018.05.004Search in Google Scholar

[30] Scarsini, M. (1984). On measures of concordance. Stochastica 8(3), 201–218.Search in Google Scholar

[31] Schechtman, E. and S. Yitzhaki (1987). A measure of association based on Gini’s mean difference. Comm. Statist. Theory Methods 16(1), 207–231.10.1080/03610928708829359Search in Google Scholar

[32] Schechtman, E. and S. Yitzhaki (2003). A family of correlation coefficients based on the extended Gini index. J. Econ. Inequal. 1, 129–146.10.1023/A:1026152130903Search in Google Scholar

[33] Schechtman, E., S. Yizhaki, and Y. Artsev (2008). The similarity between mean-variance and mean-Gini: Testing for equality of Gini correlations. Adv. Invest. Anal. Portfolio Manag. 3, 97–122.Search in Google Scholar

[34] Serfling, R.J. (1980). Approximation Theorems of Mathematical Statistics. Wiley, New York.10.1002/9780470316481Search in Google Scholar

[35] Shao, J. and D. Tu (1995). The Jackknife and Bootstrap. Springer, New York.10.1007/978-1-4612-0795-5Search in Google Scholar

[36] Shevlyakov G.L. and H. Oja (2016). Robust Correlation. Wiley, Chichester.10.1002/9781119264507Search in Google Scholar

[37] Shevlyakov G.L. and P.O. Smirnov (2011). Robust estimation of the correlation coefficient: An attempt of survey. Austrian J. Stat. 40(1-2), 147–156.Search in Google Scholar

[38] Slutsky, E. (1925). Über stochastische Asymptoten und Grenzwerte. Metron 5(3), 3–89.Search in Google Scholar

[39] Spearman, C. (1904). “General intelligence”, objectively determined and measured. Amer. J. Psych. 15(2), 201–292.10.2307/1412107Search in Google Scholar

[40] Witting, H. and U. Müller-Funk (1995). Mathematische Statistik II. Teubner, Stuttgart.10.1007/978-3-322-90152-1Search in Google Scholar

[41] Xu, W., Y.S. Huang, M. Niranjan, and M. Shen (2010). Asymptotic mean and variance of Gini correlation for bivariate normal samples. IEEE Trans. Signal Process. 58(2), 522–534.10.1109/TSP.2009.2032448Search in Google Scholar

[42] Yitzhaki, S. and I. Olkin (1991). Concentration indices and concentration curves. In K. Mosler and M. Scarsini (Eds.), Stochastic Orders and Decision Under Risk, pp. 380–392. Institute of Mathematical Statistics, Hayward CA.10.1214/lnms/1215459867Search in Google Scholar

[43] Yitzhaki, S. and E. Schechtman (2013). The Gini Methodology. Springer, New York.10.1007/978-1-4614-4720-7Search in Google Scholar

Received: 2020-10-07
Accepted: 2020-11-26
Published Online: 2020-12-16

© 2020 Courtney Vanderford et al., published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 24.3.2023 from https://www.degruyter.com/document/doi/10.1515/demo-2020-0020/html
Scroll Up Arrow