Kendall's W

Kendall's W (also known as Kendall's coefficient of concordance) is a non-parametric statistic. It is a normalization of the statistic of the Friedman test, and can be used for assessing agreement among raters. Kendall's W ranges from 0 (no agreement) to 1 (complete agreement).

Suppose, for instance, that a number of people have been asked to rank a list of political concerns, from most important to least important. Kendall's W can be calculated from these data. If the test statistic W is 1, then all the survey respondents have been unanimous, and each respondent has assigned the same order to the list of concerns. If W is 0, then there is no overall trend of agreement among the respondents, and their responses may be regarded as essentially random. Intermediate values of W indicate a greater or lesser degree of unanimity among the various responses.

While tests using the standard Pearson correlation coefficient assume normally distributed values and compare two sequences of outcomes at a time, Kendall's W makes no assumptions regarding the nature of the probability distribution and can handle any number of distinct outcomes.

W is linearly related to the mean value of the Spearman's rank correlation coefficients between all pairs of the rankings over which it is calculated.

Suppose that object i is given the rank r_i,j by judge number j, where there are in total n objects and m judges. Then the total rank given to object i is

and the mean value of these total ranks is

The sum of squared deviations, S, is defined as

and then Kendall's W is defined as

If the test statistic W is 1, then all the judges or survey respondents have been unanimous, and each judge or respondent has assigned the same order to the list of objects or concerns. If W is 0, then there is no overall trend of agreement among the respondents, and their responses may be regarded as essentially random. Intermediate values of W indicate a greater or lesser degree of unanimity among the various judges or respondents.

Legendre discusses a variant of the W statistic which accommodates ties in the rankings and also describes methods of making significance tests based on W. Legendre compared via simulation the Friedman test and its permutation version. Unfortunately, the simulation study of Legendre was very limited because it considered neither the copula aspect nor the F test. Kendall W is a rank-based correlation measure, and therefore it is not affected by the marginal distributions of the underlying variables, but only by the copula of the multivariate distribution. Marozzi extended the simulation study of Legendre by considering the copula aspect as well as the F test. It is shown that the Friedman test is too conservative and less powerful than both the F test and the permutation test for concordance which always have a correct size and behave alike. The F test should be preferred because it is computationally much easier. Surprisingly, the power function of the tests is not much affected by the type of copula.

...
Wikipedia