I. Introduction
Holmes and Anderson (Reference Holmes and Anderson2017a) (henceforth HA) introduce two extensive data sets on world alcohol consumption and expenditure and with them investigate, among other things, the possible convergence of national alcohol consumption patterns.Footnote 1 The general argument here is that, with increasing globalization and interactions between cultures, country consumption patterns are converging. Such convergence might raise concerns if consumers in developing countries copy trends in high-income countries that might be considered undesirable from a public health perspective. Tobacco and sugar consumption (the latter also in terms of sweetened non-alcoholic drinks) are obvious concerns, but so too is the consumption of alcohol.
Producers also monitor consumer trends and focus not just on overall levels of consumption but also on changes in composition or mix that might indicate changes in consumer preferences or behaviours. HA report that alcoholic drink producers are clearly aware that wine's share of global recorded alcohol consumption volume has more than halved between 1961 and 2015, falling from 35% to 15%, while beer's share has increased from 29% to 42%, and spirits’ share rose from 36% to 43%, over the same period.
The extent to which alcohol consumption patterns are converging across countries has thus been the subject of numerous studies but, prior to HA, such studies have necessarily been limited, because of the paucity of available data, to analysing specific regions or groups of high-income countries or just one or two types of alcohol. HA use their extensive set of annual data from 1961 to 2014 on more than 50 countries and regions to analyse convergence of recorded alcohol consumption and of the beverage mix using a range of convergence indices.
The aim of this article is to add to HA's analysis by utilising the compositional characteristics of wine, beer, and spirits shares in total alcohol consumption to measure convergence over time in national alcohol consumption mixes toward the (changing) world average mix.
II. Compositional Data Analysis of Alcohol Shares
The initial focus here is on the wine, beer, and spirits share of total alcohol consumption for country i in year t. If these shares are denoted w i,t, b i,t, and s i,t then, as well as the non-negativity constraints w i,t ≥ 0, b i,t ≥ 0, and s i,t ≥ 0, there is also the adding-up constraint w i,t + b i,t + s i,t = 1. If there are N countries and T years of observations, then this data may be represented by the set of N × 3 matrices

where w t + b t + s t = ι = [ 1 1| … 1 ]′, the N × 1 unit vector. The sub-matrix

then lies in the two-dimensional simplex 𝒮 2 embedded in three-dimensional real space ℛ 3 with s t = ι − w t − b t being the vector of “fill-up” values and ${\bi X}_t = \left[ {\matrix{ {{\bi X}_t^{\left( 2 \right)}} & {{\bi s}_t} \cr}} \right]$.
There are several difficulties with analysing Xt within the simplex sample space, these being a consequence of the summation constraint rendering standard covariance and correlation analysis invalid. These were foreseen by Pearson (Reference Pearson1897) but were brought to the attention of applied statisticians by Aitchison (Reference Aitchison1982), who subsequently followed this article with his book (Aitchison, Reference Aitchison2003). For example, consider the crude covariance matrix of the shares for a particular year (the subscript t is now omitted for notational convenience)

with accompanying crude correlations

Now consider the covariance between w and w + b + s, σ w,w+b+s. Because of the unit sum constraint this is the covariance between w and ι, which is clearly zero. However, this then implies that, since $\sigma _{w,w + b + s} = \sigma _w^2 + \sigma _{wb} + \sigma _{ws}$, it must be the case that

The right-hand side of equation (1) must be negative except for the trivial situation where the wine share is constant across all countries. Thus at least one of the covariances on the left must be negative or, equivalently, that there must be at least one negative element in the first row of the crude covariance matrix K. The same negative bias will similarly occur in the other rows of K so that all three covariances must be negative.
In terms of the correlations, it would normally be the case that the three correlations ρ wb, ρ ws, and ρ bs could take any values in the interval –1 to + 1 subject to the non-negative definiteness condition

For example, since 1 + 2ρ 2 − 3ρ 3 ≥ 0 for 0 ≤ ρ ≤ 1, it should be perfectly feasible for all three correlations to be equal and positive. Such a set of correlations for a three-part composition is, however, prohibited by the negative bias property.
These difficulties also affect the coefficient of variation employed by HA as one of their indicators of convergence. The coefficients of variation for the three shares are defined as

where μ w , μ b, and μ s are the mean shares. Given the unit sum constraint it is straightforward to show that since, for example, μ s = 1 − μ b − μ w and $\sigma _s^2 = - \left( {\sigma _{bs} + \sigma _{ws}} \right)$, then

and dependencies therefore exist between the three coefficients of variation, thus rendering problematic any conclusions about convergence based on these statistics.
Because of the difficulties in interpreting covariances and correlations calculated from Xt within the simplex sample space, Aitchison (Reference Aitchison1982, Reference Aitchison2003) thus proposed mapping ${\bi X}_t^{\left( 2 \right)} $ from 𝒮 2 to the two-dimensional real space ℛ 2 and then examining the statistical properties of the transformed data within ℛ 2.Footnote 2 Several transformations have been proposed for doing this, the most popular being the additive log-ratio, defined, for the general three-shares composition

where

as

The inverse transformation, known as the additive-logistic, is


where

Although it is important to note that a d (·) is invariant to the choice of fill-up share, so that nothing in the statistical analysis hinges on which of the shares is chosen for this role, it is possible to avoid choosing a fill-up by using the centred log-ratio transformation, defined as

where

is the vector of geometric means.
Although the centred log-ratio transformation introduces a singularity since Zt ι = 0, which has limited its application in favour of the additive log-ratio transformation, it does have a symmetric, albeit singular, covariance matrix

In the application considered here this is a major advantage over the additive log-ratio transform, for a measure of the total variability of Xt is given by the trace of Γt

where λ 1 > λ 2 are the eigenvalues of Γt (Aitchison, Reference Aitchison2003, pp. 190–191, Definition 8.1 and Property 8.1).
III. Convergence of Alcohol Shares through Time
The trace of the centred log-ratio covariance matrix, tr(Γt), thus provides a measure of the total variation in alcohol shares in year t, and computation of this statistic across the T years for which data on alcohol shares are available will therefore allow an assessment of the extent of convergence to the (changing) mean shares through time.
Annual data from 1961 to 2014 on wine, beer, and spirits shares of total alcohol consumption volume is available for up to 53 countries and regions each year in Tables 53(a)–(c) of Anderson and Pinilla (Reference Anderson and Pinilla2017).Footnote 3 For each year t = 1961, · · · , 2014 the centred log-ratios were calculated for the set of country shares available and the covariance matrix Γt constructed. From this the eigenvalues were computed and tr(Γt) obtained. The sequence of 54 traces are plotted in Figure 1 with the fitted quadratic trend

superimposed to emphasise the decline over time in this measure of total variability.

Figure 1 Trace of Log-Ratio Covariance Matrix, 1961–2014, with Fitted Quadratic Trend Superimposed
It is thus clear that, using this metric of total variation based on the compositional character of shares data, that the variation in alcohol shares around the global mean shares has declined substantially over the half-century from the early 1960s, thus confirming, in a statistically valid way, the findings of HA.
IV. Conclusions
In this article we have utilised the compositional properties of country alcohol share data to compute a statistically valid measure of variation in alcohol shares for each year from 1961 to 2014. This measure shows a quadratically declining trend in variation over this period and thus confirms HA's finding that the cross-country alcohol mix has converged substantially towards the global alcohol mix over the last half century or so.