Hostname: page-component-7b9c58cd5d-dlb68 Total loading time: 0 Render date: 2025-03-15T11:52:27.221Z Has data issue: false hasContentIssue false

Is There Convergence in National Alcohol Consumption Patterns? Evidence from a Compositional Time Series Approach

Published online by Cambridge University Press:  26 February 2018

Terence C. Mills*
Affiliation:
School of Business and Economics, Loughborough University, Ashby Road, Loughborogh, Leics LE11 3TU, U.K; e-mail: t.c.mills@lboro.ac.uk.
Rights & Permissions [Opens in a new window]

Abstract

Holmes and Anderson (2017a) introduce two extensive data sets on world alcohol consumption and expenditure and with them investigate, among other things, the possible convergence of national alcohol consumption patterns using wine, beer, and spirit shares. Such share data define a composition, on which conventional statistical analysis using covariances and correlations is invalid. This note reanalyses the data using techniques appropriate for a composition and introduces a statistic that can validly track the variation in national shares around the global mean through time. This variability statistic shows that such convergence of national alcohol patterns has clearly taken place over the period 1961 to 2014 and thus confirms Holmes and Anderson's findings using a valid statistical approach. (JEL Classifications: C18, D12, L66)

Type
Articles
Copyright
Copyright © American Association of Wine Economists 2018 

I. Introduction

Holmes and Anderson (Reference Holmes and Anderson2017a) (henceforth HA) introduce two extensive data sets on world alcohol consumption and expenditure and with them investigate, among other things, the possible convergence of national alcohol consumption patterns.Footnote 1 The general argument here is that, with increasing globalization and interactions between cultures, country consumption patterns are converging. Such convergence might raise concerns if consumers in developing countries copy trends in high-income countries that might be considered undesirable from a public health perspective. Tobacco and sugar consumption (the latter also in terms of sweetened non-alcoholic drinks) are obvious concerns, but so too is the consumption of alcohol.

Producers also monitor consumer trends and focus not just on overall levels of consumption but also on changes in composition or mix that might indicate changes in consumer preferences or behaviours. HA report that alcoholic drink producers are clearly aware that wine's share of global recorded alcohol consumption volume has more than halved between 1961 and 2015, falling from 35% to 15%, while beer's share has increased from 29% to 42%, and spirits’ share rose from 36% to 43%, over the same period.

The extent to which alcohol consumption patterns are converging across countries has thus been the subject of numerous studies but, prior to HA, such studies have necessarily been limited, because of the paucity of available data, to analysing specific regions or groups of high-income countries or just one or two types of alcohol. HA use their extensive set of annual data from 1961 to 2014 on more than 50 countries and regions to analyse convergence of recorded alcohol consumption and of the beverage mix using a range of convergence indices.

The aim of this article is to add to HA's analysis by utilising the compositional characteristics of wine, beer, and spirits shares in total alcohol consumption to measure convergence over time in national alcohol consumption mixes toward the (changing) world average mix.

II. Compositional Data Analysis of Alcohol Shares

The initial focus here is on the wine, beer, and spirits share of total alcohol consumption for country i in year t. If these shares are denoted w i,t, b i,t, and s i,t then, as well as the non-negativity constraints w i,t ≥ 0, b i,t ≥ 0, and s i,t ≥ 0, there is also the adding-up constraint w i,t + b i,t + s i,t = 1. If there are N countries and T years of observations, then this data may be represented by the set of N × 3 matrices

$${\bi X}_t = \left[ {\matrix{ {\matrix{ {w_{1,t}} \cr {\matrix{ {w_{2,t}} \cr \vdots \cr {w_{N,t}} \cr}} \cr}} & {\matrix{ {\matrix{ {b_{1,t}} \cr {b_{2,t}} \cr \vdots \cr}} \cr {b_{N,t}} \cr}} & {\matrix{ {\matrix{ {s_{1,t}} \cr {s_{2,t}} \cr \vdots \cr}} \cr {s_{N,t}} \cr}} \cr}} \right] = \left[ {\matrix{ {{\bi w}_t} & {{\bi b}_t} & {{\bi s}_t} \cr}} \right]\, \quad\quad t = 1,2, \cdots, T,$$

where w t + b t + s t = ι = [  1    1|    …    1 ]′, the N × 1 unit vector. The sub-matrix

$${\bi X}_t^{\left( 2 \right)} = \left[ {\matrix{ {{\bi w}_t} & {{\bi b}_t} \cr}} \right],$$

then lies in the two-dimensional simplex 𝒮 2 embedded in three-dimensional real space 3 with s t = ιw tb t being the vector of “fill-up” values and ${\bi X}_t = \left[ {\matrix{ {{\bi X}_t^{\left( 2 \right)}} & {{\bi s}_t} \cr}} \right]$.

There are several difficulties with analysing Xt within the simplex sample space, these being a consequence of the summation constraint rendering standard covariance and correlation analysis invalid. These were foreseen by Pearson (Reference Pearson1897) but were brought to the attention of applied statisticians by Aitchison (Reference Aitchison1982), who subsequently followed this article with his book (Aitchison, Reference Aitchison2003). For example, consider the crude covariance matrix of the shares for a particular year (the subscript t is now omitted for notational convenience)

$${\bi {K}} = \left[ {\matrix{ {\sigma _w^2} & {\sigma _{wb}} & {\sigma _{ws}} \cr {\sigma _{wb}} & {\sigma _b^2} & {\sigma _{bs}} \cr {\sigma _{ws}} & {\sigma _{bs}} & {\sigma _s^2} \cr}} \right],$$

with accompanying crude correlations

$$\rho _{wb} = \sigma _{wb}/\sigma _w\sigma _b\, \quad\quad \rho _{ws} = \sigma _{ws}/\sigma _w\sigma _s\, \quad\quad \rho _{bs} = \sigma _{bs}/\sigma _b\sigma _s.$$

Now consider the covariance between w and w + b + s, σ w,w+b+s. Because of the unit sum constraint this is the covariance between w and ι, which is clearly zero. However, this then implies that, since $\sigma _{w,w + b + s} = \sigma _w^2 + \sigma _{wb} + \sigma _{ws}$, it must be the case that

(1)$$\sigma _{wb} + \sigma _{ws} = - \sigma _w^2.$$

The right-hand side of equation (1) must be negative except for the trivial situation where the wine share is constant across all countries. Thus at least one of the covariances on the left must be negative or, equivalently, that there must be at least one negative element in the first row of the crude covariance matrix K. The same negative bias will similarly occur in the other rows of K so that all three covariances must be negative.

In terms of the correlations, it would normally be the case that the three correlations ρ wb, ρ ws, and ρ bs could take any values in the interval –1 to + 1 subject to the non-negative definiteness condition

$$1 + 2\rho _{wb}\rho _{ws}\rho _{ws} - \rho _{wb}^2 - \rho _{ws}^2 - \rho _{bs}^2 \ge 0.$$

For example, since 1 + 2ρ 2 − 3ρ 3 ≥ 0 for 0 ≤ ρ ≤ 1, it should be perfectly feasible for all three correlations to be equal and positive. Such a set of correlations for a three-part composition is, however, prohibited by the negative bias property.

These difficulties also affect the coefficient of variation employed by HA as one of their indicators of convergence. The coefficients of variation for the three shares are defined as

$$cv\left( w \right) = \sigma _w/\mu _w\, \quad\quad cv\left( b \right) = \sigma _b/\mu _b\,\quad\quad cv\left( s \right) = \sigma _s/\mu _s,$$

where μ w , μ b, and μ s are the mean shares. Given the unit sum constraint it is straightforward to show that since, for example, μ s = 1 − μ b − μ w and $\sigma _s^2 = - \left( {\sigma _{bs} + \sigma _{ws}} \right)$, then

$$cv\left( s \right) = \sqrt { - \left( {\sigma _{bs} + \sigma _{ws}} \right)} /\left( {1 - \mu _b - \mu _w} \right) = \sqrt { - \left( {\sigma _{bs} + \sigma _{ws}} \right)} /\left( {1 - \sigma _bcv\left( b \right) - \sigma _wcv\left( w \right)} \right),$$

and dependencies therefore exist between the three coefficients of variation, thus rendering problematic any conclusions about convergence based on these statistics.

Because of the difficulties in interpreting covariances and correlations calculated from Xt within the simplex sample space, Aitchison (Reference Aitchison1982, Reference Aitchison2003) thus proposed mapping ${\bi X}_t^{\left( 2 \right)} $ from 𝒮 2 to the two-dimensional real space 2 and then examining the statistical properties of the transformed data within 2.Footnote 2 Several transformations have been proposed for doing this, the most popular being the additive log-ratio, defined, for the general three-shares composition

$${\bi X}_t = \left[ {\matrix{ {{\bi x}_{1,t}} & {{\bi x}_{2,t}} & {{\bi x}_{3,t}} \cr}} \right] = \left( {\matrix{ {{\bi X}_t^{\left( 2 \right)}} & {{\bi x}_{3,t}} \cr}} \right),$$

where

$${\bi x}_{i,t} = \left( {\matrix{ {x_{i1,t}} & \cdots & {x_{iN,t}} \cr}} \right)^{\prime}\,\quad\quad i = 1,\; 2,\; 3,$$

as

$${\bi Y}_t = \left[ {\matrix{ {{\bi y}_{1,t}} & {{\bi y}_{2,t}} \cr}} \right] = a_2\left( {{\bi X}_t^{\left( 2 \right)}} \right) = \left[ {\matrix{ {log\left( {{\bi x}_{1,t}/{\bi x}_{3,t}} \right)} & {log\left( {{\bi x}_{2,t}/{\bi x}_{3,t}} \right)} \cr}} \right].$$

The inverse transformation, known as the additive-logistic, is

$${\bi X}_t^{\left( 2 \right)} = a_2^{ - 1} \left( {{\bi Y}_{\bi t}} \right) = \left[ {\matrix{ {exp\left( {{\bi y}_{1,t}} \right)/{\bi y}_t} & {\matrix{ {exp\left( {{\bi y}_{2,t}} \right)/{\bi y}_t} & {exp\left( {{\bi y}_{3,t}} \right)/{\bi y}_t} \cr}} \cr}} \right]$$
$${\bi x}_{3,t} = 1/{\bi y}_t,$$

where

$${\bi y}_t = 1 + exp\left( {{\bi y}_{1,t}} \right) + exp\left( {{\bi y}_{2,t}} \right).$$

Although it is important to note that a d (·) is invariant to the choice of fill-up share, so that nothing in the statistical analysis hinges on which of the shares is chosen for this role, it is possible to avoid choosing a fill-up by using the centred log-ratio transformation, defined as

$${\bi Z}_t = c_d\left( {{\bi X}_t^{\left( 2 \right)}} \right) = \left[ {\matrix{ {log\left( {{\bi x}_{1,t}/g\left( {{\bi X}_t} \right)} \right)} & {log\left( {{\bi x}_{2,t}/g\left( {{\bi X}_t} \right)} \right)} & {log\left( {{\bi x}_{3,t}/g\left( {{\bi X}_t} \right)} \right)} \cr}} \right],$$

where

$$g\left( {{\bi X}_t} \right) = \left[ {\matrix{ {{\left( {x_{11,t}x_{21,t}x_{31,t}} \right)}^{1/3}} \cr {{\left( {x_{12,t}x_{22,t}x_{32,t}} \right)}^{1/3}} \cr {\matrix{ \vdots \cr {{\left( {x_{1N,t}x_{2N,t}x_{3N,t}} \right)}^{1/3}} \cr}} \cr}} \right],$$

is the vector of geometric means.

Although the centred log-ratio transformation introduces a singularity since Zt ι = 0, which has limited its application in favour of the additive log-ratio transformation, it does have a symmetric, albeit singular, covariance matrix

$${\bi \Gamma} _t = \left[ {\gamma _{ij,t}} \right] = \left[ {cov\left\{ {log\left( {{\bi x}_{i,t}/g\left( {{\bi X}_t} \right)} \right),log\left( {{\bi x}_{\,j,t}/g\left( {{\bi X}_t} \right)} \right)} \right\}:i,j = 1,2,3} \right].$$

In the application considered here this is a major advantage over the additive log-ratio transform, for a measure of the total variability of Xt is given by the trace of Γt

$$tr\left( {{\bi \Gamma} _t} \right) = \lambda _1 + \lambda _2,$$

where λ 1 > λ 2 are the eigenvalues of Γt (Aitchison, Reference Aitchison2003, pp. 190–191, Definition 8.1 and Property 8.1).

III. Convergence of Alcohol Shares through Time

The trace of the centred log-ratio covariance matrix, tr(Γt), thus provides a measure of the total variation in alcohol shares in year t, and computation of this statistic across the T  years for which data on alcohol shares are available will therefore allow an assessment of the extent of convergence to the (changing) mean shares through time.

Annual data from 1961 to 2014 on wine, beer, and spirits shares of total alcohol consumption volume is available for up to 53 countries and regions each year in Tables 53(a)–(c) of Anderson and Pinilla (Reference Anderson and Pinilla2017).Footnote 3 For each year t = 1961, · · · , 2014 the centred log-ratios were calculated for the set of country shares available and the covariance matrix Γt constructed. From this the eigenvalues were computed and tr(Γt) obtained. The sequence of 54 traces are plotted in Figure 1 with the fitted quadratic trend

$$\widehat{{tr\left( {{\bi \Gamma} _t} \right)}} = 9.99 - 0.230t + 0.0015t^2,$$

superimposed to emphasise the decline over time in this measure of total variability.

Figure 1 Trace of Log-Ratio Covariance Matrix, 1961–2014, with Fitted Quadratic Trend Superimposed

It is thus clear that, using this metric of total variation based on the compositional character of shares data, that the variation in alcohol shares around the global mean shares has declined substantially over the half-century from the early 1960s, thus confirming, in a statistically valid way, the findings of HA.

IV. Conclusions

In this article we have utilised the compositional properties of country alcohol share data to compute a statistically valid measure of variation in alcohol shares for each year from 1961 to 2014. This measure shows a quadratically declining trend in variation over this period and thus confirms HA's finding that the cross-country alcohol mix has converged substantially towards the global alcohol mix over the last half century or so.

Footnotes

The author is grateful for the anonymous comments of a referee.

1 References to these data sets are Holmes and Anderson (Reference Holmes and Anderson2017b) and Anderson and Pinilla (Reference Anderson and Pinilla2017). The data series used here were extracted from the megafile_of_global_wine _data_1835_to_2016_031117.xlxs, which is available at www.adelaide.edu.au/wine-econ/databases.

2 The following results were presented by Aitchison (Reference Aitchison2003) for the general d-dimensional simplex; see also Mills (Reference Mills2007, Reference Mills2009, Reference Mills2010).

3 A small number of countries have occasional zero shares in some years. The presence of a zero share obviously precludes using a log-ratio transformation. Although a literature does exist on methods for dealing with zero shares (see Mills, Reference Mills2007, for a survey), they all essentially replace the zero share with a small positive value and reduce the other shares by appropriate amounts. Because of the small number of shares affected, zero shares were replaced by 0.001%, with the other two shares for that country being decreased by 0.0005% in each case.

References

Aitchison, J. (1982). The statistical analysis of compositional data (with discussion). Journal of the Royal Statistical Society, Series B 44(2), 139177.Google Scholar
Aitchison, J. (2003). The Statistical Analysis of Compositional Data. Caldwell, NJ: Blackburn Press.Google Scholar
Anderson, K., and Pinilla, V. (with the assistance of A. J. Holmes) (2017). Annual Database of Global Wine Markets, 1835 to 2016. Wine Economics Research Centre, University of Adelaide. Available at www.adelaide.edu.au/wine-econ/databases/global-wine-history.Google Scholar
Holmes, A. J., and Anderson, K. (2017a). Convergence in national alcohol consumption patterns: New global indicators. Journal of Wine Economics, 12(2), 117148.CrossRefGoogle Scholar
Holmes, A. J., and Anderson, K. (2017b). Annual Database of National Beverage Consumption Volumes and Expenditures, 1950 to 2015. Wine Economics Research Centre, University of Adelaide. Available at www.adelaide.edu.au/wine-econ/databases/alcohol-consumption.Google Scholar
Mills, T. C. (2007). Modelling compositional economic data. Indian Economic Journal, 55(3), 99115.Google Scholar
Mills, T. C. (2009). Forecasting obesity trends in England. Journal of the Royal Statistical Society, Series A 172(1), 107117.Google Scholar
Mills, T. C. (2010). Forecasting compositional time series. Quality and Quantity, 44(4), 673690.Google Scholar
Pearson, K. (1897). Mathematical contributions to the theory of evolution. On a form of spurious correlation which may arise when indices are used in the measurement of organs. Proceedings of the Royal Society of London, Series A 60, 489498.Google Scholar
Figure 0

Figure 1 Trace of Log-Ratio Covariance Matrix, 1961–2014, with Fitted Quadratic Trend Superimposed