In search of an environmental Kuznets curve in sulphur dioxideconcentrations: a Bayesian model averaging approach

JEFFREY BEGUN; THEO S. EICHER

doi:10.1017/S1355770X08004531

In search of an environmental Kuznets curve in sulphur dioxideconcentrations: a Bayesian model averaging approach

Published online by Cambridge University Press: 27 August 2008

JEFFREY BEGUN and

THEO S. EICHER

Show author details

JEFFREY BEGUN: Affiliation:
University of Washington, Box 353330, Seattle, WA 98195-3330, USA. Email: jbegun@u.washington.edu
THEO S. EICHER: Affiliation:
University of Washington, Box 353330, Seattle WA 98195-3330, USA. Email: te@u.washington.edu

Article contents

Abstract
Introduction
Searching for an EKC in SO2 concentrations
Model uncertainty in the income/environment relationship
Motivating EKC candidate regressors
Empirical results
Conclusion
Footnotes
References

Rights & Permissions

Abstract

The exact specification and motivation for an environmental Kuznets curve (EKC)is the subject of a vast literature in environmental economics. A remarkablydiverse set of econometric approaches and candidate regressors have beenproposed, which highlights the degree of model uncertainty surrounding therelationship between environmental quality and pollution. We introduce Bayesianmodel averaging (BMA) to the EKC analysis to examine: (a) whether a sulphurdioxide EKC exists, and if so (b) which income/pollution specification issupported by the data. BMA addresses model uncertainty as part of the empiricalstrategy by incorporating the uncertainty about the validity of competingtheories into the posterior distribution. We find only weak support for an EKC,which disappears altogether when we address issues relating to the extremeoversampling of two industrialized countries in the sample. In contrast, ourresults highlight the relative importance of political economy and site-specificvariables (specifically executive constraints and precipitation variation) inexplaining pollution outcomes. Trade is shown to play an important indirectrole, as it moderates the influence of the composition effect on pollution.

Type: Research Article
Information: Environment and Development Economics , Volume 13 , Issue 6: SUSTAINABLE DEVELOPMENT , December 2008 , pp. 795 - 822

DOI: https://doi.org/10.1017/S1355770X08004531 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2008

1. Introduction

A vast empirical literature has sought to establish a robust relationship between economic development and environmental quality. Grossman and Krueger (Reference Grossman and Krueger1995) and Selden and Song (Reference Selden and Song1994) documented an inverted U-shaped curve between income and pollution that is similar to Kuznets's (Reference Kuznets1955) inverted U-shaped relationship between income and inequality. In subsequent research, a large number of authors failed to confirm an ‘environmental Kuznets curve’ (EKC), either in the original Grossman and Krueger dataset, or in updated and expanded pollution datasets (e.g., Harbaugh et al., Reference Harbaugh, Levinson and Wilson2002, or Deacon and Norman, Reference Deacon and Norman2006). The conflicting empirical results have given rise to intense efforts to further explore the income/pollution relationship either by introducing formal models (see, e.g., Antweiler et al., Reference Antweiler, Copeland and Taylor2001), or by adding further control variables to reduced-form regressions (see Dasgupta et al., Reference Dasgupta, Laplante, Wang and Wheeler2002 for a survey).

The EKC is thus a case study of extreme model uncertainty where the true model is unknown and several competing approaches exist that hypothesize about the exact relationship between environmental quality and income. In light of such model uncertainty, inference procedures based on a single regression model overstate the precision of coefficient estimates. Precision is overestimated because the uncertainty surrounding the validity of a theory has not been taken into account (Raftery, Reference Raftery1995). The problem is particularly prevalent in the EKC literature since a number of well-founded approaches exist and researchers face an abundance of possible candidate regressors.

The Bayesian solution to model uncertainty is to base inferences on all competing models, each weighted by the posterior probability that the model is indeed the true model. The procedure delivers a posterior distribution for each candidate regressor, whose mean is a weighted estimate derived from all relevant models. In environmental economics, prominent examples of BMA applications include the modeling of population determinants for deer (Farnsworth et al., Reference Farnsworth, Hoeting, Hobbs and Miller2006), fish (Fernandez et al., Reference Fernández, Ley and Steel2002) as well as pollution mortality (Koop and Tole, Reference Koop and Tole2006). To our knowledge, we are the first to apply Bayesian model averaging to resolve the model uncertainty surrounding the EKC relationship.

Our strategy is to group EKC approaches into two categories. First we examine reduced-form approaches to the EKC, where many possible determinants of pollution are introduced. This branch of the literature is vast, but suffers from the criticism that the direct and indirect effects of variables cannot be disentangled. The approach therefore cannot identify intervening factors that lead to an apparent relationship between income and pollution. As an alternative, we examine specific theories that have been proposed as the underlying determinants of an EKC, and scrutinize whether the data support theory-based candidate regressors.Footnote ¹ In this case, we have a clearly predetermined set of regressors that are expected to affect pollution concentrations.

Before we summarize our results, it is important to note that the updated SO₂ data that have been extended and cleaned of previous errors no longer exhibit the EKC relationship that Grossman and Krueger (Reference Grossman and Krueger1995) discovered (see, e.g., Harbaugh et al., Reference Harbaugh, Levinson and Wilson2002). Our results below can therefore be seen as an effort to find robust evidence for an EKC in this dataset by eliminating possible omitted variable bias. We find only limited evidence for an income/pollution relationship once we account for model uncertainty. Instead, robustly related regressors in both reduced-form and theory-based approaches are those relating to political economy, site-specific effects, and trade (the individual proxies for each category are motivated in sections 3 and 4). Societies that are more open in terms of political participation are shown to exhibit significantly lower air pollution. The theory-based approach highlights the power of both direct and indirect effects (where indirect refers to interactions where one variable moderates the effect of another variable). Following Antweiler et al. (Reference Antweiler, Copeland and Taylor2001), we show that the interaction between trade and capital intensity is also of crucial importance for explaining the evolution of SO₂ concentrations across countries and time.

The number of regressors that are robustly related to pollution in the BMA approach, as well as in the best model identified by BMA, is only a fraction of the 17 possible candidate regressors motivated by reduced-form approaches. Compared to our selection of theory-based specifications, BMA finds as few as a third of the 18 regressors suggested by the most comprehensive theory-based specification. Nevertheless, the best model suggested by BMA has an adjusted R ² three times greater than the preferred theory-based specification of Antweiler et al. (Reference Antweiler, Copeland and Taylor2001).Footnote ² This provides evidence that such a complex theory may not be necessary and alternative theories, such as the Green Solow model (see Brock and Taylor, Reference Brock, Taylor, Aghion and Durlauf2005), should not be discarded simply because they rely on only a fraction of the regressors.

2. Searching for an EKC in SO₂ concentrations

2.1 Data considerations

One prominent EKC relationship in the literature relates air quality to economic development.Footnote ³ In this paper we focus on sulphur dioxide (SO₂) concentrations obtained from the Global Environmental Monitoring System (GEMS). The data are updated, error-corrected, and maintained by the EPA in its Aerometric Information Retrieval System (AIRS).Footnote ⁴ The GEMS/AIRS data are perhaps the most widely used dataset to investigate the EKC, with reported SO₂ concentrations from stations in up to 44 countries from 1971 to 2006.Footnote ⁵

Our income measure is real GDP per capita in constant 1996 dollars from the Penn World Tables 6.1 (Heston et al., Reference Heston, Summers and Aten2002). In our estimation of the Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) approach we use their income measure (GNP). There are several reasons to use concentrations data, although emissions data are also widely available. First, ground-level sulphur dioxide concentrations are the relevant criterion for the direct environmental/health impact. Second, cross-country emissions data are generated by emissions models and not based on actual SO₂ measurements. Cross-country emissions data are generated using strict input–output coefficients based on energy and manufacturing models.Footnote ⁶ In that sense these data perfectly track the production characteristics of an economy, but not necessarily the actual factors that affect environmental quality. Third, the majority of SO₂ EKC papers use concentrations and we seek to be comparable with our results.

The SO₂ concentrations data are, however, highly unbalanced in two dimensions: location and time. Few countries report data over the entire time period, and many countries report pollution concentrations for less than a decade. Often several years of data are missing between observations not only on the station level, but also on the city and country levels. Even in countries with extensive locational coverage, such as the United States, the time series for each monitoring station is highly unbalanced. When heavily oversampled countries have lower average pollution, or have added new monitoring stations with lower pollution over time, it is important to examine the robustness of the results at different levels of data aggregation.Footnote ⁷ Juxtaposing different levels of aggregation as well as reduced-form and structural results also provides a unique opportunity to examine the robustness of regressors across all specifications.

A significant number of papers in the literature have documented an EKC (or its absence) without explicitly discussing the fact that the dataset is so extremely unbalanced. The data are unbalanced in terms of location since a few countries are represented with a large number of reporting stations, while many other nations are featured only once. A full 38 per cent of the original 2,555 station-level observations originate in the US and Canada. The imbalance is exacerbated early and late in the sample as the US supplies 69 per cent of the data before 1974 and after 1993. Therefore, we restrict our analysis to 1974–1993, which reduces the dataset by 219 observations (almost exclusively from the US). Figure 1 provides a breakdown of the 2,168 observations by country of origin.

Figure 1. Distribution of SO₂/observations in GEMS/AIRS data 1974–1993

Notes: Obs < 20 refers to a number of countries that feature less than 20 observations (Greece, Italy, Thailand, Venezuela, Malaysia, Indonesia, South Korea, Peru, Pakistan). Obs 21–30 refers to countries with between 21 to 30 observations (Argentina, Finland, Sweden, Chile, France, Denmark).

Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html

The construction of the appropriate station-level covariates is also problematic. None of the covariates suggested by the literature actually speaks to specific characteristics at the station level. At best one can correlate city-level characteristics (such as temperature and precipitation variation) with observed concentrations. However, even the city-level data are still highly unbalanced, and it is unclear whether results are driven by information in the data or by oversampling and missing information across time. In that sense, aggregating to the country level is our preferred approach; it suffers, however, from the disadvantage that few of the controls actually speak to the characteristics of the area in which the concentrations are measured. For these reasons we test for an EKC at the station, city, and country levels.

2.2 The EKC in the raw income pollution data

The first surprise for researchers using the newest version of GEMS, which has been purged of errors and extended to include updated data, is that it no longer provides evidence for the fundamental EKC relationship. Figure 2 plots the raw data for every station in every year that an observation is recorded. In addition, the figure traces the predicted values from the most fundamental regression that includes only log median SO₂ concentrations as the dependent variable and real GDP per capita as a third-order polynomial.Footnote ⁸ In Grossman and Krueger (Reference Grossman and Krueger1995), a similar plot using earlier data from the same source was prominently inverted-U shaped.

Figure 2. Relationship between median SO₂ concentrations and income (by measuring station 1974–1993)

Note: Fitted values are for the fixed-effects regression $Log{\rm}SO_{2{\it it}} = \alpha _i + \beta {\it GDP}_{\it it} + \delta {\it GDP}_{\it it}^2 + \gamma {\it GDP}_{\it it}^3 + \varepsilon _{it}.$

Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html

Instead of an EKC, the updated GEMS data in figure 2 show a simple relationship between development and environmental quality that has SO₂ concentrations gradually declining with income. The lack of an EKC in the raw SO₂ data has previously been noticed (on the country level) by astute researchers who suggested that the global data mask country-level phenomena. Deacon and Norman (Reference Deacon and Norman2006) provide evidence for 23 countries that the country-level experience may in fact look very different from the original global station-level data. Since technology, factor abundance and the political response to interest groups are also national concepts, we aggregate the data in search of an EKC at the individual country level. Plotting country-level SO₂ concentrations over time confirms Deacon and Norman's (Reference Deacon and Norman2006) result that most countries' SO₂ concentrations do not follow an EKC path.Footnote ⁹

The lack of an EKC at the station or individual country level might also be an artifact of the extremely unbalanced time dimensions of the dataset. To balance the sample intertemporally, we follow Selden and Song (Reference Selden and Song1994) and take five-year averages.Footnote ¹⁰ In the averaged dataset, the US prominence is reduced to 24 per cent of the observations at the station level.Footnote ¹¹ Therefore, averaging helps address our oversampling concerns, and in the country-level data the entire locational imbalance that leads to oversampling concerns is eliminated. Averaging across time and aggregating by country does not resolve the mystery of the missing EKC in the raw data, however. Plotting station-level data and predicted values obtained by the same method as in figure 2, the country-level data in figure 3 maintain the negative relationship between pollution and income.

Figure 3. Relationship between median SO₂ concentrations and income (five-year averages 1974–1993)

Note: Five-year averages of log median SO₂ concentrations, aggregated from the station to the country level.

Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html

3. Model uncertainty in the income/environment relationship

Two simple explanations can address the absence of an EKC in the raw data presented in figures 2 and 3. Either the relationship does not exist, or the model is misspecified. By neglecting to include crucial covariates, the misspecification due to omitted variable bias may overwhelm the power of the GDP regressors. Perhaps in an effort to explore the latter line of reasoning, a number of papers in the literature feature a remarkably diverse range of different model specifications to uncover evidence in favor of an EKC.

Below, we first focus on the most prominent reduced-form approaches that commonly include variables to sharpen the EKC model specification such as international trade, capital intensity, precipitation variation, temperature, population density, investment, education, and institutions. These diverse approaches represent the level of model uncertainty that surrounds the EKC relationship. Standard robustness analysis would juxtapose various models and select on the basis of P-value. As Miller (Reference Miller1984, Reference Miller1990) points out, the difficulty is that a P-value based on a model selected from a larger set of possibilities no longer carries the same interpretation as when only two models are considered (the null and the alternative). Also, several models may seem reasonable given the data but lead to different conclusions. This can happen especially in cases when the dataset is large (for striking examples see Kass and Raftery, Reference Kass and Raftery1995; Raftery, Reference Raftery1996). The Bayesian approach to model selection and accounting for model uncertainty overcomes these difficulties. The next subsection provides a brief overview of BMA and identifies how the procedure addresses EKC model uncertainty.

3.1 Addressing model uncertainty in the income/environment relationship

When inferences are based on one model alone, the ambiguity involved in model selection dilutes information about effect sizes and predictions since ‘part of the evidence is spent to specify the model’ (Leamer, Reference Leamer1978: 91). Model averaging was first operationalized by Leamer (Reference Leamer1983) in so-called ‘extreme bound analysis’ (EBA). EBA has two limitations. First, in the absence of an efficient search, EBA arbitrarily restricts the set of candidate regressors (and hence the model space).Footnote ¹² EBA it is not anchored in foundations of statistical theory and in practical applications it has been shown to be biased towards selecting too few ‘effective’ regressors (see Sala-i-Martin, Reference Sala-i-Martin1997; Sala-i-Martin et al., Reference Sala-i-Martin, Doppelhoffer and Miller2004).

BMA inference is based on an unrestricted search of the model space spanned by all candidate regressors. BMA also requires that each model is weighed according to its quality. This quality weight is given by the posterior model probability, which is interpreted as the probability that any given model is the true model. Extreme bound analysis weighs models equally and thus attributes equal power of inference to exceptionally weak or strong models. Sala-i-Martin (Reference Sala-i-Martin1997) does introduce an ad hoc weighting scheme to BMA; his results highlight the sensitivity of EBA to the weights, and therefore the need to derive such weights using actual statistical theory. Hjort and Claeskens (Reference Hjort and Claeskens2003) point out that for good reasons BMA ‘dominates the literature on accounting for model uncertainty in statistical inference’. Raftery and Zheng (Reference Raftery and Zheng2003) summarize the main theoretical results proving that BMA: (a) minimizes the total error rate (the sum of type I and type II error probabilities); (b) produces point estimates and predictions that minimize mean squared error (MSE); and (c) yields predictive distributions that have optimal predictive performance relative to other approaches. The authors also outline the differences between Bayesian model averaging and frequentist model averaging, as well as the conceptual problems involved in frequentist model averaging.

It is therefore not surprising that averaging over all models can be analytically proven to provide better average predictive performance than any given regression, any single selected model (using selection procedures such as stepwise regression), or any subset of models (Madigan and Raftery, Reference Madigan and Raftery1994). Eicher et al. (Reference Eicher, Papageorgiou and Raftery2007a) provide concrete examples of this phenomenon using growth and simulated data and show not only that BMA attains the theory-predicted superior inference, but also that the quality of models discovered by alternative methods, such as the ‘general-to-specific’ (GETS) procedure (suggested by Hendry and Krolzig, Reference Hendry and Krolzig2001) is far inferior to BMA's.

The basic model averaging idea originated with Jeffreys (Reference Jeffreys1961) and Leamer (Reference Leamer1978), whose insights were developed and operationalized by Draper (Reference Draper1995) and Raftery (Reference Raftery1995). BMA was first introduced to economics by Fernandez et al. (Reference Fernández, Ley and Steel2001), with an application to economic growth. Here we restrict ourselves to sketching the basic BMA structure before we discuss the results (for an extensive discussion of BMA see Hoeting et al., Reference Hoeting, Madigan, Raftery and Volinsky1999).

The basic variable selection setup can be concisely summarized as follows. Given a dependent variable, Y (SO₂ concentrations), a number of observations, n, and a set of candidate regressors, X ₁, X ₂,. . ., X _k, the variable selection problem is to find the ‘best’ model

(1)

$\begin{eqnarray} Y = \alpha + \sum\nolimits_{j = 1}^p {\beta _j X_j + \varepsilon}, \end{eqnarray}$

where X ₁, X ₂, . . ., X _p is a subset of X ₁, X ₂, . . ., X _k, and β_j is a vector of regression coefficients to be estimated. Let M = {M ₁, . . ., M _k denote the set of all models considered, and let θ_k ~ (β_k, σ²) be a vector of parameters in M_k. The likelihood function of model M_k, pr (D|θ_k, M _k), given the data, D, then summarizes all information about θ_k that is provided by the data.

For any likelihood function consisting of two or more parameters, we can define the integrated likelihood as the probability of the data given model M_k. The integrated likelihood of model M_k, pr(D|M _k), is the likelihood function times the prior density, pr(θ_k|M _k), integrated over the parametersFootnote ¹³

(2)

$\begin{eqnarray} pr\left({D|M_{k}} \right) = \int {pr\left({D|\theta _k, M_{k}} \right)pr\left({\theta _k |M_{k}} \right)}\, d\theta _k. \end{eqnarray}$

The integrated likelihood is the crucial ingredient in deriving the appropriate model weight used in the model averaging process. Given the prior probability that M_k is the true model, pr(M_k), the posterior probability of a model, pr(M _k), is defined as the model's share in the total posterior mass

(3)

$\begin{eqnarray} pr\left({M_{k} |D} \right) = \frac{{pr\left({D|M_k} \right)pr\left({M_k} \right)}}{{\sum\nolimits_{l = 1}^K {pr\left({D|M_l} \right)pr\left({M_l} \right)}}}. \end{eqnarray}$

Equation (3) thus represents the individual model's weight in the averaging process. Posterior model probabilities are also the weights used to establish posterior means and variances, which have been derived by Raftery (Reference Raftery, Bollen and Long1993)

(4)

$\begin{eqnarray} E\left[ {\beta |D} \right] = \sum\nolimits_{k = 0}^K {\,\hat \beta _k pr\left({M_k |D} \right)}, \end{eqnarray}$

(5)

$\begin{eqnarray}{\it Var}\left[ {\beta |D} \right] = \sum\nolimits_{k = 0}^K {\left({Var\left[ {\beta |D,M_k} \right] + \hat \beta _k^2} \right)pr\left({M_k |D} \right)} - E\left[ {\beta |D} \right]^2, \end{eqnarray}$

where $\hat \beta _k$ is the OLS estimate for M_k, $\hat \beta _k = E\left({\beta |D,M_k} \right)$ . Hoeting (Reference Hoeting1994) derives the full expression for the definitive posterior distribution. Hence, the posterior means and variances are simply the first and second moments of each individual model, weighted by the model's ‘quality’, as given by its posterior probability. BMA thus incorporates model uncertainty into the posterior distribution such that the variance of the weighted model average is greater than the variance for any single model as long as there is disagreement across models. Intuitively, the different models are used to describe different parts of the data, rather than to pretend that a single model can describe all the data. An individual model does not account for the uncertainty about the model actually being the true model, and hence parameter estimates' variances overstate the confidence in the estimate.Footnote ¹⁴

In addition to the posterior means and standard deviations, BMA provides the posterior inclusion probability of a candidate regressor, $pr\left({\beta _k \ne 0|D} \right)$ , by summing the posterior model probabilities across those models that include the regressor. Posterior inclusion probabilities provide a probability statement regarding the importance of a regressor that directly addresses researchers' prime concern: what is the probability that the coefficient has a non-zero effect on the dependent variable.Footnote ¹⁵

4. Motivating EKC candidate regressors

Before we can employ BMA, each candidate regressor must be motivated to justify its inclusion alongside GDP measures, since each regressor can only be included if it corresponds to a well-established theory or line of research. As mentioned in the introduction, a regressor may be motivated by several theories. Numerous covariates have been introduced in the past to explain sulphur dioxide concentrations in reduced-form specifications. These regressors can be grouped into five different categories: (1) site-specific controls, (2) political economy proxies, (3) production structure, (4) trade measures, and (5) technology proxies. Note that references to our selection of theory-based approaches are limited in this paper to single equation specifications. We ruled out theory-based approaches based on energy demand decomposition that underlie the emissions literature (see Ang, Reference Ang and van den Bergh1999, and Hoekstra and Van den Bergh, Reference Hoekstra and Van den Bergh2002 for reasons explained in our discussion on emissions data above). However, the distinctions between separate effects suggested by theory (e.g., scale, technology, trade) can also be explored using a system of equations that explores the dynamics interaction and transition of the variables (see, e.g., Stern, Reference Stern2005; Constantini and Martini, Reference Constantini and Martini2007). The limitation of BMA is the single equation approach. Reduced-form approaches constitute the vast majority of EKC papers (Stern, Reference Stern2004); they cannot, however, identify the true effect of regressors, be it direct or indirect. This requires fully specified models.

Since concentrations are reported at the station level, a compelling argument can be made that any analysis of the income–pollution relationship must include regressors that control for site-specific factors (e.g., temperature and precipitation variation). Such regional differences affect nature's ability to cleanse SO₂ from the atmosphere. While variables such as temperature and precipitation variation are unlikely to be correlated with our economic variables, their inclusion is standard in the literature and meant to improve the accuracy of the estimates. Our site-specific controls for temperature and rainfall are obtained from Antweiler et al. (Reference Antweiler, Copeland and Taylor2001).Footnote ¹⁶

Aside from station characteristics, we must also control for effects that are common-to-world but nevertheless time varying. Such components reflect secular changes in global awareness of environmental problems, innovations, diffusion of technology and the evolution of world prices. We follow the standard practice in the literature and assume that these common components are captured by a linear time trend. In addition, we add a dummy for nations that signed the 1985 Helsinki Protocol, which aimed to reduce sulphur dioxide emissions by at least 30 per cent.

Income alone does not create direct pressure to improve environmental outcomes, but the democratic fabric of a society that allows political participation and threatens consequences for polluting dictators has been found to be an important determinant. The past literature introduced variables that indicate when more open and democratic societies have different attitudes towards the environment. The conjecture is that for a given level of income, more open societies experience less pollution.Footnote ¹⁷ Torras and Boyce (Reference Torras and Boyce1998) posit that richer individuals gain ‘power’ to demand better overall environmental quality. Likewise, Barrett and Graddy (Reference Barrett and Graddy2000) propose that wealthier citizens demand an increase in the non-material aspects of their standard of living. The degree to which policy responds to such desires is closely linked to the ability of individuals to assemble, organize and voice their concerns. In the same vain, Panatayotou (Reference Panayotou1997) provides evidence that strong property rights ‘flatten’ the EKC by generating less pollution for any given income level. Some authors employ the Freedom House indices to measure political rights (e.g., Shafik and Bandyopadhyay, Reference Shafik and Bandyopadhyay1992; Torras and Boyce, Reference Torras and Boyce1998; Barrett and Graddy, Reference Barrett and Graddy2000), while others use Knack and Keefer's (Reference Knack and Keefer1995) ‘Respect/Enforcement of Contracts’ (Panayotou, Reference Panayotou1997). Harbaugh et al. (Reference Harbaugh, Levinson and Wilson2002) use an index of democratization from the Jaggers and Gurr (Reference Jaggers and Gurr1995) Polity III dataset. Alternatively, Leitão (Reference Leitão2006) introduces measures of corruption.

The institutions and growth literature has since established the Polity IV ‘Constraint on Executive’ (Marshall and Jaggers, Reference Marshall and Jaggers2003) as the best measure to capture the above-mentioned effects. Acemoglu et al. (Reference Acemoglu, Johnson and Robinson2001) have shown convincingly that the degree of constraint on the executive is a fundamental determinant of all political rights. We thus choose this measure as our political rights proxy. Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) include a site-specific dummy for Communist regimes, which is interacted with per capita income as a political proxy. We leave it to BMA to identify whether executive constraints or the Communist regimes dummy proxies most effectively for political economy effects.

Since a key hypothesis is that political pressure builds as richer agents demand greater environmental quality, education is also seen as a major factor in the pollution/development relationship. Torras and Boyce (Reference Torras and Boyce1998) include adult literacy rates, noting that literacy allows for greater informational access and a more even distribution of power within society. Our measure of education is years of education from Barro and Lee (Reference Barro and Lee2000). Years of education should be a better proxy for access to information since basic literacy implies only knowledge of rudimentary reading and writing skills. We use average years of education over the prior three years to account for the fact that it takes some time to translate educational achievement into environmental activism.Footnote ¹⁸

International trade has also been associated with the EKC relationship. Arrow et al. (Reference Arrow, Bolin, Costanza, Dasgupta, Folke, Holling, Jansson, Levin, Mäler, Perrings and Pimentel1995) and Stern et al. (Reference Stern, Common and Barbier1996) mention that an EKC might be partly due to trade and the resulting global distribution of polluting industries. The authors hypothesize that free trade allows developing countries to specialize in goods that are intensive in their relatively abundant factors: labor and natural resources. Developed countries, in turn, are likely to specialize in human capital and capital intensive goods. In contrast, Shafik and Bandyopadhyay (Reference Shafik and Bandyopadhyay1992) point out that trade might exert two contrasting influences on developing countries. Following Antweiler et al. (Reference Antweiler, Copeland and Taylor2001), we use trade volume (exports plus imports) as a per cent of GDP as our measure of openness to trade. Aside from the above-mentioned trade effect, increased openness may lead to increased competition, which could cause more investment in efficient and cleaner technologies to meet the environmental standards of developed nations. More directly, investment can be motivated with reference to embodied technology, where cleaner technologies are embodied in more recent vintages of capital. To control for such an effect, we follow Harbaugh et al. (Reference Harbaugh, Levinson and Wilson2002) and include not only trade, but also a measure of investment in our analysis. Alternatively, trade-induced dynamic comparative advantage has also been tied to the composition of output that is associated with different stages of development. We use the human-capital-adjusted capital intensity proxy from Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) to account for such effects.

Another important covariate often included in the literature is population density (Grossman and Krueger, Reference Grossman and Krueger1995; Panayotou, Reference Panayotou1997; Barrett and Graddy, Reference Barrett and Graddy2000; Antweiler et al., Reference Antweiler, Copeland and Taylor2001; Harbaugh et al., Reference Harbaugh, Levinson and Wilson2002). Panayotou argues that population density may have an ambiguous effect since more dense areas can expect greater use of coal and non-commercial fuels, but densely populated countries may also be more concerned about lowering pollution concentrations. We follow Harbaugh and include national population density in order to have a relatively accurate time-series measure of population density for both developed and developing countries.

Second-generation EKC models include variables motivated by fully specified models that yield precise, testable EKC implications and relationships. The essential features of EKC models include determinants of scale, composition, and technique effects outlined by Panayotou (Reference Panayotou1997). Prominent theoretical precursors that have led to the state of the art, fully specified, open-economy EKC model in Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) are Stokey (Reference Stokey1998) (endogenous abatement), Bovenberg and Smulders (Reference Bovenberg and Smulders1995) and Aghion and Howitt (Reference Aghion and Howitt1998) (endogenous growth/technique), and Jones and Manuelli (Reference Jones and Manuelli2001) (endogenous policy). Development causes a positive scale effect since increased output per unit of capital leads to increased pollution. To account for the scale effect we follow Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) and employ a measure of city-level economic intensity (national GDP per capita times city population density). It is generally held that in rapidly growing middle-income countries pollution due to the scale effect might be the dominant EKC force (Perman and Stern, Reference Perman and Stern2003).

The technique effect diminishes the scale effect as technological progress permits a lowering of emissions per unit of output, which presumably would also impact SO₂ concentrations. Lagged per capita income is used to proxy for the technique effect since countries with higher incomes in the past should be able to afford better technology today (see Antweiler et al., Reference Antweiler, Copeland and Taylor2001). Diffusion of technology itself motivates the idea that time-related effects reduce environmental impacts in countries at all levels of development (Aghion and Howitt, Reference Aghion and Howitt1998; Perman and Stern, Reference Perman and Stern2003).Footnote ¹⁹ These effects are usually proxied with a year dummy.

To isolate either the scale or technique effect, we must control for changes in the composition of output. A change in output composition can mitigate the scale effect further if the share of less pollution-intensive industries rises as income increases. This occurs when development and human capital accumulation generate shifts toward cleaner industries (services or information technology) so that the ensuing change in the composition of output reduces environmental degradation (Panayotou, Reference Panayotou1993). A specific model was first presented by Copeland and Taylor (Reference Copeland and Taylor2003), who showed that the reliance on capital accumulation in early stages of development, as opposed to human capital accumulation in later stages, can generate an EKC. Following Antweiler et al. (Reference Antweiler, Copeland and Taylor2001), we capture the composition effect by controlling for differences in the human-capital-adjusted capital–labor ratio. In the absence of such controls, the relationship between pollution and income is a mixture of scale, composition, and technique effects, which is hard to interpret.

In addition to a simple income/output-induced composition effect, Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) also account for a trade-induced composition effect. While the reduced-form literature takes the effect of trade as ambiguous, Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) stipulate that a trade-induced composition effect depends on a country's comparative advantage, which in turn is determined by income per capita and capital abundance. Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) suggest interacting the trade measure with determinants of comparative advantage (the capital–labor ratio and income per capita). Since comparative advantage is a relative concept, these variables are measured relative to their corresponding world averages. Since theory cannot identify a turning point (the endowment levels where trade causes a switch from exporting to importing pollution-intensive products), we adopt the flexible approach of Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) by estimating interactions with different functional forms.

Note that theory does not necessarily imply such an elaborate structure. Brock and Taylor (Reference Brock, Taylor, Aghion and Durlauf2005) point out that the EKC is compatible with many different theories. The simplest of all is perhaps the ‘Green Solow model’ where pollution policy remains unchanged throughout the development process and where transitional dynamics alone suffice to generate an EKC. The Green Solow model exhibits no composition effects, no changes in pollution abatement, no evolution of the political process, and no international trade. BMA is a natural statistical tool to examine the support that competing theories receive from the data, and to address the model uncertainty in the literature.

5. Empirical results

Tables 1–3 report the reduced-form results, while table 4 reports the results for our candidate regressors that were motivated by our selection of theory-based approaches. The main results are robust to specifications of GDP in logs, different GDP lag structures (zero-, three-, and ten-year lags), alternative ‘U-curve’ specifications such as Anand and Kanbur's (Reference Anand and Kanbur1993) specifications based on inverse GDP, and specifications based on concentrations per capita. The tables report results at the station, city, and country levels.

Table 1. Reduced-form BMA results (by station)

Notes: P ≠ 0 is the posterior inclusion probability that a regressor's posterior mean is different from zero. *, **, ***, indicate 90, 95, 99 per cent confidence levels.

Table 2. Reduced-form BMA results (by city)

Notes: P ≠ 0 is the posterior inclusion probability that a regressor's posterior mean is different from zero. *, **, ***, indicate 90, 95, 99 per cent confidence levels.

Table 3. Reduced-form BMA results (by country)

Notes: P ≠ 0 is the posterior inclusion probability that a regressor's posterior mean is different from zero. *, **, ***, indicate 90, 95, 99 per cent confidence levels.

Table 4. Structural BMA results

Notes: P ≠ 0 is the posterior inclusion probability that a regressor's posterior mean is different from zero. *, **, ***, indicate 95, 99, 99.9 per cent confidence levels. Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) do not report standard errors.

The first columns of tables 1–3 report the posterior inclusion probability (the probability that the coefficient estimate is different from zero). P ≠ 0 is thus a measure of confidence that a regressor enters with a non-zero coefficient into the true regression model. The posterior inclusion probability is a scale-free probability measure of the relative importance of variables; it can therefore be transparently applied to inform policy decisions, in addition to the posterior mean and standard deviation. Jeffreys (Reference Jeffreys1961) and Raftery (Reference Raftery1995) add the interpretational refinement that P ≠ 0 > 50 per cent indicates that the data provide weak evidence that a regressor is included in the true model; P ≠ 0 > 75 per cent implies positive evidence; P ≠ 0 > 95 per cent provides strong evidence; and P ≠ 0 > 99 per cent gives very strong evidence. Inclusion probabilities close to 100 per cent signal that a particular regressor is included in almost all good models so that it contributes prominently to explaining the dependent variable, even in the presence of significant model uncertainty.

We find only limited support for income as a key driver of SO₂ concentrations. Only the highly unbalanced station- and city-level datasets in tables 1 and 2 report positive evidence of an EKC relationship between income and SO₂ concentrations. At the station level, lagged GDP has a much higher inclusion probability than current GDP, implying that contemporaneous economic activity is less important in determining SO₂ concentrations than the indirect effects of rising income over time. Nevertheless, fundamental variables, not income, are the crucial determinants of pollution levels. Precipitation variation and executive constraints both exhibit 100 per cent inclusion probabilities, while the income polynomials range around 80 per cent. We find that less variation in precipitation, increased temperature and greater executive constraints reduce SO₂ concentrations. The only economic variable that registers as significant in the reduced-form station-level results is trade intensity. Here the evidence is decisive that trade reduces pollution.

The best single regression model selected by BMA at the station level has an adjusted R ² of 0.240, and contains seven variables that exhibit at least weak evidence in terms of inclusion probabilities. The city-level results in table 2 are nearly identical to those at the station level, except that the previously weakly significant temperature variable is no longer relevant. Although the best model at the city level is based on fewer regressors and observations (both would lead one to expect a worse fit), its adjusted R ² increases to 0.303. The improvement in explanatory power may result from the fact that the aggregated dataset is less prone to oversampling.

A major change in the results occurs when we aggregate the data to the country level. Table 3 no longer provides evidence that income has an influence on pollution. Nevertheless, all other variables that have been shown to be robustly related to pollution remain strongly significant and their posterior means are surprisingly stable. Executive constraints, trade, and local weather variations are central to explaining the country-level pollution variability. Interestingly, at the country level, education and technology (proxied by the year variable) now have high inclusion probabilities, providing strong evidence that these candidate regressors belong in the true model. As we aggregate from the station to the city and finally to the country level, the adjusted R ² of the best model systematically increases (although the number of observations drops from 623 to 109). While the adjusted R ² is only 0.240 for the best model with station-level data, it nearly doubles to 0.465 at the country level.

The results for the regressors motivated by the theory-based approach of Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) are presented in table 4. Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) use station-level results as a benchmark, since their specification is the most extensive, theory-based empirical implementation of the EKC hypothesis. Alternative levels of aggregation (and hence different degrees of oversampling) generate remarkably stable outcomes in terms of posterior inclusion probabilities and posterior means. BMA identifies between six (station- and city-level) and eight (country-level) candidate regressors with weak to decisive evidence of a non-zero impact on pollution. Common across all levels of aggregation is that there exists no evidence of an EKC. Instead, the prominence of site-specific and political economy variables carries over from our structural results at all levels of aggregation. There is strong and decisive evidence that non-economic factors, such as temperature, precipitation variation, and executive constraints, affect SO₂ concentrations in the same fashion as in the reduced-form BMA tables 1–3.

Of all the variables that receive positive evidence in table 4 at the station level, only executive constraints does not appear in the theory-specified Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) model (they include the ‘Communist’ variable to capture political economy effects). Executive constraints thus remains highly significant not only in the reduced-form, but also in the structural analysis. This provides strong support for the Jones and Manuelli (Reference Jones and Manuelli2001) approach to pollution that emphasizes the political process, not income, as the driving force in the development/pollution relationship.

BMA produces a number of surprises. The major difference between the reduced-form specifications and the Antweiler theory-based results is that the trade intensity effect is lost entirely. While it does register as significant in Antweiler et al. (Reference Antweiler, Copeland and Taylor2001), BMA provides no evidence that trade intensity alone carries any explanatory power at the station, city, or country level. Nevertheless, the BMA results do suggest that trade plays an important indirect role in determining pollution since it is revealed that trade moderates the composition effect. Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) find that trade dampens the pure EKC effect as the trade/income interactions in their regression are highly significant. In the BMA approach, in contrast, this income effect is found only at the station and city levels, and only in non-linear form. In the less unbalanced country-level dataset, BMA indicates that trade's main role is to moderate the composition effect. The interaction between trade and capital intensity shows that the composition effect has a different impact on countries depending on their level of development. The greater the level of development – as proxied by the human-capital-augmented capital–labor ratio – the lower the implied concentrations for open economies.

The second trade-related variable that receives support in BMA at all levels of aggregation is the interaction between trade, income, and capital intensity. The positive estimate throughout provides strong evidence that more human/physical-capital-intensive countries have higher sulphur dioxide concentrations, even after we control for trade and income effects. This is because the three-way interaction between trade, income, and the human-capital-adjusted capital–labor ratio has a positive posterior mean. The relatively large role of the composition effect and the trade-based interactions suggests that countries do not follow a deterministic income–pollution path.

BMA does not uncover strong evidence for a pure composition effect, since capital intensity alone cannot be shown to affect pollution (in contrast to Antweiler et al., Reference Antweiler, Copeland and Taylor2001). In addition, since city-GDP/km² is not significant at any level of aggregation, BMA provides no evidence for a scale effect (the scale effect is only mildly significant in Antweiler et al.'s Reference Antweiler, Copeland and Taylor2001 work). Oversampling does influence the strength of the technique effect (proxied by year), as BMA provides evidence that a technique effect reduces pollution at the country level. A similar pattern is observed in our reduced-form analysis, where the same variable gains explanatory power only at higher levels of aggregation. These findings are in line with Stern (Reference Stern2002) who finds evidence for the important role of negative time effects in explaining declining SO₂ concentrations.

Perhaps the most important result is that the best model chosen by BMA contains less than half of the 23 candidate regressors that have been motivated by the literature. At the station level, seven significant regressors account for about one-and-a-half times more variation in the dependent variable (adjusted R ² = 0.235) than the 18 regressors (12 significant) suggested by the Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) specification (adjusted R ² = 0.144). This suggests that a number of regressors identified by Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) may be significant only because the empirical strategy did not account for model uncertainty. The BMA estimates at different levels of aggregation are surprisingly stable, however, and their adjusted R ² increases steadily from the station to the city to the country level, from 0.235 to 0.301 to 0.479, respectively (although the sample size declines). Also, the relevant regressors at the city and station levels are just about identical, although the country-level results do feature two additional regressors to explain pollution (year and education). The coefficient on education is counterintuitive just as in the reduced-form BMA results. It is supposed to proxy for the hypothesis that better-educated citizens demand better environmental quality. However, measures of education have been shown to be fragile in both growth regressions and in development accounting (see Krueger and Lindahl, Reference Krueger and Lindahl2001). Perhaps the same issues contaminate the effect of the regressors here.

6. Conclusion

This paper reexamines the evidence for an environmental Kuznets curve using the updated GEMS/AIRS data on SO₂ concentrations. The literature on the income–pollution relationship is characterized by unusual model uncertainty as both the number of proposed theories and the range of possible candidate regressors is large. We apply a theoretically founded method to address model uncertainty. Bayesian model averaging examines all models, weighs them by their relative quality, and then generates the probability that a candidate regressor is related to the dependent variable.

Our results are presented at three levels of aggregation. The station-level results are subject to severe oversampling as pollution from thousands of observations from local stations are linked to one and the same measure of income in a country. Hence we also aggregate the data to the city and country level. The results are remarkably robust. Political economy and site-specific variables explain a large share of the observed pollution. International trade is also shown to be robustly related to pollution. In our reduced-form analysis, trade is found to lower pollution. When the model is specified using full-fledged theories (Antweiler et al., Reference Antweiler, Copeland and Taylor2001), we show that trade has no direct effect, but that it moderates the composition effect. We provide evidence that as countries become richer and increase their physical and human capital, trade leads to cleaner environments. It unfortunately also implies that poor, labor-intensive, open economies experience increasing pollution levels.

Overall, we find only weak evidence for an EKC, which disappears when we address oversampling of the data or move to a fully specified theory-based approach. There may be several reasons the EKC fails to hold up in our work. The foremost, perhaps, is that many countries in the GEMS/AIRS data may already be on the flat or downward-sloping portion of the EKC during the sample period. Smulders et al. (Reference Smulders, Bretschger and Egli2005) label these portions of the EKC the ‘alarm phase’ and the ‘cleaning-up phase’ that indicate a government response to public concerns. Given that the reduction in sulphur dioxide concentrations may also be based on governments reacting to their citizens' demands, it is not surprising that we find that policy variables such as executive constraints play a crucial role in determining pollution levels.

Appendix

Table A-1. Summary statistics

Note: The ‘t–3’ subscript refers to an average of the past three years' data.

Table A-2. Correlations

Footnotes

¹ Theory-based in this context refers to regressors that have been proposed in the past to test these specific theories. Some regressors have been motivated by several alternative theories.

² The comparison of different BMA specifications in this paper may involve not only different regressors, but also different samples sizes. Adjusted R-squared compensates for both by expressing the explained variation after accounting for differences in sample sizes and the number of regressors. More complex measures for model comparison are available that balance the opposing pressures of goodness of fit and complexity (see Burnham and Anderson (Reference Burnham and Anderson2002) for an exhaustive discussion). One might be suspicious that sample size was driving our results if models with larger samples sizes performed better. The opposite is the case here.

³ Alternative measures have been used. Evidence for an EKC has been found, for example, for water quality (Grossman and Krueger, Reference Grossman and Krueger1995), deforestation (Cropper and Griffith, Reference Cropper and Griffith1994; Panayotou, Reference Panayotou, Ahmed and Doeleman1995), and water withdrawal for agriculture (Rock, Reference Rock1998; Goklany, Reference Goklany2002). Some researchers have found an EKC for carbon dioxide (Roberts and Grimes, Reference Roberts and Grimes1997) though others have found that CO₂ increases monotonically with income (Shafik and Bandyopadhyay, Reference Shafik and Bandyopadhyay1992).

⁴ Our raw GEMS/AIRS data are identical to that of Antweiler et al. (Reference Antweiler, Copeland and Taylor2001), who kindly shared their data that include median concentrations.

⁵ See, for example, Grossman and Krueger (Reference Grossman and Krueger1995), Panayotou (Reference Panayotou1997), Torras and Boyce (Reference Torras and Boyce1998), Barrett and Graddy (Reference Barrett and Graddy2000), Harbaugh et al. (Reference Harbaugh, Levinson and Wilson2002), and Deacon and Norman (Reference Deacon and Norman2006). De Groot et al. (Reference De Groot, Minliang and Withagen2004) use concentrations/GDP.

⁶ The data generate ‘estimates’ of SO₂ emissions based on national statistics and the use of sulphur bearing fuels and metals, weighted by their sulphur content.

⁷ The alternative approach to oversampling would be to use data weighting schemes that rebalance the data so that, for example, data from heavily oversampled countries receive a lower weight. Such schemes require knowledge of the true sampling distribution, however, and knowledge of the determinants and structure of the selection probabilities for each station (e.g., why certain country's states were so much more likely to be sampled).

⁸ We employ fixed-effects regressions throughout. It has been argued that the random-effects EKC cannot be estimated consistently (Mundlak, Reference Mundlak1978; Hsiao, Reference Hsiao1986; Stern, Reference Stern2004). Since the very premise of the EKC is that specific local, regional, or national characteristics are crucial, the random-effects approach suffers from inconsistency due to omitted variables. In addition, we have no desire to imply that a possible EKC in our data holds beyond the countries in this sample. Hence, we take the view that represented countries are not simply random draws from a larger EKC country/station population. An additional advantage of the fixed-effects approach is that it controls for many time-invariant, site-specific, and country-specific factors.

⁹ The plot is included in the working paper version of this paper (Begun and Eicher, Reference Begun and Eicher2007).

¹⁰ Averaging data over five-year intervals is also common in the economic growth literature and allows us to address the error associated with business cycle fluctuations that are inherent in income data (see Barro, Reference Barro1990).

¹¹ Countries that lack SO₂ data for at least two five-year periods are excluded. These countries (and their number of observations) are Austria (2), Kenya (4), Switzerland (2), Ghana (3), and the Czech Republic (21). Poland (86 observations), and Iraq (9 observations) are excluded from the reduced-form analysis since we lack PWT 6.1 GDP data for these countries. Hong Kong (40 observations) is excluded because it lacks Polity IV data, and Yugoslavia (63 observations) was dropped by Antweiler et al. (Reference Antweiler, Copeland and Taylor2001) because it lacks human capital data.

¹² Depending on the search algorithm, BMA might also restrict the model size (see Eicher et al., Reference Eicher, Papageorgiou and Raftery2007a). However, the Raftery bicreg algorithm is based on the leaps and bounds algorithm, which does not restrict the model space in our particular application.

¹³ The software used to implement BMA is available at http://www.research.att.com/~volinsky/software/bicreg. It utilizes the unit information prior, which is so diffuse and uncontroversial that it can be derived from frequentist statistics. It is a multivariate normal prior whose mean is centered at the maximum likelihood estimate and whose variance equals the expected information matrix for one observation (Kass and Wasserman, Reference Kass and Wasserman1995).

¹⁴ An additional dimension of model uncertainty relates to the uncertainty regarding the correct econometric approach. BMA cannot resolve this uncertainty. For example, List and Gallett (Reference List and Gallet1999) and Millimet et al. (Reference Millimet, List and Stengos2003) explore non-parametric modeling strategies using emissions rather than concentrations. Martínez-Zarzoso and Bengochea-Morancho (Reference Martinez-Zarzoso and Bengochea-Morancho2004) use a mean group approach that consists of estimating separate regressions for each country and calculating averages of the country-specific coefficients in the context of CO₂ pollution. Dijkgraaf and Vollebergh (Reference Dijkgraaf and Vollebergh2005) also reject an EKC in CO₂ and find homogeneity among countries to be a problematic assumption. Alternative modeling strategies usually compare their results to parametric approaches. Unfortunately, a full-fledged comparison of the parametric BMA results vs. alternative modeling strategies on the basis of predictive performance along the lines of Eicher et al. (Reference Eicher, Papageorgiou and Raftery2007a) is beyond the scope of this paper. Alternatives might be to explore a parametric approach together with parameter heterogeneity. This approach has been examined in the growth context by Eicher et al. (Reference Eicher, Papageorgiou and Roehn2007b), where theory identifies clear groups of heterogeneous countries (e.g. Africa or Asia). We are not aware that the environmental quality literature features such an established set of country clusters to test for slope heterogeneity.

¹⁵ The posterior inclusion probability thus also carries an important interpretation that is more informative than standard P-values. P-values are often interpreted as indicating that a coefficient has a chance of being equal to zero, or that the null hypothesis has a chance of being true. However, P-values only indicate something much more obscure: if the null hypothesis, θ = 0, were true, then the probability of collecting data as extreme as or more extreme than what is observed is equal to the P-value.

¹⁶ Grossman and Krueger (Reference Grossman, Krueger and Garber1993, Reference Fernández, Ley and Steel1995) also include site-geography variables such as proximity to oceans or deserts. Our fixed-effects regressions account for these implicitly.

¹⁷ Of course one could also conjecture that income is simply an indicator for deeper mechanisms that drive environmental quality. Thus, exploring all controls that have been mentioned in the past literature, other than income, is an important robustness analysis. Since BMA explores the entire model space, it is uniquely qualified to identify any combination of such deeper mechanisms, which could reduce and even negate the explanatory power of income – if income is only an indicator. If income remains significant, we either have not discovered the correct mechanism, or income may not be a simple indicator.

¹⁸ Barro and Lee (Reference Barro and Lee2000) is the most commonly used dataset for international comparisons. While often criticized, no better global dataset has been developed. For OECD countries de la Fuente and Doménech (Reference De la Fuente and Doménech2001, Reference De la Fuente and Doménech2006) show that one can increase the signal-to-noise ratio by exploiting additional data sources, and by eliminating breaks in the data that seem to be a function of data collection.

¹⁹ In contrast to simple fixed- and random-effects regressions, where collinearity between GDP and lagged GDP variables might compromise the explanatory power of either variable, BMA averages across relevant models and thus potentially mitigates the effects of collinearity.

Note: The ‘t–3’ subscript refers to an average of the past three years' data.

References

Acemoglu, D., Johnson, S., and Robinson, J.A. (2001), ‘The colonial origins of comparative development: an empirical investigation’, American Economic Review 91: 1369–1401.CrossRef Google Scholar

Aghion, P. and Howitt, P. (1998), ‘Capital accumulation and innovation as complementary factors in long-run growth’, Journal of Economic Growth 3: 111–130.Google Scholar

Anand, S. and Kanbur, S.M.R. (1993), ‘Inequality and development: a critique’, Development Economics 41: 19–43.CrossRef Google Scholar

Ang, B.W. (1999), ‘Decomposition methodology in energy demand and environmental analysis’, in van den Bergh, J.C.J.M. (ed.), Handbook of Environmental and Resource Economics, Cheltenham: Edward Elgar Publishers.Google Scholar

Antweiler, W., Copeland, B.R., and Taylor, M.S. (2001), ‘Is free trade good for the environment?’, American Economic Review 91: 877–908.CrossRef Google Scholar

Arrow, K., Bolin, B., Costanza, R., Dasgupta, P., Folke, C., Holling, C.S., Jansson, B.-O., Levin, S., Mäler, K.-G., Perrings, C., and Pimentel, D. (1995), ‘Economic growth, carrying capacity, and the environment’, Science 268: 520–521.CrossRef Google Scholar PubMed

Barrett, S. and Graddy, K. (2000), ‘Freedom, growth, and the environment’, Environment and Development Economics 5: 433–456.CrossRef Google Scholar

Barro, R.J. (1990), ‘Government spending in a simple model of endogenous growth’, Journal of Political Economy 98: S103–S125.CrossRef Google Scholar

Barro, R.J. and Lee, J.-W. (2000), ‘International data on educational attainment: updates and implications’, CID Working Paper No. 42.Google Scholar

Begun, J. and Eicher, T.S. (2007), ‘In search of a sulphur dioxide environmental Kuznets curve: a Bayesian model averaging approach’, University of Washington Working Paper UWEC 2007–18.Google Scholar

Bovenberg, A.L. and Smulders, S. (1995), ‘Environmental quality and pollution-saving technological change in a two-sector endogenous growth model’, Journal of Public Economics 57: 369–391.CrossRef Google Scholar

Brock, W.A. and Taylor, M.S. (2005), ‘Economic growth and the environment: a review of theory and empirics’, in Aghion, P. and Durlauf, S.N. (eds), Handbook of Economic Growth, Volume 1B, Amsterdam: Elsevier.Google Scholar

Burnham, K.P. and Anderson, D.R. (2002), Model Selection and Multimodel Inference: A Practical–Theoretic Approach, 2nd edn, New York: Springer-Verlag.Google Scholar

Constantini, V. and Martini, C. (2007), ‘A modified environmental Kuznetz curve for sustainable development assessment using panel data’, FEEM Working Paper 148.CrossRef Google Scholar

Copeland, B.R. and Taylor, M.S. (2003), Trade and the Environment: Theory and Evidence, Princeton, NJ: Princeton University Press.CrossRef Google Scholar

Cropper, M. and Griffith, C. (1994), ‘The interaction of population growth and environmental quality’, American Economic Review 84: 250–254.Google Scholar

Dasgupta, S., Laplante, B., Wang, H., and Wheeler, D. (2002), ‘Confronting the environmental Kuznets curve’, Journal of Economic Perspectives 16: 147–168.CrossRef Google Scholar

De Groot, H.L.F., Minliang, Z., and Withagen, C.A. (2004), ‘The dynamics of China's regional development and pollution: an empirical investigation into the existence of an environmental Kuznets curve for China’, Environment and Development Economics 9: 507–537.CrossRef Google Scholar

De la Fuente, A. and Doménech, R. (2001), ‘Schooling data, technological diffusion, and the neoclassical model’, American Economic Review 91: 323–327.CrossRef Google Scholar

De la Fuente, A. and Doménech, R. (2006), ‘Human capital in growth regressions: how much difference does data quality make?’, Journal of the European Economic Association 4: 1–36.CrossRef Google Scholar

Deacon, R.T. and Norman, C.S. (2006), ‘Does the environmental Kuznets curve describe how individual countries behave?’, Land Economics 82: 291–315.CrossRef Google Scholar

Dijkgraaf, E. and Vollebergh, H. (2005), ‘A test for parameter homogeneity in CO₂ panel EKC estimations’, Environment and Resource Economics 32: 229–239.CrossRef Google Scholar

Draper, D. (1995), ‘Assessment and propagation of model uncertainty (with discussion)’, Journal of the Royal Statistical Society, Series B 57: 45–97.Google Scholar

Eicher, T.S., Papageorgiou, C., and Raftery, A.E. (2007)a, ‘Bayesian model averaging in economics’, Working Paper.Google Scholar

Eicher, T.S., Papageorgiou, C., and Roehn, O. (2007)b, ‘Unraveling the fortunes of the fortunate: an iterative Bayesian model averaging (IBMA) approach’, Journal of Macroeconomics 29: 494–514.CrossRef Google Scholar

Farnsworth, M.L., Hoeting, J.A., Hobbs, N.T., and Miller, M.W. (2006), ‘Linking chronic wasting disease to mule deer movement scales: a hierarchical Bayesian approach’, Ecological Applications 16: 1026–1036.CrossRef Google Scholar

Fernández, C., Ley, E., and Steel, M.F.J. (2001), ‘Model uncertainty in cross-country growth regressions’, Journal of Applied Econometrics 16: 563–576.CrossRef Google Scholar

Fernández, C., Ley, E., and Steel, M.F.J. (2002), ‘Bayesian modeling of catch in a north-west Atlantic fishery’, Journal of the Royal Statistical Society: Series C 51: 257–280.Google Scholar

Goklany, I. (2002), ‘Comparing twentieth century trends in US and global agricultural water and land use’, Water International 27: 321–329.CrossRef Google Scholar

Grossman, G.M. and Krueger, A.B. (1993), ‘Environmental impacts of a North American free trade agreement,’ in Garber, P.M. (ed.), The US–Mexico Free Trade Agreement, Cambridge, MA: MIT Press, pp. 13–56.CrossRef Google Scholar

Grossman, G.M. and Krueger, A.B. (1995), ‘Economic growth and the environment’, Quarterly Journal of Economics 110: 353–377.CrossRef Google Scholar

Harbaugh, W.T., Levinson, A., and Wilson, D.M. (2002), ‘Reexamining the empirical evidence for an environmental Kuznets curve’, Review of Economics and Statistics 83: 541–551.CrossRef Google Scholar

Hendry, D.F. and Krolzig, H.-M. (2001), Automatic Econometric Model Selection Using PcGets, London: Timberlake Consultants Press.Google Scholar

Heston, A., Summers, R., and Aten, B. (2002), Penn World Table Version 6.1, Center for International Comparisons at the University of Pennsylvania (CICUP), http://pwt.econ.upenn.edu/php_site/pwt_index.php.Google Scholar

Hjort, N.L. and Claeskens, G. (2003), ‘Frequentist model average estimators’, Journal of the American Statistical Association 98: 879–899.CrossRef Google Scholar

Hoekstra, R. and Van den Bergh, J.C.J.M. (2002), ‘Structural decomposition analysis of physical flows in the economy’, Environmental and Resource Economics 23: 357–378.CrossRef Google Scholar

Hoeting, J.A. (1994), ‘Accounting for model uncertainty in linear regression’, University of Washington Ph.D. dissertation.Google Scholar

Hoeting, J.A., Madigan, D., Raftery, A.E., and Volinsky, C.T. (1999), ‘Bayesian model averaging: a tutorial’, Statistical Science 14: 382–417.Google Scholar

Hsiao, C. (1986), Analysis of Panel Data, Econometric Society Monographs No. 11, Cambridge: Cambridge University Press.Google Scholar

Jaggers, K. and Gurr, T.R. (1995), ‘Polity III: regime change and political authority, 1800–1994’, http://privatewww.essex.ac.uk/~ksg/Polity.html.CrossRef Google Scholar

Jeffreys, H. (1961), Theory of Probability, 3rd edn, Oxford: The Clarendon Press.Google Scholar

Jones, L.E. and Manuelli, R.E. (2001), ‘Endogenous policy choice: the case of pollution and growth’, Review of Economic Dynamics 4: 369–405.CrossRef Google Scholar

Kass, R.E. and Raftery, A.E. (1995), ‘Bayes factors’, Journal of the American Statistical Association 90: 773–795.CrossRef Google Scholar

Kass, R.E. and Wasserman, L. (1995), ‘A reference Bayesian test for nested hypotheses and its relationship to the Schwartz criterion’, Journal of the American Statistical Association 90: 928–934.CrossRef Google Scholar

Knack, S. and Keefer, P. (1995), ‘Institutions and economic performance: cross-country tests using alternative institutional measures’, Economics and Politics 7: 207–227.CrossRef Google Scholar

Koop, G. and Tole, L. (2006), ‘An investigation of thresholds in air pollution–mortality effects’, Environmental Modelling and Software 21: 1662–1673.CrossRef Google Scholar

Krueger, A.B. and Lindahl, M. (2001), ‘Education for growth: why and for whom?’, Journal of Economic Literature 39: 1101–1136.CrossRef Google Scholar

Kuznets, S. (1955), ‘Economic growth and income inequality’, American Economic Review 45: 1–28.Google Scholar

Leamer, E. (1978), Specification Searches, New York: Wiley.Google Scholar

Leamer, E. (1983), ‘Let's take the con out of econometrics’, American Economic Review 73: 31–43.Google Scholar

Leitão, A. (2006), ‘Corruption and the environmental Kuznets curve: empirical evidence for sulfur’, Working Paper, Universidade Católica Portuguesa.Google Scholar

List, J.A. and Gallet, C.A. (1999), ‘The environmental Kuznets curve: does one size fit all?’, Ecological Economics 31: 409–424.CrossRef Google Scholar

Madigan, D. and Raftery, A.E. (1994), ‘Model selection and accounting for model uncertainty in graphical models using Occam's window’, Journal of the American Statistical Association 89: 1535–1546.CrossRef Google Scholar

Marshall, M.G. and Jaggers, K. (2003), ‘Polity IV Project: Political Regime Characteristics and Transitions, 1800–2003’, http://www.cidcm.umd.edu/inscr/polity/.Google Scholar

Martinez-Zarzoso, I. and Bengochea-Morancho, A. (2004), ‘Pooled mean group esti-mation for an environmental Kuznets curve for CO₂’, Economics Letters 82: 121–126.CrossRef Google Scholar

Miller, A.J. (1984), ‘Selection of subsets of regression variables (with discussion)’, Journal of the Royal Statistical Society Series A 147: 389–425.CrossRef Google Scholar

Miller, A.J. (1990), Subset Selection in Regression, New York: Chapman & Hall.CrossRef Google Scholar

Millimet, D.L., List, J.A., and Stengos, T. (2003), ‘The environmental Kuznets curve: real progress or misspecified models?’, Review of Economics and Statistics 85: 1038–1047.CrossRef Google Scholar

Mundlak, Y. (1978), ‘On the pooling of time series and cross section data’, Econometrica 46: 69–85.CrossRef Google Scholar

Panayotou, T. (1993), ‘Empirical tests and policy analysis of environmental degradation at different stages of economic development’, Working Paper WP238, Technology and Employment Programme, International Labor Office, Geneva.Google Scholar

Panayotou, T. (1995), ‘Environmental degradation at different stages of economic development’, in Ahmed, I. and Doeleman, J.A. (eds), Beyond Rio: The Environmental Crisis and Sustainable Livelihoods in the Third World, London: Macmillan, pp. 13–36.CrossRef Google Scholar

Panayotou, T. (1997), ‘Demystifying the environmental Kuznets curve: turning a black box into a policy tool’, Environment and Development Economics 2: 465–484.CrossRef Google Scholar

Perman, R. and Stern, D.I. (2003), ‘Evidence from panel unit root and cointegration tests that the environmental Kuznets curve does not exist’, Australian Journal of Agricultural and Resource Economics 47: 325–347.CrossRef Google Scholar

Raftery, A.E. (1993), ‘Bayesian model selection in structural equation models’, in Bollen, K.A. and Long, J.S. (eds), Testing Structural Equation Models, Beverly Hills: Sage, pp. 163–180.Google Scholar

Raftery, A.E. (1995), ‘Bayesian model selection in social research (with discussion)’, Sociological Methodology 25: 111–196.CrossRef Google Scholar

Raftery, A.E. (1996), ‘Approximate Bayes factors and accounting for model uncertainty in generalized linear models’, Biometrika 83: 251–266.CrossRef Google Scholar

Raftery, A.E. and Zheng, Y. (2003), ‘Long-run performance of Bayesian model averaging’, Journal of the American Statistical Association 98: 931–938.CrossRef Google Scholar

Roberts, J. and Grimes, P. (1997), ‘Carbon intensity and economic development 1962–91: a brief exploration of the environmental Kuznets curve’, World Development 25: 191–198.CrossRef Google Scholar

Rock, M.T. (1998), ‘Freshwater use, freshwater scarcity, and socioeconomic development’, Journal of Environment and Development 7: 278–301.CrossRef Google Scholar

Sala-i-Martin, X.X. (1997), ‘I just ran two million regressions’, American Economic Review 87: 178–183.Google Scholar

Sala-i-Martin, X.X., Doppelhoffer, G., and Miller, R. (2004), ‘Determinants of long-term growth: a Bayesian averages of classical estimates (BACE) approach’, American Economic Review 94: 813–835.CrossRef Google Scholar

Selden, T.M. and Song, D. (1994), ‘Environmental quality and development: is there a Kuznets curve for air pollution?’, Journal of Environmental Economics and Environmental Management 27: 147–162.CrossRef Google Scholar

Shafik, N. and Bandyopadhyay, S. (1992), ‘Economic growth and environmental quality’, World Bank Policy Research Working Paper WPS 904.Google Scholar

Smulders, S., Bretschger, L., and Egli, H. (2005), ‘Economic growth and the diffusion of clean technologies: explaining environmental Kuznets curves’, Economics Working Paper Series, Working Paper 05/42, WIF – Institute of Economic Research.CrossRef Google Scholar

Stern, D.I. (2002), ‘Explaining changes in global sulfur emissions: an econometric decomposition approach’, Ecological Economics 42: 201–220.CrossRef Google Scholar

Stern, D.I. (2004), ‘The rise and fall of the environmental Kuznets curve’, World Development 32: 1419–1439.CrossRef Google Scholar

Stern, D.I. (2005), ‘Beyond the environmental Kuznets curve: diffusion of sulfur emissions abating technology’, Journal of Environment and Development 14: 101–124.CrossRef Google Scholar

Stern, D.I., Common, M.S., and Barbier, E.B. (1996), ‘Economic growth and environmental degradation: the environmental Kuznets curve and sustainable development’, World Development 24: 1151–1160.CrossRef Google Scholar

Stokey, N.L. (1998), ‘Are there limits to growth?’, International Economic Review 39: 1–31.CrossRef Google Scholar

Torras, M. and Boyce, J. (1998), ‘Income, inequality, and pollution: a reassessment of the environmental Kuznets curve’, Ecological Economics 25: 147–160.CrossRef Google Scholar

Figure 1. Distribution of SO2/observations in GEMS/AIRS data 1974–1993Notes: Obs < 20 refers to a number of countries that feature less than 20 observations (Greece, Italy, Thailand, Venezuela, Malaysia, Indonesia, South Korea, Peru, Pakistan). Obs 21–30 refers to countries with between 21 to 30 observations (Argentina, Finland, Sweden, Chile, France, Denmark).Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html

Figure 2. Relationship between median SO2 concentrations and income (by measuring station 1974–1993)Note: Fitted values are for the fixed-effects regression $Log{\rm}SO_{2{\it it}} = \alpha _i + \beta {\it GDP}_{\it it} + \delta {\it GDP}_{\it it}^2 + \gamma {\it GDP}_{\it it}^3 + \varepsilon _{it}.$Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html

Figure 3. Relationship between median SO2 concentrations and income (five-year averages 1974–1993)Note: Five-year averages of log median SO2 concentrations, aggregated from the station to the country level.Source: US–EPA maintained GEMS/AIRS dataset http://www.epa.gov/airs/aexec.html