Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-02-11T02:00:34.175Z Has data issue: false hasContentIssue false

Implications of the WTP–WTA Disparity for Benefit–Cost Analysis

Published online by Cambridge University Press:  17 March 2015

James K. Hammitt*
Affiliation:
Harvard University (Center for Risk Analysis), 718 Huntington Avenue, Boston, MA 02115, USA, e-mail: jkh@harvard.edu Toulouse School of Economics (LERNA), 21 allée de Brienne, 31000 Toulouse, France, Phone: +1 617 432 4343, Fax: +1 617 432 0190
Rights & Permissions [Opens in a new window]

Abstract

Differences between estimated willingness to accept compensation (WTA) and willingness to pay (WTP) that are larger than can be explained by standard economic theory raise questions about which measures should be used for benefit–cost analysis (BCA). These differences do not create a new problem but accentuate an existing one: the fact that the Kaldor–Hicks compensation test is ambiguous when its two components conflict. This conflict is more likely when the difference between WTA and WTP measures of a change is large. In many cases, the same individuals receive benefits and incur costs from a policy change and their preferences for the policy cannot depend on whether they ask whether their WTP for the benefit exceeds the cost they will incur or their WTA to forgo the benefit exceeds the cost they will save. In cases where benefits and costs are incurred by different people, it seems more useful to evaluate the fundamental question – whether the benefits to some justify the harms to others – than to obscure this question through a technical debate about valuation measures.

Type
Articles
Copyright
© Society for Benefit-Cost Analysis 2015 

Benefit–cost analysis (BCA) evaluates a policy intervention by comparing the monetary value of the benefits to individuals who gain with the monetary value of the harms to individuals who lose. But there are two measures each of the monetary value of a benefit and of a harm to an individual: willingness to pay (WTP) for a benefit or to avoid a harm and willingness to accept compensation (WTA) to forgo a benefit or accept a harm. Under standard economic theory, in many circumstances the WTP and WTA measures of a benefit, and of a harm, should be nearly equal in magnitude and so the choice of which to employ should have little effect on the result of a BCA. In other circumstances, the two measures may be quite different and the choice between them may matter. Moreover, several decades of research in stated-preference and experimental economics has shown that empirical estimates of WTP and WTA often differ substantially, even in contexts where standard theory suggests they should be nearly equal. The disparity between WTP and WTA raises the question of which measures should be used, and when, in BCA.Footnote 1

1. WTP and WTA

To fix ideas, consider a project that will affect the quantity of some public or other good $G$ available to individuals. Individuals cannot affect the quantity of $G$ they consume or have available, but it can be altered by public policy.

Figure 1 illustrates an individual’s indifference curves between $G$ and wealth or income that can be allocated for spending on all other goods. Consider first an individual at point $O$ with wealth $w$ and quantity $q$ for whom the policy will increase the quantity of $G$ to $q^{+}$ . Her WTP (compensating variation) for the increase is the amount $A$ , because the point $Y$ lies on the same indifference curve as her initial position $O$ . Her WTA to forgo the increase in the good (equivalent variation) is $B$ , because increasing her wealth from $w$ to $w+B$ holding quantity fixed at $q$ puts her at $Z$ , which is on the same indifference curve as increasing $q$ to $q^{+}$ while holding wealth fixed at $w$ (point $X$ ).

Figure 1 Four measures of value.

Next consider an individual who will be harmed by the policy. Assume the policy will reduce his quantity of $G$ from an initial level $q$ to a smaller level $q^{-}$ (although Figure 1 is drawn assuming the two individuals have identical wealth and initial levels of $G$ , this is not necessary and not true in general). The policy loser’s WTA (compensating variation) for the reduction in $G$ is $D$ and his WTP to prevent the reduction (equivalent variation) is $C$ .Footnote 2

If the indifference curves are vertically parallel, then the WTA and WTP measures for each individual are identical (i.e., $A=B$ and $C=D$ ).Footnote 3 Vertically parallel indifference curves imply that the slopes of the indifference curves may depend on $q$ but do not depend on $w$ ; in other words, the marginal rate of substitution between wealth and $G$ may depend on the quantity of $G$ but not on wealth. This implies that the income elasticities of WTP and of WTA are zero.Footnote 4

If WTA to forgo an improvement is greater than WTP for the improvement from $q$ to $q^{+}$ , then the higher indifference curve is steeper on average between $Z$ and $X$ than the original indifference curve between $O$ and $Y$ . Equivalently, the income elasticities of WTP and of WTA are larger (on average) when income is larger (for values of $G$ between $q$ and $q^{+}$ ).

For many goods, it is reasonable to assume that the income elasticity of WTP is positive, so WTA is larger than WTP. But for small changes in the quantity of $G$ , it seems intuitive that the indifference curves must be nearly parallel (they cannot cross), which implies WTP must be nearly equal to WTA. This intuition is not strictly accurate, however. Hanemann (Reference Hanemann1991) shows that the difference between WTP and WTA can be arbitrarily large, even for a small change in the quantity of $G$ , if $G$ has no close market substitutes. The intuition is that if there is a close market substitute for $G$ (either a single good or a basket of goods), then an individual for whom the quantity $q$ is not optimal can adjust by purchasing more or less of the substitute market good, so her consumption of $G$ plus its market substitute is optimal. If the market substitute is available at a fixed price, this priceFootnote 5 equals her marginal WTP and WTA for a change in $G$ and her indifference curves between wealth and $G$ are parallel straight lines. In contrast, if there is no close market substitute for $G$ , the consumer cannot adjust for a non-optimal quantity and her indifference curves may be arbitrarily far from vertically parallel.Footnote 6

The empirical finding that estimated WTA is often much larger than estimated WTP implies either that the indifference curves are not approximately vertically parallel over the relevant domain, or that the empirical method is not accurately estimating these monetary values. A number of hypotheses have been suggested to explain this disparity (Mitchell & Carson, Reference Mitchell and Carson1989). In addition to Hanemann’s (Reference Hanemann1991) result concerning the absence of market substitutes for the good, other explanations include income effects and transaction costs (Randall & Stoll, Reference Randall and Stoll1980), uncertainty about and caution in revealing one’s true WTP or WTA, commitment costs (loss of opportunity to learn about the value of a good before acting; see Zhao & Kling, Reference Zhao and Kling2004), and limited incentives to learn about preferences for a hypothetical transaction (Guzman & Kolstad, Reference Guzman and Kolstad2007).

Other explanations depart from the standard economic model such as reference dependence, loss aversion, or other endowment effects (Thaler, Reference Thaler1980; Kahneman, Knetsch & Thaler, Reference Kahneman, Knetsch and Thaler1990). For example, if people characterize a change in $G$ as in the domain of gains or losses relative to some reference level and are more averse to losses than to gains, then the value of a specific change in $G$ may depend on the reference level (Knetsch, Reference Knetsch2010; Knetsch, Riyanto & Zong, Reference Knetsch, Riyanto and Zong2012). Reference dependence implies that individuals do not have indifference curves that are defined over consequences alone, like those in Figure 1, but rather sets of indifference curves over consequences that are conditional on alternative reference levels.

Alternatively, some authors claim that the disparity is due to experimental-design features and elicitation techniques such as the use of elicitation methods that are not incentive compatible (e.g. Plott & Zeiler, Reference Plott and Zeiler2005, Reference Plott and Zeiler2007). Isoni, Loomes and Sugden (Reference Isoni, Loomes and Sugden2011) find that strong experimental procedures eliminate the disparity for some goods, but not for others. There is no consensus on why a large gap between the two values is often observed.

An important question for BCA is whether the observed differences between WTA and WTP represent individuals’ informed, normative preferences or reflect cognitive errors or limitations of the methods used to estimate these values (Hammitt, Reference Hammitt2013). Three meta-analyses of the empirical literature reveal that the difference between WTP and WTA varies systematically with characteristics of the good that is valued and the valuation method. In their pioneering meta-analysis of 45 studies, Horowitz and McConnell (Reference Horowitz and McConnell2002) evaluated the effects of the type of good that was valued and the strength of the experimental design. They found the type of good is systematically related to the WTP–WTA disparity, with smaller disparities for ordinary private goods than for public goods or other goods that are not usually available in markets. They found little evidence that weak experimental design contributes to a larger disparity; e.g., they found no systematic difference in the disparity between studies using hypothetical and real transactions and, surprisingly, that the disparity is significantly smaller for studies using non-incentive-compatible elicitation mechanisms. They also found a smaller disparity in studies using student subjects. The second meta-analysis, of 39 studies by Sayman and Onculer (Reference Sayman and Onculer2005), found that the use of an incentive-compatible elicitation mechanism decreases the disparity.

The third meta-analysis, of 76 studies by Tunçel and Hammitt (Reference Tunçel and Hammitt2014), reinforces and refines these conclusions. Tunçel and Hammitt found a systematic relationship between the magnitude of the disparity and the type of good valued, with a geometric-mean ratio of WTA to WTP of about 6 and 5 for environmental goods and for health and safety, respectively, and of about 1.5 for ordinary private goods and for travel or leisure time. They found the disparity is smaller for experiments using incentive-compatible elicitation mechanisms and also smaller when participants have prior experience buying or selling the good or when they have gained experience through participating in multiple elicitation rounds as part of the study. They also found the disparity is on average smaller in more recent studies; in part this reflects improvements in experimental design but a statistically significant effect remains even after controlling for design features. Tunçel and Hammitt tested Hanemann’s (Reference Hanemann1991) suggestion that the disparity could be larger for goods lacking close substitutes by including a variable reflecting their judgment about whether market substitutes exist. They found some evidence supporting this hypothesis, but the existence of substitutes is confounded with the type of good, as almost all ordinary private goods have close substitutes. They found no statistically significant association of the WTA/WTP ratio with the absence of market substitutes when restricting the analysis to goods that are not ordinary private goods.

2. Implications for BCA

Large positive differences between estimated WTA and WTP measures of the value of a benefit, or the value of a harm, tend to accentuate the ambiguity that can arise under the conventional justification for BCA, the Kaldor–Hicks compensation test. The compensation test is composed of two criteria: the Kaldor criterion uses compensating variations; it classifies the policy intervention as socially beneficial if the total WTP of those who gain exceeds the total WTA of those who are harmed. A large difference between WTA and WTP will make this criterion more difficult to satisfy, hence favoring the status quo over the policy intervention. In contrast, the Hicks criterion uses equivalent variations; it classifies the policy intervention as socially beneficial if the total WTP to prevent the intervention of those who are harmed is less than the total WTA to forgo the intervention of those who gain. A large difference between WTA and WTP makes this criterion easier to satisfy, hence favoring the intervention over the status quo. Combining these two effects, a large difference between WTA and WTP makes it more likely that the Kaldor–Hicks compensation test is ambiguous: gainers cannot compensate losers for the intervention and losers cannot compensate gainers to forgo the intervention.

One resolution to a contradiction between the Kaldor and Hicks criteria in a specific case is to recognize that the appropriate test may depend on property rights (Mitchell & Carson, Reference Mitchell and Carson1989; Freeman, Reference Freeman1993). If those who would be harmed by a policy intervention have a legal right to avoid that harm, then the Kaldor criterion is appropriate; if those who would benefit have a right to that gain, then the Hicks criterion is appropriate. As an example, assume that hunters (or hikers) use some parcel of undeveloped land and a dam is proposed that would flood the land. If the hunters have a legal right to their use of the land, because of ownership, custom, or other reasons, the dam should not be built unless the WTP of those who gain exceeds the hunters’ WTA for their loss of access. Alternatively, if the hunters are trespassers with no right to their use, the dam should be built unless their WTP to prevent construction exceeds the WTA of those who benefit from the dam.

Knetsch (Reference Knetsch2010) suggests a modification of this property-rights approach, in which the reference point (in the example, whether the hunters are entitled to their use) depends not necessarily on legal rights but on what people identify as the normal or expected state or on their legitimate expectations. For example, he suggests that in evaluating whether to remediate a toxic-waste spill, the situation without the spill is the natural reference and so the Hicks criterion is relevant: the test for whether the spill should be remediated is whether WTP to avoid cleanup by the responsible party exceeds WTA to forgo cleanup by the beneficiaries, not whether the beneficiaries’ WTP for remediation exceeds the costs to the responsible party.

For many public policies, resolving the ambiguity by designating either the Kaldor or the Hicks criterion as definitive based on considerations of rights or legitimate expectations seems inadequate. In some important cases, the benefits and costs of a policy are imposed on the same people. For example, the costs of policies to improve product or transportation safety are likely to be passed on to the beneficiaries through higher prices of the specific goods or services. Similarly, the benefits of reduced air pollution will tend to be broadly distributed over a regional population and the costs in the form of higher prices for electricity or fuels may also be broadly distributed over the same population. Even in the example of remediating a toxic-waste spill, while the responsible party may bear the cost of cleanup, the long-run effect may be that firms exercise more care to prevent toxic-waste spills and pass the costs on to their consumers; so in the long run the gainers and losers may substantially overlap.Footnote 7 Although the population distributions of benefits and costs need not be the same, in each of these cases it is likely that most individuals receive some benefits and bear some costs, so it is necessary to evaluate the net effect on each person to determine who benefits and who is harmed. Estimating benefits and costs separately is not sufficient.

In cases where both benefits and costs are incurred by the same people, each individual must prefer either the policy or its absence (or be indifferent between them). If an individual’s judgment about whether he favors the policy is dependent on whether he asks himself whether his WTP for the benefit is less than the cost he will bear or his WTA compensation to forgo the project exceeds the cost he will save, then he is incoherent and may be easily manipulated by clever policy advocates.

An individual’s choice can be correctly framed using either WTP or WTA, but the relevant comparison is not the comparison between $A$ and $B$ that is illustrated in Figure 1. If the individual has initial quantity of the good $q$ and wealth $w$ , he can compare his WTP for the increase in $G$ from $q$ to $q^{+}$ (the quantity $A$ in Figure 1) with the cost he will bear. Alternatively, he can compare his WTA to forgo the increase in $G$ with the cost he will save. This is not the amount $B$ illustrated in Figure 1, but is his WTA to forgo the increase from $q$ to $q^{+}$ when his wealth is $w$ less than the cost he will bear. Suppose the cost he will bear is $A$ , so his WTP for the increase in $G$ equals the cost and he is indifferent to the policy. Then his WTA to forgo the increase from $q$ to $q^{+}$ when his wealth is $w-A$ is exactly $A$ , and this alternative framing also reveals him to be indifferent to the policy.

With reference-dependent preferences, if WTP for the increase uses $q$ as the reference and WTA to forgo the increase uses $q^{+}$ as the reference then the results of these alternative framings may be inconsistent – but that suggests the individual must somehow reconcile the different perspectives. Alternatively, if the reference-dependent preferences are normative, this introduces another source of uncertainty into BCA: the analyst will generally not know the individual’s reference level and hence how the change in $G$ should be valued. The reference level can be interpreted as an ancillary condition that may affect an individual’s choices but is not known by the analyst, leading to ambiguity about the preferences that are revealed by his choices (Bernheim & Rangel, Reference Bernheim and Rangel2009).

In other cases, the individuals who benefit or are harmed by a policy may be distinct and the Kaldor and Hicks criteria may conflict. This conflict can arise even if the differences between WTA and WTP are small, but will arise for a larger set of policies when the differences are large. These are cases in which the policy has a distributional effect that is comparable to the efficiency effect, in the sense that whether the estimated net benefits are positive or negative is sensitive to whether one treats the gainers’ or losers’ consequences as primary (e.g., as being favored by legal right or legitimate expectation). In these cases, debating the relative merits of the Kaldor and Hicks criteria seems to obscure rather than elucidate the fundamental question, which is whether the benefits to the gainers justify imposing the harms on the losers. It seems likely that a better resolution may be achieved by addressing this question directly. The answer to whether the distributional consequence is justified may, of course, depend on legal rights and legitimate expectations.

3. Conclusion

The empirical finding that WTA measures of value often exceed WTP measures to a degree that is inconsistent with standard economic theory does not create a new problem for BCA, but accentuates an existing problem. The fundamental problem is the need to compare benefits provided to some people with harms imposed on others. This problem does not arise in all cases. In many cases, the benefits and costs of a policy may accrue to the same people (e.g., when product costs are passed forward to consumers). In these cases, there may still be important distributional effects (when the benefits and costs are not identically distributed), but the effect on each individual depends on both the benefits and costs she incurs. Whether each individual prefers the policy cannot logically depend on whether she asks whether her WTP for the benefit she will receive exceeds the cost she will pay or whether her WTA to forgo the benefit exceeds the cost she will save.

Lacking consensus on a method to compare changes in utility between individuals as required for a social welfare function (Adler, Reference Adler2012), BCA attempts to compare benefits and harms to different people using money as a metric. But this leaves it vulnerable to the problem that the monetary value of a change is not uniquely defined; e.g., it depends on whether the value is measured from the situation with or without the policy. Hence the Kaldor–Hicks compensation test is in some cases ambiguous, when the Kaldor and Hicks criteria yield opposing conclusions. An increase in the difference between WTA and WTP measures of the value of some change will tend to expand the set of policy interventions for which the Kaldor–Hicks compensation test is ambiguous. It seems more useful in these cases to address the fundamental question – whether the benefits to those who gain from the policy justify the harms to those who lose – than to engage in technical debate about WTP and WTA.

Acknowledgments

I thank Glenn Blomquist, Jack Knetsch, and Tuba Tunçel for helpful discussion and comments.

Footnotes

1 In practice, when there is a distinction, WTP measures are typically used, even when WTA is arguably more relevant. This preference reflects a judgment that WTP estimates are more credible (US Office of Management and Budget, 2003).

2 Note that $S$ is on the same indifference curve as the initial point $O$ and $R$ is on the same indifference curve as $T$ . WTP to prevent a reduction in $q$ is a curious framing: it suggests the respondent is entitled to $q$ but must pay to avoid a reduction in it, which raises the question of what it means to be entitled to  $q$ .

3 Another comparison that can be made between WTP and WTA concerns different changes in the quantity of $G$ . Beginning at the point $O$ , WTP for an increase in $G$ from $q$ to $q^{+}$ is $A$ and WTA for a decrease from $q$ to $q^{-}$ is $D$ . If the increments $q^{+}-q$ and $q-q^{-}$ are equal in magnitude, then WTA is larger than WTP so long as the indifference curve is convex (as illustrated in Figure 1). Although these are alternative measures of the value of the same increment in $G$ , they do not measure the value of the same change; e.g., the change between being sighted in one eye or in both is not the same as the change between being sighted in one eye or blind.

4 Recall that the income elasticities of WTP and of WTA are not the same as the income elasticity of demand (Flores & Carson, Reference Flores and Carson1997) and hence estimates of the income elasticity of demand are not necessarily informative about the income elasticity of WTP. Income elasticity of any outcome $H$ is defined as the proportional change in $H$ divided by the proportional change in income. Assume an increase in income increases demand for some good so that the new demand curve is higher than but parallel to the initial demand curve. Because the curves are parallel, the absolute increases in marginal WTP and in the quantity demanded do not depend on the initial point, but the proportional increases do. Comparing a point $L$ with large marginal WTP and small quantity demanded with another point $M$ on the original demand curve with small marginal WTP and large quantity demanded, the proportional increase in marginal WTP will be smaller at $L$ than at $M$ and the proportional change in quantity demanded will be larger at $L$ than at $M$ . It is quite possible that the income elasticity of marginal WTP is smaller than the income elasticity of demand at $L$ but larger at $M$ .

5 The price must be measured in dollars per the quantity of the market good that substitutes for one unit of $G$ .

6 In an extreme case, a consumer’s indifference curves could be right angles defined by the utility function $U(q,w)=\min (q,{\it\alpha}w)$ , where ${\it\alpha}$ is a parameter. If $q<{\it\alpha}w$ , her marginal utility of wealth is zero. An increase in $q$ allows her to reach a higher indifference curve and her WTP for that increase in $q$ is positive but there is no increase in wealth that permits her to achieve the higher indifference curve and so her WTA for that change in $q$ is infinite.

7 Even when the same people receive the benefits and bear the costs of a policy, the population distributions of benefits and costs need not be identical and there may be gainers and losers. For example, higher-income consumers may value the health and safety benefits of product-safety standards more than lower-income consumers, though the increase in price is the same for all. Linneman (Reference Linneman1980) describes how the Consumer Product Safety Commission’s 1973 mattress-flammability standard benefits careless and high-income consumers at the expense of careful and low-income consumers.

References

Adler, M. (2012). Well-Being and Fair Distribution: Beyond Cost-Benefit Analysis. Oxford: Oxford University Press.Google Scholar
Bernheim, B. D. & Rangel, A. (2009). Beyond Revealed Preference: Choice-Theoretic Foundations for Behavioral Welfare Economics. Quarterly Journal of Economics, 124, 51104.Google Scholar
Flores, N. E. & Carson, R. T. (1997). The Relationship between the Income Elasticities of Demand and Willingness to Pay. Journal of Environmental Economics and Management, 33, 287295.Google Scholar
Freeman, A. M. III (1993). The Measurement of Environmental and Resource Values: Theory and Methods. Washington, DC: Resources for the Future.Google Scholar
Guzman, R. M. & Kolstad, C. D. (2007). Researching Preferences, Valuation and Hypothetical Bias. Environmental and Resource Economics, 37, 465487.CrossRefGoogle Scholar
Hammitt, J. K. (2013). Positive Versus Normative Justifications for Benefit-Cost Analysis: Implications for Interpretation and Policy. Review of Environmental Economics and Policy, 7, 199218.CrossRefGoogle Scholar
Hanemann, W. M. (1991). Willingness to Pay and Willingness to Accept: How Much Can They Differ? The American Economic Review, 81, 635647.Google Scholar
Horowitz, J. K. & McConnell, K. E. (2002). A Review of WTP/WTA Studies. Journal of Environmental Economics and Management, 44, 426447.Google Scholar
Isoni, A., Loomes, G. & Sugden, R. (2011). The Willingness to Pay-Willingness to Accept Gap, the ‘Endowment Effect’, Subject Misconceptions, and Experimental Procedures for Eliciting Valuations: Comment. American Economic Review, 101, 9911011.Google Scholar
Kahneman, D., Knetsch, J. L. & Thaler, R. H. (1990). Experimental Tests of the Endowment Effect and the Coase Theorem. Journal of Political Economy, 98, 13251348.CrossRefGoogle Scholar
Knetsch, J. L. (2010). Values of Gains and Losses: Reference States and Choice of Measure. Environmental and Resource Economics, 46, 179188.CrossRefGoogle Scholar
Knetsch, J. L., Riyanto, U. E. & Zong, J. (2012). Gain and Loss Domains and the Choice of Welfare Measure of Positive and Negative Changes. Journal of Benefit-Cost Analysis, 3(4), Article 1.CrossRefGoogle Scholar
Linneman, P. (1980). The Effects of Consumer Safety Standards: The 1973 Mattress Flammability Standard. Journal of Law and Economics, 23, 461479.CrossRefGoogle Scholar
Mitchell, R. C. & Carson, R. T. (1989). Using Surveys to Value Public Goods: The Contingent Valuation Method. Washington, DC: Resources for the Future.Google Scholar
Plott, C. R. & Zeiler, K. (2005). The Willingness to Pay–Willingness to Accept Gap, the ‘Endowment Effect’, Subject Misconceptions, and Experimental Procedures for Eliciting Valuations. American Economic Review, 95, 530545.Google Scholar
Plott, C. R. & Zeiler, K. (2007). Exchange Asymmetries Incorrectly Interpreted as Evidence of Endowment Effect Theory and Prospect Theory? American Economic Review, 97, 14491466.Google Scholar
Randall, A. & Stoll, J. R. (1980). Consumer’s Surplus in Commodity Space. American Economic Review, 70, 449455.Google Scholar
Sayman, S. & Onculer, A. (2005). Effects of Study Design Characteristics on the WTP/WTA Disparity: A Meta Analytical Framework. Journal of Economic Psychology, 26, 289312.CrossRefGoogle Scholar
Thaler, R. H. (1980). Toward a Positive Theory of Consumer Choice. Journal of Economic Behavior and Organization, 1, 3960.CrossRefGoogle Scholar
Tunçel, T. & Hammitt, J. K. (2014). A New Meta-Analysis on the WTP/WTA Disparity. Journal of Environmental Economics and Management, 68, 175187.CrossRefGoogle Scholar
US Office of Management and Budget (OMB) (2003). Circular A-4, Regulatory Analysis. Washington, DC: OMB.Google Scholar
Zhao, J. & Kling, C. L. (2004). Willingness to Pay, Compensating Variation, and the Cost of Commitment. Economic Inquiry, 42, 503517.Google Scholar
Figure 0

Figure 1 Four measures of value.