The one player guessing game: a diagnosis on the relationship between equilibrium play, beliefs, and best responses

Ciril Bosch-Rosa; Thomas Meissner

doi:10.1007/s10683-020-09642-2

The one player guessing game: a diagnosis on the relationship between equilibrium play, beliefs, and best responses

Published online by Cambridge University Press: 14 March 2025

Ciril Bosch-Rosa and

Thomas Meissner

Show author details

Ciril Bosch-Rosa*: Affiliation:
Chair of Macroeconomics, Technische Universität Berlin, Berlin, Germany
Thomas Meissner: Affiliation:
Department of Microeconomics and Public Economics, Maastricht University, Maastricht, The Netherlands
*: cirilbosch@gmail.com

Article contents

Abstract
Introduction
Experimental design
Results
Conclusion
Footnotes
References

Rights & Permissions

Abstract

Experiments involving games have two dimensions of difficulty for subjects in the laboratory. One is understanding the rules and structure of the game and the other is forming beliefs about the behavior of other players. Typically, these two dimensions cannot be disentangled as belief formation crucially depends on the understanding of the game. We present the one-player guessing game, a variation of the two-player guessing game (Grosskopf and Nagel 2008), which turns an otherwise strategic game into an individual decision-making task. The results show that a majority of subjects fail to understand the structure of the game. Moreover, subjects with a better understanding of the structure of the game form more accurate beliefs of other player’s choices, and also better-respond to these beliefs.

Keywords

Guessing game Strategic thinking Cognitive sophistication

JEL classification

D84: Expectations • Speculations D12: Consumer Economics: Empirical Analysis G11: Portfolio Choice • Investment Decisions C91: Laboratory, Individual Behavior

Type: Original Paper
Information: Experimental Economics , Volume 23 , Issue 4 , December 2020 , pp. 1129 - 1147

DOI: https://doi.org/10.1007/s10683-020-09642-2 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s) 2020

1 Introduction

Subjects in laboratory experiments consistently deviate from equilibrium behavior (Camerer Reference Camerer2003). Many models of bounded rationality try to explain these deviations through errors in belief formation (e.g., Nagel Reference Nagel1995; Ho et al. Reference Ho, Camerer and Weigelt1998; Weizsäcker Reference Weizsäcker2003). Another explanation is that subjects fail to fully understand the structure of the game (Chou et al. Reference Chou, McConnell, Nagel and Plott2009 refer to this as an absence of “game form recognition”). Generally, when analyzing deviations from equilibrium behavior, one would expect both of these effects to play a role. However it is typically hard (if not impossible) to distinguish between the two, as correct belief formation crucially depends on a correct understanding of the structure of the game. With the help of a novel one-player guessing game experiment, we are able to disentangle these two effects, thus improving the understanding of why subjects deviate from equilibrium behavior.

An extensive literature has attempted to analyze both belief formation and understanding the structure of the game. Costa-Gomes and Crawford (Reference Costa-Gomes and Crawford2006) present subjects with a series of two-player dominance-solvable games and conclude that most subjects understand the games, but play non-equilibrium strategies due to their “simplified models of others’ decisions.” In Costa-Gomes and Weizsäcker (Reference Costa-Gomes and Weizsäcker2008) the authors look at subject’s actions and their stated beliefs, and find that subjects rarely best respond to their stated beliefs. However, Rey-Biel (Reference Rey-Biel2009) observes that in simplified versions of the games studied in Costa-Gomes and Weizsäcker (Reference Costa-Gomes and Weizsäcker2008), Nash Equilibrium is a better predictor of subject behavior than any other model based on level-K reasoning.

Another strand of the literature focuses on whether subjects understand the structure of the game. Using two-player guessing games, Chou et al. (Reference Chou, McConnell, Nagel and Plott2009) find that subjects are surprisingly unable to understand the experimental setup they are participating in. By using different sets of instructions for the same game, and by introducing hints, they show that subjects do not deviate from equilibrium because of cognitive biases, but rather because of a lack of game form recognition, which they define as the relationships between possible choices, outcomes, and payoffs. Fragiadakis et al. (Reference Fragiadakis, Knoepfle and Niederle2016) let subjects play a two-player guessing game repeatedly against random opponents, and subsequently ask subjects to replicate or best respond to their previous choices. They find that while behavior of only 30% of subjects is consistent with a set of commonly used models (including equilibrium play and level-K), they also identify subjects who play strategically but are not identified by commonly used models. Finally, Agranov et al. (Reference Agranov, Caplin and Tergiman2015) develop an experimental protocol that allows them to track the decision-making process of subjects in a beauty contest game. The results show that around 45% of subjects consider playing weakly dominated strategies at some point in their decision-making process.

In this experiment, we use a one-player guessing game which allows to measure how well subjects understand the structure of the two-player guessing game (Grosskopf and Nagel Reference Grosskopf and Nagel2008). In this “game” subjects play the role of both players in a two-player guessing game. That is, they are asked to pick not one but two numbers between 0 and 100, and are paid according to the proximity of each of their choices to two thirds of the average of both choices. This setup switches off the belief channel, but still demands subjects to understand the structure of the game.Footnote ¹ By comparing their actions in a two-player guessing game to the choices made in the one-player guessing game we can disentangle the effects of beliefs from “game form recognition,” and analyze to what extent understanding the structure of the game determines their belief formation and their best-responses.Footnote ²

Our experimental results show that a majority of subjects fails to fully solve the one-player guessing game, and that subjects with a better understanding of the structure of the one-player guessing game play values closer the Nash Equilibrium in the two-player guessing game. This implies that an important part of non-equilibrium play is likely due to the inability of subjects to fully understand the structure of the game. Additionally, we observe that subjects with a better understanding of the one-player guessing game form more accurate beliefs, are better at best-responding to their own beliefs, and tend to better adjust their beliefs according to the population they face. These results confirm the intuition that understanding the structure of the game is crucial for belief formation.

2 Experimental design

The experiment consists of four different parts: Subjects first play the one-player guessing game (1PG), followed by the two-player guessing game (2PG). After this, we elicit subjects’ beliefs about other subjects’ two-player guessing game choices. A subset of subjects then participated in an additional belief elicitaton task (“What-if” belief elicitation). At the end of the experiment, all subjects are asked to answer a battery of cognitive ability tests. In the following we describe each part of the experiment in more detail.

2.1 The one-player guessing game (1PG)

The one-player guessing game, first introduced in Bosch-Rosa et al. (Reference Bosch-Rosa, Meissner and Bosch-Domènech2018), allows to test whether subjects can solve the two-player guessing game introduced by Grosskopf and Nagel (Reference Grosskopf and Nagel2008) free of any strategic concerns.Footnote ³

In essence, subjects play the role of both players in a two-player guessing game, i.e. they play the two player guessing game “against themselves.” Accordingly, each subject (i) picks two numbers $x_{i} \in [0, 100]$ and $y_{i} \in [0, 100]$ and is paid depending on the absolute distance of each chosen number to the “target value” which is two thirds of the average of both numbers. The further away each chosen number is from this target value, the lower is the payoff. Formally the experimental payoff for choosing number $x_{i}$ and $y_{i}$ is:

(1)

(2)

Subjects are paid for both choices, so their combined payoff is:

(3)

\begin{matrix} Π_{i}^{1 P G} = max (π_{i}^{1 P G} (x_{i}) + π_{i}^{1 P G} (y_{i}), 0) . \end{matrix}

The payoff function is maximized at $(y_{i} = 0, x_{i} = 0)$ . This solution can be found through logical induction by starting with a random value $x_{0, i}$ , and then calculating the “best response” which is $y_{1, i}^{'} = \frac{1}{2} x_{0, i}$ . Following this, a “best response to the best response” can be calculated ( $x_{1, i}^{'} = \frac{1}{2} y_{1, i}^{'}$ ) and so on until reaching the fixed point ( $x_{\infty, i}^{'} = 0$ , $y_{\infty, i}^{'} = 0$ ).

By turning the two-player guessing game into an algebraic problem with no strategic uncertainty, we can separate those subjects who can solve the mathematical problem associated with the guessing game from those who cannot.Footnote ⁴

2.2 The two-player guessing game (2PG)

The two-player guessing game that we use is an adaptation of the one presented in Grosskopf and Nagel (Reference Grosskopf and Nagel2008) and Nagel et al. (Reference Nagel, Bühren and Frank2016). Subjects are matched in pairs and asked to simultaneously pick a number $z_{i} \in [0, 100]$ . In Grosskopf and Nagel (Reference Grosskopf and Nagel2008) the winner is whoever picks the number closer to 2/3 of the average of both numbers, so unlike in games with $N > 2$ subjects, now $z_{i} = 0$ is a (unique) weakly dominant strategy. In our version of the 2PG, the payments are based on the (absolute) distance of each individual pick to 2/3 of the average of both numbers. Formally, the payment for player i depends on the choices of player j and her own in the following way:Footnote ⁵

(4)

This small change in payoffs dramatically changes the game as now the equilibrium is reached through iterated deletion of strictly dominated strategies, and zero is no longer a weakly dominant strategy. Now the best response is to choose 1/2 of the number a player believes the other player chooses.Footnote ⁶

We opted for this modification of the original game for two reasons. First, it allows us to de facto ask subjects for a point estimate of their belief about the other subject’s choice, and secondly, and more important, it makes the game comparable to the 1PG. Note that while certainly not standard, distance-based payoff structures are widely used in the literature. Güth et al. (Reference Güth, Kocher and Sutter2002) first utilized such a payoff structure, arguing that it more closely resembles the financial decision-making situations that beauty contests are often intended to emulate. Since then a number of experiments have used distance-based beauty contests.Footnote ⁷ Most relevant for our experiment is Nagel et al. (Reference Nagel, Bühren and Frank2016), who directly compare distance-based and tournament incentives in two player guessing games and find no significant differences across the choices of subjects.

2.3 Belief elicitation

After subjects had played the 1PG and the 2PG (with no feedback in both cases) we elicited their beliefs about the other players’ decisions in the 2PG. Similar to Lahav (Reference Lahav2015), subjects were asked to distribute a total of 19 “tokens” into 20 “bins”.

Each token represented a subject in the session (each session consisted of 20 subjects), and each bin had a range of 4 integers that players could play in the 2PG (i.e. the first bin had the range [0,4], the second [5,9], and so on). See Fig. 11 in "Electronic supplementary material Appendix C" for a screen-shot of the experimental interface.

To incentivize subjects, we used a linear scoring rule that paid €0.10 for each token that overlapped with the choice of any other subject in the 2PG. For instance, if a subject put three tokens in the bin “5–9” and in her session only 2 subjects had actually played any value within this range, then she would receive a total of 20 cents for the tokens allocated in that bin. If, on the other hand, she placed 5 token in the bin “0–4” and 10 subjects had played a value in this range, then she would be paid 50 cents for the tokens allocated in that bin.

Formally, define $b_{ij}$ as the number of tokens that subject i deposited in bin j, and $p_{- i j}$ as the number of subjects other than player i that chose a value that falls within bin j in the 2PG. Then the payoff for belief formation for subject i is:

(5)

The resulting distribution of beliefs provides an estimate of what subjects think about other subjects’ choices, and allows us to analyze how subject best-respond to their own beliefs.Footnote ⁸

2.3.1 “What if” belief elicitation

Since playing the 1PG could have an influence on the beliefs subjects form in the 2PG, we asked a subset of 40 subjects to additionally guess the choices of players in a 2PG who had not previously taken part in the 1PG.Footnote ⁹ The incentives for this elicitation task are the same as the ones described above, and the data came from a random pick of 19 subjects from a sample of 80 subjects who we had invited two weeks earlier to participate in a 2PG without previously taking part in the 1PG.

2.4 Cognitive ability

Gill and Prowse (Reference Gill and Prowse2016) show that subjects who score higher in a Raven Test (Raven Reference Raven1960) choose numbers closer to equilibrium, earn more, and converge quicker to equilibrium in a three-player guessing game.Footnote ¹⁰ Since we are interested in studying the ability of subjects to solve the guessing game, we also tested the cognitive ability of our subjects. In particular, all subjects answered a Raven Test and played “Race-to-60,” a variant of the Race game (see e.g. Gneezy et al. Reference Gneezy, Rustichini and Vostroknutov2010; Levitt et al. Reference Levitt, List and Sally2011).Footnote ¹¹ The Raven Test is a multiple choice test in which subjects must pick an element that best completes a missing element in a matrix of geometrical shapes (see an example in Fig. 12 of "Electronic supplementary material Appendix C"). The score of this test has been found to correlate with measures of strategic sophistication and the ability of subjects to solve novel problems (Carpenter et al. Reference Carpenter, Just and Shell1990). It is increasingly used in economic research due to its simplicity and the lack of required technical skills.

Since logical induction is a central element of the guessing games, we test this ability with the “Race-to-60” game. In this game, each participant and a computerized player sequentially choose numbers between 1 and 10, which are added up. Whoever is first to push the sum to or above 60 wins the game. The game is solvable by backward induction, and the first mover can always win by picking numbers such that the common pool adds up to the sequence : [5; 16; 27; 38; 49; 60]. Subjects always move first and therefore, independent of the computer’s backward induction ability, can always win the game.Footnote ¹²

3 Results

A total of 80 subjects participated in this experiment. All subjects were recruited through ORSEE (Greiner Reference Greiner2015) and were mostly undergraduate students with a variety of backgrounds, ranging from anthropology to electrical engineering or architecture. Sessions lasted one and a half hours and were run at the Experimental Economics Laboratory of the Technische Universität Berlin. Subjects who had previously participated in guessing game experiments were not invited. The experiment was programmed and conducted using z-Tree (Fischbacher Reference Fischbacher2007). For detailed results on the cognitive ability tests, see "Electronic supplementary material Appendix A"

3.1 The one player guessing game

Fig. 1 Scatter plot of the choices made by each subject. Darker dots refer to subjects that fully solved the game (0,0). See Fig. 13 in "Electronic supplementary material Appendix C" for a zoom in plot depicting only the choices from [0,50]

In Fig. 1 we present the results of the 1PG in a scatter plot. Recall that in this case subjects have to pick two numbers, $(x_{i}, y_{i})$ ; the first number is depicted on the horizontal axis, the second on the vertical axis. The diagonal dashed line marks the points where a subject picked the same number for $x_{i}$ and $y_{i}$ . The solid circle indicates subjects who fully solved the game (0,0).

As can be seen, only a minority ( $\approx 31 %$ ) of subjects is able to fully solve the 1PG, i.e., pick zero for both numbers. In the remainder of this paper we will use this ability to fully solve the game as our primary measure of understanding of the structure of the guessing game.

Result 1

Only 31% of our subjects fully understand the one-player guessing game.

Another interesting observation in Fig. 1 is that subjects who play numbers closer together also play numbers closer to the origin. This is relevant, as in the 1PG there are two ways in which a subject (who has not fully solved the game) can improve her payoffs: by picking numbers closer to zero, and/or by picking numbers that are closer to each other. A Spearman test confirms the correlation between higher average of both choices and the distance between them (Spearman $ρ = 0.83$ , p value $< 0.001$ ). As subjects with high payoffs played both numbers that were close to each other, and to zero, one could interpret the payoffs of the 1PG as a measure of (partial) understanding of the structure of the guessing game. Therefore, we will use the payoffs of the 1PG as a secondary measure to complement to our primary measure of understanding, “Solved 1PG”/“Not solved 1PG”.

Fig. 2 Distribution of choices in the 2PG with and without 1PG experience

3.2 The two player guessing game

The left panel of Fig. 2 shows the distribution of choices in the 2PG, for subjects who have played the 1PG before. The distribution appears to be quite different from the typical distribution one sees with guessing game “first timers.” The mass of the distribution is close to zero with 50% of subjects playing Nash Equilibrium.Footnote ¹³ The mean is 13.47 and the median choice is 2. As mentioned in Sect. 2.3.1 we also collected data on 80 subjects who played the 2PG without previously taking part in the 1PG. Choices of these subjects are presented in the right panel of Fig. 2. While a relatively large number of subjects with no prior 1PG experience also play Nash Equilibrium (28.75%), the overall distribution of 2PG choices for those subjects is significantly shifted to the right compared to that of subjects with prior 1PG experience (Kolmogorov Smirnov test, p value $< 0.001$ , see Fig. 14 in "Electronic supplementary material Appendix C" for cumulative density plots). This shift results in a mean and median choice without 1PG experience of 27.8 and 26 respectively.

The difference in behavior between both groups could be the result of two phenomena: introspective learning from having played the 1PG (Weber Reference Weber2003), or a change in the beliefs of subjects that previously played the 1PG given that they are facing a more “experienced” pool of subjects (Agranov et al. Reference Agranov, Potamites, Schotter and Tergiman2012).Footnote ¹⁴ In Sect. 3.4.1 we show that shifts of beliefs are relatively small. Therefore, we attribute most of the difference in behavior to introspective learning. So, while most subjects are not able to fully solve the 1PG, there appears to be some learning that carries over to the 2PG.

3.3 Relationship between the 1PG and the 2PG

Figure 3 shows the decisions of subjects in the 2PG on the vertical axis, and their payoffs for the 1PG on the horizontal axis. Subjects who fully solved the 1PG (solid circles) mostly chose zero in the 2PG (24/25, 96%), and picked significantly lower numbers in the 2PG than subjects who did not fully solve the 1PG (Mann–Whitney U Test, p value $< 0.001$ ). In line with this, we also observe that subjects who earn higher payoffs in the 1PG play lower numbers in the 2PG (Spearman $ρ = - 0.745$ , p value $< 0.001$ )

Fig. 3 Distribution of choices in the 2PG (vertical axis) and payoff in the 1PG (horizontal axis)

Result 2

Subjects with a better understanding of the structure of the one-player guessing game play numbers closer to the Nash Equilibrium in the two-player guessing game.

But, is playing numbers near the Nash Equilibrium the best strategy in the 2PG? To answer this question we construct ${\bar{Π}}_{i}^{2 P G}$ . This variable represents the payoff that each subject i would have gotten had she played against the average choice of all other subjects j except herself (i.e., $j \neq i$ ). Formally ${\bar{Π}}_{i}^{2 P G}$ is defined as:

(6)

Figure 4 illustrates the relationship of ${\bar{Π}}_{i}^{2 P G}$ with both, the payoffs of the 1PG ( $Π_{i}^{1 P G}$ , left panel) and choice in the 2PG ( $z_{i}$ , right panel). Interestingly, subjects who fully solved the 1PG don’t have the highest ${\bar{Π}}_{i}^{2 P G}$ . This is because they play Nash Equilibrium, when payoffs would have been maximized by playing a number close to 9 as can be seen in the right plot. Overall, subjects who fully solved the game did not earn a significantly different payoff compared to subjects who did not fully solve the game (Mann–Whitney U test p value $= 0.465$ ).

Analyzing our secondary measure of understanding of the structure of the game ( $Π_{i}^{1 P G}$ ) reveals a more nuanced pattern: it appears that there is a non-monotonic relationship between understanding of the structure of the game and expected payoffs in the 2PG (see Fig. 15 in "Electronic supplementary material Appendix C" for a close-up of Fig. 4.). Regressing ${\bar{Π}}_{i}^{2 P G}$ on $Π_{i}^{1 P G}$ and ${(Π_{i}^{1 P G})}^{2}$ yields coefficients that are significantly positive and negative respectively. This gives statistical support to the fitted quadratic function in the left panel of Fig. 4, implying that increased understanding leads to increased expected payoffs, but that this relationship reverses for very high levels of understanding.

Fig. 4 Relationship of payoff in the 1PG ( $Π_{i}^{1 P G}$ ) and ${\bar{Π}}_{i}^{2 P G}$ (left panel). The line in the left panel is a fitted quadratic function. The right panel shows the relationship of choice in the 2PG ( $z_{i}$ ) and ${\bar{Π}}_{i}^{2 P G}$ . In both panels the darker dots indicate subjects who fully solved the 1PG

Table 1 Regression of ${\bar{Π}}_{i}^{2 P G}$ on the payoff in the 1PG ( $Π_{i}^{1 P G}$ ) and its square value ${(Π_{i}^{1 P G})}^{2}$

	${\bar{Π}}_{i}^{2 P G}$
$Π_{i}^{1 P G}$	$5.519 * * *$
$Π_{i}^{1 P G}$	(0.904)
${(Π_{i}^{1 P G})}^{2}$	$- 1.565 * * *$
${(Π_{i}^{1 P G})}^{2}$	(0.353)
Constant	$- 3.389 * * *$
Constant	(0.500)
$N$	80
Adj. $R^{2}$	0.580
Joint test p value	0.000

Standard errors in parentheses

*p < 0.05; **p < 0.01; ***p < 0.001

Result 3

The relationship between understanding of the structure of the one-player guessing game and payoffs in the two-player guessing game follows a non-monotonic pattern.

3.4 Subjective beliefs

Fig. 5 Relationship between the payoff in the 1PG ( $Π_{i}^{1 P G}$ ) and number of correct tokens deposited in the belief elicitation phase (left panel) and distribution of tokens across bins (right panel). Darker dots refer to subjects who fully solved the 1PG

On the left panel of Fig. 5, we plot the number of tokens subjects have placed correctly in the belief elicitation task against the payoff in the 1PG. Subjects who fully solved the 1PG placed a larger number of tokens correctly (Mann–Whitney U test, p value $= 0.001$ ).Footnote ¹⁵ This result is confirmed by the strong correlation that we observe between the 1PG payoff and the number tokens placed correctly (Spearman $ρ = 0.583$ , p value < 0.001). On the right panel of Fig. 5 we plot the distribution of tokens (horizontal axis) against the payoff in the 1PG (vertical axis). While subjects who did not fully solve the 1PG spread out their tokens across most of the strategy space, subjects who fully solved the 1PG expect their counterparts to play numbers closer to the Nash Equilibrium (Mann–Whitney U test, p value $< 0.001$ ). Again, the correlation between the distance of tokens to NE and payoffs in the 1PG confirms this result (Spearman $ρ = - 0.359$ with p value $= 0.001$ ) (Table 1).

To test how the accuracy of beliefs relates to the understanding of the structure of the game, we plot the mean of the belief distribution of each subject against their payoff in the 1PG (vertical axis) on the left panel of Fig. 6.Footnote ¹⁶ The vertical dotted line marks the mean choice across all subjects in the 2PG (13.63). The right panel of Fig. 6 plots the absolute distance of individual mean beliefs to mean 2PG play against earnings in the 1PG. Two things are clear from the graph: First, the mean beliefs of some subjects differ quite a bit from mean actual play in the 2PG. Second, subjects who fully solved the 1PG have a lower absolute difference of their mean beliefs and mean choice of all subjects in the 2PG (Mann–Whitney U test p value $= 0.030$ ). This result is supported using our secondary measure of understanding (Spearman $ρ = - 0.473$ with p value $< 0.001$ ).

Fig. 6 In the left panel we present the relationship between the payoff in the 1PG ( $Π_{i}^{1 P G}$ ) and the mean value of the distributed tokens (horizontal axis). The vertical dotted line marks the mean of all choices in the 2PG (which is 13.63). The right panel illustrates the relationship between the payoff in the 1PG ( $Π_{i}^{1 P G}$ , vertical axis) and the absolute distance between mean choice of subjects in the 2PG and the mean value of the distributed tokens

Result 4

Subjects with a better understanding of the structure of the one-player guessing game form more accurate beliefs about their counterparts’ choices in the two-player guessing game.

Fig. 7 Difference between actual choice and optimal choice conditional on beliefs ( $Δ z_{i}^{*}$ ) versus payoff in the 1PG ( $Π_{i}^{1 P G}$ )

Additionally, we analyze whether choices in the 2PG are best responses to the stated beliefs (i.e., the token distribution). To do so we compute the choice in the 2PG that would maximize the payoff of a subject conditional on her stated beliefs being correct:

(7)

\begin{matrix} z_{i}^{*} (B) = \underset{{\tilde{z}}_{i}}{arg max} \sum_{j = 1}^{20} \frac{b_{ij}}{19} (2 - 0.10 ({\tilde{z}}_{i} - \frac{2}{3} \frac{{\tilde{z}}_{i} + {\bar{b}}_{j}}{2})), \end{matrix}

where $z_{i}^{*} (B_{i})$ is the choice of subject i that maximizes her payoffs given her beliefs $B_{i} = (b_{i 1}, b_{i 2}, \dots, b_{i 20})$ , $b_{ij}$ is the number of tokens that subject i put in bin j, and ${\bar{b}}_{j}$ is the average value of the bin (so for example, for the first bin [0,4], ${\bar{b}}_{1} = 2$ , for the second [5,9], ${\bar{b}}_{2} = 7$ , etc.).Footnote ¹⁷ We then create an individual variable $Δ z_{i}^{*} = | z_{i} - z_{i}^{*} (B_{i}) |$ which is the absolute difference between actual choice of subject i in the 2PG minus the optimal choice conditional on her stated beliefs. Figure 7 illustrates the relation of $Δ z_{i}^{*}$ and the payoffs for the 1PG. It appears that subjects who fully solved the 1PG are better at best responding to their own beliefs and therefore have a lower $Δ z_{i}^{*}$ (Mann–Whitney U test p value $= 0.001$ ). This is confirmed by a significantly negative correlation between 1PG payoffs and $Δ z_{i}^{*}$ (Spearman $ρ = - 0.531$ , p value $< 0.001$ ). These results imply that better understanding of the structure of the guessing game improves the ability to best respond to own beliefs.

Result 5

Subjects with a better understanding of the structure of the one-player guessing game choose numbers closer to the best response of their beliefs in the two-player guessing game.

3.4.1 “What-if” beliefs

As there could be some influence of having played the 1PG on the beliefs in the 2PG, we asked 40 subjects to use 19 tokens to guess the choices of 19 subjects that had played the 2PG “a couple of weeks ago, without having previously played the 1PG”. We will refer to these distributions as “what-if” distributions, as opposed to the elicited distributions in the belief elicitation part of the experiment which we will refer to as “original” distributions.

Fig. 8 Distribution of “original” and “what-if” beliefs for the 2PG

Fig. 9 Payoff in the 1PG on the horizontal axis, and change in the mean between belief distributions ( $Δ B_{i}$ ) on the vertical axis. Any value of $Δ B_{i}$ above zero is a change in the mean away from the Nash Equilibrium

We plot the resulting aggregated distributions in Fig. 8. At first glance, the differences between what-if and original distributions appear to be small.Footnote ¹⁸ However, when comparing the means and variances of subjects’ individual distributions, we find that both mean and variance are significantly higher in the what-if distributions (Wilcoxon matched-pairs signed-ranks test, p value $= 0.039$ and p value $= 0.009$ for the difference in means and variance respectively). This indicates that subjects adjust their beliefs depending on the population they face.Footnote ¹⁹

To get a better understanding of how subjects change their beliefs in the 2PG when faced with different populations, we plot the difference in means between the what-if and original distributions ( $Δ B_{i}$ ) against the individual payoff from the 1PG ( $Π^{1 P G}$ ) in Fig. 9. In this figure, any value above the horizontal dotted line indicates a shift of the what-if distribution, with respect to the original one, away from the NE.

As can be seen in Fig. 9, whenever subjects who fully solved the 1PG adjust their beliefs, they seem to do so in the right direction (i.e., away from the NE). However, we cannot reject the null hypothesis of no differences in the distribution means ( $Δ B_{i}$ ) between subjects who fully solved the 1PG and those who did not (Mann–Whitney U Test, p value $= 0.683$ ). Using our secondary measure of understanding, we find a significant correlation between payoffs in the 1PG and $Δ B_{i}$ (Pearson $ρ = 0.308$ , p value $= 0.053$ ), but we cannot reject the hypothesis that this relationship is not monotonic (Spearman $ρ = 0.262$ , p value $= 0.101$ ). Therefore, if we interpret a higher payoff in the 1PG as a better understanding of the structure of the game, then it would appear that a better understanding is associated with better adjustment of beliefs in response to facing an inexperienced population.Footnote ²⁰

Result 6

There is weak evidence suggesting that subjects with a better understanding of the structure of the one-player guessing game are better at adjusting their beliefs in response to facing an inexperienced population in the two-player guessing game.

4 Conclusion

In laboratory experiments, subjects often deviate from equilibrium play. These deviations can be the result of either subjects not understanding the structure of the underlying game or from not forming the correct beliefs about the strategies of their counterparts. One strand of the literature has tried to explain these deviations as errors in belief formation (e.g., Costa-Gomes and Crawford Reference Costa-Gomes and Crawford2006; Ho et al. Reference Ho, Camerer and Weigelt1998). Yet, some recent research shows that subjects might not fully understand the experimental environment.

In this paper we use an individual decision-making task that allows us to uncouple subjects’ understanding of the game from their belief formation, and thus to establish to what extent understanding of the structure of the game contributes to non-equilibrium play in our experiment.

We find that a majority of subjects fail to fully understand the structure of the game. Moreover, subjects who understand the structure of the game play closer to the Nash Equilibrium, are better at best-responding to their own beliefs, and seem to modify their beliefs (correctly) depending on the population they are facing. This result is inconsistent with models of the Level-K type (e.g., Costa-Gomes and Crawford Reference Costa-Gomes and Crawford2006) which assume that agents fully understand the structure of the game and only play out of equilibrium due to flaws in belief formation. Our findings suggest, otherwise, that out of equilibrium play is not only the result of a limited ability to form correct beliefs, but that it also results from the inability of subjects to fully understand the game’s structure.

In light of these results, we believe the 1PG could be a useful “quick and easy” test for researchers interested in, or aiming to control for, understanding of the structure of guessing games. More generally, we believe that the reduction of strategic games into one-player forms could be a useful tool in the analysis of other games too.Footnote ²¹ Such transformation would allow researchers to study the degree of understanding that subjects have of the structure of the game, and to control for any deviations from Nash equilibrium play independent of errors in belief formation.Footnote ²²

Finally, a potential extension of this experiment could be to vary the number of selves subjects play in the 1PG and compare play to standard guessing games populated by the same number of strategic players. On the one hand, increasing the number of selves and strategic players increases the complexity of the game, and may therefore make understanding the structure of the game and belief formation more difficult. On the other hand, increasing the number of selves and strategic players may lead subjects to better understand the unraveling mechanic of the game. We leave it to future extensions of this work to test whether the general findings in this paper would also hold under such conditions.

Acknowledgements

Open Access funding provided by Projekt DEAL. The authors would like to thank David Freeman, Dan Friedman, Yoram Halevy, Frank Heinemann, Rosemarie Nagel, Luba Petersen, Pedro Rey Biel, Christian Seel, and especially Marina Agranov and Yves Breitmoser for their comments. Additionally, the authors acknowledge financial support from the Deutsche Forschungsgemeinschaft (DFG) through CRC 649 “Economic Risk”. The first author thanks the Deutsche Forschungsgemeinschaft (DFG) for funding through the CRC TRR 190 “Rationality and Competition”.

Footnotes

Electronic supplementary material The online version of this article (https://doi.org/10.1007/s10683-020-09642-2) contains supplementary material, which is available to authorized users.

¹ The latter feature would be missing when simply telling subjects what the other player of a two-player guessing game will play.

² For convenience we will henceforth refer to the one-player guessing game as a game, even though strictly speaking it is not one.

³ The idea of letting people play games with themselves, in order to test understanding of the structure of a game is not entirely new. See for instance Blume and Gneezy (Reference Blume and Gneezy2010) or Petersen and Winn (Reference Petersen and Winn2014).

⁴ A penalty which increases in the distance to the target guess is crucial here, as with the more commonly utilized tournament incentives one of the two selves would always win, and the game would become trivial. Additionally, this payoff structure allows us to study the best response of subjects to their beliefs in the 2PG as 0 is not anymore a unique weakly dominant strategy. See Sects. 2.2 and 3.4 for more details.

⁵ Note that we limited the minimum payoff to zero in order to avoid potential losses for the subject.

⁶ See Nagel et al. (Reference Nagel, Bühren and Frank2016) for a lengthier discussion on the implications that the different payoff structures have in the 2PG.

⁷ See for instance Hommes et al. (Reference Hommes, Sonnemans, Tuinstra and Van de Velden2004), Costa-Gomes and Crawford (Reference Costa-Gomes and Crawford2006), Mauersberger and Nagel (Reference Mauersberger, Nagel, Hommes and LeBaron2018), or Duffy (Reference Duffy, Kagel and Roth2016) who reviews some of these experiments among other types of macroeconomic experiments.

⁸ There is some discussion about how to best incentivize subjects to state their true beliefs. In particular, there is mixed evidence on whether incentive compatibility matters or not (Schotter and Trevino Reference Schotter and Trevino2014). Methods to elicit beliefs beyond first moments, such as ours, are typically difficult for subjects to understand. Using a non-linear scoring rule would introduce an additional level of complexity. Hence, while not incentive compatible for risk neutral subjects, we opted for this approach because we believe it provides the best compromise between tractability for subjects and incentivization efficacy.

⁹ In the instructions we told subjects they had to guess the choices made by 19 subjects who had played the 2PG a couple of weeks ago, without having previously played the 1PG. See "Electronic supplementary material Appendix D" for the instructions read to subjects.

¹⁰ Note however that Georganas et al. (Reference Georganas, Healy and Weber2015) find only limited evidence of correlations between a variety of cognitive ability tasks and level-k reasoning ability.

¹¹ While in Gill and Prowse (Reference Gill and Prowse2016) subjects go through all 60 matrices of the original Raven Test, in our case subjects just took part in three of the hardest blocks of 12 matrices.

¹² As in Bosch-Rosa et al. (Reference Bosch-Rosa, Meissner and Bosch-Domènech2018) the backward induction ability of the computer increased with the rounds of the game. In the first round the computer could do only one step of backward induction, in the second it could do two, in the third three, and so on.

¹³ Note that in the 2PG there are two Pareto ranked Nash Equilibria in pure strategies: (1) both players choose zero and (2) both players choose one. The latter is because the best response to the counterpart picking 1 would be picking 1/2. Since in our experiment subjects can only chose integers, both 0 and 1 are best responses. Such Pareto-ranked equilibria will always exist in laboratory experiments as it is impossible to implement a truly continuous choice.

¹⁴ Arguably, some learning could be due to a switch to simple heuristics (e.g. “play zero”) rather than real learning about the structure of the game. Using games that reverse the end of the strategy space containing the equilibrium such as Ho et al. (Reference Ho, Camerer and Weigelt1998) or Rick and Weber (Reference Rick and Weber2010), or games with interior equilibria as in Costa-Gomes and Crawford (Reference Costa-Gomes and Crawford2006) would allow to distinguish between these different kinds of learning. We thank the editor for this comment.

¹⁵ Again, in order to avoid noise due to session specific outliers, we compute the number of correct tokens by comparing individual beliefs to the distribution of the 2PG choices we collected across all sessions. For more details see "Electronic supplementary material Appendix B".

¹⁶ The individual means of the belief distributions are calculated as m i = ∑ j = 1 20 b ij 19 b ¯ j , where b ij is the number of tokens that subject i put in bin j, and b ¯ j is the average value of the bin (so for example, for the first bin [0,4], b ¯ 1 = 2 , for the second [5,9], b ¯ 2 = 7 , etc.).

¹⁷ We pick this instead of the lowest value of the bin, because it is a more stringent test to our “high ability” subjects.

¹⁸ The mean bin for the original distributions is 5.18, while the mean of the what-if distributions is 5.77 (bin number 5 contained values 20–24 while bin 6 contained values 25–29.).

¹⁹ See Fig. 16 of "Electronic supplementary material Appendix C" for a graphical representation of these results.

²⁰ An important consideration in within-subject designs are experimenter demand effects. In our setup is not clear in what direction such effects may work. One may conjecture that, if subjects aim to appear rational, the interaction of experimenter demand and the extra opportunity for introspective learning may shift the what-if distribution closer to the Nash Equilibrium. As can be seen in Fig. 9 and see Fig. 16 of "Electronic supplementary material Appendix C", the shift in distributions seems to be in the opposite direction. This makes us somewhat optimistic that experimenter demand was either small or non-existent.

²¹ For instance in ring-networks (e.g., Kneeland Reference Kneeland2015) or minimum effort games (e.g., Van Huyck et al. Reference Van Huyck, Battalio and Beil1990).

²² We would like to thank an anonymous referee for suggesting this wider interpretation of our results.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Agranov, M, Caplin, A, & Tergiman, C (2015). Naive play and the process of choice in guessing games. Journal of the Economic Science Association, 1, 146–157. 10.1007/s40881-015-0003-5CrossRef Google Scholar

Agranov, M, Potamites, E, Schotter, A, & Tergiman, C (2012). Beliefs and endogenous cognitive levels: An experimental study. Games and Economic Behavior, 75, 449–463. 10.1016/j.geb.2012.02.002CrossRef Google Scholar

Blume, A, & Gneezy, U (2010). Cognitive forward induction and coordination without common knowledge: An experimental study. Games and Economic Behavior, 68, 488–511. 10.1016/j.geb.2009.07.011CrossRef Google Scholar

Bosch-Rosa, C, Meissner, T, & Bosch-Domènech, A (2018). Cognitive bubbles. Experimental Economics, 21, 132–153. 10.1007/s10683-017-9529-0CrossRef Google Scholar

Camerer, C (2003). Behavioral game theory: Experiments in strategic interaction, Princeton: Princeton University Press.Google Scholar

Carpenter, PA, Just, MA, & Shell, P (1990). What one intelligence test measures: A theoretical account of the processing in the Raven Progressive Matrices Test. Psychological Review, 97, 404 10.1037/0033-295X.97.3.404CrossRef Google Scholar

Chou, E, McConnell, M, Nagel, R, & Plott, CR (2009). The control of game form recognition in experiments: Understanding dominant strategy failures in a simple two person “guessing” game. Experimental Economics, 12, 159–179. 10.1007/s10683-008-9206-4CrossRef Google Scholar

Costa-Gomes, MA, & Crawford, VP (2006). Cognition and behavior in two-person guessing games: An experimental study. American Economic Review, 96, 1737–1768. 10.1257/aer.96.5.1737CrossRef Google Scholar

Costa-Gomes, MA, & Weizsäcker, G (2008). Stated beliefs and play in normal-form games. The Review of Economic Studies, 75, 729–762. 10.1111/j.1467-937X.2008.00498.xCrossRef Google Scholar

Duffy, J Kagel, JH, & Roth, AE (2016). Macroeconomics: A survey of laboratory research. The handbook of experimental economics, Princeton: Princeton University Press.Google Scholar

Fischbacher, U (2007). z-Tree: Zurich toolbox for ready-made economic experiments. Experimental Economics, 10, 171–178. 10.1007/s10683-006-9159-4CrossRef Google Scholar

Fragiadakis, D. E., Knoepfle, D. T., & Niederle, M. (2016). Who is strategic? Working paper.Google Scholar

Georganas, S, Healy, PJ, & Weber, RA (2015). On the persistence of strategic sophistication. Journal of Economic Theory, 159, 369–400. 10.1016/j.jet.2015.07.012CrossRef Google Scholar

Gill, D, & Prowse, V (2016). Cognitive ability, character skills, and learning to play equilibrium: A level-k analysis. Journal of Political Economy, 124, 1619–1676. 10.1086/688849CrossRef Google Scholar

Gneezy, U, Rustichini, A, & Vostroknutov, A (2010). Experience and insight in the race game. Journal of Economic Behavior & Organization, 75, 144–155. 10.1016/j.jebo.2010.04.005CrossRef Google Scholar

Greiner, B (2015). Subject pool recruitment procedures: Organizing experiments with ORSEE. Journal of the Economic Science Association, 1, 114–125. 10.1007/s40881-015-0004-4CrossRef Google Scholar

Grosskopf, B, & Nagel, R (2008). The two-person beauty contest. Games and Economic Behavior, 62, 93–99. 10.1016/j.geb.2007.03.004CrossRef Google Scholar

Güth, W, Kocher, M, & Sutter, M (2002). Experimental “beauty contests” with homogeneous and heterogeneous players and with interior and boundary equilibria. Economics Letters, 74, 219–228. 10.1016/S0165-1765(01)00544-4CrossRef Google Scholar

Ho, T-H, Camerer, C, & Weigelt, K (1998). Iterated dominance and iterated best response in experimental “p-beauty contests”. The American Economic Review, 88, 947–969.Google Scholar

Hommes, C, Sonnemans, J, Tuinstra, J, & Van de Velden, H (2004). Coordination of expectations in asset pricing experiments. The Review of Financial Studies, 18, 955–980. 10.1093/rfs/hhi003CrossRef Google Scholar

Kneeland, T (2015). Identifying higher-order rationality. Econometrica, 83, 2065–2079. 10.3982/ECTA11983CrossRef Google Scholar

Lahav, Y (2015). Eliciting beliefs in beauty contest experiments. Economics Letters, 137, 45–49. 10.1016/j.econlet.2015.08.024CrossRef Google Scholar

Levitt, SD, List, JA, & Sally, E (2011). Checkmate: Exploring backward induction among chess players. The American Economic Review, 101, 975–990. 10.1257/aer.101.2.975CrossRef Google Scholar

Mauersberger, F, Nagel, R Hommes, C, & LeBaron, B (2018). Levels of reasoning in Keynesian Beauty Contests: A generative framework. Handbook of computational economics, Amsterdam: Elsevier 541–634.Google Scholar

Nagel, R (1995). Unraveling in guessing games: An experimental study. The American Economic Review, 85, 1313–1326.Google Scholar

Nagel, R, Bühren, C, & Frank, B (2016). Inspired and inspiring: Hervé Moulin and the discovery of the beauty contest game. Mathematical Social Sciences, 90, 191–207. 10.1016/j.mathsocsci.2016.09.001CrossRef Google Scholar

Petersen, L, & Winn, A (2014). Does money illusion matter? Comment. American Economic Review, 104, 1047–62 10.1257/aer.104.3.1047CrossRef Google Scholar

Raven, JC (1960). Guide to the standard progressive matrices: Sets A, B, C, D and E, London: HK Lewis.Google Scholar

Rey-Biel, P (2009). Equilibrium play and best response to (stated) beliefs in normal form games. Games and Economic Behavior, 65, 572–585. 10.1016/j.geb.2008.03.003CrossRef Google Scholar

Rick, S, & Weber, RA (2010). Meaningful learning and transfer of learning in games played repeatedly without feedback. Games and Economic Behavior, 68, 716–730. 10.1016/j.geb.2009.10.004CrossRef Google Scholar

Schotter, A, & Trevino, I (2014). Belief elicitation in the laboratory. Annual Review of Economics, 6, 103–128. 10.1146/annurev-economics-080213-040927CrossRef Google Scholar

Van Huyck, JB, Battalio, RC, & Beil, RO (1990). Tacit coordination games, strategic uncertainty, and coordination failure. The American Economic Review, 80, 234–248.Google Scholar

Weber, RA (2003). Learning with no feedback in a competitive guessing game. Games and Economic Behavior, 44, 134–144. 10.1016/S0899-8256(03)00002-2CrossRef Google Scholar

Weizsäcker, G (2003). Ignoring the rationality of others: Evidence from experimental normal-form games. Games and Economic Behavior, 44, 145–171. 10.1016/S0899-8256(03)00017-4CrossRef Google Scholar

Fig. 2 Distribution of choices in the 2PG with and without 1PG experience

Fig. 3 Distribution of choices in the 2PG (vertical axis) and payoff in the 1PG (horizontal axis)

Fig. 4 Relationship of payoff in the 1PG (Πi1PG) and Π¯i2PG (left panel). The line in the left panel is a fitted quadratic function. The right panel shows the relationship of choice in the 2PG (zi) and Π¯i2PG. In both panels the darker dots indicate subjects who fully solved the 1PG

Table 1 Regression of Π¯i2PG on the payoff in the 1PG (Πi1PG) and its square value (Πi1PG)2

Fig. 5 Relationship between the payoff in the 1PG (Πi1PG) and number of correct tokens deposited in the belief elicitation phase (left panel) and distribution of tokens across bins (right panel). Darker dots refer to subjects who fully solved the 1PG

Fig. 6 In the left panel we present the relationship between the payoff in the 1PG (Πi1PG) and the mean value of the distributed tokens (horizontal axis). The vertical dotted line marks the mean of all choices in the 2PG (which is 13.63). The right panel illustrates the relationship between the payoff in the 1PG (Πi1PG, vertical axis) and the absolute distance between mean choice of subjects in the 2PG and the mean value of the distributed tokens

Fig. 7 Difference between actual choice and optimal choice conditional on beliefs (Δzi∗) versus payoff in the 1PG (Πi1PG)

Fig. 8 Distribution of “original” and “what-if” beliefs for the 2PG

Fig. 9 Payoff in the 1PG on the horizontal axis, and change in the mean between belief distributions (ΔBi) on the vertical axis. Any value of ΔBi above zero is a change in the mean away from the Nash Equilibrium

Bosch-Rosa and Meissner supplementary material

File 1.2 MB

Article contents

The one player guessing game: a diagnosis on the relationship between equilibrium play, beliefs, and best responses

Abstract

Keywords

JEL classification

1 Introduction

2 Experimental design

2.1 The one-player guessing game (1PG)

2.2 The two-player guessing game (2PG)

2.3 Belief elicitation

2.3.1 “What if” belief elicitation

2.4 Cognitive ability

3 Results

3.1 The one player guessing game

Result 1

3.2 The two player guessing game

3.3 Relationship between the 1PG and the 2PG

Result 2

Result 3

3.4 Subjective beliefs

Result 4

Result 5

3.4.1 “What-if” beliefs

Result 6

4 Conclusion

Acknowledgements

Footnotes

References

Bosch-Rosa and Meissner supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests