Mind the Gap: Boltzmannian versus Gibbsian Equilibrium

Charlotte Werndl; Roman Frigg

doi:10.1086/694088

Mind the Gap: Boltzmannian versus Gibbsian Equilibrium

Published online by Cambridge University Press: 01 January 2022

Charlotte Werndl and

Roman Frigg

Article contents

Abstract
Introduction
Equilibrium in Gibbs and Boltzmann
Example 1: The Baker’s Gas
Example 2: The Ising Model
Existence under Different Conditions
When Boltzmann and Gibbs Agree
Conclusion
Footnotes
References

Rights & Permissions

Abstract

There are two main theoretical frameworks in statistical mechanics, one associated with Boltzmann and the other with Gibbs. Despite their well-known differences, there is a prevailing view that equilibrium values calculated in both frameworks coincide. We show that this is wrong. There are important cases in which the Boltzmannian and Gibbsian equilibrium concepts yield different outcomes. Furthermore, the conditions under which equilibriums exists are different for Gibbsian and Boltzmannian statistical mechanics. There are, however, special circumstances under which it is true that the equilibrium values coincide. We prove a new theorem providing sufficient conditions for this to be the case.

Type: Physical Sciences
Information: Philosophy of Science , Volume 84 , Issue 5 , December 2017 , pp. 1289 - 1302

DOI: https://doi.org/10.1086/694088 [Opens in a new window]
Copyright: Copyright © The Philosophy of Science Association

1. Introduction

There are two main theoretical frameworks in statistical mechanics (SM), one associated with Boltzmann and the other with Gibbs. One of the crucial differences is the characterization of equilibrium. While in the Boltzmannian framework equilibrium corresponds to the largest macroregion, the Gibbsian framework associates equilibrium with the stationary probability distribution of maximum entropy. The Boltzmannian picture is usually accepted as correct when it comes to answering foundational questions, in particular in connection with the approach to equilibrium. At the same time the Gibbsian framework is dismissed as “thoroughly misguided” (Goldstein Reference Goldstein, Bricmont, Dürr, Galavotti, Ghirardi, Petruccione and Zangh2001, 39) and stands accused of introducing theoretical instruments that we “neither have nor … need” (Lebowitz Reference Lebowitz1993, 38). Yet, the Gibbsian framework is the undisputed workhorse of the practitioner.

This would be not be particularly worrisome if the two formalisms were equivalent or intertranslatable as, for instance, the Schrödinger and the Heisenberg picture in quantum mechanics. But they are not. Not only do they disagree on how equilibrium is conceptualized; they do not even study the same entities. While the Boltzmannian framework investigates individual systems, the object of study in the Gibbs approach are ensembles, and there is no obvious way to translate results from one framework into the other. This creates an awkward situation: how can we think that the Boltzmannian framework gives the right foundational account of SM and at the same time rely on the Gibbsian framework for calculations if the two accounts are in fact at odds with each other?

A common response is to play down the severity of the problem with the argument that the two frameworks lead to the same results. One can, so the argument goes, use the Gibbs formalism as an effective tool for the calculation of equilibrium values of a wide spectrum of physical quantities because these coincide with the values that would come out of the Boltzmann machinery if one was able to make the calculations. However, the reasoning behind this assertion remains unclear. Either it is simply asserted as an obvious truism (Davey [Reference Davey2009] and Wallace quoted in Werndl and Frigg [Reference Werndl and Friggforthcominga]), or arguments are given only for special cases (Malament and Zabell Reference Malament and Zabell1980; Lavis Reference Lavis2005). So the claim that Gibbsian and Boltzmannian equilibrium values generally coincide has the status of an article of faith. Faith need not be wrong, but it can be. The project for this article is to investigate whether the purported equivalence of results really holds.

Our answer is negative. If understood as a general proposition, the claim is provably wrong. After briefly introducing the Boltzmannian and the Gibbsian notions of equilibrium (sec. 2), we present two examples in which Boltzmannian and Gibbsian equilibrium values come apart: the baker’s gas (sec. 3) and the Ising model (sec. 4). The differences between the two frameworks have gone unnoticed partly because Boltzmannian and Gibbsian calculations are seldom carried out on the same systems. We make a first step toward rectifying this situation by discussing our examples from both theoretical vantage points, which makes visible how the two frameworks differ. Things get worse still. Not only do equilibrium values fail to coincide; equilibriums do not necessarily exist under the same conditions. There are systems that have a Boltzmannian equilibrium but fail to have a Gibbsian equilibrium and vice versa (sec. 5). This raises the question whether there is even a grain of truth in common wisdom, and the good news is that there is. We prove a new theorem establishing that under certain special circumstances the results of the two formalisms indeed coincide (sec. 6). We describe these circumstances in some detail and give examples. We end by offering some conclusions (sec. 7).

2. Equilibrium in Gibbs and Boltzmann

In this section we briefly introduce the Boltzmannian and the Gibbsian notions of equilibrium (for details, see Uffink Reference Uffink, Butterfield and Earman2007; Frigg Reference Frigg and Rickles2008). Throughout the article we consider systems with a phase space X. Let μ_X denote the probability measure on X. Further, let T_t(x) denote the state of the system after t time units that starts in x. The dynamics T_t is usually deterministic. Sometimes, for instance, in the Ising model, dynamics are considered that are stochastic processes {Z_t}. For ease of presentation we state general definitions and results for the deterministic case, but all definitions have stochastic equivalents and the results equally also hold in the stochastic case (see Werndl and Frigg [Reference Werndl, Frigg, Massimi, Romeijn and Schurz2017] for a discussion of stochastic systems).

In Gibbsian SM the object of study is an ensemble, an infinite collection of independent systems that are all governed by the same equations but are in different states. The ensemble is described by a probability density ρ(x, t), $x \in X$ , reflecting the probability of finding the state of a system chosen at random from the ensemble in a certain part of X at time t. Physical observables are associated with real-valued functions $f : X \times t \to ℝ$ . The phase average of such a function is

\begin{matrix} (1) & \bar{f} (t) = \int_{X} f (x, t) ρ (x, t) dx . \end{matrix}

According to the standard understanding of the formalism, phase averages are observed in experiments. The Gibbs entropy of a distribution is

\begin{matrix} (2) & S_{G} [ρ] = - k \int_{X} ρ (x, t) \log [ρ (x, t)] dx, \end{matrix}

where k is Boltzmann’s constant. A distribution ρ(x, t) is stationary if it does not depend on time: $ρ (x, t) = ρ (x)$ for all t. In Gibbsian SM equilibrium is the property of an ensemble. The ensemble is in equilibrium if the distribution is stationary and has maximum entropy given the constraints imposed on the system. The most common equilibrium distributions are the microcanonical, canonical, and grand-canonical distributions (Uffink Reference Uffink, Butterfield and Earman2007).

It is customary to begin a presentation of Boltzmannian SM with the combinatorial argument. However, it is now recognized that combinatorial considerations do not provide a good general definition of Boltzmannian equilibrium (Uffink Reference Uffink, Butterfield and Earman2007), and we therefore work with our own alternative definition (Werndl and Frigg Reference Werndl and Frigg2015a, Reference Werndl and Frigg2015b).Footnote ¹ Consider the same dynamical system as above and assume that the measure μ_X is stationary. At the macrolevel the system is characterized by a set of macrovariables {v ₁, …, v_l} $(l \in ℕ)$ . They are measurable functions $v_{i} : X \to ℝ$ , associating a value with each $x \in X$ . We use capital letters V_i to denote the values of v_i. A particular set of values {v ₁, …, v_l} defines a macrostate $M_{V_{1}}, \dots,_{V_{l}}$ .Footnote ² The region of phase space corresponding to the macrostate $M_{V_{1}}, \dots,_{V_{l}}$ , its macroregion, is denoted by $X_{M_{V_{1}}, \dots,_{V_{l}}}$ .

The equilibrium macrostate is defined as the state in which a system spends most of its time. Let LF_R be the fraction of time a system spends in region $R \in X$ in the long run:

\begin{matrix} (3) & {LF}_{R} (x) = \lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} 1_{A} (T_{τ} (x)) d τ, \end{matrix}

where 1_A(x) is the characteristic function of R: $1_{A} (x) = 1$ for $x \in R$ and 0 otherwise. The notion of ‘most’ can be read in two different ways, leading to two different notions of equilibrium. The first introduces a lower bound of 1/2 for the fraction of time spent in equilibrium, leading to the notion of an α-ε-equilibrium:

Let α be a real number in the interval (1/2, 1], and let ε be a very small positive real number.Footnote ³ If there is a macrostate $M_{V_{1}^{*}}, \dots,_{V_{l}^{*}}$ satisfying the following condition, then it is the α-ε-equilibrium state of S. There exists a set $Y \subseteq X$ such that $μ_{X} (Y) \geq 1 - ε$ , and all initial states $x \in Y$ satisfy ${LF}_{X_{M_{V_{1}^{*}, \dots, V_{l}^{*}}}} (x) \geq α$ .

The second reading takes ‘most’ to refer to the fact that the system spends more time in equilibrium than in any other state (this can be less that 50% of its time). This provides the γ-ε-equilibrium:

Let γ be a real number in (0,1] and let ε be a small positive real number ( $ε < γ$ ). If there is a macrostate $M_{V_{1}^{*}}, \dots,_{V_{l}^{*}}$ satisfying the following condition, it is the γ-ε-equilibrium state of S. There exists a set $Y \subseteq X$ such that $μ_{X} (Y) \geq 1 - ε$ and for all initial conditions $x \in Y$ : ${LF}_{X_{M_{V_{1}^{*}, \dots, V_{l}^{*}}}} (x) \geq {LF}_{X_{M}} (x) + γ$ for all macrostates $M \neq M_{V_{1}^{*}}, \dots,_{V_{l}^{*}}$ .

It is obvious that on both notions of equilibrium the value associated with the equilibrium macrostate is the observed value in equilibrium. It can be proven that equilibrium states thus defined are the largest states in the system in the following sense: their measure is larger than $α (1 - ε) > 1 / 2$ for an α-ε-equilibrium, and their measure is $γ - ε$ larger than the measure of any other macroregion for a γ-ε-equilibrium.Footnote ⁴

3. Example 1: The Baker’s Gas

The baker’s gas consists of N identical particles that evolve independently according to the baker’s transformation (Lavis Reference Lavis2005). Its microstates are of the form $x = (b_{1}, c_{1}, \dots, b_{N}, c_{N})$ , where $b_{i} \in [0, 1]$ is the momentum and $c_{i} \in [0, 1]$ is the position coordinate of the ith particle. The system’s phase space therefore is $X = {[0, 1]}^{2 N}$ , which is endowed with the uniform probability measure μ_X (the 2N-dimensional Lebesgue measure). Time is discrete, and the evolution to the next time step is given by applying the baker’s transformation to each coordinate: $x = (\dots b_{i}, c_{i} \dots)$ evolves into $Λ (x) = (\dots θ (b_{i}, c_{i}) \dots)$ , where

\begin{matrix} (4) & θ (b_{i}, c_{i}) = 2 b_{i}, \frac{c_{i}}{2} if 0 \leq b_{i} \leq \frac{1}{2} and 2 b_{i} - 1, \frac{c_{i} + 1}{2} otherwise . \end{matrix}

Let us begin with the Boltzmannian treatment of the baker’s gas. Here μ_X is the stationary measure. We now use combinatorial considerations to construct the system’s macrostate. Consider a partition of the unit square (the phase space for one particle) into cells of equal size δω whose dividing lines run parallel to the position and momentum axes. This results in a finite partition $Ω_{bg} ≔ {ω_{1}^{bg}, \dots, ω_{k}^{bg}}$ , $k \in ℕ$ . The coarse-grained microstate of a particle is the cell in which a particle’s state lies. An arrangement is given by a specification of the coarse-grained microstates of all the particles. A distribution is a specification of how many particles’ states lie in a given cell. Consider the distribution $D_{bg} = (N_{1}, N_{2}, \dots, N_{k})$ , where N_i is the number of particles in cell ω_i. The number G(D_bg) of arrangements that lead to the same distribution D_bg is $G (D_{bg}) = N! / N_{1}! N_{2}! \dots, N_{k}!$ .

One can now define a partition on X by grouping together in one cell all points that have the same distribution. It is easy to see that the cell X_u corresponding to the uniform distribution, that is, where $N_{i} = N / k$ ,Footnote ⁵ is larger than any other cell. We now introduce a macrovariable V as follows: $V (x) = 0$ for $x \in X_{u}$ , and for all other cells of the partition V(x) takes values between 10⁶ and 10⁸ so that no two cells have the same value. The baker’s gas is ergodic (Lavis Reference Lavis2005) and, hence, spends more time in X_u than in any other cell. Therefore, the long-run fraction of time for which the value of V is 0 is larger than the long-run fraction for any other value. Thus, the macrostate defined by $V = 0$ is a γ-0-equilibrium, and $V = 0$ is the Boltzmannian equilibrium value.Footnote ⁶

Let us now turn to the Gibbsian treatment of the baker’s gas. The stationary maximum entropy distribution is the uniform distribution $ρ (x) = 1$ . The phase space average $\bar{V}$ for the macrovariable V will be greater than $(1 - μ_{X} (X_{u})) \times 10^{6}$ . Lavis (Reference Lavis2005) has shown that $μ_{X} (X_{u}) \approx 0.47$ (for large N), and hence $0.53 \times 10^{6} = 530, 000$ is a lower bound for $\bar{V}$ .

So we find Boltzmannian and Gibbsian equilibrium values that are very different. The macrovariable of this system is admittedly contrived, so one might argue that the problem does not arise in practice. The grain of truth in this remark is that much depends on the choice of the macrovariable, and for sufficiently restrictive classes of macrovariables the problem can indeed be avoided (see sec. 6 for a characterization of such classes). However, there are real physical systems that do not fall into these classes. Thus, the relevant contrast is not between ‘mathematical contrivance’ and ‘sensible physics’. The example we discuss in the next section is a case in point.

4. Example 2: The Ising Model

The two-dimensional Ising model is an important system in SM. We here consider a version of the model with nearest-neighbor interactions in the absence of an external field. Despite being only two-dimensional, this model provides a realistic description of crystals such as K₂NF₄ and RB₂MnF₄, which have strong horizontal and weak vertical interactions (Baxter Reference Baxter1982).

Consider a regular two-dimensional lattice with N grid points. The lattice is assumed to lie on a two-dimensional torus, so that every grid point has exactly four nearest neighbors (allowing us to neglect border effects). At every grid point there is a spin pointing either up $(σ = 1)$ or down $(σ = −1)$ . The system’s microstate is given by

\begin{matrix} (5) & σ = {σ_{1}, \dots, σ_{N}}, \end{matrix}

and its Hamiltonian is

\begin{matrix} (6) & H (σ) = - J \sum_{nn} σ_{i} σ_{j} . \end{matrix}

The sum is over all nearest-neighbor pairs, and the constant $J \geq 0$ is the energy associated with the nearest-neighbor interaction (Baxter Reference Baxter1982).

We treat the model stochastically, and hence we first introduce probabilities. The probability distribution has the form of a Gibbsian distribution, but it is important to emphasise that at this point this is nothing more than a formal definition that is neither Gibbsian nor Boltzmannian. We begin with the partition function:

\begin{matrix} (7) & Z = \sum_{σ} e^{- β H (σ)} . \end{matrix}

The sum is taken over all possible configurations σ of the model, and $β = 1 / kT$ is a constant, where T is the temperature and k is Boltzmann’s constant. The probability of finding the system in a certain configuration σ is given by the canonical distribution:

\begin{matrix} (8) & P (σ) = \frac{e^{- β H (σ)}}{Z} . \end{matrix}

For large values of β (low temperature), the probabilities of the lower energy states are dominant. For small values of β (high temperature), the probability distribution is flattened out and all configurations are more or less equally likely (Baxter Reference Baxter1982; Cipra Reference Cipra1987).

The Boltzmannian treatment is as follows. The probability P(σ) is the stationary measure that defines the dynamics of the stochastic dynamical system of section 2. It can be shown that this dynamics is in fact an irreducible Markov chain, and irreducibility is the stochastic equivalent to ergodicity (Berger Reference Berger2001). As the relevant macrovariable we choose the internal energy:

\begin{matrix} (9) & E (σ) = \sum_{n n} - J σ_{i} σ_{j}, \end{matrix}

where the sum is over all nearest-neighbor interactions. Let $\bar{σ}$ be the microstate for which $σ_{i} = 1$ for all i, and let $\hat{σ}$ be the microstate for which $σ_{j} = −1$ for all j. Then, $A ≔ {\bar{σ}, \hat{σ}}$ is the macroregion for which the internal energy has value $E = −2 JN$ (because there are 2N nearest-neighbor pairs for a quadratic lattice with N sites for $N \geq 9$ ). As noted above, the larger the β, the larger the probabilities of the lower energy states. Since the dynamics is irreducible, for large enough β the system spends most of its time in A. Hence, A is a Boltzmannian γ-0-equilibrium, and the value of the internal energy in equilibrium is −2JN.

Let us now turn to the Gibbsian treatment. Here P(σ) is the stationary measure of maximum entropy. As above the value of E is lowest for $\bar{σ}$ and $\hat{σ}$ , namely, −2JN. Since all other microstates have higher energy, the Gibbsian phase average of E over all microstates is higher than −2JN (Baxter Reference Baxter1982; Cipra Reference Cipra1987; Secular Reference Secular2015).

It follows that the equilibrium value of the internal energy in a Boltzmann equilibrium (−2JN) is lower than in Gibbsian equilibrium. This difference arises because the macrovariable takes the lowest value in the largest macroregion and a higher value in all other macroregions.

The difference between the equilibrium values is not negligible. By way of illustration consider the case of a two-dimensional lattice with four grid points: $σ = (σ_{1}, σ_{2}, σ_{3}, σ_{4})$ , where (σ₁, σ₂) constitute the first row and (σ₃, σ₄) constitute the second row. Let $β = 1 / 2$ and $J = 1$ . There are four nearest-neighbor pairs. The Boltzmannian equilibrium macrostate is {(1, 1, 1, 1), (−1, −1, −1, −1)}, and the equilibrium value is −4. The value from phase space averaging can be obtained as follows. There are two states that have energy −4 and whose probability is $e^{4 / 2} / Z \approx 7.389 / Z$ : (1, 1, 1, 1), (−1, −1, −1, −1). There are 12 states that have energy 0 and whose probability is $e^{0 / 2} / Z = 1 / Z$ : (−1, 1, 1, 1), (1, −1, 1, 1), (1, 1, −1, 1), (1, 1, 1, −1), (−1, −1, −1, 1), (−1, −1, 1, −1), (−1, 1, −1, −1), (1, −1, −1, −1), (1, 1, −1, −1), (−1, −1, 1, 1), (1, −1, 1, −1), (−1, 1, −1, 1). Finally, there are two states that have energy 4 and whose probability is $e^{- 4 / 2} / Z \approx 0.1353 / Z$ : (−1, 1, −1, 1), (1, −1, 1, −1). Hence, the phase average is

\begin{matrix} (10) & \frac{−4 \times 2 \times 7.389 + 0 \times 12 \times 1 + 4 \times 2 \times 0.1353}{2 \times 7.389 + 12 \times 1 + 2 \times 0.1353} = −2.1454. \end{matrix}

So the Gibbsian equilibrium value (−2.1454) is almost half of the Boltzmannian equilibrium value (−4).

Nothing depends on the choice $N = 4$ . It remains the case for large N that Gibbsian phase averages will be different from the Boltzmannian equilibrium values. Indeed this situation remains unchanged even in the limit $N \to \infty$ , where one also finds that the Boltzmannian and Gibbsian values differ.Footnote ⁷

5. Existence under Different Conditions

We have shown that Gibbsian and Boltzmannian equilibrium values can fail to coincide. And things get worse: there are systems that have a Boltzmannian equilibrium but fail to have a Gibbsian equilibrium and vice versa.

5.1. Gibbs Does Not Imply Boltzmann

Consider the baker’s gas (sec. 3) with an even number of particles with one macrovariable V indicating whether there are more particles on the left side of the container than on the right side: V assumes value 1 if this is not the case, and it assumes value 0 if there are more particles on the right. The corresponding macrostates are M ₁ and M ₀. Both macroregions have the same measure, namely, 1/2. The dynamics is ergodic, and therefore the system spends half of the time in M ₀ and half of the time in M ₁. For this reason the system has no Boltzmannian equilibrium (neither of the α-ε nor of the γ-ε kind). However, there exists a Gibbs equilibrium: the uniform measure μ_B is the stationary distribution of maximum entropy.

The Ising model (sec. 4) provides another example. Consider again the above case with four sites and assume $β = 0$ .Footnote ⁸ Suppose that there are two macrostates: (i) one in which all the molecules point upward, all the molecules point downward, or there are an equal number of molecules that point upward and downward (eight microstates in total) and (ii) one in which this is not the case (again eight microstates in total). The usual dynamics considered for the Ising models are irreducible Markov chains. Recall that irreducibility is the stochastic equivalent to ergodicity, and hence, as in the previous example, there is no Boltzmannian equilibrium because both macroregions have the same measure 1/2. However, a Gibbsian equilibrium exists. The probability P is the canonical distribution, which is the stationary distribution of maximum entropy.

5.2. Boltzmann Does Not Imply Gibbs

Boltzmannian equilibrium is defined for systems with a stationary measure μ_X, and so there always exists at least one stationary distribution. Nevertheless, a stationary distribution of maximum entropy need not exist, as the following example shows.

Consider a system with phase space $[0, 2] \times [0, 1]$ . The dynamics T_t of the system consists of two ‘copies’ of the baker’s gas in the following sense: the restriction of T_t to $[0, 1] \times [0, 1]$ is the standard baker’s transformation (see sec. 3) and the restriction of T_t to $(1, 2] \times [0, 1]$ is the baker’s transformation with the position coordinate shifted by one unit to the right. The measure of the system is $1 / 3 \times 1_{[0, 1] \times [0, 1]} + 2 / 3 \times 1_{(1, 2] \times [0, 1]}$ , where 1_A is the characteristic function of A. Since the baker’s system is ergodic, clearly, the phase space consists of two components ( $[0, 1] \times [0, 1]$ and $(1, 2] \times [0, 1]$ ), restricted to each of which the dynamics is ergodic. Suppose that the macrovariable takes the value 0 if the position coordinate is in [1/4, 7/4], 100 if the position coordinate is in [0, 1/4), −100 if the position coordinate is in (7/4, 2]. Clearly, there exists a Boltzmannian equilibrium. The value 0 corresponds to a 0.75-0-equilibrium: since the system is ergodic on each component and the 0 macrostate takes up three-quarters on each component, three-quarters of the time the system takes the value 0.

Yet suppose that the class of distributions of interest are all densities over $[0, 2] \times [0, 1]$ of the form $α 1_{[0, 1] \times [0, 1]} + β 1_{(1, 2] \times [0, 1]}$ , where α, $β \geq 0$ , $α + β = 1$ , and the uniform distribution (the case $α = β = 1 / 2$ ) is excluded. Then there is no Gibbsian equilibrium. It is clear that all distributions are stationary (because μ_B of the baker’s gas is stationary), but by construction, there is no distribution of maximum entropy: the closer α and β are to 1/2, the higher the entropy (2). Yet there is no maximum since the uniform distribution ( $α = β = 1 / 2$ ) is not among the class of distributions under consideration.

6. When Boltzmann and Gibbs Agree

This section focuses on special cases in which the Boltzmannian and Gibbsian calculations agree. We first consider the main case discussed in the literature. Then we present a new theorem specifying a set of conditions under which Boltzmannian and Gibbsian values coincide. We show that many standard examples satisfy these conditions and that they fail in the cases discussed in previous sections.

Suppose that the relevant observable is such that it takes the value of the phase average nearly everywhere on phase space. Furthermore, assume that a Boltzmannian equilibrium exists. In such a situation it will be the case that the value of the observable in the Boltzmannian equilibrium macrostate is equal to the phase average. The question is under what circumstances something like this is the case. One such situation is described by Khinchin (Reference Khinchin1949). He argued that phase functions have to satisfy strong symmetry requirements and therefore ought to have small dispersion for systems with a large number of constituents (we refer to this as the ‘Khinchin condition’). This, or something like it, is often taken to be an explanation of why calculations in both frameworks coincide (Malament and Zabell Reference Malament and Zabell1980; Davey Reference Davey2009; Wallace quoted in Werndl and Frigg [Reference Werndl and Friggforthcominga]).

This is correct, but the question is how far it goes. The point to note is that the conditions rule out not only artificial examples but also realistic physical models. Examples of systems with macrovariables that do not satisfy the Khinchin condition include the Ising model with the internal energy or the magnetization as macrovariables, the six-vertex model with the internal energy or the polarization as macrovariables (cf. Baxter Reference Baxter1982), and the Kac ring with the standard macrostate structure of the number of black and white balls.

So we need other conditions next to the Khinchin condition. Let us look at a situation in which a system has both a Boltzmann and a Gibbs equilibrium. In this case the following theorem provides sufficient conditions for the equilibrium values of both equilibriums to coincide:

Equilibrium Equivalence Theorem (EET). Suppose that the system (X, T _t, μ_X) is composed of $N \geq 1$ constituents. That is, the state $x \in X$ is given by the N coordinates $x = (x_{1}, \dots, x_{N})$ ; $X = X_{1} \times X_{2} \dots \times X_{N}$ , where $X_{i} = X_{oc}$ for all $1 \leq i \leq N$ (X_oc is the one-constituent space). Let μ_X be the product measure $μ_{X_{1}} \times μ_{X_{2}} \dots \times μ_{X_{N}}$ , where $μ_{X_{i}} = μ_{X_{oc}}$ is the measure on X_oc. Suppose that an observable κ is defined on the one-particle space X_oc and takes the values κ₁, … , κ_k with equal probability 1/k, $k \leq N$ .Footnote ⁹ Suppose that the macrovariable K is the sum of the one-component observable; that is, $K (x) = \sum_{i = 1}^{N} κ (x_{i})$ . Then the value corresponding to the largest macroregion as well as the value obtained by phase space averaging is

\frac{N}{k} (κ_{1} + κ_{2} + \dots κ_{k}) .

The proof is stated in full in the appendix; an intuitive sketch is as follows. Since the observable on the one-constituent space takes the values {κ₁,…, κ_k} with equal probability, combinatorial considerations show that the Boltzmannian equilibrium macroregion (i.e., the macroregion of largest size) is the one where there are N/k constituents taking the value κ₁, N/k constituents taking the value κ₂, …, N/k constituents taking the value κ_k. That is, the Boltzmannian equilibrium value is $(κ_{1} + \dots + κ_{k}) N / k$ . The Gibbsian equilibrium value obtained by phase averaging is also $(κ_{1} + \dots + κ_{k}) N / k$ because one simply takes the average over all sequences of the {κ₁, …, κ_k} and the κ_i are equally probable.

The proof does not make any assumptions about the dynamics of the system; in particular, it does not assume that the system is ergodic. The crucial assumptions of the theorem are (i) that the macrovariable is a sum of the observable on the constituent space and that the system’s measure is the product measure of its constituents and (ii) that the macrovariable on the constituent space corresponds to a partition with cells of equal probability. This theorem is important because it applies to many examples in SM. Consider, for instance, the baker’s gas discussed in section 3. The gas involves a partition of the unit square (the phase space for one particle) into cells ${ω_{1}^{bg}, \dots, ω_{k}^{bg}}$ , $k \in ℕ$ , of equal size δω. Suppose that a particle in $ω_{i}^{bg}$ takes value κ_i for all $i = 1, \dots, k$ . Now the macrovariables often considered for the baker’s gas are of the form $K (x) = \sum_{i = 1}^{N} κ (x_{i})$ (e.g., Lavis Reference Lavis2005).Footnote ¹⁰ Then the assumptions of EET are satisfied. Thus, the Boltzmannian equilibrium value is the same as the value obtained by Gibbsian phase space averaging, namely, $(κ_{1} + \dots + κ_{k}) N / k$ .

The Kac ring with the standard macrostate structure given by the magnetization and the ideal gas with the standard macrostates afford further examples of systems satisfying the conditions of the theorem (these examples are discussed in Werndl and Frigg [Reference Werndl and Frigg2015b]). In the Boltzmannian framework, one often considers macrovariables of the type assumed in the theorem (Frigg Reference Frigg and Rickles2008, 110). In these cases the Gibbsian and Boltzmannian equilibrium calculations lead to the same results.

The examples discussed earlier in the article, showing that equilibrium calculations in Boltzmannian and Gibbsian SM do not lead to the same results, violate the relevant assumptions. In the baker’s system with the macrovariables as discussed in section 3 and in the Ising model with the internal energy as an observable significant chunks of the phase space are taken up by nonequilibrium states, which results in the Khinchin condition not being satisfied. These two examples do not satisfy the assumptions of EET either because the macrovariables of both the baker’s gas and the Ising model are not the sum of a one-component macrovariable whose outcomes have equal probability. What does the work in these two examples is that significant parts of the phase space are taken up by nonequilibrium states and that the Boltzmannian equilibrium state has the lowest value of the macrovariable. This results in the phase average being different from the lowest value of the macrovariable (the Boltzmannian equilibrium value).

These examples show that Gibbsian and Boltzmannian calculations can but need not provide the same results.Footnote ¹¹ An important task of the foundations of SM is to classify under which conditions the two frameworks lead to the same results and under which conditions they do not. The Khinchin condition and EET provide partial answers to this question. The answers are partial because both offer only sufficient but not necessary conditions for Gibbsian and Boltzmannian results being the same.

7. Conclusion

There is widespread belief that Gibbsian and Boltzmannian SM provide the same equilibrium values. We argued that this is false. There are important cases in which the Boltzmannian and Gibbsian equilibrium calculations will lead to different results. Furthermore, the conditions under which an equilibrium exists are different for Gibbsian and Boltzmannian SM. It is, however, true that the equilibrium values coincide under special circumstances. We proved a new theorem giving conditions under which this is the case.

This raises the question of what happens in cases in which the two do not coincide. Is the Gibbsian or the Boltzmannian equilibrium the correct one? For the Ising model the correct empirical conclusions follow from the Boltzmannian and not the Gibbsian calculations, and we suspect that this will be the same for other cases. Which framework provides the correct values and under which circumstances is a question for future research.

Appendix. Proof of the Equilibrium Equivalence Theorem

First, we determine the value of the largest macroregion. Recall that N is a multiple of k and that the observable on the one-constituent space takes the values {κ₁, …, κ_k} with equal probability. Hence, for the macroregion of largest size there are N/k particles taking the value κ₁, N/k particles taking the value κ₂, …, N/k particles taking the value κ_k. Therefore, the value of the largest macroregion is $(κ_{1} + \dots + κ_{k}) N / k$ .

Let us now determine the phase average. The proof is by mathematical induction. We will show that the sum over all sequences of length N whose elements are in {κ₁, …, κ_k} is $k^{N - 1} N (κ_{1} + \dots + κ_{k})$ . Because each sequence has equal probability 1/k ^N, the desired result follows that the phase average is

\begin{matrix} (A1) & \frac{1}{k^{N}} k^{N - 1} N (κ_{1} + \dots + κ_{k}) = \frac{N}{k} (κ_{1} + \dots + κ_{k}) . \end{matrix}

For $N = 1$ , because $k \leq N$ , $k = 1$ , and the sum over all sequences of length N is $κ_{1} = 1^{1 - 1} 1 κ_{1} = κ_{1}$ .

For $N \to N + 1$ , we need to determine the sum over all sequences of length $N + 1$ . One obtains sequences of length $N + 1$ by adding to sequences of length N one element at the end of the sequences. One can add k possible elements at the end, and so a contribution to the sum over all sequences of length N + 1 is k times the sum of the sequences of length N; that is,

\begin{matrix} (A2) & k \times k^{N - 1} N (κ_{1} + \dots + κ_{k}) . \end{matrix}

What is still missing is the contribution from the element at the end. There are k^N sequences of length N to which a κ_i can be added, and hence this contribution is

\begin{matrix} (A3) & k^{N} (κ_{1} + \dots + κ_{k}) . \end{matrix}

Adding these two contributions leads to

\begin{matrix} (A4) & k \times k^{N - 1} N (κ_{1} + \dots + κ_{k}) + k^{N} (κ_{1} + \dots + κ_{k}) = k^{N} (N + 1) (κ_{1} + \dots + κ_{k}) . \end{matrix}

Footnotes

1. Combinatorial considerations are of course still useful to construct macrostates, and we rely on them below.

2. Macrostates can of course also be defined through an interval of values rather then only one value.

3. We assume that (1−α)ε>1/2.

4. Proofs for the deterministic case are given in Werndl and Frigg (Reference Werndl and Frigg2015a) and for the stochastic case in Werndl and Frigg (Reference Werndl, Frigg, Massimi, Romeijn and Schurz2017). The problem of the existence of such equilibrium states is discussed in Werndl and Frigg (Reference Werndl and Friggforthcomingb).

5. We assume that N=k×r for some r∈ℕ.

6. It is not an α-ε-equilibrium because the equilibrium macroregion takes up less than half of the phase space for large N (Lavis Reference Lavis2005).

7. More specifically, one finds that for arbitrarily large N the macrovalue closest to the value obtained by Gibbs phase space averaging will always be different from the macrovalue representing the Boltzmannian equilibrium.

8. This corresponds to an infinite temperature, which is often considered as an approximation to very high temperatures (e.g., Baxter Reference Baxter1982). We work with β=0 for reasons of simplicity; similar examples could be given for low but nonzero values of β.

9. Here, N is assumed to be a multiple of k, i.e., N=k×s for some s∈ℕ.

10. Note that this macrovariable is very different from the one in sec. 3.

11. This discrepancy is not an artifact of the use of our own definition of the Boltzmannian equilibrium. The same results would follow if one adopted the standard definition of the equilibrium as the largest macroregion.

References

Baxter, Rodney J. 1982. Exactly Solved Models in Statistical Mechanics. London: Academic Press.Google Scholar

Berger, Arno. 2001. Chaos and Chance: An Introduction to Stochastic Aspects of Dynamics. New York: de Gruyter.CrossRef Google Scholar

Cipra, Barry. 1987. “An Introduction to the Ising Model.” American Mathematical Monthly 94:937–54.CrossRef Google Scholar

Davey, Kevin. 2009. “What Is Gibbs’s Canonical Distribution?” Philosophy of Science 76 (5): 970–83.CrossRef Google Scholar

Frigg, Roman. 2008. “A Field Guide to Recent Work on the Foundations of Statistical Mechanics.” In The Ashgate Companion to Contemporary Philosophy of Physics, ed. Rickles, Dean, 99–196. London: Ashgate.Google Scholar

Goldstein, Sheldon. 2001. “Boltzmanns Approach to Statistical Mechanics.” In Chance in Physics: Foundations and Perspectives, ed. Bricmont, Jean, Dürr, Detlef, Galavotti, Maria Carla, Ghirardi, Gian Carlo, Petruccione, Francesco, and Zangh, Nino, 39–54. Berlin: Springer.CrossRef Google Scholar

Khinchin, Alexandr I. 1949. Mathematical Foundations of Statistical Mechanics. Mineola, NY: Dover.Google Scholar

Lavis, David. 2005. “Boltzmann and Gibbs: An Attempted Reconciliation.” Studies in History and Philosophy of Modern Physics 36:245–73.CrossRef Google Scholar

Lebowitz, Joel L. 1993. “Boltzmann’s Entropy and Time’s Arrow.” Physics Today 46:32–38.CrossRef Google Scholar

Malament, David B., and Zabell, Sandy L. 1980. “Why Gibbs Phase Averages Work: The Role of Ergodic Theory.” Philosophy of Science 56:339–49.Google Scholar

Secular, Paul. 2015. “Monte-Carlo Simulation of Small 2D Ising Lattice with Metropolis Dynamics.” http://secular.me.uk/physics/ising-model.pdf.Google Scholar

Uffink, Jos. 2007. “Compendium of the Foundations of Classical Statistical Physics.” In Philosophy of Physics, ed. Butterfield, Jeremy and Earman, John, 923–1047. Amsterdam: North Holland.CrossRef Google Scholar

Werndl, Charlotte, and Frigg, Roman. 2015a. “Reconceptionalising Equilibrium in Boltzmannian Statistical Mechanics.” Studies in History and Philosophy of Modern Physics 49:19–31.CrossRef Google Scholar

Werndl, Charlotte, and Frigg, Roman 2015b. “Rethinking Boltzmannian Equilibrium.” Philosophy of Science 82:1224–35.CrossRef Google Scholar

Werndl, Charlotte, and Frigg, Roman 2017. “Boltzmannian Equilibrium in Stochastic Systems.” In EPSA15 Selected Papers: The 5th Conference of the European Philosophy of Science Association in Düsseldorf, ed. Massimi, Michela, Romeijn, Jan-Willem, and Schurz, Gerhard. Berlin: Springer.Google Scholar

Werndl, Charlotte, and Frigg, Roman Forthcominga. “Transcript of the Talk and Discussion at the Oxford Workshop in Honour of David Wallace.” In Proceedings of the Quantum Foundations of Statistical Mechanics Workshop in Oxford, ed. Daniel Bendingham, Owen Marony, and Christopher Timpson. Oxford: Oxford University Press.Google Scholar

Werndl, Charlotte, and Frigg, Roman Forthcomingb. “When Does a Boltzmannian Equilibrium Exist?” In Quantum Foundations of Statistical Mechanics, ed. Daniel Bedingham, Owen Maroney, and Christopher Timpson. Oxford: Oxford University Press.Google Scholar

Article contents

Mind the Gap: Boltzmannian versus Gibbsian Equilibrium

Abstract

1. Introduction

2. Equilibrium in Gibbs and Boltzmann

3. Example 1: The Baker’s Gas

4. Example 2: The Ising Model

5. Existence under Different Conditions

5.1. Gibbs Does Not Imply Boltzmann

5.2. Boltzmann Does Not Imply Gibbs

6. When Boltzmann and Gibbs Agree

7. Conclusion

Appendix. Proof of the Equilibrium Equivalence Theorem

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests