1. Boltzmann versus Gibbs
In a recent article, Charlotte Werndl and Roman Frigg (Reference Werndl and Frigg2017) discuss the relationship between the Boltzmannian and Gibbsian framework of statistical mechanics, addressing, in particular, the question whether the two formulations yield equivalent predictions for equilibrium values of macroscopic observables defined in terms of appropriate macrovariables. In the Boltzmannian framework, the macrovariables take (approximately) constant values on the equilibrium region of phase space that are thus revealed by a suitable measurement on a system in equilibrium (a system, that is, whose actual microstate is in the equilibrium region). The role of the stationary phase space measure is to establish that it is, in some sense, very likely to find the system in an equilibrium state. In the Gibbsian framework, equilibrium is a property of an ensemble, represented by a stationary distribution ρ on phase space Γ, and it is often (although maybe somewhat carelessly) said that the prediction for a measurement of a macrovariable f on an individual ensemble system is given by the phase average
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df1.png?pub-status=live)
where are the phase space coordinates. This quantity is also called the ensemble average or expectation value of f.
There are many situations in which the Gibbsian phase average agrees—within appropriate error bounds—with the Boltzmannian equilibrium value. Werndl and Frigg mention a criterion that they call the “Khinchin condition” and that they characterize briefly as the phase function having “small dispersion for systems with a large number of constituents” (Reference Werndl and Frigg2017, 1298). Indeed, in less technical terms, a sufficiently small dispersion of the macrovariable means precisely that typical values (the Boltzmann equilibrium value) are close to the average value (the Gibbsian equilibrium value). Another way to formulate the Khinchin condition—now from a Boltzmannian perspective—is to say that there exists a unique Boltzmann equilibrium whose corresponding macroregion exhausts almost the entire phase space volume. Formally
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df2.png?pub-status=live)
where is very small compared to ξ, and ε is very small compared to 1 (and 1A denotes the characteristic function of the set A). For then, the macrovariable f takes an (approximately) constant value (the Boltzmannian equilibrium value
) on a set of measures close to 1 (the Boltzmannian equilibrium region Γeq). Hence, the phase average (1) will be close to the Boltzmannian equilibrium value, provided f is somewhat well behaved and its values do not suddenly “explode” outside the equilibrium region. Rigorously
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df3.png?pub-status=live)
and thus if
, that is, unless the nonequilibrium values of f are many orders of magnitude larger than the equilibrium value.
It is important to emphasize that (unless one considers a thermodynamic limit) “the Boltzmannian equilibrium value” refers, in general, to a small range of values of f. This kind of coarse-graining is both essential for probabilistic estimates and physically called for since the relevant measurements have limited accuracy. For Werndl and Frigg, in contrast, every single value of the macrovariable f defines a different “macrostate,” implying that, for them, even the slightest variation in the respective physical quantity (pressure, density, energy, etc.) leads to a system being “out of equilibrium.” What the authors call the “Boltzmann equilibrium” is thus not the equilibrium as defined by Boltzmann or used in Boltzmannian statistical mechanics, and it is unfortunate—since it is misleading—that they refer to it by the same name. For our further discussion, we will thus refer to it as the “Werndl-Frigg equilibrium” instead.
The existence of a dominant Boltzmann equilibrium (in the sense of Boltzmann) is, in fact, the generic case in statistical mechanics, giving rise to the claim that Boltzmann and Gibbs make (in general) equivalent predictions for systems in the respective equilibriums. If one agrees that the Boltzmannian formulation is the more fundamental one, this also explains why Gibbsian phase averaging yields (in general) accurate predictions for individual measurements.
Nonetheless, there are some famous and well-studied cases in which the Khinchin condition (2) does not hold. For instance, in the two-dimensional Ising model without an external field, it makes sense to speak of two Boltzmann equilibriums below the critical temperature, corresponding to a positive or negative magnetization, respectively. The Gibbsian distribution ρ is, however, symmetric under a flip of all spins, hence yielding an average magnetization of zero. There is nothing inconsistent or mysterious about this fact, as long as one keeps in mind that the Gibbsian value refers, in the first place, to an ensemble average. In particular, in statistical mechanics, one does not try to draw interesting conclusions about the Ising model from such phase averages. Instead, one usually studies so-called phase transitions at the critical temperature by fixing either +1 or −1 boundary conditions (referring to the polarization of spins at the edge of the lattice) thus implicitly picking one of the two magnetization states.
2. Boltzmann versus Werndl and Frigg
In their article, Werndl and Frigg (Reference Werndl and Frigg2017) mention the magnetization in the Ising model only briefly but present instead other examples for which they claim a disagreement between Boltzmannian and Gibbsian equilibrium values. One such example is based on the “baker’s gas,” a mathematical model for the ideal gas that the authors have used in various publications to argue that the Boltzmann equilibrium fails to be dominant, that is, exhaust a majority of the phase space volume as stated in equation (2). To this end, the authors partition the one-particle phase space (corresponding to the unit square in the baker’s model) into k cells and claim that the macrostate corresponding to the Boltzmann equilibrium is the “uniform distribution” for which each cell contains exactly N/k particles. It is then easy to see that while the phase space volume associated with this distribution is greater than the phase space volume associated with any other particular arrangement of particles over the cells, it will not exhaust a majority of phase space volume for large N. Werndl and Frigg exploit this fact—amounting to an apparent violation of the Khinchin condition (2)—by introducing an artificial macrovariable, weighing the previously defined “macroregions” in such a way that the Gibbsian phase average differs significantly from the value associated with the uniform distribution.
However, as emphasized above, the authors’ reference to the “Boltzmann equilibrium” is a misnomer, since they use a notion of equilibrium that does not correspond to the concept introduced by Boltzmann and used in Boltzmannian statistical mechanics (thereby repeating the arguments of Lavis [Reference Lavis2005] already criticized by Lazarovici and Reichert [Reference Lazarovici and Reichert2015]). Exact uniform distributions, where each cell contains exactly N/k particles, are very special configurations: their measure actually goes to zero for large N. But configurations for which the density of particles in each cell differs only slightly from 1/k are macroscopically indistinguishable and coarse-grain to the same macrostate in Boltzmann’s sense. Otherwise, we would have to say, for instance, that a gas is “out of equilibrium” if the left-hand side of the volume contains even a single particle more than the right-hand side.
Compare this with Boltzmann’s discussions of the Maxwellian velocity distribution as the equilibrium distribution of an ideal gas. Here, Boltzmann was very explicit about the fact that “for a finite number of molecules, the Maxwell distribution will never hold exactly but only to a good approximation” (Reference Boltzmann1896, 775, my translation).
In the case of the baker’s model, the dominant Boltzmann equilibrium with respect to the considered uniform measure contains all configurations for which the relative number of particles in each cell is within (I am neglecting a k-dependent constant that would yield somewhat better bounds). Since
is a tiny number for macroscopic N, corresponding to density fluctuations of less than one-tenth of a billionth of a percent, these configurations look macroscopically uniform and constitute the relevant equilibrium state. The Khinchin condition (2) is thus satisfied, and the Boltzmannian and Gibbsian equilibrium values will be equivalent for any sensible macrovariable as specified above.
3. Law of Large Numbers versus Equilibrium Equivalence Theorem
By the same token, all other examples presented by Werndl and Frigg (Reference Werndl and Frigg2017) may show a disagreement between the Gibbsian equilibrium and their own but shed no light on the relationship with the real Boltzmann equilibrium. Nevertheless, the authors go on to conclude that it is “an important task of the foundations of SM [statistical mechanics] … to classify under which conditions the two frameworks lead to the same results and under which conditions they do not” and present a “new theorem giving conditions” under which “Boltzmannian” and Gibbsian equilibrium values coincide (1300). I quote their result in full (1298):
Equilibrium Equivalence Theorem (EET). Suppose that the system (X, Tt, μX) is composed of constituents. That is, the state
is given by the N coordinates
;
, where
for all
(Xoc is the one-constituent space). Let μX be the product measure
, where
is the measure on Xoc. Suppose that an observable k is defined on the one-particle space Xoc and takes the values κ1, …, κk with equal probability
,
. Suppose that the macrovariable k is the sum of the one-component observable; that is,
. Then the value corresponding to the largest macroregion as well as the value obtained by phase space averaging is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df107.png?pub-status=live)
This theorem tries to identify the Werndl-Frigg equilibrium value with the Gibbs average. It does not relate to the Boltzmann equilibrium. That is because the “largest macroregion” of Werndl and Frigg refers to the set on which the macrovariable takes exactly the average value, and this is, again, in contrast to the Boltzmannian framework, in which a small range of (for all practical purposes indistinguishable) values coarse-grains to one and the same macrostate.
It should be noted, however, that part of the conditions specified by Werndl and Frigg are sufficient (although by no means necessary) for the equivalence of Gibbsian and Boltzmannian equilibrium values. In fact, under these conditions, the equivalence is a standard exercise in statistical mechanics, based on the law of large numbers (LLN)—the fundamental theorem of probability theory. The LLN states that, for a family of independent and identically distributed random variables,
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df4.png?pub-status=live)
for any , where σ2 is the variance of κ. For the macrovariable
then, which is extensive and growing with N, we can set
for
, so that, in terms of K (and writing
as a “phase average”), the LLN estimate becomes
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df5.png?pub-status=live)
Note that the bound is small compared to
, which is of order N; that is, it is the relative deviation
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df106.png?pub-status=live)
that becomes vanishingly small for large particle numbers. In particular, for a macroscopic system, we have (from Avogadro’s constant), and setting
, we can conclude that K deviates from
by less than 1 millionth of a percent on a set of measure (approximately) 0, 999,999.Footnote 1
Summing up in less technical terms, the weak LLN states precisely that for large N (which is the relevant case in statistical mechanics), phase space is dominated by an equilibrium region, on which the value of the macrovariable K is very close to the expectation value (= phase average). And this is precisely the empirical equivalence of Boltzmannian and Gibbsian equilibrium values. The LLN also yields immediately the Khinchin condition, both in the sense of small dispersion (which is actually how the LLN is usually proven) and in the form of equation (2) (to be compared with [5]).
The LLN holds, in fact, under much more general assumptions than those of the EET. In the form of equation (5), the result holds for any sum of uncorrelated and identically distributed random variables with finite variance. This is to say that neither the uniform distribution nor the discreteness of the random variables is required, and they do not even have to be defined on a one-constituent space. The conditions are nonetheless very strong and hardly apply to anything but idealized models. In fact, when Werndl and Frigg write that “the crucial assumptions of the theorem are (i) that the macrovariable is a sum of the observable on the constituent space and (ii) that the macrovariable on the constituent space corresponds to a partition with cells of equal probability” (Reference Werndl and Frigg2017, 1299), they overlook the truly critical assumption in both their theorem and the LLN. While i is often justified, and the “equal probability” in ii is unnecessary when one argues with the LLN, the crucial and very strong assumption is the statistical independence of the constituent variables (their joint distribution factorizes), which does not hold, in most realistic cases, because of interactions or conserved quantities. (It is precisely this loss of statistical independence that becomes significant in the Ising below the critical temperature.) Controlling correlations—or justifying in some other way a LLN—is really the key issue in many of the hard problems in statistical mechanics.
I want to emphasize again that while the LLN applies immediately under their stated assumptions, the theorem proven by Werndl and Frigg is not a LLN statement because the authors have a different notion of “Boltzmann equilibrium” in mind. The LLN, as much of statistical mechanics, is about estimates. Werndl and Frigg do not do estimates. Instead, in their article, the Werndl-Frigg equilibrium value and its equality to the phase average is supposed to be exact. This is why, in addition to considering standard conditions for the LLN, the authors assume a particularly simple and symmetric distribution of the macrovariable for which the average coincides with the most likely value. The EET is then a straightforward exercise in combinatorics. Its physical relevance, however, is questionable, to say the least. For one, the measure of what Werndl and Frigg call the “largest macroregion,” that is, the set on which K takes precisely the value , actually goes to zero for large N. Interpreting this measure probabilistically, it is thus extremely unlikely for a system to be in this Werndl-Frigg equilibrium. Second, while the macrovariable considered by the authors is discrete, different values can be very close for large N. Therefore, many different “macrostates” in the sense of Werndl and Frigg will be empirically indistinguishable, given the limited resolution of measurements on macroscopic systems. This is a crucial difference between macrostates in the sense of Werndl and Frigg and macrostates in the sense of Boltzmann. In general, it should be stressed that the relevant notion of equivalence for Boltzmannian and Gibbsian equilibrium predictions can only be empirical equivalence, that is, that the respective values agree to a sufficiently good approximation.
The LLN is, in fact, the paradigm that one should have in mind when one thinks about the Boltzmann equilibrium: there is a certain range of typical values for the relevant quantities, and the larger the particle number N, the more weight (phase space measure or probability, if you wish) is concentrated on an ever smaller range of values around the mean.
It is also a standard result in probability theory that the variance (= dispersion squared) for a sum of independent random variables (as considered by Werndl and Frigg) is additive. This is to say, in particular, that typical fluctuations are of the order , and we will not have a dominant equilibrium region if the coarse-graining into macrostates is finer than that. This is a mathematical fact, not a foundational issue.Footnote 2
To be clear, a “macrovariable,” qua mathematical object, is usually some nice function of the microscopic variables—think, for instance, of the energy H(q, p) as a function of the particles’ positions and momenta in a canonical ensemble. But such a variable is in general too fine-grained to consider all its different values as macroscopically distinct. There is thus not a one-to-one correspondence between the possible values of the macrovariable and Boltzmannian macrostates. Boltzmannian macrostates are supposed to be observationally distinguishable by relevant means.
It should also be noted that while the Boltzmannian equilibrium value is thus necessarily “unsharp,” the Gibbsian equilibrium value, if identified with the phase average (1), is a definite real number by definition. It would be a mistake, however, to read this as an “infinitely precise” prediction of the Gibbsian theory. Instead, one can, for instance, compute the ensemble variance
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220325095958164-0493:S003182480001521X:S003182480001521X_df6.png?pub-status=live)
and identify the Gibbsian prediction, to be compared with the Boltzmannian one, with .
4. Conclusion
In sum, I arrive at the following conclusions about the arguments of Werndl and Frigg (Reference Werndl and Frigg2017) and their proposed Equilibrium Equivalence Theorem. (1) Contrary to what the authors announce in their article, their theorem does not provide any new conditions for the agreement of Boltzmannian and Gibbsian equilibrium values. They have instead stated conditions (plus some unnecessary ones) that are standard for the LLN, from which the Khinchin condition readily follows.
(2) The authors’ suggestion that “we need other conditions next to the Khinchin condition” seems ill motivated to begin with (Reference Werndl and Frigg2017, 1298). They present examples in which what they identify as the “Boltzmann equilibrium” fails to be dominant—so that the Khinchin condition (2) is not satisfied—but those are also examples in which they find the associated equilibrium value to disagree with the Gibbsian one. In fact, a good case can be made that the Khinchin condition—in the sense of “uniqueness and dominance of the Boltzmann equilibrium”—is not only sufficient but also necessary for the empirical equivalence of Boltzmannian and Gibbsian equilibrium values.Footnote 3 (For otherwise, we have either no Boltzmann equilibrium—and thus no Boltzmann equilibrium value—or multiple Boltzmann equilibriums, so that the Gibbsian phase average will correspond to an average of the Boltzmann equilibrium values rather than any one in particular.) However, instead of arguing this point in greater detail, a more important remark may be the following: if the Khinchin condition does not hold, it means that there is a high probability of finding macrovalues that differ significantly from the Gibbsian phase average, so that this phase average, as a prediction for individual measurements, is highly dubious in the first place.Footnote 4
(3) Werndl and Frigg have written a series of articles (Reference Werndl and Frigg2015a, Reference Werndl and Frigg2015b, Reference Werndl and Frigg2017) attacking the premise of a dominant Boltzmann equilibrium and addressing the problems—like the nonequivalence of Boltzmannian and Gibbsian equilibrium values—that ensue. In this note, I have argued that the apparent foundational problems arise merely from their own definition of “macrostates” and “Boltzmann equilibrium” that does not correspond to the way in which physicists, since Boltzmann, have actually argued. Since the authors generally fail to consider an appropriate coarse-graining of the microscopic state space—as suggested by both physical considerations and elementary results in probability theory—their counterexamples to the existence of a dominant Werndl-Frigg equilibrium have no relevance for the Boltzmannian framework. No Boltzmannian—least of all Boltzmann himself—every claimed that one could partition phase space in any arbitrary matter and end up with a dominant equilibrium state.
In general, I am skeptical of this ad absurdum approach to the foundations of statistical mechanics—constructing artificial counterexamples that create artificial problems. In my opinion, it misses the point of the discipline, which is not an “axiomatic” theory but an effective framework for the description of complex systems that requires some degree of pragmatism and good physical sense. A crucial difference is that, in an axiomatic theory, any counterexample can point to foundational issues, while in statistical mechanics, such counterexamples point merely to inadequate use.