1. Introduction
The word “gauge” is ubiquitous in modern physics. Our best physical theories are described, in various contexts, as “gauge theories.” The “gauge argument” allegedly reveals the underlying “logic of nature” (Martin Reference Martin2002). Our theories regularly exhibit “gauge freedom,” “gauge structure,” and “gauge dependence.” Unfortunately, however, it is far from clear that the term has some univocal meaning across the many contexts in which it appears. It is a bit like “liberal” in American political discourse: it shows up everywhere, and no one knows what it means.
Here I focus on two strands of usage.Footnote 1 On the first strand, a “gauge theory” is a theory that exhibits excess structure or, in Earman’s words, “descriptive fluff” (Reference Earman2004).Footnote 2 On this way of thinking about gauge, there is a mismatch between the mathematical structure used in the theory and the structure we take the world to have, in such a way that (perhaps) one could remove some structure from the theory without affecting its descriptive or representational power. Most famously, Earman and Norton (Reference Earman and Norton1987) argue that the so-called hole argument shows that general relativity is a gauge theory in just this sense; some have taken this as motivation for moving to a different, perhaps undiscovered, formalism for representing space-time (Earman Reference Earman1989).Footnote 3 Similar considerations have motivated some views on Yang-Mills theory (Healey Reference Healey2007; Rosenstock and Weatherall Reference Rosenstock and Weatherall2015).
The second strand of usage concerns a specific class of theories. Here one uses “gauge theory” to refer to various generalizations of classical electromagnetism that share a certain mathematical structure.Footnote 4 For instance, Trautman (Reference Trautman and Held1980) defines “gauge theory” as follows: “For me, a gauge theory is any physical theory of a dynamic variable which, at the classical level, may be identified with a connection on a principal bundle” (26). This turns out to be a large class containing most of our fundamental theories, including all Yang-Mills theories, general relativity, and Newton-Cartan theory. It is in this context that physicists seem to speak most often of gauge theories, usually as a synonym for “Yang-Mills theories.”
It is easy to imagine that the two strands are closely related and, in particular, that all gauge theories in the second sense are also gauge theories in the first sense. But as I argue below, this is a mistake. In particular, I will articulate a precise sense in which electromagnetism—the paradigmatic example on both strands—may be understood to have excess structure, and thus to be a gauge theory in the first sense. I then consider whether other theories, such as Yang-Mills theory and general relativity, have excess structure in the same sense. I will argue that they do not. It follows that on at least one precise sense of what it means for a theory to have excess structure, the two strands of usage described above come apart.
2. Two Approaches to Electromagnetism
In what follows, we consider electromagnetism on the fixed background of Minkowski space-time, .Footnote 5 In this setting, there are two ways of characterizing models of ordinary electromagnetism.Footnote 6
On one characterization, the principal dynamical variable is the electromagnetic field, represented by a two-form on space-time. The electromagnetic field is required to satisfy Maxwell’s equations, which may be expressed as
and
, where
is the Minkowski derivative operator and
is a smooth vector field representing the charge-current density on space-time. A model of the theory on this characterization might be written as a triple
, where
is any closed two form (i.e., any two form satisfying the first of Maxwell’s equations).Footnote 7 Call this formulation of the theory EM1.
On the second characterization, the dynamical field is the four-vector potential, represented by a one-form Aa on space-time. This field is required to satisfy a single differential equation: , where again Ja is the charge-current density. A model of the theory may again be represented by a triple,
, where Aa is any one form.Footnote 8 I call this formulation of the theory EM2.
These two formulations are systematically related. Given any model of EM2, I can always define an electromagnetic field by
. Since any Fab thus defined is exact, it must also be closed, and thus the resulting triple
is a model of EM1; moreover, this Fab is associated with the same charge-current density as Aa. Conversely, given any model
of EM1, since Fab is closed, it must also be exact, and thus there exists a one-form Aa such that
.Footnote 9 The triple
is then a model of EM2, again with the same charge-current density.
However, there is an important asymmetry in this relationship. Given any model of EM2, there exists a unique corresponding model of EM1, because any smooth one form has a unique exterior derivative. But the converse is not true: given a model of EM1, there will generally be many corresponding models of EM2, since if Aa is such that
, then
, for any smooth scalar field χ, also satisfies
, because for any smooth scalar field,
. Transformations
of this form are sometimes known as gauge transformations.
It is this asymmetry that, I claim, supports the common view that electromagnetism has excess structure. The idea is that EM1 and EM2 both have all of the resources necessary to represent classical electromagnetic phenomena. Indeed, in both cases, one may take the empirical content of electromagnetism to be fully exhausted by the electromagnetic field associated with a given model—either directly in the case of EM1 or as derived above in the case of EM2. But there are prima facie distinct models of EM2 associated with the same electromagnetic field. Thus, it would seem that these models of EM2, although they differ in their mathematical properties, should be taken to have the same representational capacities. Intuitively, then, whatever structure distinguishes models of EM2 related by gauge transformations must be irrelevant to the representational capacities of the models, at least as long as the empirical content is exhausted by the associated electromagnetic field. It is in this sense that electromagnetism—or really, EM2—has excess structure.
3. Comparing Structure
In the next section, I make the intuitive argument just given precise. First, however, I take a detour through pure mathematics. Mathematical objects often differ in how much structure they have. For instance, topological spaces have more structure than sets: a topological space (X, τ) consists in a set X, along with something more, namely, a collection τ of open subsets of X satisfying certain properties. Similarly, an inner product space has more structure than a vector space, and a Lie group has more structure than a smooth manifold. In this section, I use some basic category theory to capture these judgments as mathematical relationships between the theories.Footnote 10
To begin, recall that various mathematical theories may be associated with categories. For instance, there is a category Set, whose objects are sets and whose arrows are functions. There is a category Top, whose objects are topological spaces and whose arrows are continuous functions. There are also functors between such categories. For instance, there is a functor that takes every topological space (X, τ) to the set X and takes every continuous map
to the function
. Functors of this sort are often called “forgetful,” because, intuitively speaking, they take objects of one category and forget something about them: in this case, they take topological spaces (X, τ) and forget about τ.
How can one tell whether a given functor is forgetful? There is a simple but insightful theory available, due to Baez, Bartel, and Dolan (Reference Baez, Bartel and Dolan2004; see also Barrett Reference Barrett2013). It requires a few further definitions, concerning properties that a functor may have. First, we will say that F is full if for all objects A, B of
, the map
induced by F is surjective. Similarly, F is faithful if for all pairs of objects in
, the induced map on arrows is injective. And F is essentially surjective if for every object X of
, there is some object A of
and arrows
and
such that
and
. (Such an arrow f is an isomorphism, so essentially surjective functors are surjective on objects “up to isomorphism.”)
If a functor is full, faithful, and essentially surjective, then the functor is said to realize an equivalence of categories. In such cases, F forgets nothing. Otherwise a functor is forgetful. In particular, a functor forgets (only) structure if it is faithful and essentially surjective but not full. A functor forgets (only) properties if it is full and faithful but not essentially surjective. And a functor forgets (only) stuff if it is full and essentially surjective but not faithful. In general, a given functor may be forgetful in more than one of these ways, but not in any other ways: any functor may be written as the composition of three functors that forget (no more than) structure, properties, and stuff, respectively.
The best way to make this categorization plausible is by considering examples. For instance, the functor we have already considered, , forgets only structure. This is because every set corresponds to some topological space (or other), which means that F is essentially surjective. Similarly, any distinct continuous functions
must be distinct as functions, so F is faithful. But F is not full, because not every function
is continuous, given topologies on X and X′. So in this case, the classification captures the pretheoretic intuition with which we began.
Similarly, we can define categories Grp and AbGrp, whose objects are groups and Abelian groups, respectively, and whose arrows are group homomorphisms; then, there is a functor that takes Abelian groups and group homomorphisms to themselves. This functor is full and faithful, since it just acts as the identity on group homomorphisms between Abelian groups. But it is not essentially surjective because not every group is Abelian. So this functor forgets only properties—namely, the property of being Abelian. And finally, we can define a functor
, where 1 is the category with one object and one arrow (the identity on the one object). This functor takes every set to the unique object of 1 and every arrow to the unique arrow of 1. It is clearly full and essentially surjective, but not faithful, so it forgets only stuff. To see how, note that we may think of 1 as the category with the empty set as its only object; thus, H forgets all of the elements of the sets.
This classification of functors gives us a criterion for when a mathematical theory T 1 may be said to have more structure than another theory T 2, namely, when there exists a functor from the category associated with T 1 to the category associated with T 2 that forgets structure. Given two categories, there may be multiple functors between them, and it may be that not all such functors forget structure, even if there exists one that does. This means that comparative judgments of “amount of structure” between theories should be understood as relative to a choice of functor. This flexibility is a virtue: it allows us to explore various ways in which theories may be related.
4. A Diagnostic Tool
I now return to the question of interest. To begin, I use the criterion just developed to make the intuitive argument at the end of section 2 precise. I define two categories, corresponding to the two formulations of electromagnetism already discussed, and then define a functor between them that captures the relationship already discussed between EM1 and EM2.
The first category, , has models
of EM1 as objects and as arrows has maps that suitably preserve this structure. For present purposes, we take these to be isometries of Minkowski space-time that preserve the electromagnetic field, so that given two models,
and
, an arrow
will be an isometry of
such that
. Likewise, we may define a category
whose objects are models
of EM2 and whose arrows are isometries of Minkowski space-time that preserve the four-vector potential. Given these categories, the map defined above, taking models
of EM2 to models
of EM1, becomes a functor
that take arrows of
—which, recall, are isometries of Minkowski space-time with an additional property—to the arrow of
corresponding to the same isometry. (This action on arrows is well defined because, given any arrow
of
,
.)Footnote 11 We then have the following result.
Proposition 1. F forgets only structure.
Proof. F is clearly faithful and essentially surjective. To see that it is not full, consider the object of
. The object
of
maps to A. But now consider any nonconstant scalar field ψ. Then
is a model of
, and
. If F were full, then, there would have to be an arrow
such that
, but this is impossible, since by construction, there are no arrows between X and X′. QED
Proposition 1 provides a precise sense in which EM2 has more structure than EM1: the functor realizing the natural relationship between the theories forgets (only) structure. Recall that the intuitive argument was that there are distinct models of EM2 corresponding to a single model of EM1, and thus there must be features of the models of EM2 that distinguish them, without making any difference to their empirical content. The present argument, meanwhile, is that there are models of EM2 that fail to be isomorphic—by the standard of isomorphism used in defining EM2—even though the corresponding models of EM1 are isomorphic or even identical. This is captured in the formalism by the fact that there are arrows in , which we may interpret as “structure-preserving maps” between models of EM1, that are not structure-preserving maps between models of EM2. The structure that these maps do not preserve is the structure that, on the intuitive argument, distinguished models of EM2.
I take this to be strong evidence that the formal criterion given by forgetful functors captures the sense in which electromagnetism has excess structure. And since electromagnetism is the paradigmatic example of a gauge theory, I take this to be the sense of “excess structure” associated with the first strand of usage.Footnote 12 Of course, there may be other senses in which a theory might be thought to have excess structure, but I will not consider that question further here. Rather, I stipulate that the criterion developed here is salient and turn to a different question. Do gauge theories in the second sense—that is, the theories Trautman identifies—have excess structure?
No. First, consider electromagnetism, formulated now as a theory whose dynamical variable is a connection on a principal bundle over Minkowski space-time—that is, electromagnetism formulated as a gauge theory in Trautman’s sense. Call this theory EM3. Models of EM3 may be written (P, ωα), where P is the total space of the (unique, trivial) principal bundle over Minkowski space-time and ωα is a principal connection.Footnote 13 This theory is closely related to both EM1 and EM2 as already discussed: given any (global) section
, we may define a four-vector potential Aa as the pullback along σ of ωα:
. Similarly, we may define an electromagnetic field tensor Fab as the pullback along σ of the curvature of the connection, defined by
, where d is the exterior derivative on P:
. Thus, Aa and Fab may be thought of as representatives on M of the connection and curvature on P. In general, Aa will depend on the choice of section σ, whereas Fab will not depend on that choice because U(1) is an Abelian group.
Given this characterization of the theory, we can define yet another category, , as follows: the objects of
are models of EM3, and the arrows are principal bundle isomorphisms
that preserve both the connection on P and the metric on M: that is, pairs of diffeomorphisms
and
such that
,
,
, and
for any x ∈ P and any g ∈ U(1). Then we may define a functor
as follows:
acts on objects as
, for any global section
, and
acts on arrows as
. (Again, this action on arrows is well defined. Choose any section
. Then
is also a section of P. But since
and
are independent of the choice of section,
.)
Proposition 2. forgets nothing.
Proof. First, consider an arbitrary object of
and pick some Aa such that
. Then given any section
and any one-form Aa, we can define a connection ωα on P by assigning to each point
the one-form π*(Aa) and then extending this field on σ[M] to all of P by requiring it to be equivariant. It follows that
is essentially surjective. Now consider any two objects
and
of
and suppose there exist arrows
such that
. It follows that
is a vertical bundle automorphism such that
. But by the equivariance of
, this is only possible if
. Since Ψ and
are bijective, it follows that
. Thus,
is faithful. Finally, suppose there is an arrow
. Then ψ is an isometry of Minkowski space-time such that, for any section
,
. Fix
. It follows that there exists a section
such that
. We may then define a diffeomorphism
as follows. For any
, there exists some
such that
. This relationship determines a smooth map
satisfying
for any
. We then define Ψ by
for all x ∈ P. This map
is a diffeomorphism such that
and
. Thus,
is a principal bundle isomorphism. Moreover, since for any
,
, it follows by the equivariance of ωα and
that
. So
is full. QED
This result shows that EM3 does not have excess structure in the sense that EM2 does. To extend this to other gauge theories in the second sense, however, requires more work. The reason is that the criterion we have been using requires us to have two formulations, both of which are taken to be descriptively adequate and empirically equivalent, which we then compare. In other cases of interest, though, such as non-Abelian Yang-Mills theory or general relativity, it is not clear that we have a plausible second theory to consider.
Still, there is something one can say. It concerns the role of “gauge transformations” between models of EM2, as described at the end of section 2. These are maps that relate models of EM2 that have the same representational capacities even though they are not isomorphic. The criterion of excess structure described here, meanwhile, requires the existence of a functor between categories of models that fails to be full—or in other words, a standard of comparison between the theories relative to which one formulation has “more” arrows than the other or, alternatively, relative to which one of the formulations is “missing” arrows.Footnote 14 This suggests a rule of thumb for whether a theory, or a formulation of a theory, has excess structure in the sense described here, namely, the theory has models that are not isomorphic but that nevertheless we interpret as having precisely the same representational content. Indeed, whereas the criterion discussed above tells us when one theory or formulation has more structure than another, this second criterion evaluates whether any alternative formulation could have less structure and still do the same descriptive work—at least without equivocating between physical situations we now think are distinct.
How can we put this rule of thumb to work? Suppose you are given a theory and a collection of maps taking models to models with the same representational capacities—that is, one is presented with a candidate “gauge theory” and a class of “gauge transformations.” One may then ask: are these gauge transformations naturally construed as isomorphisms of the models of the theory, understood as mathematical objects? If the answer is yes, then it would seem that these maps do not signal excess structure, since these maps would not be “missing” from a natural category of models; conversely, if the answer is no, then there likely is excess structure in the formulation.
Applying this diagnostic to some examples of gauge theories in the second sense above, we immediately see that the moral concerning EM3 generalizes to other Yang-Mills theories. For instance, models of an arbitrary Yang-Mills theory with structure group G may be written , where P is a principal G-bundle over some space-time (M, gab) and
is a principal connection on P.Footnote 15 In this setting, a “gauge transformation” is often defined as a (vertical) principal bundle automorphism
relating models
and
(see, e.g., Bleecker Reference Bleecker1981, sec. 3.2). But these maps are just a special class of connection- and metric-preserving principal bundle isomorphisms, and so although they do map between models with the same representational resources, they are not “extra” maps, in the sense of the gauge transformations of EM2. So Yang-Mills theory does not have excess structure in the sense discussed here.
Likewise, for general relativity, we characterize models of the theory as relativistic space-times, (M, gab). Here “gauge transformations” are often taken to be diffeomorphisms relating models
and
(see, e.g., Wald Reference Wald1984; Earman and Norton Reference Earman and Norton1987). But once again, these maps are just isometries (i.e., they are just isomorphisms of Lorentzian manifolds). So here, too, there is no excess structure.
5. Conclusion
I have isolated two strands of usage of the expression “gauge theory” in physics and philosophy of physics. According to one, a gauge theory is a theory that has excess structure; according to the other, a gauge theory is any theory whose dynamical variable is a connection on a principal bundle. I then endeavored to make precise the sense in which the paradigmatic example of a gauge theory (according to both strands)—classical electromagnetism—may be construed as having excess structure. From this discussion, I extracted a general criterion for when a theory has excess structure. From this criterion, I argued that gauge theories in the second sense need not have excess structure—and indeed, Yang-Mills theory and general relativity should not be construed as having excess structure in the sense that one formulation of electromagnetism does.