1 Introduction
The Axiom of Choice is a staple of modern mathematics which requires little introduction nowadays. However, as the reader is probably aware, the Axiom of Choice also has some controversial implications, especially in analysis (e.g., the existence of sets of reals which are not Lebesgue-measurable or without the Baire property). The controversial aspect of such results, and of the Axiom of Choice itself, ultimately stems from the nonconstructiveFootnote 1 nature of this axiom. Appealing to the Axiom of Choice means that we are unable to properly construct and define our objects, but merely prove their existence. To some people this approach is lacking substance; they would argue that the objects we construct in mathematics should be explicit. At any rate, effective proofs, or proofs which explicitly exhibit or construct some object, are perceived almost universally as being more satisfactory than mere existence proofs.
By weakening the Axiom of Choice to one of its countable variants we obtain the Principle of Dependent Choice
$(\mathsf {DC}),$
namely the statement that every partial order without maximal points has an infinite chain. This axiom is regarded by most mathematicians as innocuous, as it does not give rise to the construction of pathological objects in analysis and elsewhere in mathematics.Footnote
2
By its very formulation
$\mathsf {DC}$
is simply telling us that we can construct things recursively in
$\omega $
steps just by showing that the construction may proceed through each successive step. Even if the object promised to exist is still nonconstructive, this level of nonconstructiveness is felt to be a minimum that we should be happy to tolerate if we are to develop any reasonable mathematics of infinite objects.Footnote
3
In fact, and precisely because performing recursive constructions over the natural numbers is something we do all the time in many areas of mathematics,
$\mathsf {DC}$
promises to be a particularly successful axiom in those areas. This is actually true for classical analysis, most of which can be carried out in the theory
$\mathsf {ZF}+\mathsf {DC}$
.
The above considerations on the naturalness of
$\mathsf {DC}$
are meant as a motivational preamble to the work in the present paper. This is a paper in two parts. The first part studies to what extent is
$\mathsf {ZF}+\mathsf {DC}$
an adequate base to develop the theory of proper forcing, whereas the second part deals with the problem of deriving generic absoluteness for (some substantial class of) forcing notions in the theory
$\mathsf {ZF}+\mathsf {DC}$
supplemented with sufficiently strong large cardinal axioms. These two lines of research may look rather disconnected, but they share a goal, namely to see whether
$\mathsf {ZF}+\mathsf {DC}$
, which we have hinted at being a successful foundation for large portions of classical mathematics, is also a reasonable vantage point from which to look into more purely set-theoretic areas.
Our motivation for studying properness in
$\mathsf {ZF}+\mathsf {DC}$
originated in the earlier realization, by the second author, that
$\mathsf {DC}$
enables one to prove, and in fact is equivalent to, the existence of countable elementary submodels of
$V_\alpha $
, for all infinite ordinals
$\alpha $
;Footnote
4
in particular,
$\mathsf {ZF}+\mathsf {DC}$
provides a nonvacuous playground in which to study properness. On the other hand, the goal of our study of generic absoluteness over the base theory
$\mathsf {ZF}+\mathsf {DC}$
was to elucidate whether or not the use of the Axiom of Choice in its full power is an overkill for proving generic absoluteness for
$L(\mathbb {R})$
, or even for the Chang model, from large cardinals. This question is motivated by the role played by
$\mathsf {ZF}+\mathsf {DC}$
as a valid base theory for real analysis and the fact that most of real analysis lives in
$L(\mathbb {R})$
.Footnote
5
As already mentioned,
$\mathsf {ZF}+\mathsf {DC}$
is “the right theory for classical analysis.” In this paper we argue that it is also the right basic theory for handling much of the forcing technology relevant to the real line. This argument is provided through the results in the first and second part of this paper. In the first part of the paper we observe that Dependent Choice is enough—in fact the minimum needed—of the Axiom of Choice to develop the theory of proper forcing, and that proper forcing must preserve Dependent Choice. We prove the consistency of the Proper Forcing Axiom
$(\mathsf {PFA})$
by forcing, with a proper forcing notion, over any model of
$\mathsf {ZF}+\mathsf {DC}$
in which there is a supercompact cardinal. We also show that assuming
$\mathsf {PFA}$
, a stronger version of Dependent Choice, namely
$\mathsf {DC}_{\omega _1}$
, must hold. As a consequence, all typical applications of
$\mathsf {PFA}$
in
$\mathsf {ZFC}$
at the level of
$H(\omega _2)$
(see, for example, [Reference Baumgartner, Kunen and Vaughan3] or [Reference Moore22]) also go through in
$\mathsf {ZF}+\mathsf {DC}$
; in particular,
$\mathsf {PFA}$
implies, in
$\mathsf {ZF}+\mathsf {DC}$
, that
$2^{\aleph _0}=\aleph _2$
and that
$H(\omega _2)$
has a definable well-ordering (Theorem 5.3). We do not know how to prove that the Proper Forcing Axiom does not imply the Axiom of Choice,Footnote
6
but we do suggest a possible way to prove such a statement assuming some generic absoluteness of the Proper Forcing Axiom (together with reasonable large cardinal assumptions).
The second part of the paper begins with a (rather lengthy) discussion of the foundational significance of the generic absoluteness phenomenon. We then observe that generic absoluteness for the Chang model fails relative to forcing notions destroying
$\mathsf {DC}$
, and move on to proving that if
$\mathsf {ZF}+\mathsf {DC}$
holds and the supercompact cardinals form a stationary class in
$\mathrm {Ord}$
, then the theory of the Chang model is generically absolute with respect to forcing that preserves Dependent Choice (Theorem 8.6)Footnote
7
,
Footnote
8
and in fact with respect to
$\mathsf {DC}$
-preserving symmetric submodels of
$\mathsf {DC}$
-preserving forcing extensions (Theorem 9.1). We conclude the paper with some foundational remarks motivated by the proof of this last result.
1.1 The basic definitions
All our proofs will be carried out in
$\mathsf {ZF}$
. We will redefine the notion of
$H(\kappa )$
to work in a context where the Axiom of Choice may fail. We then use this definition when discussing proper forcing. In our paper a notion of forcing is a partially ordered set
$\mathbb {P}$
with a maximum, denoted by
$1_{\mathbb {P}}$
(or just
$1$
). If
$p,q\in \mathbb {P}$
, we say that q extends p, or that it is a stronger condition, if
$q\leq p$
. If two conditions have a common extension, we say that they are compatible. Rather than writing
$\mathrel {\Vdash }_{\mathbb {P}}$
to denote the forcing relation corresponding to a notion of forcing
$\mathbb {P}$
, we will omit the subscript and simply write
$\mathrel {\Vdash }$
whenever
$\mathbb {P}$
is clear from the context. A forcing
$\mathbb {P}$
is weakly homogeneous if for every two conditions p and q there is an automorphism
$\pi $
such that
$\pi p$
and q are compatible. If
$\mathbb {P}$
is weakly homogeneous, then for every
$p\in \mathbb {P}$
and every sentence
$\varphi $
in the forcing language for
$\mathbb {P}$
,
$p\mathrel {\Vdash }\varphi $
if and only if
$1\mathrel {\Vdash }\varphi $
. Also, if
$\mathbb {P}$
and
$\mathbb {Q}$
are weakly homogeneous forcing notions, then so is
$\mathbb {P}\times \mathbb {Q}$
.
We denote
$\mathbb {P}$
-names by
$\dot x$
, and if x is a set in the ground model, we denote the canonical name for x by
$\check x$
. If
$\dot x$
and
$\dot y$
are
$\mathbb {P}$
-names, we say that
$\dot y$
appears in
$\dot x$
if there is some condition p such that
$\langle p,\dot y\rangle \in \dot x$
. We then recursively define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu1.png?pub-status=live)
Not assuming the Axiom of Choice, the terminology, definitions and notations surrounding cardinals become vague. We consider cardinals in the broader sense: if x can be well-ordered, then
$|x|$
is the least ordinal that has a bijection with x; otherwise,
$|x|$
is the Scott cardinal of x:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu2.png?pub-status=live)
for the least
$\alpha $
for which the intersection is nonempty. Greek letters (with the notable exception of
$\varphi $
and
$\pi $
) will denote ordinals, and specifically
$\aleph $
numbers when we assume that they denote infinite cardinals.
We write
$|x|\leq |y|$
to denote that there is an injection from x into y, and
$|x|\leq ^*|y|$
to denote that x is empty or else there is a surjection from y onto x. The notation
$|x|<|y|$
denotes the statement
$|x|\leq |y|$
and
$|y|\nleq |x|$
, and
$|x|<^*|y|$
is defined from
$\leq ^*$
similarly.Footnote
9
Finally, for a set x, the Hartogs number of x is
$\aleph (x)=\min \{\kappa \mid \kappa \nleq |x|\}$
and the Lindenbaum number of x is
$\aleph ^*(x)=\min \{\kappa \mid \kappa \nleq ^*|x|\}$
.
2 Hereditary sets
Structures of the form
$H(\kappa )$
, where
$\kappa $
is a cardinal, are one of the key constructions in set theory. These are usuallyFootnote
10
defined as
$\{x\mid | \operatorname {\mathrm {tcl}}(x)|<\kappa \}$
. Assuming that
$\kappa $
is a regular cardinal, these sets are models of
$\mathsf {ZFC}^-$
, namely all the axioms of
$\mathsf {ZFC}$
except the Power Set Axiom (which can hold for “small sets” when
$\kappa $
is sufficiently large). In almost all the definitions we have we can replace
$H(\kappa )$
with
$V_\alpha $
for “sufficiently nice
$\alpha ,$
” meaning one where enough of Replacement is satisfied. But this requires more care in noting what Replacement axioms are being used, and virtually nobody wants that. Not when a nice alternative like
$H(\kappa )$
is available.
In the absence of the Axiom of Choice, however, we want to have
$H(\kappa )$
reflect the fact that choice fails. This cannot be done when taking the same definition at face value, since it implies that if
$x\in H(\kappa )$
then x can be well-ordered. This means that we need to modify the definition of
$H(\kappa )$
.
Definition 2.1. Given a set x,
$H(x)$
is the class
$\{y\mid |x|\nleq^*| \operatorname {\mathrm {tcl}}(y)|\}$
. We denote by
$H({<}x)$
the union of the classes
$H(y)$
such that
$|y|<^*|x|$
.
Note that for an infinite cardinal
$\kappa $
,
$H(\kappa )=H({<}\kappa ^+)$
and if
$\kappa $
is a strong limit cardinal, then
$H({<}\kappa )=H(\kappa )$
.
Proposition 2.2. Let
$\kappa>\omega $
be an uncountable cardinal. The following properties hold:
-
1.
$H(\kappa )$ is a transitive set of height
$\kappa $ .
-
2.
$H(\kappa )=V_\kappa $ if and only if
$\kappa $ is a strong limit cardinal; in other words,
$\kappa \nleq ^*|V_\alpha |$ for all
$\alpha <\kappa $ .
-
3. If
$\kappa $ is regular, then
$H(\kappa )\models \mathsf {ZF}^-$ , and if it also a strong limit then
$H(\kappa )\models \mathsf {ZF}_2$ .
Proof. Clearly,
$H(\kappa )$
is a transitive class. We want to argue that
$H(\kappa )\subseteq V_\kappa $
, which implies that
$H(\kappa )$
is a set. For this we need the following claim: for a set x the following are equivalent:
-
1. The von Neumann rank of x is
$\alpha $ .
-
2.
$\alpha =\min \{\beta \mid \operatorname {\mathrm {tcl}}(x)\cap V_{\beta +1}\setminus V_\beta =\varnothing \}$ .
In other words, if x has rank
$\alpha $
, then its transitive closure contains elements of any rank below
$\alpha $
.
We prove this equivalence by
$\in $
-induction on x. Suppose that
$ \operatorname {\mathrm {rank}}(x)=\alpha $
. Then for all
$\beta <\alpha $
there is some
$y\in x$
such that
$\alpha> \operatorname {\mathrm {rank}}(y)\geq \beta $
. By the induction hypothesis,
$ \operatorname {\mathrm {tcl}}(y)\cap V_{\gamma +1}\setminus V_\gamma \neq \varnothing $
for all
$\gamma <\beta $
, and therefore
$ \operatorname {\mathrm {tcl}}(x)\cap V_{\gamma +1}\setminus V_\gamma \neq \varnothing $
; moreover
$y\in \operatorname {\mathrm {tcl}}(x)\cap V_{\beta +1}\setminus V_\beta $
, so indeed
$\alpha $
is the minimal ordinal satisfying (b). On the other hand, assuming (b) it is clear that
$ \operatorname {\mathrm {rank}}(x)\geq \alpha $
. But if
$ \operatorname {\mathrm {rank}}(x)>\alpha $
, then for some
$y\in x$
the rank of y is at least
$\alpha $
, in which case the above shows that
$ \operatorname {\mathrm {tcl}}(x)\cap V_{\alpha +1}\setminus V_\alpha \neq \varnothing $
, so
$\alpha $
is not the minimal ordinal satisfying (b).
Using this equivalence we argue that if
$ \operatorname {\mathrm {rank}}(x)>\kappa $
, then
$x\notin H(\kappa )$
. This is clear, since
$ \operatorname {\mathrm {tcl}}(x)$
can be mapped onto
$\kappa $
by mapping each element to its rank. So indeed
$H(\kappa )\subseteq V_\kappa $
, and so
$H(\kappa )$
is a transitive set.
If
$\kappa $
is a strong limit cardinal, then by definition
$V_\alpha \in H(\kappa )$
for all
$\alpha <\kappa $
, and equality follows.
Finally, if
$\kappa $
is regular, and
$x\in H(\kappa )$
, then every function from x to
$\kappa $
is bounded, in particular, if
$f\colon x\to H(\kappa )$
, then
$f\in H(\kappa )$
as well, so
$H(\kappa )$
satisfies second-order Replacement, and by transitivity of
$H(\kappa )$
we get all the other axioms of
$\mathsf {ZF}^-$
. And of course, if
$\kappa $
is also a strong limit cardinal, the above implies that
$H(\kappa )=V_\kappa \models \mathsf {ZF}_2$
.□
Unfortunately, despite what we said at the beginning of this section,
$\mathsf {ZF}$
does not guarantee that there are any uncountable regular cardinals. This means, in turn, that we cannot guarantee that
$H(\kappa )$
will satisfy Replacement axioms. Nevertheless, models of the form
$H(\kappa )$
are somewhat more natural than models of the form
$V_\alpha $
, as they provide us with an ordinal structure which is slightly more “reasonable” and they will also satisfy Mostowski’s collapse lemma (as opposed to
$V_{\omega +\omega }$
, for example, which will not).
Remark 2.3. It is interesting to note that the requirement that
$\kappa $
is an
$\aleph $
is crucial for the proof that
$H(\kappa )$
is even a set. The definition can be used with any cardinal x, but if x is not well-ordered, then
$|x|\nless ^*\kappa $
for any ordinal
$\kappa $
, meaning that
$\mathrm {Ord}\subseteq H(x)$
. In the definition, we could have artificially cropped
$H(x)$
when x is not well-orderable, in order to ensure that
$H(x)$
remains a set, but we want to somehow reflect the structure of the universe in a natural way and this cropping seems artificial and not justified by the mathematics at this point.
Question 2.4. Is there any particular use for
$H(x)$
for an arbitrary x in
$\mathsf {ZF}$
? Do the proper classes
$H(x)$
all satisfy any particular weak set theory of interest?
Proposition 2.5. The class
$\{H({<}\kappa )\mid \kappa \text { is an }\aleph \text {-number}\}$
is a continuous filtration of the universe.
Proof. Trivially this class is a continuous sequence of transitive sets. If x is any set, then taking
$\kappa =\aleph ^*( \operatorname {\mathrm {tcl}}(x))$
implies that
$x\in H(\kappa )=H({<}\kappa ^+)$
. □
Remark 2.6. It is consistent that
$\aleph ^*(\mathbb {R})=\kappa $
is a limit cardinal, in which case
$\mathbb {R}\in H(\kappa )$
but
$\mathbb {R}\notin H({<}\kappa )$
. This is a consequence of the Axiom of Determinacy (see exercise 28.16 in [Reference Kanamori14]), for example, but the consistency of this statement does not require any large cardinals.Footnote
11
3 Dependent choice
Definition 3.1. Let T be a tree, i.e., a well-founded partial order whose lower cones are chains.
-
1. We say that T is
$\kappa $ -closed if for every
$\alpha <\kappa $ , every increasing sequence of nodes in T indexed by
$\alpha $ has an upper bound.
-
2. We say that
$B\subseteq T$ is a branch if it is a maximal chain, and a cofinal branch is a branch which meets all the levels of the tree.
Definition 3.2. Let
$\kappa $
be an infinite cardinal. We denote by
$\mathsf {DC}_\kappa $
the following statement: Every
$\kappa $
-closed tree without maximal elements has a chain of order type
$\kappa $
. We use
$\mathsf {DC}_{<\kappa }$
to abbreviate
$\forall \lambda <\kappa ,\mathsf {DC}_\lambda $
, and for
$\kappa =\aleph _0$
we simply write
$\mathsf {DC}$
.Footnote
12
Among the basic consequences of
$\mathsf {DC}$
we have the Axiom of Choice for countable families of nonempty sets, the countability of every countable union of countable sets, the regularity of
$\omega _1$
, and many others.
Lemma 3.3. Every well-orderable tree has a maximal chain. In particular, if T is a
$\kappa $
-closed well-orderable tree without maximal elements, then T has a chain of order type at least
$\kappa $
.
Proof. Enumerate T and proceed by recursion. □
Theorem 3.4. The following are equivalent:
-
(1)
$\mathsf {DC}$ .
-
(2) The Löwenheim–Skolem theorem for countable languages: every structure in a countable language has a countable elementary submodel.
-
(3) For every
$\alpha \geq \omega $ and every countable
$A\subseteq V_\alpha $ there is a countable elementary submodel M of
$V_\alpha $ such that
$A\subseteq M$ .
-
(4) For every
$\alpha \geq \omega $ there is a countable elementary submodel
$M\prec \langle V_\alpha ,\in \rangle $ .
The observation that (1) and (2) are equivalent is easy to make, but is attributed to George Boolos, and appears in [Reference Boolos and Jeffrey4] as exercise 13.3.Footnote 13
Proof. The proof that (1) implies (2) is the usual proof of the Löwenheim–Skolem theorem, noting that it does not use more than
$\mathsf {DC}$
. Namely, given a countableFootnote
14
structure M in a relational language
$\mathcal L$
(every constant and function can be seen as a relation), we define
$N\subseteq M$
by recursion:
-
(1)
$N_0=\varnothing $ .
-
(2) Suppose that
$N_k$ was defined and is countable. For every formula of the form
$\exists y\varphi (y,\vec x)$ , and every finite sequence
$\vec n$ from
$N_k$ , choose
$y_{\varphi ,\vec n}$ such that
$M\models \varphi [y_{\varphi ,\vec n},\vec n]$ . Note that there are only countably many formulae and countably many finite sequences, so we need to make countably many choices, which is doable by
$\mathsf {DC}$ . Then we define
$$ \begin{align*}N_{k+1}=N_k\cup\{y_{\varphi,\vec n}\mid\varphi\text{ is a formula}, \vec n\in N_k^{|\vec n|}\}.\end{align*} $$
Finally, let
$N=\bigcup _{k\in \omega } N_k$
. Then N is a countable substructure in the language
$\mathcal L$
. It is not hard to verify that
$N\prec M$
. We needed
$\mathsf {DC}$
, rather than just the axiom of choice for countable families of sets, since the definition of each
$N_k$
depends on the previous choices made.
The implications
$(2)\implies (3)$
and
$(3)\implies (4)$
are easy and trivial, respectively. The last implication is obtained by noting that if
$\mathsf {DC}$
fails, then there is some
$\alpha $
such that
$\mathsf {DC}$
fails in
$V_\alpha $
. Taking a countable elementary submodel M of
$V_\alpha $
, there is some
$T\in M$
such that T is a counterexample to
$\mathsf {DC}$
. By elementarity,
$T\cap M$
is a subtree of T without maximal elements. Hence, since
$T\cap M$
is countable, we may easily build an infinite branch of
$T\cap M$
by recursion using a well-order on
$T\cap M$
(Lemma 3.3). But this is a contradiction. □
Of course, in the above equivalence, we can replace
$V_\alpha $
by
$H(\aleph _\alpha )$
, or any other filtration of the universe.
Part.1 CCC, properness, and forcing axioms without choice
4 Proper forcing
Definition 4.1. Let
$\mathbb {P}$
be a notion of forcing. We say that
$\mathbb {P}$
is proper if for any sufficiently large
$\kappa $
Footnote
15
and every countable elementary submodel
$M\prec (H(\kappa ),\in ,\mathbb {P})$
, if
$p\in \mathbb {P}\cap M$
, then p has an extension which is M-generic. Namely, there is some
$q\leq p$
such that for any open dense
$D\in M$
,
$D\cap M$
is predense below q.
Without choice properness can be a bit quirky.
Proposition 4.2. The following are equivalent:
-
(1)
$\mathsf {DC}$ .
-
(2)
$ \operatorname {\mathrm {Col}}(\omega ,\omega _1)$ is not proper.
-
(3) There exists a forcing which is not proper.
Proof. Assume
$\mathsf {DC}$
holds and let
$\mathbb {P}$
be a proper forcing. Suppose
$\dot f$
is a name such that for some condition
$p\in \mathbb {P}$
,
$p\mathrel {\Vdash }\dot f\colon \check \omega \to \check \omega _1$
. Let M be a countable elementary submodel of a large enough
$H(\kappa )$
such that
$\mathbb {P},p,\dot f\in M$
. Since M is an elementary submodel,
$\omega \in M$
, and therefore for each
$n<\omega $
, there is a predense set
$D_n\in M$
whose conditions all decide the value of
$\dot f(\check n)$
, which by elementarity is an ordinal in M. Let
$q\leq p$
be an M-generic condition. Then q must force that the range of
$\dot f$
must in fact lie in
$M\cap \omega _1$
. However,
$\mathsf {DC}$
implies that
$\omega _1$
is regular, and so this set is bounded. In particular, this means that q forces that
$\dot f$
has a bounded range and therefore is not surjective. In turn this means that for any name
$\dot f$
there is no condition forcing that
$\dot f$
is a surjection from
$\omega $
onto
$\omega _1$
. Therefore
$\mathsf {DC}$
implies that no proper forcing can collapse
$\omega _1$
and, in particular,
$ \operatorname {\mathrm {Col}}(\omega ,\omega _1)$
is not a proper forcing.
The implication from (2) to (3) is trivial. Finally, if
$\mathsf {DC}$
fails, then the definition of properness holds vacuously by the definition of “sufficiently large
$\kappa ,$
” thus making every forcing proper. □
Remark 4.3. We could have replaced “sufficiently large” by explicitly requiring
$\kappa $
to be any fixed cardinal such that
$H(\kappa )$
has both the power set of
$\mathbb {P}$
and the basic tools needed for defining the forcing relation. One could also define a hierarchy by saying that
$\mathbb {P}$
is “proper below
$\kappa ,$
” where
$\kappa $
is fixed in the definition of properness, and then define proper as “for all
$\kappa $
,
$\mathbb {P}$
is proper below
$\kappa .$
” However, we feel that the above is a good definition as it is flexible enough to allow for a certain degree of freedom in choosing
$\kappa $
and it provides us with this nice characterization of
$\mathsf {DC}$
.
On the other hand, if one decides that it is best to require something explicit about
$\kappa $
, or stick to the “properness below
$\kappa $
” definition, then the failure of
$\mathsf {DC}$
at
$H(\omega _1)$
(e.g., if there is a Dedekind-finite set of reals) will imply that every forcing is proper, whereas this will not be the case if
$\mathsf {DC}$
first fails at some point above
$H(\omega _1)$
. So we would be getting an odd equivalence of “every forcing is proper” given that choice of ‘properness’; avoiding this oddity is a good motivation for picking the definition of properness as we did above.
Proposition 4.4. Every
$\sigma $
-closed forcing is proper.
Proof. Let
$\mathbb {P}$
be a
$\sigma $
-closed forcing. If M is a countable elementary submodel of some
$H(\kappa )$
with
$\mathbb {P}\in M$
, then enumerate the dense open sets in M as
$\{D_n\mid n\in \omega \}$
and enumerate
$\mathbb {P}\cap M$
as
$\{p_n\mid n\in \omega \}$
, and for any
$p\in \mathbb {P}\cap M$
, recursively construct a decreasing sequence such that
$q_0=p$
and
$q_{n+1}$
is the least
$p_k$
such that
$p_k\in D_n$
and
$p_k\leq q_n$
. Since
$\mathbb {P}$
is
$\sigma $
-closed, there is some lower bound q of all the
$q_n$
’s and it is clear that q is an M-generic condition. □
One ambition might be to prove that every ccc forcing is proper, but without the Axiom of Choice chain conditions are tricky to even formulate. Mirna Džamonja suggested in private communication to the authors to follow Mekler’s footsteps in [Reference Mekler20] and formulate ccc, at least in the presence of
$\mathsf {DC}$
, by saying that every condition is M-generic for any countable elementary submodel M. Of course, under this definition it is trivial that every ccc forcing is proper.
Theorem 4.5. Let X be any set, and let
$ \operatorname {\mathrm {Add}}(\omega ,X)$
denote the forcing whose conditions are finite partial functions
$p\colon X\times \omega \to 2$
, ordered by reverse inclusion. Then
$\mathsf {DC}$
implies that
$ \operatorname {\mathrm {Add}}(\omega ,X)$
is a ccc forcing, for any X.
Of course we can immediately assume that X is not empty, otherwise this is an atomic forcing, which is entirely uninteresting (and it is certainly proper). This theorem is also immediately true for any well-orderable set X, as then
$ \operatorname {\mathrm {Add}}(\omega ,X)$
is also well-orderable and therefore admits maximal antichains and the standard
$\mathsf {ZFC}$
arguments work with it without using choice. Nevertheless, we will only use the assumption that X is not empty in the following proof.
Proof. Suppose that
$M\prec H(\kappa )$
for any large enough
$\kappa $
such that X,
$ \operatorname {\mathrm {Add}}(\omega ,X)$
and D are all in M, where D is a dense subset of
$ \operatorname {\mathrm {Add}}(\omega ,X)$
. Let p be any condition in
$ \operatorname {\mathrm {Add}}(\omega ,X)$
, and let
$p'=p\cap M\in M$
. Then there is some
$q\in D\cap M$
which is compatible with
$p'$
by predensity of D. Moreover,
$ \operatorname {\mathrm {dom}} (p\setminus p')\cap M=\varnothing $
, so p is also compatible with q. In other words,
$\varnothing $
, the maximum condition of
$ \operatorname {\mathrm {Add}}(\omega ,X)$
, is M-generic, and so every condition is M-generic, for every M. □
Theorem 4.6. If
$\mathbb {P}$
is proper and
$\mathsf {DC}$
holds, then
$1\mathrel {\Vdash }\mathsf {DC}$
.
Proof. Let
$\dot T$
be a
$\mathbb {P}$
-name for a tree of height
$\omega $
without maximal nodes and let
$p\in \mathbb {P}$
. Let M be a countable elementary submodel of some sufficiently large
$H(\kappa )$
such that
$\dot T$
, p,
$\mathbb {P}\in M$
.
Let q be an M-generic condition extending p and let
$\dot T_*=M\cap \dot T$
. For each
$n<\omega $
, let
$D_n$
be the collection of conditions
$p\in \mathbb {P}$
for which there is some
$\mathbb {P}$
-name
$\dot t$
such that
$p\mathrel {\Vdash }``\dot t$
is a node in
$\dot T$
of level
$n.$
” For every n,
$D_n$
is a dense subset of
$\mathbb {P}$
in M. Hence,
$D_n\cap M$
is predense below q for each n, and as a consequence q forces that
$\dot T_*$
is a subtree of
$\dot T$
of height
$\omega $
. By a similar argument, we also have that q forces
$\dot T_*$
not to have any maximal nodes. But then, since of course
$\dot T_*$
is forced to be countable, q forces that
$\dot T_*$
has a branch by Lemma 3.3. We have shown that every condition in
$\mathbb {P}$
can be extended to a condition adding a branch through
$\dot T$
, which means that
$1\mathrel {\Vdash }\mathsf {DC}$
. □
The above theorem shows that one cannot violate
$\mathsf {DC}$
with a proper forcing, which is somewhat similar to the fact that one cannot violate
$\mathsf {AC}$
with any forcing. This fact provides an argument that
$\mathsf {ZF}+\mathsf {DC}$
is a good base theory for working with proper forcing. This argument is strengthened by the following theorem.
Theorem 4.7.
-
(1) Two-step iterations of proper forcings are proper.
-
(2) Countable support iterations of proper forcings are proper.
Proof. If
$\mathsf {DC}$
fails, then this holds vacuously. If
$\mathsf {DC}$
holds, the usual proofs as given in the context of
$\mathsf {ZFC}$
also work here, noting that all relevant countable choices in the
$\mathsf {ZFC}$
proofs can equally be made in the present context; for a reference see theorem 2.7 in [Reference Abraham, Foreman and Kanamori1]. However, the following caveat is needed here.
In
$\mathsf {ZFC}$
we usually define the iteration
$\mathbb {P}\ast \dot {\mathbb {Q}}$
as a partial order on the collection of ordered pairs of the form
$\langle p,\dot q\rangle $
, where
$p\in \mathbb {P}$
and
$\dot q$
is a
$\mathbb {P}$
-name such that
$1\mathrel {\Vdash }\dot q\in \dot {\mathbb {Q}}$
. We could replace this with
$p\mathrel {\Vdash }\dot q\in \dot {\mathbb {Q}}$
, and while for a two-step iteration this is entirely inconsequential, for a countable support iteration this matters a lot.
Moving from
$p\mathrel {\Vdash }\dot q\in \dot {\mathbb {Q}}$
to
$1\mathrel {\Vdash }\dot q\in \dot {\mathbb {Q}}$
usually requires the Axiom of Choice. But we can circumvent this as follows: if
$\dot q$
is a name which appears in
$\dot {\mathbb {Q}}$
and
$p\mathrel {\Vdash }\dot q\in \dot {\mathbb {Q}}$
, then define
$\dot q_*$
as
$\bigcup \{1_{{\mathbb {Q}}}\mathbin \upharpoonright p'\mid p'\perp p\}\cup \dot q\mathbin \upharpoonright p$
, and easily
$1\mathrel {\Vdash }\dot q_*\in \dot {\mathbb {Q}}$
and
$p\mathrel {\Vdash }\dot q_*=\dot q$
. □
5 The proper forcing axiom
Definition 5.1. The Proper Forcing Axiom (
$\mathsf {PFA}$
) states that if
$\mathbb {P}$
is a proper forcing, then for every collection
$\mathcal {D}=\{D_\alpha \mid \alpha <\omega _1\}$
of dense open subsets of
$\mathbb {P}$
there exists a
$\mathcal {D}$
-generic filter.
Proposition 5.2.
$\mathsf {PFA}$
implies
$\mathsf {DC}_{\omega _1}$
.
Proof. Let T be a
$\sigma $
-closed tree of height
$\omega _1$
and with no maximal elements. Then by Proposition 4.4, T is proper. Consider
$D_\alpha =T\setminus T\mathbin \upharpoonright \alpha $
, for
$\alpha <\omega _1$
. Then
$D_\alpha $
is a dense open set, so by
$\mathsf {PFA}$
there is a generic filter meeting each
$D_\alpha $
, which is a branch in T. □
Theorem 5.3.
$\mathsf {PFA}$
implies that
$2^{\aleph _0}=2^{\aleph _1}=\aleph _2$
and that there is a well-order of
$H(\aleph _2)$
of length
$\omega _2$
definable over
$H(\aleph _2)$
from parameters.
Proof. First note that by
$\mathsf {DC}_{\omega _1}$
, we have that
$\aleph _1\leq 2^{\aleph _0}$
, and applying
$\mathsf {PFA}$
to the binary tree
$2^{<\omega }$
we get that
$\aleph _1<2^{\aleph _0}$
. Moreover, by
$\mathsf {DC}_{\omega _1}$
we can choose a ladder system on
$\omega _1$
, i.e., a sequence
$\langle C_\alpha \mid \alpha <\omega _1\text { is a limit ordinal }\rangle $
such that each
$C_\alpha $
is a cofinal subset of
$\alpha $
of order type
$\omega $
. It then also follows that there is a sequence
$\langle S_\alpha \mid \alpha <\omega _1\rangle $
of pairwise disjoint stationary subsets of
$\omega _1$
.
Next we want to point out that Moore’s proof that the Mapping Reflection Principle (
$\mathsf {MRP}$
) implies that
$2^{\aleph _0}=2^{\aleph _1}=\aleph _2$
in [Reference Moore22] can be repeated under
$\mathsf {PFA}$
even without choice. The following changes with respect to the proof as presented in Moore’s paper are necessary:
-
(1) Require that
$ \operatorname {\mathrm {cf}}(\theta )>\omega _1$ in the statement of
$\mathsf {MRP}$ (Definition 2.3).
-
(2) Replace
$H((2^{\omega _1})^+)$ with
$H(\aleph ^*(2^{\omega _1}))$ in Lemma 4.4.
-
(3) Note that the existence of a ladder system and
$\mathsf {DC}_{\omega _1}$ are enough for the proofs of Proposition 4.2 and Theorem 4.3 to go through.□
Thus,
$\mathsf {PFA}$
implies
$H(\omega _2)\models \mathsf {AC}$
and hence that all the richness of the usual
$\mathsf {ZFC}$
combinatorics is available to this structure.
Remark 5.4. One could argue that the correct formulation of forcing axioms (and
$\mathsf {PFA}$
specifically) should require that
${<}2^{\aleph _0}$
dense open sets admit a filter meeting them all, and add that
$\aleph _1\neq 2^{\aleph _0}$
to avoid trivialities. In that case in models of
$\mathsf {ZF}+\mathsf {DC}+(|x|<2^{\aleph _0}\rightarrow |x|\leq \aleph _0)$
, e.g., in Solovay’s model (see [Reference Solovay26]) or models of
$\mathsf {AD}$
, this type of axiom holds vacuously since by
$\mathsf {DC}$
every countable family of dense open sets admits a filter meeting them, and every collection of strictly less than
$2^{\aleph _0}$
sets is necessarily countable. We feel that this vacuity is a good reason not to adopt such a definition.
Of course, we can replace
${<}2^{\aleph _0}$
by
${<}^*2^{\aleph _0}$
, namely require that there is a surjective map from the continuum onto the family of dense open sets but there is no surjection in the other direction. Since there is a surjection from the continuum onto
$\omega _1$
, and we require that
$2^{\aleph _0}\neq \aleph _1$
, there is no surjection in the other direction. Therefore formulating
$\mathsf {PFA}$
this way implies that every collection of
$\aleph _1$
dense open sets admits a generic filter. This is enough to prove
$\mathsf {DC}_{\omega _1}$
, and the above arguments then imply that
$2^{\aleph _0}=\aleph _2$
.
Even weakening
$\mathsf {PFA}$
to Martin’s Axiom (assuming
$\mathsf {DC}$
and defining ccc as “every condition is M-generic for every countable model M”), the corresponding forcing axiom applied to
$ \operatorname {\mathrm {Add}}(\omega ,X)$
, which is a ccc forcing by Theorem 4.5, would imply that if there is a surjection from the continuum onto X, then either there is a surjection from X onto the reals or there is an injection from X into the continuum. And in particular
$\aleph _1<2^{\aleph _0}$
.
We finish this section with the outline of the consistency of
$\mathsf {PFA}$
relative to a supercompact cardinal in
$\mathsf {ZF}+\mathsf {DC}$
.
Definition 5.5. (Woodin, definition 220 in [Reference Woodin29]) For an ordinal
$\alpha $
we say that
$\kappa $
is
$V_\alpha $
-supercompact if there exists some
$\beta>\alpha $
and an elementary embedding
$j\colon V_\beta \to N$
such that:
-
(1) N is a transitive set and
$N^{V_\alpha }\subseteq N$ ,
-
(2) the critical point of j is
$\kappa $ (in particular j is nontrivial),
-
(3)
$\alpha <j(\kappa )$ .
If
$\kappa $
is
$V_\alpha $
-supercompact for all
$\alpha $
, we say that it is a supercompact cardinal.
Assuming the Axiom of Choice holds, this definition is of course equivalent to the standard definition by deriving fine and normal measures from the elementary embeddings.
Proposition 5.6. If
$\kappa $
is a supercompact cardinal, then the countable support iteration of length
$\kappa $
using lottery sums of counterexamples to
$\mathsf {PFA}$
of minimal rank forces
$\mathsf {PFA}$
in
$\mathsf {ZF}+\mathsf {DC}$
.
This approach to the consistency of
$\mathsf {PFA}$
is due to Philipp Schlicht and is unpublished. It appears in Julian Schölder’s masters thesis, [Reference Schölder24], where it is also extended. For the convenience of the reader, we sketch below the idea behind the proof.
Proof. We define an iteration
$\mathbb {P}_\alpha $
for
$\alpha \leq \kappa $
, which is a countable support iteration, and in which the
$\alpha $
th iterand is the lottery sum of all names of counterexamples to
$\mathsf {PFA}$
of minimal rank
${<}\kappa $
(if these exist; otherwise it is the trivial forcing).
If
$\mathsf {PFA}$
does not hold in the generic extension, let
${\mathbb {Q}}$
be a rank-minimal counterexample and
${\mathcal {D}}=\{D_\alpha \mid \alpha <\omega _1\}$
be a collection of dense open sets witnessing that
$\mathsf {PFA}$
fails. Let
$\dot {\mathbb {Q}}$
and
$\dot {\mathcal {D}}$
be names for
${\mathbb {Q}}$
and
${\mathcal {D}}$
, respectively, and let
$p\in \mathbb {P}_\kappa $
force that
$\dot {\mathbb {Q}}$
and
$\dot {\mathcal {D}}$
form a counterexample to
$\mathsf {PFA}$
of minimal rank. Choose
$\eta $
to be a large enough ordinal such that the above is true in
$V_\eta $
and there is an elementary embedding
$j\colon V_\eta \to N$
with critical point
$\kappa $
and such that
$j(\kappa )$
is above the rank of
$\dot {\mathbb {Q}}$
and
$\dot {\mathcal {D}}$
, where N is transitive and sufficiently closed. In particular we may assume that
$\dot {\mathbb {Q}}$
,
$\dot {\mathcal {D}}\in N$
.
By the definition of our iteration and the closure of N, we may extend p, in
$j(\mathbb {P}_\kappa )$
, to a condition
$p'$
which, at stage
$\kappa $
, selects
$\dot {\mathbb {Q}}$
among the choices of the lottery at that stage. Let H be a
$j({\mathbb {Q}})$
-generic filter over N such that
$p'\in H$
and let
$G=H\cap \mathbb {P}_\kappa $
. By a standard lifting argument,
$j\colon V_\eta \to N$
lifts to an elementary embedding from
$V_\eta [G]$
to
$N[H]$
, which we will also denote by j. Let I be the
$\dot {\mathbb {Q}}^G$
-generic filter over V given by H. We may assume that
$N[H]$
contains
$j``V_\alpha $
for some
$\alpha $
containing all relevant names of conditions in
$\dot {\mathbb {Q}}$
, and hence that it contains
$I':=j``I$
. But the interpretation of
$I'$
by H generates a filter of
$j(\dot {\mathbb {Q}})^H$
meeting all members of
$j(\dot {\mathcal {D}})^H$
. It follows that
$p'$
does not force, in
$j(\mathbb {P}_\kappa )$
and over N, that
$j(\dot {\mathbb {Q}})$
and
$j(\dot {\mathcal {D}})$
form a counterexample to
$\mathsf {PFA}$
, and therefore neither does p. But then, by elementarity of j, p does not force, in
$\mathbb {P}_\kappa $
and over
$V_\eta $
, that
$\dot {\mathbb {Q}}$
and
$\dot {\mathcal {D}}$
form a counterexample to
$\mathsf {PFA}$
, which is a contradiction. □
6 Does PFA imply the Axiom of Choice?
Probably not. In fact, if
$\delta <\kappa $
are such that
$\delta $
is supercompact and
$\kappa $
is a Reinhardt cardinal, then forcing
$\mathsf {PFA}$
via a forcing
$\mathbb {P}\subseteq V_\delta $
as in Proposition 5.6 will preserve the Reinhardtness of
$\kappa $
.
We would have liked, of course, to produce a model of
$\mathsf {PFA}$
without the Axiom of Choice starting from a more reasonable assumption (in the region of supercompact cardinals). However, this seems to be a much more difficult task than one would originally expect it to be. We believe that outlining this difficulty is constructive, and raises some interesting questions about the absoluteness of
$\mathsf {PFA}$
even between models of
$\mathsf {ZFC}$
.
The “obvious construction” would be to start with a model of
$\mathsf {ZFC}+\mathsf {PFA}$
, and then construct a symmetric extension ‘à la the first model Cohen’ (see chap. 5 in [Reference Jech13]), using
$ \operatorname {\mathrm {Add}}(\omega _2,\omega _2)$
as our forcing,Footnote
16
thus obtaining an intermediate model which is closed under
$\omega _1$
-sequences. Using the König–Yoshinobu Theorem from [Reference König and Yoshinobu17], the full generic extension must satisfy
$\mathsf {PFA}$
, so the intermediate model should too.
However, when getting down to brass tacks, one sees that the question is this: If
$\mathbb {P}$
is a proper forcing, is
$\mathbb {P}$
also proper in a larger model which agrees on the same
$\omega _1$
-sequences?
This leads us to a very interesting question, which even in the context of
$\mathsf {ZFC}$
has no obvious answer:
Question 6.1. Suppose that
$\mathbb {P}$
is an
$\omega _2$
-directed-closed forcing and
$1\mathrel {\Vdash }\mathsf {PFA}$
. Does that mean that
$\mathsf {PFA}$
holds in the ground model?
Of course, to make our example, we would need to argue with a proof in a model of
$\mathsf {ZF}+\mathsf {DC}_{\omega _1}$
, rather than a model of
$\mathsf {ZFC}$
. But we are optimistic that a proof in
$\mathsf {ZFC}$
will also work—mutatis mutandis—in
$\mathsf {ZF}+\mathsf {DC}_{\omega _1}$
.
When approaching the above question, it seems that an easy way to solve it would be to prove the following statement: if
${\mathbb {Q}}$
is an
$\omega _2$
-directed-closed forcing and
$\mathbb {P}$
is proper, then
$1_{{\mathbb {Q}}}\mathrel {\Vdash }_{{\mathbb {Q}}}``\check {\mathbb {P}}$
is proper.” Yasuo Yoshinobu proved that even in
$\mathsf {ZFC}$
, this question admits a negative answer.
Theorem 6.2. (Yoshinobu [Reference Yoshinobu32];
$\mathsf {ZFC}$
) Given a regular cardinal
$\kappa>\omega $
, there is a forcing of the form
$ \operatorname {\mathrm {Add}}(\omega ,1)\ast \operatorname {\mathrm {Col}}(\omega _1,2^\kappa )\ast \dot {\mathbb {Q}}$
, where
$\dot {\mathbb {Q}}$
is a ccc forcing, which collapses
$\omega _1$
after forcing with
$ \operatorname {\mathrm {Add}}(\kappa ,1)$
.
Part.2 Generic absoluteness without choice
7 Motivation and limitations
As mathematicians, we would like to have a foundation of mathematics T which, apart from having other virtues, is powerful enough that it answers all, or most of, the questions we may be naturally interested in when doing mathematics. There is widespread agreement that set theory provides a universal language to do mathematics: we imagine an entity, V, that we think of as ‘the set-theoretic universe’, i.e., the collection of all pure sets,Footnote
17
and observe that every question in mathematics seems to be translatable into a question about what holds true in the set-theoretic universe. Hence, it is just natural to try to find such a foundation of mathematics phrased in the language of set theory. To be clear, this foundation of mathematics should be a first order theory T in the language of set theory which, besides having the desired virtues—e.g., we would like to have some evidence that the description of V given by T is in fact consistent—is such that we can prove, for a variety of natural mathematical statements
$\sigma $
we may potentially be interested in, that every model of T is a model of
$\sigma $
(or that every model of T is a model of
$\lnot \sigma $
).
The standard axiomatic system for set theory,
$\mathsf {ZFC}$
, has the virtuesFootnote
18
one would expect from a satisfactory foundation of mathematics. However, and since
$\mathsf {ZFC}$
is computable and interprets Peano arithmetic, Gödel’s incompleteness theorem tells us that this theory is incomplete if it is consistent, and that in fact there are arithmetical statements that the theory does not decide. Moreover, any consistent strengthening of
$\mathsf {ZFC}$
obtained by adding finitely many axioms to it will be affected by the incompleteness phenomenon in exactly the same way.
Large cardinal axioms are a family of ‘natural’ principles, intending to supplement
$\mathsf {ZFC}$
and asserting the existence of some very high cardinals
$\kappa $
with strong properties of some sort.Footnote
19
They tend to be phrased as reflection principles for V, or as principles which one can proveFootnote
20
to be equivalent to second order statements about V asserting the existence of elementary embedding, different from the identity, from V into some submodel M, and where M has strong closure properties within V. These axioms tend to be linearly ordered, in the sense that for any two such axioms
$\psi _0$
and
$\psi _1$
, it is usually the case that one of the corresponding theories, say
$\mathsf {ZFC}+\psi _1$
, proves the consistency of
$\mathsf {ZFC}+\psi _0$
, and in fact the existence of a rank-initial segment of the universe satisfying
$\mathsf {ZFC}+\psi _0$
, or vice versa.Footnote
21
Thus, these axioms form a hierarchy of more and more daring principles, stretching from just
$\mathsf {ZFC}$
, all the way up to inconsistency.Footnote
22
Historically, the motivation behind large cardinal axioms has come from a variety of sources. In any case, a general underlying motivation for these axioms is the intuition that the set-theoretic universe ought to be very large. We should point out that here we are dealing with a truly open-ended family of principles.Footnote
23
In particular, there is no ‘official’ definition of large cardinal axiom, but set-theorists surely recognize such axioms when they meet them.
It seems to be an empirical fact that every consistent theory T occurring in mathematical practice can be interpreted in
$\mathsf {ZFC}+\psi $
for some large cardinal axiom
$\psi $
, where this is understood at least in the weak sense of there being a large cardinal axiom
$\psi $
such that
$\mathsf {ZFC}+\psi $
proves the consistency of T.Footnote
24
Moreover, given that large cardinal axioms tend to be linearly ordered in the way we mentioned, the sets of
$\Pi ^0_1$
sentences implied by consistent large cardinal axioms (tend to) form a linearly ordered sequence, under inclusion, of sets of true
$\Pi ^0_1$
sentences. These two points together suggest the appealing picture of an open-ended increasing sequence of
$\Pi ^0_1$
-theories of arithmetic, each one of them characterized as the set of
$\Pi ^0_1$
theorems of
$\mathsf {ZFC}$
together with some consistent large cardinal axiom, and whose union is the true
$\Pi ^0_1$
-theory of
$\mathbb N$
. The suggestion that the
$\Pi ^0_1$
-theory of
$\mathbb N$
is actually given by the union of these
$\Pi ^0_1$
-theories is justified by the contention that every true
$\Pi ^0_1$
sentence ought to be implied by some consistent reasonable theory T possibly occurring in mathematical practice,Footnote
25
together with the empirical fact, mentioned above, that if
$\sigma $
is a
$\Pi ^0_1$
theorem of such a theory T, then there is a large cardinal axiom
$\psi $
with the property that every model M of
$\mathsf {ZFC}+\psi $
contains a model N of T. N need not be an
$\omega $
-model from the point of view of M, but the fact that
$\sigma $
holds in N entails that it also holds in M. We could in fact use this picture as a guiding principle when thinking about the upper reaches of the large cardinal hierarchy.Footnote
26
The bad news is that there are very natural mathematical questions beyond
$\Pi ^0_1$
-arithmetic, the answer to which is independent from any genuine large cardinal axiom. The most prominent such question, already asked by Cantor, is perhaps the question whether
$\mathsf {CH}$
, i.e.,
$2^{\aleph _0}=\aleph _1$
, is true or, more generally, the Continuum Problem, i.e., the question “How many real numbers are there”?Footnote
27
Using Cohen’s method of forcing one can prove that it is consistent, relative to
$\mathsf {ZFC}$
, to assume that
$2^{\aleph _0}$
takes the value
$\aleph _1$
, or
$\aleph _2$
, or
$\aleph _{\omega _1}$
, or indeed
$\aleph _\alpha $
for many other choices of
$\alpha $
. In fact, the same is true if we replace
$\mathsf {ZFC}$
with
$\mathsf {ZFC}$
together with any large cardinal axiom: one can, for example, force both
$2^{\aleph _0}=\aleph _1$
and
$2^{\aleph _0}=\aleph _2$
by a rather small forcing
${\mathbb {P}}$
, and all large cardinals there might be in the universe with retain their large cardinal properties in the extension by
${\mathbb {P}}$
(as was shown by Azriel Levy and Robert Solovay in [Reference Lévy and Solovay19]).
Set-forcing or, more generally, class-forcing, is indeed an extremely powerful method for, starting with some model M of (sufficiently many) axioms of
$\mathsf {ZFC}$
, producing another model of set theory satisfying some interesting statement (see [Reference Shelah25]). This is effected by building a generic extension
$M[G]\supseteq M$
of M. The other main method we have for producing models of set theory is to construct carefully chosen submodels of the universe (inner model theory). In fact, as things stand at present, the forcing method and the construction of inner models of set theory are essentially the only methods we have for proving independence results, over base set theories—of the form
$\mathsf {ZFC}$
+
$\psi $
for some large cardinal axiom
$\psi $
—about the infinite.Footnote
28
It is therefore natural, from an empiricist standpoint, to seek to distil mathematical truth by isolating axioms—beyond
$\mathsf {ZFC}$
—which make reasonable portions of the truth predicate of the universe immune, to some extent, to the method of forcing.
$V=L$
is just such an axiom, and in fact it completely rules out any intromission whatsoever of the method of forcing (this includes generic extensions via class-forcing); the reason being, of course, that no nontrivial forcing extension of L will satisfy
$V=L$
. But
$V=L$
is not regarded as a good axiom by most set-theorists, as it is a minimality hypothesis incompatible with most large cardinal axioms (e.g., if
$V=L$
, then there are no measurable cardinals). In fact, it is usually the case, for typical (canonical) inner models M, that the view that
$V=M$
is seen to impose undesirable minimality constraints on the universe—most notably incompatibility with large cardinal axioms—that disqualify the answers to mathematical questions derived from
$V=M$
as the true answers. This in effect leaves the construction of forcing extensions as the only method we feel we should worry about when it comes to deciding on the truth value—in the context of
$\mathsf {ZFC}$
enhanced with large cardinal axioms—of statements about the infinite.Footnote
29
Arguably, the picture presented in the previous paragraph could change quite dramatically if Woodin’s Ultimate-L project eventually succeeds (see [Reference Woodin31]). This project aims at constructing a canonical inner model M for a supercompact cardinal, and such an inner model would in fact absorb most ‘classical’ large cardinals there might exist in V above the supercompact cardinal of M. If this inner model can in fact be constructed, then the axiom
$V=\text {Ultimate-}L$
would be compatible with at least most large cardinal hypotheses, and it would be immune to the method of forcing in the same way that the axiom
$V=L$
is. We should point out that we already have axioms which are compatible with all consistent large cardinal axioms and which, when added to
$\mathsf {ZFC}$
, also completely neutralize the method of set-forcing. The assertion that there is a proper class of supercompact cardinals which are indestructible under directed closed forcing is one such axiom.Footnote
30
Let us call this axiom
$\mathsf {A}$
. In fact, if
$\kappa $
is a supercompact cardinal and
$\mathbb P$
is a forcing notion of size less than
$\kappa $
, then
$\kappa $
is not indestructible by
${<}\kappa $
-directed closed forcing after forcing with
${\mathbb {P}}$
[Reference Hamkins7].Footnote
31
In any case,
$V=\text {Ultimate-}L$
would be incomparably more effective than an axiom like
$\mathsf {A}$
. Indeed,
$V=\text {Ultimate-}L$
would effectively answer many questions about the set-theoretic universe (e.g., it would imply that
$2^{\aleph _0}=\aleph _1$
and in fact
$\diamondsuit _{\omega _1}$
holds, etc.), whereas
$\mathsf {A}$
has nothing to say about these facts: if there is a proper class of supercompact cardinals, then one can construct a class-sized partial order forcing
$\mathsf {A}$
together with
$\mathsf {CH}$
, or with
$2^{\aleph _0}=\aleph _2$
, etc. Nevertheless, and however things turn out with
$V=\text {Ultimate-}L$
, one can naturally object to this axiom, and of course also to axioms like
$\mathsf {A}$
, on the ground that they only neutralize set-forcing for vacuous reasons; namely, because the axiom fails in any nontrivial generic extension by set-forcing. That was of course also the case for the axiom
$V=L$
. We are not impressed with a strategy for arguing for the truth of some statement if this strategy consists in dogmatically banning any methods to show the independence of the statement, especially if these methods are compatible with the standard views about V (e.g., maximality, etc.).
The above methodological considerations justify the search for principles,
$\psi $
, which, when added to our standard foundation with large cardinals T, decide some significant fragment of the truth predicate of V not in first order logic, but in the stronger logic naturally associated to the forcing method. In other words, we want
$\psi $
to imply,Footnote
32
for every member
$\sigma $
of some reasonable class
$\Sigma $
of sentences, that every forcing extension
$M[G]$
of every model M of T satisfies
$\sigma $
(or that every forcing extensions
$M[G]$
of every model M of T satisfies
$\lnot \sigma $
). We then say that
$\psi $
proves generic absoluteness for
$\Sigma $
. Such principles
$\psi $
would be natural axioms in that they would provide an effectively complete picture of V, at least for sentences in
$\Sigma $
, in the sense of this picture being complete with respect to, essentially, our only method for proving independence over T. Furthermore, it would be desirable that these principles do not render our strong logic degenerate, i.e., it would be desirable to have that
$\psi $
holds in many generic extensions
$M[G]$
of models M of T.
Fortunately we do have axioms which are compatible with all consistent large cardinal hypotheses, which imply some substantial amount of generic absoluteness, and which furthermore do so for interesting (nondogmatic) reasons. In fact, as Woodin proved [Reference Woodin28], large cardinals themselves do exactly that; for example, the existence, in
$\mathsf {ZFC}$
, of a proper class of Woodin cardinals has exactly this effect, as it renders the theory of the inner model
$L(\mathbb R)$
—and in fact of the Chang model (see Definition 7.1)—invariant under generic extensions by set-forcing, and as of course the axiom survives any set-forcing extension. Hence, large cardinals not only seem to decide first order arithmetic, in first order logic. They are in fact proved to also decide second order arithmetic, and actually the theory of the Chang model, in the logic of forcibility (stronger than first order logic but the next natural logic to give our attention to given our available methods). Moreover, this theory is already completely decided by the axiom asserting the existence of a proper class of Woodin cardinals. This fact provides a powerful additional argument in favor, in the context of
$\mathsf {ZFC}$
, of large cardinal axioms, at least in the region of a proper class of Woodin cardinals.
What we have said so far should be enough to motivate those subscribing to the universist approach, namely the point of view that there is one real universe, which our axioms should aim at describing. But generic absoluteness is important for those subscribing to the pluralist or the multiversist approaches as well. In this type of approaches, one shifts the focus of attention from the notion of one unique universe of set theory to a plurality of competing universes of set theory, naturally generated by applying (some of) our methods for proving independence over a standard foundation. The independence phenomenon has made the actual set-theoretic universe into an elusive, or inaccessible, entity. Hence, set-theorists have been effectively forced to familiarizing themselves with the corresponding multiverse(s), which after all are the object they can access. Given the prevalence of the forcing method in set theory, the generic multiverse, i.e., the collection of universes of set theory generated from a given universe by iterating the operations of taking set-generic extensions and ground models, has been the main object of attention in multiverse set theory. It is a remarkable fact about the generic multiverse that if we start with a model M of
$\mathsf {ZFC}$
and build a set-forcing extension
$M[G]$
of it, the ground model M is definable in
$M[G]$
from some parameter. In fact, we can uniformly define all the ground models, from parameters, and in fact all the ground models of all generic extensions. Hence, we can simulate talk about the generic multiverse within any of its members M; for example, given any n, we can define
$\Sigma _n$
-truth in the generic multiverse, i.e., the
$\Sigma _n$
-sentences which are true in all members of the generic multiverse, inside M (see [Reference Woodin, Kennedy and Kossak30] for more details).
The generic multiverse both provides a concrete mathematical arena to debate foundational views in mathematicsFootnote 33 and, because of its foundational value, it has also become a mathematical object of interest. To cite an example of mathematical work done on the generic multiverse with an eye on foundations, Hamkins and Löwe have studied the modal logic of forcing, where this is understood as the modal logic corresponding to looking at the generic multiverse as a Kripke frame where the accessibility relation is given by ‘N is a set-generic extension of M’ (see [Reference Hamkins and Löwe8]).
Using the generic multiverse language, the generic absoluteness phenomenon can be used to provide a bridge between various possible positions in the universist vs. multiversist debate in the foundations of set theory. The first illustration of this that comes to mind is the following: As we mentioned, the presence, in some members of the generic multiverse (equivalently, in all of its members), of a proper class of sufficiently strong large cardinals (e.g., Woodin cardinals) implies that the truth of the Chang model is invariant across the generic multiverse, whereas this is not the case for
$\mathsf {CH}$
. This suggests seeing the theory of the Chang model as ‘more definite’ or ‘truer’ than the theory of, say,
$H(\omega _2)$
(where
$\mathsf {CH}$
lives). That in turn suggests seeing the Chang model as ‘more real’ than
$H(\omega _2)$
. According to such a view, reality about the set-theoretic universe would come in degrees, with a fragment of the universe being more real than another if there is less variation of the theory of the former, across the generic multiverse, than of the theory of the latter.Footnote
34
It is now a good time to define Chang models in general.
Definition 7.1. Let
$\kappa $
be an ordinal. The
$\kappa $
-Chang model, denoted by
$\mathcal C_\kappa $
, is the
$\subseteq $
-minimal model of
$\mathsf {ZF}$
containing all the ordinals and closed under
$\kappa $
-sequences. This model can always be constructed as
$L(\mathcal {P}_{\kappa ^+}(\mathrm {Ord}))$
. In the case where
$\kappa =\omega $
, we omit
$\omega $
from the terminology and notation, and refer to it as the Chang model.
In the following section we will prove that under suitable large cardinal assumptions, the theory of the Chang model is generically absolute under forcings which preserve
$\mathsf {DC}$
. In other words, assuming sufficiently strong large cardinals exist, if
${\mathbb {P}}$
is a forcing which preserves
$\mathsf {DC}$
, e.g., a proper forcing, and
$G\subseteq {\mathbb {P}}$
is a V-generic filter, then the theory of the Chang model in V and
$V[G]$
is the same, even if we add reals as parameters. The foundational motivation for this work is the following: As has already been mentioned, most of real analysis lives inside
$L(\mathbb {R})$
, and therefore in the Chang model,Footnote
35
and can be carried out in
$\mathsf {ZF}+\mathsf {DC}$
. This means that
$\mathsf {ZF}+\mathsf {DC}$
is a very natural meta-theory for analysis. On the other hand, we have made a case for the desirability of generic absoluteness. These two points together make it natural to enquire whether or not full
$\mathsf {AC}$
is an overkill when it comes to deriving generic absoluteness for
$L(\mathbb {R})$
or even for the Chang model (modulo large cardinals), in the same way that, say,
$\mathsf {CH}$
or
$\lnot \mathsf {CH}$
are indeed overkills when deriving generic absoluteness for these inner models with
$\mathsf {ZFC}$
as base theory.
It is important to realize that nontrivial instances of generic absoluteness for the Chang model may be blocked unless we restrict our considerations to forcing notions preserving
$\mathsf {DC}$
:
Proposition 7.2. The Chang model cannot have generic absoluteness for its
$\Sigma _2$
theory in
$\mathsf {ZF}$
, even in the presence of large cardinals.
Proof. Suppose that V is a model of
$\mathsf {ZFC}$
and
$\kappa $
is a supercompact cardinal (or extendible, or any other not-yet inconsistent large cardinal). Perform a Feferman–Levy style construction taking the symmetric collapse of
$ \operatorname {\mathrm {Col}}(\omega _1,{<}\aleph _{\omega _1})$
(see sec. 4 in [Reference Hayut and Karagila10] for details).
This provides us with a model M of
$\mathsf {ZF}+\mathsf {DC}$
where
$\kappa $
is still a supercompact cardinal in the sense of Definition 5.5,Footnote
36
and
$M\models \operatorname {\mathrm {cf}}(\omega _2)=\omega _1$
; moreover any countable sequence of ordinals in M belongs to the ground model V. Therefore the Chang model
$\mathcal C$
in M is the same as in V.
Working in M, let G be an M-generic filter for
$ \operatorname {\mathrm {Col}}(\omega ,\omega _1)$
. Then in
$M[G]$
we have that
$\omega _1^{M[G]}=\omega _2^M$
and therefore
$M[G]\models \operatorname {\mathrm {cf}}(\omega _1)=\omega $
. This fact is reflected to
$\mathcal C$
since it is witnessed by a countable sequence of ordinals, despite the fact that
$ \operatorname {\mathrm {Col}}(\omega ,\omega _1)$
has size
$\aleph _1$
(without using any choice!) and thus is strictly less than
$\kappa $
. However,
$ \operatorname {\mathrm {cf}}(\omega _1)=\omega $
is a
$\Sigma _2$
statement over the Chang model. □
Remark 7.3. The above proof shows of course that if
$\kappa $
is, say, a supercompact cardinal in the
$\mathsf {ZFC}$
model V, then there is a symmetric extension M satisfying
$\mathsf {DC}$
in which
$\kappa $
remains supercompact and such that
$\Sigma _2$
-generic absoluteness for the Chang models fails between M and the
$ \operatorname {\mathrm {Col}}(\omega , \omega _1)$
-extension of M (which in the above situation kills
$\mathsf {DC}$
). Hence, there is no hope for proving
$\Sigma _2$
-generic absoluteness for the Chang model in general over the base theory
$\mathsf {ZF}+\mathsf {DC}$
unless we restrict to forcings preserving
$\mathsf {DC}$
, e.g., proper forcings.
Question 7.4. Can generic absoluteness for
$L(\mathbb {R})$
fail, in the presence of large cardinals, without the requirement that
$\mathsf {DC}$
is preserved? Moreover, is it at all possible, assuming the existence of
$\mathbb {R}^\#$
, that
$\omega _1$
be singular in
$L(\mathbb {R})$
?
8 Generic absoluteness for the Chang model
Given an infinite regular cardinal
$\lambda $
and a nonempty set X,
$ \operatorname {\mathrm {Col}}(\lambda ,{<}X)$
is the forcing, ordered by reverse inclusion, of all functions p such that
$| \operatorname {\mathrm {dom}} p|<\lambda $
,
$ \operatorname {\mathrm {dom}} p\subseteq \lambda \times X$
, and
$p(\alpha ,x)\in x$
for all
$\alpha <\lambda $
and
$x\in X$
. It is straightforward to see that
$ \operatorname {\mathrm {Col}}(\omega ,{<}X)$
is weakly homogeneous and that forcing with
$ \operatorname {\mathrm {Col}}(\omega ,{<}X)$
adds surjections from
$\omega $
onto all members of X.
The following technical lemma, Lemma 8.1, is a rendering of [Reference Woodin29], theorem 226. This lemma will be crucially used in the proof of Lemma 8.2 (and of Lemma 8.4).
Lemma 8.1. Suppose
$\kappa $
is a supercompact cardinal and
${\mathbb {P}}\in V_\kappa $
is a forcing notion preserving
$\mathsf {DC}$
. There is then a strictly increasing sequence
$\langle \kappa _\xi \mid \xi <\kappa \rangle $
of strongly inaccessible cardinals below
$\kappa $
, together with a sequence
$\langle {\mathcal {Q}}_\xi \mid 1\leq \xi \leq \kappa \rangle $
of forcing notions and a sequence
$\langle \dot {\mathcal {Q}}^{\mathbb {P}}_\xi \mid 1\leq \xi \leq \kappa \rangle $
of
${\mathbb {P}}$
-names for forcing notions, satisfying the following conditions.
-
(1)
$\kappa _0$ is such that
${\mathbb {P}}\in V_{\kappa _0}$ and such that
-
•
$ \operatorname {\mathrm {Col}}(\omega _1,{<}V_{\kappa _0})$ forces
$\mathsf {DC}_{{<}\kappa _0}$ over
$V_{\kappa }$ , and
-
•
${\mathbb {P}}$ forces that
$ \operatorname {\mathrm {Col}}(\omega _1,{<}V_{\kappa _0}[\dot G_{\mathbb {P}}])$ forces
$\mathsf {DC}_{{<}\kappa _0}$ over
$V_{\kappa }[\dot G_{\mathbb {P}}]$ .
Also,
-
•
${\mathcal {Q}}_1= \operatorname {\mathrm {Col}}(\omega _1,{<}V_{\kappa _0})$ , and
-
•
$\dot {\mathcal {Q}}^{\mathbb {P}}_1$ is a
${\mathbb {P}}$ -name for
$ \operatorname {\mathrm {Col}}(\omega _1,{<}V_{\kappa _0}[\dot G_{\mathbb {P}}])$ .
-
-
(2) For every
$\xi>0$ , letting
$\gamma _\xi = \{ \kappa _{\xi '}\mid \xi '<\xi \}$ ,
$\kappa _\xi $ is such that
-
•
${\mathcal {Q}}_\xi $ forces that
$ \operatorname {\mathrm {Col}}(\gamma _\xi ,{<}V_{\kappa _\xi }[\dot G_{{\mathcal {Q}}_\xi }])$ forces
$\mathsf {DC}_{{<}\kappa _\xi }$ over
$V_{\kappa }$ , and
-
•
${\mathbb {P}}\ast \dot {{\mathcal {Q}}}_\xi ^{\mathbb {P}}$ forces that
$ \operatorname {\mathrm {Col}}(\gamma _\xi ,{<}V_{\kappa _\xi }[\dot G_{{\mathbb {P}}\ast \dot {{\mathcal {Q}}}^{\mathbb {P}}_\xi }])$ forces
$\mathsf {DC}_{{<}\kappa _\xi }$ over
$V_{\kappa }[\dot G_{\mathbb {P}}]$ .
Also,
-
•
${\mathcal {Q}}_{\xi +1}={\mathcal {Q}}_\xi \ast \operatorname {\mathrm {Col}}(\gamma _\xi ,{<}V_{\kappa _\xi }[\dot G_{{\mathcal {Q}}_\xi }])$ , and
-
•
$\dot {{\mathcal {Q}}}^{\mathbb {P}}_{\xi +1}$ is a
${\mathbb {P}}$ -name for
$\dot {{\mathcal {Q}}}^{\mathbb {P}}_\xi \ast \operatorname {\mathrm {Col}}(\gamma _\xi ,{<}V_{\kappa _\xi }[\dot G_{\dot {{\mathcal {Q}}}^{\mathbb {P}}_\xi }])$ .
-
-
(3) For every limit
$\xi $ ,
${\mathcal {Q}}_\xi $ is the Easton limit of
$\langle {\mathcal {Q}}_{\xi '}\mid \xi '<\xi \rangle $ and
$\dot {\mathcal {Q}}_\xi ^{\mathbb {P}}$ is a
${\mathbb {P}}$ -name for the Easton limit of
$\langle \dot {\mathcal {Q}}_{\xi '}^{\mathbb {P}}\mid \xi '<\xi \rangle $ .
-
(4) For every
$\xi <\kappa $ , if
$H\subseteq {\mathcal {Q}}_\xi $ is V-generic (resp., if
$H'\subseteq {\mathcal {Q}}_\xi ^{\mathbb {P}}$ is
$V[G]$ -generic), then
${\mathcal {Q}}_\kappa /H$ is
$\kappa _\xi $ -closed in
$V[H]$ (resp.,
${\mathcal {Q}}_\kappa ^{\mathbb {P}}/H$ is
$\kappa _\xi $ -closed in
$V[G][H']$ ).
-
(5) For every
$\delta <\kappa $ such that
${\mathbb {P}}\in V_{\delta }$ , if
$\delta $ is supercompact in V (in
$V^{\mathbb {P}}$ ), then
$\kappa _\delta =\delta $ and
$\delta $ remains supercompact after forcing with
${\mathcal {Q}}_\kappa $ over V (after forcing with
${\mathbb {P}}\ast \dot {{\mathcal {Q}}}^{\mathbb {P}}_\kappa $ over V).
-
(6) If
$H\subseteq {\mathcal {Q}}_\kappa $ is V-generic, then
$V_\kappa [H]\models \mathsf {ZFC}$ .
-
(7) If
$G\subseteq {\mathbb {P}}$ is V-generic and
$H\subseteq {\mathcal {Q}}^{\mathbb {P}}_\kappa $ is
$V[G]$ -generic, then
$V_\kappa [G][H]\models \mathsf {ZFC}$ .
Lemma 8.2 and its companion, Lemma 8.4, are the key ingredients in the proof of Theorem 8.5.
Lemma 8.2. Suppose
$\delta <\kappa $
are supercompact cardinals,
${\mathbb {P}}\in V_\delta $
is a forcing notion preserving
$\mathsf {DC}$
, and
$G\subseteq {\mathbb {P}}$
is a V-generic filter. Then there is, in some outer model, a
$V[G]$
-generic filter
$K\subseteq \operatorname {\mathrm {Col}}(\omega ,{<}V_\delta )$
, together with an elementary embedding
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu4.png?pub-status=live)
Proof. Let
$G\subseteq {\mathbb {P}}$
be a V-generic filter and, in
$V[G]$
, let
$\langle {\mathcal {Q}}_\xi ^{\mathbb {P}}\mid \xi \leq \kappa \rangle $
be the iteration given by Lemma 8.1. Let
$H\subseteq {\mathcal {Q}}^{\mathbb {P}}_\kappa $
be a
$V[G]$
-generic filter. By Lemma 8.1 (5) and (7),
$\delta $
remains supercompact in
$V_\kappa [G][H]$
and
$V_\kappa [G][H]\models \mathsf {ZFC}$
. Hence, by a classical
$\mathsf {ZFC}$
result of Woodin (see [Reference Woodin28]) there is, in some outer model N, a
$V_\kappa [G][H]$
-generic filter
$J\subseteq \operatorname {\mathrm {Col}}(\omega ,{<}\delta )$
, for which there is an elementary embedding
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu5.png?pub-status=live)
Let X be
$V_\delta $
as computed in V. Since
$ \operatorname {\mathrm {Col}}(\omega ,{<}X)$
and
$ \operatorname {\mathrm {Col}}(\omega , {<}\delta )$
are forcing-equivalent in
$V_\kappa [G][H]$
, we may fix a
$V_\kappa [G][H]$
-generic filter
$K_0\subseteq \operatorname {\mathrm {Col}}(\omega , {<}X)$
such that
$\mathcal C^{V_\kappa [G][H][J]}=\mathcal C^{V_\kappa [G][H][K_0]}$
.
Claim 8.3. There is some
$V_\kappa [G]$
-generic filter
$K\subseteq \operatorname {\mathrm {Col}}(\omega , {<}X)$
such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu6.png?pub-status=live)
□
Proof. Let
$H_\delta =H\cap {\mathcal {Q}}_\delta ^{\mathbb {P}}$
. Since
${\mathcal {Q}}_\kappa ^{\mathbb {P}}/H_\delta $
is
$\delta $
-closed in
$V[G][H_\delta ]$
(Lemma 8.1 (4) and (5)), each
$ \operatorname {\mathrm {Col}}(\omega ,{<}X)$
-name in
$V_\kappa [G][H]$
for an
$\omega $
-sequence of ordinals is equivalent to such a name in
$V_\kappa [G][H_\delta ]$
: every such name is equivalent to the canonical name
$\dot x$
for an
$\omega $
-sequence where, for each
$n<\omega $
,
$\dot x(n)$
consists of pairs
$\langle p, \check \alpha \rangle $
, for p a condition in
$ \operatorname {\mathrm {Col}}(\omega , {<}X)$
and
$\alpha $
an ordinal, and where the set of such p’s forms an antichain of
$ \operatorname {\mathrm {Col}}(\omega , {<}X)$
and is therefore of size less than
$\delta $
. Hence
$\mathcal C^{V_\kappa [G][H][K_0]}=\mathcal C^{V_\kappa [G][H_\delta ][K_0]}$
. But
${\mathcal {Q}}_\delta ^{\mathbb {P}}$
clearly embeds completely in
$ \operatorname {\mathrm {Col}}(\omega , {<}X)\times \operatorname {\mathrm {Col}}(\omega , {<}X)$
, and therefore also in
$ \operatorname {\mathrm {Col}}(\omega , {<}X)$
since
$ \operatorname {\mathrm {Col}}(\omega , {<}X)\cong \operatorname {\mathrm {Col}}(\omega , {<}X)\times \operatorname {\mathrm {Col}}(\omega , {<}X)$
, from which the conclusion follows. □
The above claim finishes the proof since
${\mathcal {Q}}_\kappa ^{\mathbb {P}}$
is
$\sigma $
-closed in
$V[G]$
, by Lemma 8.1 (4), and therefore
$\mathcal C^{V_\kappa [G][H]}=\mathcal C^{V_\kappa [G]}$
.
Similarly, we can prove the following lemma.
Lemma 8.4 (
$\mathsf {ZF}+\mathsf {DC}$
)
Suppose
$\delta <\kappa $
are supercompact cardinals. Then there is, in some outer model, a V-generic
$K\subseteq \operatorname {\mathrm {Col}}(\omega ,{<}V_\delta )$
, together with an elementary embedding
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu7.png?pub-status=live)
The main theorem in this section is the following.
Theorem 8.5 (
$\mathsf {ZF}+\mathsf {DC}$
)
Suppose for every closed and unbounded
$\Pi _2$
-definable class C of ordinals there is a supercompact cardinal
$\kappa \in C$
. Then, for every set-forcing
${\mathbb {P}}$
and every V-generic filter
$G\subseteq {\mathbb {P}}$
, if
$V[G]\models \mathsf {DC}$
, then the structures
$(\mathcal C^V; \in , r)_{r\in \mathbb {R}^V}$
and
$(\mathcal C^{V[G]}; \in , r)_{r\in \mathbb {R}^V}$
have the same
$\Sigma _2$
-theory.
Proof. Suppose, towards a contradiction, that
${\mathbb {P}}$
forces that
$(\mathcal C^V; \in , r)_{r\in \mathbb {R}^V}$
and
$(\mathcal C^{V[\dot G]}; \in , r)_{r\in \mathbb {R}^V}$
disagree on the truth value of some sentence
$\exists x\forall y\varphi (x, y)$
, where
$\varphi $
is a restricted formula. By our large cardinal assumption we may then fix a supercompact cardinal
$\kappa $
such that
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu8.png?pub-status=live)
and such that there is a supercompact cardinal
$\delta <\kappa $
such that
${\mathbb {P}}\in V_\delta $
.
We show that
$(\mathcal C^{V_{\kappa }}; \in , r)_{r\in \mathbb {R}^V}$
and
$(\mathcal C^{V_{\kappa }[G]}; \in , r)_{r\in \mathbb {R}^{V}}$
are elementarily equivalent, which will be a contradiction.
Let X be
$V_\delta $
as computed in V. By Lemma 8.2 and Lemma 8.4 we know that there are, in some outer model, filters
$K,K'\subseteq \operatorname {\mathrm {Col}}(\omega ,{<}X)$
which are V-generic and
$V[G]$
-generic, respectively, for which there are elementary embeddings
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20210920113016888-0761:S1755020320000143:S1755020320000143_eqnu9.png?pub-status=live)
Since
${\mathbb {P}}\times \operatorname {\mathrm {Col}}(\omega ,{<}X)$
and
$ \operatorname {\mathrm {Col}}(\omega , {<}X)$
are forcing–equivalent, there is a V-generic filter
$K''\subseteq \operatorname {\mathrm {Col}}(\omega , {<}X)$
such that
$V_\kappa [G][K']=V_\kappa [K'']$
. But then, by the weak homogeneity of
$ \operatorname {\mathrm {Col}}(\omega ,{<}X)$
, the theories of
$(\mathcal C^{V_{\kappa }}; \in , r)_{r\in \mathbb {R}^V}$
and
$(\mathcal C^{V_{\kappa }[G]}; \in , r)_{r\in \mathbb {R}^{V}}$
are the same. □
Using a similar argument one can prove the following.
Theorem 8.6 (
$\mathsf {ZF}+\mathsf {DC}$
)
Suppose there is, for every closed and unbounded class of ordinals C, a supercompact cardinal
$\kappa $
such that
$\kappa \in C$
. Then, for every set-forcing
${\mathbb {P}}$
and every V-generic filter
$G\subseteq {\mathbb {P}}$
, if
$V[G]\models \mathsf {DC}$
, then the structures
$(\mathcal C^V; \in , r)_{r\in \mathbb {R}^V}$
and
$(\mathcal C^{V[G]}; \in , r)_{r\in \mathbb {R}^V}$
are elementarily equivalent.
Note the second order character of the hypothesis of Theorem 8.6. This hypothesis can of course be taken to be an infinite scheme asserting that for every formula
$\Theta (x)$
with parameters defining a closed and unbounded class of ordinals there is some supercompact cardinal
$\kappa $
such that
$\Theta (\kappa )$
. Many reasonable large cardinal assumption could be used here in place of this. Nevertheless, it is not clear to us whether the existence of one supercompact cardinal suffices to yield the conclusion.
The following version of Theorem 8.5 for
$L(\mathbb {R})$
can be derived by the same argument as in the proof of Theorem 8.5 together with the fact that, under the existence of a supercompact cardinal,
$\mathbb {R}^\#$
exists in any set-forcing extension, and together with Woodin’s classical result that the Axiom of Determinacy holds in
$L(\mathbb {R})$
assuming
$\mathsf {ZFC}$
and the existence of infinitely many Woodin cardinals below a measurable cardinal.
Theorem 8.7 (
$\mathsf {ZF}+\mathsf {DC}$
)
Suppose there are two supercompact cardinals. Then, for every set-forcing
${\mathbb {P}}$
and every V-generic filter
$G\subseteq {\mathbb {P}}$
-generic filter G over V, if
$V[G]\models \mathsf {DC}$
, then
-
(1) the structures
$(L(\mathbb {R})^V; \in , r)_{r\in \mathbb {R}^V}$ and
$(L(\mathbb {R})^{V[G]}; \in , r)_{r\in \mathbb {R}^V}$ are elementarily equivalent, and
-
(2) the Axiom of Determinacy holds in
$L(\mathbb {R})^{V[G]}$ .
9 Some concluding remarks
The construction of symmetric submodels of forcing extensions—also known as symmetric extensions—is another way to use forcing to prove independence over base theories T. Such a symmetric submodel is a model N such that
$M\subseteq N\subseteq M[G]$
, for a ground model M satisfying our base theory and a generic filter G over M.Footnote
37
When T is
$\mathsf {ZFC}$
, we of course do not get anything new by looking at symmetric extensions. The reason is that if M and N both satisfy
$\mathsf {ZFC}$
, then N is itself a forcing extension of M. On the other hand, if we drop the Axiom of Choice from our foundation, then the construction of symmetric extensions becomes a genuine new method for proving independence. In particular, retreating to a foundation which does not incorporate Choice allows for a richer and more complex generic absoluteness project; now we can also ask if such and such hypothesis implies that all symmetric extensions of V (of some nice form) agree with V on the theory of the Chang model, or we can ask Question 9.2 below, etc.
In this context, one first outcome of foundational significance from our result is that, in the presence of sufficiently strong large cardinals, the theory of the Chang model is invariant not also under
$\mathsf {DC}$
-preserving forcing, but also under
$\mathsf {DC}$
-preserving symmetric submodels of
$\mathsf {DC}$
-preserving forcing extensions.Footnote
38
Theorem 9.1. Suppose that M satisfies
$\mathsf {ZF}+\mathsf {DC}$
and the hypothesis of Theorem 8.6. Let
$M[G]$
be a generic extension of M such that
$M[G]$
also satisfies
$\mathsf {DC}$
, and let N be a corresponding
$\mathsf {DC}$
-preserving symmetric extension. Then
$\mathcal C^N$
and
$\mathcal C^M$
are elementarily equivalent.
Proof. As we remarked in footnote 36, the relevant large cardinal hypothesis is preserved when moving from M to N. It follows, therefore, that N will satisfy the hypothesis of Theorem 8.6. By a theorem of Grigorieff in [Reference Grigorieff6],
$M[G]$
is a generic extension of N. As we assumed that all three models satisfy
$\mathsf {DC}$
, this is a
$\mathsf {DC}$
-preserving forcing extension, so by Theorem 8.6,
$\mathcal C^N$
and
$\mathcal C^{M[G]}$
are elementarily equivalent, as well as
$\mathcal C^M$
and
$\mathcal C^{M[G]}$
. As elementary equivalence is, well, an equivalence relation, it follows that
$\mathcal C^M$
and
$\mathcal C^N$
are also elementarily equivalent. □
This observation leads to the following question.
Question 9.2. Suppose that M satisfies
$\mathsf {ZF}+\mathsf {DC}$
and that the theory of the Chang model is generically absolute under
$\mathsf {DC}$
-preserving set-forcing. Let N be a
$\mathsf {DC}$
-preserving symmetric extension of M. Is
$\mathcal C^M$
elementarily equivalent to
$\mathcal C^N$
?
It should be noted that generic absoluteness, in general, does not imply the existence of a proper class of large cardinals in the ground model. In fact, it does not imply even the existence of an inaccessible cardinal. For example, suppose V is a model of
$\mathsf {ZFC}$
in which there is a supercompact cardinal
$\kappa $
but there are no inaccessible cardinals above
$\kappa $
. Suppose G is a
$ \operatorname {\mathrm {Col}}(\omega _1, {<}\kappa )$
-generic filter over V. Then generic absoluteness for the Chang model holds in
$V[G]$
but in
$V[G]$
there are no inaccessible cardinals.
The proofs of our main results in the previous section can be seen as an illustration of the thesis that, in choiceless contexts, one obtains fragments of choice from sufficiently strong large cardinal axioms (see [Reference Woodin29]). Roughly speaking, they show that if
$\mathsf {DC}$
holds and a large enough supply of supercompact cardinals is available, then one can force
$\mathsf {AC}$
, over suitable initial segments
$V_\alpha $
of the universe, while preserving large cardinals by a forcing—and here is a crucial point—which does not add
$\omega $
-sequences of ordinals and therefore leaves the corresponding versions of the Chang model unchanged. One can then appeal to the classical arguments for proving generic absoluteness for this inner model over
$\mathsf {ZFC}$
. The reason we needed to force full choice, at least over some suitable
$V_\alpha $
, is that in the classical arguments for proving generic absoluteness at this level—which we ultimately appeal to—we need Łoś’s Theorem, between V and some generic ultrapower of V, and Łoś’s Theorem is not provable in
$\mathsf {ZF}+\mathsf {DC}$
.Footnote
39
The extra large cardinal strength—with respect to the large cardinals used in the classical proofs, namely Woodin cardinals—in our present proofs, coming from the supercompact cardinals, is put to work at forcing full choice. It is perhaps interesting to note that no reduction in the large cardinals needed seems to be available to our arguments if one replaces
$\mathsf {DC}$
by some higher version
$\mathsf {DC}_\lambda $
of it, for some fixed infinite cardinal
$\lambda $
, even if we restrict to forcing notions preserving
$\mathsf {DC}_\lambda $
, as we will still need to force
$\mathsf {AC}$
in order to be able to apply Łoś’s Theorem. It is only when we throw in the Axiom of Choice that we manage to drop our large cardinal assumption from many supercompact cardinals down to Woodin cardinals.
On the other hand, and as we saw in §7,
$\mathsf {DC}$
is provably needed if we are to derive generic absoluteness for the Chang model relative to any large cardinals whatsoever. The place where our argument breaks down if we remove
$\mathsf {DC}$
from our hypotheses is that we will still be able to force
$\mathsf {ZFC}$
over initial segments
$V_\alpha $
of V, but we will definitely change the Chang model when doing so as we cannot assume, in the absence of
$\mathsf {DC}$
, that the forcing for getting
$\mathsf {AC}$
is
$\omega _1$
-distributive.
The above considerations show that it is reasonable to take
$\mathsf {ZF}+\mathsf {DC}$
, rather than
$\mathsf {ZFC}$
, as a vantage point in which to look at the generic absoluteness phenomenon for the Chang model in the presence of large cardinals. One could in fact assess the results in this second part of the paper as showing that
$\mathsf {DC}$
is the bare minimum amount of choice enabling large cardinals to do their job of fixing the theory of the Chang model. This is another argument in favour of the naturalness of
$\mathsf {ZF}+\mathsf {DC}$
: This theory not only provides the right framework for developing classical analysis, but is also the right base theory over which to derive generic absoluteness for
$L(\mathbb R)$
—where analysis lives—and even the Chang model, and thereby safeguarding truth in this inner model—and therefore truth in analysis—from the independence phenomenon.
We will conclude by pointing out an obstacle for extending this type of results. The classical arguments for deriving generic absoluteness in
$\mathsf {ZFC}$
for Chang models from large cardinals, relative to all set-forcings, work of course only for the
$\omega $
-Chang model, which is precisely what we can capture—in the sense of leaving unchanged—by a
$\omega _1$
-distributive forcing assuming
$\mathsf {DC}$
holds.Footnote
40
On the other hand it is possible to prove reasonable generic absoluteness results for higher Chang models, provided the scope of the result is restricted to apply to forcing extension via a carefully chosen—but, arguably, still natural—class of partial orders. Specifically, for any given infinite regular cardinal
$\kappa $
and any definable sufficiently nice class
$\Gamma $
of forcing notions one can define a certain strong forcing axiom
$ \operatorname {\mathrm {CFA}}(\Gamma )_\kappa $
which in particular implies the usual forcing axiom for meeting collections of
$\kappa $
-many dense subsets of forcings in
$\Gamma $
. One can then prove, under the assumption that sufficiently strong large cardinals exist and that
$ \operatorname {\mathrm {CFA}}(\Gamma )_\kappa $
holds, that the theory of the
$\kappa $
-Chang model is invariant with respect to forcing extensions via partial orders in
$\Gamma $
which, in addition, preserve our forcing axiom
$ \operatorname {\mathrm {CFA}}(\Gamma )_\kappa $
.Footnote
41
Moreover, for
$\kappa =\omega _1$
, many examples of nice classes
$\Gamma $
of forcing notions have been found for which the corresponding forcing axiom
$ \operatorname {\mathrm {CFA}}(\Gamma )_{\omega _1}$
is consistent (see [Reference Viale27] and [Reference Asperó and Viale2] for these results). It is totally unclear how one may want to try to adapt these results to a
$\mathsf {DC}_\lambda $
context. The problem is that the formulation of the relevant strong forcing axiom completely loses its power in the absence of Łoś’s Theorem, to be applied, in the proofs, to certain ultrapowers coming from towers of ideals belonging to our class
$\Gamma $
.Footnote
42
Finally, a (vague) question.
Question 9.3. Can we find reasonable axioms R for which one can prove, in
$\mathsf {ZF}+\mathsf {DC}_\lambda $
, for some
$\lambda \geq \omega _1$
, that if R holds, then no forcing notion in some natural class which moreover preserves R can change the theory of the
$\omega _1$
-Chang model?
Acknowledgments
This research was initiated when the authors were visiting fellows at the Isaac Newton Institute forMathematical Sciences during the programme ‘Mathematical, Foundational and Computational Aspects ofthe Higher Infinite’ (HIF) funded by EPSRC grant EP/K032208/1. The authors thank the organizers of theHIF programme for extending an invitation to Cambridge in October 2015.
The authors also thank both Mirna Džamonja and Yasuo Yoshinobu for their helpful suggestions. Thanksare also due to the anonymous referee whose remarks helped elevate the quality of the paper in content and exposition.
The first author acknowledges the support of EPSRC Grant EP/N032160/1. The second author was partially supported by theRoyal Society Newton International Fellowship, grant no. NF170989.