48

Inspired by the question here, I have been trying to understand the sheaf-theoretic approach to forcing, as in MacLane–Moerdijk's book "Sheaves in geometry and logic", Chapter VI.

A general comment is that sheaf-theoretic methods do not a priori produce "material set theories". Here "material set theory" refers to set theory axiomatized on the element-of relation $\in$, as usually done, in ZFC. Rather, they produce "structural set theories", where "structural set theory" refers to set theory axiomatized on sets and morphisms between them, as in the elementary theory of the category of sets ETCS. I will always add a collection (equivalently, replacement) axiom to ETCS; let's denote it ETCSR for brevity. Then Shulman in Comparing material and structural set theories shows that the theories ZFC and ETCSR are "equivalent" (see Corollary 9.5) in the sense that one can go back and forth between models of these theories. From ZFC to ETCSR, one simply takes the category of sets; in the converse direction, one builds the sets of ZFC in terms of well-founded extensional trees (modeling the "element-of" relation) labeled by (structural) sets.

So for this question, I will work in the setting of structural set theory throughout.

There are different ways to formulate the data required to build a forcing extension. One economic way is to start with an extremally disconnected profinite set $S$, and a point $s\in S$. (The partially ordered set is then given by the open and closed subsets of $S$, ordered by inclusion.) One can endow the category of open and closed subsets $U\subset S$ with the "double-negation topology", where a cover is given by a family $\{U_i\subset U\}_i$ such that $\bigcup_i U_i\subset U$ is dense. Let $\mathrm{Sh}_{\neg\neg}(S)$ denote the category of sheaves on the poset of open and closed subsets of $S$ with respect to this topology.

Then $\mathrm{Sh}_{\neg\neg}(S)$ is a boolean (Grothendieck) topos satisfying the axiom of choice, but it is not yet a model of ETCSR. But with our choice of $s\in S$, we can form the ($2$-categorical) colimit $$\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$$ called the filter-quotient construction by MacLane–Moerdijk. I'm highly tempted to believe that this is a model of ETCSR — something like this seems to be suggested by the discussions of forcing in terms of sheaf theory — but have not checked it. (See my answer here for a sketch that it is well-pointed. Edit: I see that well-pointedness is also Exercise 7 of Chapter VI in MacLane–Moerdijk.)

Questions:

  1. Is it true that $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ is a model of ETCSR?
  2. If the answer to 1) is Yes, how does this relate to forcing?

Note that in usual presentations of forcing, if one wants to actually build a new model of ZFC, one has to first choose a countable base model $M$. This does not seem to be necessary here, but maybe this is just a sign that all of this does not really work this way.

Here is another confusion, again on the premise that the answer to 1) is Yes (so probably premature). An example of an extremally disconnected profinite set $S$ is the Stone-Cech compactification of a discrete set $S_0$. In that case, forcing is not supposed to produce new models. On the other hand, $\mathrm{Sh}_{\neg\neg}(S)=\mathrm{Sh}(S_0)=\prod_{S_0} \mathrm{Set}$, and if $s$ is a non-principal ultrafilter on $S_0$, then $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ is exactly an ultraproduct of $\mathrm{Set}$ – which may have very similar properties to $\mathrm{Set}$, but is not $\mathrm{Set}$ itself. What is going on?

Wojowu
  • 27,379
Peter Scholze
  • 19,800
  • 3
  • 98
  • 118
  • 8
    Much safer than the shiv theoretic approach to forcing, I say. – Asaf Karagila Mar 04 '21 at 11:01
  • 2
    @AsafKaragila Shivs are more prone to applications than theory. – Alec Rhea Mar 04 '21 at 11:06
  • The countability of the base model is to have https://en.wikipedia.org/wiki/Rasiowa%E2%80%93Sikorski_lemma available, from what I understand, so that one can get the generic filter in the forcing poset. This seems to me to be linked to getting the filter in the subobjects of the terminal object of the sheaf topos, to actually perform the filterquotient construction, so that the subobject classifier in the new topos is $1+1$. (And, it's exercise VI.6.iii) in Mac Lane and Moerdijk, not 7, that you want: note the filter!) – David Roberts Mar 04 '21 at 12:32
  • I'll have more to say later, but for now, can you explain how to get your profinite set $S$ starting from a forcing poset? I'm a little confused because I think of the forcing poset as the Lindenbaum algebra of the geometric theory of the object we want to adjoin, which in general might have no models at all in the theory we start with, so that it would correspond to a locale without any points; but a topological space always has enough points, so where do they come from? – Mike Shulman Mar 04 '21 at 14:14
  • Or is it that when you say "sheaves on $S$" you really mean "sheaves on some site whose objects happen to be related to $S$ in some way but are not actually a point-set topology on the set $S$"? – Mike Shulman Mar 04 '21 at 14:15
  • 3
    If V is a model of set theory, then you can do the first part of this construction "in V" to produce some category C that V thinks is a Grothendieck topos. In some ambient universe, C is a Boolean pretopos having an underlying Stone space X. In your example, I think X is Spec(B), where B is the Boolean algebra of clopen subsets of your S? In particular S sits inside X as "the points of the spectrum that V knows about". The usual forcing construction is to take a "stalk" of C at some point of X which is "V-generic": that is, very far from belonging to S. – Jacob Lurie Mar 04 '21 at 14:24
  • The forcing poset defines a complete boolean algebra, which by Stone duality corresponds to an extremally disconnected profinite set. The subsheaves of $\ast$ in $\mathrm{Sh}(S)$ are in bijection with open and closed subsets $U\subset S$, so for all I can tell the maximal filters (used in Exercise VI.6 of MacLane--Moerdijk) are exactly given by the points $s\in S$. But Jacob's comment makes me believe I'm still stuck down in my platonic universe and need to free my mind more... – Peter Scholze Mar 04 '21 at 14:36
  • @Jacob: Just to see that I follow this, C is the class of P-names, right? – Asaf Karagila Mar 04 '21 at 15:00
  • One issue that set theorists often bring up about the sheaf-theoretic approach to forcing is the fact that for set theorists, it seems to be essential to have good control over definability properties of the forcing model in terms of the ground model (I think the main thing is that the forcing relation should be definable in the ground model?). I've gotten the impression that in the sheaf-theoretic approach, it's not obvious how to control these definability properties. My hunch is that this is not really an issue in the end, but I've never seen this point carefully addressed. – Tim Campion Mar 04 '21 at 15:16
  • The above description seems perfectly definable to me, but I don't really know what definable means. @AsafKaragila: I think it's fair to consider $C=\mathrm{Sh}(S)$ as the "structural" version of the class of P-names. – Peter Scholze Mar 04 '21 at 15:20
  • @Tim: Yes, that's exactly right. Part of what I was going for in my "long comment" is the fact we can do everything internally, so the model doesn't really play a significant role. Of course, we can do without that, given a model of ZFC, build a model of ZFC+CH or ZFC+MA, etc. The true strength of forcing is that it can be done internally, rather than externally. – Asaf Karagila Mar 04 '21 at 15:22
  • 1
    I really don't think you should call this topos $\mathrm{Sh}(S)$. That seems to be asking for trouble, since when $S$ is a topological space, "$\mathrm{Sh}(S)$" has a standard meaning, and this isn't it. – Mike Shulman Mar 04 '21 at 15:22
  • Sorry, Mike! I'll add the double negation in, but I didn't know the tex command... – Peter Scholze Mar 04 '21 at 15:23
  • @AsafKaragila And yet, set theorists seem to object when I point out that you can also do everything internally in the internal language of a sheaf topos. – Mike Shulman Mar 04 '21 at 15:23
  • Thanks, @Peter. When I was working on iterations of symmetric extensions, I realised that the "generic semi-direct product" has a strong categorical flavour. One of the other students would explain sheaves to me about twice a month. My advisor even said at some point that maybe that approach would clarify the construction, but I'm pretty sure that he was pulling my legs. I still don't fully grasp what sheaves are... :-) – Asaf Karagila Mar 04 '21 at 15:24
  • Objection, your honour, the comment by @Mike is written in an internal language to an internal sheaf topos! :-) – Asaf Karagila Mar 04 '21 at 15:25
  • 1
    I also occurs to me that the techniques used in topos theory to get a localic cover of a sheaf topos over a general site (cf. C.5.2 in the Elephant or A.4.3.1 in SAG) sound a lot like "giving names" to the objects / morphisms of the site. I wonder if this has anything to with the "names" which appear in forcing. – Tim Campion Mar 04 '21 at 15:31
  • 1
    Regarding question 1, I don't want to give a definitive answer without thinking more about it, because there are many details involved. but it should work. This construction is mentioned in example A.2.1.13 of Sketches of an elephant and in more details in Johnstone's older book "Topos theory" (section 9.4). It is not going to fully answer your question as these references don't specifically discuss ETCS nor replacement, but that should already considerably reduce what is left to prove. – Simon Henry Mar 04 '21 at 17:07

4 Answers4

28

Yes, this is a model of ETCSR. Unfortunately, I don't know of a proof of this in the literature, which is in general sadly lacking as regards replacement/collection axioms in topos theory. But here's a sketch.

As Zhen says, the filterquotient construction preserves finitary properties such as Booleanness and the axiom of choice. Moreover, a maximal filterquotient will be two-valued. But as you point out, a nondegenerate two-valued topos satisfying the (external) axiom of choice is necessarily well-pointed; I wrote out an abstract proof at https://ncatlab.org/nlab/show/well-pointed+topos#boolean_properties. Thus, $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(S)$ is a model of ETCS.

As for replacement, the proof that I know (which is not written out in the literature) goes by way of the notions of "stack semantics" and "autology" in my preprint Stack semantics and the comparison of material and structural set theories (the other half, not the part that became the paper of mine cited in the question).

Briefly, stack semantics is an extension of the internal logic of a topos to a logic containing unbounded quantifiers of the form "for all objects" or "there exists an object". (My current perspective, sketched in these slides, is that this is a fragment of the internal dependent type theory of a 2-topos of stacks -- hence the name!) This language allows us to ask whether a topos is "internally" a model of structural set theories such as ETCS or ETCSR.

It turns out that every topos is "internally (constructively) well-pointed", and moreover satisfies the internal collection axiom schema. But the internal separation axiom schema is a strong condition on the topos, which I called being "autological". If a topos is autological and also Boolean, then the logic of its stack semantics is classical; thus it is internally a model of ETCSR. Since Grothendieck toposes are autological, your $\mathrm{Sh}_{\neg\neg}(S)$ is internally a model of ETCSR.

Now we can also prove that if $\mathcal{E}$ is Boolean and autological, so is any filterquotient of it. The idea is to prove a categorical version of Łoś's theorem for the stack semantics. (I don't know whether this is true without Booleanness, which annoys me to no end, but you probably don't care. (-: ) Therefore, $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(S)$ is also autological.

Finally, another fact about autology is that a well-pointed topos is autological if and only if it satisfies the ordinary structural-set-theory axiom schemas of separation and collection. Therefore, $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(S)$ satisfies these schemas, hence is a model of ETCSR.

However, I doubt that this particular filterquotient is related to forcing at all. The point is the same one that Jacob made in a comment: when set theorists force over a countable base model to make an "actual" new model, they find an actual generic ultrafilter outside that model. A generic ultrafilter in the base model would be a point of the topos $\mathrm{Sh}_{\neg\neg}(S)$, which as Andreas pointed out in a comment, does not exist. Your "points" $x$ are not points of the topos $\mathrm{Sh}_{\neg\neg}(S)$, so it's unclear to me whether filterquotients at them have anything to do with forcing.

Let me reiterate my argument that the real content of forcing is the internal logic of the topos $\mathrm{Sh}_{\neg\neg}(S)$. In particular, if you build a model of material set theory in this internal logic, what you get is essentially the Boolean-valued model that set theorists talk about. Edit: I think the rest of this answer is off-base; see the discussion in the comments. I'm pretty sure this is the best kind of "model" you can get if you don't want to start talking about countable models of ZFC sitting inside larger ambient models.

At the moment, my best guess for a topos-theoretic gloss on the countable-transitive-model version of forcing is something like the following. Suppose that $E$ is a countable model of ETCSR, containing an internal poset $P$, which we can equip with its double-negation topology. Then treating $E$ as the base topos, we can build $\mathrm{Sh}(P,E)$, a bounded $E$-indexed elementary topos (i.e. "$E$ thinks it is a Grothendieck topos"), which contains the Boolean-valued model associated to $P$ as described above. It is the classifying topos of $P$-generic filters, hence has in general no $E$-points.

But we also have the larger topos $\rm Set$ in which $E$ is countable, and we can consider the externalization $|P|$ which is a poset in $\rm Set$, namely $|P| = E(1,P)$. Then we can build the topos $\mathrm{Sh}(|P|,\rm Set)$ which "really is" a Grothendieck topos and classifies $|P|$-generic filters. The "Rasiowa–Sikorski lemma" implies that, since $E$ is countable, in this case such a filter does actually exist in $\rm Set$, so there is a point $p:\mathrm{Set} \to \mathrm{Sh}(|P|,\rm Set)$.

Now we should also have some kind of "externalization functor" $|-| : \mathrm{Sh}(P,E) \to \mathrm{Sh}(|P|,\rm Set)$. My guess is that the set-theorists' forcing model is the "image" (whatever that means) of the Boolean-valued model in $\mathrm{Sh}(P,E)$ under the composite of this externalization functor with the inverse image functor $p^* : \mathrm{Sh}(|P|,\rm Set) \to Set$. However, I have not managed to make this precise.

Mike Shulman
  • 65,064
  • Is there any characterization of set theory that every topos satisfies? Your slide likely suggests $\mathsf{BZ}$, but your answer says that every topos satisfies the internal collection axiom schema. Are the usual collection and the internal collection different? (I feel I am conflating structural set theory and material set theory.) – Hanul Jeon Mar 04 '21 at 18:45
  • (By the way, is there any reason for choosing the hiragana yo(よ) on page 21 of your slide? Does it come from Yoneda?) – Hanul Jeon Mar 04 '21 at 18:46
  • Thanks a lot!! Actually, I tend to believe that this is forcing. Let me follow Asaf Karagila and pretend that $V[G]$ makes sense, and let me conflate it with its category of sets. Then $V[G]$ is a topos with a generic filter, so there is a natural map $\mathrm{Sh}{\neg\neg}(S)\to V[G]$; moreover, this will necessarily factor over $\varinjlim{U\ni s} \mathrm{Sh}{\neg\neg}(U)$ for a unique $s\in S$. The map $\varinjlim{U\ni s} \mathrm{Sh}_{\neg\neg}(U)\to V[G]$ is now a map of models of ETCSR, and I'm inclined to believe it must be an isomorphism. – Peter Scholze Mar 04 '21 at 19:00
  • After all, as in Zhen Lin's answer, $\mathrm{Sh}_{\neg\neg}(S)$ is obtained from $S$ by freely adjoining a generic filter, just like $V[G]$ is freely obtained by adjoining a generic filter (but somehow in an ambient model of ZFC). So I think the map is necessarily essentially surjective, and somehow I can't believe that the two things differ. – Peter Scholze Mar 04 '21 at 19:02
  • One source of confusion (for me) is the difference between the generic filter $G$, and the point $s\in S$ (which corresponds to an ultrafilter in the base model); I think the two are quite different. In $\varinjlim_{U\ni s} \mathrm{Sh}{\neg\neg}(U)$, there is a generic filter $G$ (just because there is one already in $\mathrm{Sh}{\neg\neg}(S)$), and this generic filter extends the ultrafilter corresponding to $s$ in the base model, but it's a different thing. – Peter Scholze Mar 04 '21 at 19:06
  • @PeterScholze The space $S$ is determined by a Boolean algebra $B$ (in $V$) which "$V$ thinks is complete" (every subset of $B' \subseteq B$ belonging to $V$ has a supremum). The construction $V[G]$ makes sense for any homomorphism of Boolean algebras $G: B \rightarrow { 0,1 }$ belonging the ambient universe. In particular, you can take homomorphisms $G$ which belong to $V$ (this is the construction of your original post). Alternatively, you can take homomorphisms which are "generic" meaning that they preserve suprema of subsets $B' \subseteq B$ which are contained in $V$ (this is forcing). – Jacob Lurie Mar 04 '21 at 19:24
  • @PeterScholze (ctd) There is no overlap between the two, except in the case where $S$ has isolated points (in which case you can take $G$ to "be" an isolated point of $S$, and you will get $V[G] = V$). – Jacob Lurie Mar 04 '21 at 19:26
  • Ah, I see! Thanks, Jacob. As I said, this was a point of confusion :-). – Peter Scholze Mar 04 '21 at 19:46
  • @HanulJeon Yes, the use of hiragana yo for the Yoneda embedding is a small movement, see some references at https://ncatlab.org/nlab/show/Yoneda+embedding#ReferencesNotation. – Mike Shulman Mar 04 '21 at 19:56
  • The structural axiom of collection is something that we can ask about whether a well-pointed topos satisfies. In addition, because every topos looks well-pointed in its own internal stack semantics, we can ask about whether it satisfies the collection axiom there. In the latter case, the answer is always yes. – Mike Shulman Mar 04 '21 at 19:57
  • 2
    BZC is a material set theory that corresponds, under the material-structural correspondence, to ETCS, the structural set theory of a well-pointed topos in classical logic. Similarly, the structural collection axiom corresponds to the material collection axiom. Thus, if an arbitrary topos E has a stack semantics that satisfies classical logic, if you construct a material set theory inside its stack semantics you get a model of BZC + collection, hence of ZFC. However, this condition on E is much stronger than ordinary Booleanness, indeed it's equivalent to Booleanness + autology. – Mike Shulman Mar 04 '21 at 20:02
  • Well, I'm still confused. In the model of the original post, there is a generic filter $G$, different from the ultrafilter corresponding to $s$ (but refining it). So I don't see how $G$ itself corresponds in a meaningful way to a homomorphism of Boolean algebras. However, I see that I made a mistake above in asserting the existence of $s\in S$ given $\mathrm{Sh}_{\neg\neg}(S)\to V[G]$; I guess the relevant ultrafilter will only exist in $V[G]$, and may not lie in $S$. But I still don't have a really coherent picture of what's going on. – Peter Scholze Mar 04 '21 at 20:14
  • For example: I believe that proofs involving forcing work very much at the level of $\mathrm{Sh}{\neg\neg}(S)$ (or somewhat equivalently in terms of the forcing relation), so it seems that from this perspective one could as well take $\varinjlim{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ as a replacement for the forcing extension. It would definitely feel more concrete to me, as all the data used to define it is defined within $V$. – Peter Scholze Mar 04 '21 at 20:21
  • (For example, MacLane-Moerdijk show that the above recipee gives models violating the continuum hypothesis, and if nothing else I feel like I have a better grasp of this model than of the other presentations of forcing I've seen.) – Peter Scholze Mar 04 '21 at 20:24
  • @PeterScholze You absolutely can. That's the "Boolean-valued model" approach, and is better compatible with the perspective that $V$ is the "entire" universe of sets. (If $V$ is already the whole universe, you can't adjoin a new homomorphism $G: B \rightarrow {0,1}$ to it: they're all there already. But also no (non-isolated) point of $S$ corresponds to a "generic" filter. In topos language, the double-negation topos is Grothendieck topos with no points.) – Jacob Lurie Mar 04 '21 at 20:24
  • Well, this thing is not a Boolean-valued model anymore, or is it? – Peter Scholze Mar 04 '21 at 20:25
  • It's $\mathrm{Sh}{\neg\neg}(S)$ that is (or contains) the Boolean-valued model. But I'm also confused: while the filterquotient of $\mathrm{Sh}{\neg\neg}(S)$ doesn't seem to be the usual set-theorists' forcing model, it does seem to be itself a model of ETCSR which has the same finitary properties as the Boolean-valued model (since the filterquotient functor is logical). In set-theoretic language, I guess maybe this would correspond to quotienting the Boolean-valued model by a (non-generic) ultrafilter that does exist. Is this something that set theorists do? What is it? – Mike Shulman Mar 04 '21 at 20:38
  • @MikeShulman Well, as Peter points out in his original post, a special case is taking ultrapowers of $V$, which is definitely something that set-theorists do. If I understand correctly, you're saying that every ultrafilter gives a model of ZFC? So one can view forcing extensions and ultrapowers as specializations of a more general procedure for making models? – Jacob Lurie Mar 04 '21 at 20:45
  • 7
    Apparently "boolean ultrapowers" is the key word, and it has been heavily considered, see https://arxiv.org/pdf/1206.6075.pdf – Dustin Clausen Mar 04 '21 at 20:48
  • Thanks Dustin!! This reference is extremely (extremally?) enlightening! – Peter Scholze Mar 04 '21 at 21:09
  • In particular, the construction discussed in the OP is a forcing extension, but not of $V$, but of an elementary extension of $V$, see Theorem 22. I'm really excited, everything seems to make so much more sense now! – Peter Scholze Mar 04 '21 at 21:12
  • Yeah, this really does seem like a nice point of view. Unfortunately after looking around some more I think I may have been exaggerating when I said "heavily considered". – Dustin Clausen Mar 04 '21 at 21:21
  • @PeterScholze Isn't it true that the construction of your original post is already an elementary extension of $V$ (with no forcing extension needed)? It's certainly true when $S$ is a Stone-Cech compactification, and I think the proof of Los' theorem shows this too (the essential content is that the direct image functor from the double negation topos to the usual topos of sheaves on $S$ is a map of pretopoi). – Jacob Lurie Mar 04 '21 at 21:32
  • 2
    @MikeShulman Quotienting a Boolean-valued model by a non-generic ultrafilter is one technique in the study of non-wellfounded models of ZFC. See for example Forcing with non-wellfounded models by Paul Corazza. – Timothy Chow Mar 04 '21 at 22:07
  • @JacobLurie I don't think so -- in MacLane-Moerdijk they show how to violate the continuum hypothesis using Cohen forcing, using this construction. I guess the proof of Los' theorem shouldn't go through. I thought the proof proceeds by spreading a statement from an ultrafilter in to a neighborhood and then specializing to isolated points. If there are no isolated points, I don't see how to argue. – Peter Scholze Mar 04 '21 at 22:17
  • @PeterScholze Let $S$ be an extremally disconnected space and let $s$ be a point. For every set $X$, I can take the constant sheaf $\underline{X}$ on $S$, sheafify with respect to the double negation topology, and then take the stalk at the point $s$. I believe that this defines a functor $F: \mathrm{Set} \rightarrow \mathrm{Set}$ that preserves finite limits, coproducts, and epimorphisms, and therefore carries models of any first-order theory (like ZFC + CH) to models of the same theory. The nontrivial ingredient is in Moerdijk and MacLane, Lemma 4 on page 519. – Jacob Lurie Mar 04 '21 at 22:38
  • 1
    Alternatively: I think what I'm saying is the content of Theorem 15 in the Hamkins and Seabold paper? – Jacob Lurie Mar 04 '21 at 22:52
  • 1
    @JacobLurie Yes, but in their Theorem 22 you have $V\subset \check{V}_U\subset \check{V}_U[G]=V^{\mathbb B}/U$, where the first elementary embedding is the one of their Theorem 15, and in the OP one considers $V\subset V^{\mathbb B}/U$, I think. So it's a generic extension over an elementary embedding. – Peter Scholze Mar 04 '21 at 23:00
  • @PeterScholze Ah, I think I see the source of my confusion. There's an operation of "ultrapower by $s \in S$" on models of ZFC, but it doesn't match with the construction of your original post under the (structural set theory $\leftrightarrow$ material set theory) dictionary because it will only "see" those sheaves $\mathscr{F}$ on $S$ which are locally constant on some open set $U$ satisfying $s \in \overline{U}$? – Jacob Lurie Mar 04 '21 at 23:58
  • Exactly! In sheaf-theoretic language, the elementary embedding is gotten by sheafifying the presheaf of categories taking any $U\subset S$ to $\mathrm{Set}$, i.e. the constant sheaf of categories. This is different from sheaves with values in $\mathrm{Set}$! I think your explicit description is correct. – Peter Scholze Mar 05 '21 at 07:26
  • I haven't managed to tease out the connection from the Hamkins-Seabold paper myself yet, but I think it means at least that the last three paragraphs of my answer are wrong. @PeterScholze, since it makes so much more sense to you now, do you think you could post a new answer below, with an explanation for the rest of us? – Mike Shulman Mar 05 '21 at 08:12
22

I think the language of classifying toposes is helpful in understanding this view of forcing.

Let $P$ be a poset. The set theorists have the intuition that forcing over $P$ adjoins a generic filter of $P$ to the universe, and in a similar way the classifying topos of a theory is what you get by freely adjoining a model of that theory to the base $\textbf{Set}$. Every topos is a classifying topos of some theory, and it turns out that $\textbf{Sh} (P, \lnot \lnot)$ can be regarded as classifying generic filters of $P$ – so, in some sense, a direct translation of the set theorists' intuition!

For convenience – otherwise we will have to use extremely cumbersome circumlocutions – we assume $P$ is a meet semilattice with binary meet $\land$ and top element $\top$. With some work (= mechanical application of Diaconescu's theorem), the topos $\textbf{Psh} (P)$ can be seen to classify models of the following propositional theory:

  • We have propositional symbols $\theta_p$ for each $p \in P$.
  • We have the axiom $(\theta_{p_1} \land \cdots \land \theta_{p_n}) \implies \theta_q$ whenever $p_1 \land \cdots \land p_n \le q$ in $P$. (The case $n = 0$ means $\theta_\top$ is an axiom.)

A model of this theory consists of subobjects $\Theta_p$ of the terminal object $1$ for each $p \in P$, such that $\Theta_{p_1} \times \cdots \times \Theta_{p_n}$ is a subobject of $\Theta_q$ whenever $p_1 \land \cdots \land p_n \le q$ in $P$. Thus, in $\textbf{Set}$, models are in bijective correspondence with filters of $P$, i.e. subsets of $P$ containing $\top$ and closed under $\land$. We may therefore say that $\textbf{Psh} (P)$ is the classifying topos for filters of $P$.

Subtoposes of a classifying topos correspond to extensions of the theory: the classifying morphism of some model factors through the subtopos if and only if the model satisfies the extended theory. So the topos $\textbf{Sh} (P, \lnot \lnot)$ must classify some special filters of $P$. Say $D \subseteq P$ is dense under $q \in P$ if, for every $p \in D$ we have $p \le q$ and for every $p \le q$ we have $p' \in D$ such that $p' \le p$. With some work (= Diaconescu's theorem again), we see that $\textbf{Sh} (P, \lnot \lnot)$ classifies models satisfying the following extension:

  • We have the axiom $\theta_q \implies \bigvee_{p \in D} \theta_p$ whenever $D \subseteq P$ is dense under $q \in P$.

In $\textbf{Set}$, models correspond to generic filters of $P$, i.e. filters $F \subseteq P$ such that if $q \in F$ and $D \subseteq P$ is dense under $q$ then there is $p \in D$ such that $p \in F$. Thus, we may say that $\textbf{Sh} (P, \lnot \lnot)$ classifies generic filters of $P$. (That said, $P$ may not actually have any generic filters in $\textbf{Set}$. This is the whole point of forcing!)

Now onto the filter-quotient stuff. Set theorists actually have a few different ways of thinking about forcing. The one that comes closest to the sheaf topos construction is forcing using boolean-valued models. To extract a $2$-valued model one chooses an ultrafilter $U$ and then reinterprets a proposition as true if its valuation is in $U$ and false otherwise. More generally, if $A$ is the complete boolean algebra of valuations and $U$ is some (not necessarily ultra) filter of $A$, then we have a boolean algebra $A / U$ obtained by quotienting $A$ by the relations $a \le b$ whenever $a \land u \le b$ for some $u \in U$. If $U$ is complete (i.e. closed under infinitary meets) then $A / U$ is furthermore a complete boolean algebra, and given an $A$-valued model $M$ we obtain an $A / U$-valued model $M / U$ by applying the quotient map $A \to A / U$ to the valuations. In the case that $U = \{ a \in A : b \le a \}$ for some $b \in A$, $A / U$ is isomorphic as a poset to $A_b = \{ a \in A : a \le b \}$. For general filters $U$, it is clear that $A / U \cong \varinjlim_{b \in U^\textrm{op}} A_b$. If $U$ is an ultrafilter we also have $M / U \cong \varinjlim_{b \in U^\textrm{op}} M_b$, but here $M_b$ is defined to be the structure obtained by interpreting a proposition to be true if its valuation is $\ge b$ and false otherwise. (Warning: $M_b$ is usually not a model of the theory of interest!) The topos-theoretic filter-quotient construction is the analogue of this construction.

Here, we have some localic boolean topos $\mathcal{E}$ and we take $A$ to be the complete boolean algebra of subobjects of $1$ in $\mathcal{E}$. When $U$ is a filter, the resulting $\mathcal{E} / U$ will inherit finitary structures preserved by the inverse image functor of open embeddings – so $\mathcal{E} / U$ will have finite limits, finite colimits, power objects, exponentials etc. – but I think one has to do some work to verify that properties like, say, the failure of the continuum hypothesis are also inherited. It should go without saying that $\mathcal{E} / U$ will usually fail to be a Grothendieck topos.

Finally, regarding the internal/external distinction. All of the above is from the external perspective – in the sense that $P$ is a poset in the metatheory – and the role of $\textbf{Set}$ is special. But just as the set theorists can apply forcing to the "real" universe of sets, one can construct the topos of $\lnot\lnot$-sheaves on a poset $P$ inside a general boolean elementary topos $\mathcal{S}$. What I do not know is how to (elegantly) internalise the filter-quotient construction in this scenario. One difficulty is that $P$ is a poset internal to $\mathcal{S}$, and if $\mathcal{S}$ is not well pointed then taking the global points of $P$ loses information. It should be doable, though.

Zhen Lin
  • 14,934
  • Thanks, this is certainly helpful! Do you also have an idea on the answer to Question 1? – Peter Scholze Mar 04 '21 at 15:40
  • No, sorry. The usual yoga of filtered colimits would suggest that the filter-quotient has whatever finitary properties each of the toposes appearing in the colimit has and which are preserved by the pullback functors in question, but I think that is not enough to answer the question of replacement. – Zhen Lin Mar 04 '21 at 15:51
  • 7
    The paragraph that begins "In Set, models correspond to generic filters of $P$," though absolutely correct, could give people the impression that there are such filters in Set. For the posets of interest in forcing, there are no generic filters in the real world Set. This phenomenon is no problem (topoi without points have been around for a long time), but I want to make sure people are aware of it. – Andreas Blass Mar 04 '21 at 16:18
  • 1
    Right, of course. I just tried to fix some inaccuracies in my discussion of boolean valued models and it now occurs to me that I don't actually understand some details. In the usual setup we take a complete boolean algebra so that we can interpret quantifiers in the naïve way using infinitary meets and joins, but if we do that then wouldn't we need a complete ultrafilter in order to get a $2$-valued model? Where do interesting complete ultrafilters come from? – Zhen Lin Mar 05 '21 at 02:55
  • @ZhenLin Your questions made me realize I also don't understand this stuff. Also it reminded me of our old paper Distribution algebras and duality (with Bunge, Funk and Streicher). There it is shown that for an $\mathscr S$-topos $\mathscr E\to\mathscr S$, indexed right adjoints $\mathscr S\to\mathscr E$ classify $\mathscr S$-complete atomic Heyting algebras (which need not be internally complete). – მამუკა ჯიბლაძე Mar 05 '21 at 04:49
  • I wonder if something similar happens here, so complete ultrafilters might correspond to adjunctions which are not geometric morphisms since left adjoints are not required to preserve finite limits? – მამუკა ჯიბლაძე Mar 05 '21 at 04:50
  • @ZhenLin No you don't need complete ultrafilters! They usually don't exist (even countably complete ones), unless you assume some large cardinal hypothesis (measurability). But it is slightly confusing. Note that while $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ is model of ETCSR, and thus thinks of itself to have all small colimits, it does not actually have all small colimits when viewed as a category in $V$. So in passing to the quotient by the ultrafilter, the new colimits are different, which is (I believe) why you do not need a map of complete Boolean algebras. – Peter Scholze Mar 05 '21 at 07:35
  • @PeterScholz I'm talking about the boolean-valued model framework. An $A$-valued structure is, roughly speaking, a structure in the topos $\textbf{Sh} (A)$ where here we use the canonical coverage of $A$. (The traditional definition is subtly different.) Extracting a $2$-valued model corresponds to applying a functor $\textbf{Sh} (A) \to \textbf{Set}$ that respects the logic of interest. For geometric logic that would be a functor that preserves finite limits and arbitrary colimits – these are in 1-1 correspondence to complete ultrafilters of $A$. I don't know how to handle first-order logic. – Zhen Lin Mar 05 '21 at 08:52
  • Yes, but the arguments here are not happening in geometric logic (so arguably classifying topoi are slightly besides the point). I agree it's confusing... – Peter Scholze Mar 05 '21 at 08:54
18

Thanks for all the enlightening answers! Let me summarize my understanding now. (Please correct me if I'm saying something stupid!)

First, as explained by Mike Shulman in his answer, the answer to Question 1) is Yes: $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ is a model of ETCSR. So if you, like me, always wanted to have a hands-on understanding of what forcing does, I think you can go with this explicit model (and do away with all meta-mathematical baggage surrounding either boolean models, or countable base models, and generic filters).

It remains to answer Question 2): How is this related to forcing? The answer seems to be that it is a generic extension $\mathrm{Set}_s[G]$ of $\mathrm{Set}_s$ where $\mathrm{Set}_s$ is an elementary extension of $\mathrm{Set}$.

More precisely, recall the concept of "boolean ultrapowers", which is "ultrapowers with Stone-Cech compactifications replaced by a general extremally disconnected profinite set $S$". Namely, given $S\ni s$ as above, and any model $M$ of some first-order theory, we can look at the constant sheaf with value $M$ on $S$ with respect to the double-negation topology, and take its stalk at $s\in S$. (Warning: This is not really a stalk! I.e., $s$ does not define a point of the topos of double-negation sheaves. By "stalk" I just mean the colimit $\varinjlim_{U\ni s}$; equivalently, push forward to usual sheaves on the topological space $S$, and take the stalk there.) This defines a new model $M_s$ of the first-order theory. (If $S$ is the Stone-Cech compactification of $S_0$ and $s\in S$ is an ultrafilter on $S_0$, then the sheafification is taking any open and closed subset $U\subset S$, corresponding to some subset $U_0\subset S_0$, to $\prod_{U_0} M$, and then the stalk is $\varinjlim_{s\in \beta U_0} \prod_{U_0} M$, i.e. an ultraproduct.) This procedure can be applied in particular to a model of ZFC, or a model of ETCSR, with the procedures being equivalent. This construction features prominently in this beautiful article of Hamkins–Seabold.

The Boolean ultrapower $\mathrm{Set}_s$ of $\mathrm{Set}$ (at $s\in S$) is thus given by the stalk at $s\in S$ of the constant sheaf of categories with value $\mathrm{Set}$. The constant sheaf of categories with value $\mathrm{Set}$ is different from set-valued sheaves! However, there is a natural functor, in particular giving a functor $$\mathrm{Set}_s\to \varinjlim_{U\ni s}\mathrm{Sh}_{\neg\neg}(U).$$

As in Zhen Lin's answer, the topos $\mathrm{Sh}_{\neg\neg}(S)$ classifies generic filters $G$. In particular, it contains such a generic filter itself, given by $$ G=\bigsqcup_{U\subset S} U.$$ (Here, $U$ is identified with the sheaf it represents.)

Now I believe that the discussion around Theorem 22 in Hamkins-Seabold should translate into the following statement.

$\mathrm{Set}_s$ is an elementary extension of $\mathrm{Set}$, $G$ is a generic filter over $\mathrm{Set}_s$, and $\varinjlim_{U\ni s} \mathrm{Sh}_{\neg\neg}(U)$ identifies with the generic extension $\mathrm{Set}_s[G]$ of $\mathrm{Set}_s$.

(What is $\mathrm{Set}_s[G]$ here? It corresponds to the (material) forcing extension by $G$ under translating back-and-forth between structural and material set theory.)

Finally, how is all of this related to usual forcing? In usual forcing, one first makes an auxiliary enlargement $V\subset \tilde{V}$ (i.e. $\mathrm{Set}\to \tilde{\mathrm{Set}}$) so that after this enlargement, "there are new elements of $S$" in the sense that the corresponding complete Boolean algebra $B$ admits new homomorphisms $B\to \{0,1\}$. One can then redo the previous construction, but take the stalk at a "new point" $s\in \tilde{S}$ (where $\tilde{S}$ is the spectrum of $B$ in $\tilde{V}$). All of the above should still work in this setting, giving us $\mathrm{Set}\to \mathrm{Set}_s\to \mathrm{Set}_s[G]$. However, if $s$ corresponds to a $V$-generic filter, then it turns out that $\mathrm{Set}_s=\mathrm{Set}$. So in this case the forcing extension is really one of $\mathrm{Set}$ itself, not of an elementary extension.

This also answers my final confusion in the OP: If $S$ is a Stone-Cech compactification, then $\mathrm{Sh}_{\neg\neg}(S)$ is in fact the constant sheaf of categories with value $\mathrm{Set}$, so in particular $G$ already lies in $\mathrm{Set}_s$ for all $s\in S$, and $\mathrm{Set}_s[G]=\mathrm{Set}_s$. Thus, the forcing extension is trivial, but the elementary extension $\mathrm{Set}\to \mathrm{Set}_s$ to the ultrapower remains.

Why does one usually want generic filters? I think the point is that the extension $\mathrm{Set}_s\subset \mathrm{Set}_s[G]$ always preserves ordinals (in its material incarnation). By contrast, $\mathrm{Set}\subset \mathrm{Set}_s$ very much does not, and in fact $\mathrm{Set}_s$ usually has nonstandard integers; in the material incarnation, this leads to non-wellfounded models. So one uses generic filters to make $\mathrm{Set}=\mathrm{Set}_s$.

From the structural point of view, wellfoundedness is not visible, but of course it may still be disconcerting that the extension has nonstandard integers.

Peter Scholze
  • 19,800
  • 3
  • 98
  • 118
  • Could you tell more about the place where you push forward a model from double negation sheaves to ordinary sheaves on $S$? The only pushforward that seems to be available is the right adjoint to the restriction: this restriction hardly ever in this case has a left adjoint. Now the right adjoint preserves anything equational, but is not obliged to preserve any existential statements that might happen to be among axioms of the theory in question, so it might destroy models of some important theories, no? – მამუკა ჯიბლაძე Mar 05 '21 at 09:27
  • 2
    The relationship between $\textbf{Set}_s$ and $\textbf{Set}_s [G]$ is something I never thought about before, very interesting. The functor $\textbf{Set}_s \to \textbf{Set}_s [G]$ is a filtered colimit of inverse image functors so it should have pleasant properties, but I suppose it is not actually an inverse image functor in general. (I think it would have to be an equivalence if it were.) – Zhen Lin Mar 05 '21 at 09:47
  • 1
    @მამუკაჯიბლაძე I think (as mentioned by Jacob Lurie in the comments below Mike Shulman's answer) Lemma 4 on page 517 of MacLane-Moerdijk is relevant to this question. In the notation of his comment, this ensures that this endofunctor of $\mathrm{Set}$ preserves surjections, which should lead to preservation of existential axioms. – Peter Scholze Mar 05 '21 at 09:52
  • 2
    @ZhenLin: Can you cast that statement about filtered colimit of inverse image functors into the world of material set theory? – Asaf Karagila Mar 05 '21 at 09:59
  • @AsafKaragila Not really... the functor corresponds to the inclusion of the ultrapower into the extension (of the ultrapower), which in material set theory obviously has pleasant properties such as preserving finite disjoint unions and quotients and finite products and so on. If the inclusion were actually an inverse image functor then it should be the case that every set in the extension is functorially isomorphic to a set in the ultrapower, which should imply that the extension is trivial. But I haven't thought about this properly. – Zhen Lin Mar 05 '21 at 10:32
  • @ZhenLin: I suppose that you're describing the maps $i_p$, for a condition $p$, which are interpreted as $p\Vdash\dot x\in\dot y$ and $p\Vdash\dot x=\dot y$, right? (So $i_p(\dot x)={\dot y\mid p\Vdash\dot y\in\dot x}$ or something like that.) – Asaf Karagila Mar 05 '21 at 10:35
  • @ZhenLin: And so in that case, asking if the functor is itself an inverse image functor would be assigning a canonical name for every set in the generic extension, in a way that "makes sense" somehow. (Which I suspect is something that you can't definably do in ZFC, even with global choice, because it would somehow let you decode the generic filter in the ground model or something, and that should be a known thing.) – Asaf Karagila Mar 05 '21 at 10:43
  • 1
    I'm not comfortable with the forcing language / names approach, but I don't think so? Also there is the subtlety with boolean valued models I mentioned in the comments to my answer that I haven't figured out, so I'm not really sure how well this translates to material set theory. I can probably write a precise question if someone wants to try to answer... – Zhen Lin Mar 05 '21 at 11:00
  • @ZhenLin: That's fine, I suppose. But it's not that important for you to spend more time on this than you have to. Thanks! – Asaf Karagila Mar 05 '21 at 11:13
  • 1
    @PeterScholze Thank you for the explanation! And, sorry - another doubt. That lemma says that the pushforward preserves finite epimorphic families. But I don't think it preserves coproducts, even finite ones. For example, if I am not mistaken, pushforward of $1\sqcup1$ is not $1\sqcup1$: the value of the latter on $U$ is the set of open and closed subsets of $U$, while for the former it is the set of open and closed subsets of the closure of $U$, which is the same only if $U$ is itself open and closed. – მამუკა ჯიბლაძე Mar 05 '21 at 11:31
  • Accordingly, I don't think the endofunctor on Sets mentioned by @JacobLurie in that comment preserves coproducts. So then, might not there be problems with axioms involving disjunction? Also, what about infinite epimorphic families? These might become involved if there are some axiom schemata, for example. – მამუკა ჯიბლაძე Mar 05 '21 at 11:32
  • Open and closed subsets of $U$ are the same as open and closed subsets of the closure of $U$, since $S$ is extremally disconnected. Alternatively, you can compute the stalk at $s$ using only clopen neighborhoods. The functor preserves only finite coproducts, not arbitrary ones. – Jacob Lurie Mar 05 '21 at 11:37
  • @JacobLurie But $1\sqcup1\to i_i^(1\sqcup1)$ is not an isomorphism, right? So, some of its stalks must also fail to be isomorphisms, as $\operatorname{Sh}(S)$ has enough points, being just the topos of sheaves on a space. – მამუკა ჯიბლაძე Mar 05 '21 at 11:40
  • 1
    It is an isomorphism. Abstract proof: since $i_* i^$ is a left exact functor, the map $i_ i^( \mathscr{F} ) \coprod i_ i^(\mathscr{G} ) \rightarrow i_ i^* (\mathscr{F} \coprod \mathscr{G} )$ is automatically a monomorphism. The epimorphism statement is the one you have to worry about (and is the content of that Lemma in Moerdijk-MacLane). Concretely: the operation $V \mapsto \overline{V}$ and $\overline{V} \mapsto U \cap \overline{V}$ define mutually inverse bijections from clopen subsets of $U$ to clopen subsets of $\overline{U}$, because extremally disconnected spaces are bizarre. – Jacob Lurie Mar 05 '21 at 11:45
  • @JacobLurie Oh I see, thank you! So only infinite coproducts remain. Well, I think I can be persuaded that they do not create any problems... – მამუკა ჯიბლაძე Mar 05 '21 at 11:54
  • The functor definitely doesn't preserve infinite coproducts. But the axioms of ZFC do not involve infinite disjunctions... – Jacob Lurie Mar 05 '21 at 11:56
  • 1
    @Jacob: ... yet! I'm writing one, it's just taking a while! :-) – Asaf Karagila Mar 05 '21 at 11:57
  • @AsafKaragila take your time... :P – მამუკა ჯიბლაძე Mar 05 '21 at 11:58
  • 3
    Fun fact: On Hamkins' webpage the blackboard on the title photo is about boolean ultrapowers. – Peter Scholze Mar 05 '21 at 13:26
15

I can't really answer your question, since they are outside my field of expertise. But until Mike and others come to answer, let me make a long comment about the following sentence:

Note that in usual presentations of forcing, if one wants to actually build a new model of ZFC, one has to first choose a countable base model $M$.

While true, this is not quite the case in the usual applications of forcing. Indeed, we normally force over the universe. This is really the whole point of forcing: it is internal to the model you're working with, rather than external. The external part is the generic filter (which we can eliminate, if we are willing to forego well-foundedness), but everything else happens inside the model.

The usual presentation of forcing uses models, because the uses of forcing are to prove consistency results. It's easier to apply this to models, than to develop Boolean-valued logic and make arguments about that. The approach via models also clarifies how forcing works, what are the names, and what the forcing relation truly means.

On the other hand, if you're not interested in preserving well-foundedness (and therefore, by extension adding new "ordinals") you can force over the universe with a non-generic filter that meets "enough dense sets", and use it to interpret the names. This will define a class and a binary relation on it, which will interpret correctly "enough of set theory", but will not be well-founded.

This seems odd in the standard context of forcing, since one of the nice properties of forcing is that it doesn't adds ordinals, so it doesn't violate well-foundedness. But if you're looking at sheaves, there's no notion of a well-founded model, so there's no reason to worry about that.

Asaf Karagila
  • 38,140