Zulip Chat Archive

Stream: general

Topic: category theory design

Scott Morrison (Aug 12 2018 at 13:19):

I spent some time this afternoon trying out three different implementations of products/equalizers/pullbacks. If anyone has a moment to have a look at them, I'd very much appreciate it. It's all in <https://github.com/semorrison/lean-category-theory-pr/blob/limits/src/categories/universal/default.lean>.

Scott Morrison (Aug 12 2018 at 13:20):

To try them out, I proved that the category of types has equalizers, pullbacks, and binary products.
Rather beautifully, using obviously you can write exactly the same proof for all three versions:
you just specify the shape, and obviously deals with the variations in what's required to check the universal
properties.

Scott Morrison (Aug 12 2018 at 13:21):

e.g.

instance : has_equalizers (Type u) :=
{ equalizer := λ Y Z f g, { X := { y : Y // f y = g y }, ι := subtype.val, w := by obviously, h := by obviously } }

Kevin Buzzard (Aug 12 2018 at 13:23):

I feel out of my depth commenting on your work Scott. For me the proof of the pudding will be in the eating. Are we still a long way from me defining a presheaf on the adic space Spa A to be a functor from some category of open sets to the category whose objects are pairs consisting of a topological ring and some open subring with some properties?

Scott Morrison (Aug 12 2018 at 13:24):

We're ... getting there.

Scott Morrison (Aug 12 2018 at 13:25):

For me the eating is all about the fact that I don't need to write any proofs of boring statements, and _that_ I'm feeling pretty happy about.

Mario Carneiro (Aug 12 2018 at 13:25):

Could we have a coercion from span and friends to the object?

Scott Morrison (Aug 12 2018 at 13:25):

The big unknown step in getting to what you want is that we'll need to do enriched categories, and I haven't even started thinking about that.

Scott Morrison (Aug 12 2018 at 13:25):

Sure, those coercions would be nice.

Scott Morrison (Aug 12 2018 at 13:26):

I went with very short, nondescript field names for these structures, but attempted to be very uniform. To me the naming looks terrible, so if anyone has comments there I'm all ears. :-)

Mario Carneiro (Aug 12 2018 at 13:28):

it didn't cry out to me as much as previous commits, so I guess that's a step forward?

Scott Morrison (Aug 12 2018 at 13:28):

:-)

Mario Carneiro (Aug 12 2018 at 13:28):

uniform naming is good

Scott Morrison (Aug 12 2018 at 13:29):

I really love @Reid Barton's approach, which I implemented as version_2 in that file. The fact that obviously just worked out the box to prove the category of types has all these things in Reid's approach made me very happy.

Mario Carneiro (Aug 12 2018 at 13:30):

what is is_equiv?

Scott Morrison (Aug 12 2018 at 13:30):

Having obviously handle version_1 (essentially what @Mario Carneiro and I discussed earlier today) took a bit of tweaking, and in the end requires two hints.

Scott Morrison (Aug 12 2018 at 13:31):

It's defined at the top of the file, as is_iso in the category of types: an explicit inverse function

Scott Morrison (Aug 12 2018 at 13:31):

I can change it to something equiv, of course. :-)

Mario Carneiro (Aug 12 2018 at 13:31):

I'm not sold on maximum compactification of statements. We did a lot of this in the filter/topology section, and it often made proofs longer

Scott Morrison (Aug 12 2018 at 13:32):

Ok. Well, for now it's easy for me to continue experimenting with all three versions. They have very similar interface, except for how the universal properties are packaged.

Scott Morrison (Aug 12 2018 at 13:33):

And for the first thing I tried to prove, all three versions admitted (the same) "shortest possible proofs".

Scott Morrison (Aug 12 2018 at 13:34):

Just as an example, in version_2 the proof of the field h in has_equalizers that obviously finds is:

/- obviously says: -/
intros X',
dsimp at *,
fsplit,
work_on_goal 0 {
  intros a a_1,
  cases a,
  have congr_fun_a_property_a_1 := congr_fun a_property a_1,
  fsplit,
  work_on_goal 0 {
  solve_by_elim
},
  work_on_goal 0 {
  simp at *,
  solve_by_elim
}
}, work_on_goal 0 {
  ext1,
  ext1,
  ext1,
  refl
}, work_on_goal 0 {
  ext1,
  ext1,
  ext1,
  cases x,
  refl
}

Scott Morrison (Aug 12 2018 at 13:35):

where in version_0 obviously finds

/- obviously says: -/
fsplit,
work_on_goal 0 {
  intros X' k w a,
  have congr_fun_w_a := congr_fun w a,
  dsimp at *,
  fsplit,
  work_on_goal 0 {
  solve_by_elim
},
  work_on_goal 0 {
  simp at *,
  solve_by_elim
}
}, work_on_goal 0 {
  intros X' k w,
  refl
}, work_on_goal 0 {
  dsimp at *,
  fsplit,
  intros Z_1 g_1 h w,
  ext1,
  have congr_fun_w_x := congr_fun w x,
  ext1,
  solve_by_elim
}

Mario Carneiro (Aug 12 2018 at 13:36):

where is unfolding of the definitions happening?

Mario Carneiro (Aug 12 2018 at 13:36):

what is fsplit

Scott Morrison (Aug 12 2018 at 13:36):

fsplit is just split, but not reordering the new goals

Scott Morrison (Aug 12 2018 at 13:36):

sorry, needs a better name

Scott Morrison (Aug 12 2018 at 13:37):

(i.e. the goals for fsplit on a structure are the fields in order, whereas for split the dependent ones come first.)

Mario Carneiro (Aug 12 2018 at 13:41):

Another version of version_1 is to use Σ!

Mario Carneiro (Aug 12 2018 at 13:42):

which was once created but doesn't seem to exist anymore

Mario Carneiro (Aug 12 2018 at 13:42):

but you can guess what the definition was given ∃!

Mario Carneiro (Aug 12 2018 at 13:45):

one thing I like about version_0.is_binary_product is that the uniqueness is equational

Mario Carneiro (Aug 12 2018 at 13:45):

Do you know if there is an equational expression for the others?

Mario Carneiro (Aug 12 2018 at 13:46):

I don't really understand version_0.is_equalizer.uniq

Scott Morrison (Aug 12 2018 at 13:49):

Hmm... maybe I got that one wrong. Let's see. Uniqueness is meant to say if you have a function h : X' -> Y so h then f = h then g, and two different factorisations of h as h_1 then i and h_2 then i, then h_1 = h_2.

Mario Carneiro (Aug 12 2018 at 13:50):

I think for mathlib, I'll vote for version_0 with the others proved as lemmas

Mario Carneiro (Aug 12 2018 at 13:50):

I know it's wordy, but it's also perspicuous

Mario Carneiro (Aug 12 2018 at 13:51):

we should save the slick version for a theorem

Mario Carneiro (Aug 12 2018 at 13:52):

you will notice that https://en.wikipedia.org/wiki/Product_(category_theory) gives essentially version_0 as its definition

Scott Morrison (Aug 12 2018 at 13:56):

I think what I said for version_0.is_equalizer.uniq is correct, but absolutely that should be a lemma, not the official statement.

Scott Morrison (Aug 12 2018 at 14:00):

Hopefully @Reid Barton will have a look later, and comment too. He's already implemented quite a lot using something very close to version_2.

Reid Barton (Aug 14 2018 at 17:50):

Hi, I've been traveling recently (and also dealing with a broken laptop power supply) but I should be able to take a look within a couple of days.

Reid Barton (Aug 20 2018 at 18:34):

Returning to this topic. I have a few general comments.

Reid Barton (Aug 20 2018 at 18:40):

I think it's rather important to include the is_* structures (like is_pullback, etc.) and to define the structures like pullback in terms of them. The is_pullback type is useful to express concepts like "the functor G : D -> C preserves pullbacks", that is, "the image under G of a pullback diagram is again a pullback", especially in the absence of completeness assumptions on D and C. In terms of just pullback, the conclusion would have to include an existential and a bunch of equalities.

All of Scott's versions in the file linked above follow this form. I only mention point because I think Scott's library used to define pullback without is_pullback, and I've become convinced of the importance of the is_* structures from experience with applications.

Reid Barton (Aug 20 2018 at 18:55):

In the end, I don't think it's terribly important for applications which definition is adopted as the official definition, especially since projection notation makes it pretty easy for a structure to behave as though it has an entirely different set of fields. The biggest effect will be when constructing is_pullback values, since as far as I know the record { ... } notation can only be used with the actual fields of the structure. For this reason alone, maybe taking the "explicit" version as the definition would be preferable, as it has the most fields (there's not much pain in using an explicit function as an "alternative constructor", when that function has just one or two arguments).

I did find both the "explicit" and "bijective" styles useful in different situations. I'm slightly surprised that obviously was able to provide all the limit constructions for the category of sets, I mean, types, using any of the definition styles. I guess tauto is pretty good at guessing how to do product-type constructions. I suspect if you tried to build colimits, you'd need to do more hand-holding. At any rate, when I proved that topological spaces have coproducts and coequalizers, I found it easier to do the construction using the "explicit" style; when I thought about what I would need to do in the "bijective" style, it amounted to checking the "explicit" style conditions first anyways. On the other hand, for proving facts about colimits, I made use of the "bijective" style using some bij_on machinery. This wasn't that easy to set up, but I'm reasonably happy with how the proofs turned out. They seem preferable to endless diagram chases--unless possibly the computer can just do the entire diagram chasing argument, in which case I won't complain about how long it is!

Reid Barton (Aug 20 2018 at 19:00):

About constructive vs classical universal properties--I've been assuming that the constructive notions are preferable, but I haven't actually experimented with the alternative, that is, making the is_* structures into Props rather than subsingletons. All I can report here is that using the constructive notions hasn't presented me with any major obstacles.

Reid Barton (Aug 20 2018 at 19:02):

I find it a bit curious that Mario thinks the wikipedia definition is essentially Scott's "explicit" definition. To me, it's not even obvious that the "explicit" is_product structure is a subsingleton, but wikipedia definitely defines a proposition!

Reid Barton (Aug 20 2018 at 19:03):

The "explicit" definition looks like someone took the wikipedia definition and ran it through a compiler, and what came out is this structure with four fields.
In math we don't even really have a specific name for the function which takes a map into X and a map into Y and produces a map into X x Y, besides the "induced" map or something.

Reid Barton (Aug 20 2018 at 19:08):

But in any case, the order is different. First we require a property of the (potential) product diagram, "for all ..., there exists a unique ... such that blah blah", and then call the diagram a product. Then, optionally, we could name the function that is determined by this relationship, but mostly we don't bother.

Reid Barton (Aug 20 2018 at 19:09):

With a constructive unique existential quantifier, you could just translate the definition word-for-word into Lean.
But in any case, like I said above, I don't think the exact choice of definition is that important.

Reid Barton (Aug 20 2018 at 19:23):

Here's a very simple example of using the "bijective" style to prove a fact about products: namely that products are associative up to isomorphism. Suppose I have (X x Y) x Z and X x (Y x Z). Then for any object A, I get, on the one hand, a bijection between Hom(A, (X x Y) x Z) and (Hom(A, X) x Hom(A, Y)) x Hom(A, Z), and on the other hand a bijection between Hom(A, X x (Y x Z)) and Hom(A, X) x (Hom(A, Y) x Hom(A, Z)). Composing these together with the associativity isomorphism for products of sets, I get an isomorphism Hom(A, (X x Y) x Z) ~= Hom(A, X x (Y x Z)), which means that I have an isomorphism between (X x Y) x Z and X x (Y x Z), by Yoneda.

Johan Commelin (Aug 20 2018 at 19:25):

Is Yoneda already in Lean?

Reid Barton (Aug 20 2018 at 19:26):

If I were to use the explicit/first-order definition of products directly, I would need to manually build the comparison map by pairing the maps (X x Y) x Z -> X and (X x Y) x Z -> Y x Z given by, respectively, projection on the first component twice and (the pairing of the maps (X x Y) -> Y and ...), and then construct the inverse map in the same fashion, and then finally check that the compositions are the identity, which means checking that each projection is the identity, which is going to involve a whole lot of applications of the axioms of a product object.

Reid Barton (Aug 20 2018 at 19:27):

Similarly, I would prefer to show that right adjoints preserve products by a calculation along the lines of
Hom(A, G(X x Y)) = Hom(FA, X x Y) = Hom(FA, X) x Hom(FA, Y) = Hom(A, GX) x Hom(A, GY)
although I think that in this case, the first-order approach is also not so difficult

Reid Barton (Aug 20 2018 at 19:32):

I think Scott's library has some version of Yoneda. The version I used is just a couple lines to prove, although I did sweep some questions of naturality in A under the rug.

Reid Barton (Aug 20 2018 at 19:36):

(For reference, I proved that left adjoints preserve pushouts in https://github.com/rwbarton/lean-homotopy-theory/blob/lean-3.4.1/src/categories/preserves_colimits.lean#L44-L82, although the proof is still not as polished as I'd like. The basic structure is analogous to what I wrote above though.)

Johan Commelin (Aug 20 2018 at 19:49):

I would like to first have a proof that left adjoints preserve arbitrary colimits, and then deduce the fact about push-outs by specialisation. How hard do you think that will be?

Reid Barton (Aug 20 2018 at 19:59):

I had a hard time when I tried to deduce existence of pullbacks from completeness quite some time ago, but now I think I know how to go about it.

Reid Barton (Aug 20 2018 at 20:01):

One should describe the indexing category for the pullback in terms of generators and relations (of course, there are no relations in this case) because one needs to describe functors from the indexing category in terms of concrete data

Scott Morrison (Aug 21 2018 at 19:31):

Thanks, @Reid Barton, for all the remarks. I'm rewriting my file about limits again now, and I think what I'm doing is compatible with everything you've said. The "explicit" version comes first, and the other two viewpoints are available as lemmas and alternative constructors.

Scott Morrison (Aug 21 2018 at 19:34):

@Johan Commelin I have Yoneda at <https://github.com/semorrison/lean-category-theory/blob/master/src/categories/yoneda.lean>. obviously just plows through all the boring bits.

Johan Commelin (Aug 22 2018 at 05:25):

I'm also interested in what the plans are for the limit/colimit duality? Are we going to define things twice, or in terms of op? Or both? I don't have any experience with this... but I guess it is something that will pop up pretty soon.

Scott Morrison (Aug 22 2018 at 05:27):

For now I'm going to write out everything twice, with painful copy-paste-replace.

Scott Morrison (Aug 22 2018 at 05:27):

:-(

Scott Morrison (Aug 22 2018 at 05:28):

I've been thinking I should learn how to write an analogue of to_additive, or something even more general.

Scott Morrison (Aug 22 2018 at 05:28):

We certainly need to have separate definitions for limits and colimits (and all the specialisations, equalizers, coequalizers, etc), and theorems relating a limit and a colimit in the opposite category.

Scott Morrison (Aug 22 2018 at 05:29):

The only question is whether the "other half" of the definitions are written by hand. (And whether those theorems are automatically generated.)

Johan Commelin (Aug 22 2018 at 05:32):

Hmmm, to_opposite or to_dual might be a nice way of dealing with this! Especially if it could also generate the "linking" theorems.

Johan Commelin (Aug 22 2018 at 05:33):

I remember that some time ago there was a bit of chat about whether op op C was defeq to C. Would that help here?

Scott Morrison (Aug 22 2018 at 06:02):

Certainly op op C is defeq to C, since op C := C. I think you mean

variables {C : Type u₁} [𝒞 : category.{u₁ v₁} C]
include 𝒞

@[simp] lemma foo : @category_theory.opposite (Cᵒᵖ) (@category_theory.opposite C 𝒞) = 𝒞 := sorry

which is not a rfl lemma.

Scott Morrison (Aug 22 2018 at 06:04):

Of course it is true, e.g. by

@[simp] lemma foo : @category_theory.opposite (Cᵒᵖ) (@category_theory.opposite C 𝒞) = 𝒞 :=
begin
  tactic.unfreeze_local_instances,
  cases 𝒞,
  unfold category_theory.opposite,
  congr,
end

Scott Morrison (Aug 22 2018 at 06:04):

but I think the natural isomorphism is more useful.

Johan Commelin (Aug 22 2018 at 07:29):

Right, I meant your foo.

Reid Barton (Aug 22 2018 at 14:41):

Oh interesting. I read a paper on the design of a category theory library for HoTT/Coq, and they had to jump through some extra hoops to account for the fact that associativity in C^op is the eq.symm of associativity in C

Reid Barton (Aug 22 2018 at 14:41):

But I guess that doesn't show up here, because of proof irrelevance.

Reid Barton (Aug 22 2018 at 15:11):

The op op C = C question comes up if you want to dualize colimits back into limits.

Reid Barton (Aug 22 2018 at 15:12):

Let's suppose that we've defined a diagram in C to be a pushout when the corresponding diagram in C^op is a pullback

Reid Barton (Aug 22 2018 at 15:13):

Now we have some result about pushouts, for example: In a model category, a pushout of a cofibration is a cofibration.

Reid Barton (Aug 22 2018 at 15:14):

And we want to conclude the dual statement, namely that a pullback of a fibration is a fibration. The opposite of a model category is again a model category, with cofibrations and fibrations interchanged, so this is true by duality.
(For whatever reason, it's normal in the model category literature to prove things on the "left" side and deduce the corresponding statements on the "right" side by duality, so let's just assume that we want to do it this way rather than the other way around.)

Reid Barton (Aug 22 2018 at 15:16):

The obvious approach is to apply the statement we already know to C^op, since C^op is also a model category.
Then we obtain: A pushout (in C^op) of a cofibration (in C^op) is a cofibration (in C^op).
We know that a cofibration in C^op corresponds to a fibration in C. Now we have a pushout in C^op. By definition, this is a pullback in (C^op)^op.

Reid Barton (Aug 22 2018 at 15:16):

In order to conclude the statement we want, we need to know (C^op)^op = C.

Reid Barton (Aug 22 2018 at 15:18):

Which is true, they are propositionally equal.
If we have some tactic which proves the dual version of a statement from the original statement, then it can insert this propositional equality and it's no big deal.
If we want to directly define the proof of the dual statement as equal to the original proof (with C^op substituted for C), then this will only succeed if (C^op)^op = C definitionally.

Reid Barton (Aug 22 2018 at 15:20):

And it's frustratingly close to being true--the proof is just this cases 𝒞. If we had definitional equality for structures, then (C^op)^op would be definitionally equal to C, I think.

Reid Barton (Aug 22 2018 at 15:22):

If, instead of proving a theorem ("the pushout of a cofibration is a cofibration"), we were constructing a canonical isomorphism or something, I can imagine that the propositional equality (C^op)^op = C could get in the way of later reasoning about the thing we constructed

Last updated: May 02 2025 at 03:31 UTC