Zulip Chat Archive
Stream: general
Topic: Reducing environment size
Joachim Breitner (Oct 06 2023 at 09:51):
It (maybe unreasonably) bothered me a bit that the loogle deployment size contains 2.8 GB of Mathlib olean when it really doesn't need all the information therein. So I experimented with pruning from the environment the data that loogle doesn’t need.
- Removing the
valueof all theorems it went down to 1.8GB (I was actually expecting a bigger portion of the olean to be the proof terms. Hmm.) - Also removing the
valueof all defns brought it down to 1.3GB (This is not correct, allreducibledefinitions need to be kept around. But loogle is a bit buggy with regard toreducibledefinitions anyways, it should probably look through them consistently.)
That seems to be the low-hanging fruit here, overall probably not worth the complexity – but maybe someone finds these numbers interesting.
Alex J. Best (Oct 06 2023 at 09:55):
https://leanprover.zulipchat.com/#narrow/stream/270676-lean4/topic/olean.20dump.20tool/near/376045522 is quite relevant prior analysis of this. I think some centralised support for producing these reduced oleans would be useful (even if experimental)
Mario Carneiro (Oct 06 2023 at 10:00):
I would be supportive of adding various kind of "unpack everything except theorem bodies" mechanisms to leantar. The tricky part is avoiding failures related to missing data
MohanadAhmed (Oct 06 2023 at 16:17):
Joachim Breitner said:
- Removing the
valueof all theorems it went down to 1.8GB (I was actually expecting a bigger portion of the olean to be the proof terms. Hmm.)
That is a significant reduction.
Can you share a code sample showing how to remove theorem bodies?
Joachim Breitner (Oct 06 2023 at 18:37):
Last updated: May 02 2025 at 03:31 UTC