Zulip Chat Archive
Stream: Equational
Topic: New data – new pictures?
Joachim Breitner (Sep 30 2024 at 12:30):
I’m excited to see a picture that includes all new implications and anti-implications. I lost track of who is generating pictures how – is there already a picture generation script that works with the output of extract_implications
, maybe even with support for the new facts
format for antiimplication?
Vlad Tsyrklevich (Sep 30 2024 at 19:43):
cc @Andrii Kurdiumov Just looked at the tool, doesn't look like it supports parsing the new form of data for anti-implications
Vlad Tsyrklevich (Sep 30 2024 at 19:45):
Speaking of which, I should submit the script to generate implication graphs
Terence Tao (Oct 01 2024 at 00:41):
Is there a plain text file listing all the implications and anti-implications obtained thus far? I am already recieving inquiries from new people wanting to create some visualizations and it would be nice to have a text file that one could simply give them, even if it is not actively updated as new submissions come in.
Daniel Weber (Oct 01 2024 at 01:29):
direct.tar.xz these are the direct implications/nonimplications
closure.tar.xz and this is the transitive closure
Terence Tao (Oct 01 2024 at 04:32):
Are there some statistics on how many direct and transitive relations have been obtained so far, and how many remain? I am thinking of posting these stats together with the data sets on the README.md as part of the ongoing progress.
Andrii Kurdiumov (Oct 01 2024 at 04:33):
It was approximately 4.5m direct relations. Out of ~24M or so.
Andrii Kurdiumov (Oct 01 2024 at 04:34):
I have to run with latest files which sent here to give you explicit number.
Terence Tao (Oct 01 2024 at 04:35):
Eventually it will be nice to have an automated scheduled update of these stats eg on a daily basis to form a time series
Andrii Kurdiumov (Oct 01 2024 at 04:35):
That’s doable.
Andrii Kurdiumov (Oct 01 2024 at 04:37):
When it should be displayed? On the GitHub only, or in blueprint too?
Daniel Weber (Oct 01 2024 at 04:43):
In terms of equational#29 the current counts are:
implicit_proof_false: 12764328
implicit_proof_true: 4494090
unknown: 4014155
explicit_proof_false: 739207
explicit_proof_true: 21791
implicit_conjecture_true: 64
explicit_conjecture_true: 1
Amir Livne Bar-on (Oct 01 2024 at 04:44):
(deleted)
Amir Livne Bar-on (Oct 01 2024 at 04:45):
Terence Tao said:
Is there a plain text file listing all the implications and anti-implications obtained thus far? I am already recieving inquiries from new people wanting to create some visualizations and it would be nice to have a text file that one could simply give them, even if it is not actively updated as new submissions come in.
We can add this to CI: the extract_implications Lean program generates a simple compact format, we can host the results on the web site
Terence Tao (Oct 01 2024 at 04:45):
How does this square with the claim of 4.5 direct relations?
Andrii Kurdiumov (Oct 01 2024 at 04:46):
Oops. I may have to distinguish direct relationship and these which are transitive closure of these direct relationships.
Terence Tao (Oct 01 2024 at 04:47):
OK then the numbers roughly make sense
Andrii Kurdiumov (Oct 01 2024 at 04:48):
4.5m was from latter. And that from directly Lean files. Not a new format which I have to add.
Terence Tao (Oct 01 2024 at 04:51):
Anyway the upshot is that we have resolved about 80% of the implications so far. Of course that was the lowest hanging fruit, I expect the going to be slower now. (Though I have just received word that there are going to be some sizeable AI-assisted contributions coming down the pipeline shortly...)
Daniel Weber (Oct 01 2024 at 04:53):
There also hasn't been any large execution of SMT solvers yet, right?
Andrii Kurdiumov (Oct 01 2024 at 04:53):
About pictures. Does script which generates SVG is in the repo?
Terence Tao (Oct 01 2024 at 04:54):
Andrii Kurdiumov said:
When it should be displayed? On the GitHub only, or in blueprint too?
I think Github is fine for now. I'll open a task for this. (I assume that equational#126 and hence equational#29 is close to completed for now.)
Terence Tao (Oct 01 2024 at 04:56):
Daniel Weber said:
There also hasn't been any large execution of SMT solvers yet, right?
There is this blog post at https://www.philipzucker.com/ruler/ but I can't determine exactly whether there was an actual large scale data run here or just a proof of concept
Daniel Weber (Oct 01 2024 at 04:57):
I think it's ready to be merged now, yes. I now added code to compute these counts, which can be run using lake exe extract_implications outcomes --hist <files>
Amir Livne Bar-on (Oct 01 2024 at 04:58):
Another count that might be interesting is unknown, modulo equivalences. That is, if we know A=>B and B=>A, then count X=>A and X=>B as one unknown (and also in the opposite direction). This comes out to 3182453
Terence Tao (Oct 01 2024 at 05:03):
Nice, so now only 15% or so of the original total. Is it easy currently to create a text file of these 3182453 equivalences (e.g., replacing each equation with the lowest numbered equation known to be equivalent)? One could imagine it could help a large SMT or AI run if they could just feed in such a file instead of the original 22m.
Nicholas Carlini (Oct 01 2024 at 06:01):
Seconded that I would find such a list very useful, especially if it could be kept (mostly) up to date automatically.
Vlad Tsyrklevich (Oct 01 2024 at 06:17):
Andrii Kurdiumov said:
About pictures. Does script which generates SVG is in the repo?
The implication graph, or something else?
Nicholas Carlini (Oct 01 2024 at 06:20):
It might also be fun to have a graph showing the number of remaining equations as a function over time, to see how things are progressing.
Andrii Kurdiumov (Oct 01 2024 at 06:57):
@Vlad Tsyrklevich yes, implictions.svg, I think they are interesting perspective, even if very large.
Vlad Tsyrklevich (Oct 01 2024 at 07:17):
See https://leanprover.zulipchat.com/#narrow/stream/458659-Equational/topic/Some.20more.20implication.20visualizations/near/473883222 I will submit the script today
Terence Tao (Oct 01 2024 at 14:52):
Created equational#171 and equational#173 to request these statistics on a continuing basis. Actually just having a one-shot generated file of reduced outstanding implications would be useful for people to run all sorts of other algorithms on it. For instance, each of the metatheorems in https://teorth.github.io/equational_theories/blueprint/sect0002.html could be run against that list.
EDIT: I see that Amir has this separately at https://github.com/amirlb/equational_theories/tree/unknown-implications . I've modified equational#173 to request integrating this into the main repo.
Last updated: May 02 2025 at 03:31 UTC