Zulip Chat Archive
Stream: Equational
Topic: 854
Terence Tao (Oct 23 2024 at 15:41):
This is a new thread to discuss the law 854 x = x ◇ ((y ◇ z) ◇ (x ◇ z))
, which was previously discussed here. It has five open implications Graphiti, to 413 x = x ◇ (x ◇ (x ◇ (y ◇ x)))
, 1045 x = x ◇ ((y ◇ (y ◇ x)) ◇ x)
, 1055 x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
, 3316 x ◇ y = x ◇ (y ◇ (x ◇ y))
, and 3925 x ◇ y = (x ◇ (y ◇ x)) ◇ y
. So understanding this equation would resolve 10 outstanding implications. Remark: 1055 implies 1045 as a special case, but otherwise the open consequents are unrelated.
It may be worth introducing the set E
of pairs (x,y)
with x ◇ y = x
. The 854 law then asserts that
(x, (y ◇ z) ◇ (x ◇ z))
lies in E
for all x
, y
, z
. So one expects E
to be somewhat "large". Note that if (y,z)
, (x,z)
, (y,x)
all lie in E
, then (x,y)
must also lie in E
. That is to say, E
is symmetric when restricted to any column {x: (x,z) ∈ E}
of E
.
Immunities:
- (L) Implication is true for linear models (even noncommutative ones)
- Implication is true assuming the reflexive axiom
x ◇ x = x
, even permitting three exceptions.
Examples:
- Left-absorptive magmas
x ◇ y = x
(i.e., every pair lies in E). - A sporadic 12-element magma
Equation 10 x = x ◇ (y ◇ x)
may play a role. It is not implied by 854, nor does it imply 854; but it does imply all of the open consequents of 854. The sporadic order 12 magma obeys 10 with the single exception 0 ◇ (2 ◇ 0) ≠ 0
.
The sporadic magma was used to resolve three other implications from 854, so this is evidence that the finite magma approach may continue to be fruitful here.
Daniel Weber (Oct 23 2024 at 16:06):
Some rules for 854 from the partial greedy ruleset:
- If then
- If then
- If then
Terence Tao (Oct 23 2024 at 17:01):
One thought: observe that in the multiplication table for the sporadic order 12 magma, 0 only appears in the first row. What this means is that this order 12 magma is actually an order 11 magma (deleting 0) that also obeys 854, with an additional element 0 adjoined.
This suggests a possible route to constructing more finite magmas that could potentially serve as counterexamples. For instance, one could try to adjoin an additional element 13 to the order 12 magma and ask an ATP to try to fill out the new entries of the multiplication table to recover 854 but refute one of the desired claims. Or, more generally, one could search for counterexamples with one element 0 that is deletable (which means that x ◇ y ≠ 0
if x, y ≠ 0
or even two elements 0
, ∞
that are sequentially deletable (thus x ◇ y ≠ ∞
if x, y ≠ ∞
, and x ◇ y ≠ 0,∞
if x, y ≠ 0,∞
).
Terence Tao (Oct 23 2024 at 17:19):
Separate thought: as mentioned in the first post here, E
has a restricted symmetry property. Suppose we assume total symmetry: that is we impose the axiom that if x ◇ y = x
, then y ◇ x = y
. Does this imply any of the outstanding claims? If so, that is another immunity. Otherwise, it could be a useful axiom to add to find more potential counterexamples.
Michael Bucko (Oct 23 2024 at 18:24):
Do we have a tptp for this one?
Douglas McNeil (Oct 23 2024 at 23:34):
In the spirit of reporting our failures as well as our successes, here are things that didn't work:
(1) Starting with the known sporadic 12 and growing a size 13 or higher example, with or without deletability constraints. I mean, it ran, but provided no new counterexamples.
(2) Trying to randomly evolve a counterexample-providing neighbour to the one we know about by emptying cells and using randomized backtracking to fill them in. It's easy to make minor changes, but there are definite "pools" you need to jump out of: you have to raise the percentage of change per step and/or run it for a long time to even get things which start to _look_ different. But it feels more like I'm just random-walking my way around the same fundamental solution than going anywhere.
Eventually the patterns of shifting numbers become kind of hypnotic and you can almost convince yourself you can grok the inner meaning of 854, if only you could put it into words.. anyway, everything which satisfied 854 satisfied everything else . :man_shrugging:
Terence Tao (Oct 23 2024 at 23:43):
Interesting that you observe that the counterexample is not rigid, that it admits small local perturbations. This is very different from how, say, a group behaves - these are highly symmetric objects, and cannot be locally perturbed. This opens up a whole new type of way to generate constructions, by doing a probabilistic construction that verifies the 854 axiom most of the time, and then some secondary cleanup operation to fix the violations.
On the other hand, your other negative result seems to show that not every 854 magma can be extended by one element. So something special must be true about the order 11 magma that the sporadic order 12 magma is an extension of. Here it is in finite magma explorer. It obeys a ton of laws (Graphiti), including 10 x = x ◇ (y ◇ x)
, which as noted above implies all the other open laws. In fact Graphiti shows that it obeys the even stronger law 433 x = x ◇ (y ◇ (x ◇ (z ◇ y)))
, which implies both 10 and 854. Perhaps one could first randomly generate a finite magma obeying 433, then try to extend that by one element in a way that keeps 854 but breaks 10, and hopefully also the other open claims?
Douglas McNeil (Oct 23 2024 at 23:52):
To be clear, I can extend the 12 to 13 and higher -- I think I got bored around 22 or so -- it's just that they're not interesting, and not merely in the sense they don't falsify anything. The majority of extensions, even when I'm deliberately weighting things to look less like their parent and randomizing fill order and everything possible to make it more exciting, tends to still produce extensions which look more like adding the more "typical" 854s to the edges. By that I mean the ones like this:
2 2 0 0 2 7 2 0
2 2 1 1 2 2 2 1
2 2 2 2 2 2 2 2
3 3 3 4 3 3 3 3
2 7 4 4 2 7 7 4
7 7 5 5 2 7 7 5
7 2 6 6 7 7 7 6
7 7 7 7 7 7 7 7
which at this point I think I could do freehand.
Douglas McNeil (Oct 23 2024 at 23:55):
Incidentally, if my TPTP coding can be trusted (ehh), assuming total symmetry implies all of 413, 1045, 1055, 3316, and 3925, which may mean 854 has even more defences IIUC.
Terence Tao (Oct 23 2024 at 23:57):
Oh. Then what I said before might not be the smartest thing to try (one could still try it). It is however very strange that you now suggest that there is a highly non-unique way to perform extensions in general, but somehow these extensions are moving the magma away from being counterexamples, rather than towards the counterexamples.
Are you trying to force counterexample behavior into the extension? One could simply try to extend the order 12 magma (say) with two new elements 12, 13 with the 413-contradicting axiom 12 ≠ 12 ◇ (12 ◇ (12 ◇ (13 ◇ 12)))
(say) together with 854 and see if it can actually build one. (It may possibly be easier to extend the order 11 magma, as it has more starting structure.)
Terence Tao (Oct 23 2024 at 23:58):
What do you mean by total symmetry? Symmetry of the magma operation (the commutative law), or symmetry of E?
Zoltan A. Kocsis (Z.A.K.) (Oct 23 2024 at 23:59):
Terence Tao said:
This is very different from how, say, a group behaves - these are highly symmetric objects, and cannot be locally perturbed. This opens up a whole new type of way to generate constructions, by doing a probabilistic construction that verifies the 854 axiom most of the time, and then some secondary cleanup operation to fix the violations.
You can also generate groups by using a probabilistic construction that verifies the group axioms most of the time, then clean up the local violations.
Douglas McNeil (Oct 24 2024 at 00:00):
Terence Tao said:
What do you mean by total symmetry? Symmetry of the magma operation (the commutative law), or symmetry of E?
I meant ((magma(X, Y) = X) => (magma(Y, X) = Y)))
Terence Tao (Oct 24 2024 at 00:02):
Ah, so symmetry of E. Okay, so that is indeed another "immunity" this problem has, hmm.
@Zoltan A. Kocsis (Z.A.K.) Heh, you're completely right - for instance I know of this nice work of Gowers and Long in this area. I guess the difference here is that the cleaned up object isn't unique. And yet none of the non-unique perturbations seem to be contradicting the open implications, which is very mysterious to me.
Zoltan A. Kocsis (Z.A.K.) (Oct 24 2024 at 00:06):
The only real issue is that you'll sometimes generate tables that verify all group axioms with extremely high probability, but happen not to be close to the multiplication table of any actual group. But the same thing is not ruled out by malleability observations concerning 854 either.
These "fake groups" happen to be exceedingly rare, but they do exist. Here's an example:
Start with ((Z/2Z)^k,+)
with your favorite k
(even k=0
, I did the construction this way because I was answering a different question at the time, and now don't have time to remove it). Let 0
denote the identity of this group. Define a new structure (H,*)
with underlying set H = (Z/2Z)^k U {1..n}
, and the operation *
defined by the following clauses:
x * y = x + y if x,y both in (Z/2Z)^k
x * y = y if x in (Z/2Z)^k and y in {1..n}
x * y = x if y in (Z/2Z)^k but x in {1..n}
x * x = 0 if x in {1..n}
x * y = min{x,y} if x,y both in {1..n} and x /= y
If you choose n large enough, then the axioms x*x=0
, x*(y*z)=(x*y)*z
, x*0=0*x = x
will all be satisfied with a
probability arbitrarily close to 1. But you need to edit (add/delete/remove) on the order of O(|H|^2)
entries to make the multiplication table of the operation *
coincide with that of a group.
Zoltan A. Kocsis (Z.A.K.) (Oct 24 2024 at 00:07):
Note that this does not contradict Gowers-Long, since they need to assume injectivity to make their result work.
Douglas McNeil (Oct 24 2024 at 00:10):
As for the extensions, what I mean concretely is that you get something like this unless you try very hard:
9 11 0 9 0 0 11 13 0 0 11 10 11 11 11
2 4 1 2 1 1 11 1 1 1 9 1 8 9 4
2 2 7 2 5 4 4 2 4 5 2 5 2 2 2
9 11 3 9 3 3 11 10 3 3 11 13 11 11 11
4 4 7 4 7 4 4 4 4 5 4 5 4 4 4
5 5 7 5 5 14 7 5 4 5 5 5 5 5 5
2 8 13 2 6 13 4 6 13 6 2 6 2 2 2
9 7 7 8 7 7 7 10 7 7 7 5 7 7 7
8 8 7 8 8 13 7 8 4 8 8 8 8 8 8
9 9 7 9 7 9 9 9 9 5 9 5 9 9 9
4 2 10 2 10 10 4 10 10 10 9 10 11 9 11
9 11 7 9 7 11 11 9 11 7 11 5 11 11 11
8 11 12 4 12 12 11 12 12 12 4 12 9 9 9
2 4 13 8 13 13 11 13 13 13 9 13 2 4 11
4 2 14 9 14 14 9 14 14 14 8 14 4 4 4
I tried building a few levels at once to give more space for differences to emerge, but it had no effect. I can try dropping to 11 and starting from there before throwing in the towel.
Terence Tao (Oct 24 2024 at 01:20):
You seem to have some intuition as to what 854 magmas "look boring" and which ones "look interesting" which I don't understand, but would love to learn more about. Do you mean that the new rows and columns added are nearly identical (except for incrementing 12 to 13, say)? Also, in your last example here, are oyu starting with an order 14 magma and adding one final row and column? I don't see a copy of the original 12-magma here so I'm not quite sure what you're doing here.
In any case, if you have some files of various magmas you've generated in this fashion, it could be useful to make them available, so that other collaborators could test various hypotheses and strategies on them.
Terence Tao (Oct 24 2024 at 03:35):
I think I'm beginning to understand what you mean by boring extensions. Consider the sporadic order 12 magma, which can be viewed as an order 11 magma with an additional element 0 adjoined, thus adding a row and column (and a new square: in the current numbering, 0 ◇ 0 = 5
). The row adjoined can contain some zeroes (0 ◇ x = 0
for several x), but the column does not (x ◇ 0 ≠ 0
for all x), which is consistent with the first rule identified by @Daniel Weber here.
One can cheaply extend this further to an order 13 magma by adding another element 0'
, thus adding another row and column, by basically duplicating the 0
row and column except for replacing every occurrence of 0
in the output with 0'
. Thus x ◇ 0' = x ◇ 0
, and 0' ◇ x
is 0 ◇ x
with any appearance of 0
replaced by 0'
. Finally, we can set 0' ◇ 0
, 0 ◇ 0'
, 0' ◇ 0'
all equal to 0 ◇ 0
. It is a routine matter to check that this new order 13 magma will obey 854 if the order 12 magma did. Unfortunately it also inherits all the other laws that we want to disprove.
There is a small amount of play in perturbing this table by adjusting some insignificant entries, but it does not seem to generate any interesting new failures of law 10, which is why it is also not disproving any of the laws we want to disprove, all of which are consequences of 10. I'll explain this more in the next comment.
Terence Tao (Oct 24 2024 at 03:56):
Here is a heuristic argument why the law 10 seems to hold almost universally on 854-magmas, which is a significant obstacle blocking us from disproving all the remaining implications. We have this one order 12 example with a single failure of law 10, and we can artificially inflate this failure a little bit by adding dummy rows and columns as above, but that isn't really helping the situation.
First, observe from this graphiti that 854 implies a ton of other laws, even disregarding the five blue conjectural ones. This has many consequences for the left-absorbent sets , namely that they have to be pretty big. For instance, we have
- (law 8).
- if and only if lies in the range of (law 378).
- For any , the elements all lie in (laws 101, 430, 823, 832, 843, 123, 1035 respectively). The open implication 412 also asserts that lies in .
- The open implication 3316 implies that if , then is also in .
- Law 10 (which is not a consequence of 854, but is "almost" true as discussed below) asserts that for all .
From 854, we see that Law 10 holds whenever , that is to say for some with . Since was already pretty big, is massive. It then becomes very hard to avoid falling into this set, and so there are very few counterexamples to Law 10. (As one subcase: Law 10 holds whenever .)
Probably the easiest target to refute here is law 1055, which is the only three-variable law to refute. In the language of law 10, the task is to find a pair violating law 10, such that . One would think that is also a very large set, and it should be rare for to fall outside this set, but unfortunately for the one order 12 magma we have (and the various extensions of it we can build), the small number of exceptions to 10 fail to obey this condition. It could just be that we were unlucky with the initial "seed" magma and that some other finite magma may solve the problem here.
Daniel Weber (Oct 24 2024 at 04:18):
A few finite magmas ruling out some rules from the partial rule set
Terence Tao (Oct 24 2024 at 04:22):
I'm a little confused, I thought all the rules in the partial ruleset had to be consequences of 854, due to the way the greedy algorithm worked?
Daniel Weber (Oct 24 2024 at 04:23):
Yes, but when the rules are minimized I attempt to relax the assumptions while still being a consequence, and if it fails to be a consequence I search for a counterexample to more quickly rule out other relaxations
Daniel Weber (Oct 24 2024 at 04:31):
It might be useful to assume something which isn't always a consequence of 854, but still doesn't immediately imply some target
Douglas McNeil (Oct 24 2024 at 04:35):
I wasn't thinking nearly at that level of detail, to put it mildly, but it squares up: I was mostly going from the regularity and how straightforward it was to extend to something which was pretty obviously itself extensible. I'd expected to have to work much harder on the backtracking, hence my desperate attempts to introduce more entropy to find something qualitatively different.
In the last example I guess the reference order 12 magma seems pretty hidden! :-) That one started as s12 before it mutated and grew. I can't remember the precise numbers offhand, but if you just randomly change two cells of an order 12 854 magma to other elements you'll still satisfy 854 over 1% of the time. That's what it was for the sporadic, anyway.
Terence Tao (Oct 24 2024 at 06:30):
Each 854 magma G
is associated with a "Cayley graph" that seems to control a lot of the algebra. Namely make a directed edge x → y
if one can write y = z ◇ x
for some z
. The first observation is that we have an equivalent characterization: x → y
if and only if y ◇ x = y
. This is thanks to law 378.
The next observation is that law 10 is equivalent to this graph being undirected: x → y
if and only if y → x
. This is an easy consequence of the previous equivalence. In practice it seems that this graph is nearly undirected, with most edges being symmetric x ↔ y
and only a few exceptions.
We have lots and lots of symmetric edges, due to all the various consequences of 854. Examples: , , , , . If , then we have and . If and , then .
The open law 1055 is the assertion that if then .
It's an amazing amount of structure for such a "non-rigid" object!
Daniel Weber (Oct 24 2024 at 06:39):
This is the graph for the 12 element magma
Figure_1.png
Daniel Weber (Oct 24 2024 at 06:58):
Can one recover the magma from this graph?
Terence Tao (Oct 24 2024 at 07:04):
That's a good question. Certainly the relation that x → y
if and only if y ◇ x = y
already fills out a big chunk of the multiplication table from the graph. At that point there is this Sudoku game one has to play. I guess one could for instance ask an ATP to reconstruct the 12 element magma from that graph, i.e., from the 854 rule plus the additional rule that x → y
if and only if y ◇ x = y
.
Terence Tao (Oct 24 2024 at 07:07):
In the example above, the edge 0 → 2
is the only directed edge; not coincidentally, 0
and 2
are the only adjacent elements that have no mutual friends, i.e., no z
such that 0, 2 → z
. This is related to the law (an easy consequence of 854) that if z → x, y
and x → y
then y → x
.
Ibrahim Tencer (Oct 24 2024 at 07:29):
I've found some models of 854. We take some function f and define x * y = f(x) if x = y, otherwise x * y = x. We also assume that x ≠ f(x) and x ≠ f(f(x)). Then a bit of case checking gives that this will satisfy 854. But this model will also satisfy 10.
I also tried modifying it: you can take some q, s, t that are all distinct (provided they aren't linked by f in some way) and redefine qs = sq = t. But if f(q) = s then 10 fails via q(qq) = qs = t, but then for any u that isn't q or s, q((qu)(qu)) = q(qq) = qs = t, so 854 fails too. I haven't checked exactly what conditions you need on q, s, and t for this to work, it requires a lot of tedious case checking, but there's an example below. Maybe there is some other way of modifying these models to get a counterexample.
Here is the unmodified model on Z/7 with f(x) = x + 1:
And the same magma but with 0 * 3 = 3 * 0 = 5:
Daniel Weber (Oct 24 2024 at 07:29):
You can use *
to write * without it turning to italics
Ibrahim Tencer (Oct 24 2024 at 07:32):
Daniel Weber said:
You can use
*
to write * without it turning to italics
Thanks
Ibrahim Tencer (Oct 24 2024 at 08:16):
Is it known if there are any other counterexamples to the implications that are of order less than 12?
Terence Tao (Oct 24 2024 at 14:55):
@Ibrahim Tencer Thanks for this construction. This also answers @Daniel Weber 's question of whether the graph determines the magma; it does not, as for all of your examples the directed graph is the complete directed graph without loops, and so the directed graph does not "see" the squaring function f(x)
. One could take this to mean that the graph is trying to tell us that the values of this function are not extremely relevant.
If you want to create a failure in 10 (which is a necessary precondition for contradicting the laws we actually want), so that x → y
but not y → x
, then one thing that must happen is that x
and y
must have disjoint incoming neighborhoods; the set of z
for which z → x
must be disjoint from that with z → y
. So quite a few edges have to be deleted from the complete directed graph before we even have a chance of contradicting 10.
Terence Tao (Oct 24 2024 at 15:09):
One small observation. Here is a reordering of the order 12 example (Finite Magma Explorer):
4 0 5 0 0 2 2 5 5 0 2 0
1 0 1 2 1 1 1 1 1 2 2 2
4 2 4 2 2 2 2 5 5 2 2 2
3 2 3 2 3 3 3 3 3 2 7 7
4 4 4 4 1 4 4 4 5 7 4 7
4 5 5 5 5 3 2 5 5 5 2 5
4 6 6 6 6 3 2 6 6 6 2 6
4 7 4 7 7 7 7 5 5 7 7 7
4 8 4 8 7 8 8 4 5 7 8 7
9 8 9 8 1 9 9 9 1 6 8 6
3 6 10 6 10 3 3 10 10 6 2 6
9 8 11 8 10 11 11 11 10 6 8 6
With this ordering, observe that the top left n x n minor only contains elements less than n for all . What this means is that this example can actually be obtained from an order 6 example (the top left 6 x 6 minor) by adjoining one more element six more times. The lone counterexample to 10 in this case is coming from but not : . So starting with the order 6 example, one has to perform six more extensions before one violates the equation 10! On the one hand, this does show that repeated extension can eventually produce results even if initially the extensions look "boring". On the other hand, the counterexample only barely involves the initial values 0,1,2,3,4,5 of the seed magma; somehow the magma has to grow enough so that the forward image of 0 (i.e., the 0 column entries 1,3,4,9) can be made distinct from the forward image of 11 (i.e., the 11 column entries 0,2,5,6,7) which is basically a precondition for a 10 violation to take place.
A tentative conclusion is then that maybe the iterated extension idea shouldn't be abandoned just yet - it may just take more time to work than previously thought, and some intelligent way to guide the extension may be needed. (One wild idea: try reinforcement learning, with a loss function that rewards approximate compliance with 854 while forcing a specific counterexample configuration to exist?)
Daniel Weber (Oct 24 2024 at 15:20):
Perhaps just telling the ATP to look for extensions might let it be more efficient
Terence Tao (Oct 24 2024 at 15:28):
One promising target might be to disprove 1055 x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
as follows. With the above renumbering we already have a violation of 10: 11 ≠ 11 ◇ (0 ◇ 11)
(of course we had this as well in the original model, with different renumbering). To disprove 1055, it suffices to find an extension of this model that contains two new elements y
, z
such that 0 = y ◇ (z ◇ 11)
. One would think that this is an easy task (one equation, two unknowns: looks underdetermined!) but it seems that doing the naive thing of just adding two more elements 12, 13 and trying to force 0 = 12 ◇ (13 ◇ 11)
right away doesn't seem to work. One approach might be to select a good candidate element of the original magma, say 7 for sake of argument, then find an extension in which 0 can eventually appear on the 7th column, e.g. one may reach this with the element 15 with 0 = 15 ◇ 7
. Then all one has to do is keep extending until 7 can appear in the 11 column, e.g. if one can extend until one has 7 = 23 ◇ 11
then one can declare victory with an order 24 magma: 11 ≠ 11 ◇ ((15 ◇ (23 ◇ 11)) ◇ 11)
.
Matthew Bolan (Oct 24 2024 at 15:30):
There is a 5 in column 3 of the first row, is this the correct reordering? The needed property still seems to be satisfied for
Terence Tao (Oct 24 2024 at 15:32):
Gah, I made an off-by-one error. You're right. The seed magma has order 6 (and I don't think it is an extension of an order 5 magma). I've adjusted the preceding comment accordingly.
Douglas McNeil (Oct 24 2024 at 15:48):
I need to go touch grass, but I think I found an order 13 case with another 10 violation.
I think the new violation is at (0, 12).
Terence Tao (Oct 24 2024 at 16:02):
My previous proposal needs revision. law 378 x ◇ y = (x ◇ y) ◇ y
does present one obstruction to the above approach: if for instance one wants to write 0 = x ◇ 7
for some x
, it is necessary that 0 = 0 ◇ 7
, which is not actually the case in this particular construction. So if we had 0 = 15 ◇ 7
and 7 = 23 ◇ 11
then we would also have 0 = 0 ◇ 7
and 7 = 7 ◇ 11
and the 1055 counterexample would already have been seen at the seed magma. So this approach doesn't work as stated.
But a revision might work. One starts with the same seed magma as before with the same 10 violation 11 ≠ 11 ◇ (0 ◇ 11)
but now extends until one gets a new element, say 15, obeying both 0 = 0 ◇ 15
and 15 = 15 ◇ 11
. That would then give a 1055 violation 11 ≠ 11 ◇ ((0 ◇ (15 ◇ 11)) ◇ 11)
. This no longer sets off the 378 objection, but unfortunately 854 also implies a ton of other laws, and I am not sure yet whether these two requirements 0 = 0 ◇ 15
and 15 = 15 ◇ 11
force some other constraint that would have been detectable in the original magma.
Douglas McNeil (Oct 24 2024 at 16:12):
4 now? When I get back I'll have to figure out which change I tried had such an effect.
And these locations seem suggestive:
[(0, 2, 0, 3), (0, 12, 0, 3), (15, 2, 15, 3), (15, 12, 15, 3)]
Terence Tao (Oct 24 2024 at 16:21):
OK, here is an interesting phenomenon: in order to contradict 1055, it is not enough to have a single violation of 10; one must have at least two.
Suppose we want to contradict 1055 x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
using a 10 counterexample such as 11 ≠ 11 ◇ (0 ◇ 11)
, so we want x = 11
and y ◇ (z ◇ x) = 0
. If we call z ◇ x
15 for sake of argument, then y ◇ 15 = 0
and z ◇ 11 = 15
, hence by law 384 we have 0 ◇ 15 = 0
and 15 ◇ 11 = 15
. But if (11,15) does not violate 10, this also forces 11 ◇ 15 = 11
, and now from the original law 854 we must have 11 = 11 ◇ ((0 ◇ 15) ◇ (11 ◇ 15)) = 11 ◇ (0 ◇ 11)
, contradicting our original hypothesis. So 11,15
must also be a violation to 10: 11 ≠ 11 ◇ (15 ◇ 11)
.
So the extensions need to generate new 10 violations. I guess this dovetails very nicely with what @Douglas McNeil is trying right now. By the way, what do the quadruples such as (0,2,0,3) mean in your notation?
Daniel Weber (Oct 24 2024 at 16:58):
Could something like a greedy approach work, but instead of setting for a fresh you set ?
Terence Tao (Oct 24 2024 at 17:00):
This sounds promising. More precisely, in situations where one has to decide what to do for , one has two options: create something novel, or set it equal to , and perhaps one can try an iteration that only generates additional rules if both of these options are blocked.
Douglas McNeil (Oct 25 2024 at 15:28):
Okay, new record of 6 violations of 10 in an 854 magma:
I may have found the trick. :fingers_crossed:
Douglas McNeil (Oct 25 2024 at 15:44):
And now 12.
I expect I could keep going for a bit, the exceptions have a pretty recognizable pattern (the syntax from before, Terence, was just (x,y,lhs,rhs)). This is probably cleaner:
Verifying equations:
✓ Equation 854 satisfied
✓ Found 12 violations of equation 10:
At (x,y)=(0,2): 0 ≠ 0 ◇ (2 ◇ 0) = 0 ◇ 2 = 3
At (x,y)=(0,12): 0 ≠ 0 ◇ (12 ◇ 0) = 0 ◇ 12 = 3
At (x,y)=(0,18): 0 ≠ 0 ◇ (18 ◇ 0) = 0 ◇ 18 = 3
At (x,y)=(15,2): 15 ≠ 15 ◇ (2 ◇ 15) = 15 ◇ 2 = 3
At (x,y)=(15,12): 15 ≠ 15 ◇ (12 ◇ 15) = 15 ◇ 12 = 3
At (x,y)=(15,18): 15 ≠ 15 ◇ (18 ◇ 15) = 15 ◇ 18 = 3
At (x,y)=(16,2): 16 ≠ 16 ◇ (2 ◇ 16) = 16 ◇ 2 = 3
At (x,y)=(16,12): 16 ≠ 16 ◇ (12 ◇ 16) = 16 ◇ 12 = 3
At (x,y)=(16,18): 16 ≠ 16 ◇ (18 ◇ 16) = 16 ◇ 18 = 3
At (x,y)=(17,2): 17 ≠ 17 ◇ (2 ◇ 17) = 17 ◇ 2 = 3
At (x,y)=(17,12): 17 ≠ 17 ◇ (12 ◇ 17) = 17 ◇ 12 = 3
At (x,y)=(17,18): 17 ≠ 17 ◇ (18 ◇ 17) = 17 ◇ 18 = 3
You can see what turns out to be happening. Originally we only had the (0,2), then I found (0,12) and then we added (15,12), and (15,2), and now the game is on.
Douglas McNeil (Oct 25 2024 at 16:03):
Yeah, it looks like in this way I can carry on (I just hit 28), because of the very dull extending pattern I've been complaining about. :-) Once we got to an "outer border" which is self-sustaining, we can carry on indefinitely. I mean, look at this thing:
5 11 3 5 0 0 11 6 0 0 11 10 3 11 6 5 9 5 3 3 3 3 3
4 2 1 4 1 1 4 1 1 1 4 1 1 4 1 4 4 4 1 1 1 1 1
2 2 7 2 8 4 4 2 4 8 2 8 7 2 2 2 2 2 7 7 7 7 7
5 11 3 5 3 3 11 1 3 3 11 1 3 11 10 9 5 5 3 3 3 3 3
4 4 7 4 7 4 4 4 4 8 4 8 7 4 4 4 4 4 7 7 7 7 7
5 5 7 5 5 4 4 5 10 5 5 5 7 5 5 5 5 5 7 7 7 7 7
5 5 10 2 6 10 4 6 10 6 5 6 10 2 6 5 5 5 10 10 10 10 10
8 7 7 8 7 7 7 1 7 7 7 8 7 7 10 5 5 5 7 7 7 7 7
8 8 7 8 8 4 4 8 10 8 8 8 10 8 8 8 8 8 7 7 7 7 7
9 9 7 9 7 9 9 9 9 8 9 8 7 9 9 9 9 9 7 7 7 7 7
9 4 10 4 10 10 9 10 10 10 4 10 10 11 10 9 4 4 10 10 10 10 10
9 11 7 9 7 11 11 9 11 7 11 8 7 11 9 9 9 9 7 7 7 7 7
12 12 7 12 8 4 4 12 4 8 12 8 7 12 12 12 12 12 7 7 7 7 7
4 9 13 5 13 13 4 13 13 13 4 13 13 5 13 5 4 5 13 13 13 13 13
9 14 14 8 14 14 14 1 14 14 14 8 14 14 8 9 5 5 14 14 14 14 14
9 11 3 9 15 15 11 6 15 15 11 10 3 11 6 9 5 9 3 3 3 3 3
9 11 3 5 16 16 11 6 16 16 11 6 3 11 6 5 5 5 3 3 3 3 3
9 11 3 5 17 17 11 6 17 17 11 6 3 11 6 5 5 5 3 3 3 3 3
18 18 7 18 8 4 4 18 4 8 18 8 7 18 18 18 18 18 7 7 7 7 7
19 19 7 19 8 4 4 19 4 8 19 8 7 19 19 19 19 19 7 7 7 7 7
20 20 7 20 8 4 4 20 4 8 20 8 7 20 20 20 20 20 7 7 7 7 7
21 21 7 21 8 4 4 21 4 8 21 8 7 21 21 21 21 21 7 7 7 7 7
22 22 7 22 8 4 4 22 4 8 22 8 7 22 22 22 22 22 7 7 7 7 7
What we haven't seen yet is any violations of 10 which aren't of the form x * (y * x) = 3. Whether this a real thing or just an artifact of our search isn't clear, my explorations have definitely been biased towards descendants of the original. Be kind of interesting if it turned out that there can only be one "3", i.e. one violating rhs x(yx) value, in an 854 magma.
Michael Bucko (Oct 25 2024 at 16:14):
Douglas McNeil schrieb:
5 11 3 5 0 0 11 6 0 0 11 10 3 11 6 5 9 5 3 3 3 3 3
4 2 1 4 1 1 4 1 1 1 4 1 1 4 1 4 4 4 1 1 1 1 1
2 2 7 2 8 4 4 2 4 8 2 8 7 2 2 2 2 2 7 7 7 7 7
5 11 3 5 3 3 11 1 3 3 11 1 3 11 10 9 5 5 3 3 3 3 3
4 4 7 4 7 4 4 4 4 8 4 8 7 4 4 4 4 4 7 7 7 7 7
5 5 7 5 5 4 4 5 10 5 5 5 7 5 5 5 5 5 7 7 7 7 7
5 5 10 2 6 10 4 6 10 6 5 6 10 2 6 5 5 5 10 10 10 10 10
8 7 7 8 7 7 7 1 7 7 7 8 7 7 10 5 5 5 7 7 7 7 7
8 8 7 8 8 4 4 8 10 8 8 8 10 8 8 8 8 8 7 7 7 7 7
9 9 7 9 7 9 9 9 9 8 9 8 7 9 9 9 9 9 7 7 7 7 7
9 4 10 4 10 10 9 10 10 10 4 10 10 11 10 9 4 4 10 10 10 10 10
9 11 7 9 7 11 11 9 11 7 11 8 7 11 9 9 9 9 7 7 7 7 7
12 12 7 12 8 4 4 12 4 8 12 8 7 12 12 12 12 12 7 7 7 7 7
4 9 13 5 13 13 4 13 13 13 4 13 13 5 13 5 4 5 13 13 13 13 13
9 14 14 8 14 14 14 1 14 14 14 8 14 14 8 9 5 5 14 14 14 14 14
9 11 3 9 15 15 11 6 15 15 11 10 3 11 6 9 5 9 3 3 3 3 3
9 11 3 5 16 16 11 6 16 16 11 6 3 11 6 5 5 5 3 3 3 3 3
9 11 3 5 17 17 11 6 17 17 11 6 3 11 6 5 5 5 3 3 3 3 3
18 18 7 18 8 4 4 18 4 8 18 8 7 18 18 18 18 18 7 7 7 7 7
19 19 7 19 8 4 4 19 4 8 19 8 7 19 19 19 19 19 7 7 7 7 7
20 20 7 20 8 4 4 20 4 8 20 8 7 20 20 20 20 20 7 7 7 7 7
21 21 7 21 8 4 4 21 4 8 21 8 7 21 21 21 21 21 7 7 7 7 7
22 22 7 22 8 4 4 22 4 8 22 8 7 22 22 22 22 22 7 7 7 7 7
So beautiful!
Douglas McNeil (Oct 25 2024 at 16:29):
^ for us visual-minded types
Terence Tao (Oct 25 2024 at 16:41):
I'm pretty sure by taking the direct product of an 854 magma with a 10 violation, and some other 854 magma (which doesn't need to have 10 violations), one can create a magma with 10 violations with many different values, although this is again a "boring" way to create more 10 violations. Still, this is encouraging that there is enough "malleability" in 854 to create a few more 10 violations, as these are essential precursors to refuting the outstanding implications. More precisely, any 10 violation can be "promoted" to a new anti-implication if one of the following happens:
- Promotion to 413 counterexample occurs if .
- Promotion to 1045 counterexample occurs if for some .
- Promotion to 1055 counterexample occurs if for some , or equivalently if , for some .
- Promotion to 3316 counterexample occurs if .
- Promotion to 3925 counterexample occurs if .
I don't have a clear idea how to force any of these to occur on top of an existing 10 violation, although 1045 or 1055 seem the most promising given that they have an existential quantifier.
Douglas McNeil (Oct 25 2024 at 17:02):
I've got alerts in to warn me if I hit anything which refuted any of the outstanding implications and didn't hit any of them, including an overnight perturbation run w/~1M magma, unfortunately.
Running out of ideas, although before I found more violations than I knew what to do with, I was considering starting with seeds with lots of 10 violations but worse 854 behaviour and trying to pull them towards 854, which is basically the opposite of what I was trying before with 854 and 413.
Could still be a snipe hunt, more's the pity.
Terence Tao (Oct 26 2024 at 17:29):
Given all the recent progress on 1485, I now propose an analogous approach to attack 854. The main idea is to introduce the notion of a "relaxed 854-magma" which is hopefully easier to construct and could (again hopefully) be used as a guide to build an actual 854-magma by a greedy construction.
A relaxed magma is a magma with multiplicity allowed in the multiplication table: a given product x ◇ y
could have multiple possible values, though there must always be at least one. See here for an example, adapted for 1485 analysis. For such a relaxed magma to be consistent with 854 x = x ◇ ((y ◇ z) ◇ (x ◇ z))
, we need the following rule:
- If one has already chosen values for
y ◇ z
,x ◇ z
and(y ◇ z) ◇ (x ◇ z)
consistent with the relaxed multiplication table, thenx
is a permissible choice ofx ◇ ((y ◇ z) ◇ (x ◇ z))
in that table.
Or to put it another way, if R(x,y,z)
denotes the relation that z
appears as one of the options for x ◇ y
in the table, we need the following axioms:
- Claim 1'. For each
x, y
, there is at least onez
for whichR(x,y,z)
holds. - Claim 854. If
R(y,z,w)
,R(x,z,u)
, andR(w,u,v)
, thenR(x,v,x)
.
On the other hand, if we would like to refute a law, say 413 x = x ◇ (x ◇ (x ◇ (y ◇ x)))
for sake of argument, we would like to have the following:
- Claim anti-413. There exists
x,y,z,w,u
suchR(y,x,z)
,R(x,z,w)
,R(x,w,u)
are true, butR(x,u,x)
is false.
Note that if we could upgrade Claim 1' to
- Claim 1. For each
x, y
, there is at exactly onez
for whichR(x,y,z)
holds.
while also obeying Claims 854 and anti-413, we would get the anti-implication 854 !-> 413. So I propose looking for finite relaxed magmas that just obey Claims 1', 854, anti-413, and similarly for the other four outstanding implications. We then have to figure out how to greedily create an actual 854 magma that follows the relaxed 854 magma as a model, but perhaps our previous experience in greedy constructions will help here. (Given the huge number of laws that 854 implies, I suspect this will need to be an automated task, similar to what @Daniel Weber did with the pure greedy approach.)
Note: the definition of a "relaxed 854 magma" may be subject to revision. It may be that one has to take every single known consequence of 854 and also place relaxed version of those laws as additional axioms, though many may be redundant. Similarly for the laws identified by Daniel Weber several days ago. Maybe this can be done by an iterative process: first generate a relaxed 854 counterexample magma with the plain 854 rule, test if any of the other laws are not obeyed, and if not add one of them as an axiom and search again.
Another note: I have the vague feeling that the closer that the relaxed magma is to an actual magma, in that there are very few entries that are ambiguous, the better a guide it will be, as there will be fewer cases to check in the greedy algorithm analysis. I also have the vague sense that it would be good to avoid idempotents R(x,x,x)
, as this may cause particular problems in the greedy algorithm.
Michael Bucko (Oct 26 2024 at 19:24):
I was trying to take the relaxed magma approach and address 854-413. I guess those kinds of operation tables aren't admitted by the FMT, but perhaps I can use this kind of relaxed format in one of the solvers?
[
[[0,1], [0], [1,2], [0], [4], [5]],
[[1], [1,2], [2], [1,4], [0], [5]],
[[2], [2], [2,3], [3], [4], [1]],
[[0], [3], [3], [3,4], [0,5], [5]],
[[4], [4], [4], [5], [4], [5]],
[[5], [5], [5], [5], [5], [5]]
]
Terence Tao (Oct 27 2024 at 02:39):
Unfortunately it doesn't seem like this table obeys 854. Take x=y=z=0
. Then one possible value w
of y ◇ z
is 1
, one possible value u
of x ◇ z
is 1
, and one possible value v
of w ◇ u = (y ◇ z) ◇ (x ◇ z)
is 2
; but then x ◇ ((y ◇ z) ◇ (x ◇ z)) = 0 ◇ 2
cannot equal x = 0
(it can only equal 1
or 2
).
But this is the sort of solution - with very few multiple values - I would like to see. If hypothetically the table did obey 854, one could then guide a search for finite 854 magmas refuting 413 by having vertices of six Classes 0-5, being multiplied according to the above table, for instance the product of a Class 0 object and a Class 0 object should be Class 0 or 1, the product of a Class 0 object and a Class 1 object should be Class 0, and so forth. If the underlying relaxed magma obeyed 854 and refuted 413, this would greatly increase the chance that any such extended magma would also obey 854 and refuted 413, though it is not guaranteed. (After a lengthy discussion over at the 1485 thread I see now that the greedy algorithm approach is likely to be significantly more complicated for 854 than it will be for 1485, so I am hoping that a heuristic approach for 854 may be sufficient to locate a concrete finite counterexample.)
Jose Brox (Oct 27 2024 at 09:45):
I've been working in another approach that seemed to be promising for 854, I will see if I can finish it soon (or at least give you the details if it gets too complicated).
Michael Bucko (Oct 27 2024 at 09:47):
Terence Tao schrieb:
Unfortunately it doesn't seem like this table obeys 854. Take
x=y=z=0
. Then one possible valuew
ofy ◇ z
is1
, one possible valueu
ofx ◇ z
is1
, and one possible valuev
ofw ◇ u = (y ◇ z) ◇ (x ◇ z)
is2
; but thenx ◇ ((y ◇ z) ◇ (x ◇ z)) = 0 ◇ 2
cannot equalx = 0
(it can only equal1
or2
).
Prof. Tao, you're right.
Michael Bucko (Oct 27 2024 at 09:49):
Wrote a simple Sage script -- trying to locate a concrete finite counterexample. It's been running for 30 minutes. I'll let it run some more, and then try to add some heuristics.
(I should probably just stick to z3 and Lean, but I wanted to try that)
854.sage.pas
UPDATE: the Sage script run for a couple of hours, and didn't find anything. I'll most likely focus on Lean+z3 and experiment with heuristics.
Terence Tao (Oct 27 2024 at 14:56):
Perhaps 413 was too hard of an initial target to refute. The easiest-looking target is 1055 x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
. Writing w = z ◇ x
, u = y ◇ w
, and v = u ◇ x
, we need to find an example with x ≠ x ◇ v
. Because of Law 378, the condition u = y ◇ w
for some w
is equivalent to u = u ◇ w
, so we may eliminate a variable. In terms of R, the axiom is now
- Anti-1055: There exists
x, z, w, u, v
such thatR(z,x,w)
,R(u,w,u)
,R(u,x,v)
are true, butR(x,v,x)
is false.
Perhaps it could be slightly easier to locate counterexamples here.
Terence Tao (Oct 27 2024 at 16:01):
Actually this suggests the following exercise for anyone with some spare ATP compute. We have five open implications from 854: 413 x = x ◇ (x ◇ (x ◇ (y ◇ x)))
, 1045 x = x ◇ ((y ◇ (y ◇ x)) ◇ x)
, 1055 x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
, 3316 x ◇ y = x ◇ (y ◇ (x ◇ y))
, and 3925 x ◇ y = (x ◇ (y ◇ x)) ◇ y
. We know that 1055 implies 1045, but otherwise the five laws have no implications between them. However, if we assume 854, it may be that there are more equivalences here. Over on the 1485 thread, for instance, the open implications from 1485 are to 151, 3456, and 3862; they are not equivalent to each other in the absence of any other axioms, but we also know that assuming 1485, they are equivalent, so really there is just one anti-implication to resolve there. So perhaps there is a similar collapse among the five open implications here, and if so that should be able to be picked up by standard ATP queries. This may help clarify which of these laws are the easiest to attack.
Michael Bucko (Oct 27 2024 at 16:17):
Running one more Sage script for 854->1055 with anti-1055 (posting for the reference- hoping to not have a bug here).
# Check 854:
def check_equation_854(table):
for x in S:
for y in S:
for z in S:
yz = table[y][z]
xz = table[x][z]
yz_xz = table[yz][xz]
lhs = x
rhs = table[x][yz_xz]
if lhs != rhs:
return False
return True
# NOT 1055:
# There exists x, z, w, u, v such that:
# w = z ◇ x
# u = u ◇ w
# v = u ◇ x
# x ≠ x ◇ v
def check_not_equation_1055(table):
for x in S:
for z in S:
w = table[z][x]
for u in S:
# Check if u = u ◇ w
if table[u][w] != u:
continue
v = table[u][x]
# Check if x != x ◇ v
if table[x][v] != x:
return True # Yay!
return False
Jose Brox (Oct 27 2024 at 16:58):
Terence Tao ha dicho:
We have five open implications from 854: 413
x = x ◇ (x ◇ (x ◇ (y ◇ x)))
, 1045x = x ◇ ((y ◇ (y ◇ x)) ◇ x)
, 1055x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
, 3316x ◇ y = x ◇ (y ◇ (x ◇ y))
, and 3925x ◇ y = (x ◇ (y ◇ x)) ◇ y
. We know that 1055 implies 1045, but otherwise the five laws have no implications between them. However, if we assume 854, it may be that there are more equivalences here.
I've been running in Prover9/Mace4 for several minutes all the implications starting or ending with 413, and no proof nor countermodel has been found. Is there perhaps any set-theoretical reason impeding finite counterexamples?
Michael Bucko (Oct 27 2024 at 17:05):
Jose Brox schrieb:
Terence Tao ha dicho:
We have five open implications from 854: 413
x = x ◇ (x ◇ (x ◇ (y ◇ x)))
, 1045x = x ◇ ((y ◇ (y ◇ x)) ◇ x)
, 1055x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
, 3316x ◇ y = x ◇ (y ◇ (x ◇ y))
, and 3925x ◇ y = (x ◇ (y ◇ x)) ◇ y
. We know that 1055 implies 1045, but otherwise the five laws have no implications between them. However, if we assume 854, it may be that there are more equivalences here.I've been running in Prover9/Mace4 for several minutes all the implications starting or ending with 413, and no proof nor countermodel has been found. Is there perhaps any set-theoretical reason impeding finite counterexamples?
Can you please share your Prover9 script with anti-1055?
Ibrahim Tencer (Oct 27 2024 at 17:08):
Douglas McNeil said:
Verifying equations:
✓ Equation 854 satisfied
✓ Found 12 violations of equation 10:
At (x,y)=(0,2): 0 ≠ 0 ◇ (2 ◇ 0) = 0 ◇ 2 = 3
At (x,y)=(0,12): 0 ≠ 0 ◇ (12 ◇ 0) = 0 ◇ 12 = 3
At (x,y)=(0,18): 0 ≠ 0 ◇ (18 ◇ 0) = 0 ◇ 18 = 3
At (x,y)=(15,2): 15 ≠ 15 ◇ (2 ◇ 15) = 15 ◇ 2 = 3
At (x,y)=(15,12): 15 ≠ 15 ◇ (12 ◇ 15) = 15 ◇ 12 = 3
At (x,y)=(15,18): 15 ≠ 15 ◇ (18 ◇ 15) = 15 ◇ 18 = 3
At (x,y)=(16,2): 16 ≠ 16 ◇ (2 ◇ 16) = 16 ◇ 2 = 3
At (x,y)=(16,12): 16 ≠ 16 ◇ (12 ◇ 16) = 16 ◇ 12 = 3
At (x,y)=(16,18): 16 ≠ 16 ◇ (18 ◇ 16) = 16 ◇ 18 = 3
At (x,y)=(17,2): 17 ≠ 17 ◇ (2 ◇ 17) = 17 ◇ 2 = 3
At (x,y)=(17,12): 17 ≠ 17 ◇ (12 ◇ 17) = 17 ◇ 12 = 3
At (x,y)=(17,18): 17 ≠ 17 ◇ (18 ◇ 17) = 17 ◇ 18 = 3You can see what turns out to be happening. Originally we only had the (0,2), then I found (0,12) and then we added (15,12), and (15,2), and now the game is on.
So in all of these examples we have yx = x and x² ≠ x. Is it known whether a counterexample to 10 must be like this?
Terence Tao (Oct 27 2024 at 18:42):
I think I've gotten a way to get abstract nonsense (or "syntactic" arguments) to get new anti-implication, namely that 854 x = x ◇ ((y ◇ z) ◇ (x ◇ z))
does not imply 3925 x ◇ y = (x ◇ (y ◇ x)) ◇ y
, but I'm not 100% sure about this, it's sort of a weird argument, mostly taking advantage of the fact that the 854 law is of the left-absorbent form x = x ◇ f(x,y,z)
for some word f(x,y,z)
.
Define the G
to be the free magma on two generators x
, y
, i.e., the magma of words in x
, y
, and let G/~
be the quotient of G
by the equivalence relation ~
defined by saying that w ~ w'
if one can rewrite w
to w'
after finitely many applications of the law 854. Then, abstract nonsense tells us that G/~
is universal for 854, at least regarding identities involving at most two variables. In particular, to disprove 3925, it suffices to show that the words x ◇ y
, (x ◇ (y ◇ x)) ◇ y
are not equal in G/~
.
Now, 854 is not a confluent law, nor do we know of a complete rewriting system for it, so the relation ~
is pretty mysterious, which is why we have not tried to make much use of this equivalence relation so far. But we can say a few things about it. If we can refute some proposed law w(x,y) = w'(x,y)
in any magma obeying 854, then this implies that w
and w'
are not equivalent. For instance, because we have at least one known counterexample to the 10 law x = x ◇ (y ◇ x)
, we know that the words x
and x ◇ (y ◇ x)
are not equivalent with respect to ~
. More generally, the large number of implications and anti-implications that we already have regarding 854 (collected very conveniently in Equation Explorer) tell us a lot about this equivalence relation already for short words. Also, all of the finite 854 examples we have been collecting will be very useful for establishing if two given long words are not equal to each other.
Also, 854 implies law 378, x ◇ y = (x ◇ y) ◇ y
. As a consequence, if we define the left Cayley graph on an 854 magma by declaring x → y
whenever y = z ◇ x
for some z
, we see that x → y
if and only if y = y ◇ x
. We can then lift this up to a graph on the free group G
: in this group, x → y
if and only if y ~ y ◇ x
, or equivalently (by law 378) if y ~ z ◇ x
for some z
.
As observed previously, law 10 asserts that this graph has some one-way edges. In particular, for the two generators x
, y
of G
, we have x → y ◇ x
(because (y ◇ x) ◇ x ~ y ◇ x
), but we do not have the edge y ◇ x → x
(because it is not true that x ◇ (y ◇ x) ~ x
.) More generally, the implications and anti-implications we already know about 854, as well as the many finite models we have, already tell us a lot about this directed graph on short and medium length words.
OK, so far this is just abstract nonsense translating things we already know into the free magma. But here is the key new fact: the free magma has a "unique factorization" type property!
Unique factorization. Suppose that a, b, c, d
are elements of the free group such that a ◇ b ~ c ◇ d
, with the relations b → a
and d → c
being false (thus, a ◇ b ~ a
and c ◇ d ~ c
are both false). Then a ~ c
and b ~ d
.
Proof. By hypothesis, it is possible to transform the word a ◇ b
into the word c ◇ d
by some finite number of rewrites using 854. Because of the form of 854, any rewrite either replaces a term e
with an equivalent term e ◇ f
for which there is an edge f → e
, or else replaces e ◇ f
with an equivalent term e
for which there is an edge f → e
. Because of this and the initial hypothesis that we do not have the edge b → a
, we see from induction that the only words that can be obtained from rewriting a ◇ b
take the form (((a' ◇ b') ◇ w(1)) ◇ ... ◇ w(n))
where a' ~ a
, b' ~ b
, n
is a natural number, and w(1),...,w(n)
are words such that we have edges
w(i+1) → (((a' ◇ b') ◇ w(1)) ◇ ... ◇ w(i))
for all i<n
. Comparing this with c ◇ d
, the words can only match if n=0
, c=a'
, d=b'
, giving the claim. □
Now we can contradict 3925. It suffices to show that the words x ◇ y
, (x ◇ (y ◇ x)) ◇ y
are not equivalent. We already know (from the failure of 10) that x
and (x ◇ (y ◇ x))
are not equivalent. From unique factorization, we are then done unless we have an edge y → x
or an edge y → (x ◇ (y ◇ x))
.
If we had an edge y → x
, then x ◇ y ~ x
. That is to say, every 854 law would obey the left absorption property x ◇ y ~ x
(Equation 4). So this is clearly not the case.
If we had an edge y → (x ◇ (y ◇ x))
, then (x ◇ (y ◇ x)) ◇ y ~ x ◇ (y ◇ x)
, and then 3295 would simplify to
x ◇ y = x ◇ (y ◇ x)
(Equation 325). But this law is not implied by 854 (this is listed on Equation Explorer, presumably one of the many 854 magmas we have generated will demonstrate this). So we are done!
[Note: the way I came up with this argument was by observing that the greedy approach to building an 854 magma would always either define a ◇ b
equal to a
, or else to some completely new element c
that one never tries to "use" again. This suggested something like a unique factorization property holding true for this greedy model, but I had trouble formalizing this intuition until I returned to the abstract nonsense world of formal words.]
Matthew Bolan (Oct 27 2024 at 19:14):
I suppose this doesn't apply to the other 4 implications because they are all of the form
Matthew Bolan (Oct 27 2024 at 19:17):
Where is the assumption that is false used?
Terence Tao (Oct 27 2024 at 19:18):
Well, 3316 has a decent chance I think; I haven't checked it, perhaps someone else is willing to give it a try.
As for the other three, it may be possible to prove something like the following: in the free magma, the relation w → x
(i.e., x ~ x ◇ w
) is only true if w ~ (y ◇ z) ◇ (x ◇ z)
for some y,z
. Then this converts the other three laws back into something involving products for which unique factorization might be useful again.
Terence Tao (Oct 27 2024 at 19:19):
One needs the falsity of to exclude the cases, e.g., to prevent from being equal to for some .
Matthew Bolan (Oct 27 2024 at 19:19):
ah, thanks
Terence Tao (Oct 27 2024 at 19:20):
The conditions and are saying somehow that the products , are "irreducible" at the top level. It seems that unique factorization means that every word in the free group has a unique "canonical reduction" in which all products at all levels are irreducible.
Terence Tao (Oct 27 2024 at 19:22):
This isn't the same as confluence, I think because the edges can be obtained in a rather complicated fashion; it isn't just using the 854 rule to shorten the word, sometimes you have to lengthen the word as well.
Matthew Bolan (Oct 27 2024 at 19:25):
So, 3316 being false boils down to showing that we don't have an edge ?
Terence Tao (Oct 27 2024 at 19:27):
Well, there are two cases. If we don't have an edge, then I think we can conclude by unique factorization since we know from the failure of 10 that is not equivalent to , and we also know there is no edge . If there is an edge, then the RHS of 3316 collapses to and I think we again win because of the failure of Equation 4. So, I guess basically the same argument used for 3925 should work for 3316.
Terence Tao (Oct 27 2024 at 19:28):
Probably we don't have an edge, by the way, and presumably one of the finite magmas we possess will demonstrate that.
Terence Tao (Oct 27 2024 at 19:29):
Or, one can use the fact (from EE) that law 53 x = x ◇ (y ◇ (x ◇ y))
is not a consequence of 854.
Terence Tao (Oct 27 2024 at 19:30):
(Now we're really using the full power of Equation Explorer!)
Terence Tao (Oct 27 2024 at 19:30):
Basically unique factorization + equation explorer seems to tell us quite a lot of information about the free magma for 854
Michael Bucko (Oct 27 2024 at 19:48):
Michael Bucko schrieb:
Running one more Sage script for 854->1055 with anti-1055 (posting for the reference- hoping to not have a bug here).
# Check 854: def check_equation_854(table): for x in S: for y in S: for z in S: yz = table[y][z] xz = table[x][z] yz_xz = table[yz][xz] lhs = x rhs = table[x][yz_xz] if lhs != rhs: return False return True # NOT 1055: # There exists x, z, w, u, v such that: # w = z ◇ x # u = u ◇ w # v = u ◇ x # x ≠ x ◇ v def check_not_equation_1055(table): for x in S: for z in S: w = table[z][x] for u in S: # Check if u = u ◇ w if table[u][w] != u: continue v = table[u][x] # Check if x != x ◇ v if table[x][v] != x: return True # Yay! return False
It was running for 3 hours. No result.
Terence Tao (Oct 27 2024 at 19:52):
OK here are a few more basic facts about the free magma.
Call a word "irreducible" if it is not of the form a ◇ b
with b → a
; that is to say, it is either one of the generators x
, y
, or else is of the form a ◇ b
with no edge of the form b → a
. Clearly if a word is reducible then it is equivalent to the shorter word a
, so on iterating we see that every word is equivalent to an irreducible word. Unique factorization then says that this irreducible word is unique up to replacing its factors by equivalent words. So every word in fact has a unique "totally irreducible" equivalent, in which every term in the word is irreducible.
A modification of the syntactic analysis shows that if one performs rewrites to an irreducible word a ◇ b
, then the only words one can reach are of the form ((a' ◇ b') ◇ w(1)) ... ◇ w(n)
where a' ~ a
, b' ~ b
, and for each i<n
, the word w(i+1)
is equivalent to a word of the form (Y ◇ Z) ◇ (X ◇ Z)
where X ~ ((a' ◇ b') ◇ w(1)) ... ◇ w(i)
. (I use capital letters here because I am reserving x
, y
for the generators of the free magma.) This completely describes all the words equivalent to a ◇ b
, and so we conclude that W → a ◇ b
if and only if W ~ (Y ◇ Z) ◇ ((a ◇ b) ◇ Z)
for some Y
, Z
. Similarly if a ◇ b
is replaced by one of the generators x
, y
. Since every word is equivalent to one of these irreducible forms, we conclude
Description of graph. Let X, W
be words in the free group. Then W → X
if and only if W ~ (Y ◇ Z) ◇ (W ◇ Z)
for some words Y
, Z
.
Informally, the only way (up to equivalence) to generate edges in this graph is from the obvious way of just applying 854.
This description may help make some headway with the other open implications 413, 1045, 1055. It won't be as easy as with 3316 and 3925 in which we only had to analyze short words though, because now the new words Y
, Z
are not initially of any bounded size. But perhaps unique factorization can be used to bring them under control.
Matthew Bolan (Oct 27 2024 at 20:08):
What is U in the description of the graph?
Terence Tao (Oct 27 2024 at 20:09):
Oops that was a typo, it should have been W.
Matthew Bolan (Oct 27 2024 at 20:48):
Is there a reason why you restrict to only the open anti-implications here? Is there some automated check done to see if known anti-implications don't contradict unknown ones? Terence Tao said:
Actually this suggests the following exercise for anyone with some spare ATP compute. We have five open implications from 854: 413
x = x ◇ (x ◇ (x ◇ (y ◇ x)))
, 1045x = x ◇ ((y ◇ (y ◇ x)) ◇ x)
, 1055x = x ◇ ((y ◇ (z ◇ x)) ◇ x)
, 3316x ◇ y = x ◇ (y ◇ (x ◇ y))
, and 3925x ◇ y = (x ◇ (y ◇ x)) ◇ y
. We know that 1055 implies 1045, but otherwise the five laws have no implications between them. However, if we assume 854, it may be that there are more equivalences here. Over on the 1485 thread, for instance, the open implications from 1485 are to 151, 3456, and 3862; they are not equivalent to each other in the absence of any other axioms, but we also know that assuming 1485, they are equivalent, so really there is just one anti-implication to resolve there. So perhaps there is a similar collapse among the five open implications here, and if so that should be able to be picked up by standard ATP queries. This may help clarify which of these laws are the easiest to attack.
Matthew Bolan (Oct 27 2024 at 20:52):
I guess I am asking if there is any work done on understanding the implication graphs for the remaining equations -- in an 854 magma, which laws are known to imply which others?
Matthew Bolan (Oct 27 2024 at 20:54):
I suppose for 854 the most interesting anti-implications we know come from this 12-element magma, which we know doesn't resolve anything open.
Amir Livne Bar-on (Oct 27 2024 at 21:04):
I formalized 854+anti1055 and 854+anti413 as integer linear programs. It takes a few minutes to run through all 5-element multi-valued magmas and prove there are no solutions. I'm running now with 6-elements, this will probably take a few hours to complete.
But the fact it works at all means that maybe we can use ILP solvers to find interesting magmas for 854. What's an interesting property? I wasn't following the discussion too closely. We can look for counter-examples to 10 with specific properties for instance.
Michael Bucko (Oct 27 2024 at 21:06):
Amir Livne Bar-on schrieb:
I formalized 854+anti1055 and 854+anti413 as integer linear programs.
Could you please share those programs? (I'm always learning from those examples)
Michael Bucko (Oct 27 2024 at 21:09):
(deleted)
Amir Livne Bar-on (Oct 27 2024 at 21:10):
Sure, here's the code: scip_854.py
Terence Tao (Oct 27 2024 at 21:15):
Matthew Bolan said:
I guess I am asking if there is any work done on understanding the implication graphs for the remaining equations -- in an 854 magma, which laws are known to imply which others?
In principle we could look at all of the 4660 equations not implied by 854, and see what the implication graph among those equations is assuming 854, but this is basically repeating the entire project, but now for 854 magmas, and seems like quite a lot of work. It could potentially give a little bit of insight as to how the five (or maybe now just three) outstanding implications relate to each other, but it seems like quite a bit of effort for relatively little reward right now.
That said, it has been proposed multiple times that this project could be extended at some point to study things like "Do Law X and Law Y jointly imply Law Z"? Certainly many of our techniques would carry over, but the project seems to be significantly larger in scale, and likely to contain many more extremely difficult implications, than our current project.
Mario Carneiro (Oct 27 2024 at 21:24):
Terence Tao said:
I think I've gotten a way to get abstract nonsense (or "syntactic" arguments) to get new anti-implication, namely that 854
x = x ◇ ((y ◇ z) ◇ (x ◇ z))
does not imply 3925x ◇ y = (x ◇ (y ◇ x)) ◇ y
, but I'm not 100% sure about this, it's sort of a weird argument, mostly taking advantage of the fact that the 854 law is of the left-absorbent formx = x ◇ f(x,y,z)
for some wordf(x,y,z)
.
It seems to work! equational#745
Terence Tao (Oct 27 2024 at 21:24):
That was incredibly fast!
Terence Tao (Oct 27 2024 at 21:24):
I was still writing up the blueprint proof lol
Mario Carneiro (Oct 27 2024 at 21:24):
Can someone help me figure out how to prove these theorems:
theorem not_l4' : ∃ (G : Type*) (_ : Magma G), Equation854 G ∧ ¬Equation4 G := sorry
theorem not_l10' : ∃ (G : Type*) (_ : Magma G), Equation854 G ∧ ¬Equation10 G := sorry
theorem not_l325' : ∃ (G : Type*) (_ : Magma G), Equation854 G ∧ ¬Equation325 G := sorry
Mario Carneiro (Oct 27 2024 at 21:25):
I thought that was the blueprint :)
Mario Carneiro (Oct 27 2024 at 21:26):
I do find this kind of technique interesting, effectively the free magma lets us "superimpose" a bunch of counterexample magmas into one magma where all of the non-implications hold simultaneously
Mario Carneiro (Oct 27 2024 at 21:27):
It would be interesting to see if it's possible to prove it without free magmas
Terence Tao (Oct 27 2024 at 21:28):
We have 'Facts' to establish not_l4'
, not_l10'
, not_l325'
that are somewhere in the Lean codebase
Terence Tao (Oct 27 2024 at 21:33):
In principle this script of @Amir Livne Bar-on should help find them
Terence Tao (Oct 27 2024 at 21:56):
For 4 and 10, one can find a counterexample by using the main result of https://github.com/teorth/equational_theories/blob/main/equational_theories/Generated/All4x4Tables/Refutation870.lean to refute for instance 1038, which is a special case of both 4 and 10.
Terence Tao (Oct 27 2024 at 22:00):
For 325, one can similarly use https://github.com/teorth/equational_theories/blob/main/equational_theories/Generated/All4x4Tables/Refutation82.lean to refute the special case 307
Mario Carneiro (Oct 27 2024 at 22:08):
But Refutation82 doesn't prove 854?
Terence Tao (Oct 27 2024 at 22:09):
This is what I see:
/-! The facts -/
@[equational_result]
theorem «Facts from FinitePoly x² + x * y % 2» :
∃ (G : Type) (_ : Magma G) (_: Finite G), Facts G [10, 11, 106, 109, 111, 371, 378, 433, 434, 442, 443, 854, 1043, 1053, 1247, 1257, 1265, 1271, 1855, 1863, 3285, 3306, 3321, 3726, 3727, 3895, 3931, 4087, 4293, 4318, 4673] [47, 102, 115, 117, 118, 124, 125, 127, 151, 203, 255, 307, 412, 416, 417, 419, 420, 463, 464, 466, 467, 473, 474, 476, 477, 500, 501, 503, 504, 510, 511, 513, 614, 822, 825, 826, 869, 870, 872, 873, 879, 880, 882, 883, 906, 907, 909, 910, 916, 917, 919, 1021, 1026, 1029, 1072, 1073, 1075, 1076, 1082, 1083, 1085, 1086, 1109, 1110, 1112, 1113, 1119, 1120, 1122, 1232, 1275, 1276, 1278, 1279, 1285, 1286, 1288, 1289, 1312, 1313, 1315, 1316, 1322, 1323, 1325, 1426, 1629, 1833, 1837, 1838, 1840, 1841, 1848, 1858, 1884, 1885, 1887, 1888, 1894, 1895, 1897, 1898, 1921, 1922, 1924, 1925, 1931, 1932, 1934, 2035, 2238, 2441, 2644, 2847, 3050, 3254, 3258, 3259, 3261, 3262, 3268, 3269, 3271, 3272, 3281, 3308, 3309, 3342, 3343, 3345, 3346, 3352, 3353, 3456, 3664, 3667, 3668, 3674, 3675, 3678, 3712, 3714, 3722, 3748, 3749, 3751, 3752, 3759, 3761, 3863, 3868, 3871, 3877, 3880, 3890, 3917, 3918, 3924, 3927, 3951, 3952, 3954, 3955, 3961, 3962, 3964, 4074, 4080, 4118, 4120, 4121, 4127, 4128, 4130, 4155, 4157, 4158, 4164, 4165, 4167, 4268, 4272, 4273, 4275, 4276, 4283, 4284, 4290, 4291, 4320, 4321, 4343, 4380, 4585, 4587, 4599, 4605, 4629, 4635, 4658] :=
⟨Fin 2, «FinitePoly x² + x * y % 2», Finite.of_fintype _, by decideFin!⟩
854 is listed there as something that is obeyed I think
Mario Carneiro (Oct 27 2024 at 22:10):
oh there are multiple Refutation82's, you meant https://github.com/teorth/equational_theories/blob/main/equational_theories/Generated/FinitePoly/Refutation82.lean
Terence Tao (Oct 27 2024 at 22:10):
Oh sorry oops I was doing all this manually. (I couldn't get Amir's script to build the big json file, something to do with Windows maximum path length or some such nonsense)
Mario Carneiro (Oct 27 2024 at 22:15):
all fixed now
Douglas McNeil (Oct 27 2024 at 22:16):
@Amir Livne Bar-on : FWIW I got the dreaded "unsatisfiable" via sat on 854 + at least one 413 violation up to order ~8 I think, maybe 9. Can't remember offhand how high I got for 10, but it wasn't enough to rule out a smaller example than 11. That might be as much a measure of how well I was coding before bed that day as anything, of course. ;-)
Amir Livne Bar-on (Oct 27 2024 at 22:19):
Kind of odd that we can't get at the file names, but anyway here's the file, generated 2 days ago on commit a1d9acd6
:
full_entries.json
Amir Livne Bar-on (Oct 27 2024 at 22:24):
Douglas McNeil said:
Amir Livne Bar-on : FWIW I got the dreaded "unsatisfiable" via sat on 854 + at least one 413 violation up to order ~8 I think, maybe 9. Can't remember offhand how high I got for 10, but it wasn't enough to rule out a smaller example than 11. That might be as much a measure of how well I was coding before bed that day as anything, of course. ;-)
Was this for magmas or for multi-magmas? In any case, sounds like SAT is a better fit for this problem than integer programming.
Douglas McNeil (Oct 27 2024 at 22:40):
Only magmas (assuming multimagmas are the multivalued magmas of the relaxation conversation?) but I don't trust negative results I produce anyway.. too easy for me to accidentally overconstrain and not realize it because I'm still getting enough of what I expect to see.
That said, I'd be happy if someone proved there were no finite counterexamples!
Terence Tao (Oct 27 2024 at 23:23):
Moot now, but I got the script to work with the json provided.
PS C:\Users\teort\Dropbox\Lean\equational_theories> py scripts/explain_implication.py Equation854 Equation10
Equation10 => Equation3935
RewriteHypothesis.Equation10_implies_Equation3935 in Generated/TrivialBruteforce/theorems/RewriteHypothesis.lean
Equation854 has a model that does not satisfy Equation3935
«Facts from FinitePoly [[2,3,0,4,0,10,0,10,0,7,7],[1,5,6,1,8,1,8,8,6,1,6],[2,9,6,2,2,2,2,2,9,2,6],[2,3,3,2,3,9,3,9,3,7,7],[4,5,4,4,8,4,5,8,4,4,4],[2,5,5,2,5,9,5,2,5,5,5],[6,5,6,6,8,6,5,8,6,6,6],[4,3,7,4,5,4,5,8,7,7,7],[8,9,6,8,8,8,8,8,9,8,6],[2,9,9,2,9,9,9,9,9,4,4],[2,9,9,2,10,10,10,10,9,2,6]]» in Generated/All4x4Tables/Refutation870.lean
PS C:\Users\teort\Dropbox\Lean\equational_theories> py scripts/explain_implication.py Equation854 Equation4
Equation4 => Equation3722
Subgraph.Equation4_implies_Equation3722 in Subgraph.lean
Equation854 has a model that does not satisfy Equation3722
«Facts from FinitePoly x² + x * y % 2» in Generated/FinitePoly/Refutation82.lean
PS C:\Users\teort\Dropbox\Lean\equational_theories> py scripts/explain_implication.py Equation854 Equation325
Equation325 => Equation307
SimpleRewrites.Equation325_implies_Equation307 in Generated/SimpleRewrites/theorems/Rewrite_yx.lean
Equation854 has a model that does not satisfy Equation307
«Facts from FinitePoly x² + x * y % 2» in Generated/FinitePoly/Refutation82.lean
Terence Tao (Oct 27 2024 at 23:54):
@Mario Carneiro It's not super urgent, but if at some point you are able to update your proof of 854 !=> 3925 to also refute 3316 that would be great (it's almost the same argument, re-using the same facts not_l4'
, not_l10
, not_l325'
; see https://teorth.github.io/equational_theories/blueprint/854-chapter.html). I could try to modify the code myself but I would probably make it much clunkier as a result.
Mario Carneiro (Oct 28 2024 at 00:04):
Mario Carneiro (Oct 28 2024 at 00:05):
one fun fact here is that X ≠ X ◇ (Y ◇ X)
and Y ≠ Y ◇ (X ◇ Y)
are actually different theorems, you have to prove them separately, making use of the fact that X
and Y
are generators of the free group. So now there are two versions of not_l10
in the proof
Amir Livne Bar-on (Oct 28 2024 at 05:57):
Amir Livne Bar-on said:
I'm running now with 6-elements.
The order 6 run took a bit more than an hour (and failed for both anti-axioms), and order 7 has been running since and is going to take a while
Michael Bucko (Oct 28 2024 at 06:09):
Terence Tao schrieb:
This script is very useful working on their learning Lean skills and exploring the graph.
(equational_theories) riccitensor@insight equational_theories % python3 scripts/explain_implication.py --entries-file ./full_entries.json Equation854 Equation1054 Equation1054 => Equation1047 SimpleRewrites.Equation1054_implies_Equation1047 in Generated/SimpleRewrites/theorems/Rewrite_wz_zx.lean Equation1047 => Equation650 Equation1047_implies_Equation650 in Generated/VampireProven/Proofs1.lean Equation650 => Equation448 Equation650_implies_Equation448 in Generated/VampireProven/Proofs11.lean Equation448 => Equation441 Equation448_implies_Equation441 in Generated/VampireProven/Proofs10.lean Equation441 => Equation4 Equation441_implies_Equation4 in Generated/VampireProven/Proofs10.lean Equation4 => Equation3 Subgraph.Equation4_implies_Equation3 in Subgraph.lean Equation854 has a model that does not satisfy Equation3 Magma2c.Facts in SmallMagmas.lean
Eric Taucher (Oct 28 2024 at 10:14):
Terence Tao said:
build the big json file, something to do with Windows maximum path length or some such nonsense
Had same/similar problem
@Richard Copley noted a work around.
I was able to get an exe by writing the arguments to a file
args.txt
then invoking leanc asleanc.exe @args.txt
.
It is noted in this topic with the specific problem being solved with the help of @Richard Copley starting with this reply.
Eric Taucher (Oct 28 2024 at 10:23):
If you want just the exact steps that worked in my case for the Windows command line length problem, they are in this reply.
Michael Bucko (Oct 28 2024 at 11:29):
Mario Carneiro schrieb:
Inspiring proof! (and thank you for your suggestions)
Asking questions related to the proof and the techniques here (while playing with not_1055)- https://leanprover.zulipchat.com/#narrow/channel/113489-new-members/topic/not_1055.20for.20beginners/near/479252733
Michael Bucko (Oct 28 2024 at 13:41):
Re 854 not 1055: could we use a similar line to the other proofs from @Mario Carneiro ?
Something like this:
- refine (find witnesses to existential qualifier) the core theorem (needs to be found) using unique factorization (another 1-3 theorems need to be found?), and a custom relation (to be found) to optimize for contradiction potential (in rather few steps)
What's the blocker here?
Also, given that we can (very efficiently) write software / train models / do atp and so on, is there any way to "bootstrap" those proofs with lines (like the one above)?
For the reference, the three theorems from Mario:
theorem not_l4 : X ≠ X ◇ Y := by
refine fun h => let ⟨G, _, h1, h2⟩ := not_l1038'; h2 fun x y => ?_
let φ := FreeMagmaWithLaws.evalHom (fun | 0 => x | _ => (y ◇ (x ◇ y)) ◇ x) <|
Law.satisfiesSet_singleton.2 <| (Law854.models_iff _).2 h1
simpa [MagmaHom.map_op, X, Y, φ] using congrArg φ h
theorem not_l10_2 : Y ≠ Y ◇ (X ◇ Y) := by
refine fun h => let ⟨G, _, h1, h2⟩ := not_l1038'; h2 fun x y => ?_
let φ := FreeMagmaWithLaws.evalHom (fun | 1 => x | _ => y ◇ (x ◇ y)) <|
Law.satisfiesSet_singleton.2 <| (Law854.models_iff _).2 h1
simpa [MagmaHom.map_op, X, Y, φ] using congrArg φ h
theorem not_l325 : X ◇ Y ≠ X ◇ (Y ◇ X) := by
refine fun h => let ⟨G, _, h1, h2⟩ := not_l307'; h2 fun x => ?_
let φ := FreeMagmaWithLaws.evalHom (fun _:ℕ => x) <|
Law.satisfiesSet_singleton.2 <| (Law854.models_iff _).2 h1
simpa [MagmaHom.map_op, X, Y, φ] using congrArg φ h
And the custom relation:
def Rel (x y : G) := ∃ z, y = z ◇ x
local infix:50 " ⇝ " => Rel
theorem rel_iff {x y} : x ⇝ y ↔ y = y ◇ x := ⟨fun ⟨_, H⟩ => H ▸ l378 .., fun H => ⟨_, H⟩⟩
Mario Carneiro (Oct 28 2024 at 13:52):
It would be a good benchmark to try to get an ATP to at least reproduce the disproof of 3925 if you give it all the facts we used (the non-theorems, the unique factorization theorem and the main law and possibly the derived laws too)
Ibrahim Tencer (Oct 28 2024 at 13:59):
Mario Carneiro said:
one fun fact here is that
X ≠ X ◇ (Y ◇ X)
andY ≠ Y ◇ (X ◇ Y)
are actually different theorems, you have to prove them separately, making use of the fact thatX
andY
are generators of the free group. So now there are two versions ofnot_l10
in the proof
I know nothing about Lean but is there not some way to combine the proofs? At the very least by using an automorphism of the free group or something.
Mario Carneiro (Oct 28 2024 at 14:07):
yes, but that is literally what those lines are already doing
Mario Carneiro (Oct 28 2024 at 14:08):
if it came up more than a few times we could process it a bit more to get the line count down
Mario Carneiro (Oct 28 2024 at 14:09):
in particular, at the very least you need to prove that i != j
in order to instantiate that for arbitrary generators Leaf i
and Leaf j
instead of X := Leaf 0
and Y := Leaf 1
Terence Tao (Oct 28 2024 at 15:05):
Here is a different way to view the recent proofs of the anti-implications. Let be the free magma on some generators, say . To avoid some confusion I will denote the free magma operation on by rather than . Define a relaxed magma with the same carrier as but with the multi-valued magma operation
thus the product of and is either the usual product , or just the left entry . Intuitively, this models a "sometimes left-absorptive" magma operation.
Finally, let be the free magma on the same generators modulo the law 854. Then unique factorization tells us that there is an injective homomorphism from to , that maps each word to its unique totally irreducible representation, where totally irreducible means that it is either a generating variable, or of the form where are totally irreducible and . The existence of this injective homomorphism is coming from the (sometimes) left-absorptive nature of the law 854; we would not expect this behavior for say 1485 for instance.
Now consider for instance the law 3925 . In order for this to hold in the relaxed magma for a given choice of , a case check shows that one of the following must hold for the various interpretations used for in order for this identity to hold:
- . (Equation 10)
- . (Equation 4)
- (Equation 325).
Hence if 3925 is to hold in the injective copy of , one of these three laws must hold in itself for the generating variables , and hence be true in every 854-obeying magma for all choices of variables by universality; but we have counterexamples to refute all three possibilities. A similar argument refutes 3316.
This doesn't help us with the other three open anti-implications, but I think it clarifies a little bit what is going on. We're still using custom designed magmas to refute implications; it's just that now we found one which is a relaxed magma rather than a genuine magma.
Michael Bucko (Oct 28 2024 at 15:36):
So the path for 3925 to hold in M854 was (at least) through 10, 4 or 325, but we can find counterexamples that deny those paths, so 854 can't imply 3925.
We might need to find more aggressive ways of finding those paths / counterexamples (compute seems to be a major limitation). We're currently considering relaxed magmas - could relaxed implication chains be a thing?
So far, my transformer check didn't work (DeepSeek). Tactic generation based on the existing proofs based on the DeepSeek model didn't work.
I'm switching to LeanDojo & will be experimenting (code + llm + lean combined).
Amir Livne Bar-on (Oct 28 2024 at 15:48):
There are two reasons why this proof search might fail: either the prover is not good enough to generate proofs, or the other ones are more difficult/ very different from the training data.aybe we can check whether it can prove one of the non-implications after seeing the other two?
Michael Bucko (Oct 28 2024 at 16:10):
Amir Livne Bar-on schrieb:
There are two reasons why this proof search might fail: either the prover is not good enough to generate proofs, or the other ones are more difficult/ very different from the training data.aybe we can check whether it can prove one of the non-implications after seeing the other two?
I always provide them with more information. But even that didn't help the current Lean tactic generators. I shared one notebook that uses DeepSeek for generating tactics in a different thread. Check it out, esp. using a longer context.
On the ATP side, I still need more time, but I guess we simply need to give them all the facts we know (Mario mentioned that too). This also means that we need to generate those facts efficiently.
@Adam Topaz will soon put his pretrained transformer on HF, and we might be able to use Spaces and Autotrain to experiment.
EDIT: What I am using now is Cursor with LLMs. And soon, hopefully today, LeanDojo.
Matthew Bolan (Oct 28 2024 at 17:08):
I'd like to think aloud about what unique factorization tells us about whether 854 implies 413 (x = x ◇ (x ◇ (x ◇ (y ◇ x)))). IIUC, by unique factorization, iff for some words . Now, as by equation 49, by equation 10, and by equation 4, (x ◇ (x ◇ (y ◇ x)))) is "totally irreducible", so I think unique factorization gives us strong constraints on what a and b can be. Is this kind of analysis correct and worth pursuing further?
Terence Tao (Oct 28 2024 at 17:12):
Yes, I think so far so good. The next logical step is to divide into two cases, depending on whether or not. As you point out, the key issue is to constrain and , which currently can be words of unbounded length. In some cases we can "eliminate quantifiers" and remove an unbounded variable by using Law 378, which converts a statement such as "There exists such that " to just "", but this trick doesn't cover all use cases.
Matthew Bolan (Oct 28 2024 at 18:13):
Can we even say ?
Terence Tao (Oct 28 2024 at 18:42):
I think the free 854 magma contains no idempotents, because we have models of 854 with no idempotents (e.g., the original sporadic order 12 example). So in particular we do know that is not equivalent to its square. That's a useful additional technique, by the way: if law fails for every choice of variable in some model, then it also fails in the universal magma for every choice of variable.
Here's maybe some more notation to fix conventions. As before, is the free magma on say two generators (we can add more generators as needed, e.g., to study 1055), with operation . In this magma, no two words are equal; for instance, is not equal to , despite this equality holding for all 854 magmas. In particular does not obey 854. We can view the free magma for 854 as a modified submagma of , where the carrier consists of all the completely irreducible words in , and the operation is given by , depending on whether is true or not (or equivalently, whether is reducible or irreducible). One can think of as a "reduced" version of , in which one uses the law 854 to try to reduce the product if possible. We don't have full understanding of the relation , and as such we don't fully know how to compute for all , but thanks to EE and existing finite models, we understand a very large fraction of this relation. For instance, we know enough to say that is completely irreducible (because none of the laws (equation 49), (equation 10), or (equation 4) are consequences of 854), so in particular , but it is currently open whether is completely irreducible: could be equal to either or , depending on whether 413 is a consequence of 854 or not. Roughly speaking, we can fill out all small entries of the multiplication table except for those involving 413, 1045, or 1055.
One thing we do know is that for , that (i.e., ) if and only if for some . Unfortunately we do not have any bounds of the length of in terms of , so this does not reduce the verification of the relation to any finite search yet. But it is something.
Matthew Bolan (Oct 28 2024 at 18:50):
That's great if we know models with no idempotents, I ran into the above case at least once.
Matthew Bolan (Oct 28 2024 at 18:50):
Where can I find such models, is it the 12 element one again? Oh I missed the part of your message where you said this
Douglas McNeil (Oct 28 2024 at 18:59):
Order 12 is overkill for non-idempotency, even [[2, 0, 0],[1, 0, 1],[2, 2, 1]] is 854. :-)
Terence Tao (Oct 28 2024 at 19:02):
That could be a useful little model to keep in mind to quickly establish some non-implications . In particular, since in this order 3 model the relation is just the relation, we see that we will have whenever , are words that can map to the same element of this order 3 model. (This generalizes the previous observation that , i.e., no idempotents.)
Terence Tao (Oct 28 2024 at 19:05):
Algebraically, this model is with equal to if , and if , or more compactly . Even more algebraically, .
Terence Tao (Oct 28 2024 at 19:08):
I wonder if the other finite magma models we have are extensions of this model, for instance can the order 12 model have their 12 elements be grouped into three classes so that the multiplication table looks like a block version of the order 3 model? (Class 0 ◇ Class 0 = Class 2, Class 0 ◇ Class 1 = Class 0, etc.) If so, this may be a good way to try to focus the search for finite models.
Ibrahim Tencer (Oct 28 2024 at 19:34):
Douglas McNeil said:
Order 12 is overkill for non-idempotency, even [[2, 0, 0],[1, 0, 1],[2, 2, 1]] is 854. :-)
Yes and this is one example of the class of models that are left-absorptive except on the diagonal, all of which have no idempotents.
Terence Tao (Oct 28 2024 at 19:37):
I guess one should plot the complementary graph of non-edges for say the order 12 model. If this complementary graph breaks up into three or more connected components, then there is a chance for this graph to be an extension of the order 3 model, or more generally one of Ibrahim's models. It looks like there may be a few too many non-edges for this to be the case, though.
Matthew Bolan (Oct 28 2024 at 20:17):
Continuing my above thoughts, we know that x=x◇(x◇(x◇(y◇x)))) iff (x◇(x◇(y◇x))))=(a◇b)◇(x◇b) for some elements a,b. I think we cannot have . If this were the case, then together with unique factorization imply , and thus . Now and the fact about idempotents means we don't have an arrow , so , violating the fact 854 does not imply equation 10.
In the case , I also think we cannot have . Suppose otherwise. Then together with unique factorization tells me and , so , which we found impossible above.
In the final case, and tells me , but I haven't found a contradiction coming from this yet.
Matthew Bolan (Oct 28 2024 at 21:04):
Checked this last chain of equalities against the 12 element magma and 3 element one mentioned here, no dice. To falsify it we want to find a magma such that there exist such that for all the equalities x⋄(x⋄(y⋄x))=(a⋄b)⋄(x⋄b)=(a⋄b)=a do not hold. I think there is actual content in the first two cases I did (For example, it looks like the 12 element magma doesn't rule them out either), so this last case has a chance of being dealt with by something we know about?
Matthew Bolan (Oct 28 2024 at 21:07):
Maybe I'll break out the SAT solver on that after dinner, though I should probably stop obsessing over this and do the math my advisor actually wants me to do soon
Michael Bucko (Oct 28 2024 at 21:09):
(deleted)
Terence Tao (Oct 28 2024 at 21:13):
Obviously your "day job" comes first :). But this may well be an actual reduction of the problem. It feels like one is now adding a lot more axioms to constrain the magma and so we may be able to find counterexamples faster and for larger magmas than we did before. And it can't be harder than the original problem, since if your chain of equalities always hold for some a,b then by 854 we conclude that 413 holds as well.
Terence Tao (Oct 28 2024 at 21:37):
Pushing your final case analysis a bit further. You've already pinned down completely as , which is great. The remaining conditions in the scenario are that (you wrote but I think you meant ). The second condition can be unfolded as for some . Now we can split into cases:
- . Then by unique factorization (and because was already irreducible) we must have , , and . The scenario here is thus that there exists such that , , and . Conversely, if this scenario holds, then is such that , so that giving 413, so this is a scenario that we actually have to rule out.
- . So now , so the conditions are , , and . Conversely, if obey these conditions, then for some , and , hence and we again get 413. So again we need to rule out this scenario.
It's possible that we can find one finite magma that rules out scenario 1, and a separate finite magma that rules out scenario 2, and then by the magic of free magmas, we get to rule out 413 holding universally. We could keep splitting these scenarios into further and further subscenarios - if some of them start getting ruled out by finite examples then this approach has some potential, as long as somehow our ability to find finite counterexamples "outruns" the branching of the scenarios.
Terence Tao (Oct 28 2024 at 21:42):
If we can somehow nudge the case analysis towards scenarios where there are many more constraints than free variables then it raises the chance that e.g., the order 3 magma can be used to rule those scenarios apart. For instance, from the pigeonhole principle and the order 3 magma we can't have four words such that for all . In fact, the undirected version of the graph on must be 3-colorable!
Matthew Bolan (Oct 28 2024 at 21:54):
Perhaps I've done something wrong but it looks like the 12 element magma falsifies your condition 1 (x=1, y=0). I see you've just edited though.
Terence Tao (Oct 28 2024 at 21:55):
I'll check with my own Python code. This may take a while.
Terence Tao (Oct 28 2024 at 22:00):
One thing is that homomorphisms preserve the relation but not necessarily the relation.
Douglas McNeil (Oct 28 2024 at 22:36):
[ehh, foolishness!]
Terence Tao (Oct 28 2024 at 22:43):
OK, according to my python code, in the 12 element magma if one sets x=1, y=0, then a=2, and the only d that obeys , is d=2. Now, it is true that we have in the 12-element magma in this case, but unfortunately this doesn't guarantee that this happens back in the free magma ; the homomorphism doesn't work in both directions. Nevertheless, this magma is putting a little bit of a constraint on the possible values of in ; it has to map to under the homomorphism that sends to and to . This could possibly be useful later under some more case analysis. Also, the fact that 11 out of the 12 options of were ruled out by this one randomly selected magma gives some hope; maybe a tweaked version of this magma rules out all options.
Matthew Bolan (Oct 28 2024 at 22:49):
Ah, i definitely missed this fact about not being preserved by homomorphism in my original claim about the 12 element magma.
Terence Tao (Oct 28 2024 at 22:51):
Yeah, I didn't notice it either until we had this discussion. So it seems that we would greatly prefer to start seeing a lot of constraints in our scenarios than a lot of constraints, as we can use finite magma models (or any other model we happen to get our hands on) to then create contradictions. On the other hand, constraints are good for other reasons - they guarantee irreducibility of certain products, allowing one to use unique factorization.
Douglas McNeil (Oct 28 2024 at 22:53):
Let me try again (my previous mistake was that I only looked at violating the adjacent equalities in x⋄(x⋄(y⋄x))=(a⋄b)⋄(x⋄b)=(a⋄b)=a, which was a lot easier than ensuring all the pairs were distinct). What properties should magmas have (in a ⋄ sense, not a → sense) which would help at this point?
Terence Tao (Oct 28 2024 at 22:55):
I guess that, given that directly finding a 413 counterexample is hard, we can assume that 413 holds (and similarly for 1055/1045) to try to speed up the ATP. Possibly we could even assume that 10 holds, though one might first want to ask an ATP if 10+854 is strong enough to rule out any of the scenarios we are proposing.
Terence Tao (Oct 28 2024 at 23:18):
To be a bit more precise, consider the following two properties that an 854 magma may or may not satisfy.
- Scenario_1: For every there exists such that and , where .
- Scenario_2: For every there exists such that , , and , where .
(So I'm explicitly dropping the condition from Scenario_1, as we don't have it for anything except the free magma .) By the previous discussion, either of these properties implies 413, and if we show that each of these properties fail for at least one magma, then we are done. So we have split the original problem into two easier (or at least not harder) problems.
Unfortunately the 12-element magma doesn't obey Scenario_1 (the best choices for a counterexample are when x=1, but even then there is always one choice of d that blocks you), and the situation is similarly blocked for Scenario_2 (again x=1 is the best option, but I always get 7 options for b,d). Maybe other magmas fare better, though.
Terence Tao (Oct 28 2024 at 23:21):
I guess we want to try magmas where the relation is relatively sparse?
Matthew Bolan (Oct 29 2024 at 01:50):
I tried all magmas with <= 4 elements in both my original condition and both your scenarios. No luck.
I notice that the 12 element magma satisfies a few laws that 854 is known not to imply - are there more interesting finite magmas falsifying these laws I should try?
Pace Nielsen (Oct 29 2024 at 02:31):
Continuing the case analysis of @Matthew Bolan it seems like we would take . Note that 413 is equivalent to saying that . Thus, with that choice,
Pace Nielsen (Oct 29 2024 at 02:35):
Or, at least, that is a major case we would need to rule out.
Matthew Bolan (Oct 29 2024 at 02:41):
So you're using 413 in the first equality to replace with ?
Matthew Bolan (Oct 29 2024 at 02:43):
That probably destroys any hope of this condition being progress then.
Terence Tao (Oct 29 2024 at 02:53):
Hmm, the same analysis shows that "Scenario 1" is also implied by 413 (take , ). So, yes, unfortunately we have not actually gained any ground with this approach yet :sad:
Pace Nielsen (Oct 29 2024 at 03:54):
I believe the same analysis also applies to the other two open implications from 854.
Matthew Bolan (Oct 29 2024 at 04:33):
Yeah, even though I did nothing, the remaining laws still come down to understanding whatever is going on with the equivalence class of , and we at least know all short words in this class, so it still seems promising to try analysis of this type. I think the reason why I did nothing is that it is somehow not very interesting to do rewrites only in subwords of your equation you already know to be irreducible. A proof of 413 ( x=x◇(x◇(x◇(y◇x))))) from 854 eventually needs to do a rewrite that gets rid of the leftmost multiplication on the righthand side, before then all you have done is replace the irreducible word x and the irreducible word (x◇(x◇(y◇x)))) by things equivalent to them.
Terence Tao (Oct 29 2024 at 04:35):
Here is a small observation that lets us create some more 854 magmas. Given an 854 magma with some marked elements , one has a relation on it by declaring if . This then lifts back to a relation on the free magma generated by and , using the homomorphism that maps to respectively. One can then say that a word in is -reduced if it is a single variable or of the form where , and completely -reduced if every subword of is -reduced. Then the set of completely -reduced words is again an 854 magma, where the operation is defined to equal if , and otherwise. Informally, is the freest 854 magma one can make that has the "same" graph as .
This magma is intermediate between the free magma and the original 854 magma , as there are canonical homomorphisms from to , and from to . It is typically infinite even when is finite, and has a slightly better chance of disproving laws than the original magma . On the one hand, it has a more explicit description than (because we can compute , whereas the corresponding relation on is only partly understood), so one can check various identities to be true here (in particular, I think this gives a more explicit counterexample for the two anti-implications we recently proved, taking to be the product of all the finite magmas used as ingredients in the proof). On the other hand, I don't think it will disprove say 413 if the original magma did not disprove 413. But maybe one can tweak the relation to get more play here.
Example: take to be the order 3 magma and mark . Then , hence , and so . On the other hand , hence , so . But , so . In particular, as one repeatedly squares , one just gets longer and longer words, so in particular is infinite even though is finite. (Squaring three times will get you back to in , but squaring three times instead gives a rather long word.)
Terence Tao (Oct 29 2024 at 04:52):
Perhaps a toy problem is to find a new proof of 854 !=> 10 that doesn't go through a large explicit magma like the order 12 example (or the polynomial example that we actually use in the Lean codebase to disprove 10).
Terence Tao (Oct 29 2024 at 05:04):
Just abstracting an observation implicitly made by @Pace Nielsen : In any 854 magma,and any x, a
, the following are equivalent:
- for some
- for some
- for some .
- .
Proof: Equivalence of 1 and 2, or 5 and 6, is by definition. 3 trivially implies 2, and the converse comes from Law 378. Since , 5 implies 4. If 4 is true, then , giving 7. Law 854 shows that 7 implies 2. If 2 is true, then , giving 6. Finally, to show the equivalence of 1 and 8, use the already established equivalence of 1 and 5, together with law 378 which gives .
Matthew Bolan (Oct 29 2024 at 05:15):
I notice that we can say a little bit about how many rewrites two equivalent words are away from eachother. My earlier argument was saying something about the form some of the rewrites in a proof of 413 need to take, and Pace's obstruction to my argument was a direct application of the rewrite I am already trying to do, so maybe there's some sort of descent argument here? If the shortest possible sequence of rewrites actually needs to turn a proper subword into then we're done.
Terence Tao (Oct 29 2024 at 16:09):
Right, if we had some way to measure the "complexity" of an edge, and could show that any proof of a target edge required first locating an edge of equal or "higher" complexity, and similarly for any edge of higher complexity than the target edge, then we are done.
Matthew Bolan (Oct 29 2024 at 16:57):
Is it possible to prove equation 8 ( ) in just 2 rewrites? Toying around with things it feels like (I have not proven anything formal yet, so please take this with a grain of salt!) to prove for a totally irreducible word, I always pass through a proof that there exists some auxiliary word such that and , which would mean proving these latter two edges takes fewer rewrites than proving the edge we initially cared about, so in particular . In law , the role of can be played by , which allows both of my auxilliary edges to be resolved in a single rewrite. Ah, this last statement was false for , this still takes two rewrites
Matthew Bolan (Oct 29 2024 at 17:01):
Hmm, something is wrong, maybe I've forgotten a condition on Z. Fixed the thing that concerned me, though now there's more questions about whether such a statement holds for law 8.
Terence Tao (Oct 29 2024 at 17:01):
Hmm, perhaps out of all the words with , we can try to order them by the number of previous results of the form are needed to prove them? In particular, what words can we prove such a thing without using previous results? Clearly a word of the form would work; what else is there?
Matthew Bolan (Oct 29 2024 at 17:17):
Nothing right? If you forbid yourself such rewrites, you forbid yourself to change the righthand side to anything. To turn the lefthand side into just will then take such a rewrite. Or did you mean rewrites which shorten the word or something?
Terence Tao (Oct 29 2024 at 17:49):
I guess I was thinking of rewrites that specifically involve the original term . Maybe that isn't the right metric though. Consider the 413 task of trying to see if we can prove . We know that the equivalence is false for two generators . So if is to be provable by rewrites, at least one of the rewrites at some point must be able to look inside the term and use its specific structure somehow. But if we can show that every rewrite applied to either leaves the term untouched, or else replaces with an equivalent term, then we know that it can't ever reach because the same argument would then also give the false statement . If there was some metric on "rewrite distance" that could detect this sort of thing then we could be in business; I had thought "number of uses of a type law" might be one such metric, but on further reflection this probably isn't the right one to use.
Matthew Bolan (Oct 29 2024 at 19:47):
Does vampire think anything of the form is true?
Matthew Bolan (Oct 29 2024 at 19:59):
Also, what are the known proofs law 10 fails? Is it only the 12 element magma?
Matthew Bolan (Oct 29 2024 at 19:59):
Probably I should be trying to do this descent stuff for 10 before attempting 413.
Michael Bucko (Oct 29 2024 at 21:18):
This operation table satisfies 854, and satisfies 413. Changing (7,7) to 6 does not satisfy not 413 anymore (unfortunately, 845 gets refuted too).
1 0 0 0 0 0 0 0
1 1 1 1 1 1 1 1
2 2 1 2 2 2 2 2
3 3 3 3 3 3 3 3
4 4 4 4 3 4 4 4
5 5 5 5 5 1 5 5
6 6 6 6 6 6 6 6
7 7 7 7 7 7 7 3
Matthew Bolan (Oct 29 2024 at 21:21):
I get that this satisfies quite a few laws, including law 10. Am I misunderstanding? I notice that both laws you list begin with 413, so perhaps you have an active filter on finite magma explorer.
Michael Bucko (Oct 29 2024 at 21:24):
Matthew Bolan schrieb:
I get that this satisfies quite a few laws, including law 10. Am I misunderstanding? I notice that both laws you list begin with 413, so perhaps you have an active filter on finite magma explorer.
You're right. I was filtering for 413. Removing "only" in the prev post.
Michael Bucko (Oct 29 2024 at 21:26):
Also, reframed to "Changing (7,7) to 6 does not satisfy not 413 anymore (unfortunately, 845 gets refuted too)."
Btw. we should be able to filter using multiple equations. That'd enable us to see the implication status in one go, without the need for double filtering.
Matthew Bolan (Oct 29 2024 at 21:31):
Terence Tao said:
Perhaps a toy problem is to find a new proof of 854 !=> 10 that doesn't go through a large explicit magma like the order 12 example (or the polynomial example that we actually use in the Lean codebase to disprove 10).
By the way, I did wind up programming something which can do computations in this sort of magma (at least for when the seed magma is finite). No idea what to do with it though.
Michael Bucko (Oct 29 2024 at 21:31):
The diagonal of the operation table can be quite freely modified to keep 845 satisfied. Are there any configs of the diagonal that give us higher chances of refuting 413?
Michael Bucko (Oct 29 2024 at 21:34):
(deleted)
Matthew Bolan (Oct 29 2024 at 21:35):
The law is 854, not 845
Terence Tao (Oct 29 2024 at 23:32):
The three laws we want to disprove can be expressed in notation as follows:
- 413:
- 1045:
- 1055: .
On the other hand, law 10 asserts that the relation is symmetric: iff .
With this in mind, it is also natural to consider the reversals of the above laws:
- 413*: , i.e., .
- 1045*: , i.e. .
- 1055*: , i.e. .
Do we know the status of these laws? For instance, does Vampire determine if they are consequences of 854? What about the various finite magmas that we have? Unfortunately these laws have just a few too many operations to be directly visible in our implication graph, but presumably the methods we have already could be used here.
EDIT: I see that 1045* and 1055* are special cases of 378. The situation is less clear for 413* though.
Matthew Bolan (Oct 29 2024 at 23:38):
413* is true for all 3 and 4 element 854 magmas, according to my code
Terence Tao (Oct 30 2024 at 01:41):
I believe I've managed to make a breakthrough on 854. The standard greedy algorithm does not seem to work: picking two elements for which is currently undefined and setting it equal to some new value can lead to an explosion of other relations being forced. However, I think that if one also imposes the new unique factorization axiom, then the explosion does not occur and one can in fact make the greedy algorithm work, and thus presumably close off all three remaining implications by a suitable choice of seed. I will write up the details now and report back soon.
Terence Tao (Oct 30 2024 at 04:27):
OK, it turns out that the greedy algorithm didn't close after all, even with the unique factorization property (and some additional axioms I had to add on). I now see a specific obstruction to greedy arguments, which is the following consequence of 854:
Lemma. If and , then .
Proof. By hypothesis, and for some , then
giving the claim.
Now suppose that we try to greedily adjoin to a partially defined magma, then . The problem is that if one then has a chain in the existing multiplication table such that for and , the above lemma then forces for all . Then for any with for some , one has so now for all such . And then for any of that form we have
thus equation 10 has to hold for a large number of existing elements. This is a quite "global" constraint that makes it very difficult to run a greedy argument, and unique factorization does not seem to be effective at ruling out this type of global condition.
The only way out of this I see is if one can prevent the hypothesis from occurring for anything other than which is forced by Equation 8, so as not to make the above Lemma cause a combinatorial explosion. The implication 413 that we are trying to disprove is already of this form, though, so ruling out this scenario is at least as hard as the original problem.
Matthew Bolan (Oct 30 2024 at 04:29):
I recognize that lemma, it showed up for me when I was trying to get some "rewrite metric" to work.
Terence Tao (Oct 30 2024 at 05:07):
Curiously, for the order 12 magma this is in fact an if and only if: the assertion "For all , implies " holds if and only if . Not sure if this is true in general.
Matthew Bolan (Oct 30 2024 at 05:07):
I think I have a proof that is true in general, let me check my notes
Terence Tao (Oct 30 2024 at 05:08):
That would mean the relation is transitive, which is a little weird
Matthew Bolan (Oct 30 2024 at 05:09):
very weird, so I'd bet on me having misremembered...
Terence Tao (Oct 30 2024 at 05:10):
It's also an if and only if for the order 3 magma
Matthew Bolan (Oct 30 2024 at 05:28):
I can't find the statement in that form in my notes. I was mostly working with the specific question of showing a proof of 413 would require. The statements I did write down seem to rest heavily on it being that particular case, so I cannot translate them easily, not that I found an analog of this one anyway. I was probably just remembering the direction you already showed. I'll check the other order 3 and 4 magmas (By the way I tabulated a list of them a while ago, IDK if others have larger lists or if people want me to share my list)
Matthew Bolan (Oct 30 2024 at 05:50):
Ok I had a stupid bug in there for a while, but It seems to be true for all 3 and 4 element magmas
Matthew Bolan (Oct 30 2024 at 05:56):
854magmas.txt
Here's the python I have been using to check against these small magmas.
Terence Tao (Oct 30 2024 at 05:58):
I now have a proof of the converse.
Converse Lemma. Suppose that are such that implies for all . Then .
Proof. Since (by 378), we have , thus . On the other hand, from Equation 101 one has . Thus , giving the claim.
So now we have the following odd consequences:
Corollary. The relation defined by (or equivalently, ) is a pre-order.
Corollary. For any , the set is upwardly closed with respect to this pre-order.
Corollary. if and only if .
So perhaps we need to study this pre-order. One obvious thing to do is to first try to understand the equivalence relation defined by , i.e. and , or equivalently . Possibly we may be able to "quotient out" by this equivalence and assume that this equivalence relation is equality (a similar quotienting did help us with identifying finite magmas for an unrelated equation).
Matthew Bolan (Oct 30 2024 at 06:38):
413 says , which seems strange to me, but I don't have a good grasp on this pre-order yet. In particular, are there elements not equal to in the same class as in ?
Michael Bucko (Oct 30 2024 at 10:41):
Wrote a script for 854N413.
It's modifying the 854 as long as 854 is satisfied or 413 is refuted until it reaches (854 satisfied, 413 refuted). I'd be happy to add more exploration paths / strategies to it.
I'm running it now (if you see any bugs, please let me know).
(@Shreyas Srinivas tagged -- because of the PR related question)
Step 21148:
Current Operation Table:
7 0 0 0 0 0 0 0
0 1 1 1 1 1 1 1
2 2 1 2 2 2 2 2
3 3 3 3 3 3 3 3
4 4 4 4 3 4 4 4
5 5 5 5 5 3 5 5
6 6 6 6 6 6 6 6
7 7 7 7 7 7 7 7
Equation 854 is not satisfied.
Equation 413 is refuted.
--------------------------------------------------
Step 21149:
Current Operation Table:
0 0 0 0 0 0 0 0
0 1 1 1 1 1 1 1
2 2 1 2 2 2 2 2
3 3 3 3 3 3 3 3
4 4 4 4 3 4 4 4
5 5 5 5 5 3 5 5
6 6 6 6 6 6 6 6
7 7 7 7 7 7 7 7
Equation 854 is not satisfied.
Equation 413 is refuted.
Douglas McNeil (Oct 30 2024 at 12:04):
I suppose I can mention what I've been up to. Lots of "they'd have been very clever if they worked" ideas failed. So I fell back on something which should give me something, basically by relying on the explosion of relations instead of trying to avoid them.
I loop over all the possible "types" of 413 violation (depending on how many elements they involve and in what pattern). For each one, I then exhaustively try to grow the magma in all consistent ways and find a contradiction. Sometimes for a good choice of direction this ends pretty quickly and would work for any order magma with that pattern, and we could formalize it without much difficulty I think to show any 413-violating 854 magma can't have a violation like that.
OTOH sometimes the most efficient end actually depends upon the size as you take advantage of the magma closing in on itself and fill in the x=y line. By the time a magma's >80% full or so the paths are pretty likely to tail off.
So the hope was that this would either (1) show that every initial point terminates, which I can actually do for what I already knew, small magmas <= 7, or (2) find a counterexample (no luck yet, naturally), or as a fallback (3) find a high-density almost consistent magma which would have minimal need for relaxation.
I think I'm going to implement some kind of adaptive strategy: there are magmas which close almost instantly if you pick the right cell and take hours if you take the wrong one, and I spend almost all of my time trying to resolve a relatively few patterns.
Michael Bucko (Oct 30 2024 at 12:39):
Douglas McNeil schrieb:
I loop over all the possible "types" of 413 violation (depending on how many elements they involve and in what pattern). For each one, I then exhaustively try to grow the magma in all consistent ways and find a contradiction. Sometimes for a good choice of direction this ends pretty quickly and would work for any order magma with that pattern, and we could formalize it without much difficulty I think to show any 413-violating 854 magma can't have a violation like that.
Do you maybe have a script for that? I'd love to have a look.
Terence Tao (Oct 30 2024 at 14:37):
Matthew Bolan said:
413 says , which seems strange to me, but I don't have a good grasp on this pre-order yet. In particular, are there elements not equal to in the same class as in ?
It may be a good idea to try to work out what this pre-order looks like for small words in , assuming conjecturally that any implication that cannot be proven quickly in Vampire is false. Given your observation about 413, the pre-order may somehow be tied to violations of the 10 axiom (though the order relation is different, we have if ). Perhaps if one temporarily assumes 10 as a working axiom, the pre-order simplifies somehow? (Certainly the directed graph becomes symmetric under 10: iff .)
Matthew Bolan (Oct 30 2024 at 14:39):
We have some other laws in equation explorer that can be interpreted as saying things about this pre-order by the way
Matthew Bolan (Oct 30 2024 at 14:40):
Mostly in the "does not imply" column
Matthew Bolan (Oct 30 2024 at 14:49):
I notice that if , we have and even without assuming law 10.
Terence Tao (Oct 30 2024 at 14:49):
Law 10, while false in general, seems to be true in many special cases.
Terence Tao (Oct 30 2024 at 14:54):
I guess it comes down to law 101, which is the special case of law 10 x = x(yx)
when y = xz
for some z
. I.e. we have iff . [Edit: 101 gives only one direction of this implication, but the proof of the lemma and converse lemma gives the other.]
Terence Tao (Oct 30 2024 at 15:51):
Using the order 3 model I see the relation is actually quite rare on the free magma ; the words have to be identical on every interpretation of the generators on the order 3 model in order for this relation to have a chance of being true. In particular the 413 law is one of the first non-trivial possibilities (the other is , which I guess we could also investigate. Incidentally I think the modification 413* I proposed earlier is actually equivalent to 413 by the above discussion.)
Matthew Bolan (Oct 30 2024 at 21:37):
Here's an easy observation about this pre-order that I didn't spot until just now for some reason: If in the free magma, then . Indeed, otherwise , which is false since we have models with no idempotents.
Matthew Bolan (Oct 30 2024 at 21:47):
Maybe that's a good way to try and avoid holding? Is there some reason is hard/impossible to cook up?
Pace Nielsen (Oct 30 2024 at 21:52):
There may also be some potential to the translation-invariant approach to 854. The functional equation we get is
The sort of "quotient" quantity is playing a major role here. The equation we want to avoid from 413 is .
Using Vampire, we find that the carrier group has to be noncommutative, but we can assume if we want to simplify powers. We also find some quick consequences, such as the following (where I set )
So, lots of things must map to 1. That last equality is very close to what we want to avoid (up to inverses).
Using Vampire some more, we see that we should be able to take with . So it seems like we may be able to greatly restrict the output of f, mostly to (except for ).
Matthew Bolan (Oct 30 2024 at 22:17):
We have a model with no idempotents, and a model where always, (for example any left absorbing magma), so maybe some "quotient" of Terry's construction is worth trying?
Terence Tao (Oct 30 2024 at 22:27):
@Pace Nielsen huh, so in contrast to the other equations, 854 is not known to be immune to a pure translation invariant approach? That's worth looking into. In particular the preorder on a translation invariant model should give the group the structure of a (pre)-ordered group, which is interesting. Also the relation is now a Cayley graph and determined by a single subset of .
Terence Tao (Oct 31 2024 at 02:40):
OK, this definitely needs checking (and formalization), but I think I managed to repair the greedy argument, equational#760 . To avoid the problem coming from the partial order, I simply imposed as an additional rule that the partial order is always trivial: . Amazingly, and after a lot of case analysis, it appears that the greedy algorithm barely closes with this additional axiom (as well as the unique factorization axiom, and some further axioms along for the ride including known consequences 8, 101, 46155, 378 of 854, a no idempotence axiom, and a "monotonicity" axiom that basically prevents a word from being a strict subword for itself (but without using the language of words)). Among other things, the triviality of the order immediately rules out 413, which is , simply by starting with a seed that fails the 10 law , which as one can imagine from all of our previous attempts to build 10-violating 854 magmas is quite feasible.
It is an extremely delicate analysis, with almost all hypotheses just barely closing with no "slack", and I suspect that the greedy magma being constructed here is in fact very close to the universal magma .
In principle 1045 and 1055 should now also fall by constructing a suitable seed for each, but I have not attempted this.
Matthew Bolan (Oct 31 2024 at 02:44):
Wow! I hope it checks out. I guess my job is to look for seeds for 1045 now.
Mario Carneiro (Oct 31 2024 at 03:05):
Does monotonicity refer to the ordering relation defined on 854 or the ordering relation on ?
Terence Tao (Oct 31 2024 at 03:14):
The ordering relation on Nat. Which is admittedly artificial but it worked (i needed to ensure that words like (xy)z could not equal x unless (xy)z=xy=x
Terence Tao (Oct 31 2024 at 03:24):
IMG_1223.jpeg in case anyone is interested, this is the blackboard i needed to stare at for fifteen minutes before i realized the argument could close if the partial order was trivial
Mario Carneiro (Oct 31 2024 at 03:25):
looks like fun, I will try my hand at formalizing it tomorrow
Matthew Bolan (Oct 31 2024 at 03:44):
Ok I think I have a search for seeds running. Is there an obvious obstruction to 3 element seeds?
Matthew Bolan (Oct 31 2024 at 04:16):
Is there perhaps a typo in the seed in the proof of 12.10? For my program says 46155 fails due to not being well defined, and eyeballing it I think I agree.
Matthew Bolan (Oct 31 2024 at 04:19):
The needed 413 violation also seems to simplify to to me, which violates no idempotents, so I'm guessing there's a typo in the table. EDIT: I see, you are claiming the equality fails here by the no idempotents requirement. I'll modify my check to assume this.
Matthew Bolan (Oct 31 2024 at 04:28):
It's also complaining about the auxiliary law with and
Terence Tao (Oct 31 2024 at 05:02):
I think the entry should be deleted. (I mistakenly added it as I thought it was a consequence of the other entries and Equation 101, but I confused and .)
The derivation here is: to get , I chose , , then set equal to , and equal to . Law 378 then forces and , but then I think one can stop.
Matthew Bolan (Oct 31 2024 at 05:06):
Ok, and that explains why my program didn't find any similar examples, I was checking the naive condition for 413 rather than the condition you are, and that was destroying it since the needed products weren't defined.
Matthew Bolan (Oct 31 2024 at 05:06):
My program agrees then, and finds seeds for at least 1055, assuming no bugs.
Matthew Bolan (Oct 31 2024 at 05:08):
[[1, 0, None], [1, None, None], [None, None, None]] is supposed to be a seed for 1055.
Terence Tao (Oct 31 2024 at 05:08):
It should also be possible to directly find a slightly larger seed that violates 413 directly. I just didn't have the patience to Sudoku enough of the multiplication table to find it.
Terence Tao (Oct 31 2024 at 05:09):
Huh, that is smaller than I expected. Let me check it by hand.
Matthew Bolan (Oct 31 2024 at 05:09):
well, definite bug, so one second.
Terence Tao (Oct 31 2024 at 05:11):
Yeah it has to be at least as big as my seed magma as a 1055 example has to contain a 10 violation
Matthew Bolan (Oct 31 2024 at 05:13):
yeah, forgot to check something was well defined when I rewrote 1045 to 1055.
Matthew Bolan (Oct 31 2024 at 05:13):
well, more accurately, I was still checking the thing from 1045 was well defined.
Matthew Bolan (Oct 31 2024 at 05:14):
ugh, back to it not finding small examples. Time to rewrite to be less naive about it, or alternately someone plays sudoku.
Matthew Bolan (Oct 31 2024 at 05:15):
Wait!
[[None, 0, 0, None], [3, None, None, None], [None, 2, None, None], [3, None, None, None]]
Terence Tao (Oct 31 2024 at 05:20):
It seems to obey the axioms, but I don't see the 1055 violation
Matthew Bolan (Oct 31 2024 at 05:21):
let me get it to print it, maybe another bug. My track record is not good tonight.
Terence Tao (Oct 31 2024 at 05:23):
Incidentally the greedy algorithm is itself pretty amenable to programming on a computer, and should start happily generating an infinite 854 magma a couple entries at a time, from a given seed
Matthew Bolan (Oct 31 2024 at 05:25):
Ok It thinks the violation is with (x,y,z) = (1,0,2)
Terence Tao (Oct 31 2024 at 05:28):
Huh, it checks out I think. I get x ◇ ((y ◇ (z ◇ x)) ◇ x) = 3
distinct from x=1
.
Matthew Bolan (Oct 31 2024 at 05:28):
On to 1045
Matthew Bolan (Oct 31 2024 at 05:29):
I agree with your computation by the way
Terence Tao (Oct 31 2024 at 05:30):
Your seed would also work for the 413 anti-implication. Basically anything that contains a 10 violation will work for that one.
Matthew Bolan (Oct 31 2024 at 05:31):
oh then the 1045 seed will be a 3-for-1 special then
Matthew Bolan (Oct 31 2024 at 08:19):
1 0 0 _ _
1 _ 1 _ 1
_ 4 3 2 _
_ _ 3 _ _
_ 4 _ _ _
Matthew Bolan (Oct 31 2024 at 08:19):
Should be anti 1045
Matthew Bolan (Oct 31 2024 at 08:21):
But I have been very sloppy tonight, and am quite tired. In fact, I no longer trust that the previous 1055 example was correct for law 854 itself, I was checking that the whole term was well defined instead of just the righthand factor.
Matthew Bolan (Oct 31 2024 at 08:23):
On the bright side, to find that thing I had to implement the greedy algorithm itself, and it seems to work. It is very fun to tell it to grow a seed to size 100.
Matthew Bolan (Oct 31 2024 at 08:27):
The table
2 0 0 _
_ _ 3 _
2 2 _ 2
_ _ 3 _
also supposedly works for 1045, with the counterexample at 1 0, and it is smaller. This example is actually very annoying to see, as 3 hours ago I had a computer crash right before the search for tables with 8 entries was to begin. I could have saved myself a lot of time.
Matthew Bolan (Oct 31 2024 at 08:31):
Matthew Bolan said:
On the bright side, to find that thing I had to implement the greedy algorithm itself, and it seems to work. It is very fun to tell it to grow a seed to size 100.
Just for fun, here is a 10x10 anti-1045 seed:
1 0 0 0 _ _ _ _ _ _
1 3 1 1 _ 6 _ 1 _ _
_ 7 5 4 _ 2 2 8 _ _
3 3 _ _ 3 _ _ _ _ 3
_ _ _ 4 _ _ _ _ _ _
_ _ 5 9 _ _ 5 _ _ _
_ _ 6 _ _ 6 _ _ _ _
_ 7 _ _ _ _ _ _ 7 _
_ _ _ _ _ _ _ 8 _ _
_ _ _ 9 _ _ _ _ _ _
Jose Brox (Oct 31 2024 at 10:56):
Terence Tao ha dicho:
It is an extremely delicate analysis, with almost all hypotheses just barely closing with no "slack", and I suspect that the greedy magma being constructed here is in fact very close to the universal magma M854.
Hi again! I've been quite ill the last few days and had to disconnect. The idea I was working on had several points in common, as unique factorization and monoticity: I was trying to get a model over Nat by using exponentials in the product, e.g.
and imposing 101, 378, etc. (so e.g. ) and some other experimental tweaks and more complicated versions. I didn't get to prove the model closed. Do you think it could be valid/useful?
Mario Carneiro (Oct 31 2024 at 12:45):
@Terence Tao Not a bug in the proof exactly, but the case seems misleadingly presented:
- Equation 8: As this equation is of a left-absorbent form, the only addition to the table that could cause a violation is , that is to say the only case that needs to be ruled out is . But then would already have been defined by Equation 8, contradiction.
The proof is going by cases on , so the new case is not but rather . In this case it is not a contradiction, but rather the goal holds because of the clause .
Terence Tao (Oct 31 2024 at 14:46):
@Mario Carneiro Thanks for the catch! I have updated the blueprint accordingly (and also added @Michael Bucko 's 1045 seed; I have to travel today, but I will check it by computer on the plane).
@Jose Brox This is a very nice idea to revive the Kisielewicz type construction! Basically you have selected an ansatz in which if and otherwise, and the only difficulty is to figure out how to select the relatino . This finally resolves an issue someone (I think @Daniel Weber asked some time ago, which is whether the directed graph contained enough information to determine the magma. This wasn't literally true, but with your ansatz it becomes true, and the problem is now "just" a graph theory problem of designing a directed graph obeying a certain complicated set of axioms. My construction is highly recursive but with this perspective you might be able to find a simpler and more explicit rule. It's nice that the unique factorization and monotonicity axioms already come "for free" in this approach. More generally, I like the idea of using this type of ansatz to attack the other open implications like 1323.
Matthew Bolan (Oct 31 2024 at 14:49):
Matthew Bolan said:
The table
2 0 0 _
_ _ 3 _
2 2 _ 2
_ _ 3 _
also supposedly works for 1045, with the counterexample at 1 0, and it is smaller. This example is actually very annoying to see, as 3 hours ago I had a computer crash right before the search for tables with 8 entries was to begin. I could have saved myself a lot of time.
If it helps to verify, this table was grown in 1 step from the table
2 0 0
_ _ _
2 2 _
Terence Tao (Oct 31 2024 at 14:53):
Reminds me of the many hours I lost to Minesweeper as a student...
It's possible that by starting with the empty seed, one will by chance generate a 10 counterexample, and then with a few more iterations, get a 413 and 1045 counterexample as well, though it may depend on the order in which one takes the greedy construction. Though if @Jose Brox 's construction works out, that may be the more elegant construction in the end.
Matthew Bolan (Oct 31 2024 at 14:54):
I generated quite a few starting from the empty seed and got nothing. I had to start with 3x3 seeds to get results.
Matthew Bolan (Oct 31 2024 at 14:55):
but once you do that and grow for a few steps, counterexamples seem common.
Terence Tao (Oct 31 2024 at 14:55):
OK. There could be an "abiogenesis" problem; we've seen in practice that it's hard to generate 10 counterexamples (which, in the abiogenesis analogy, would be like "amino acids") from scratch, but once one has them, they seem to be useful for generating more complex counterexamples as well.
Douglas McNeil (Oct 31 2024 at 14:59):
In the case of 413, there are about ~160 distinct minimal seeds. About 1/2 of those can be ruled out as generating 854 compatible magmas almost immediately. Another 1/4 can with a fair chunk of work and correct choice of next coordinate to inspect can be shown to fail if you exhaust every possibility. The remaining ones.. are a pain, and while I was hoping they'd all close, I guess one of them must not if the greedy algorithm works! :sweat_smile:
Terence Tao (Oct 31 2024 at 15:07):
In retrospect, the reason why the strategy of finite model extension was not working so well was that the initial finite magmas already had a non-trivial partial order relation, which was creating a lot of constraints to extending the magma consistently (and I think, over time, the partial order became so "dense" that the extension became "boring", with no real choices in how to place the new element in relation to previous rows). Plus, when extending element at a time, there were pigeonhole principle issues that forced one to abandon unique factorization and reuse some numbers as products, which also created additional unwanted constraints.
Terence Tao (Oct 31 2024 at 15:08):
It might be nice to generate an explicit 413 seed, just for reassurance (even if we get the Lean guarantee, it would be nice to have a human-confirmable verification as well). I think all one has to do is extend a 10-violating seed a couple times and it will automatically generate a 413 violation due to the auxiliary law, but I don't have an explicit partial multiplication table exhibiting this.
Matthew Bolan (Oct 31 2024 at 15:09):
Do I misunderstand something? I thought we already had one of those
Matthew Bolan (Oct 31 2024 at 15:09):
Or do you want a full table
Terence Tao (Oct 31 2024 at 15:10):
We have a small table with a 10 violation, and a theoretical argument that any 10-violating seed can be extended to a 413-violating seed, but I don't know if any of the seeds already listed above already violate 413 (it's possible that some of the larger ones already do this, I haven't checked).
Matthew Bolan (Oct 31 2024 at 15:11):
Oh, I can easily give you one, just a moment.
Terence Tao (Oct 31 2024 at 15:12):
This is mostly just a sanity check to make sure theory and numerics are aligned. Of course once we have Lean formalization we have all the reassurance we need.
Matthew Bolan (Oct 31 2024 at 15:17):
Yeah, that makes sense. It is quite promising to me though that my program still thinks the tables your algorithm grows obey the needed requirements.
It says
3 2 4 0 5 _
1 _ 1 _ _ _
_ 2 _ _ 2 _
3 _ _ _ _ _
_ _ 4 _ _ 4
_ _ _ _ 5 _
works, with the counterexample at (x,y)=(0,1).
Matthew Bolan (Oct 31 2024 at 15:18):
let me have it also say how it grew that
Matthew Bolan (Oct 31 2024 at 15:20):
It started from
_ 2 _
1 _ 1
_ 2 _
for that one, then added 3,4,5 in that order.
Terence Tao (Oct 31 2024 at 15:29):
Wow, that is a bit bigger than I thought. Glad I didn't attempt the human Sudoku problem and just stopped when I reached a 10 violation.
Given how several new relations are generated at each step, it is something close to a miracle that the construction doesn't ever create any collisions (once one imposes the unique factorization, monotonicity, non-idempotence and auxiliary laws, each of which was not "forced" by the greedy construction, but became a tactical choice because the argument was simply not closing at all without them). It is still somewhat mysterious to me exactly what this beast of an 854 law is - it clearly has a lot of structure on one hand, but also seems very "free" on the other. Certainly this is the sort of "mathematics from Mars" I was hoping this project would uncover!
Matthew Bolan (Oct 31 2024 at 15:31):
Yeah! I'm honestly sad you resolved it so quickly, I've become quite interested by 854 magmas.
Terence Tao (Oct 31 2024 at 15:41):
Well, the outstanding implications are likely resolved, and we have a lot of ways to generate 854 models now, but we still don't fully understand the law. For instance we don't quite have a description of , though I think we are pretty close to doing so now.
Mario Carneiro (Oct 31 2024 at 15:45):
I think I found a hole in the proof. It's in case 5 of the proof of 854. The scenario is that , , , and is relevant.
Mario Carneiro (Oct 31 2024 at 15:52):
actually nevermind, I'm back on track, I just got thrown off by an early sentence
Terence Tao (Oct 31 2024 at 15:53):
Feel free to update the blueprint if you spot any poor wording
Mario Carneiro (Oct 31 2024 at 15:53):
I did have to use monotonicity though in a way that wasn't obvious from the text
Mario Carneiro (Oct 31 2024 at 15:54):
on the whole the case split is structured quite differently for me so I have to do a lot of translation
Terence Tao (Oct 31 2024 at 15:54):
I'm sure my splitting is not the optimal one, it was the first one that worked and I didn't try to polish it
Mario Carneiro (Oct 31 2024 at 15:54):
np, that's my job :)
Terence Tao (Oct 31 2024 at 15:55):
It's probably just outside of the range of aesop
to resolve each individual case, I assume?
Mario Carneiro (Oct 31 2024 at 15:56):
eh, maybe? Each case is actually tiny, but there are more of them than the text implies
Mario Carneiro (Oct 31 2024 at 15:57):
I'm feeling pretty good about the proof structure so far
Terence Tao (Oct 31 2024 at 15:58):
OK. I'm looking forward to seeing the final proof. The proof of your previous 854 argument was quite elegant, in particular the proof of unique factorization for the free magma was a lot less painful than I expected.
Terence Tao (Oct 31 2024 at 15:59):
I imagine this one will be... less elegant, though.
Mario Carneiro (Oct 31 2024 at 15:59):
I think it's still pretty slick, but it is definitely larger
Mario Carneiro (Oct 31 2024 at 16:02):
In case 6, where did b1 and b2 come from?
Mario Carneiro (Oct 31 2024 at 16:02):
I don't think the context gives us anything to tell us that b factors
Terence Tao (Oct 31 2024 at 16:02):
x * z = b and b = b1 * b2
Terence Tao (Oct 31 2024 at 16:03):
oh. let me see
Terence Tao (Oct 31 2024 at 16:03):
we are given that and , so factors
Mario Carneiro (Oct 31 2024 at 16:04):
but why?
Mario Carneiro (Oct 31 2024 at 16:04):
Is this one of our IH's?
Terence Tao (Oct 31 2024 at 16:04):
Yeah , so
Mario Carneiro (Oct 31 2024 at 16:05):
I mean what prevents the situation from being such that has no factorization, even though
Terence Tao (Oct 31 2024 at 16:05):
is equal to and manifestly factors into and
Mario Carneiro (Oct 31 2024 at 16:07):
although in that case it's not clear what unique factorization is asserting here since we just define b1 := x and b2 := z
Terence Tao (Oct 31 2024 at 16:09):
well ok it could be a notational quirk but I am implicitly using the notation , under the assumption that they are uniquely defined which is what unique factorization guarantees. One could arrange the proof differently, e.g., to define for _all_ relevant , and only use unique factorization later when it becomes important to know that is unique (not sure actually where that is used but I think it shows up somewhere).
Mario Carneiro (Oct 31 2024 at 16:10):
I gathered that, I was just confused because we only have uniqueness and not existence in this setting
Mario Carneiro (Oct 31 2024 at 16:10):
One could arrange the proof differently, e.g., to define b1⋄c=b1 for _all_ relevant b1, and only use unique factorization later when it becomes important to know that b1 is unique (not sure actually where that is used but I think it shows up somewhere).
Oh, I actually guess I did that
Terence Tao (Oct 31 2024 at 16:10):
Usually yes but in this particular case we happen to possess an explicit factorization of so we can guarantee existence in this case only.
Terence Tao (Oct 31 2024 at 16:11):
Then I guess you don't actually need unique factorization in Case 6. But I suspect it may show up elsewhere. Or perhaps as with your 1485 formalization, maybe it never shows up at all and then we can simplify the proof.
Mario Carneiro (Oct 31 2024 at 16:11):
oh yes, it definitely has
Michael Bucko (Oct 31 2024 at 16:39):
Since 854 is really reluctant when it comes to collapsing to simpler forms, and always dreams up those non-colliding features / data points, perhaps we should have a deeper look into 1043 (and 433) too.
2,4 → 1043 → 433 → 854
2,4 → 1043 → 854
2,4 → 854
Terence Tao (Oct 31 2024 at 16:47):
The family tree of inequivalent laws implied by 1043 (which includes 854 and all of its descendants, as well as the important law 10) is really quite impressive.
Mario Carneiro (Oct 31 2024 at 16:56):
It's done!
Mario Carneiro (Oct 31 2024 at 17:00):
Matthew Bolan (Oct 31 2024 at 17:07):
Are the seeds confirmed?
Michael Bucko (Oct 31 2024 at 17:12):
The fact that 1043[x = x ◇ ((y ◇ (x ◇ z)) ◇ z)]
can imply 433 [x = x ◇ (y ◇ (x ◇ (z ◇ y)))]
which can then imply 854 [x = x ◇ ((y ◇ z) ◇ (x ◇ z))]
feels more intuitive (which feels wrong) at first than 1043 [x = x ◇ ((y ◇ (x ◇ z)) ◇ z)]
being able to directly reach 854 [x = x ◇ ((y ◇ z) ◇ (x ◇ z))]
.
Perhaps we should look into property mapping, i.e. there could be a list of properties within the equations (in the implication chain) that allow for certain (aggressive, immediate, direct) simplifications.
Perhaps we could use property mapping in order to learn simplifications . (I hope to touch some math from Mars!!) One thing that comes to mind is to outsource simplification learning to diffusion models.
Mario Carneiro (Oct 31 2024 at 17:23):
Matthew Bolan said:
Are the seeds confirmed?
not yet
Mario Carneiro (Oct 31 2024 at 17:33):
Which are the seeds we want to verify, and what laws do they counter?
Matthew Bolan (Oct 31 2024 at 17:33):
They are both in the blueprint already. One counters 413, the other 1045.
Matthew Bolan (Oct 31 2024 at 17:34):
And then the 1045 will also counter 1055 implicitly
Matthew Bolan (Oct 31 2024 at 17:35):
Corollaries 12.10 and 12.11.
Terence Tao (Oct 31 2024 at 18:35):
The seed for 1045 can also be used to refute 413 if you want to economize on seeds
Terence Tao (Oct 31 2024 at 18:36):
I like how inductive types make the case analysis somewhat more manageable
Mario Carneiro (Oct 31 2024 at 18:37):
yeah when I first saw you added 5 elements in the inductive step I knew the case splits would be legendary
Mario Carneiro (Oct 31 2024 at 18:38):
luckily they break up into logical groups and this is reflected in the proof (a lot of the time things simplify for left-absorbent laws)
Terence Tao (Oct 31 2024 at 19:00):
This is certainly one case where formalization is essential. I don’t think even I would fully trust my own argument otherwise, it is so fragile
Mario Carneiro (Oct 31 2024 at 19:02):
I'm failing to validate the seed for 1045. Properties eq854, eq101, eq46155, eq378, uniq_fac, mono
fail
Matthew Bolan (Oct 31 2024 at 19:02):
That's surprising
Matthew Bolan (Oct 31 2024 at 19:03):
2 0 0 _
_ _ 3 _
2 2 _ 2
_ _ 3 _
Matthew Bolan (Oct 31 2024 at 19:03):
This was the intended table, can you confirm that's what you have?
Mario Carneiro (Oct 31 2024 at 19:03):
oh mono is obviously broken for
Mario Carneiro (Oct 31 2024 at 19:03):
my list is
[((0,0),2), ((0,1),0), ((0,2),0), ((1,2),3), ((2,0),2), ((2,1),2), ((2,3),2), ((3,2),2)]
Matthew Bolan (Oct 31 2024 at 19:04):
Looks like a typo. Try ((3,2),3)
Mario Carneiro (Oct 31 2024 at 19:04):
yep that did it
Mario Carneiro (Oct 31 2024 at 19:04):
I'll fix the blueprint
Mario Carneiro (Oct 31 2024 at 19:05):
where does it violate 413?
Matthew Bolan (Oct 31 2024 at 19:05):
Let me check.
Matthew Bolan (Oct 31 2024 at 19:13):
(deleted, bug)
Matthew Bolan (Oct 31 2024 at 19:16):
Yeah it's at . Law 413 is the same as saying , which should fail for
Matthew Bolan (Oct 31 2024 at 19:19):
Something still seems not quite right. Fixed.
Matthew Bolan (Oct 31 2024 at 19:22):
(x,y) = (1,2), sorry. I was looking at the wrong table.
Matthew Bolan (Oct 31 2024 at 19:23):
now , and by the auxilliary law we cannot have
Mario Carneiro (Oct 31 2024 at 19:32):
@[equational_result]
theorem not_413_1045 : ∃ (G : Type) (_ : Magma G), Facts G [854] [413, 1045] := ...
Terence Tao (Oct 31 2024 at 19:42):
I was just going to say that I had verified the seed in Python, but this is clearly moot now :grinning_face_with_smiling_eyes:
Terence Tao (Oct 31 2024 at 19:46):
Glad to see you could verify the seed by decide, it would have been tedious otherwise
Mario Carneiro (Oct 31 2024 at 19:47):
yes, this is the next step up the industrial strength proof scale past the one for 1485
Mario Carneiro (Oct 31 2024 at 19:49):
There are more steps past this; I'm glad you didn't give me a 12x12 seed to check
Michael Bucko (Oct 31 2024 at 20:59):
(deleted)
Terence Tao (Nov 02 2024 at 15:24):
It is perhaps of less interest now that 854 is fully resolved, but I did actually manage to get a fully explicit description of the free magma . It is the set of words (on some fixed set of generators) generated by an operation which is equal to the free operation unless a certain relation holds, in which case it is equal to . The relation can be defined recursively as follows, where we use to denote the left and right components of a word (so ), to denote the left and right components of , etc..
Definition of directed graph: holds iff one of the following statements is true:
- .
- and .
- , , and .
- , and there exists such that and .
- , and one of the claims , , or holds.
Here we adopt the convention that a statement can only hold if all the terms needed to verify it are well-defined. For instance for part 2 to hold, must be well-defined. In part 4, will need to be defined if one is selecting to equal , though if one selected to equal say instead then there is no such requirement.
One interesting feature of this relation is that it forces to be somewhat "close" to each other, in that one is a depth one or depth two subword of the other. Specifically, can only hold if one of holds. (This property is also preserved by the 854 greedy construction, and is what led me eventually to the above characterization.)
The verification that this actually obeys 854 and is universal is a lengthy case check, comparable to that for the 854 greedy construction. I put it in the blueprint at equational#779, but I don't consider it a priority for formalization given that 854 is now fully resolved by other means.
Amir Livne Bar-on (Nov 02 2024 at 20:48):
A thought about the "math from Mars" angle: equation 543 has an operation definable from the magma operation that obeys a very symmetrical set of laws, which are together equivalent to that one equation.
Did anyone try to search for something similar for 854? That is, whether there's a definable operation (or multi-valued operation? or several operations?) that is equivalent to law 854 when symmetrical or otherwise pretty equations hold for the new operation?
Terence Tao (Nov 02 2024 at 20:53):
Well, we have identified the directed graph relation and the pre-ordering on 854 magmas which, when combined with the original magma operation , do obey a large number of interesting relations. I haven't looked into the extent to which one can try to reduce the axioms to hide the original law 854 from view and instead have a number of simpler looking laws. Given the complex nature of the free 854 group, I'd say there is a limit to how much one can clean up the algebra, but I can certainly believe one can replace one weird-looking law 854 by a small number of nicer looking laws (in particular, the fact that obeys the standard pre-ordering axioms may beautify the axioms a little bit, or at least make them slightly less Martian in nature. On the other hand, this pre-order is trivial on the free group , so there's a limit to how much it can help on its own.).
Alex Meiburg (Nov 03 2024 at 17:07):
Terence Tao said:
It is perhaps of less interest now that 854 is fully resolved, but I did actually manage to get a fully explicit description of the free magma .
I can't be alone in thinking that finding explicit descriptions of free magmas for different equations is of independent interest. ("Explicit" in the sense of "a computable algorithm for testing equality of words".) Both because I think they provide another good perspective on what the equation is really doing, but also they make proving any future non-implications pretty much maximally easy.
Alex Meiburg (Nov 03 2024 at 17:20):
Actually, personally I think that this project (for each law L, write down a def
for a Magma that is provably isomorphic to the FreeMagmaWithLaws L
, such that this def
is computable) would be a great Future direction
Kevin Buzzard (Nov 04 2024 at 05:28):
There are a lot of laws though!
Eric Taucher (Nov 04 2024 at 09:45):
Alex Meiburg said:
for each law L, write down a
def
for a Magma that is provably isomorphic to theFreeMagmaWithLaws L
, such that thisdef
is computable
Could you provide an example of this? It sounds like one of the side tasks I'm working on while learning the math involved. Additionally, since I'm not a mathematician and am working on creating source code for this, could you explain it more as an algorithm?
Alex Meiburg (Nov 04 2024 at 13:13):
Kevin Buzzard said:
There are a lot of laws though!
Haha yes! It would definitely be a slow project, with probably close to zero help from automation - except maybe to show certain operations close from a SAT solver for the really hard ones
Amir Livne Bar-on (Nov 04 2024 at 13:16):
There are dozens of confluent laws so these are already solved automatically
Alex Meiburg (Nov 04 2024 at 13:17):
Oh that's true! I hadn't thought about that
Alex Meiburg (Nov 04 2024 at 13:23):
Eric Taucher said:
Could you provide an example of this? It sounds like one of the side tasks I'm working on while learning the math involved.
So Eq4512, the associative law, can be described as List x
, where x is the generators. This is the free associative magma on x. There is an existing definition in the repository of the free associative magma, but it's noncomputable: two expressions are equal _if there exists some series of expressions related by the associative law_ that lets you turn one into the other. There isn't an algorithm immediately to check if such a series exists, so this is a non-constructive version of the free associative magma.
But, by showing that this is isomorphic to List x
, we can make a constructive and computable version of it.
The FreeMagmaWithLaws gives this non-constructive definition for each law. It would be nice to find a constructive definition of as many laws as possible. List is a particularly easy one. Terence Tao's message from Nov 2 gives a constructive definition of the free Eq854 magma, which is obviously quite a bit more complicated.
As one other example: Eq543 defines an abelian group structure, and the free abelian group is DFinsupp x Int
. The magma operation is then subtraction of these DFinsupp's.
Eric Taucher (Nov 04 2024 at 13:51):
@Alex Meiburg
Thanks. Still a bit over my head, the magma parts, but enough meat on the bone to chew on. :smile:
Last updated: May 02 2025 at 03:31 UTC