Zulip Chat Archive
Stream: Equational
Topic: 1516 -> 255
Terence Tao (Oct 23 2024 at 15:23):
This is a new thread to discuss the specific question of whether 1516 implies 255 , where we recursively define . This, and its dual, would resolve two open implications. It has been discussed extensively in a previous thread; I now summarize some progress here.
Immunities:
- (T) Implication is true for translation-invariant models (even noncommutative ones).
- (L) Implication is true for linear models (even noncommutative ones).
- (F) Implication is true for finite models.
- (C) Implication is true for commutative models.
- Implication is true if squaring is surjective.
- Implication is true if one has "flip-injectivity":
a ◇ b = c ◇ d
impliesb ◇ a = d ◇ c
.
Examples:
- The smallest non-trivial finite magma is of order 5. (In view of comments:) This is the law on . In particular in this case. This is also a translation-invariant, commutative model.
- (Added in view of comments below:) Another linear model is on . In this model, , , and , and then repeats with period 3.
Some known identities/properties for 1516 magmas:
- is surjective
- . That is to say, 255 is true for square numbers.
1516 does not seem to obviously imply
- for .
Pace Nielsen (Oct 23 2024 at 16:05):
Does 1516 imply x^4 = x^7, x^4 = x^10, or x^7 = x^10?
Daniel Weber (Oct 23 2024 at 16:09):
Some rules from the greedy ruleset:
- If and then
- If then
- If then
Michael Bucko (Oct 23 2024 at 18:26):
Do we have a tptp for this one?
Pace Nielsen (Oct 23 2024 at 21:04):
The smallest non-trivial magma mentioned above is linear. I thought it might be useful to describe all linear maps
satisfying 1516.
The carrier ring must be commutative, and a reduction system is given by
Not the simplest system in the world.
The 5-element magma corresponds to the ring taking .
Terence Tao (Oct 23 2024 at 21:07):
The model on is idempotent, . Is there a slightly larger model which isn't idempotent? I don't see listed among your laws, so I suspect the answer is "yes".
Pace Nielsen (Oct 23 2024 at 21:23):
Running a search in for small primes, there are lots of examples. The smallest where the idempotent law fails is when .
Terence Tao (Oct 23 2024 at 21:26):
That law on might serve as a good test model as a first approximation of what rules might and might not be consequences of 1516. For instance here we have , , , and just repeats periodically mod 3. Which is of course consistent with your request to investigate for various .
Pace Nielsen (Oct 23 2024 at 21:36):
Just checked that the relations force all linear maps to satisfy .
Terence Tao (Oct 23 2024 at 21:37):
I think you had previously confirmed that this implication was immune to linear models (even noncommutative ones), but it doesn't hurt to double check.
Pace Nielsen (Oct 23 2024 at 21:37):
Oh yeah, I forgot for a moment that the target was . Good to have double-checked it!
Alex Meiburg (Oct 24 2024 at 01:54):
For a system with many immunities, one strategy that I haven't seen mentioned could be a LORC (linear on residue classes) function. There's a couple ways to think of it, one would be operations of the form "if x is even and y is odd [foo], else if x is odd [bar], else [baz]" and so on, but generalized to arbitrary mod k. Another way to look at it is that the magma on (Fin k)×ℤ -- or some other ring, doesn't have to be ℤ -- with a sort of semidirect product structure. The map (x,y) -> x gives a homomorphism to a k-element magma, and for all i and j, the map fun x y => ((i,x) ⋄ (j,y)).2
is linear.
Alex Meiburg (Oct 24 2024 at 01:55):
We could try this (mostly by hand I guess) by trying successively larger k, taking a k-element magma that obeys the law, and then trying to fill in the k^2 linear functions. I think a few other infinite examples people used to solve hard nonimplications follow this form with k=2, which is why I have some hope it could work further
Alex Meiburg (Oct 24 2024 at 01:58):
Although, in this case, that could get very messy fast. Hmm. The smallest nontrivial example is on 5 elements, which means you need 25 linear functions, or to solve for some 75 variables.
Alex Meiburg (Oct 24 2024 at 02:01):
Well, I plunk out the system of equations and see if there's an obvious thing to do. Maybe there's even lots of solutions! :)
Terence Tao (Oct 24 2024 at 02:03):
It's worth a shot, though the immunity that this problem has to constructions in which squaring is surjective gives me some pause.
Alex Meiburg (Oct 24 2024 at 02:08):
Do the finite magmas satisfying 1516 always have a surjective square map? The ones listed above all do
Assuming so, that means that (since the homomorphism to Fin k will lead to a surjective square map), if we don't want a LORC construction to have a surjective square, we need at least one of the ax+by+c maps to have gcd(a,b) != 1. And that's sufficient then for having it not be a surjective squaring map
Alex Meiburg (Oct 24 2024 at 02:37):
Well, the k=5 case doesn't work at least (with the ring as Z). Since it's idempotent, there's a closed submagma on (1,x), and indeed for every (i,x). Since by construction that's a linear model, it has to obey the equations people discussed earlier. I derived them as a * (b^2-b) = 1
together with either 1-b+2b^2-2b^3+b^4 = 0
or (c = 0
and -1-b+b^3 = 0
). Solving these over ℂ, we find that there are no integer solutions. (There are solutions over ℂ, in fact over the algebraic integers, but then the squaring map on this submagma is surjective.) Since the squaring map on these submagmas is surjective, it's also true of the whole magma, so it can't be a counterexample.
Terence Tao (Oct 24 2024 at 02:39):
All finite 1516 models have surjective squaring map. Proof So this is indeed a significant obstacle to LORC constructions, since they will likely have finite models thanks to the Lefschetz principle (unless one works with a noncommutative LORC).
Terence Tao (Oct 24 2024 at 02:40):
This massive amount of immunity is annoying!
Alex Meiburg (Oct 24 2024 at 03:10):
I see. Furthermore I found that it doesn't work with the p=7 case above, because just having a _single_ idempotent element actually makes the argument above fall through. In the p=7 case, 0^2 = 0, and so the submagma induced there again needs some a/b/c to work, and there are no rational solutions. So there's no LORC over Z sitting above the p=7 magma, or above any finite magma with at least one idempotent element.
And then -- I didn't quite the follow the Lefschetz part, it's not my area, but I think this is the idea? -- we could say, okay, instead of taking the ring to be Z, we could take (say) the algebraic integers, where we do have a solution for the idempotent elements. And we could then still have a map like x⋄y = 2x+4y for one of the other parts, something that isn't surjective on the algebraic integers. So in that way we could get a LORC that doesn't have a surjective squaring map. But if we had that, we could just extend it to operating on C instead of the algebraic integers, and now squaring is surjective, so it will necessarily obey 255.
Terence Tao (Oct 24 2024 at 03:27):
I guess I'm using the Lefschetz principle in a slightly nonstandard form, namely that any first-order statement that is true over a characteristic 0 algebraically closed field such as C, is also true in (algebraically closed) fields of sufficiently large finite characteristic p, and hence also in certain finite fields of large characteristic; this basically follows by combining the usual form of the Lefschetz principle with the compactness theorem in logic. There is a nice article of Serre describing this technique, for instance to prove the Ax-Grothendieck theorem (which is relevant here). I think this method shows that any polynomial or PORC type construction in any field will necessarily induce a similar construction in a finite field, which we know already to be impossible.
Alex Meiburg (Oct 24 2024 at 03:38):
Alright, that makes sense now, thanks! (It's been a while since I had occasion to think about the Ax-Grothendieck theorem, but it's a happy memory; such a fun result.) So, this also rules out PORC type constructions over any commutative ring of characteristic 0, since they could always be extended to the field of fractions. In particular, nothing over Z. Would have to look at something noncommutative, or with positive characteristic.
Terence Tao (Oct 24 2024 at 04:34):
I'm thinking that a NORC (nonlinear over residue classes, aka an extension of a finite model) has a chance (it's basically the same strategy @Bernhard Reinke outlined over on the 1323 thread). Taking the model is problematic due to squaring being idempotent there, so let's take the model instead. So our carrier is now of the form with a magma operation of the form
where we have a frightening-looking 49 separate magma operations . However, we can take advantage of dilation symmetry to make the projective ansatz for all invertible , which cuts us down to "just" 1 + 48/6 = 9 magma operations. The law 1516 can now be computed to take the form
for all . This is 49 different magma-type laws, but with the projective ansatz we cut back to 9. Firstly there is the a=b=0 case, which is just saying that obeys 1516. Then we have the case, which is
the case, which is
and then six more cases with and non-zero, which are
Bearing in mind that squaring shouldn't be surjective, it is tempting to make equal to a constant 0 to simplify the latter two rules. (We have no examples of 1516 magmas with squaring not surjective right now, so we can't easily make non-surjective.) The hope is now that the various magma operations here are sufficiently "decoupled" from each other that it becomes easier to construct counterexamples.
Terence Tao (Oct 24 2024 at 05:02):
Unfortunately, we can't simultaneously make squaring in surjective while setting squaring in equal to a constant . The former hypothesis together with (1) makes all the left multiplication operators surjective, where denotes left-multiplication . On the other hand the latter hypothesis and (2) makes surjective, hence is surjective, but from (2) and the above hypothesis we know that this map is also constant, a contradiction.
One can still hope that a greedy construction works here, but it doesn't look appetizing; one can wait to see if the simpler situation in 1323 falls to this approach first.
Bernhard Reinke (Oct 24 2024 at 05:56):
A difference to the situation in 1323 is that 255 is a law in only one variable, while for 1323 we want to refute a law in two variables. Part of my assumptions in 1323 is that one-generated submagmas are as simple as possible, namely isomorphic to the finite magma that we are extending over. We of course cannot make this assumption here.
On the other hand, if there is a counterexample, there should be a one-generated counterexample. So I am curious what happens in the following case: z
should be our generator and impose , what kind of inequalities in the terms in z
can we deduce? For example we get . Are there any other easy ones?
Pace Nielsen (Oct 24 2024 at 19:05):
In semigroups and rings, elements satisfying the periodic identity , for some integers are well-studied. If we take to be the smallest such integer it is called the index, and then the smallest such is called the period. Moreover, then if and only if and .
It seems that these types of facts carry over to the magma setting, using left-parenthesized powers, by the same arguments. Since we are trying to avoid , it is interesting to ask which of its consequences might hold.
Also, it seems that tripling of multiplication is common when working with 1516. So I'm wondering whether or not 1516 implies . (This should cover the other identities I was asking about earlier.)
Pace Nielsen (Oct 25 2024 at 16:21):
Checked these in Vampire, and it timed out.
Jose Brox (Oct 25 2024 at 16:23):
Pace Nielsen ha dicho:
Checked these in Vampire, and it timed out.
Pace Nielsen (Oct 31 2024 at 22:53):
Below, for ease of notation, I'm going to suppress the diamond operation by using concatenation and parentheses. Also, I take and as usual.
I'm wondering if some of the techniques that helped solve 854 might apply to 1516, in the following sense. Looking at the equation explorer, we see that 1516 does not imply any other equation (below 4684) except 1426 (coming from taking y=x) and the trivial equation 1. Thus, one might a priori hope that a reduced form could be found by blindly applying the 1516 reduction rule, taking anything looking like y^2(x(xy)) to x, repeatedly in subwords.
However, after a little computation, we see that this isn't the case. For instance, taking in 1516 yields
and so (as mentioned at the very beginning of this thread) we get a new relation
Thus, even though and are both reduced under "blind" applications of 1516, they are nonetheless equivalent.
So we add this new reduction rule to our list of rules, and try again. If I understand correctly (someone please correct me if I'm wrong), someone performed this sort of computation and the process of finding a normal form does not appear to converge to a simple finite set of rules. However, the hope is that by studying enough of the forced new rules, we might be able to glean a new (non-forced) principle that would let us find a reduced form for magmas "close" to 1516.
If someone has access to the program that searched for a normal form, could it be modified to spit out the first 10 (say) new reduction rules that are forced in 1516 magmas? I'm hopeful that we can notice a pattern. [Really, we only need to consider 1-generated 1516 magmas, since we are trying to refute a 1-variable law.]
Pace Nielsen (Nov 01 2024 at 16:03):
Here are the first few new reduction relations I found in addition to the ones above.
Pace Nielsen (Nov 02 2024 at 17:21):
Okay, I found something that is puzzling to me, but Vampire seems to confirm (if I entered it correctly).
Assuming 1516, if , then .
Terence Tao (Nov 02 2024 at 17:22):
I think this is just a relabeling of 1516.
Pace Nielsen (Nov 02 2024 at 17:25):
You are right.
Pace Nielsen (Nov 02 2024 at 17:32):
Does this give us an infinite sequence of reductions? Starting from , we then get (taking ) the new reduction , which in turn gives us the new rule , etc... Not sure if this stabilizes, or gives lots of independent reductions.
Pace Nielsen (Nov 02 2024 at 18:14):
We have . Vampire times out on the converse, so it seems a safe bet that it just doesn't hold. OTOH, if we impose the converse implication anyway, Vampire continues to time out on Eq255. So, to avoid an explosion of new reduction rules, it might be helpful to add the assumption
Terence Tao (Nov 02 2024 at 18:23):
This comes close to imposing the requirement that squaring is injective. If we impose that as well, does that trivialize the problem?
Terence Tao (Nov 02 2024 at 18:25):
Related to this, one could explore whether left or right cancellability has any impact on the problem. For instance, left cancellability implies injective squaring: if then , hence if left multiplication is injective.
Pace Nielsen (Nov 02 2024 at 18:54):
Vampire times out when trying to prove Eq255 (after 999s), assuming Eq1516 + left-cancellativity + the converse implication I mentioned above. So probably we can assume left-cancellativity as well. (Eq1516 + right-cancellativity does give Eq255.)
Terence Tao (Nov 02 2024 at 20:07):
By the way, left cancellability also implies the converse implication you stated anyway, since if then and hence by left cancellability.
We already knew that was surjective, so now with left cancellability these maps are now invertible. So for instance and . From 1516 the inverse is in fact explicitly given as .
Terence Tao (Nov 02 2024 at 21:29):
It is possible that left-cancellability is in fact true in the free group , which could lead to further useful properties on that group. Do we know of any consequence of 1516 of the form for some words with ?
One possible consequence of left-cancellability, in addition to injectivity of the squaring map, is a unique factorization claim when are not equivalent to squares.
Matthew Bolan (Nov 02 2024 at 21:39):
There are at least left-cancellative models of some fairly large finite sizes, for example This 9x9 one
Matthew Bolan (Nov 02 2024 at 21:41):
I suppose that's not too surprising given the known finite linear models.
Terence Tao (Nov 02 2024 at 21:43):
Most of the linear models seem to be left-cancellative as well.
Under the working hypothesis that the free group is left-cancellative, this should imply that squaring is injective, and then I think the only way to rewrite a non-square into an equivalent form is either by showing their components agree, , (i.e., one is in the "unique factorization scenario"), or else if one can first show that is rewritable into for some . If one can safely conjecture that is never a square when is not a square then there is a chance of iterating this and actually establishing some of these conjectures by some sort of induction argument.
[EDIT: actually this probably doesn't work unless one assumes that both and its left component are non-squares, but then in the absence of unique factorization, "left component" might not be well defined. Ugh, maybe the free magma here is actually quite gross...]
Terence Tao (Nov 02 2024 at 21:48):
My experience with the 854 free group suggests that once has a near-complete list of properties (such as unique factorization or left cancellability) that you (correctly) conjecture the free magma to satisfy, you can prove them all in a giant induction, but if you only have a partial list of these properties, you can't prove anything (sort of like how if you only have a portion of a working greedy ruleset, one cannot draw any useful conclusions, other than perhaps a clue as to what the next addition to the ruleset needs to be).
Terence Tao (Nov 02 2024 at 21:54):
(deleted)
Terence Tao (Nov 02 2024 at 22:07):
OK, I think here is what one can correctly say about the free magma by tracing through all possible rewrite options: if and are words with not equivalent to a square, and not equivalent to a square, then in order to prove , one must either first prove and , or else one must first prove for some with .
Basically, there are three different type of rewrites one can apply to a word . Firstly, one can rewrite into something equivalent, or into something equivalent; let me call these "low level rewrites". Or one can do a "top level expansion", replacing with for some . Or, one can do a "top level reduction", if happens to be of the form in which case it can be reduced to . With the hypotheses on or , any rewriting route from to either has to avoid all top level expansions or reductions (which is the first case, as only low level rewrites are involved), or at some point involves at some point a top level expansion followed (after some low level rewrites) by a top level reduction (which gives the second case), since the first top level operation applied to cannot be a top level reduction, and the last top level operation applied to reach cannot be a top level expansion.
Terence Tao (Nov 02 2024 at 23:33):
Here is a random thought: we know that the implication holds when squaring is surjective. What about the opposite case where squaring is constant, i.e., for all , and the equation is now for all . Does Vampire give a proof of 255 in this case? If not then this might be a significantly simpler model to analyze.
Terence Tao (Nov 02 2024 at 23:46):
OK, constant squaring implies triviality: if , then , and also , hence .
I might try a two-valued model later tonight.
Matthew Bolan (Nov 03 2024 at 00:00):
I think prover9 is telling me the only models with are trivial.
Terence Tao (Nov 03 2024 at 00:31):
Ah, that's a shame.
Pace Nielsen (Nov 03 2024 at 02:36):
Vampire is telling me that Eq1516 + left cancellativity implies the following two special forms of right cancellativity:
The second one may simplify the case analysis above.
When I was trying to work through the possible rewrite options, subject to left cancellativity, another case to handle arises from the identity
This is a bit different because of the (possible) lack of a square in the left factor.
When looking at other possible rewrite rules, the following all imply Eq255 (when put in conjunction with Eq1516): , and , and .
However, it appears safe to add the following (independent) rewrite to to Eq1516 + left cancellativity: !
Andy Jiang (Nov 03 2024 at 03:57):
Inspired by Kevin M's ideas at 1485, and some of the above discussion, I asked Vampire to prepare a list of conditions which are both not obviously implied by 1516 and does not obviously imply 255 together with 1516
tally.txt
Hope it can be helpful for inspiration if you are looking for conditions you can impose to models
? means exist and ! means for all by the way
(Note: checked by Vampire at very low time limit so could be obvious non-examples so if you are going to assume one of them please check with Vampire for longer or ask me and I can do it)
As a sample of the file above here are two conditions vampire found with above criterion which seems to only overlap on the trivial model
1) X^3 is constant
2) for some element B, (B * X)^2 is constant
The second condition together with left-cancelation implies 255. So if you are assuming left-cancellation. 1) is a possible additional assumption which you may wish to take
Andy Jiang (Nov 03 2024 at 07:50):
X^3 being constant even has a finite model with left-cancellation being
[[0,1,2,3,4,5,6],
[5,2,3,6,0,1,4],
[6,0,5,1,3,4,2],
[1,3,6,4,5,2,0],
[2,6,4,0,1,3,5],
[3,4,0,5,2,6,1],
[4,5,1,2,6,0,3]]
This is like almost literally a sodoku. Does anyone know an interpretation of this magma? (like in some ansatz)
Bernhard Reinke (Nov 03 2024 at 08:05):
Terence Tao said:
- (Added in view of comments below:) Another linear model is on . In this model, , , and , and then repeats with period 3.
maybe it is this one?
Andy Jiang (Nov 03 2024 at 08:13):
yup! thanks (sorry I didn't see it before)
Kevin M (Nov 03 2024 at 13:31):
If x^3 = constant, then all squares satisfying x^4 = x means there is an element 1*x = x for squares?
Can we deduce what this element does to non-squares?
Kevin M (Nov 03 2024 at 13:37):
Since the goal is to contradict eqn255, then if we try for x^3 = constant, that is like postulating an element that "detects" squares. 1◇x=x for squares, 1◇x!=x for non-squares.
Kevin M (Nov 03 2024 at 13:43):
Sorry, that is too strong (collapses to trivial). Just 1◇x!=x for at least one non-square.
Andy Jiang (Nov 03 2024 at 15:08):
Kevin M said:
Sorry, that is too strong (collapses to trivial). Just 1◇x!=x for at least one non-square.
ah it's interesting that collapses to trivial--could be interesting to understand the proof there
Andy Jiang (Nov 03 2024 at 16:16):
btw easy observation by vampire: such a universal cube must be idempotent b/c
(0 * 0)
=(0 * 0) * ((0 * 0) * ((0 * 0) * 0)) (substitute y=0 and x=(0 * 0) in 1516)
= (0* 0) * ((0 * 0) * 0) (by 0 is 0^3)
= (0 * 0) * 0 (by 0 is 0^3)
= 0 (by 0 is 0^3)
Andy Jiang (Nov 03 2024 at 16:20):
I guess the mechanism is really if y^3 = y then y^2 = y with no assumptions because you just substitute x= (y * y) in the equation and it collapses in a funny way
Kevin M (Nov 03 2024 at 16:44):
Maybe its worth trying to fill out a table again.
First an element to violate eqn255
0 != ((0◇0)◇0)◇0
which forces (0◇0) to be distinct so
1 = 0◇0
Then our "simplification guess" (x◇x)◇x = constant, which as you noted needs to be idempotent, so it is a distinct element.
2 = (x◇x)◇x
This forces:
2◇2 = 2
1◇0 = 2
2◇1 = 1
The element (0◇1) must be distinct, as well as the square of that. Vampire cannot decide if this is another violation of eqn255 or not. Knowing that we need an infinite magma to get a counter-example, and the square is a new element as well, maybe we can put that in here.
So yet another guess, with infinity in mind this time,
f(n+1) = f(n) ◇ (f(n) ◇ f(n))
with f(0) = 0,
could these all be considered non-squares, and violators of eqn255?
Unlike 1485 "shifts", this is more like "successor".
Since ideally all elements would be 'generated' from f(0) with multiplications, I wonder if there is some way the table is "symmetric" to only keeping the elements that can be generated by f(1). Basically, if we can somehow fill out the table such that each element has a "successor" count (or counts), like the f(n), then we can drop 0, reduce all the counts by 1, and get an equivalent table. This should be quite restrictive, so we can probably find a way to rule it out quickly if this cannot work.
Terence Tao (Nov 03 2024 at 18:40):
I spent some time trying to figure out what the free 1516 magma looked like. In the case of , the free 854 magma ended up having a carrier that was a subset of the free magma generated by a single binary operation , with the actual operation defined to equal when the pair is "irreducible", and if the pair is "reducible", with the former being the dominant case.
For 1516, because of the commutativity relation between the squaring map and the cubing map , I think the base free magma has to be take this into account, and will therefore be the free magma generated by a binary operation and two commuting unary operations . So every element of is either a generator, or is uniquely factorable into the form for some natural numbers and some smaller words . One can think of elements of as decorated trees in which the leaf nodes are generators and the branching nodes are decorated by pairs of natural numbers. For instance if are generators then would be a typical word in . So is a magma with two commuting operations , but I stress that these operations are not homomorphisms: for instance.
My starting point the observation is that 1516 can be viewed as the law , so the product of and is the "left factor" of . Accordingly I am trying to work with an ansatz in which has a carrier that is a subset of , with the operation defined to equal for "irreducible" pairs , and defined to equal when and is a "reducible triple", and is equal to if is a "sporadic triple". Intuitively, reducible triples encode all the identities of the form with irreducible that are forced by 854, and the sporadic triples encode all the other identities. My hope is that most pairs are irreducible, a few triples are reducible, and a very small number of triples are sporadic.
The hard part is to figure out all the definitions of the terms in quotes. The definition of irreducibility is clear: is irreducible if is not of the form for some reducible triple , and there is no sporadic triple of the form . To keep the operation well-defined, sporadicity also has to be a partial function (if are sporadic, then ) and disjoint from reducibility in the sense that if is reducible then cannot be sporadic. Then on splitting the 1516 law into various cases, one gets the following additional rules:
- If is irreducible, then is reducible (or one could instead classify as sporadic, if this is more convenient).
- If is coming from a reducible or sporadic triple, then one has to classify as sporadic. So for instance if is sporadic then so is .
Also, the definition of squares and cubes gives some initial sporadic triples and . This generates a few more sporadic triples: , , . The reducible triples include for irreducible pairs (plus a few coming from the sporadic triples), and propagate by the rule that if is reducible then so is .
At one point I thought I had the whole system under control with a complete description of both the reducible and sporadic triples, but there are some annoying famillies of sporadic triples that show up, such as
that reflects the identities
and
There is some hope that the families of sporadic triples are still sufficiently under control that they can be described explicitly, and shown to not collide with each other or the reducible triples, but this is becoming quite a mess of an approach - in principle still formalizable, but worse than the task of formalizing the description of which was already somewhat complex. This approach might still be possible, but I am hoping that there are some other simpler approaches to rule out 255 specifically (which, in this language, corresponds to ensuring that does not emerge as a sporadic triple, which I think will be automatic if this scheme at all closes).
Kevin M (Nov 03 2024 at 22:48):
I made a little progress investigating the following idea
- infinite sequence:
I'm not sure how this relates to Tao's idea as now CS=SC is trivialized to a constant map. (does this simplify things there, or break that approach?)
Kevin M (Nov 03 2024 at 22:48):
Anyway, I haven't found a contradiction yet, but here are some initial relations.
Square elements obey:
- (since square elements satisfy eqn255)
And all elements follow a combination of those two square relations
Kevin M (Nov 03 2024 at 22:48):
Then something interesting popped up with "left cubes".
-
or if we define the "left cube" -
That gives a way to go "backward" along the sequence as well:
So a bi-directional "shift" operation seems to have shown up!
Kevin M (Nov 03 2024 at 23:47):
Andy Jiang said:
Kevin M said:
Sorry, that is too strong (collapses to trivial). Just 1◇x!=x for at least one non-square.
ah it's interesting that collapses to trivial--could be interesting to understand the proof there
I went back to look at this, and I had made a mistake writing the statement in the prover.
Vampire does not find a contradiction when I include axioms for the following:
! [A, B] : A=mul(mul(B,B),mul(A,mul(A,B))) % eqn1516
0 != mul(mul(mul(0,0),0),0) % an element refuting eqn255
! [A] : mul(mul(A,A),A) = 2 % "right" cube is a constant
! [A] : ( (? [B] : A = mul(B,B)) | (mul(2,A) != A) ) % (non-square implies 2*x != x)
Andy Jiang (Nov 03 2024 at 23:53):
ok had some time this morning to play around with vampire and got
assuming 2=(x◇x)◇x for all x and left-cancellation
2◇(2◇x) = x (vampire seems to use left-cancellation to prove this)
(2◇(x◇x))◇x =2 (this follows from Kevin's above observation that 2 ◇ (x ◇ x) = (x ◇ x))
but did not find any simple + nontrivial homomorphisms of this magma
Edit:
((x ◇ x)◇x) = 2 = (x ◇ x) ◇(2◇(2◇x))
directly implies x=(2◇(2◇x)) by left-cancel
Andy Jiang (Nov 03 2024 at 23:58):
does suggest x -> 2◇x is some involution that fixes the squares (again assuming left-cancel + universal cube=2)
Kevin M (Nov 04 2024 at 01:26):
Generalizing the f(n) shifting, we can define:
- cube from left multiplying
From the result forced by .
This gives .
However, with everything I assumed so far, even adding left-cancelation, it does not imply shift(unshift(x))=x.
Throwing that in as yet another axiom, still no contradictions.
Maybe that is the missing piece to simplify everything?
Terence Tao (Nov 04 2024 at 02:09):
Just wanted to say that I hope this approach of imposing simplifying axioms such as constant cubes and left-cancellability will work like it did for 1485; it is almost the opposite approach of the free approach in which one wants to keep everything as distinct as possible, but this leads one to the edge of a combinatorial explosion in relations. I think it might barely close for 1516 and leave us with an in-principle describable free magma, but it is extremely complex.
One small comment is that the linear model has constant cubes. Of course we know that translation-invariant or finite models are not good for this problem, but perhaps it might still supply some intuition.
Michael Bucko (Nov 04 2024 at 12:23):
Kevin M schrieb:
Andy Jiang said:
Kevin M said:
Sorry, that is too strong (collapses to trivial). Just 1◇x!=x for at least one non-square.
ah it's interesting that collapses to trivial--could be interesting to understand the proof there
I went back to look at this, and I had made a mistake writing the statement in the prover.
Vampire does not find a contradiction when I include axioms for the following:! [A, B] : A=mul(mul(B,B),mul(A,mul(A,B))) % eqn1516 0 != mul(mul(mul(0,0),0),0) % an element refuting eqn255 ! [A] : mul(mul(A,A),A) = 2 % "right" cube is a constant ! [A] : ( (? [B] : A = mul(B,B)) | (mul(2,A) != A) ) % (non-square implies 2*x != x)
I'm quite new to Vampire. I turned this script into this format. Is that how you are also using it?
fof(eq1516, axiom, ! [A, B] : (A = mul(mul(B, B), mul(A, mul(A, B))))).
fof(refute_eq255, axiom, 0 != mul(mul(mul(0, 0), 0), 0)).
fof(right_cube_constant, axiom, ! [A] : (mul(mul(A, A), A) = 2)).
fof(non_square_implication, axiom,
! [A] : ((? [B] : (A = mul(B, B))) | (mul(2, A) != A))).
(it ran for 60s and also didn't find a contradiction)
Michael Bucko (Nov 04 2024 at 14:45):
Tried to solve Equation1516_implies_Equation255
using Duper (1,000,000 hearbeats), but it asked for more heartbeats. I'll try with 10,000,000.
Michael Bucko (Nov 04 2024 at 15:00):
Ran it with 10,000,000 heartbeats - no results.
Andy Jiang (Nov 04 2024 at 15:51):
random observation there's a size 8 model of form
(a,b,c) * (d,e,f) = (a+b+e,d+e+f+a+c,e+f+b) everything mod 2
idk if this is linear in a finite field or something
I would guess no? But I guess generalizations of these are ruled out by linear
Amir Livne Bar-on (Nov 04 2024 at 21:41):
Noting here the three linear models of order 7, they have quite different properties. One of them might turn out to be useful.
(a) has constant cubes (where cube is )
(b) has constant left-cubes (expressions of the form
(c) has
These operations can be derived from one another, for example .
Kevin M (Nov 05 2024 at 02:05):
Apologies, but I have a really basic question.
I'm having a bit of trouble understanding what needs to be proven to show that we have a well-defined infinite algebra when we only have one relation to enforce.
Let's consider this toy model:
I'll represent elements as a generator A, or 2-tuples of other elements.
-- define:
head(element) = head of tuple if element is a 2-tuple
or (A,A) if element is A
tail(element) = tail of tuple if element is a 2-tuple
or (A,(A,A)) if element is A
x * y = head(y) if head(x) = tail(x) and head(y) = head(tail(y)) and head(x) = tail(tail(y))
else (x,y)
Note:
x*A != A, because head(A) != head(tail(A)).
So ((A*A)*A)*A != A, violating eqn255.
Basically, each element is defined as the history of how it was generated (with A treated as a special "self-referential" object). Then the multiplication can access this history to enforce the one relation.
If there were multiple relations to enforce, I could see a potential issue where inconsistencies could arise due to the order these are applied or something. But here we only have a single relation. So any other relation derivable from it would have to just be multiple applications of that same relation. So isn't that already encoded in above? I realize this is very hand-wavey, but I'm not sure how to make it more precise to see what I'm missing. And I'm having trouble working on an infinite model without resolving this misunderstanding.
Andy Jiang (Nov 05 2024 at 02:31):
don't you just have to show that your magma obeys 1516? then it's enough I think
Terence Tao (Nov 05 2024 at 02:36):
Note that not all expressions of the form will have the form because need not be and need not be . Similarly for the other two conditions you propose. One can try to make the definition of the multiplication operation more complicated to handle the other cases, but it's a recursive process, every new change one makes to the magma operation generates a new edge case that potentially needs to be handled. This is basically what I tried to do a few days ago; I do think the process will eventually converge, but the rule is not going to be anywhere near as simple as what you propose.
Andy Jiang (Nov 05 2024 at 02:42):
If I calculated correctly (which is unlikely tbh) I think this particular one fails 1516 for X=(A,A) and Y=((A,A),A)
Edit: I take it back
Edit2: I think actually it does fail at those inputs...
Andy Jiang (Nov 05 2024 at 02:58):
btw what's simplest model we know of 1516 with non-square elements?
Terence Tao (Nov 05 2024 at 03:00):
I'm not sure we have one yet
Andy Jiang (Nov 05 2024 at 03:06):
I think vampire can't refute the existence of a model with left-cancel and ((X * X) * X) always = 2 with only 2 non-squares elements which are swapped by the involution x -> 2 * x. Although these conditions are not the simplest so I'm not sure how much weight to put on that
Andy Jiang (Nov 05 2024 at 05:40):
Personally feel like it might be possible to fill in the multiplication greedily row by row b/c the conditions can be written with left multiplication. Like posit two non-squares 0 and 1 which are swapped by left-multiplication by 2 and assume 2 is universal right cube and left-cancellation--I still think these conditions +1516 is not extremely strong on the table. But I haven't seriously tried so could be missing something obvious.
Andy Jiang (Nov 05 2024 at 05:54):
OK upon further thought I think it's probably not so simple...
Amir Livne Bar-on (Nov 05 2024 at 07:44):
Some more finite models for 1516:
I asked Mace4 for non-idemponent models of order 9, and got this one. I did a bit or renaming, guided by the existence of an element that satisfied and . It has the further property that , which I don't know if follows from these (it's equivalent to ). Can anyone identify this as some known algebraic structure?
| e 0 1 2 3 4 5 6 7
--+------------------
e | e 7 0 1 2 3 4 5 6
0 | 0 1 4 2 7 6 e 3 5
1 | 1 6 2 5 3 0 7 e 4
2 | 2 5 7 3 6 4 1 0 e
3 | 3 e 6 0 4 7 5 2 1
4 | 4 2 e 7 1 5 0 6 3
5 | 5 4 3 e 0 2 6 1 7
6 | 6 0 5 4 e 1 3 7 2
7 | 7 3 1 6 5 e 2 4 0
Order 8 had only idempotent models, here's an example after slight renaming (which might confuse more than help). Again I can't identify this operation.
| 0 1 2 3 4 5 6 7
--+----------------
0 | 0 2 3 4 5 6 7 1
1 | 4 1 6 0 7 3 5 2
2 | 5 3 2 7 0 1 4 6
3 | 6 7 4 3 1 0 2 5
4 | 7 6 1 5 4 2 0 3
5 | 1 4 7 2 6 5 3 0
6 | 2 0 5 1 3 7 6 4
7 | 3 5 0 6 2 4 1 7
There were no 1516 magmas of order 10 (this is a merge of results of 2 long runs and I might have had a bug). Order 11 has a single linear model which is idempotent, and Mace4 is having a lot of trouble finding a non-idempotent model, but hadn't ruled it out yet.
Amir Livne Bar-on (Nov 05 2024 at 08:23):
Checked with an ATP, 1516 and together imply 255
Andy Jiang (Nov 05 2024 at 13:04):
Matthew Bolan said:
I think prover9 is telling me the only models with are trivial.
just for kicks I asked vampire the case where there are three squares and so far on my machine it hasn't proved 255 from it yet.
Edit: Nevermind
Kevin M (Nov 05 2024 at 13:13):
Hmm... it's not the most constructive, but here is an attempt at a finite algorithm defining an eqn1516 magma that violates eqn255.
Represent elements as a generator 0, or 2-tuples of other elements.
As before we define L * R = (L,R)
unless a simplification using eqn1516 is possible.
A "simplification" will happen in L * R
when there are x,y such that L=(y ◇ y) and R=(x ◇ (x ◇ y)),
in which case L * R = x
.
Squaring will never trigger a simplification, so for all x, x * x = (x, x)
.
This means we can determine if L is a square, and get y from L unambiguously.
The simplification rule is enforcing eqn1516, x = (y ◇ y) ◇ (x ◇ (x ◇ y))
and therefore ( (x ◇ (x ◇ z)) = (y ◇ (y ◇ z)) ) <-> (x=y).
So given elements R,y if we find an x such that R=(x ◇ (x ◇ y)), it is unique.
The depth of the tuple L * R
(if viewed as defining a tree), will be less than the max depth of L or R if a simplification occurred. So given R and y, we can just search for an x with depth < max(depth(R), depth(L)) such that R=(x ◇ (x ◇ y)). If we find one, it is unique. If none is found, there is no smaller depth element to reduce to, and the result is just the tuple (L,R).
Daniel Weber (Nov 05 2024 at 13:17):
Isn't it possible that a simplification occurred in the definition of R
?
Kevin M (Nov 05 2024 at 13:52):
yes, that is what the search is intended to handle.
Daniel Weber (Nov 05 2024 at 14:01):
But then can't the depth of R
be less than the depth of x
?
Kevin M (Nov 05 2024 at 15:13):
Yes, but if L * R
can result in a simplification, we must be able to write R in the form x * (x * y)
where y comes from L = y * y = (y, y)
. And we'll search for x up to max(depth(L), depth(R)). So in some cases we will search beyond the depth of R.
Daniel Weber (Nov 05 2024 at 15:13):
But isn't it possible that y
is very small, x
is very large, but some sequence of simplifications caused R
to simplify to something smaller than x
?
Terence Tao (Nov 05 2024 at 15:43):
I think you are going down a similar path to the one I took in trying to construct something close to the free 1516 magma. I think the approach has a chance of eventually working, but is quite complex and as such I was reluctant to push for it. But since it seems to be emerging organically, perhaps I will share more of the computations that I have.
Say that is a "simplification triple" if one has . Then 1516 asserts that is a simplification triple for any , where is the squaring operator. One can hope that most of the time one can write (and also ), and that the simplification rules given by this case (i.e., ) are close to a complete set of simplifications. This was where I was a few days ago, and I'm guessing that this is the hope right now.
But there are some complications. Firstly, there is a propagation property of simplification triples:
Lemma 1. If is a simplification triple, then so is .
Proof: By hypothesis, , thus
So in addition to the original simplification triples , one also has
and so forth.
This still isn't so bad, one could just add all of these to the list of simplification rules, show that they don't conflict with each other, and hope that now we have a near-complete list of simplifying rules. I tried this, but another problem emerged. There are some "sporadic" simplifying rules that are not of the standard form (in the sense that is not expected to be of the form for some ). An example is the rule , where is the cubing map (this is related to the previously observed fact that and commute, as well as the fact that in the constant cubing case one has ). At first I thought there were only a small number of sporadic rules (a finite number of rules parameterized by one variable ), but then I found more due to this lemma:
Lemma 2. If and are simplifying triples, then we have the sporadic identity .
Proof: By hypothesis, and , thus .
For instance, since we know and are simplifying triples, we have the sporadic identity , where is the word
Okay, so we have to add a few sporadic identities in addition to the previous families of simplifying triple identities. Now do we have something close to a complete set of simplifying rules? Well, unfortunately sporadic identities beget further sporadic identities:
Lemma 3. If we have a sporadic identity of the form , then we get another sporadic identity of the form .
Proof.
So for instance starting from we have
and this can be iterated further to obtain an increasingly complex sequence of sporadic identities.
So, now are we done? Regrettably, there is another propagation vector: sporadic identities also generate new simplifying triples!
Lemma 4. If we have a sporadic identity , then we have a simplifying triple .
Proof. This is just a specialization of the previous observation that is always a simplifying triple.
So now we have a potential feedback loop in which simplifying triples and sporadic identities beget each other in a combinatorial explosion. However, with my computations it seems that this explosion is barely averted: the "simplifying triple -> sporadic identity" vector in Lemma 2 requires two simplifying triples to create a sporadic identity, and it appears that all of the additional simplifying triples beyond the standard ones do not actually generate any further sporadic identities, and so the process stops with a finite (but rather complex) collection of families of both simplifying triples and sporadic identities.
I believe that if one then writes down all these identities (as well as the commutativity law ), one can use this to build a version of Kevin's magma that actually does obey 1516 in all cases, and is in fact the free 1516 magma (because every rule one imposed is actually forced on us by 1516), but the verification of this is extremely lengthy and tedious - longer for instance than the verification of the free 854 magma in the blueprint. If it seems like this is the only way to resolve this equation I can try to get all the details into the blueprint, but I was really hoping that there was another way to proceed here.
Terence Tao (Nov 05 2024 at 16:18):
It's possible that the theory of terminating rewriting systems can be deployed to automatically check that the final set of rewriting rules is actually complete. The main issue is that this is not a finite set of rules, but has some infinite families (and even a doubly infinite family, obtained by the many applications of Lemma 3 applied a standard sporadic triple, then one application of Lemma 4, then many applications of Lemma 1). However, I think it may be possible to truncate the family and create a slightly overcomplete, but now finite, set of rewriting rules. This would no longer generate the free 1516 magma but could still be enough to resolve the implication to 255. Is there an automated way to verify completeness of a finite set of rewriting rules? In that case I can relatively easily generate a candidate set of such rules.
Kevin M (Nov 05 2024 at 16:58):
I remember a blog post of Scott Aaronson about using some automated tooling for verifying termination of rewriting rules. Let me see if I can find that.
Kevin M (Nov 05 2024 at 16:59):
Oh, it was his Collatz paper.
maybe these would be useful:
https://github.com/jwaldmann/matchbox
https://aprove.informatik.rwth-aachen.de/
Matthew Bolan (Nov 05 2024 at 17:10):
When you say "verify completeness", you mean verifying that all rules in the original rewriting system belong to the new one?
Terence Tao (Nov 05 2024 at 17:21):
Perhaps I am using the wrong terminology - I am not sufficiently familiar with the language of rewriting systems to be sure - but basically one has a magma operation on the free magma generated by a single pairing operation and two commuting unary operators , which is usually just the pairing operation but with a finite number of exceptional rules, the most common of which is
and also the defining rules for
but there are others such as
(where ), and so forth (I think I have about 20 such rules overall). I would like to show that the magma with this ruleset (a) is well-defined (in that there are no conflicting rules of the form and with ), (b) obeys 1516, and (c) does not obey 255 (this one I expect to be easy). It is (b) which is tricky because 1516 involves three magma operations and naively one has 21^3 cases to check if there are 20 different exceptions to the rule , but perhaps this can be done automatically.
If I wanted the free magma, I think I would need to specify an infinite family of such rules, but for the purposes of ruling out 255 it would be enough to have a finite set of rules that include all the rules forced by 1516, but a few unnecessary ones that simplify the ruleset. For instance, any rule obtained using two or more applications of Lemma 1 above from a simplification triple will be of the form
so I was thinking of simply adding _all_ such rules of this form to the ruleset, regardless of whether was already known to be a simplification triple, with the intuition that the inputs and here are highly related to each other and this rule is unlikely to cause significant problems as it is so rarely invoked. This way I can truncate all applications of Lemma 1 beyond the second one.
EDIT: I considered also adding a rule such as but actually this does not seem to simplify the ruleset much, as most of the rules do not actually involve . This may help explain why this alternate approach to the problem was only yielding limited progress.
Pace Nielsen (Nov 05 2024 at 17:37):
I have been experimenting with the Kisielewicz ansatz for 1516, trying to come up with (hopefully) an easier description on the free 1516 magma, or something close to it, as follows.
Think of our carrier set as the positive integers, and take as our first "rule" the equality
where is some (injective) function on the positive integers, mapping to a nice, easily recognizable image. (For instance, we might take , in which case that nice subset would be "powers of 2".) Using the 1516 identity , we can now iterate and define new rules as follows.
Taking , we get the identity . So we can take
Again, we would like to think of as a nice function, mapping to somewhere "new", something like , although we'll keep things arbitrary for now.
We can use (2), taking to get the identity . So we can take
We can use (3), taking to get . So we can take
Using (5), taking we get . So we just get
Now using (7), taking we get . This means that commute. Of course, we already knew this because squaring and cubing commute. (But it is nice to see it arise again.) If we think of as a somewhat "generic" exponentiation map on the positive integers, then the only other nice maps that commute with it are powers of it.
I tried experimenting with taking , but in those three cases eventually problems arose. I'm starting to think that this method is unfortunately unworkable when taking cubing to be a simple integer power of squaring. [Maybe someone can see an easy reason for this.]
Pace Nielsen (Nov 05 2024 at 17:49):
Maybe a better carrier set would be something like , where we treat infinity as the constant value of cubing, and right multiplication by it is the obvious "shift the second coordinate" involution?
Terence Tao (Nov 05 2024 at 19:14):
Ugh, it turns out there is a fifth vector, again from sporadic pairs to simplifying triples:
Lemma 5. If , then is a simplifying triple.
Proof. From 1516, is a simplifying triple, hence by Lemma 1 so is , and hence by Lemma 1 again so is . Substituting , we obtain the claim.
Assuming no more vectors show up, I can truncate to a ruleset of size about 60 rules, but now the generation of the rules as well as its verification both need to be computer generated, as there are just too many opportunities for mistakes here.
[UPDATE: Actually, things are not as bad as I feared. Lemma 5 is actually just Lemma 3 followed by Lemma 4, so doesn't actually generate new rules.]
My explorations did produce one unusual sporadic rule, namely
which I believe is actually forced from 1516 (perhaps this can be checked with an ATP: UPDATE, see below, this is not quite implied by 1516 unless one assumes injective squares) and which may possibly be useful in advancing the "assume " strategy. (This is an unusual rule in that it is a rule of the form where the first argument is not obviously a square. The only other rule of this type I have discovered is the rule defining .)
Terence Tao (Nov 05 2024 at 19:19):
IMG_1224.jpeg in case anyone is interested, here are the rewriting rules I found so far, which may be complete enough to verify 1516 in all cases (but now there are like 60^3 cases to check)
Terence Tao (Nov 05 2024 at 19:48):
@Pace Nielsen I think I have generated a similar list of sporadic laws involving squares and cubes. My list is
- (correcting a previous version)
and then there are about two dozen more of an irregular "simplifying triple" form (basically applying Lemma 4 or Lemma 5 to the above list, followed optionally by one application of Lemma 1), and the rest lie in about three infinite families (formed by two applications of Lemma 1, or two applications of Lemma 4). (Somehow the exercise reminds me of learning about irregular verbs and obscure tenses in French.) I can imagine that setting or causes clashes here, though I'm a little surprised that caused an issue given all the previous analysis of the case.
But actually the most frequent rules that show up don't involve any unary operator beyond squaring, so I now think that trying to fix the other unary operators in terms of squaring may not yield decisive simplifications (one may get from say 60 rules down to 40 or so, but still computer assistance would be required). For instance, the rule , where and , doesn't involve anything other than squaring, but is a relatively frequent rule that one encounters when trying to verify the (many) cases of 1516, largely because one of the inputs and the output are almost completely unconstrained. (Most of the rules have so many constraints between their inputs and outputs that their case analysis is quite easy to resolve, but the ones in which two out of the three of the inputs and outputs are relatively unconstrained cause difficulties in that they keep generating new identities out of old, as captured by Lemmas 1-5.)
Kevin M (Nov 05 2024 at 20:08):
I'm having trouble verifying number 9 and 11 with vampire.
Terence Tao (Nov 05 2024 at 20:23):
For 11, the proof should start with equation 6, , which should imply
and on multiplying both sides on the left by one should get the claim. If some of these previous claims are not proven by Vampire, that would pinpoint where the error lies.
For 9, I think I removed a square incorrectly. Is the weaker claim
proven by Vampire?
Amir Livne Bar-on (Nov 05 2024 at 20:25):
Trying to understand the finite models above, the order 8 model is isomorphic to a linear model on , with the matrix
1 1 0 0 1 0
1 1 1 1 0 1
1 0 1 1 0 0
where the convention is that the first operand is concatenated to the second to create a vector in , which is multiplied with this matrix.
Kevin M (Nov 05 2024 at 20:28):
prover9 found 11 (so I probably wrote something a bit off for vampire), but no luck with 9 still
Kevin M (Nov 05 2024 at 20:29):
okay, found a missing parens for 11 in vampire, that is all okay now.
adding S to both sides of 9, it now proves that as well.
Terence Tao (Nov 05 2024 at 20:30):
OK then I will correct my text, thanks for confirming.
Jose Brox (Nov 05 2024 at 21:57):
Pace Nielsen ha dicho:
I have been experimenting with the Kisielewicz ansatz for 1516
I also tried the Kisielewicz sword against the 1516 boss for a while, in the end I move to other things because I run into problems with all the options I tried... Basically, when confronting an ambiguity there are two possibilities: either allow the iteration of the existing functions and add a new rule for the new case, or add new functions to resolve the ambiguity. If I recall correctly, in the 1516 case there were a lot of these choices (I also tried with 854, etc.).
Jose Brox (Nov 05 2024 at 22:02):
(where Terry has a set of blackboards, I have a Kisielewicz paper's ocean :sweat_smile: :grinning_face_with_smiling_eyes:)
20241105_225837.jpg
Pace Nielsen (Nov 05 2024 at 22:08):
@Jose Brox Yeah, unless I see some simplifying principle, the Kisielwicz ansatz seems to be just reproducing some of (with less generality) Terry's computations.
Jose Brox (Nov 05 2024 at 22:09):
Jose Brox ha dicho:
I also tried the Kisielewicz sword against the 1516 boss
Although provisional and abandoned, I share here
Kisielewicz model for equation 1516.pdf
my try at the Kisielewicz model, just in case it may be useful. I changed it multiple times, so there will be mistakes.
Terence Tao (Nov 05 2024 at 22:36):
Pace Nielsen said:
Jose Brox Yeah, unless I see some simplifying principle, the Kisielwicz ansatz seems to be just reproducing some of (with less generality) Terry's computations.
Yeah, if you replace the free magma with the natural numbers, the pairing operation with say , and the squaring and cubing operations with say and then I think one can convert the model I am proposing into a (very complicated) Kisielewicz type model.
While the model is horrible to describe in human-readable terms, I think I can describe in (barely) human-readable form what properties the model has to satisfy, and propose an initial candidate for the model (as I said, it will have about 60 or so rules, but they can be "procedurally generated" from a smaller seed to a certain extent). It may therefore be possible to get to the point where the model can be first computer verified, and then formally verified. Will send an update on this later today.
Andy Jiang (Nov 06 2024 at 00:38):
Sorry naive question: why is the carrier set the free magma with no relations (i assume single generator so it's like binary trees)? I would've thought you would define it in reduced trees (relative to your rules) and show that the reduction rules are confluent (Diamond lemma sort of thing like no reduction choice should make a difference in the end). What does completeness of rewriting mean in this setting(sorry if it's a standard term I'm happy to read a reference)? Sorry if you already explained above but I didn't quite understand. Like if you do it this way with all the rules not a modified set, why do you generate the free 1516? I don't understand because your carrier contains unreduced expressions--I think I'm missing something simple
Terence Tao (Nov 06 2024 at 00:46):
This may be a better approach (restricting only to words that are reduced, and demonstrating confluence of the set of reduction laws). You are right that what I am trying to prove here is slightly stronger than is needed, which is that the magma operation on even the non-reduced words still satisfies 1516. To get to the free 1516 magma, one then has to restrict to the subset of words which can actually be generated from the magma operation, which is probably the same thing as the set of reduced words.
So perhaps the ruleset can be written in terms of just (and also and , which I still believe should be kept separate operations otherwise the commutativity property is going to cause real headaches) abandoning explicit use of the pairing map , and one might be able to check confluence of the system which would be a slightly different, but certainly viable, way to proceed.
Andy Jiang (Nov 06 2024 at 01:05):
Oh I didn't mean it as a suggestion of improvement haha I was only trying to understand! Also does theory guarantee that confluence/rewriting system completeness is decidable by some algorithm? Is the idea that two reductions can only overlap in finitely many ways and you just enumerate all the ways and each way have a common reduction?
Andy Jiang (Nov 06 2024 at 01:23):
Oh sorry is it correct to say for the rewriting system you only do a single reduction step at multiplication time with one of your rules, and the only thing you need to check is 1516 is satisfied in the constructed magma? Where you take binary trees as carrier and multiplication is check against all rules and if not matching then join into a larger tree?
Terence Tao (Nov 06 2024 at 02:04):
Here is part I of the complex proposal to build a nearly free 1516 magma. It requires one to guess a good ruleset of "sporadic rules" and "simplifying rules" that need to obey 12 axioms, but I am still working on a viable proposal for what that good ruleset might be. Most likely the first few iterations of the proposal won't quite close and so one will need to keep tweaking it. This is a very tedious and error-prone process to do by hand, so we may have to look into more automated options.
--
We work in the free algebra generated by a pairing operation and two commuting unary operators , and some set of generators (one could also work with the natural numbers with pairing , , and , but I don't think this actually simplifies anything). We assume that we have a collection of "sporadic rules" , as well as "simplifying rules" of the form for some collection of "simplifying triples" . This lets us define an operation on the free algebra by the following rules:
- (Sporadic case) If is a sporadic rule, then we define equal to .
- (Simplifying case) If is a simplifying triple, we define equal to .
- (Generic case) In all other cases, we define to equal .
In order for this to give a magma, we need to verify two axioms:
Axiom 1. If and are both sporadic rules, then .
Axiom 2. If is a simplifying triple, then there are no sporadic rules of the form .
We would like to have for all , so we impose
Axiom 3. For any , is a sporadic rule.
Now we need to verify 1516, which is . Note that can be sporadic, simplifying, or generic; and can also be sporadic, simplifying, or generic. So there are nine cases to consider.
Case 1: is generic; is generic. Then we need to show and so it will suffice to have
Axiom 4. If and are generic, then is a simplifying triple.
Remark: This axiom should somehow contribute the "most frequently encountered" simplifying triples (otherwise this method is unlikely to work).
Case 2: is simplifying; is generic. Then we have , , and generic. So we need to have
Axiom 5. If is a simplifying triple and is generic, then is a simplifying triple.
Remark: Iterating this axiom twice, we expect to see simplifying triples of the form . I am hoping we can truncate the ruleset by declaring all such triples to be simplifying triples.
Case 3: is sporadic; is generic. Then we have some sporadic rule and is generic. So it suffices to have
Axiom 6. If is a sporadic rule and is generic, then is a simplifying triple.
Remark: This axiom basically means that every sporadic rule also generates a simplifying triple.
Case 4: is generic; is simplifying.
In this case . So we need
Axiom 7. If is generic and is a simplifying triple, then is a sporadic rule.
Remark: I am hoping that this axiom only activates very rarely.
Case 5: is simplifying; is simplifying.
We now have for some simplifying triples and . So we need
Axiom 8. If and are simplifying triples, then is a sporadic rule.
Remark: I am hoping that this axiom mainly only activates when both simplifying triples arose from Axiom 4. This gives rise to the sporadic rules where and . Another consequence of this axiom is that if is sporadic, then almost certainly we will have to have the sporadic rule .
Case 6: is sporadic; is simplifying.
So now is a sporadic rule and is a simplifying triple. The axiom we need then is
Axiom 9. If is a sporadic rule and is a simplifying triple, then is a sporadic rule.
Remark: I am hoping that this axiom activates very rarely.
Case 7: is generic; is sporadic.
Here we have as a sporadic rule. So the axiom needed here is
Axiom 10. If is generic and is a sporadic rule, then is a sporadic rule.
Remark: I am hoping that this axiom only activates very rarely.
Case 8: is simplifying; is sporadic.
So now for some simplifying triple and there is a sporadic rule . So the axiom needed is
Axiom 11. If is a simplifying triple and is a sporadic rule, then is a sporadic rule.
Remark: I am hoping this axiom is activated primarily when the simplifying triple arose from Axiom 4, in which case it asserts that the sporadic rule creates the sporadic rule , which in turn creates the sporadic rule , where .
Case 9: is sporadic; is sporadic.
So now we have two sporadic rules and , and the axiom needed is
Axiom 12. If and are sporadic rules, then is a sporadic rule.
Remark: I am hoping that this axiom activates only rarely.
One could imagine setting up an automated scheme in which one starts with a good "seed" of sporadic rules and simplifying triples, and then runs the above axioms to generate more, hoping that the process terminates without creating a contradiction (a violation of Axiom 1 or Axiom 2). I have some ideas for what the seed should be, but I am still working on it.
Andy Jiang (Nov 06 2024 at 03:09):
I wonder if after you have a list of rules (maybe machine-help-generated) and you write it into Lean as like an operation on the type freely generated by S, and the magma operation (the carrier you have) and write down the multiplication--is it possible that Lean itself can with enough time verify that 1516 is satisfied by repeated uses of cases and simp tactics or something? but I'm not very familiar with the capabilities of Lean so I don't know
Terence Tao (Nov 06 2024 at 04:35):
Yes, Lean would be able to do so, but it could be extremely tedious unless we find a way to automate it.
Here is a simplification of the approach in which we abandon the concept of a "simplifying triple" by absorbing it into the "sporadic rules", which are now a larger family of rules . The magma operation is now given by just two rules:
- (Sporadic case) If is a sporadic rule, then we define equal to .
- (Generic case) In all other cases, we define to equal .
Naively, one could try to suspend all genericity hypotheses and work with the following slightly too-strong axioms:
Axiom 1 If and are both sporadic rules, then .
Axiom 2 We have the sporadic rule .
Axiom 3 For every , we have the sporadic rule .
Axiom 4 If we have the sporadic rule , then we have the sporadic rule .
Axiom 5 If we have the sporadic rule , then we have the sporadic rule .
Axiom 6 If we have the sporadic rules and , then we have the sporadic rule .
Unfortunately, the commutativity of squaring and cubing shows that these axioms are inconsistent. More precisely:
- From Axiom 1 and Axiom 4 we have .
- From 1. and Axiom 4 we have .
- From Axiom 3 we have .
- From 3. and Axiom 5 we have .
- From 4. (twice) and Axiom 6 we have .
- From 1., 5., and Axiom 6 we have .
- From 5. and Axiom 4 we have .
- From 7. and Axiom 5 we have .
- From 8. (twice) and Axiom 6 we have
- On the other hand, from 5. we have This and 9. now violate Axiom 1, since in the free magma.
To fix this we have to implement the cubing operator , and replace any occurrence of by . This requires adding the additional sporadic axiom
Axiom 2'
and then also reinstating as much of the genericity hypotheses to the axioms as we can. So, at a given state of the ruleset, we replace Axioms 3-5 with
Axiom 3' For every , if we do not currently have any rule for or for , then we have the sporadic rule .
Axiom 4' If we have the sporadic rule , but we do not currently have any rule for , then we have the sporadic rule .
Axiom 5' If we have the sporadic rule , but we currently do not have any rule for , then we have the sporadic rule .
We can then try to dynamically grow the ruleset starting from just Axiom 2 and Axiom 2' by greedily trying to verify Axioms 3', 4', 5'; the previous contradiction is now blocked from occuring. I suspect that to block a collision later on, we will also need to impose
Axiom 2'' .
(This law does not appear to be forced by 1516, but its square is, so we have to include it in this ansatz, as squaring is necessarily injective here. By the way this seems to indicate that squaring is not injective in the free magma, which means that this approach will not be able to capture that free magma precisely, but it may still get close enough to it that we can still refute 255.)
It may be possible to just run a greedy algorithm on this initial ruleset and see numerically if any collisions occur. This will not terminate currently, but it might give some hope that the strategy could work, and one can then try to add a few more seed axioms (motivated by, but not forced by, 1516) to force termination in finite time.
Jose Brox (Nov 06 2024 at 10:03):
Terence Tao ha dicho:
Pace Nielsen said:
Jose Brox Yeah, unless I see some simplifying principle, the Kisielwicz ansatz seems to be just reproducing some of (with less generality) Terry's computations.
Yeah, if you replace the free magma with the natural numbers, the pairing operation with say , and the squaring and cubing operations with say and then I think one can convert the model I am proposing into a (very complicated) Kisielewicz type model.
The thing with Kisielewicz is that you can reduce freeness at the cost of introducing more functions when a new ambiguity appears; but for this you have to make educated guesses which may prove to be incompatible with the operation several steps in the future... It is kind of similar to be building a finite-state automaton and creating a new state instead of just creating new transitions between existing states.
Pace Nielsen (Nov 06 2024 at 14:28):
Today I need to teach classes and write a letter of recommendation, but I do have some ideas about how to implement an automation to find/check if the list of sporadic rules is complete. I'll try to get to that as soon as possible, probably tomorrow though.
Andy Jiang (Nov 07 2024 at 00:39):
Do we have any toy examples of a system of rules like this for another equation we can try to implement in the meantime? Like maybe free 854 magma? Just to test any general frameworks we might be able to build
Terence Tao (Nov 07 2024 at 03:38):
I looked at the Kiseleiwicz constructions, but those are mostly modifications of the constant law rather than modifications of a free law , and also they have so many operations that the number of axioms needed is rather large. I guess one could use this procedure to discover a large number of rules of the form , which would slowly fill out the infinite family of relations in https://teorth.github.io/equational_theories/blueprint/854-chapter.html#free-relate .
Terence Tao (Nov 07 2024 at 14:47):
I guess one could try building this type of model with 1323, . Here, we work on a free magma generated by one generator and a pairing operation (i.e., trees), and the magma operation is equal to unless there is a sporadic rule . To simplify things we also assume that there are no sporadic rules for , so . If one can impose the axioms
Axiom 1 If there are sporadic rules and , then .
Axiom 2 There is no sporadic rule of the form .
Axiom 3 One has the sporadic rule .
Axiom 4 If one has the sporadic rule , then one has the sporadic rule .
Axiom 5 If one has the sporadic rule , then one has the sporadic rule .
Axiom 6 If one has the sporadic rules and , then one has the sporadic rule .
Such a system will automatically obey 1323, and if so has a good chance of not obeying 2744. So one could hope to just generate rules using Axioms 3,4,5,6. If one is super optimistic then this process terminates with a finite number of sporadic rules (in terms of arbitrary trees ) that still do not violate Axioms 1,2, in that any new sporadic rule generated is a special case of an existing rule. But there may be self-sustaining chains of rules, for instance one may discover that every sporadic rule of the form creates another rule of the form where for some transformation . If this is the case, this creates an infinite sequence of families of rules, but one can truncate it by declaring for instance that is a rule whenever for some , even if no rule of the form exists. This makes the ruleset slightly too large to be the free magma, but could help make the task of finding a complete ruleset terminate in finite time.
So maybe this is a good model problem to work on first before tackling 1516, which seems to generate many more rules and has the additional complication of squaring and cubing having to commute.
Terence Tao (Nov 07 2024 at 14:56):
OK, this one may be harder to truncate than I thought. Axiom 4 iterates itself if we keep squaring . For instance two applications of Axiom 4 tell us that if we have the sporadic rule , then we have the sporadic rule . I had thought to truncate this iteration by just adding all rules of the form as sporadic rules for any , but this violates Axiom 1 because the right-hand side involves and the left-hand side doesn't. So perhaps one just has to add the infinite family of all rules generated by Axiom 4 (starting from Axiom 3) and hope that this doesn't trigger any invocations of Axiom 5 and Axiom 6, then we get to close the system. One thing that gives me hope is that in the sporadic rules generated by Axiom 4, is longer than ; but for at least one of the sporadic rules needed for Axiom 5 or Axiom 6, is longer than . So maybe all the extra rules generated by Axiom 4 don't cause uncontrollable combinatorial explosion (and/or violation of Axiom 1 or Axiom 2).
Terence Tao (Nov 07 2024 at 15:11):
Actually, I think this just works for 1323. Setting , , and , then Axiom 3 says and Axiom 4 says that , which iterates to give an alternating infinite family of sporadic rules
etc. and I think this ruleset obeys Axioms 1, 2, 5, 6 (hence 1323) and not 2744?
Wait, Axiom 6 might not be obeyed. Checking...
Terence Tao (Nov 07 2024 at 15:24):
OK, Axiom 5 is fine (because in the hypothesis , is longer than , which is not the case for any of the axioms in the above list), and Axiom 6 is fine unless the rule is of the form (i.e., ), in which case Axiom 6 now reads
Axiom 6'. If is a sporadic rule, then is a sporadic rule.
So this unfortunately generates further infinite families of rules (now generated a free semigroup on two generators, Axiom 4 and Axiom 6') but it still might be possible to get enough control on the rules generated that no further triggering of Axiom 6 or Axiom 5 occurs, or any violation of Axiom 1 or Axiom 2, in which case we are still in business.
Terence Tao (Nov 07 2024 at 15:44):
OK, I think the process can terminate, though it's not super clean. Setting for brevity, then Axiom 6' says that implies , which iterates to . To stop proliferation, I will simply add all rules of this type as a new axiom:
Axiom 7.
Note that this does not violate Axiom 1 since and determine and hence . (In contrast, one could not truncate earlier at .)
The complete rule set is then, I conjecture, all the rules coming by starting with either Axiom 3 or Axiom 7, applying 0 or more applications of Axiom 4, and then applying 0 or 1 applications of Axiom 6'. The point is that after applying 1 application of Axiom 6', the term to the left of should no longer be a square, and so Axiom 4 can't trigger, and after 2 applications of Axiom 6', one is back in Axiom 7. One also has to check that Axiom 5 can't trigger, nor can any other case of Axiom 6 other than the one leading to 6', but I think this is just a finite case check (hopefully).
Bernhard Reinke (Nov 07 2024 at 17:04):
I am not sure I understand Axiom 7, what relations should hold between such that the conclusion holds? doesn't appear on the RHS, so I guess there are some assumptions.
Terence Tao (Nov 07 2024 at 17:09):
No, there are no assumptions on ; it is a relatively sweeping law. It is not forced by 1323, but I am hoping that we can insert it in order to close the iteration and obtain a superset of all the laws forced by 1323 that are stil self-consistent and still refute 2744. Am writing notes on this for the blueprint right now.
Bernhard Reinke (Nov 07 2024 at 17:23):
Hm, I think it we add to 1323 the law that corresponds to axiom 7 (in the sense of using the magma operation in instead of the tree building operation, this already implies 2744
Terence Tao (Nov 07 2024 at 17:24):
Oof that is a blow to the strategy. Um what if instead we add the third iterate instead, ?
Terence Tao (Nov 07 2024 at 17:27):
This makes the propagation tree slightly more complicated, because if one starts from Axiom 3 and applies Axiom 6' twice, then one can then apply Axiom 4 indefinitely, but I have some hope that that is the only other infinite family produced and so the argument may still close.
Bernhard Reinke (Nov 07 2024 at 17:31):
Terence Tao said:
Oof that is a blow to the strategy. Um what if instead we add the third iterate instead, ?
Same issue unfortunately
Terence Tao (Nov 07 2024 at 17:33):
That surprises me a little to be honest. That complex rule implies 2744 in all cases? Is this an ATP check?
Terence Tao (Nov 07 2024 at 17:33):
oh by the way I think I misspelled the law. checking... no, it's correct
Bernhard Reinke (Nov 07 2024 at 17:36):
yes, this is the tptp snippet, vampire shows 2744 immediately
fof(eq1323,axiom,
X = m(Y,m(m(m(Y,Y),X),Y))
).
fof(def_T, axiom,
t(Y,X) = m(m(m(m(Y,Y),m(Y,Y)),X),m(Y,Y))
).
fof(axiom_7, axiom,
m(t(Y,Z),t(W,t(Y,Z)))=t(t(Y,Z),W)
).
fof(eq2744,conjecture,
X = m(m(m(Y,Y),m(Y,X)),Y)
).
Bernhard Reinke (Nov 07 2024 at 17:37):
I hope this corresponds to what you had in mind (this is version for the iterated law)
Terence Tao (Nov 07 2024 at 17:39):
It looks all right but just as a sanity check - if you restrict Axiom 7 to only those Y,Z,W for which m(Z,Y)=W, is it a consequence of 1323?
Terence Tao (Nov 07 2024 at 17:41):
Anyway this makes me realize that it is far better to use ATPs to test whether various relaxations of laws are possible in order to truncate the iteration, than to simply propose them and then do a very tedious human verification
Terence Tao (Nov 07 2024 at 17:45):
One could also try the next iteration out, . If that still implies 2744 then there's something fundamental that this approach is missing. Perhaps it is that we are not forcing to be non-injective for some .
Bernhard Reinke (Nov 07 2024 at 17:46):
Terence Tao said:
It looks all right but just as a sanity check - if you restrict Axiom 7 to only those Y,Z,W for which m(Z,Y)=W, is it a consequence of 1323?
It doesn't seem to be a consequence, at least vampire runs quickly out of memory
Terence Tao (Nov 07 2024 at 17:48):
OK, then there is a typo somewhere. Can you verify that 1323 implies Axiom 6', namely that implies ?
Bernhard Reinke (Nov 07 2024 at 17:49):
yes, that is ok
Terence Tao (Nov 07 2024 at 17:49):
ok... does imply ?
Terence Tao (Nov 07 2024 at 17:50):
Ah I see my mistake
Terence Tao (Nov 07 2024 at 17:52):
should imply instead
Terence Tao (Nov 07 2024 at 17:52):
if it does, I propose as the new Axiom 7 (with no prior restriction that ) and see if this does not imply 2744
Bernhard Reinke (Nov 07 2024 at 17:55):
it is now a consequence of 1323 with the assumption , but the new Axiom 7 still implies 2744.
Terence Tao (Nov 07 2024 at 17:57):
OK... um can you try the same exercise with replaced by ?
Terence Tao (Nov 07 2024 at 17:57):
i.e. it should be a consequence of 1323 with , and (fingers crossed) does not imply 2744.
Terence Tao (Nov 07 2024 at 17:58):
My intuition here is that once all three components of a law become complex, then the law becomes dramatically less influential for proving other laws
Bernhard Reinke (Nov 07 2024 at 18:01):
Same as before, it is a consequence of 1323 with , but without it implies 2744
Bernhard Reinke (Nov 07 2024 at 18:02):
fof(eq1323,axiom,
X = m(Y,m(m(m(Y,Y),X),Y))
).
fof(def_T, axiom,
t(Y,X) = m(m(m(m(Y,Y),m(Y,Y)),X),m(Y,Y))
).
fof(axiom_7_second_iterate, axiom,
m(t(Y,Z),t(W,Y)) = t(t(Y,Z),W)
).
fof(eq2744,conjecture,
X = m(m(m(Y,Y),m(Y,X)),Y)
).
this is the tptp snippet for the last claim. I have to run now, but maybe other people can help with ATP questions if you have them
Michael Bucko (Nov 07 2024 at 18:47):
Terence Tao schrieb:
My intuition here is that once all three components of a law become complex, then the law becomes dramatically less influential for proving other laws
Interesting idea! Much less influential in terms of predictive power, or reusability for proofs? If reusability, is it reusability for humans, or any computer?
Michael Bucko (Nov 07 2024 at 18:53):
Terence Tao schrieb:
Anyway this makes me realize that it is far better to use ATPs to test whether various relaxations of laws are possible in order to truncate the iteration, than to simply propose them and then do a very tedious human verification
That could enable more gradual, smaller, and very well defined ATP tests. Such smaller tests could also be (to some extent) (co-)coordinated by LLMs. I'll open a GitHub issue - I could make a script for this. One would only have to add their LLM API key and the path to Vampire.
EDIT: https://github.com/teorth/equational_theories/issues/803
Terence Tao (Nov 07 2024 at 19:45):
@Bernhard Reinke Thanks so much! I have to attend to other things now, but will come back to this, probably on the 1323 thread rather than the 1516 thread. Anyway this seems to be a simpler model for this type of approach than 1516, so thank you @Andy Jiang for the suggestion!
Pace Nielsen (Nov 07 2024 at 21:32):
@Terence Tao I started the work of automating the search for new rules for 1516. Below I'll ignore the sporadic rules of the form
It immediately found the primary relation , which we already knew. After incorporating that identity into the search, it then found the next (again well-known) identity , where . (This is your sporadic rule 7 in the list you posted on Nov. 5 at 12:48 PM). The next few identities are of the same form---things like or slightly more complicated things like . I'm still thinking about the best way to incorporate these types of relations into the framework before pushing further. (The trouble is that we want to treat as different than , even though they become the same after replacing with a square.) I'll update everyone when I get further.
Terence Tao (Nov 07 2024 at 22:04):
Can you run the same exercise for 1323 along the lines of what I indicated above? It might be somewhat simpler. I'll make a comment over at the 1323 thread about this.
Pace Nielsen (Nov 08 2024 at 15:27):
I may be able to run the same exercise for 1323, once I've figured out 1516 fully. I did figure out a nice way to handle the identity , and am now double-checking things, and then will run a final search over generic pairs.
Pace Nielsen (Nov 08 2024 at 23:01):
Here is the full report from the automated search on law 1516.
My paradigm for running the search was that instead of thinking of as (when possible) the expression in the free magma, and instead of writing it as , I instead treated it as a new "function" following the Kisielewicz ansatz. [This is really just a renaming procedure, but it was helpful for my own understanding.]
So, for example, is a new function, and is a new function, and is a new function, and so forth. After this point I just started calling the new functions . Some of the other important functions that showed up are , , , and finally .
Now, the 1516 law is . I thought about it in the following way. Once we know that is defined (either as a new function or a composition of previously defined functions), then we generally want to define as a brand-new function , and then we must take , which is not a new function. So each defined product generally gives rise to two new defined products, both involving the new function in some way.
Let me show how this worked in practice. Suppose we have derived the equality . From this, we want to take (a brand-new function), and then we must also take .
There were two dangers in this. First, when is defined, it might also be the case that is also already (previously) defined, so we cannot adjoin a new function (i.e. we have to use the previously defined answer). This happened exactly five times (involving those five special functions above). For example, we had the equality . We can't define then as a new function, because we already had it equals .
The second danger was that when defining as , it might also already have a different definition; so there could be collisions, where the expressions don't match.
The first collision it found was . To accommodate this issue, I had to modify the composition of functions to always concentrate sequential powers of and together. The next collision it found was that . This causes problems with some new functions like , because then . After adding the rules for these new types of collisions (essentially adding a new, third infinite family of reduction rules) there were no other collisions that the program found.
There is one other issue I want to think about, but this appears to capture the full reduction system.
Pace Nielsen (Nov 09 2024 at 01:05):
Actually, there might be more to the reduction system---I'll try to finish it off tomorrow.
Terence Tao (Nov 09 2024 at 01:44):
This looks provisionally promising, and perhaps simpler than what I was proposing. Look forward to seeing the final analysis!
Terence Tao (Nov 09 2024 at 02:18):
Are the problems involving and significant enough that it is worth adding additional rules such as requiring to be constant, or do you think this does not make the anaysis substantially simpler?
Pace Nielsen (Nov 09 2024 at 20:55):
The previous runs of the automated searcher checked for closure under the obvious adjoining process: starting from , adjoin the new products and in the greediest manner possible. Beginning with just the seed , it closed up.
So, next I extended the search to guarantee that if was going to be handled, I had already handled the simpler product . The automated process then found a third collision, which was number 9 on Terry's list:
So, this rule is not automatic assuming and under greedy extension. I'll start thinking about how to implement this new relation, as well as whether taking constant would help.
Terence Tao (Nov 09 2024 at 21:01):
My thought was to impose as a new axiom.
Pace Nielsen (Nov 09 2024 at 21:02):
Let me see if Vampire says that allows 255 to fail. It would certainly be VERY easy to implement in the searcher.
Pace Nielsen (Nov 09 2024 at 21:05):
Of course Vampire says it is fine, because we already knew that it says left-cancellation is fine.
Terence Tao (Nov 10 2024 at 01:01):
I'm thinking to explore more the possibility, floated by @Bruno Le Floch over at the 1323 thread, to adapt the recent success over 1323 here, by trying to work out what's going on on the square numbers first and by imposing ATP-approved simplifying axioms, particularly on the square numbers, to make the algebraic constraints easy to solve. As Bruno pointed out, there is a new difficulty here because the target law 255 also involves square numbers, but nevertheless I think we should probe the strategy.
Perhaps we can start by imposing our previous axiom of constant cubes on our hypothetical counterexample ; I will write this axiom here as (instead of ). I'd also like to assume that the set of squares is closed under the magma operation, hopefully an ATP can confirm that this is a likely safe assumption in that it doesn't accidentally imply 255? Then we have a submagma of that obeys a lot of laws, including 255, , and perhaps some others. Are there more simplifying axioms we can put on either or that don't accidentally prove 255 for ? (For instance, perhaps we could make a unit for , since it is already a left-unit by 255. Another possible simplifying assumption is to have for some small which was also a situation we enjoyed in 1323 where we could impose .) This may suggest a good choice for what multiplication table to put on . Based on the 1323 experience I suspect an infinite carrier for may be prudent.
Is it consistent to make invertible? Then the 1516 law becomes , which would be similar to the simplified form we had for 1323. It hints that we don't need to think about what the multiplication table should be when is not a square, we should be able to fill it out greedily with an infinite carrier. (In fact, we might even be able to fill it out just assuming surjective.) We just have to work out the portion when is a square and is not a square, in such a way that we don't accidentally make for all non-squares . What constraints on this multiplication table do we have?
Andy Jiang (Nov 10 2024 at 01:48):
At some point I think i checked that vampire says squares being closed under the operation implies 255 (at least assuming C=0 and left cancellation) let me check again without those assumptions and report back (also to make sure I'm not misremembering)
Matthew Bolan (Nov 10 2024 at 01:49):
Without left cancellation I think my vampire is telling me that is fine.
Matthew Bolan (Nov 10 2024 at 01:50):
My vampire is fine with most of Terry's assumptions, but when I add being invertible it tends to find a proof of 255 (not sure what subset is doing it here), and when I try and add for small n it also tends to find a proof of 255
Terence Tao (Nov 10 2024 at 01:51):
How about if is two-valued? (One may then also need to be two-valued).
Terence Tao (Nov 10 2024 at 01:51):
Or if is injective just on the squares (i.e. is left-cancellative even if isn't)
Matthew Bolan (Nov 10 2024 at 01:59):
The latter seems fine. Not yet checked the former. Here's my TPTP
fof(def_S, axiom,
s(X)=m(X,X)
).
fof(def_C, axiom,
c(X)=m(s(X),X)
).
fof(eq1516,axiom,
X = m(s(Y),m(X,m(X,Y)))
).
fof(cconst, axiom,
c(X) = c(Y)
).
fof(smclosed, axiom,
?[Z] : m(s(X),s(Y)) = s(Z)
).
fof(leftcancelonsq, axiom,
(Z=Y) | (m(s(X),s(Z)) != m(s(X),s(Y)))
).
fof(eq255, conjecture,
X = m(c(X),X)
).
Matthew Bolan (Nov 10 2024 at 02:23):
Adding being two valued to the above TPTP (so in particular still assuming SM is left-cancellative), vampire can prove 255 for . The amount of time it takes gets longer as I increase .
Terence Tao (Nov 10 2024 at 02:29):
Hmm, interesting. Ok, getting a bit desperate here... what if we require that is three-valued for some small and no other assumptions?
(I'm really hoping we can make the image of somewhat small, as this suggests a way to build a magma by first working the multiplication table on the image of , then the image of , and so forth back to the full magma.)
Matthew Bolan (Nov 10 2024 at 02:30):
So just 1516 and S^n being three valued ?
Terence Tao (Nov 10 2024 at 02:31):
yeah. It could be that this problem is in fact immune not just to finite magmas, but to any magma that eventually squares to a finite set, and maybe this three-element experiment will support that.
Matthew Bolan (Nov 10 2024 at 02:36):
With no other assumptions it's not finding a proof of 255, even with being 2-valued.
Terence Tao (Nov 10 2024 at 02:37):
Oh okay! Now we're getting somewhere. OK... what about 2-valued, and squares closed under multiplication? If that works, then we should start studying magmas with 2-valued squares as possible candidates for that we can later extend to .
Matthew Bolan (Nov 10 2024 at 02:40):
No proof of 255 from that in a minute long run.
Terence Tao (Nov 10 2024 at 02:48):
I guess what I am concluding from this is that left cancellability is perhaps not a good thing to want to impose. 1516 is the assertion that . This will be less constraining on if we allow to be multi-valued.
I think I'm done with ATP requests for now, thanks so much for your help!
Matthew Bolan (Nov 10 2024 at 02:49):
Actually, I have one more ATP observation, which is that if I ask for itself to be 2-valued it doesn't seem to be finding a proof.
Terence Tao (Nov 10 2024 at 02:50):
Wow, I thought we had ruled that out already
Matthew Bolan (Nov 10 2024 at 02:51):
Yeah, makes me suspicious, but perhaps that was under some standing assumption.
Matthew Bolan (Nov 10 2024 at 02:51):
I'll post the TPTP once this latest run finishes
Terence Tao (Nov 10 2024 at 02:52):
Yeah you had previously ruled that out at https://leanprover.zulipchat.com/#narrow/channel/458659-Equational/topic/1516.20-.3E.20255/near/480260405
Matthew Bolan (Nov 10 2024 at 02:53):
Hmm, bug hunting time then.
Terence Tao (Nov 10 2024 at 02:55):
Things would be quite simple if we could have . There are now just three rules:
- .
- If and , then .
- If and , then .
This looks relatively easy to fill out greedily if we choose and reasonably well (not too injective - having infinite preimages, in particular, would give us a lot of useful flexibility). If there is a chance that we can get a 255 counterexample this way then we should pursue it.
Terence Tao (Nov 10 2024 at 03:06):
If anyone else has the ability to do an independent ATP run of the same question "does plus 1516 imply 255?" that would be great.
Pace Nielsen (Nov 10 2024 at 03:12):
When I ran the following check for a 2-valued S in Vampire, it implied 255 (by a fairly long proof).
fof(eq1516, axiom, X = mul(mul(Y, Y), mul(X, mul(X, Y)))).
fof(sdef, axiom, s(X) = mul(X, X)).
fof(twosquares, axiom, s(X) != e => s(X) = f).
fof(255, conjecture, X = mul(mul(mul(X, X), X), X)).
Matthew Bolan (Nov 10 2024 at 03:16):
Huh, I didn't find a proof with mine running that file, perhaps I just am not running these long enough.
Matthew Bolan (Nov 10 2024 at 03:22):
Wow, looks like my minute cutoff was just barely too short on my machine. I now can replicate.
Matthew Bolan (Nov 10 2024 at 03:22):
I guess we should run these for quite a while before we commit a lot of effort to some ansatz :sweat_smile:
Terence Tao (Nov 10 2024 at 03:23):
I'm going to try to figure out a pen and paper proof
Matthew Bolan (Nov 10 2024 at 03:29):
Prover 9's proof that all such things are trivial seems possible to understand:
proof.txt
Terence Tao (Nov 10 2024 at 03:32):
Hmm. I think it's related to the fact that there are no order 2 magmas solving 1516. The smallest we have is of order 5. Probably ATPs are not going to give us guidance as to whether 5-valued squares are possible, but one could imagine postulating that SM is one of the order 5 models (or the order 7 model) and going from there.
Matthew Bolan (Nov 10 2024 at 03:43):
Matthew Bolan said:
Actually, I have one more ATP observation, which is that if I ask for itself to be 2-valued it doesn't seem to be finding a proof.
rerunning the TPTP from here for longer and it finds a counterexample after a few minutes, so it looks like I never had a bug, I just didn't run it for long enough. Replacing 255 with just the singleton law X=Y it finds a proof in just 7 seconds.
Matthew Bolan (Nov 10 2024 at 03:44):
Sorry about all that :sweat_smile:
Terence Tao (Nov 10 2024 at 03:55):
Well, it's good for the project to get practical tips on how to use ATPs accurately!
Matthew Bolan (Nov 10 2024 at 03:56):
Definitely one thing I've just learned is to also check if vampire can prove the singleton law from your axioms.
Matthew Bolan (Nov 10 2024 at 03:59):
I'm pretty shocked the difference was so severe (especially compared with how fast prover9 solves both versions).
Terence Tao (Nov 10 2024 at 04:01):
I think I now understand what is going on from a theory point of view. Specifically,
Theorem. Let be a 1516 magma. If is finite (i.e., only finitely many squares), then squaring is surjective . In particular, is finite and obeys 255.
Proof. Let be a square, then we know is surjective from the 1516 equation . Hence, for any , we can write for some , then from 1516 we have . Hence , hence maps bijectively to by finiteness. Thus for any , we have , i.e., takes values in the squares; but is surjective, hence .
So now I am going to try taking to be an infinite model (e.g. the product of infinitely many copies of the order 5 model) and see if I can then greedily construct a 255 counterexample from that ansatz.
Matthew Bolan (Nov 10 2024 at 04:01):
Vampire has just informed me after a nearly 8 minute run that being two valued implies 255 (and a second run reveals it implies the singleton law), so we can throw away that earlier claim too.
TPTP for that (stealing from Pace)
fof(eq1516, axiom, X = mul(mul(Y, Y), mul(X, mul(X, Y)))).
fof(sdef, axiom, s(X) = mul(X, X)).
fof(twosquares, axiom, s(s(X)) != e => s(s(X)) = f).
fof(255, conjecture, X = mul(mul(mul(X, X), X), X)).
Terence Tao (Nov 10 2024 at 04:02):
Yeah, I'm pretty much migrating to models with infinitely many squares (or iterated squares) now.
Matthew Bolan (Nov 10 2024 at 04:34):
Matthew Bolan said:
The latter seems fine. Not yet checked the former. Here's my TPTP
fof(def_S, axiom, s(X)=m(X,X) ). fof(def_C, axiom, c(X)=m(s(X),X) ). fof(eq1516,axiom, X = m(s(Y),m(X,m(X,Y))) ). fof(cconst, axiom, c(X) = c(Y) ). fof(smclosed, axiom, ?[Z] : m(s(X),s(Y)) = s(Z) ). fof(leftcancelonsq, axiom, (Z=Y) | (m(s(X),s(Z)) != m(s(X),s(Y))) ). fof(eq255, conjecture, X = m(c(X),X) ).
A 10 minute run from vampire did not prove 255 starting from this. Prover9 also seems to be unable to prove 255 from it, and Mace4 finds that the order 7 model meets the axioms (but of course does not refute 255).
Terence Tao (Nov 10 2024 at 05:02):
I ended up not being able to build a non-trivial model in which was say the infinite product of the order 5 magma due to a modification of the argument when was finite. Part of the problem was that this infinite product was right cancellative. Do we know if any 1516 model which is not right cancellative?
Terence Tao (Nov 10 2024 at 05:07):
Oh wait, we have a translation invariant model! This could be promising.
Terence Tao (Nov 10 2024 at 06:16):
OK, I think I have a plan to create a 1516 magma that contradicts 255; it's a fairly tricky construction, will try to write a more detailed blueprint soon. It will contain a submagma , the set of squares, on which simplicity I will take to be the identity (so I am assuming in the overall magma). I will use to refer to elements of (squares), and for elements of (non-squares). Both sets will be infinite, but the latter is somewhat "larger" than the former in some sense. The multiplication table is built in three steps: first on (square times square), then between and (square times non-square), and finally between and (non-square times arbitrary).
Step 1: Construct a 1516 magma with which for all , and with the additional property that for all there are at least two such that (thus the right multiplication operations are doubly surjective).
Construction (sketch): I think one can do this with the integers as carrier and a translation-invariant model with to get the law, by modifying the existing greedy construction. I'll detail this in the blueprint version.
With this step, the left multiplication operators are defined as functions from to . By construction, we have for all . Also, as all elements of are squares, the are surjective for all .
Step 2: Construct a set of non-squares (disjoint from ), a squaring map , and extensions of the left multiplication operators for each , such that for all and . Furthermore: for each and , the set is infinite. (Thus, all the are "infinitely surjective" on the non-squares.)
Construction: The non-squares will be tuples of the form where and is a natural number. The squaring map will be . For each and each , we can use step 1 to find distinct from such that ; clearly the are distinct as varies. We now have . So if we define , then we have whenever is of the form for some . But we still have undefined if is not of this form, in particular, is currently not defined.
Now we greedily fill in the rest of the functions. If is some element of (so that ) and is not yet defined, we can use the surjectivity of the to write for some , then set ; then we have as required. Also, if we want to write as for some where is not defined, we can perform the previous construction if necessary to have for some , then find such that , and set where is large enough that has not yet been assigned. Then set , then , and we have created a preimage in . Iterating these procedures in a suitable order, we can obtain the claim.
Step 3: Construct for non-squares , such that and for all , but such that the 255 identity fails at least once.
Construction (Sketch): The main constraint here is , or equivalently that if $$L_x y $ equals , then must lie in the preimage . Because these preimages are infinite, we can starting with the requirement , we can then greedily fill out (ensuring that for all to avoid collisions), and there is more than enough freedom to ensure that
fails at least once.
Jose Brox (Nov 10 2024 at 08:38):
In my experience, 1 minute is sometimes not enough. I now run Prover9 for 20 minutes when the implication is relevant, just to be sure (I have been surprised once!).
Jose Brox (Nov 10 2024 at 10:12):
Terence Tao ha dicho:
Well, it's good for the project to get practical tips on how to use ATPs accurately!
From the collective experience here, one thing I think that we can say is that using both Prover9 and Vampire is proving useful: while Prover9 may be sometimes faster for equational proving (and perhaps finds simpler proofs), for more complicated first-order formulas Vampire can have a big advantage. In the 1323 case, Prover9 couldn't rule out the possibility of having only two squares after 30 minutes (at least in my machine and with high resources), while Vampire did it in 30s (in other computers). Apparently this was related to Vampire resorting to a SAT solver.
Jose Brox (Nov 10 2024 at 10:31):
Terence Tao ha dicho:
OK, I think I have a plan to create a 1516 magma M that contradicts 255
Is there anything in this strategy that prevents using it for showing that 1729 doesn't imply 817? We'd just need to swap from
to .
Terence Tao (Nov 10 2024 at 14:55):
I've added a note in the plan of the paper to discuss the topic on the practical use of ATPs including the point you made above about taking advantage of the complementary strengths of different ATPs. It occurs to me that actually we have constructed a pretty good data set for benchmarking ATP performance.
I am focusing in writing up the 1516 strategy for the blueprint. Given that the strategy was derived from a similar successful strategy for 1323 that already handled 1729 -> 917, I wouldn't be surprised if there was some way to tweak it to handle the last remaining implication. But we should coordinate with @Shreyas Srinivas on this. (That is nominally the last implication outstanding, but actually there is a cluster of implications from 713, 1289, 1447 that we might want to visit because the only proofs we have right now are large greedy algorithms that involve a SAT-solver calculation that we have not figured out how to replicate within Lean in a civilized fashion.)
Terence Tao (Nov 10 2024 at 16:23):
I wrote up the previous arguments in the blueprint. I've also opened up equational#820 to formalize it. However it may be worth trying to see if some simplifications are possible: right now the argument involves no less than three separate greedy algorithms, first to construct the squares x squares portion of the multiplication table, then squares x nonsquares, then finally nonsquares x everything. The previous ATP-powered discussions seemed to indicate however that some complexity of this sort may be inevitable, but perhaps there are some other simplifications.
Andy Jiang (Nov 11 2024 at 09:17):
Could I ask how you came up with the form that the non-squares should take? Like ZxZxN? Is there a particular constraint which this form helps on? Just want to understand your solving process thanks!
Terence Tao (Nov 11 2024 at 15:13):
Sure! My starting point was to view the equation 1516 as a constraint between the various left shift maps , and to first see what conditions they impose on the left shift maps for in the squares . Here I assume that the magma operation on has already been defined, so the question is what does on the non-squares, which at present are a completely unknown set. In order to fill out the multiplication table on the non-squares later, also know that I will need these to be surjective, and for the greedy algorithm to work I would in fact like them to really surjective, with every element having infinitely many preimages, not just at least one.
From the analysis of the case when was finite, I extracted the following argument. Let be some non-square, and let be a square. We know that is surjective, hence for some . Writing , we obtain . Thus, each non-square comes with a function from squares to squares with the property that . This function is clearly injective. When the squares are finite, this forces the function to be surjective, and now this is a problem because this has completely determined all of the left shift maps on squares, and forced them to also take values in the squares, which by surjectivity means that there are no non-squares! So we need infinitely many squares, and we also need the function to omit at least one element in its range.
So, every non-square now should come with two important square numbers - its own square , which I denoted , and an auxiliary square , for which is not automatically defined by the rule and can be free to assign to help fulfill requirements such as the (infinite) surjectivity of . For greedy algorithm purposes it would be nice to have a lot of s for each possible choice of pair . This is what led me to choose the space of triples as the carrier for the non-squares, with representing the square and thus defining the squaring map: .
The map , which partially defines the left multiplication maps via the formula , will be required to avoid . It also has to solve the equation . On comparing this equation with for all squares (here taking advantage of the property for squares), it becomes natural to choose so that , or equivalently . But it also has to avoid , so it would be nice if map was at least doubly surjective so that we can always pick a point in its preimage that is not . This was when I asked if there were non-right-cancellative models for 1516 known, and discovered that @Pace Nielsen had basically already built one using the translation-invariant method.
After verifying that this model could produce doubly surjective right multiplication maps, I now had a construction for for each . So now I still had some entries of the multiplication table to fill in: each had undefined, together with perhaps some other values as well. The main constraint to fulfill here to select a value of was to ensure that (a) was infinitely surjective, and (b) the equation (from 1516) was obeyed.
For (b) I originally had a somewhat complicated greedy algorithm approach in which would be set to some previously unused non-square for which was undefined, but I realized that it would be easier to set it to a suitable square in that obeyed a compatibility condition. This initial technique was however very suited for attaching a pre-image of to ; again one has to verify a compatibility condition, but it was straightforward to figure out the constraints and see that they could be solved.
Pace Nielsen (Nov 12 2024 at 23:25):
I was very happy to see Terry's recent solution to 1516->255!! :tada:
Although it is now moot, I figured I might as well finish my automated search for other relations. Although there is still the strong possibility that my program has some bugs in it, I've run it for a day, and after imposing the equalities , , and
(as well as the obvious rule coming from 1516 itself) it didn't find any new needed rules. I'll let it run for one more day, just in case something pops up.
My feeling after writing the program is that acts somewhat like a "zero-divisor" in a ring; I think of it as behaving something like (but without the subtraction). [Note: The displayed rule is not strictly necessary in the free 1516 magma on one symbol, and so without imposing it, then there could be some other zero-divisor-like behavior, coming from annihilating things on the left.]
Jose Brox (Nov 14 2024 at 01:49):
Pace Nielsen ha dicho:
I'll let it run for one more day, just in case something pops up.
How did this go? Did you finish the run?
Pace Nielsen (Nov 14 2024 at 01:55):
I made a sort of "kludge" to get around some weird behaviors, and it broke the program eventually. So, nothing new popped up before the program crashed.
Bernhard Reinke (Nov 15 2024 at 13:18):
I am trying to extend my (noncommutative) translation invariant 1516 !=> 1489 construction (see equational#831) to a "base" for the construction as outlined in the blueprint. I haven't fully digested the proof, but I think in Lemma 14.3 one has to exclude the case h = 0
and following that the case a=b
in 14.4. Is this a problem down the line?
Terence Tao (Nov 15 2024 at 15:19):
Oof, this is indeed necessary since if then from 1515 so one can't have two solutions. I think I can fix this by removing the diagonal from the non-squares. Will work on this now.
Terence Tao (Nov 15 2024 at 15:41):
Grr, this is a non-trivial issue.
Terence Tao (Nov 15 2024 at 16:55):
Okay, I think I fixed it in equational#837, but it was harder than I expected, in particular I need to strengthen the base construction properties (and rework the "squares x non-squares" constructions) to make the argument work. In particular the base magma now needs some non-trivial solutions to the equation , but fortunately this is easily achieved by adding such solutions to the starting seed.
Bernhard Reinke (Nov 16 2024 at 13:59):
Bernhard Reinke said:
I am trying to extend my (noncommutative) translation invariant 1516 !=> 1489 construction (see equational#831) to a "base" for the construction as outlined in the blueprint. I haven't fully digested the proof, but I think in Lemma 14.3 one has to exclude the case
h = 0
and following that the casea=b
in 14.4. Is this a problem down the line?
I finished the "base" construction. I want to have a look at another translation invariant proof instead of doing the rest of the 1516 -> 255, but it should be OK to black box the base construction
Last updated: May 02 2025 at 03:31 UTC