Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: chatgpt proved a theorem in 6 minutes and 20 seconds

Bulhwi Cha (Feb 23 2026 at 17:21):

/-- `isMin_below min a` means that `min` is a minimal element of the set `{y : α | r y a}` with
respect to `r`. -/
def Relation.isMin_below {α : Sort u} (r : α → α → Prop) (min a : α) : Prop :=
  r min a ∧ ∀ ⦃y : α⦄, r y a → ¬r y min

open Relation

/-- If `a` is accessible through a binary relation `r`, then for every descending chain `f` starting
from `a`, it is not false that `f` ends at a minimal element of the set `{y : α | r y a}` with
respect to `r`.  -/
theorem Acc.not_not_descending_chain_ends_at_min_of_acc {α : Sort u} {r : α → α → Prop} {a : α}
    (acc : Acc r a)
    {f : Nat → α}
    (hsta : f 0 = a)
    (hcon : ∀ ⦃n : Nat⦄, isMin_below r a (f n) → f (n + 1) = f n)
    (hdes : ∀ ⦃n : Nat⦄, ¬isMin_below r a (f n) → r (f (n + 1)) (f n)) :
    ¬¬∃ (c : Nat), isMin_below r a (f c) ∧ ∀ ⦃m : Nat⦄, c ≤ m → f m = f c :=
  sorry

ChatGPT 5.2 Thinking proved the original version of the above theorem in 6 minutes and 20 seconds; I spent 11 hours, 11 minutes, and 45 seconds to prove it.

ChatGPT 5.2 Thinking's proof: https://paste.sr.ht/~chabulhwi/5cc4cf5c4cd6f84c20df426d5e4f4044dbf0fadb
My proof: https://git.sr.ht/~chabulhwi/tpil-solutions/tree/ba60de58f16f4ec0400d8ba9e388984ee661d175/item/TPIL/Chapter08/Accessibility.lean#L248

I dropped out of Korea Aerospace University in 2023, and I don't know much about undergraduate mathematics, although I've been using the Lean theorem prover for four years.

I'm not sure whether I'll be able to prove undergraduate-level theorems as fast as the state-of-the-art AI agents, even after I become more knowledgeable about undergraduate mathematics.

However, I'll keep proving theorems by myself for the following reasons:

While trying to prove a theorem, I find out which lemmas are important for achieving my goal.
I understand a theorem much better after I prove it without looking at an AI agent's proof.
Reviewing an AI agent's proof isn't fun.

Still, it's impressive that GPT-5.3-Codex, Claude Sonnet 4.6, Claude Opus 4.6, and Gemini 3.1 Pro can write Lean 4 code to prove basic theorems about induction and recursion. Personally, I don't want to pay money for using these models, so I'll try to find ways to use them for free.

Thomas Waring (Feb 23 2026 at 23:35):

perhaps interestingly the gpt proof uses Classical.choice (and Quot.sound) where yours does not, which i assume was intended judging from the double negation

Bulhwi Cha (Feb 24 2026 at 01:23):

That's right. I had a feeling that I could write a constructive proof of some modified version of the theorem I intended to prove.

Last updated: Feb 28 2026 at 14:05 UTC