Zulip Chat Archive
Stream: Machine Learning for Theorem Proving
Topic: Sign Up for the Aristotle API!
(deleted) (Oct 08 2025 at 18:20):
#announce > Sign Up for the Aristotle API!
I knew this day would come. GPT-5 is already quite good.
(deleted) (Oct 08 2025 at 18:21):
Has anyone tried the API?
Alex Meiburg (Oct 08 2025 at 18:29):
If you're curious to see the kind of 'raw' outputs I've gotten from the API: 1, 2 -- those got cleaned up and merged to Mathlib -- and 3, 4 that I would like to eventually clean up and PR.
(deleted) (Oct 08 2025 at 18:34):
How long does it take to run
(deleted) (Oct 08 2025 at 18:34):
I care a lot about speed
(deleted) (Oct 08 2025 at 18:34):
Because GPT-5 can already do a lot it's just slow
Lenny Taelman (Oct 08 2025 at 19:01):
Vikram Shanker said:
[...] The API is designed to complete the sorries in your project - here are few more details: [...]
Great timing, we are having the SorryDB Hackathon in Amsterdam right now! We would be excited to test Aristotle against >3000 real-world sorries!
Chris Hughes (Oct 08 2025 at 19:12):
It proved the formula for the volume of a simplex with vertices at the origin and each of the basis vectors of R^n. I was impressed by that; I tried myself and didn't manage.
Eric Wieser (Oct 08 2025 at 19:17):
Volume in the measure theoretic sense?
Jared green (Oct 08 2025 at 19:46):
will we get periodic updates on benchmark performance?
Yury G. Kudryashov (Oct 08 2025 at 20:40):
See also #mathlib4 > Discussion thread for Aristotle API for a thread targeted at users who don't care about AI systems development.
Eric Rodriguez (Oct 08 2025 at 21:24):
just want to note that we also released our tech report on https://arxiv.org/abs/2510.01346 :)
Kevin Buzzard (Oct 08 2025 at 21:35):
Eric Rodriguez said:
just want to note that we also released our tech report on https://arxiv.org/abs/2510.01346 :)
Reference 42 you have "freiman ruzsa" presumably because of TeX's arcane biblio system. Try {F}reiman--{R}uzsa in the bibtex?
Kevin Buzzard (Oct 08 2025 at 21:36):
PS congratulations on becoming the first tech company to actually write a paper about a computer trying an IMO competition, rather than just a blog post! [edit: I'm wrong, apparently this honour goes to Seed-Prover]
Yury G. Kudryashov (Oct 08 2025 at 21:37):
AFAIK, there are tech reports from other companies too. They just forgot to announce it here.
Jason Rute (Oct 09 2025 at 02:04):
@Kevin Buzzard here is the tech report of Seed Prover which got a silver (or gold if given more time) in the IMO. https://arxiv.org/abs/2507.23726. I thought @Thomas Zhu shared it here on Zulip, but maybe I’m mistaken.
Thomas Zhu (Oct 09 2025 at 02:08):
The Seed-Prover paper was shared to #announce here: https://leanprover.zulipchat.com/#narrow/channel/113486-announce/topic/Seed.20Prover.20Achieves.20Silver-Level.20Score.20at.20IMO.202025/near/532198800.
Eric Wieser (Oct 09 2025 at 04:54):
Eric Wieser said:
Volume in the measure theoretic sense?
Could you share the statement, Chris?
Chris Hughes (Oct 09 2025 at 07:06):
theorem problem {n : ℕ} :
MeasureTheory.volume.real (convexHull ℝ ({(0 : Fin n → ℝ)} ∪
Set.range (fun i : Fin n => Pi.single i 1))) =
1 / n.factorial
Dhyan Aranha (Oct 09 2025 at 07:28):
It looks like Dan Halpern-Leistner's name is misspelled in the arxiv paper announcement.
Last updated: Dec 20 2025 at 21:32 UTC