Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: Sign Up for the Aristotle API!


(deleted) (Oct 08 2025 at 18:20):

#announce > Sign Up for the Aristotle API!

I knew this day would come. GPT-5 is already quite good.

(deleted) (Oct 08 2025 at 18:21):

Has anyone tried the API?

Alex Meiburg (Oct 08 2025 at 18:29):

If you're curious to see the kind of 'raw' outputs I've gotten from the API: 1, 2 -- those got cleaned up and merged to Mathlib -- and 3, 4 that I would like to eventually clean up and PR.

(deleted) (Oct 08 2025 at 18:34):

How long does it take to run

(deleted) (Oct 08 2025 at 18:34):

I care a lot about speed

(deleted) (Oct 08 2025 at 18:34):

Because GPT-5 can already do a lot it's just slow

Lenny Taelman (Oct 08 2025 at 19:01):

Vikram Shanker said:

[...] The API is designed to complete the sorries in your project - here are few more details: [...]

Great timing, we are having the SorryDB Hackathon in Amsterdam right now! We would be excited to test Aristotle against >3000 real-world sorries!

Chris Hughes (Oct 08 2025 at 19:12):

It proved the formula for the volume of a simplex with vertices at the origin and each of the basis vectors of R^n. I was impressed by that; I tried myself and didn't manage.

Eric Wieser (Oct 08 2025 at 19:17):

Volume in the measure theoretic sense?

Jared green (Oct 08 2025 at 19:46):

will we get periodic updates on benchmark performance?

Yury G. Kudryashov (Oct 08 2025 at 20:40):

See also #mathlib4 > Discussion thread for Aristotle API for a thread targeted at users who don't care about AI systems development.

Eric Rodriguez (Oct 08 2025 at 21:24):

just want to note that we also released our tech report on https://arxiv.org/abs/2510.01346 :)

Kevin Buzzard (Oct 08 2025 at 21:35):

Eric Rodriguez said:

just want to note that we also released our tech report on https://arxiv.org/abs/2510.01346 :)

Reference 42 you have "freiman ruzsa" presumably because of TeX's arcane biblio system. Try {F}reiman--{R}uzsa in the bibtex?

Kevin Buzzard (Oct 08 2025 at 21:36):

PS congratulations on becoming the first tech company to actually write a paper about a computer trying an IMO competition, rather than just a blog post! [edit: I'm wrong, apparently this honour goes to Seed-Prover]

Yury G. Kudryashov (Oct 08 2025 at 21:37):

AFAIK, there are tech reports from other companies too. They just forgot to announce it here.

Jason Rute (Oct 09 2025 at 02:04):

@Kevin Buzzard here is the tech report of Seed Prover which got a silver (or gold if given more time) in the IMO. https://arxiv.org/abs/2507.23726. I thought @Thomas Zhu shared it here on Zulip, but maybe I’m mistaken.

Thomas Zhu (Oct 09 2025 at 02:08):

The Seed-Prover paper was shared to #announce here: https://leanprover.zulipchat.com/#narrow/channel/113486-announce/topic/Seed.20Prover.20Achieves.20Silver-Level.20Score.20at.20IMO.202025/near/532198800.

Eric Wieser (Oct 09 2025 at 04:54):

Eric Wieser said:

Volume in the measure theoretic sense?

Could you share the statement, Chris?

Chris Hughes (Oct 09 2025 at 07:06):

theorem problem {n : } :
    MeasureTheory.volume.real (convexHull  ({(0 : Fin n  )} 
      Set.range (fun i : Fin n => Pi.single i 1))) =
      1 / n.factorial

Dhyan Aranha (Oct 09 2025 at 07:28):

It looks like Dan Halpern-Leistner's name is misspelled in the arxiv paper announcement.


Last updated: Dec 20 2025 at 21:32 UTC