Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: Aristotle achieves SOTA 96.8% proof generation on VERINA

Vikram Shanker (Dec 03 2025 at 20:31):

We, at Harmonic, are happy to present our latest benchmark result! Evaluating Aristotle on the verification part of the VERINA benchmark, we have achieved 96.8%, proving 160 of the statements and disproving 23 of them by finding a counterexample. More details on this in our blog post at harmonic.fun/news.

Notification Bot (Dec 04 2025 at 08:35):

9 messages were moved from this topic to #Machine Learning for Theorem Proving > Aristotle writing rust? by Johan Commelin.

Last updated: Feb 28 2026 at 14:05 UTC