Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: using/training llms to call powerful tactics


Jared green (Apr 02 2025 at 21:40):

what would it take to build an ai agent for theorem proving that will actually use tactics more powerful than the 'elementary' ones?

Jared green (Apr 02 2025 at 21:45):

specifically, synthesizing subgoals and then placing a power tactic that might close them

Eric Wieser (Apr 02 2025 at 21:51):

Does use H$ D.2.2 (φ _ *(L+1)-1) (L.le_sub_of_add_le (by nlinarith[((b.1* b.2+1).totient_pos).2 Nat.succ_pos']))▸(((Nat.dvd_gcd) ( this).1)) this.right count as a "powerful tactic"?

Jared green (Apr 02 2025 at 21:55):

no, the short ones like aesop, duper...

Jared green (Apr 02 2025 at 21:56):

the ones intended to be general

Jason Rute (Apr 02 2025 at 22:52):

Thor does this in Isabelle by calling Sledgehammer as a tactic more or less. Draft-Sketch-Prove (and successors) is even closer to what you want. It uses an LLM to write a proof sketch and uses SledgeHammer to fill in the holes in the sketch.

Jason Rute (Apr 02 2025 at 22:54):

But AlphaProof also synthesized fairly advanced subgoals. I think that is more important than whether or not those subgoals can be filled in with a single tactic call.

GasStationManager (Apr 02 2025 at 23:53):

I believe the solution is to have a library of prompt instructions that covers the Lean features that are not well represented in training data. I have a very primitive version in Leantool, where the system prompt introduced the LLM to features like omega, exact?, aesop, and LeanSearchClient's #moogle. This will require additional prompt engineering efforts to make it work well, e.g. adding examples.
If you have (or can write) a good prompt that teaches a Lean feature, let me know and I'd be happy to add it to LeanTool..

Jared green (Apr 03 2025 at 05:44):

i have no prompt engineering 'skills', i sort of expected something that would be trained from the start with that in mind. if you get a pretrained model to do it, i think you would just add the new power tactic to a list of mcp templates.

Leni Aniva (Apr 07 2025 at 18:28):

given how existing agents work you either need to incentivize exploration or execute the tactic when some mechanically checkable condition is met


Last updated: May 02 2025 at 03:31 UTC