Zulip Chat Archive
Stream: Machine Learning for Theorem Proving
Topic: using/training llms to call powerful tactics
Jared green (Apr 02 2025 at 21:40):
what would it take to build an ai agent for theorem proving that will actually use tactics more powerful than the 'elementary' ones?
Jared green (Apr 02 2025 at 21:45):
specifically, synthesizing subgoals and then placing a power tactic that might close them
Eric Wieser (Apr 02 2025 at 21:51):
Does use H$ D.2.2 (φ _ *(L+1)-1) (L.le_sub_of_add_le (by nlinarith[((b.1* b.2+1).totient_pos).2 Nat.succ_pos']))▸(((Nat.dvd_gcd) ( this).1)) this.right
count as a "powerful tactic"?
Jared green (Apr 02 2025 at 21:55):
no, the short ones like aesop, duper...
Jared green (Apr 02 2025 at 21:56):
the ones intended to be general
Jason Rute (Apr 02 2025 at 22:52):
Thor does this in Isabelle by calling Sledgehammer as a tactic more or less. Draft-Sketch-Prove (and successors) is even closer to what you want. It uses an LLM to write a proof sketch and uses SledgeHammer to fill in the holes in the sketch.
Jason Rute (Apr 02 2025 at 22:54):
But AlphaProof also synthesized fairly advanced subgoals. I think that is more important than whether or not those subgoals can be filled in with a single tactic call.
GasStationManager (Apr 02 2025 at 23:53):
I believe the solution is to have a library of prompt instructions that covers the Lean features that are not well represented in training data. I have a very primitive version in Leantool, where the system prompt introduced the LLM to features like omega, exact?, aesop, and LeanSearchClient's #moogle. This will require additional prompt engineering efforts to make it work well, e.g. adding examples.
If you have (or can write) a good prompt that teaches a Lean feature, let me know and I'd be happy to add it to LeanTool..
Jared green (Apr 03 2025 at 05:44):
i have no prompt engineering 'skills', i sort of expected something that would be trained from the start with that in mind. if you get a pretrained model to do it, i think you would just add the new power tactic to a list of mcp templates.
Leni Aniva (Apr 07 2025 at 18:28):
given how existing agents work you either need to incentivize exploration or execute the tactic when some mechanically checkable condition is met
Last updated: May 02 2025 at 03:31 UTC