Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: BFS-Prover-V2


Ivan Eric (Oct 01 2025 at 09:45):

The latest tactic generator model by ByteDance based on Qwen2.5. It gets 95 % on the miniF2F benchmark

Ivan Eric (Oct 01 2025 at 10:27):

https://huggingface.co/ByteDance-Seed/BFS-Prover-V2-32B

Joachim Breitner (Oct 06 2025 at 19:13):

Looks interesting, has someone here tried to use it already?

(deleted) (Oct 07 2025 at 07:48):

How about you use it

(deleted) (Oct 07 2025 at 07:50):

I expect the model to be fast

(deleted) (Oct 07 2025 at 07:50):

If it's slow it's not worth using, but I think it should be fast

Joachim Breitner (Oct 07 2025 at 08:01):

I’m hardly ever proving things these days :shrug:

Zeyu Zheng (Oct 10 2025 at 02:56):

Huỳnh Trần Khanh said:

How about you use it

Hi! You can try out our model through LLMLean following the instructions at https://bfs-prover.github.io/V2/

Bas Spitters (Oct 10 2025 at 07:22):

Looks exciting!
How does LLMLean compare to Lean-LSP-MCP? And how does Ollama compare to OpenRouter?

Notification Bot (Oct 10 2025 at 07:37):

A message was moved here from #announce > BFS-Prover-V2: A SoTA step-level prover for real use by Patrick Massot.

George Tsoukalas (Oct 10 2025 at 09:15):

What version of Lean is it trained on?

Snir Broshi (Oct 10 2025 at 12:20):

Very cool, but why is ByteDance interested in theorem proving?

(deleted) (Oct 10 2025 at 14:15):

Because theorem proving has many industrial applications

Zeyu Zheng (Oct 11 2025 at 13:39):

Bas Spitters said:

Looks exciting!
How does LLMLean compare to Lean-LSP-MCP? And how does Ollama compare to OpenRouter?

From what I understand, LLMLean embeds the LLM inside Lean (via tactics like llmstep / llmqed), letting it propose or complete proofs directly. Lean-LSP-MCP treats Lean as a separate logic engine and your agent talks through LSP/MCP interfaces.

Zeyu Zheng (Oct 11 2025 at 13:39):

Bas Spitters said:

Looks exciting!
How does LLMLean compare to Lean-LSP-MCP? And how does Ollama compare to OpenRouter?

Regarding Ollama vs OpenRouter: Ollama lets you run models locally on your machine. OpenRouter is a unified API gateway to many cloud model providers.

Zeyu Zheng (Oct 11 2025 at 13:42):

George Tsoukalas said:

What version of Lean is it trained on?

The model was trained on v4.10.0

(deleted) (Oct 11 2025 at 14:33):

Ancient Lean again. Sad

Kim Morrison (Oct 15 2025 at 10:52):

I can confirm that the install instructions work, running the LLM locally.

Kim Morrison (Oct 15 2025 at 10:53):

One thing to note is that while llmstep does print some clickable suggestions in the InfoView, there is not a "blue squiggle" underneath it when the suggestions are ready, so it is easy to miss that it has actually done something.

(This should be fixed!)

Kim Morrison (Oct 15 2025 at 21:40):

I tried BFS-Prover-V2 on the same example that I had experimented with for reap, i.e. theorem set_smul_closure_subset (as reported by @Yutong Wang in #announce > `reap`: General neural tactic now public available @ 💬 ).

Unfortunately it wasn't particularly successful. The first iteration is good, suggesting refine iUnion₂_subset fun a ha => ?_, but the next step (where reap and Yury used smul_closure_subset), BFS-Prover-V2 just says "No valid suggestions".

Nicholas Jiang (Nov 07 2025 at 00:16):

Has anyone gotten the planner in the repo working? I've been stuck on verification after the initial generation of the proof sketch. Here's the repo again for reference: https://github.com/ByteDance-Seed/BFS-Prover-V2


Last updated: Dec 20 2025 at 21:32 UTC