Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: Synthetic dataset generation


Soonho Kong (May 22 2024 at 21:17):

Hi All, I'm wondering if you can share ideas and/or existing prototypes on synthesizing diverse Lean datasets (theorems & proofs) across various mathematical domains, including number theory, algebra, and combinatorics. The goal would be to create a comprehensive dataset that could potentially serve educational purposes, such as automatically generating exercises for platforms like Lean Games, as well as training AI agents in these mathematical domains.

Kevin Buzzard (May 22 2024 at 21:23):

I would get humans to synthesize them. Presumably this isn't what you want to hear?

Kim Morrison (May 22 2024 at 23:17):

Without a very robust feedback loop between:

  • generating statements
  • deciding that they are valuable because they are both
    • true (via generating proofs) and
    • useful (via noticing that they help prove later things)

synthetic data in maths is really hard. I think people are working on such feedback loops, but there's not a lot public to point to.

Abhishek Anand (May 23 2024 at 00:52):

  1. following AlphaGeometry, one could directly generate/sample in the valid-proof space and compute the theorem from the proof.
  2. It is not obvious that the theorems need to be "useful". I dont think AlphaGeometry tried to select only useful theorems for training. The author said that most of the theorems in the synthetic dataset are uninteresting. It seems they did deduplicate the theorems.

The synthetic proof dataset generation phase of AlphaGeometry is discussed starting 21:00 in this video https://www.youtube.com/watch?v=eZbYSOpga2U

Kim Morrison (May 23 2024 at 01:07):

Yes, it is surprising that AlphaGeometry in the end could solve interesting problems, when so much of its training was "not useful" examples. But still, they were in a vastly smaller domain than just writing proofs.

Huajian Xin (May 24 2024 at 05:32):

(deleted)

Notification Bot (May 24 2024 at 05:33):

Huajian Xin has marked this topic as resolved.

Notification Bot (May 24 2024 at 05:34):

Huajian Xin has marked this topic as unresolved.

Huajian Xin (May 24 2024 at 05:34):

Hi! We submitted a new paper on arxiv and successfully achieved large-scale synthesis of formal proof data for Lean4. The trained model successfully proved 52% of theorems using the Lean4 language on the miniF2F benchmark. http://arxiv.org/abs/2405.14333

Notification Bot (May 24 2024 at 18:52):

26 messages were moved from this topic to #Machine Learning for Theorem Proving > DeepSeek-Prover by Eric Wieser.


Last updated: May 02 2025 at 03:31 UTC