Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: InternLM-Math-Plus


Zijian Wu (May 28 2024 at 03:30):

We released a series of math LLM named InternLM-Math-Plus, of sizes 1.8B, 7B, 20B, and 8x22B. Our models are equipped with COT reasoning, code-interpreter reasoning, and LEAN 4 abilities (43.4% pass@1 on MiniF2F-test set).

Demo: https://huggingface.co/spaces/internlm/internlm2-math-7b

Model: https://huggingface.co/internlm/internlm2-math-plus-mixtral8x22b

Code: https://github.com/InternLM/InternLM-Math

Arxiv: https://arxiv.org/abs/2402.06332

Huajian Xin (May 28 2024 at 04:02):

Very good job! I would like to ask if pass@1 here refers to tree search once or model generation once?

Zijian Wu (May 28 2024 at 04:07):

It is tree search once (like HyperTree Proof Search's definition). In addition, the tree search width is 32, following the setting of llemma.


Last updated: May 02 2025 at 03:31 UTC