Zulip Chat Archive

Stream: Machine Learning for Theorem Proving

Topic: OCR to LaTeX

Heather Macbeth (Apr 10 2023 at 18:12):

This is machine learning for math rather than machine learning for theorem proving, but I imagine the readership of this stream will be well-informed: what are the best tools currently available for the task of converting an image of handwritten text to LaTeX? Is there a way to create a pipeline in which a first approximation is compiled to DVI/PDF and then compared optically with the original image for feedback and adjustment?

Newell Jensen (Apr 10 2023 at 18:30):

I have used https://mathpix.com/image-to-latex in the past, worked fairly well.

Newell Jensen (Apr 10 2023 at 18:31):

Definitely saves a bunch of time even though it might not get it right 100% of the time.

Heather Macbeth (Apr 10 2023 at 18:36):

Interesting, I'll try it. Is there anything public about how it works?

Newell Jensen (Apr 10 2023 at 18:55):

Not sure, didn't look into it but others here may know more.

Siddhartha Gadgil (Apr 11 2023 at 01:40):

Mathpix seems to work well.

Zhangir Azerbayev (Apr 11 2023 at 21:20):

There is also this open source project: pix2tex. It's a vision transformer trained on wikipedia, arxiv, and the im2latex-100k dataset.

Last updated: Dec 20 2023 at 11:08 UTC