Mathematics Distillation Challenge – Equational Theories (opens in new tab)

(terrytao.wordpress.com)

110 pointspicafrost2mo ago5 comments

5 comments

Already saw it on Mastodon yesterday. A better presentation is in the other link. (https://competition.sair.foundation/competitions/mathematics...)

I understand the idea of a distillation challenge but it really feels like the poor man's solution to a problem that could be better solved by training a LLM and analysing the layers. In the end, as I value a condensed "cheat sheet", if the goal is to improve open-source model. A better approach seems to recreate the AlphaProof system, longer to do, but more efficient. The path taken by mathematicians now is agentic system with general LLM.

- Masto: https://mathstodon.xyz/@tao/116225525978210807

- AlphaProof: https://deepmind.google/blog/ai-solves-imo-problems-at-silve...

- AlphaProof description: https://www.nature.com/articles/s41586-025-09833-y

steppi2mo ago

It seems like the lift in the open-source models is being used as a proxy metric, and the core goal is a human understandable yoga [1] for approaching these kinds of equational proofs in universal algebra.

[1] https://mathoverflow.net/questions/64071/what-does-the-term-...

tux32mo ago

The goal isn't to improve the AI performance using the cheat sheet, it's to produce a cheat sheet at all that efficiently distills intuition about these 22 million results.

Presumably if it's written in plain text and useful to the AI, there may be some relevant information in there that will be interesting for humans too.

_ache_2mo ago

As I say, I understand the goal of having a cheat sheet that can distills a big chunk of math. But that distillation would have been better done by a neural network instead of the creation of a prompt (fine-tuning or pure distillation). But studying that neural network will be harder.

It's explicitly stated that the goal is to improve performance of cheap models but I assume, like you did, that they are hopping that the plain text may be useful to humans too.

steppi2mo ago

Tao does state his hopes in the article: "My hope is that the winning submissions will capture the most productive techniques for solving these problems, and/or provide general problem-solving techniques that would also be applicable to other types of mathematical problems."

I think your suggestions are actually complementary. Distillation of the larger networks capable of solving these problems and study of the layers could be part of the process for generating the cheat sheet.

j / k navigate · click thread line to collapse

5 comments

_ache_2mo ago

Already saw it on Mastodon yesterday. A better presentation is in the other link. (https://competition.sair.foundation/competitions/mathematics...)

- Masto: https://mathstodon.xyz/@tao/116225525978210807

- AlphaProof: https://deepmind.google/blog/ai-solves-imo-problems-at-silve...

- AlphaProof description: https://www.nature.com/articles/s41586-025-09833-y

steppi2mo ago

[1] https://mathoverflow.net/questions/64071/what-does-the-term-...

tux32mo ago

The goal isn't to improve the AI performance using the cheat sheet, it's to produce a cheat sheet at all that efficiently distills intuition about these 22 million results.

Presumably if it's written in plain text and useful to the AI, there may be some relevant information in there that will be interesting for humans too.

_ache_2mo ago

It's explicitly stated that the goal is to improve performance of cheap models but I assume, like you did, that they are hopping that the plain text may be useful to humans too.

steppi2mo ago

j / k navigate · click thread line to collapse