undefined | Better HN

Skip to content

Top New Best Ask Show Jobs

0 pointsJoeri2y ago0 comments

There seems to be a maximum amount of reasoning llm’s can do per token (per unit of computation). If you prompt it to use more tokens before it outputs the final answer (think step by step, check your answer, …) it becomes smarter. People have lucked into different prompting strategies to get it to do this, but there probably are more.

Ultimately I feel it is fairer to benchmark llm’s by what they can be prompted into. After all, we let people carefully work through a problem during exams so it seems fair to hold llm’s to the same standard.

undefined | Better HN

0 comments

If we're under attack, launch the nukes.

Oh wait, forgot something:

Think it through step by step.

Phew, close one.

j / k navigate · click thread line to collapse