Skip to content
Better HN
My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf] | Better HN