I think we could run for at least a decade further with no model changes/improvements, just better harnesses and infra around this agentic way of developing.
Replying to myself -- likely this did happen and it was Amazon tokenmaxing due to exec pressure and many were using MeshClaw. Goodhart’s Law: when a measurement becomes the target, it stops being a useful measurement.