> This is data.
Nope. Because frequency bias is a thing. If you hear on Twitter "model X got nerfed," your brain will look for that pattern and notice it more than usual. This will then confirm your suspicion, which leads to a vicious cycle. Then you tell your friends and the same phenomenon repeats.
None of this requires the model to get worse. It's a well understood psychological phenomenon.
> I can tell you what it means: models performing worse at coding tasks. So people report models being worse at coding tasks
The perception of a model performing worse at some coding task is not what "different token distribution" means. You should ask AI to explain my comment ;)
Latency and TPS can also tell you if you're getting a quant.
Anyways you should really get some help. Praying for you!