undefined | Better HN

0 pointsotabdeveloper47d ago0 comments

I did, and I fixed Qwen's issues with trivial sampling and loop detection hacks.

If I can do this, then a company that wants to sell local models seriously could do it too.

0 comments

ninjagoo6d ago

> I did, and I fixed Qwen's issues with trivial sampling and loop detection hacks.

Wow, that's amazing! Care to share the changes? Would love to try them out.

otabdeveloper4OP6d ago

It's not amazing at all.

What's amazing is that LLM technologies are so immature that even basic engineering diligence isn't being done. (Like detecting token loops, for example.)

j / k navigate · click thread line to collapse