If I can do this, then a company that wants to sell local models seriously could do it too.
Wow, that's amazing! Care to share the changes? Would love to try them out.
What's amazing is that LLM technologies are so immature that even basic engineering diligence isn't being done. (Like detecting token loops, for example.)