Either LFM2.5-1.6B-4bit or Qwen3.5-2B-8bit or Qwen3.5-4B-4bit
Though, I don't see any references to Gemma at all in the open source code...
I would really like to know what people use these small and tiny models for. If any high-karma users are reading it, would you consider posting Ask HN?
very limited amount of use cases, perhaps. As a generalized chat assistant? I'm not sure you'd be able to get anything of value out from them, but happy to be proven otherwise. I have all of those locally already, without fine-tuning, what use case could I try right now where any of those are "very effective"?
Claude Code is a Desktop app as well.