Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
regularfry
1y ago
0 comments
Share
Qwen2.5 has a 32B release, and quantised at q5_k_m it *just about" completely fills a 4090.
It's a good model, too.
0 comments
default
newest
oldest
kristianp
1y ago
Do you also need space for context on the card to get decent speed though?
regularfry
OP
1y ago
Depends how much you need. Dropping to q4_k_m gives you 3GB back if that makes the difference.
j
/
k
navigate · click thread line to collapse