Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
busfahrer
4d ago
0 comments
Share
Great, thanks! :-) and to mirror another poster: what kind of prompt parsing (prefill) speed do you get for that model? Also how is the speed for the 27B model?
0 comments
default
newest
oldest
egorfine
4d ago
35B: 1300-1800 t/s on both Q4 and Q6.
27B: give me 20 minutes
busfahrer
OP
3d ago
Thank you, good sir!
j
/
k
navigate · click thread line to collapse