Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
regexorcist
4d ago
0 comments
Share
Curious if you tested llama.cpp and still found oMLX faster? I haven't tried the latter myself, might give it a go.
0 comments
default
newest
oldest
egorfine
4d ago
Oh yeah I did test various solutions and different settings and quants
Llama is about 1/3 slower on Apple Silicon.
j
/
k
navigate · click thread line to collapse