Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs) | Better HN
0 comments
No comments yet.
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)
(opens in new tab)
(furiosa.ai)
9 points
olibaw
5mo ago
0 comments
Share