Skip to content

Top New Best Ask Show Jobs

Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs) | Better HN

Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs) (opens in new tab)

(furiosa.ai)

9 pointsolibaw7mo ago0 comments

0 comments

No comments yet.