Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Gemma 3 Inference: vLLM on GKE. Over 22k token/s | Better HN
0 comments
No comments yet.
Gemma 3 Inference: vLLM on GKE. Over 22k token/s
(opens in new tab)
(medium.com)
2 points
m4r1k
11mo ago
0 comments
Share