Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
KaiserPro
3mo ago
0 comments
Share
Depending on what you're doing its taking up to 8GPUs working in parallel to serve those queries.
0 comments
default
newest
oldest
YetAnotherNick
3mo ago
Yes but then the batch size is in 100s or even 1000s. These GPU doesn't serve just 1 user at a time.
j
/
k
navigate · click thread line to collapse