Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
vessenes
5mo ago
0 comments
Share
This is the spec for a training node. The inference requires 80GB of VRAM, so significantly less compute.
0 comments
default
newest
oldest
andai
5mo ago
The default model is ~0.5B params right?
j
/
k
navigate · click thread line to collapse