Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
colechristensen
10h ago
0 comments
Share
No, training a state of the art model involves training on the order of 10 trillion tokens.
We're talking about a step that updates weights based on say between 10k and 1M tokens.
0 comments
default
newest
oldest
delis-thumbs-7e
10h ago
I learned something. Thank you!
j
/
k
navigate · click thread line to collapse