Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
redox99
3mo ago
0 comments
Share
I think it's more likely to be the old base model checkpoint further trained on additional data.
0 comments
default
newest
oldest
jumploops
3mo ago
Is that technically not a new pretrained model?
(Also not sure how that would work, but maybe I’ve missed a paper or two!)
redox99
OP
3mo ago
I'd say for it to be called a new pretrained model, it'd need to be trained from scratch (like llama 1, 2, 3).
But it's just semantics.
j
/
k
navigate · click thread line to collapse