Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
llm_trw
2y ago
0 comments
Share
Depends. The only paper they cite for training:
https://arxiv.org/pdf/2310.11453.pdf
doesn't improve training costs much and most models are already training constrained. Not everyone has $200m to throw at training another model from scratch.
0 comments
default
newest
oldest
arunk47
2y ago
Is there any scope for indie builders?
llm_trw
OP
2y ago
Not really. These are slightly better for memory during pre-training and fine turning but not enough to make a 4090 usable even for a 7b model.
j
/
k
navigate · click thread line to collapse