Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
madduci
3mo ago
0 comments
Share
The first users of this dataset will be Big Tech corps. Meta, Alphabet, OpenAI, Microsoft, Apple will all be happy to use this dataset for training their LLMs.
For them, 300TB is just cheap
0 comments
default
newest
oldest
ipsum2
3mo ago
They already have this data. See jukebox from OpenAI, released before chatgpt.
j
/
k
navigate · click thread line to collapse