Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
vlovich123
3y ago
0 comments
Share
Can someone explain to me Microsoft’s decision here to use GPL code in the training set? It would seem like sticking to non-attribution / non-viral licenses would have kept them in the clear. Was that an insufficient size data set?
0 comments
default
newest
oldest
az226
3y ago
It only trains on the GPL code, it doesn't reproduce entire code files verbatim. So it's fair use.
vlovich123
OP
3y ago
Except when it does
j
/
k
navigate · click thread line to collapse