undefined | Better HN

0 pointskube-system2y ago0 comments

> If a LLM knows a song as part of its training data, then it is copyright infringement.

No it isn't. You can feed whatever you want into your LLM, including copyrighted data. The issues arise when you start reproducing or distributing copyrighted content.

0 comments

jdietrich2y ago

>You can feed whatever you want into your LLM, including copyrighted data.

That's currently the subject of considerable legal debate.

https://edition.cnn.com/2023/07/10/tech/sarah-silverman-open...

kube-systemOP2y ago

That is mostly an issue of the latter, whether the service that Meta/OpenAI offers outputs content that is a violation of copyright. Technically, derivative works are a copyright violation, but if you're not distributing them, you normally have a good fair use argument, and/or nobody knows.

j / k navigate · click thread line to collapse