The weights are a transformed, lossy and non-complete permutation of the training material. You
cannot recover most of the dataset reliably, which is what stops it from being an outright replacement for the work it's trained on.
> does not mean that you are not distributing them.
Except you literally aren't distributing them. It's like accusing me of pirating a movie because I sent a screenshot or a scene description to my friend.
> It's scary to think that "training" is becoming a thinly veiled way to strip copyright of works.
This is the way it's been for years. Google is given Fair Use for redistributing incomplete parts of copywritten text materials verbatim, since their application is transformative: https://en.wikipedia.org/wiki/Authors_Guild,_Inc._v._Google,....
Or Corellium, who won their case to use copywritten Apple code in novel and transformative ways: https://www.forbes.com/sites/thomasbrewster/2023/12/14/apple...
Copyright has always been a limited power.