FSF-calls for white papers on philosophical and legal questions around Copilot (opens in new tab)

(fsf.org)

283 pointsnon_sequitur4y ago198 comments

198 comments

The ignorance in this comment section is already giving me an aneurysm. Software licenses matter. Copyright matters. If megacorps like Microsoft can sue people into oblivion for violating their copyright terms, people can sue Microsoft into oblivion for violating theirs. I don't use MS Github, I have no skin in the game, but I hope there is at-least a $1000 award to every instance of AGPL and GPL license violation because it's unfair and illegal what they're doing.

This isn't ML, it is a ripoff and is violating clear software licensing terms. https://news.ycombinator.com/item?id=27710287

Software freedom matters, but I wouldn't expect the typical HN type to understand, since their money is made on exploiting freely-available software, putting it into proprietary little SaaS boxes, then re-selling it.

heavyset_go4y ago

> The ignorance in this comment section is already giving me an aneurysm. Software licenses matter. Copyright matters.

If anyone thinks they don't, ask why Microsoft didn't train Copilot on their Windows, Office, or Azure source repositories.

zarzavat4y ago

Microsoft (presumably) did train it on their open source repositories, since those repositories are public GitHub repos. They didn't train it on anybody's private repositories.

jazzyjackson4y ago

The point is, if they're sure they won't be recycling copyrighted code wholesale, why not include their own in the training set. Surely their internal code is higher quality than the average git repo, which must be 80% abandonware (if my personal repos are anything to go by :P)

1 more reply

paulryanrogers4y ago

But publicly accessible doesn't mean public domain. Microsoft has shared even some of their private code with others like governments. No doubt with strict licenses which they expect to be honored. AGPL and other licenses on publicly accessible code still matter.

1 more reply

atatatat4y ago

> They didn't train it on anybody's private repositories.

Are you sure?

cromka4y ago

Case closed, everybody go home.

lostmsu4y ago

Because that's extra work to wire them up? Until recently Windows wasn't even in Git.

xxpor4y ago

Software licenses have barely been tested in court, let alone how they apply to code injected and combined with other code via machine learning. You're extremely overconfident about how this will actually play out.

For one, just because your code is covered by the GPL, it doesn't mean every single line in isolation is copyrightable. It has to demonstrate creativity. That's why you don't have to worry about writing for (int i = 0; i < idx; i++) {.

ghoward4y ago

You're right that code has to demonstrate creativity for copyright. But that also means that an algorithm, even a transformative algorithm, cannot change copyright because an algorithm is not creative, by definition.

This means that the output of any algorithm on copyrighted code is still under the original copyright. I mean, we still apply the copyright of the original to the output of compilers, even though compilers can be transformative with inlining and link-time optimization, to the point that it mixes disparate code in the same way Copilot does.

In fact, I wrote some software licenses [1] that codify the fact that algorithms cannot change copyright.

[1]: https://yzena.com/licenses/

gradys4y ago

You sound very confident about this, whereas copyright lawyers I've read discuss this issue seem much less confident overall, but lean toward thinking this would be fair use.

What makes you so confident that this would not be ruled fair use?

(And for people not familiar - if ruled fair use, it doesn't matter what the license is because fair use is an exception to copyright itself.)

1 more reply

alpaca1284y ago

> it doesn't mean every single line in isolation is copyrightable

Microsoft did not just copy individual lines. They fed whole repositories into their model, ignoring the license (if it exists) even though they knew from the start that information generated by the model will be publicly available. Available usually out of context, but nonetheless - the scope of the input and intent are very clearly "everything" and "redistribution".

Just adding a filter/ML model to the output shouldn't matter. I dare you to build a Copilot clone trained from leaked internal Microsoft code and then trying to argue the output is a bit mixed up.

That is a clear violation imho.

google2341234y ago

Copilot was trained on leaked internal Microsoft code that's on github at the moment. Anyway, everyone seems perfectly ok with training langauge models on copyright text.

2 more replies

sobellian4y ago

The search engine on Github also calls up entire pages of GPL licensed code verbatim. Does it run afoul of copyright?

hodgesrm4y ago

> Software licenses have barely been tested in court...

OSS licenses have been litigated and upheld. Can't supply details of my own experience for confidentiality reasons but plenty of plaintiffs have prevailed in suits about violations of OSS license terms. My guess is the numbers are higher than you might think because a lot of the cases end in non-public settlements.

sjy4y ago

A confidential settlement does not mean that a licence has been “tested in court” or “litigated and upheld.” It means the parties thought the risk of losing was high enough to justify a settlement. The state of the law remains uncertain because cases are getting settled rather than litigated.

1 more reply

api4y ago

What about non-traditional-FOSS licenses? There is a lot of source-available not-OSI-compliant licensed software on GitHub like MongoDB, CockroachDB, etc., and that's clearly proprietary. If this thing is trained on that and generates what amount to snippets of that code then it's clearly violating those licenses.

Then there's private repositories. If they included those in the training data set that's even more actionable.

Personally I think this is software piracy at an absolutely unprecedented scale. Machine learning is just information transfer from the training data into weights in a model, a close relative of lossy data compression. Microsoft is now reselling all its GitHub users' code for profit.

Wowfunhappy4y ago

Private repositories weren't included in the training data per-github, only public repos.

sangnoir4y ago

> You're extremely overconfident about how this will actually play out.

I'd argue Microsoft too, was/is overconfident about how this would play out. I would have expected a little more caution on selecting the training data.

josefx4y ago

> it doesn't mean every single line in isolation is copyrightable.

copilot is known to reproduce entire blocks of text including non functional parts like comments.

bluGill4y ago

While they are not tested, anything other than accepting the idea kills the idea of software completely. There is lots of room to change details, but somehow copyright and the fact that the code is copied into computer memory needs to be reconciled.

xxpor4y ago

I don't see how. It might kill specific ideological licensing of software code, but the idea it'd kill software as a whole is pretty unbelievable. Software is too valuable to society.

As we're seeing, there's VERY little software where the specific algorithms or ideas in the software are what's valuable. The value comes from the ability to sell a service based on the software and operate it at scale. Like you said, how much SaaS is mostly open source stuff packaged up? Android is (sort of) open source, companies pay lots of people a lot of money to contribute to the Linux kernel where they give away the code they developed with that money, etc etc.

austincheney4y ago

A software license, like any license, is a permission to operate.

> it doesn't mean every single line in isolation is copyrightable

It is if you can prove reproduction apart from your own original work (fair use). Unlike patents copyright doesn’t protect uniqueness. It is only a shield from reproduction, and if reproduction is demonstrable to a court you are likely at risk.

https://cws.auburn.edu/OVPR/pm/tt/copyrightvplagiarism

hartator4y ago

> Software licenses matter. Copyright matters.

Some of us think is detrimental to humanity at whole.

Y_Y4y ago

Why not both?

Suppose that it's just a bad idea and shouldn't exist. Does that mean that I should release my code into the public domain? I think you could make a good case that even being totally opposed to copyright morally or pragmatically or otherwise, given that it currently is enforced in many places it's worthwhile to play along. For example, some people would prefer a world without copyright, but GPL their code, because it might prevent a greater evil.

dcow4y ago

Exactly. The copyleft side of me says you can't copyright instructions on how to bake a cake, or a fast route across a city, or a beautiful way to display colored pixels in a grid, or an efficient compression scheme for video data... because it's all intellectual, and not physical, "property". But society disagrees so a nice hack on copyright that perpetually keeps any of the above from being stolen and locked down by profit seeking psychopaths just early enough to the scene to make a buck, seems like the best interim solution.

1 more reply

sangnoir4y ago

True, but while they exist,they should be evenly applied

warkdarrior4y ago

If you abolish copyright, that will only make it easier for for-profit corporations to use FOSS. There will be nothing stopping them from using FOSS, unless people stop sharing their code altogether.

syshum4y ago

While True, if you abolish copyright then there is nothing preventing me from Installing Microsoft office on as many machines as I want never paying Microsoft a dime....

2 more replies

pabs34y ago

If Copilot is violating the GPL license family, then it is also violating the permissive licenses like MIT too.

MichaelMoser1234y ago

How so? the MIT license allows you to do everything with the code. It doesn't allow to sue the author, but that's about it. Here it is: https://opensource.org/licenses/MIT

bblough4y ago

From your link:

> The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

Not including the copyright information for the MIT-licensed code is a violation of the license.

1 more reply

swayson4y ago

Very well put and refreshing. Thank you.

c7DJTLrn4y ago

If I ever receive monetary compensation for violation of the license on my repositories, I will personally deliver it to you in cash. It won't happen.

I have a feeling Copilot is more of a tool for publicity than for development.

spywaregorilla4y ago

That statement sort of depends on how important your repos are

snarky_birdie4y ago

>I don't use MS Github, I have no skin in the game

You don't have to use Github to have a skin in the game. As long as someone has access to your open source code, no matter where it's hosted, anyone is free to upload it to Github. The open source license of your code allows that.

JTbane4y ago

>I hope there is at-least a $1000 award to every instance of AGPL and GPL license violation

So much this. If a neural network is capable of regurgitating code verbatim (with comments!), it's not a stretch to say it's a derivative work of the GPL code used to feed it.

syshum4y ago

Thank you...

api4y ago

But don't you get it? The purpose of FOSS is to provide free labor for billion dollar companies.

tomnipotent4y ago

A non-trivial amount of FOSS is contributed by programmers on the clock working for those same billion dollar companies.

ralph844y ago

Their link to why you shouldn't use GitHub[0] takes you to a page where they criticize GitHub for complying with US export controls. The FSF is a US corporation, why do they think that US export controls don't equally apply to savannah.gnu.org? And unlike FSF, GitHub has actually done the work of applying for export licenses so that developers in US-sanctioned countries can access GitHub[1].

[0] https://www.gnu.org/software/repo-criteria-evaluation.html#G... [1] https://github.blog/2021-01-05-advancing-developer-freedom-g...

fadjacent4y ago

there is a different and more important criticism listed too, githus is nonfree.

But, github could easily establish a non-us entity to host export restricted code. And for savannah, if anyone had any code they were worried about export control for their code, savannah would quickly and easily have an independent person host that repo outside the US.

judge20204y ago

To be more specific:

https://stackoverflow.com/legal/terms-of-service/public#:~:t...

> You agree that any and all content.. that you provide to the public Network... is perpetually and irrevocably licensed to Stack Overflow on a worldwide, royalty-free, non-exclusive basis pursuant to Creative Commons licensing terms (CC BY-SA 4.0)

Technically a lot of people who copy from Stack Overflow are breaking CC BY-SA 4.0 since it requires attribution AND requires distributing code that uses it under the same license ( I think - I am not your lawyer) :

https://creativecommons.org/licenses/by-sa/4.0/

lamontcg4y ago

Given how the racist twitterbot AI turned out, along with L4 autonomous driving by 2017, I suspect that Copilot is going to suffer most from an incredibly high velocity of churned out security bugs and bad code. SWEs are probably going to get fired for using it and companies will need to ban it, even if the legal problems don't take it down.

c7DJTLrn4y ago

It's useless. It is a problem looking for a solution much like most "AI" tools these days. I am frankly frustrated at everyone buying into this stunt.

GuB-424y ago

I think copilot is the wrong application of AI. It spits out what most coders would write for a specific problem. First, if many people have the same problem, than libraries are the solution, not copy-pasting. Also, just because many people do one thing doesn't mean it is the right thing to do, and you sometimes get code with security vulnerabilities.

Instead, I would like a system telling me about obscure things, traps, vulnerabilities, performance issues, etc... like the machine learning linter. The way I could see it work is by matching my code with bugfix commits. For example if several commits replaces "printf(buffer)" with "printf("%s", buffer)" and I write "printf(buffer)", I want an AI to tell me "code like yours is often replaced in commits, it may be wrong", bonus points if it can extract the reason from commit messages ("format string vulnerability") and suggest a replacement ("printf("%s", buffer)"), mega-bonus if it can point me to good explanation of the problem.

Pissing lines of code is easy, I can do it, anyone with a couple weeks of training can do it, I don't need a bot to help me with that. Thinking about everything while I am pissing my lines is hard, and I will welcome a little help.

A nice thing about that approach is that it is unlikely to result in worse code than what I would have written by myself, because it will be designed to trigger only on bad code.

lamontcg4y ago

I'm sure there's an IDE out there which will do that already without any AI. Just need to lint your code, highlight the bad stuff it finds and suggest a refactoring.

1 more reply

phillipcarter4y ago

I don't think Copilot is useless at all. Today it's actually been very helpful for me with interactive, notebooks-based programming. And it's also just an early beta right now; as the model improves and the tooling around it matures a little more, I suspect I'll be using it a lot for interactive stuff.

Notebooks programming has a flow of "execute a small bit of code, check the results, and iterate", and this fits perfectly with Copilot since you still need to check if the suggestions work.

Maybe this kind of programming is where Copilot finds a niche, maybe not. I don't know. I'm skeptical of its use in larger applications where you can't trivially check if the code you wrote (with its help) did what you want. I think there needs to be a lot more tooling built around that to really make it compelling for larger applications like that, likely in the form of more editor tooling integrations. But I think it's promising. I wrote about that a little more here: https://phillipcarter.dev/posts/four-dev-tools-areas-future/...

thamer4y ago

Have you tried it? I've been using it for weeks and I'm consistently impressed by its suggestions. It's definitely not a stunt.

oehtXRwMkIs4y ago

Do you mean solution looking for a problem?

belorn4y ago

An interesting initiative from FSF, through I suspect the answer the most of the question will be answered when someone attempts a similar projects in a more traditional copyright-restrictive area.

As an example I would like to see is a Cosinger, where the AI is trained using songs on youtube and streaming services. With the final product, a user start to sing and the algorithm attempt to sing along and give the singer suggestions for how the song should continue. I could see how a lot of musicians would be willing to pay good money for such program, and removing obligations to pay any money for the training set would make it much more feasible to create.

There are already AI's that create music (through unlikely from proprietary training sets). A Cosinger shouldn't be too far from that.

antocv4y ago

A Cosinger would be illegal unethical, profit killing, anti democracy and ultimately anti our very own freedom to own intellectual property. /s

The same difference as allowing Google to prosper while beating down ThePirateBay, another search engine.

belorn4y ago

I predict it is very likely we will see a court case where a smaller actor will take public available information as training data and get sued for copyright information. It will be interesting to see if, just like in the pirate bay case, the courts will be creative. In the TPB case, the accused was found guilty of an Swedish anti-biker gang law that was written with the intention to shut down biker bars.

When copilot came out, one thing it reminded me of was the ethical considerations of face generators in animation. The output naturally has some similarities with the training data, and it is trivial to use a limited set of actors in order to create faces with canny similarities of the actors. A question that people asked (here on HN if I recall) was if you needed permission from those actors to use in the training set, or if this would allow anyone to "steal" the face of public faces and create semi-look alike that can then be used in anything from porn to advertisement.

The law is undoubtedly going to catch up.

hartator4y ago

> We already know that Copilot as it stands is unacceptable and unjust, from our perspective.

So, why call for white papers? I don’t believe they will publish any papers that go against their views.

user-the-name4y ago

Read the rest of the paragraph. They think it is unacceptable and unjust from certain perspectives that are trivial for them. However, there are other perspectives that are worth exploring, and that is what this is about.

humanistbot4y ago

You seem to be unfamiliar with (edit: or object to) the very idea of lawyers.

pavon4y ago

Just because someone has formed strong opinions about some aspect of a subject, doesn't mean they can't be open minded about another aspect. They plainly state that they don't have clear answers about many of the questions that Copilot raises, and this isn't going to be the last time that those issues appear. It is these broader issues that they want to hold discussions about, not Copilot itself. I don't see any reason not to accept this interest as genuine.

meepmorp4y ago

They have a position and they now want to support it with arguments, and they'd like it if people would help them do that.

I think that's a backwards because it's putting the conclusion first then seeking to justify it, but to each their own.

user-the-name4y ago

No, they have a position and arguments to support it, but those have nothing to do with the machine learning aspects, just with the fact that the software is proprietary.

They are asking for views on the machine learning, which they do not have arguments or a position on.

kelnos4y ago

> I think that's a backwards because it's putting the conclusion first then seeking to justify it

Isn't that literally a lawyer's job?

meepmorp4y ago

>Isn't that literally a lawyer's job?

I guess, but then they should have their story straight before they start the astroturfing campaign.

tyingq4y ago

They know a couple of reasons for sure. They want more reasons, or more detail on other reasons for which they aren't as sure yet.

whazor4y ago

I am curious about the results.

Having tested copilot, most suggestions are based on existing code in your opened file. Furthermore, most snippets tend to be relatively short, where it feels more like a Stack Overflow answer than existing code.

Of course it is possible to make the model generate longer pieces of code that are potentially GPL. But you would have to do certain effort for it. It also tends to adopt your coding style.

But maybe the fact that there are no guarantees makes it unfair.

dcow4y ago

The difference is that Stack Overflow has taken the legal responsibility of making sure any contributions to the site are licensed in a way that allows users to copy-paste them into their own works, and has the authority to do as much. GH does not have the authority to, without authors' permission, launder their code through an AI "tumbler" and spit out shiny suggestions stripped of all license concerns.

unnah4y ago

I just checked Stackoverflow terms, and it still says that all user contributions are licensed under Creative Commons CC BY-SA 4.0, which means that copying them to your own codebase is likely to be a copyright violation. Lots of people do it, but it's a well-known legal problem.

thomzane4y ago

I am excited to see where these questions lead.

grepfru_it4y ago

Something like this?

  [GitHub Copilot License Config Menu]
  Show suggestions with the following tags:
  - [ ] GPLv3
  - [x] GPLv2
  - [ ] AGPL
  - [x] CC-BY-SA
  - [x] Apache License
  - [x] MIT License
  - [ ] No License Attribution

dunham4y ago

Would that require generating 2^n models or can models be combined?

blooalien4y ago

I would think that to combine the models, the software would need some internal method to differentiate between the licenses used by the various code sources it's pulling it's suggestion "ideas" from, and the compatibility between the licenses of those sources and your own choice of licensing for the project you're creating.

tyingq4y ago

Other picklists might be handy too, especially something that would narrow to higher quality sources.

And they need a report button with a picklist of reasons.

imoverclocked4y ago

eg, cleanliness described by different linters/static-analysis tools? Can we actually make better code suggestions by choosing examples which are known to have less super-obvious flaws?

1 more reply

remram4y ago

Those licenses require attributions. You can't just say "Copyright (c) all the projects indexed in Copilot".

judge20204y ago

I would imagine saying "Copyright (c) appropriate rights holders on planet Earth" wouldn't satisfy most license attribution claims if they were ever tested in court.

grepfru_it4y ago

There's certain information I cannot share, but you can see the general idea of what I'm throwing out here

MichaelMoser1234y ago

i actually like it that copilot is better than me at solving interview questions. https://www.youtube.com/watch?v=FHwnrYm0mNc I for one welcome our robot overlords.

i wonder if they could retrain the model on BSD or MIT licensed code only; How much of the open source code is licensed as GPL vs more permissive licenses, does anyone know?

Interesting that they want to charge for the use of co-pilot, I guess that we will see this business model more in the future.

Trollmann4y ago

Haven’t watched the video but this makes a lot of sense. I assume there are quite a lot of Leetcode solution repositories containing exact problem descriptions and LeetCode naming on GitHub. So essentially it‘s copy and pasting from these solutions.

65104y ago

My opinion: Copilot is a derived work.

lostmsu4y ago

An opinion: so is you.

65104y ago

Thats much to deep for this forum.

lights01234y ago

> It requires running software that is not free/libre (Visual Studio, or parts of Visual Studio Code)

A little nitpicky, but the only proprietary part it requires is the plugin itself, not the IDE—Copilot runs just fine with the Free build of VS Code compiled from source from GitHub, after flipping a switch to enable WIP APIs.

r2834924y ago

I think you are wrong: https://vscodium.com/

lights01234y ago

VSCodium provides Free pre-compiled binaries of VS Code from GitHub, like I was describing. What about it makes me wrong?

I did it two days ago, installing the Copilot plugin in a Free build of VS Code provided by my distro.

zekrioca4y ago

Interesting: In HN, a same link submitted at a different time get different # of upvotes.

Same link, just 13h ago, but with 5x less upvotes than the one in here: https://news.ycombinator.com/item?id=27992894

ghoward4y ago

Because the US programmers were going to bed?

zekrioca4y ago

I'd expect HN to not let duplicates to be submitted.

IvyMike4y ago

From the FAQ https://news.ycombinator.com/newsfaq.html

> Are reposts ok?

> If a story has not had significant attention in the last year or so, a small number of reposts is ok. Otherwise we bury reposts as duplicates.

> Please don't delete and repost the same story. Deletion is for things that shouldn't have been submitted in the first place.

1 more reply

ghoward4y ago

I see. Well, it actually happens all the time.

kmeisthax4y ago

>Is Copilot's training on public repositories infringing copyright? Is it fair use?

My money's on yes, but this isn't settled until SCOTUS says so.

>How likely is the output of Copilot to generate actionable claims of violations on GPL-licensed works?

This depends on how likely Copilot is to regurgitate it's training input instead of generate new code. If it only does so IF you specifically ask it to (e.g. by adding Quake source comments to deliberately get Quake input), then the likelihood of innocent users - i.e. people trying to write new programs and not just launder source code - infringing copyright is also low. However, if Copilot tends to spit out substantially similar output for unrelated inputs, then this goes up by a lot. This will require an actual investigation into the statistical properties of Copilot output, something you won't really be able to do without unrestricted access to both the Copilot model and it's training corpus.

>How can developers ensure that any code to which they hold the copyright is protected against violations generated by Copilot?

I'm going to remove the phrase "against violations generated by Copilot" as it's immaterial to the question. Copilot infringement isn't any different from, say, a developer copypasting a function or two from a GPL library.

The answer to that, is that unless the infringement is obvious, it's likely to go unpunished. Content ID systems (which, AFAIK, don't really exist for software) only do "striking similarity" analysis; but the standard for copyright infringement in the US is actually lower: if you can prove access, then you only have to prove "substantial similarity". This standard is intended to deal with people who copy things and then change them up a bit so the judge doesn't notice. There is no way to automate such a check, especially not on proprietary software with only DRM-laden binaries available.

If you have source code, then perhaps you can find some similar parts. Indeed, this is what SCO tried to do to the Linux kernel and IBM AIX; and it turned out that the "copied" code was from far older sources that were liberally licensed. (Also, SCO didn't actually own UNIX.) Oracle also tried doing this to the Java classpath in Android and got smacked down by the Supreme Court. Having the source open makes it easier to investigate; but generally speaking, you need some level of suspicion in order to make it economic to investigate copyright infringement in software.

Occasionally, however, someone's copying will be so hilariously blatant that you'll actually find it. This usually happens with emulators, because it's difficult to actually hire for reverse engineering talent and most platform documentation is confidential. Maui X-Stream plagiarized and infringed PearPC (a PowerPC Macintosh emulator) to produce "CherryOS"; Atari ported old Humongous Entertainment titles to the Wii by copying ScummVM; and several Hyperkin clone consoles feature improperly licensed SNES emulation code. In every case, the copying was obvious to anyone with five minutes and a strings binary, simply because the scope of copied code was so massive.

>Is there a way for developers using Copilot to comply with free software licenses like the GPL?

Yes - don't use it.

I know I just said you can probably get away with stealing small snippets of code. However, if your actual intent is to comply with the GPL, you should just copy, modify, and/or fork a GPL library and be honest about it.

To add onto the FSF's usual complaints about software-as-a-service and GitHub following US export laws (which, BTW, the FSF also has to do, unless Stallman plans to literally martyr himself for--- oh god he'd actually do that); I'd argue that Copilot is unethical to use regardless of concerns over plagiarism or copyright infringement. You have no guarantee that the code you're actually writing actually works as intended, and several people have already been able to get Copilot to hilariously fail on even basic security-relevant tasks. Copilot is an autocomplete system, it doesn't have the context of what your codebase looks like. There are way better autocomplete systems that already exist in both Free and non-Free code that don't require a constant Internet connection to a Microsoft server.

>Should ethical advocacy organizations like the FSF argue for change in copyright law relevant to these questions?

I'm going to say no, because copyright law is already insane as-is and we don't need to make it worse just so that the copyleft hack still works a little better.

Please, for the love of god, we do not need stronger copyrights. We need to chain this leviathan.

blibble4y ago

> My money's on yes, but this isn't settled until SCOTUS says so.

there are more countries in the world than the United States, and most of the world's developers live outside of the United States

most countries won't pay any attention to what the US Supreme Court decides

judge20204y ago

> most countries won't pay any attention to what the US Supreme Court decides

Copyright lawsuits across nation state lines are pretty much non-existent and not worth it. What matters in the U.S. is pretty much as far as anyone who cares about copyright is going to care about.

blibble4y ago

I am capable of suing Microsoft in the UK for violating my UK copyright without requiring any involvement with the US legal system

and the UK has essentially no concept of fair use

and the same will play out across every country in Europe and the 90% of the world that isn't the United States

1 more reply

pkrefta4y ago

I'm using Github to publish my code and seriously I don't care whenever Copilot was trained using it. I published it and in the end somebody can do anything with it without giving a damn about license, copyright etc - that's the truth of open-source.

grepfru_it4y ago

This was the same mentality that brought copyleft to the masses in 1984. While you may not care, there are others who do care about the sanctity of license agreements. This is an argument where staying silent means you accept this approach. Of the millions of open source projects, a large portion of the contributors ARE speaking up because they don't find this to be acceptable. I personally think copilot is the future and all this discussion is doing is going to bring a license usage feature to copilot (e.g. i want only or i do not want GPL code in my copilot suggestions)

Please continue using GitHub as you were, but maybe consider acting on your words and either removing or changing licenses within your code that does not represent your ideals. Nothing is preventing you from releasing code into the public domain, so do that!

Permit4y ago

> Of the millions of open source projects, a large portion of the contributors ARE speaking up because they don't find this to be acceptable.

Is this true? Is there really a large portion of contributors speaking up against this? I got the opposite sense, that it was a very small portion of contributors speaking up against this but I don't have any evidence one way or the other.

kelnos4y ago

> that's the truth of open-source

No, that's your opinion, which as it turns out also has no legal basis. For me, I want proper attribution from people who use my code. And for any code that I release that's under copyleft, I absolutely do want that license followed.

You seem to be fine releasing your stuff into the public domain, and that's great that you want to do that, but you don't speak for everyone.

colechristensen4y ago

Well then you're a BSD-license kind of person.

Not everybody is and that's ok too.

nitrogen4y ago

The BSD license still requires attribution and copyright notices visible to the end user.

laumars4y ago

This is why there are a multitude of different open source software licences. Because some people care more than others about the terms in which their code is used by others.

johannes12343214y ago

That is a valid position one can have.

However other people for varying reasons have other ideas ...

senko4y ago

> We already know that Copilot as it stands is unacceptable and unjust [...]. Activists wonder if there isn't something fundamentally unfair about a proprietary software company building a service off their work.

> We will read the submitted white papers, and we will publish ones that we think help elucidate the problem.

Doesn't give me hope they're aiming for unbiased opinion. I would be very surprised if any of the published papers don't closely align with FSFs apriori position.

nescioquid4y ago

It sounds like they have a legal premise and they want to work out the implications, not to open up discussion to every quibble about the FSF's values. Having an opinion on the legal issues around their licenses and values seems sort of essential to what the organization does.

The word "unbiased" seems to be doing a lot of heavy work in your comment. The FSF is inherently biased towards its project -- how is that a problem?

senko4y ago

> The word "unbiased" seems to be doing a lot of heavy work in your comment. The FSF is inherently biased towards its project -- how is that a problem?

That's straw-man, I never said (nor do I think) FSF should not be biased towards its project.

However, I would be more willing to trust the results of this call if I had confidence that all solid arguments are presented, even if they're not aligned with FSF's agenda. Hiding them won't make them disappear - you might as well get as informed as possible about the issue, especially if you care deeply about the issue and agree with the FSF.

1 more reply

kelnos4y ago

Well, sure. They're looking for legal support for their position. They're not pretending to be an unbiased, disinterested observer.

user-the-name4y ago

The part you removed is the crucial part that explains that paragraph.

ghoward4y ago

I honestly wish I was in a position to write a whitepaper for this. However, I should not for several reasons:

* I have already made my position clear in public, [1] so I could probably be identified.

* I am not a lawyer, just some bloke who attempted to write FOSS licenses to combat ML on copyrighted code. [2]

[1]: https://gavinhoward.com/2021/07/poisoning-github-copilot-and...

[2]: https://yzena.com/licenses/

slownews454y ago

Anyone feel like FSF moved from maybe engineering idealists to a very lawyer driven type org?

The big GPLv3 push and development - plenty of attacks on folks actually shipping product on GPLv2 and building communities around that model (which keeps software free but allows users of the software to do what they want with it pretty much including putting in devices that are locked down - cars / tivo's etc).

Here's an opportunity to really advance in an interesting area with ML -> something that may open up programming to more people -> may advance computers ability to program and modify their own programs in the long run.

And regardless of the FSF attorney stuff, places like china, tiny little LLC's with no assets will very likely use the wonderful amount of code on the web to develop solutions in this space, even if FSF claims everything is a violation. Where is the vision anymore from FSF.

One thing that's been sad about the FSF -> it's gone from what I would consider a forward looking idealism sort of thing -> here's how we could do / make cool stuff that let communities work together -> to now sort of a legal compliance type org that really is focused on "actionable claims" " protected against violations" etc.

Question - does the Linux community and other successful larger open source communities welcome the FSF and their attorney's into the discussion? I can hardly imagine the BSD's, the Linux folks really connecting anymore with them.

Is there space for a different group, maybe a collection of actual develops shipping code in larger communities to get together, no FSF / SFC lawyers present, to think creatively about the future? What should we be working for, what is fair to everyone, what helps society, what works around pro-social community building?

A tool that helps with cross language building blocks for common functions etc (stackoverflow on steroids) - just how bad is this?

danhor4y ago

This is more of a tangent, but I found this framing very interesting: > which keeps software free but allows users of the software to do what they want with it pretty much including putting in devices that are locked down - cars / tivo's etc

The FSF considers the user to be the one using cars/tivo's/other devices. In their view, this was a design flaw of gplv2 that it allowed locking out end-users of their devices.

For Linux this was not the case. The important part that modifications/extensions were shared (and maybe even upstreamed), while the end user access wasn't important.

The case of tivoization fractured the interest between the mostly moral "I want freedom for the end user" and the more immediately benefical "If you use my code, I want reciprocity for modifications".

I personally believe that today the latter case won, even for a lot of non-gpl software that gets lots of contributions e.g. via github for lots of different reasons, but the moral case gets more dire.

Looking at security for older (or shockingly often even current) devices, right to repair and lots of other issues concerning the effective loss of rights with more modern devices, the concerns of the FSF were often accurate, but with the increasingly hostile approach to "proprietary" IP and thus the exclusion of GPLv3 and similar licenses not palatable to the larger open source community.

The approach to IP in china is also sometimes a lot different, see https://www.bunniestudios.com/blog/?p=4297.

pabs34y ago

Some interesting links about the TiVo stuff:

https://sfconservancy.org/blog/2021/jul/23/tivoization-and-t... https://news.ycombinator.com/item?id=27937877 https://events19.linuxfoundation.org/wp-content/uploads/2017...

Apparently what TiVo did (breaking proprietary software if you modify GPLed software) is even allowed by GPLv3.

slownews454y ago

Right - FSF ended up with a user view. Problem was the developers are the one actually writing the code and picking licenses, and the FSF moved away from really talking with them. I think this was a big shift.

laumars4y ago

I’m all for advancing machine learning but given how much big corporations aggressively defend their IP, it’s a hard pill to swallow if someone shrugs off a potential misuse of open source code. The law is the law and if it’s ok for Microsoft to defend their copyrights then it’s ok for the FSF to defend my copyrighted code too. The fact that I licensed it GPL was intentional — if I didn’t give a crap what happened to the code then I’d have used BSD or similar. But I chose to place restrictions and I’m very much interested to see if training proprietary AI models are legally covered under those restrictions.

e404y ago

No. The FSF had lawyers from the beginning and always thought (I talked with RMS in the early 80's some) that enforcement was part of the plan.

slownews454y ago

Sure, but the GPLv2 was very freedom oriented. Enforcement practically was relatively sparse and more educational I thought. Ie, release the TiVo source code, but we don't care that Tivo's are locked down.

Is anyone building strong communities on AGPLv3 / GPLv3? I feel the momentum shifted towards Apache / MIT style licenses unfortunately.

blendergeek4y ago

> Is anyone building strong communities on AGPLv3 / GPLv3? I feel the momentum shifted towards Apache / MIT style licenses unfortunately.

While the corporate momentum switched to Apache/MIT licenses, there are strong communities built on AGPLv3/GPLv3.

* Nextcloud - file hosting (AGPLv3)

* Source Hut - git hosting (AGPLv3)

* StreetComplete - OpenStreetMap editing (GPLv3)

* F-Droid - Free Software "app store" for android (GPLv3)

* NewPipe - alternative Youtube frontend (GPLv3)

While these aren't necessarily used by large corporations, their individual communities are thriving and strong.

The shift toward SSPL and Commons Clause licensing is another argument in favor of AGPLv3 licensing. Amazon/Google often won't touch your AGPLv3 code (and you can still sell proprietary licenses to other companies that can't/won't use AGPLv3).

1 more reply

detaro4y ago

> but we don't care that Tivo's are locked down.

They literally made the GPLv3 because they cared about that very much.

4 more replies

jcelerier4y ago

Qt has switched to GPLv3 and is going pretty strong as a community. Can't find the figures for the official forum, but an unofficial one has 75k members.

blendergeek4y ago

> The big GPLv3 push and development - plenty of attacks on folks actually shipping product on GPLv2 and building communities around that model (which keeps software free but allows users of the software to do what they want with it pretty much including putting in devices that are locked down - cars / tivo's etc).

The users of the software are the owners of the devices. The distributors are the ones locking down the devices to prevent the users from modifying the software (often so that the distributors can control something else the users are doing).

GPL is about end-user freedom (as opposed to software distributor freedom). This is why GPLv3 exists.

slownews454y ago

GPL used to be targeted at DEVELOPERS of software - the share and share alike model. These developers would in some cases use the GPL'ed software in locked down devices (many / most android devices are pretty locked down - but developers contribute to a GPL kernel).

So yes, FSF created GPLv3 to focus on USERS freedoms, but the users are not writing the software - so it remains the devs who pick licenses.

pseudalopex4y ago

The Free Software definition always put users first.[1]

[1] https://en.wikipedia.org/wiki/The_Free_Software_Definition#T...

goodpoint4y ago

I've never met a developer who is not a user.

simion3144y ago

>And regardless of the FSF attorney stuff, places like china, ....

So your argument is if China does not care about license neither should we, the thing is I am fine with that, I know Windows source code is leaked so let's train an AI on it too

I think is a clear sign that MS did not trained on proprietary code , it means that is not legal or not safe, so the question is why GPL or other licenses are safe, I think you need the authors or the licenses to give you the permission to use the code as training data in black box, locked, proprietary algorithms.

j / k navigate · click thread line to collapse

198 comments

davisr4y ago

This isn't ML, it is a ripoff and is violating clear software licensing terms. https://news.ycombinator.com/item?id=27710287

heavyset_go4y ago

> The ignorance in this comment section is already giving me an aneurysm. Software licenses matter. Copyright matters.

If anyone thinks they don't, ask why Microsoft didn't train Copilot on their Windows, Office, or Azure source repositories.

zarzavat4y ago

Microsoft (presumably) did train it on their open source repositories, since those repositories are public GitHub repos. They didn't train it on anybody's private repositories.

jazzyjackson4y ago

1 more reply

paulryanrogers4y ago

1 more reply

atatatat4y ago

> They didn't train it on anybody's private repositories.

Are you sure?

cromka4y ago

Case closed, everybody go home.

lostmsu4y ago

Because that's extra work to wire them up? Until recently Windows wasn't even in Git.

xxpor4y ago

ghoward4y ago

In fact, I wrote some software licenses [1] that codify the fact that algorithms cannot change copyright.

[1]: https://yzena.com/licenses/

gradys4y ago

You sound very confident about this, whereas copyright lawyers I've read discuss this issue seem much less confident overall, but lean toward thinking this would be fair use.

What makes you so confident that this would not be ruled fair use?

(And for people not familiar - if ruled fair use, it doesn't matter what the license is because fair use is an exception to copyright itself.)

1 more reply

alpaca1284y ago

> it doesn't mean every single line in isolation is copyrightable

Just adding a filter/ML model to the output shouldn't matter. I dare you to build a Copilot clone trained from leaked internal Microsoft code and then trying to argue the output is a bit mixed up.

That is a clear violation imho.

google2341234y ago

Copilot was trained on leaked internal Microsoft code that's on github at the moment. Anyway, everyone seems perfectly ok with training langauge models on copyright text.

2 more replies

sobellian4y ago

The search engine on Github also calls up entire pages of GPL licensed code verbatim. Does it run afoul of copyright?

hodgesrm4y ago

> Software licenses have barely been tested in court...

sjy4y ago

1 more reply

api4y ago

Then there's private repositories. If they included those in the training data set that's even more actionable.

Wowfunhappy4y ago

Private repositories weren't included in the training data per-github, only public repos.

sangnoir4y ago

> You're extremely overconfident about how this will actually play out.

I'd argue Microsoft too, was/is overconfident about how this would play out. I would have expected a little more caution on selecting the training data.

josefx4y ago

> it doesn't mean every single line in isolation is copyrightable.

copilot is known to reproduce entire blocks of text including non functional parts like comments.

bluGill4y ago

xxpor4y ago

I don't see how. It might kill specific ideological licensing of software code, but the idea it'd kill software as a whole is pretty unbelievable. Software is too valuable to society.

austincheney4y ago

A software license, like any license, is a permission to operate.

> it doesn't mean every single line in isolation is copyrightable

https://cws.auburn.edu/OVPR/pm/tt/copyrightvplagiarism

hartator4y ago

> Software licenses matter. Copyright matters.

Some of us think is detrimental to humanity at whole.

Y_Y4y ago

Why not both?

dcow4y ago

1 more reply

sangnoir4y ago

True, but while they exist,they should be evenly applied

warkdarrior4y ago

If you abolish copyright, that will only make it easier for for-profit corporations to use FOSS. There will be nothing stopping them from using FOSS, unless people stop sharing their code altogether.

syshum4y ago

While True, if you abolish copyright then there is nothing preventing me from Installing Microsoft office on as many machines as I want never paying Microsoft a dime....

2 more replies

pabs34y ago

If Copilot is violating the GPL license family, then it is also violating the permissive licenses like MIT too.

MichaelMoser1234y ago

How so? the MIT license allows you to do everything with the code. It doesn't allow to sue the author, but that's about it. Here it is: https://opensource.org/licenses/MIT

bblough4y ago

From your link:

> The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

Not including the copyright information for the MIT-licensed code is a violation of the license.

1 more reply

swayson4y ago

Very well put and refreshing. Thank you.

c7DJTLrn4y ago

If I ever receive monetary compensation for violation of the license on my repositories, I will personally deliver it to you in cash. It won't happen.

I have a feeling Copilot is more of a tool for publicity than for development.

spywaregorilla4y ago

That statement sort of depends on how important your repos are

snarky_birdie4y ago

>I don't use MS Github, I have no skin in the game

JTbane4y ago

>I hope there is at-least a $1000 award to every instance of AGPL and GPL license violation

So much this. If a neural network is capable of regurgitating code verbatim (with comments!), it's not a stretch to say it's a derivative work of the GPL code used to feed it.

syshum4y ago

Thank you...

api4y ago

But don't you get it? The purpose of FOSS is to provide free labor for billion dollar companies.

tomnipotent4y ago

A non-trivial amount of FOSS is contributed by programmers on the clock working for those same billion dollar companies.

ralph844y ago

[0] https://www.gnu.org/software/repo-criteria-evaluation.html#G... [1] https://github.blog/2021-01-05-advancing-developer-freedom-g...

fadjacent4y ago

there is a different and more important criticism listed too, githus is nonfree.

judge20204y ago

To be more specific:

https://stackoverflow.com/legal/terms-of-service/public#:~:t...

https://creativecommons.org/licenses/by-sa/4.0/

lamontcg4y ago

c7DJTLrn4y ago

It's useless. It is a problem looking for a solution much like most "AI" tools these days. I am frankly frustrated at everyone buying into this stunt.

GuB-424y ago

A nice thing about that approach is that it is unlikely to result in worse code than what I would have written by myself, because it will be designed to trigger only on bad code.

lamontcg4y ago

I'm sure there's an IDE out there which will do that already without any AI. Just need to lint your code, highlight the bad stuff it finds and suggest a refactoring.

1 more reply

phillipcarter4y ago

Notebooks programming has a flow of "execute a small bit of code, check the results, and iterate", and this fits perfectly with Copilot since you still need to check if the suggestions work.

thamer4y ago

Have you tried it? I've been using it for weeks and I'm consistently impressed by its suggestions. It's definitely not a stunt.

oehtXRwMkIs4y ago

Do you mean solution looking for a problem?

belorn4y ago

An interesting initiative from FSF, through I suspect the answer the most of the question will be answered when someone attempts a similar projects in a more traditional copyright-restrictive area.

There are already AI's that create music (through unlikely from proprietary training sets). A Cosinger shouldn't be too far from that.

antocv4y ago

A Cosinger would be illegal unethical, profit killing, anti democracy and ultimately anti our very own freedom to own intellectual property. /s

The same difference as allowing Google to prosper while beating down ThePirateBay, another search engine.

belorn4y ago

The law is undoubtedly going to catch up.

hartator4y ago

> We already know that Copilot as it stands is unacceptable and unjust, from our perspective.

So, why call for white papers? I don’t believe they will publish any papers that go against their views.

user-the-name4y ago

humanistbot4y ago

You seem to be unfamiliar with (edit: or object to) the very idea of lawyers.

pavon4y ago

meepmorp4y ago

They have a position and they now want to support it with arguments, and they'd like it if people would help them do that.

I think that's a backwards because it's putting the conclusion first then seeking to justify it, but to each their own.

user-the-name4y ago

No, they have a position and arguments to support it, but those have nothing to do with the machine learning aspects, just with the fact that the software is proprietary.

They are asking for views on the machine learning, which they do not have arguments or a position on.

kelnos4y ago

> I think that's a backwards because it's putting the conclusion first then seeking to justify it

Isn't that literally a lawyer's job?

meepmorp4y ago

>Isn't that literally a lawyer's job?

I guess, but then they should have their story straight before they start the astroturfing campaign.

tyingq4y ago

They know a couple of reasons for sure. They want more reasons, or more detail on other reasons for which they aren't as sure yet.

whazor4y ago

I am curious about the results.

Of course it is possible to make the model generate longer pieces of code that are potentially GPL. But you would have to do certain effort for it. It also tends to adopt your coding style.

But maybe the fact that there are no guarantees makes it unfair.

dcow4y ago

unnah4y ago

thomzane4y ago

I am excited to see where these questions lead.

grepfru_it4y ago

Something like this?

  [GitHub Copilot License Config Menu]
  Show suggestions with the following tags:
  - [ ] GPLv3
  - [x] GPLv2
  - [ ] AGPL
  - [x] CC-BY-SA
  - [x] Apache License
  - [x] MIT License
  - [ ] No License Attribution

dunham4y ago

Would that require generating 2^n models or can models be combined?

blooalien4y ago

tyingq4y ago

Other picklists might be handy too, especially something that would narrow to higher quality sources.

And they need a report button with a picklist of reasons.

imoverclocked4y ago

eg, cleanliness described by different linters/static-analysis tools? Can we actually make better code suggestions by choosing examples which are known to have less super-obvious flaws?

1 more reply

remram4y ago

Those licenses require attributions. You can't just say "Copyright (c) all the projects indexed in Copilot".

judge20204y ago

I would imagine saying "Copyright (c) appropriate rights holders on planet Earth" wouldn't satisfy most license attribution claims if they were ever tested in court.

grepfru_it4y ago

There's certain information I cannot share, but you can see the general idea of what I'm throwing out here

MichaelMoser1234y ago

i actually like it that copilot is better than me at solving interview questions. https://www.youtube.com/watch?v=FHwnrYm0mNc I for one welcome our robot overlords.

i wonder if they could retrain the model on BSD or MIT licensed code only; How much of the open source code is licensed as GPL vs more permissive licenses, does anyone know?

Interesting that they want to charge for the use of co-pilot, I guess that we will see this business model more in the future.

Trollmann4y ago

65104y ago

My opinion: Copilot is a derived work.

lostmsu4y ago

An opinion: so is you.

65104y ago

Thats much to deep for this forum.

lights01234y ago

> It requires running software that is not free/libre (Visual Studio, or parts of Visual Studio Code)

r2834924y ago

I think you are wrong: https://vscodium.com/

lights01234y ago

VSCodium provides Free pre-compiled binaries of VS Code from GitHub, like I was describing. What about it makes me wrong?

I did it two days ago, installing the Copilot plugin in a Free build of VS Code provided by my distro.

zekrioca4y ago

Interesting: In HN, a same link submitted at a different time get different # of upvotes.

Same link, just 13h ago, but with 5x less upvotes than the one in here: https://news.ycombinator.com/item?id=27992894

ghoward4y ago

Because the US programmers were going to bed?

zekrioca4y ago

I'd expect HN to not let duplicates to be submitted.

IvyMike4y ago

From the FAQ https://news.ycombinator.com/newsfaq.html

> Are reposts ok?

> If a story has not had significant attention in the last year or so, a small number of reposts is ok. Otherwise we bury reposts as duplicates.

> Please don't delete and repost the same story. Deletion is for things that shouldn't have been submitted in the first place.

1 more reply

ghoward4y ago

I see. Well, it actually happens all the time.

kmeisthax4y ago

>Is Copilot's training on public repositories infringing copyright? Is it fair use?

My money's on yes, but this isn't settled until SCOTUS says so.

>How likely is the output of Copilot to generate actionable claims of violations on GPL-licensed works?

>How can developers ensure that any code to which they hold the copyright is protected against violations generated by Copilot?

>Is there a way for developers using Copilot to comply with free software licenses like the GPL?

Yes - don't use it.

>Should ethical advocacy organizations like the FSF argue for change in copyright law relevant to these questions?

I'm going to say no, because copyright law is already insane as-is and we don't need to make it worse just so that the copyleft hack still works a little better.

Please, for the love of god, we do not need stronger copyrights. We need to chain this leviathan.

blibble4y ago

> My money's on yes, but this isn't settled until SCOTUS says so.

there are more countries in the world than the United States, and most of the world's developers live outside of the United States

most countries won't pay any attention to what the US Supreme Court decides

judge20204y ago

> most countries won't pay any attention to what the US Supreme Court decides

Copyright lawsuits across nation state lines are pretty much non-existent and not worth it. What matters in the U.S. is pretty much as far as anyone who cares about copyright is going to care about.

blibble4y ago

I am capable of suing Microsoft in the UK for violating my UK copyright without requiring any involvement with the US legal system

and the UK has essentially no concept of fair use

and the same will play out across every country in Europe and the 90% of the world that isn't the United States

1 more reply

pkrefta4y ago

grepfru_it4y ago

Permit4y ago

> Of the millions of open source projects, a large portion of the contributors ARE speaking up because they don't find this to be acceptable.

kelnos4y ago

> that's the truth of open-source

You seem to be fine releasing your stuff into the public domain, and that's great that you want to do that, but you don't speak for everyone.

colechristensen4y ago

Well then you're a BSD-license kind of person.

Not everybody is and that's ok too.

nitrogen4y ago

The BSD license still requires attribution and copyright notices visible to the end user.

laumars4y ago

This is why there are a multitude of different open source software licences. Because some people care more than others about the terms in which their code is used by others.

johannes12343214y ago

That is a valid position one can have.

However other people for varying reasons have other ideas ...

senko4y ago

> We will read the submitted white papers, and we will publish ones that we think help elucidate the problem.

Doesn't give me hope they're aiming for unbiased opinion. I would be very surprised if any of the published papers don't closely align with FSFs apriori position.

nescioquid4y ago

The word "unbiased" seems to be doing a lot of heavy work in your comment. The FSF is inherently biased towards its project -- how is that a problem?

senko4y ago

> The word "unbiased" seems to be doing a lot of heavy work in your comment. The FSF is inherently biased towards its project -- how is that a problem?

That's straw-man, I never said (nor do I think) FSF should not be biased towards its project.

1 more reply

kelnos4y ago

Well, sure. They're looking for legal support for their position. They're not pretending to be an unbiased, disinterested observer.

user-the-name4y ago

The part you removed is the crucial part that explains that paragraph.

ghoward4y ago

I honestly wish I was in a position to write a whitepaper for this. However, I should not for several reasons:

* I have already made my position clear in public, [1] so I could probably be identified.

* I am not a lawyer, just some bloke who attempted to write FOSS licenses to combat ML on copyrighted code. [2]

[1]: https://gavinhoward.com/2021/07/poisoning-github-copilot-and...

[2]: https://yzena.com/licenses/

slownews454y ago

Anyone feel like FSF moved from maybe engineering idealists to a very lawyer driven type org?

A tool that helps with cross language building blocks for common functions etc (stackoverflow on steroids) - just how bad is this?

danhor4y ago

The FSF considers the user to be the one using cars/tivo's/other devices. In their view, this was a design flaw of gplv2 that it allowed locking out end-users of their devices.

For Linux this was not the case. The important part that modifications/extensions were shared (and maybe even upstreamed), while the end user access wasn't important.

The approach to IP in china is also sometimes a lot different, see https://www.bunniestudios.com/blog/?p=4297.

pabs34y ago

Some interesting links about the TiVo stuff:

https://sfconservancy.org/blog/2021/jul/23/tivoization-and-t... https://news.ycombinator.com/item?id=27937877 https://events19.linuxfoundation.org/wp-content/uploads/2017...

Apparently what TiVo did (breaking proprietary software if you modify GPLed software) is even allowed by GPLv3.

slownews454y ago

laumars4y ago

e404y ago

No. The FSF had lawyers from the beginning and always thought (I talked with RMS in the early 80's some) that enforcement was part of the plan.

slownews454y ago

Is anyone building strong communities on AGPLv3 / GPLv3? I feel the momentum shifted towards Apache / MIT style licenses unfortunately.

blendergeek4y ago

> Is anyone building strong communities on AGPLv3 / GPLv3? I feel the momentum shifted towards Apache / MIT style licenses unfortunately.

While the corporate momentum switched to Apache/MIT licenses, there are strong communities built on AGPLv3/GPLv3.

* Nextcloud - file hosting (AGPLv3)

* Source Hut - git hosting (AGPLv3)

* StreetComplete - OpenStreetMap editing (GPLv3)

* F-Droid - Free Software "app store" for android (GPLv3)

* NewPipe - alternative Youtube frontend (GPLv3)

While these aren't necessarily used by large corporations, their individual communities are thriving and strong.

1 more reply

detaro4y ago

> but we don't care that Tivo's are locked down.

They literally made the GPLv3 because they cared about that very much.

4 more replies

jcelerier4y ago

Qt has switched to GPLv3 and is going pretty strong as a community. Can't find the figures for the official forum, but an unofficial one has 75k members.

blendergeek4y ago

GPL is about end-user freedom (as opposed to software distributor freedom). This is why GPLv3 exists.

slownews454y ago

So yes, FSF created GPLv3 to focus on USERS freedoms, but the users are not writing the software - so it remains the devs who pick licenses.

pseudalopex4y ago

The Free Software definition always put users first.[1]

[1] https://en.wikipedia.org/wiki/The_Free_Software_Definition#T...

goodpoint4y ago

I've never met a developer who is not a user.

simion3144y ago

>And regardless of the FSF attorney stuff, places like china, ....

So your argument is if China does not care about license neither should we, the thing is I am fine with that, I know Windows source code is leaked so let's train an AI on it too

j / k navigate · click thread line to collapse