Disabled at 22 million commits (opens in new tab)

> So the author was purposefully trying to do the most extreme thing they could to see how git/GitHub act/break.

This is Hacker News. Hacking is about using, in particular, technology in surprising ways that were not intended by the creators.

jstummbillig2y ago

No harm, no foul, because, what, it says "Hacker" in the title? I don't know, are we 12?

The reason that hacking is even a thing: It's actually possible to break things in a responsible, non-destructive way (in contrast to most things in the physical world).

If we skip the responsible part, we are just... breaking things and incurring costs. Why should that be okay?

csiegert2y ago

It’s GitHub, not HackerHub. That the story is reported on Hacker News is irrelevant.

https://www.vice.com/en/article/a33j5a/a-redditor-archived-n...

ASalazarMX2y ago

This is Hacker News after all, not Hack-The-World News.

iLoveOncall2y ago

> I don’t blame GH at all.

I don't really see anyone blaming GitHub, not even the original post, I'm not sure why all the responses here are insinuating that?

rafark2y ago

This is why we can’t have nice things.

frankreyes2y ago

Yes.

hayd2y ago

And used Github actions to do the (infinite loop?) compute.

housemusicfan2y ago

So basically this:

https://www.youtube.com/watch?v=1kzb6uf0U0k

lucb1e2y ago

Video showing Simpsons going to an 'all you can eat' and staying until after closing time still gobbling down food without end, owner has homer thrown out.

pragmatick2y ago

This is the most blatant case of false advertising since my suit against the movie The Neverending Story.

ranting-moth2y ago

More correct title would be "GitHub stopped abuse after 22M commits".

There is absolutely nothing wrong with GH stopping that and it's very wrong to insinuate otherwise like OP is doing.

Wouldn't be surprised if GH would permaban him.

I don’t think the author is trying to insinuate that GitHub is in the wrong in any way. They explicitly say they understand the decision, and anticipated that it would happen.

I don’t want to quibble with the term “abuse”, because I think in this scenario it depends on whether intent is a factor and whether we should trust their stated intent. But depending on how you look at it, GitHub would be just as likely to benefit from hiring the author as they would from banning.

ranting-moth2y ago

Load testing someone else's system resulting in a manual staff intervention due to potential system destabilization at 6am?

It's a wordplay to call that anything else than abuse.

adamckay2y ago

> GitHub would be just as likely to benefit from hiring the author as they would from banning

For what purpose?

Creating an infinite loop that updates a file and commits it is hardly worthy of a job offer.

apetresc2y ago

Hire them? Why? There’s nothing technically clever or novel here. Anyone can create a shell script to generate random commits and push them. I’d bet even GPT-3.5 could handle that. Why should GitHub hire them?

femto1132y ago

Deliberately trying to create an extreme situation in order to find when/where/how a service breaks is inarguably "abuse" regardless of whether the intent was malign.

rat99882y ago

It is malicious as he knows he will harm the service to be able to draw whatever conclusion. This is not a case where the end justifies the means.

https://clintonwhitehouse3.archives.gov/WH/glimpse/president...

mabbo2y ago

The author used up so much of github's resources that it impacted other users. 22 million commits is probably enough that something started to hit a linear or n-log-n scaling function, setting off an alarm on some metric. Yeah, you get in trouble for that.

I'm reminded of a time in high school where my friend almost got himself banned from the school computers.

At home he had dial-up internet (it was 2003 and he lived in a very rural area). But at school he had megabits of bandwidth he could (ab)use. So he started pirating everything on the internet using a computer nobody ever used in a side-room of the library. It ran 24/7 downloading his long list of desires: games, movies, tv series, etc. He stored his spoils on his network drive, which had no limits on how much it could hold (until he got caught). He'd occasionally bring in a hard drive, copy everything that fit on it and bring it home with him on the school bus.

But all good things must end.

The network admin for the school board eventually came by and sat him down. He showed my friend a pie chart where, as he described it to me, "my name was on the portion that took up more than 2/3 of the pie". After a conversation, all the data got deleted, my friend got a stern warning, and somehow didn't get into any worse trouble than that.

Panzer042y ago

"somehow"

I don't get this attitude. Shit happens, we talk about it, we don't do it again. Not everything needs to have dire consequences.

flutas2y ago

I think he means "somehow" in the meaning of "somehow, none of the copyright holders asked the school for his information."

layer82y ago

> The author used up so much of github's resources that it impacted other users.

Note that the message only said “the potential to affect other users”. I would expect a professional service to catch such things before it actually affects other users.

justinclift2y ago

Sounds like the network admin and surrounding people had their heads screwed on properly. :)

GMoromisato2y ago

A long time ago, the math column in Scientific American decided to run a contest. It asked readers to send a post card with the biggest number they could think of. Whoever came up with the biggest number would win $1 million--divided by the winning number.

The editor of the magazine almost stopped the contest because he worried that someone might actually win real money and the magazine would be on the hook. But the author reassured him: human nature being what it is, the winning number is going to be not only larger than 1 million, but much larger than you can imagine.

And so it was. The winning number was (IIRC) some tower of exponentials that would take most of the universe to write out as decimal digits. The SciAm budget was safe.

If readers had coordinated somehow, they could have won a million dollars from SciAm and divided it among themselves. They might have made a hundred dollars each. But the author knew that such coordination would be impossible. Human nature would not allow it. Someone, somewhere, was going to send in a ridiculously large number to win. Classic Prisoner's Dilemma.

The GitHub case is the same. Human nature being what it is, someone, somewhere is always going to try to push the limits. As the developer of a SaaS development platform, this is something I'm taking to heart.

LouisSayers2y ago

The biggest number I can think of is 0.001 :D

They could have been in quite some trouble!

pritambaral2y ago

In the famous words of Calvin Coolidge, "you lose".

throwuxiytayq2y ago

This feels slightly malicious, but I can’t help but admire the curiosity that takes someone to actually see what happens if. That said, now we know, so nobody else needs to bother GitHub engineers by doing this again, hopefully.

lucb1e2y ago

Not like it would take a lot of code to check on push if the repository has more than 10k commits per day since its creation date or something, to stop such abuse. Doesn't thwart existing repositories with millions of commits (Linux is at ~2M) and gives time to formulate a long-term plan for what's allowed and what's paid or just disallowed.

So even if people were to try, I don't see that being a big bother. Not that it's not malicious to do this now

thewataccount2y ago

I don't think you can limit pure commit counts though, because you can push many commits/massive history changes in one go.

Monorepo's in particular could be impacted

blowski2y ago

I’m surprised at the reaction in these comments. Somebody curiously pushing the limits of a service to see what would happen is very much in the spirit of all hackers. Meanwhile, GitHub responded appropriately, and his write up agrees.

guraf2y ago

The reactions were predictable when you consider:

1. Some HN users might/could have been personally inconvenienced by OP's action and they prefer resenting him rather than GitHub for whatever reason

2. Many HN users get paid a lot to work on SaaS themselves, so seeing a peer (however big it is) get abused for (what appears to be) entertainment is terrifying to them

Panzer042y ago

It's a bit odd how hostile everyone here is acting. Sure, it's a bit silly, but hardly worthy of the kind of vitriol directed towards his "abuse".

Mystery-Machine2y ago

Someone potentially taking the service down for everyone, you know, just out of curiosity. Which part of this curiosity you need GitHub for? I'm curious how well GitHub handles DDoS attacks, what's their limit. Let's DDoS and find out, it will be fun!

SparkyMcUnicorn2y ago

> Someone potentially taking the service down for everyone, you know, just out of curiosity.

I think this is exactly why it's great, and it's basically turned into a GitHub advertisement. Either GitHub is simply unable to handle weird abuse methods and/or the abuse prevention is improved.

As an enterprise, wouldn't it be a bit concerning if your git host was unable to function (or respond appropriately) when presented with a random script kiddie?

This person didn't have bad intentions, but other people out there most definitely do.

ryaneager2y ago

You really think one lone repo could take down all of GitHub? If GitHub doesn’t have stops in place to prevent that then they honestly deserve it.

2h2y ago

because he is pushing the limits of a public service thats used by millions of people every day. the BEST CASE is basically what happened, GitHub finds out and disables the repo. worst case is he takes down the entire GitHub site and gets permanently banned.

dont fuck with shit I use.

ryaneager2y ago

Do you think GitHub’s architecture is so bad that one person can take it all down by committing to a single repo?

iBotPeaches2y ago

> I decided to see how many commits GitHub (and git) could take before acting kind of wonky. At ~19 million commits (and counting) to master: it’s wonky.

This just doesn't seem right to me. Why? Its obvious at some point you'll harm the service. If the goal was to test it, why not try locally with git.

xigency2y ago

A good lesson to learn - If you as a service owner aren’t testing the limits to the point of failure and enforcing sensible guardrails around that, then some random user eventually will.

TechBro86152y ago

GitHub offers the service for free and doesn't publish or enforce any specific limit on number of commits. I see nothing wrong with a user pushing as many commits to it as possible. It's not his problem when to stop it.

This is also how I feel about the Tor project getting their knickers twisted over people who do research on the live network. If the network can't handle it, then it's not resilient to attack. Asking people nicely not to do stuff that degrades your product will not make the product suddenly anti-fragile.

adamckay2y ago

It's this kind of attitude that's why we can't have nice things, though.

A service is offered for free, with no documented limits or restrictions, so you push the service to its breaking point... Just to see what happens?

lucb1e2y ago

> why not try locally with git.

Because you can't. GitHub is not open source, you'd need to steal the source code to try it locally. This comment is for educational purposes only, not trying to give OP ideas!!1

But you're right in spirit of course. Would be more interesting to install Forgejo/Gitea, GitLab, GitWeb, gitolite, TortoiseGit, etc., test them on various limits, and write that up in a nice blog post for magic internet points.

js22y ago

> "GitHub (and git)"

The "(and git)" portion can of course be tested locally. What OP will find out is that there is no more inherent limit on the number of commits in a repo than there is an inherent limit in the number of nodes in a linked list.

You can go on forever till you run out of disk space. Possibly repacking will eventually require more than available memory.

guraf2y ago

Testing git, which was a stated goal, could have been done locally.

It's obvious that the author is lying about that part, he only wanted to push GitHub to its limit, but he did say git:

> I decided to see how many commits GitHub (and git) could take before acting kind of wonky. At ~19 million commits (and counting) to master: it’s wonky.

RexM2y ago

git runs outside of GitHub, which is what the comment you responded to was saying.

Test the behavior of git locally, without testing GitHub.

https://github.com/MicrosoftDocs/azure-docs

ronsor2y ago

You can download GitHub Enterprise Server for free.

layer82y ago

> Its obvious at some point you'll harm the service.

That’s not obvious at all. One would expect a professional service to have limits in place to prevent any negative impacts.

BeefySwain2y ago

Sidestepping all of the ethical questions of embarking on this "research", I'm surprised the number was that low.

Linux[0] itself has about 1.2 million commits, so apparently Linux is within an order of magnitude of bringing GitHub to it's knees?

[0] https://github.com/torvalds/linux

eddythompson802y ago

Microsoft’s azure docs repo has 1.1M commits, and it’s many gigabytes big. I made the mistake of trying to clone it to fix an issue in the docs I ran into. Ended up just editing it on GitHub because fuck that.

vinyl72y ago

You can clone a few latest commits

  git clone -–depth [depth] [remote-url]

https://github.com/orgs/Homebrew/discussions/226

metabagel2y ago

I think it’s a rate issue, not the number of commits.

aloer2y ago

iirc remember some years ago the homebrew repo caused too much load due to their architecture where every client would pull on install or update. Or something like that.

Part of the GitHub response afaik included the info that they went as far as they could with dedicated and beefier servers but asked for a software fix.

I would think that if GitHub anticipates a normal repo growing this large they can give it the special treatment

jwilk2y ago

tikhonj2y ago

There's a rough rule of thumb that you should expect to redesign your system to handle each order of magnitude increase in scale, and I figure it applies here too—gracefully handling that size of repo would require substantial engineering work, and they have plenty of time to handle it before human-oriented open source repos get even close to the current limit.

lucb1e2y ago

I'm not sure redesigns were necessary between going 1 to 10, from 10 to 100, from 100 to 1000, from 1000 to 10'000, from 10'000 to 100'000, or from 100'000 to 1000'000 which we're now at. It sounds like a sensible engineering rule, but I'm not sure it translates to software, or at least not in this case. I don't know of any design changes made to Git since it was first created, there's no v1 and v2 repositories for example.

> There's a rough rule of thumb that you should expect to redesign your system to handle each order of magnitude increase in scale

I rather know the rule: by good engineering, you can modify a system to handle a one magnitude increase with respect what it was designed for. As soon as a two magnitude increase can occur, you better redesign the system.

Kwpolska2y ago

> I’ve also asked if they can re-enable it so I can give one more commit to say the final results on the readme then (public) archive it.

Entitled much? The author should be happy GitHub didn't just ban them for violating the ToS and intentionally trying to break things.

They asked. They didn’t demand, and they seem prepared to accept whatever GitHub decides. If I were fielding that request, I’d certainly grant it—on the condition that any deviation from the stated intent would indeed result in a ban—purely on the basis that it’s a ~free QA contribution and postmortem.

Kwpolska2y ago

Keeping the repository, even as a public archive, would still require a lot of resources on GitHub's side. The only fair thing to do here would be to apologize and ask for the repo to be deleted.

https://web.archive.org/web/20230702211636/https://programmi...

medellin2y ago

The mindset around programming and exploration in general is in a sad state. I don’t understand why we have so much hate here for things like this. Better that someone like this find it then someone who noticies it and spins up 1000s of repos to do the exact same thing.

I think the sentiment here shows the current state that software engineering has devolved into. It’s a 9-5 where you put in minimal work and get mad when someone breaks your system because you might have to do an hour of work to fix it on your weekend.

Arainach2y ago

"Devolved" implies a negative connotation, but this is a positive evolution.

flusensieb2y ago

antimora2y ago

Here is another GH abuser I found recently: https://github.com/eemailme

This account basically subscribes to thousands of repositories and monitors all activities. I am suspecting this account is harvesting user activities. I am not sure why GitHub allows this type of data harvesting.

kjs32y ago

When I was an junior admin in college, there was always at least one kid a trimester who 'experimented' with a fork-bomb on one of the shared Unix servers, and was shocked to learn that there are things you can do that you really shouldn't do. Same thing.

stiwari2y ago

I'm surprised that a lot of users here are telling OP that he was wrong. OP was well within his rights to do this, as his intention was to stop when any impact is observed, not continue with it. It is within their rights to test the system they want to use to make sure their requirements are met.

To be honest, this is why companies also should not discourage this. Imagine if a malicious group did it with multiple users at the same time. At least now they will have pro active alarms for it.

TacticalCoder2y ago

Did the dude bring GH down at some point?

chris_wot2y ago

So basically, this guy is trying a DoS of GitHub. To hell with him.

j / k navigate · click thread line to collapse

130 comments

MBCook2y ago

So the author was purposefully trying to do the most extreme thing they could to see how git/GitHub act/break.

I don’t blame GH at all.

Source: https://web.archive.org/web/20230702215522/https://sh.itjust...

> So the author was purposefully trying to do the most extreme thing they could to see how git/GitHub act/break.

This is Hacker News. Hacking is about using, in particular, technology in surprising ways that were not intended by the creators.

jstummbillig2y ago

No harm, no foul, because, what, it says "Hacker" in the title? I don't know, are we 12?

The reason that hacking is even a thing: It's actually possible to break things in a responsible, non-destructive way (in contrast to most things in the physical world).

If we skip the responsible part, we are just... breaking things and incurring costs. Why should that be okay?

csiegert2y ago

It’s GitHub, not HackerHub. That the story is reported on Hacker News is irrelevant.

https://www.vice.com/en/article/a33j5a/a-redditor-archived-n...

ASalazarMX2y ago

This is Hacker News after all, not Hack-The-World News.

iLoveOncall2y ago

> I don’t blame GH at all.

I don't really see anyone blaming GitHub, not even the original post, I'm not sure why all the responses here are insinuating that?

rafark2y ago

This is why we can’t have nice things.

frankreyes2y ago

Yes.

hayd2y ago

And used Github actions to do the (infinite loop?) compute.

housemusicfan2y ago

So basically this:

https://www.youtube.com/watch?v=1kzb6uf0U0k

lucb1e2y ago

Video showing Simpsons going to an 'all you can eat' and staying until after closing time still gobbling down food without end, owner has homer thrown out.

pragmatick2y ago

This is the most blatant case of false advertising since my suit against the movie The Neverending Story.

ranting-moth2y ago

More correct title would be "GitHub stopped abuse after 22M commits".

There is absolutely nothing wrong with GH stopping that and it's very wrong to insinuate otherwise like OP is doing.

Wouldn't be surprised if GH would permaban him.

I don’t think the author is trying to insinuate that GitHub is in the wrong in any way. They explicitly say they understand the decision, and anticipated that it would happen.

ranting-moth2y ago

Load testing someone else's system resulting in a manual staff intervention due to potential system destabilization at 6am?

It's a wordplay to call that anything else than abuse.

adamckay2y ago

> GitHub would be just as likely to benefit from hiring the author as they would from banning

For what purpose?

Creating an infinite loop that updates a file and commits it is hardly worthy of a job offer.

apetresc2y ago

femto1132y ago

Deliberately trying to create an extreme situation in order to find when/where/how a service breaks is inarguably "abuse" regardless of whether the intent was malign.

rat99882y ago

It is malicious as he knows he will harm the service to be able to draw whatever conclusion. This is not a case where the end justifies the means.

https://clintonwhitehouse3.archives.gov/WH/glimpse/president...

mabbo2y ago

I'm reminded of a time in high school where my friend almost got himself banned from the school computers.

But all good things must end.

Panzer042y ago

"somehow"

I don't get this attitude. Shit happens, we talk about it, we don't do it again. Not everything needs to have dire consequences.

flutas2y ago

I think he means "somehow" in the meaning of "somehow, none of the copyright holders asked the school for his information."

layer82y ago

> The author used up so much of github's resources that it impacted other users.

Note that the message only said “the potential to affect other users”. I would expect a professional service to catch such things before it actually affects other users.

justinclift2y ago

Sounds like the network admin and surrounding people had their heads screwed on properly. :)

GMoromisato2y ago

And so it was. The winning number was (IIRC) some tower of exponentials that would take most of the universe to write out as decimal digits. The SciAm budget was safe.

LouisSayers2y ago

The biggest number I can think of is 0.001 :D

They could have been in quite some trouble!

pritambaral2y ago

In the famous words of Calvin Coolidge, "you lose".

throwuxiytayq2y ago

lucb1e2y ago

So even if people were to try, I don't see that being a big bother. Not that it's not malicious to do this now

thewataccount2y ago

I don't think you can limit pure commit counts though, because you can push many commits/massive history changes in one go.

Monorepo's in particular could be impacted

blowski2y ago

guraf2y ago

The reactions were predictable when you consider:

1. Some HN users might/could have been personally inconvenienced by OP's action and they prefer resenting him rather than GitHub for whatever reason

2. Many HN users get paid a lot to work on SaaS themselves, so seeing a peer (however big it is) get abused for (what appears to be) entertainment is terrifying to them

Panzer042y ago

It's a bit odd how hostile everyone here is acting. Sure, it's a bit silly, but hardly worthy of the kind of vitriol directed towards his "abuse".

Mystery-Machine2y ago

SparkyMcUnicorn2y ago

> Someone potentially taking the service down for everyone, you know, just out of curiosity.

I think this is exactly why it's great, and it's basically turned into a GitHub advertisement. Either GitHub is simply unable to handle weird abuse methods and/or the abuse prevention is improved.

As an enterprise, wouldn't it be a bit concerning if your git host was unable to function (or respond appropriately) when presented with a random script kiddie?

This person didn't have bad intentions, but other people out there most definitely do.

ryaneager2y ago

You really think one lone repo could take down all of GitHub? If GitHub doesn’t have stops in place to prevent that then they honestly deserve it.

2h2y ago

dont fuck with shit I use.

ryaneager2y ago

Do you think GitHub’s architecture is so bad that one person can take it all down by committing to a single repo?

iBotPeaches2y ago

> I decided to see how many commits GitHub (and git) could take before acting kind of wonky. At ~19 million commits (and counting) to master: it’s wonky.

This just doesn't seem right to me. Why? Its obvious at some point you'll harm the service. If the goal was to test it, why not try locally with git.

xigency2y ago

A good lesson to learn - If you as a service owner aren’t testing the limits to the point of failure and enforcing sensible guardrails around that, then some random user eventually will.

TechBro86152y ago

adamckay2y ago

It's this kind of attitude that's why we can't have nice things, though.

A service is offered for free, with no documented limits or restrictions, so you push the service to its breaking point... Just to see what happens?

lucb1e2y ago

> why not try locally with git.

Because you can't. GitHub is not open source, you'd need to steal the source code to try it locally. This comment is for educational purposes only, not trying to give OP ideas!!1

js22y ago

> "GitHub (and git)"

You can go on forever till you run out of disk space. Possibly repacking will eventually require more than available memory.

guraf2y ago

Testing git, which was a stated goal, could have been done locally.

It's obvious that the author is lying about that part, he only wanted to push GitHub to its limit, but he did say git:

> I decided to see how many commits GitHub (and git) could take before acting kind of wonky. At ~19 million commits (and counting) to master: it’s wonky.

RexM2y ago

git runs outside of GitHub, which is what the comment you responded to was saying.

Test the behavior of git locally, without testing GitHub.

https://github.com/MicrosoftDocs/azure-docs

ronsor2y ago

You can download GitHub Enterprise Server for free.

layer82y ago

> Its obvious at some point you'll harm the service.

That’s not obvious at all. One would expect a professional service to have limits in place to prevent any negative impacts.

BeefySwain2y ago

Sidestepping all of the ethical questions of embarking on this "research", I'm surprised the number was that low.

Linux[0] itself has about 1.2 million commits, so apparently Linux is within an order of magnitude of bringing GitHub to it's knees?

[0] https://github.com/torvalds/linux

eddythompson802y ago

vinyl72y ago

You can clone a few latest commits

  git clone -–depth [depth] [remote-url]

https://github.com/orgs/Homebrew/discussions/226

metabagel2y ago

I think it’s a rate issue, not the number of commits.

aloer2y ago

iirc remember some years ago the homebrew repo caused too much load due to their architecture where every client would pull on install or update. Or something like that.

Part of the GitHub response afaik included the info that they went as far as they could with dedicated and beefier servers but asked for a software fix.

I would think that if GitHub anticipates a normal repo growing this large they can give it the special treatment

jwilk2y ago

tikhonj2y ago

lucb1e2y ago

> There's a rough rule of thumb that you should expect to redesign your system to handle each order of magnitude increase in scale

Kwpolska2y ago

> I’ve also asked if they can re-enable it so I can give one more commit to say the final results on the readme then (public) archive it.

Entitled much? The author should be happy GitHub didn't just ban them for violating the ToS and intentionally trying to break things.

Kwpolska2y ago

Keeping the repository, even as a public archive, would still require a lot of resources on GitHub's side. The only fair thing to do here would be to apologize and ask for the repo to be deleted.