Docker Hub Registry is down (opens in new tab)

(status.docker.com)

116 points0vermorrow6y ago32 comments

32 comments

alpb6y ago

Regular reminder that Docker Hub is not really an enterprise registry with an SLA. You should use pretty much anything else for serious applications that rely on pulling images in the hot path (such as auto-scaling up).

LiamPa6y ago

I wish I knew this 2 years ago, going to have to migrate to something that isn’t down every month.

FpUser6y ago

Being paranoid helps. My pipelines never pull images from the hub, I always store those locally.

gchamonlive6y ago

do you have some kind of a maintenance routine that pulls image updates? you can end up with ancient docker hub images, because without --pull when compiling docker image, docker build won't pull base image updates by default

1 more reply

danielecook6y ago

It seems like dockerhub requires a lot of bandwidth...lots of people being able to pull gigabytes worth of images everyday. Does anyone know anything behind the economics behind this? How can they offer it for free?

meritt6y ago

> Does anyone know anything behind the economics behind this?

You use VC money and run at a loss while focusing on marketing and tech evangelism, getting more and more startups and hopefully established companies using your software. As the cracks begin to show those growing organizations have too much tied to your system and they can't afford outages and need to scale. So they pay you for the Enterprise version of your software where you actually fix all of flaws present in the community version.

Look at MongoDB if you need a good case study. It was incredibly hyped from about 2009-2015, people would defend it in heated online arguments, and today it's rarely considered for greenfield projects. But they're making about $100M/qtr selling subscriptions to Enterprise & Atlas servicing the technical debt established during that hype cycle.

ptomato6y ago

Likely the traditional model of taking a large amount of VC money, putting it in a pile, and then setting it on fire and waiting until they stop adding more, at which point the company ends.

rtempaccount16y ago

yeah there's a variety of "free at point of use" services driving the Internet and, sooner or later it seems likely there will need to be a change in how they're funded.

It's not just Docker hub, there's services like the various Programming language package repos (npm, rubygems etc) and the Linux distro package repos.

I would have had Github in that category, but now it's owned by MS, presumably they don't have many of that kind of funding problems...

toomuchtodo6y ago

Github used to be in their own colo on their own bare metal. I'm not sure if they've been pushed into Azure cloud as part of the MS acquisition, but either way Github isn't paying cloud retail ($$$) for their bandwidth, and it's likely sustainable.

1 more reply

bluedino6y ago

What's the best way around this kind of outage?

LethargicStud6y ago

1) You can run your own pull-through cache[0]

2) You can use a different registry

3) Run something like kraken[1] so machines can share already-downloaded images with eachother

4) If you need an emergency response, you can docker save[2] an image on a box that has it cached and manually distribute it/load it into other boxes

0: https://docs.docker.com/registry/recipes/mirror/

1: https://github.com/uber/kraken

2: https://docs.docker.com/engine/reference/commandline/save/

alexellisuk6y ago

Great response here.

I'd also add as an option - https://goharbor.io

toomuchtodo6y ago

Have the ability to build containers on demand from source as well as host your own repo.

ownagefool6y ago

Realistically, it's probably a cache/mirror.

If you can't build a deploy a new version of your app, you can probably live with it and grab a cup of coffee.

If you server fails over and your new server can't pull the current image, your app is potentially down, and that's a lot worse.

The math you do here is the cost of wasted time versus the cost for you to run your own registry with better uptime.

1 more reply

rodgerd6y ago

Maintaining your own registry as a cache.

dweomer6y ago

pull-through mirror with tons of disk

nelsonmarcos6y ago

If we only had listened our sysadmin...

beilabs6y ago

So my CI environment requires access to other docker images, all hosted on Docker Hub.

Seems like the tech giants should load balance these images for the good of the Internet to provide some decent redundancy and for my sanity at 11.30pm.

treve6y ago

Whenever one of our essential 3rd party services go down, I can only shrug and hope they figure it out quickly. They provide a good service and nobody has 100% uptime. Still better than solving it internally, which is even more likely to have downtime.

Partial failure is just fact of life. If this is a major issue for your process, it might be better to try and find ways to alter your process so this isn't an issue. Alternatively, mirror locally.

beilabs6y ago

You're absolutely right.

Being honest no build is worth losing sleep over. We are piggybacking on their service and bandwidth. For us to start building the infrastructure to cache their images doesn't make financial sense, we deploy daily and their uptime always allows for that.

thresh6y ago

No, you should rewrite your CI not to depend on external stuff, if you want sane evenings.

geggam6y ago

If you are getting paid to do CI you are simply doing it wrong.

Rule #1 Host your own stuff, never rely on others.

Rule #2 automate everything

driverdan6y ago

This has broken fresh containerized deploys on Heroku, which is surprising since they run their own registry. They should be proxying Hub, it'd save them a ton of bandwidth.

popotamonga6y ago

What a coincidence, the same minute all my lightsail instances got unresponsive and then 20 minutes stuck on "stopping".

Launched a new one.. docker pull bam error. Customer unsatisfied.

dpix6y ago

Looks like it's back up now, whew!

bluedino6y ago

It went from orange to red.

Incident Status Full Service Disruption

tekno456y ago

uuugh. was just about to do some testing

tryphan6y ago

More confidence for the folks at Docker Inc.

1 more reply

j / k navigate · click thread line to collapse

32 comments

alpb6y ago

LiamPa6y ago

I wish I knew this 2 years ago, going to have to migrate to something that isn’t down every month.

FpUser6y ago

Being paranoid helps. My pipelines never pull images from the hub, I always store those locally.

gchamonlive6y ago

1 more reply

danielecook6y ago

meritt6y ago

> Does anyone know anything behind the economics behind this?

ptomato6y ago

Likely the traditional model of taking a large amount of VC money, putting it in a pile, and then setting it on fire and waiting until they stop adding more, at which point the company ends.

rtempaccount16y ago

yeah there's a variety of "free at point of use" services driving the Internet and, sooner or later it seems likely there will need to be a change in how they're funded.

It's not just Docker hub, there's services like the various Programming language package repos (npm, rubygems etc) and the Linux distro package repos.

I would have had Github in that category, but now it's owned by MS, presumably they don't have many of that kind of funding problems...

toomuchtodo6y ago

1 more reply

bluedino6y ago

What's the best way around this kind of outage?

LethargicStud6y ago

1) You can run your own pull-through cache[0]

2) You can use a different registry

3) Run something like kraken[1] so machines can share already-downloaded images with eachother

4) If you need an emergency response, you can docker save[2] an image on a box that has it cached and manually distribute it/load it into other boxes

0: https://docs.docker.com/registry/recipes/mirror/

1: https://github.com/uber/kraken

2: https://docs.docker.com/engine/reference/commandline/save/

alexellisuk6y ago

Great response here.

I'd also add as an option - https://goharbor.io

toomuchtodo6y ago

Have the ability to build containers on demand from source as well as host your own repo.

ownagefool6y ago

Realistically, it's probably a cache/mirror.

If you can't build a deploy a new version of your app, you can probably live with it and grab a cup of coffee.

If you server fails over and your new server can't pull the current image, your app is potentially down, and that's a lot worse.

The math you do here is the cost of wasted time versus the cost for you to run your own registry with better uptime.

1 more reply

rodgerd6y ago

Maintaining your own registry as a cache.

dweomer6y ago

pull-through mirror with tons of disk

nelsonmarcos6y ago

If we only had listened our sysadmin...

beilabs6y ago

So my CI environment requires access to other docker images, all hosted on Docker Hub.

Seems like the tech giants should load balance these images for the good of the Internet to provide some decent redundancy and for my sanity at 11.30pm.

treve6y ago

Partial failure is just fact of life. If this is a major issue for your process, it might be better to try and find ways to alter your process so this isn't an issue. Alternatively, mirror locally.

beilabs6y ago

You're absolutely right.

thresh6y ago

No, you should rewrite your CI not to depend on external stuff, if you want sane evenings.

geggam6y ago

If you are getting paid to do CI you are simply doing it wrong.

Rule #1 Host your own stuff, never rely on others.

Rule #2 automate everything

driverdan6y ago

This has broken fresh containerized deploys on Heroku, which is surprising since they run their own registry. They should be proxying Hub, it'd save them a ton of bandwidth.

popotamonga6y ago

What a coincidence, the same minute all my lightsail instances got unresponsive and then 20 minutes stuck on "stopping".

Launched a new one.. docker pull bam error. Customer unsatisfied.

dpix6y ago

Looks like it's back up now, whew!

bluedino6y ago

It went from orange to red.

Incident Status Full Service Disruption

tekno456y ago

uuugh. was just about to do some testing

tryphan6y ago

More confidence for the folks at Docker Inc.

1 more reply

j / k navigate · click thread line to collapse