I got pwned by my cloud costs (opens in new tab)

(troyhunt.com)

1398 pointsandimm4y ago641 comments

641 comments

Don't put Cloudflare in front of a Cloud egress bill. i.e. don't do this: Azure|Amazon > Cloudflare

Always use your own proxy where the egress is well within your free tier, i.e. do this: Azure|Amazon > Hetzner|Linode > Cloudflare

Why?

Because Cloudflare cache is a massively multi-tenant LRU cache and whilst hot files will be cached well (and with Cloudflare Tiered Cache even better - but this itself is a cost) anything else is still going to expose you to some degree of egress cost.

When I exposed AWS to the web I paid $3k per month to AWS. With Cloudflare in front of AWS I paid $300 per month to AWS. With Linode in front of AWS and behind Cloudflare I paid $20 per month to Linode and about $12 per month to AWS.

A Linode, Hetzner instance... or any other dumb cheap web server that comes with a healthy free tier of bandwidth is all you need to set up a simple nginx reverse proxy and have it cache things to disk https://docs.nginx.com/nginx/admin-guide/content-cache/conte...

sascha_sl4y ago

Or simply use a proper CDN that doesn't pretend to eat all the cost for a flat fee but then sometimes does not. BunnyCDN has an amazing volume tier at half a cent per GB.

buro94y ago

Oh exactly that.

Or if caching is your biggest priority then Fastly or Akamai will shine too.

But if you're balancing all considerations and want the cheap "good enough" caching with the DDoS protection, free TLS certs, and unmetered (assuming you aren't imgur or something)... then Cloudflare does a great job at being good enough. And for those sharp edges... drop in a proxy of your own, or layer your CDNs.

2 more replies

reitzensteinm4y ago

Will BunnyCDN reliably keep an 18gb file in cache without hitting origin? I use and like Bunny, but relying on that to not get a massive bill in the mail scares the shit out of me.

1 more reply

jimbobimbo4y ago

Azure has its own CDN. If one wants to do Cloudfare -> CDN -> Azure Storage, then at least let it be Azure CDN in the middle, not another cloud provider in the mix. ¯\_(ツ)_/¯

z3t44y ago

Or simply run everything on your own server. All those middlemen are going to kill any latency improvements you get from anycast edge servers.

martindbp4y ago

I've switched to Backlaze B2, which has a bandwidth alliance with Cloudflare. Even without it, B2 egress is something like 1/5th of S3, so may be worth thinking about.

rawtxapp4y ago

If you use argo caching on Cloudflare, it should reduce origin server load even more. Essentially, instead of going directly to your origin, cloudflare endpoint will first reach to it's root node to see if it's cached there and only that node is allowed to communicate with your origin. I see like ~95% cache hits with that turned on.

bastawhiz4y ago

Argo does not affect caching, only performance. You're maybe mistaking it for tiered caching or a custom caching topology.

1 more reply

XCSme4y ago

> Azure|Amazon > Hetzner|Linode > Cloudflare

Why not directly Hetzner|Linode > Cloudflare?

nightpool4y ago

Because Hetzner and Linode VPSs have fixed disk sizes, while Azure and AWS have basically infinite storage. You use your cheap commodity VPS as a cache, not a source-of-truth.

3 more replies

nostrebored4y ago

So that you incur as much downtime risk as possible, obviously.

I hate these 'cloud economics' optimizations that people tend to try.

3 more replies

zrail4y ago

Another option if Linode's included bandwidth + overages is too much is a dedicated box from Reliable Site. I'm not a customer nor am I affiliated with them at all, I just occasionally check in on their low end prices and noticed that they've started included an unmetered 1Gbps port with every host.

https://www.reliablesite.net

(search HN and reddit for that URL, you'll see they've been around and recommended for a really long time).

edub4y ago

If you're going to have an intermediary proxy that you run, for AWS perhaps use Lightsail. It is price competitive, and includes more bandwidth than Linode/DigitalOcean/Vultr for the price.

klohto4y ago

You are not allowed to use Lightsail once you use more professional services on AWS atleast per ToS

2 more replies

ddlutz4y ago

Why not use the CDN of the cloud provider you are on? Azure Storage > Azure CDN

3 more replies

canucker20164y ago

Or Troy Hunt can ping his Cloudflare contacts and see if he can get access to Cloudflare R2 Storage.

see https://blog.cloudflare.com/introducing-r2-object-storage/

From the Cloudflare blog, it seems R2 would've handled this exact situation - auto-migration of cloud S3-like-storage objects - download from cloud-storage just once and cache in R2 for Cloudflare to serve.

1 more reply

cuham_17544y ago

How about Amazon Lightsail? It price structure is basically the same with Hetzner or Linode, and you get it in-house if you use AWS.

2 more replies

jitbit4y ago

CloudFlare tiered cache is now free BTW

Dave3of54y ago

Ah the old cloud provider switcheroo. Yip this is the way they make money. They make it easy to setup some gigantic hugely scalable website then hit you with a gigantic scaled up bill. AWS would do this as well.

Team I'm in at the moment is in the early stages of cloud adoption but the company in total has fell hook line and sinker for AWS. When I mentioned the cost there is always an excuse.

The main one being that you don't have to hire sysadmins anymore as that's taken care now by AWS. Ah yes but they have actually been replaced with a "DevOps" team plus just our department now spend > £1 million per year to AWS in hosting costs. A 20% reduction in those fees could pay for a few sysadmin(s).

The next one is that no other vendor would be able to supply the kit. You know StackOverflow is able to run on a single webserver (https://nickcraver.com/blog/2016/02/17/stack-overflow-the-ar...). Plus many of the other providers have loads of instances available.

I mean I'm not against cloud it's just not the cheapest option if you choose one of the big 3 providers. I use a company called scaleway (https://www.scaleway.com/en/) they have all the essential cloud services you need and everything else you can run yourself in docker or k8s.

Kneecaps074y ago

There's an argument to be made for quality of life for your employees. As someone who has transitioned from on-prem server management to mainly cloud work, my job happiness has skyrocketed. I haven't set foot in a data center in three years and I do not miss it one bit.

Dealing with hardware failures, hardware vendors, confusing licensing, having to know SKUs, racking new cabinets, swapping hard drives, patching servers - it's all awful work. When you go cloud only, you can be more productive instead of dealing with some of that nonsense work.

drdaeman4y ago

I always was a software developer first, but in the old days I spent enough time in the server rooms doing all sorts of sysadmin work, and those days I dabble in devops.

And, honestly, I miss the old days. Today, $cloud has some weird spasms where you suddenly get an influx of connection timeouts or tasks waiting for aeons to get scheduled and you just can't log in to a switch or a machine and figure out what the exact hell is going on. You just watch the evergreen $cloud status page, maybe file some tickets and pray someone bothers to investigate, or maybe live with those random hiccups "sorry $boss, everything is 100% good on our side, it's $cloud misbehaving today", adding more resilience -> complexity -> unreliability in the name of reliability to the system. Either way, with the clouds I feel handicapped, lacking the ability to diagnose things when they go wrong.

I don't miss those three days we spent fighting a kernel panic. Was about a decade ago - we outgrew the hardware and had to get a new one with a badass-at-the-time 10GB SFP+ NIC that worked nice for the first few weeks but then its driver suddenly decided to throw some tantrums on almost a hourly basis. I don't even remember the details - a lot of time flew since then, but thankfully we found some patch somewhere in the depths of LKML and the server was a perfect clockwork ever since. That wasn't fun, but that was an one-in-many years incident.

Either way, I do feel that in the ancient ages hardware and software used to be so much more simple and reliable. Like, today people start with those multi-node high-availability all-the-buzzwords Kubernetes-in-the-cloud monstrosities that still fail now and then (because there are so many moving parts shit's just bound to fail at incredible rate), and in the good old days people somehow managed to have a couple of servers in the rack - some proper, some just desktop towers sitting by - and with some duct tape and elbow grease those ran without incidents for years and years.

Have I turned old and sour? Or maybe it's just the nostalgia about the youth, and I've forgotten or diminished most the issues while warmly remembering all the good moments?

1 more reply

Symbiote4y ago

In between your two extremes are colocation (no managing buildings, power, cooling, racks, security, optionally network), dedicated servers (no managing/installing servers, disks, warranties) and basic VMs.

1 more reply

kortilla4y ago

This reads like a software engineer being happy work caters lunch so he/she didn’t have to cook for the whole team anymore. Didn’t anyone discuss maybe hiring a cook?

1 more reply

Dave3of54y ago

I think this depends. For OPS people no longer having to physically go into a DC I agree but you've now pushed a bunch of work developers especially now will have a harder time as they used to make code and there was someone who sorted infrastructure now the devs themselves are kept up all nights with AWS stuff going up and down.

If cloud improved QOL for ALL employees I'd agree but I think it just shifts work around and costs more.

goodpoint4y ago

> Dealing with hardware failures ... it's all awful work

I've met plenty of datacenter technicians that loved they work and the opportunities for growth it provided.

Some companies really know how to manage a datacenter with minimum pain. Some don't.

BlueTemplar4y ago

It's not like all those jobs have been taken over by automation - someone still has to take care of these cloud servers ?

Handytinge4y ago

> Dealing with hardware failures, hardware vendors, confusing licensing, having to know SKUs, racking new cabinets, swapping hard drives, patching servers - it's all awful work.

Each to their own, but I think you'll find there's a fairly significant portion of sysadmins who love that work!

3pt141594y ago

I can see both sides. If you're a startup that needs to be able to scale quickly if product market fit is achieved, the cloud really saves your bacon. Or is your ten person team really going to figure out how to get Postgres to reliably run with billions of records, with encrypted backups, etc?

It's basically a form of permanent debt. Faster product market fit, higher long term infrastructure costs until you have enough breathing room to start pulling it into your own datacenter. At that point you have some negotiating leverage with the cloud provider.

On the other hand, if you're not looking for explosive growth man oh man is DigitalOcean or anyone of a number good providers of good old VPSes / Cloud-lite.

capableweb4y ago

I keep hearing this argument against using your own infrastructure again and again, and I'm not sure how true it is.

I've worked with teams on both sides, and everyone is gonna have to deal with figuring out how to run at scale, it's just different ways of achieving that.

I've worked with teams that manage their own infrastructure with dedicated servers, and not having to think about scaling for a long time as the one beefy server could just take whatever load you threw at it.

I've also worked with teams who don't manage their own infrastructure and thought they were ready to scale without issues, but once the scale actually happened, it turned out there was more things to consider than just the amount of servers you run, race-conditions were everywhere but no one thought about that.

Definitely a case of "right tool for the right job", but I don't think it's as easy as "Self-managed: harder to scale, PaaS/Cloud: easy peazy to scale".

1 more reply

martinald4y ago

I don't disagree; but I think the cloud (AWS/Azure/GCP) have sort of shielded people from how cheap/powerful the underlying hardware has became.

For ~100eur/month on hertzner you can get a 16core Zen3, 128GB RAM with 8TB of NVMe SSD.

Unless your stack is horrendously badly optimised you can serve SO MUCH traffic off that - definitely billions of postgres records without breaking a sweat.

So the scale argument somewhat disappears - if anything, people end up adding much more complexity to the product to get round the high hardware costs of the cloud (complex caching systems for example, instead of just throwing loads of hardware at the problem).

2 more replies

Dave3of54y ago

> Or is your ten person team really going to figure out how to get Postgres to reliably run with billions of records, with encrypted backups, etc?

Actually AWS won't help you here. I have literally been on a 2 day training course or aurora with AWS and the explanation of how to scale was actually just the same as any traditional non-cloud explanation. Correct usage of indexes, partitioning data, optimising queries (especially any non trivial query output by an ORM) and read replicas.

In terms of explosive growth if you're talking about something like google or tiktok again slapping it all in AWS will not automatically just work. There is a lot of engineering that you'll need to get to their level.

I also think you haven't really looked at the SO link I sent through with thoughtful engineering they have huge user base with a tiny footprint.

> DigitalOcean or anyone of a number good providers of good old VPSes / Cloud-lite

Not sure why you are dunking on DO here they are a fully fledged cloud provider with much the same stuff you would need. You can also run up a huge bill on DO as well.

1 more reply

ignoramous4y ago

> I can see both sides. If you're a startup that needs to be able to scale quickly if product market fit is achieved, the cloud really saves your bacon.

Depends on the team size of the said startup [0]. In my opinion, tech-shops are better off using new-age cloud providers like fly.io / glitch.com / render.com / railway.app / replit.com / deno.com / workers.dev etc [1].

[0] https://tailscale.com/blog/modules-monoliths-and-microservic...

[1] https://www.swyx.io/cloud-distros/

fiddlerwoaroof4y ago

> is your ten person team really going to figure out how to get Postgres to reliably run with billions of records, with encrypted backups, etc?

Most of the problems here will be DBA problems like understanding query plans and such. Even with AWS RDB, I’ve had to upload various setting files to tweak tunables to get things working.

mcbain4y ago

That stackoverflow infra blog post is out of date. They use more than a single webserver now. For example: https://stackexchange.com/performance

dijit4y ago

Now they have 9.

They still serve a lot more traffic than I do and I have hundreds of instances; thousands of containers.

1 more reply

andrewxdiamond4y ago

Most importantly, SO is extremely read-heavy, write-lite, and cache-friendly.

A similar “scale” e-commerce site would be significantly more load, have more dynamic data, and just be overall harder to run.

Dave3of54y ago

Looks like they have actually reduces their footprint. It not that they do run on a single webserver it's that they can run on one.

1 more reply

traceroute664y ago

nova220334y ago

our department now spend > £1 million per year to AWS in hosting costs. A 20% reduction in those fees could pay for a few sysadmin(s).

You can hire a "few" sysadmins for 200k/year?

Dave3of54y ago

In the UK/Europe yes:

https://uk.indeed.com/jobs?q=System%20Administrator&vjk=5149...

Probably not at FAANG level salaries but I doubt there are many sysadmins working for FAANG companies anymore.

DevOps btw are more expensive and infact in the UK DevOps can be higher paid that a developer. I suspect most of the DevOps working for this company are on £65k+. According to:

https://ifs.org.uk/tools_and_resources/where_do_you_fit_in

That puts those earners in the top 3% or from that website:

" In the below graph, the alternatively shaded sections represent the different decile groups. As you can see, you are in the 10th decile group.

In conclusion, Your income is so high that you lie beyond the far right hand side of the chart. "

mattbee4y ago

£200k / year, in the UK? That's about 2-5 depending on experience.

sparselogic4y ago

A 20% reduction would result in ~£800k/yr.

1 more reply

InefficientRed4y ago

> > £1 million per year

I'm curious about your workload. I tend to only use cloud for workloads where it's either (1) by far the only feasible option (e.g. need GPUs for short periods of time), or else (2) basically free.

> I mean I'm not against cloud it's just not the cheapest option

This is certainly true for most workloads. It's also true that buying is better than renting, but here I am living in a rented apartment.

The logic from on high might be something like "if demand is uncertain and capex is risky, why buy when you can rent?"

cyberCleve4y ago

Ouch. If Troy Hunt of all people can make this mistake, it can happen to anybody. HIBP is an awesome service funded totally by donations, so it's too bad this happened. Of course Microsoft is happy to hide behind their confusing pricing model and let customers overpay for Azure without alerting them.

j1elo4y ago

> If Troy Hunt of all people can make this mistake, it can happen to anybody.

Exactly this. As a low-level / embedded / non-cloud stuff dev, I've been getting up to speed through all the cloud-ification of the industry, but I'm still scared (not literally ofc) of running most things on my own on any big cloud provider (smaller ones seem more manageable).

I'm reading this and seems like being a customer of cloud services is like walking a dangeous path filled with gotchas and caveats, just jumping from cover to cover while hiding from danger, and hoping you're safe and didn't mess it up so far, "fingers crossed".

Like this tiny detail that he didn't realize was critical, so I would fall on it too plus on another 500s small papercuts: "oh I set cache up, so I hope all is well". "Yeah, no you aren't, I guess you didn't think of this detail about maximum cached file size! Gotcha, Game Over!"

Yeah cloud providers should have clearer communitacions and etc etc... but the fact of today is that they don't. So I'd never sleep well feelin 100% confident that I had covered and taken into account every minuscule detail and possible scenario that could end up being a disaster.

qw4y ago

> I reached out to a friend at Cloudflare and shortly thereafter, the penny dropped

Another advantage is his big network that he can ask for help. There's also a chance that his blog post will reach the right person in Azure and he'll get a reduced bill.

As someone who doesn't have the same network or the "fame", I am concerned about what would have happened to me in that situation.

chasd004y ago

Remember when no-sql came out and everyone was rushing to it because "rdbms don't scale"? I'm beginning to feel the same way towards "cloud" in the Azure or AWS sense. You can go really really far with standard issue VMs from linode or digital ocean and so on. I wonder how many are overpaying for Cloud services so far above and beyond what their actual needs are.

sydthrowaway4y ago

How are you teaching yourself?

1 more reply

tpetry4y ago

Every cloud makes this mistake easy! You have to manually activate billing alerts for everyone because they want you to spend more snd more each month.

I am still waiting for a cloud without these dark patterns. But that will never happen because it‘s leaving a big amount of money on the table by not being hostile.

Dave3of54y ago

Also the billing alerts is just that an alert. They should have something in place to put a hard cap on monthly spend. That way his free website would go offline when he's spent > $X.

As you say they make it hard deliberately.

Edit: Turn out Azure have this:

https://docs.microsoft.com/en-us/azure/cost-management-billi...

4 more replies

harry84y ago

Dark patterns - this sounds like a colour scheme you don't care for.

"Predatory death-trap pricing" captures the spirit of the thing with rather more clarity. It is wholly intentional after all.

3 more replies

iso16314y ago

Get a VPS from linode for $5 a month and it costs $5 a month.

5 more replies

sgustard4y ago

If I leave my water running then go on vacation I'll have a huge water bill too. I don't conclude my water company is intentionally trying to overcharge me. The more reasonable conclusion is: building an alert system that addresses every customer need is hard. Most enterprises (where all the customer focus is) want minimal downtime above other considerations, including cost.

2 more replies

nix234y ago

>Every cloud makes this mistake easy!

Funny enough...Oracle (OCI) makes it better, you can buy oracle"coins" 1to1 with $ and load your account just with what you think you need.

1 more reply

dx0344y ago

Hetzner's cloud offer is limited but they limit your possible spending by default and it's very easy to set up billing alerts. I guess they mostly do it to ensure they get the money at the end of the month, but it's equally useful for their users.

1 more reply

TedDoesntTalk4y ago

> I am still waiting for a cloud without these dark patterns.

This is how mobile and landline phone companies made enormous fortunes before flat rate billing. It’s called post-paid vs pre-paid billing.

Grollicus4y ago

Do you have any substance to your allegation of Microsoft hiding behind their pricing model?

This is very straight forward from their view, before: almost no traffic = almost no costs, now: huge traffic = $$$.

On the other hand, it doesn't seem that Troy did try to talk to them about this and seems to want to eat the costs himself. As it was his mistake. I think that's commendable. I also think with the amount of free advertisement Troy has done for them they'd be open to this and I can imagine we might see a followup post like "MS was so nice they waived my costs".

NicoJuicy4y ago

He's an Azure MVP. He already has 13 k in credits/yr, which could absorb the costs ( just guessing here)

1 more reply

Nextgrid4y ago

Putting anything internet-facing on the cloud is as irresponsible as posting your credit card number publicly. Anyone can essentially charge you an infinite bill and you can't do anything about it until it's too late.

Maybe it's not a problem when you're dealing with millions of VC money, but there's no way in hell I would host anything in a bandwidth-metered cloud service when my or my own company's money is involved.

capableweb4y ago

Correct me if I'm wrong, but Troy Hunt is a person focusing on security, not infrastructure, deployments or development even. If anyone is near making that mistake, it's people like Troy Hunt. Operators would of course see the problem easily (paying for bandwidth like that would be the first warning sign), while they are sometimes blind to other issues, like security.

Closi4y ago

> Correct me if I'm wrong, but Troy Hunt is a person focusing on security, not infrastructure, deployments or development even.

Eh, I don't know - either way he is a Microsoft Regional Director and MVP for Cloud (as well as security), runs courses on cloud deployment on Pluralsight, and has done speeches on Azure and reducing cloud bills, so if a he can get stung it doesn't say a whole lot good about my chances.

1 more reply

moritonal4y ago

So I guess one method would be to set spending-limits when you setup your account. But that'd lead to constant moments of having to bump your budget (or worse, get approval to do so from Accounting) when you're trying to work.

There are both spending limits and alerting that you could use, but would be impossible to predetermine from Azure's perspective, so they rightly ask you to.

brimble4y ago

There's an entire surprisingly-large industry built around providing better UI to the major cloud providers, so you can actually tell WTF is going on with billing, access control, networking, et c. They're so hostile that it has to be intentional.

1 more reply

baybal24y ago

Who is Hunt Troy?

jacquesm4y ago

The guy behind 'Have I been pwned', a website where you can check if your login credentials to some website have been leaked.

https://haveibeenpwned.com/

oneepic4y ago

It is worth mentioning that the alert itself costs money. So if you're evaluating the alert every 5 minutes on the past 24h of data it can burn a small but surprising amount of money.

From TFA it looks like that would be 10 cents per "time series". Or what I translate it to, is 10 cents every 5 minutes (*I think, but I havent used Azure in some time*). $1.20/hour, $28.80/day, almost $900/month. Not too hard to drop that by making the alert less frequent. (edit: I think I saw AU$ there, so maybe it is AU$900.)

manarth4y ago

A time-series represents a "thing you're monitoring" – in this instance, it's aggregate egress, so $0.10 per month, regardless of the evaluation period.

Monitoring CPU? Another $0.10 per month. Memory? Another $0.10.

Thankfully, not $900.

oneepic4y ago

I meant to emphasize frequency, not eval period. Apologies. That said I took a look at the pricing docs and didnt see frequency mentioned, so hopefully I am in the wrong about the price.

As an aside, their (Azure's) pricing docs are written in the same fishy way their technical docs are written (my opinion only)...

1 more reply

TriNetra4y ago

Shameless plug: https://CloudAlarm.in (in beta), sends you real alerts usually faster than azure with multiple reminders. It does this daily unless you tell it to shut up for the month for the given exceed. I call it real alerts because it doesn't wait for consumption threshold to reach the way Azure cost alerts do; as soon as it detects that your current cost * remaining days > the budget amount, it'll send you an alert [1].

The alert emails are way more meaningful (with projected amount in subject for example) unlike generic ones from Azure Alerts, so you see a real alert and prompted to take immediate action.

1: https://cloudalarm.in/Home/Docs/#how-is-budget-alarm-differe...

GordonS4y ago

But surely CloudAlarm relies on the same data as Azure's alerts do? Azure support told me that data is only updated daily.

Also, Azure has an option to alert you beforehand if it looks like you'll go over; struggling to see how your service is any better.

1 more reply

mnahkies4y ago

This is something to be mindful of when using datadog synthetics monitors as well - if you have a short interval, or many locations being tested from they can become expensive quickly

godot4y ago

These stories almost always boil down to this fundamental conflict of what you want for a personal project vs a business. (though in this case yes, Troy Hunt's HIBP is larger than a lot of startup businesses)

In a business setting, you want your service to stay up, at the cost of spike in costs if accidents or mistakes happen.

In a personal project, you want there to be hard limit on cost, and your service to go down if spikes call for it. (I'm relatively sure that no one wants their personal projects to incur a bill of thousands of dollars by accident.)

kortilla4y ago

> In a business setting, you want your service to stay up, at the cost of spike in costs if accidents or mistakes happen.

No you don’t. This is absolutely not a given. Being a “business” doesn’t mean you suddenly have unlimited budget.

The vast majority of businesses are not “web scale” and are better off taking an availability outage than suddenly handling 1,000,000x the normal volume of traffic.

1 more reply

trulyme4y ago

I don't agree with this - even for businesses there is always a limit over which there is serious trouble for bottom line. I think cloud providers should allow one to set a hard cost limit over which everything shuts down. For personal projects the limit might be $100 and for small businesses $100k, but even rich companies have it (not the same reason, but Knight Capital comes to mind).

ghaff4y ago

Certainly the cloud providers probably make money by not having hard limits.

But it's also the case that if they did implement hard limits of some sort, you'd be reading blog posts about how AWS destroyed my project just when it was going big because someone stuck a circuit breaker foot gun in some corner and everything stopped working properly when usage spiked.

I do think there should probably be a hard circuit breaker. It should be simple and therefore inflexible. And it should come with a big warning sign. Still people will get burned because someone will set it, a project grows, and one day it goes off.

1 more reply

benbristow4y ago

Azure (and I'm sure other cloud providers do) allow you to set email notifications for when your bill goes over a set amount so you can stop it before it happens.

If you're using a cloud provider I'd highly recommend setting one of those up.

In Azure it's under your Subscription and then Budgets

1 more reply

temp89644y ago

Is it really that black and white? I think there is a continuum in hosting service. Not just A) very low end VPS, and B) unlimited cloud.

The fact is that there are low end VPS, middle end VPS, high end VPS, and dedicated servers. If you started from a low end VPS, it is very easy to gradually upgrade your VPS.

A $5/month VPS can be used to play for tons of things. I just don't get people who use free tier cloud, unless you just want to learn about the cloud hosting per se.

Nextgrid4y ago

The problem is that the cloud alone will not allow you to scale infinitely. Slapping a standard DB-based application on the cloud will not magically make it scalable. RDS will still be a bottleneck and can't be scaled online for example (Aurora might change that, but that's a very recent development).

Making your application scalable is a significant effort that may involve different trade-offs. Your typical Prestashop or Magento e-commerce site will still max out the DB and go down, cloud or not, but with the cloud you'll end up with a huge bill in addition to your downtime.

Engineering your application to be scalable is an option that's often not made for cost/time to market reasons which is fine, but in this case the cloud will give you much less scalability than a lot of people believe.

bushbaba4y ago

Isn’t that why services such as AWS lightsail and digital ocean exist?

stevehind4y ago

Have you contacted Azure? On one hand you owe the money “fair and square”, but on the other if I were them I’d waive an unexpected $10k bill to a good faith actor that was incurred without any proactive notification by Azure.

jillesvangurp4y ago

Yep, we had an incident with Mongo cloud where a bug in their synchronization protocols for Mongo Realm resulted in an insane amount of traffic. This was a development cluster with almost no application load somehow pumping around many TB over the course of a few days. The bill was many thousands of dollars. Their support did the right thing. And we actually ended up with some credits because we were having a rough time with bugs in their software. Ultimately, we gave up on Mongo Realm because it was just not working as advertised for us (high CPU usage on the device, lots of bandwidth, we experienced data loss in the managed cloud storage, etc.). But their support team was great.

Their interests is keeping you as a long term customer. So, they will help you if they can. Unexpectedly high bills like that can end the relation in no time. And 10K is not a lot on a yearly basis. That's a few months of normal usage for lots of companies. So, protecting that revenue is worth something to them. That's also worth realizing when you deal with cloud providers: you are spending non trivial amounts of money on their services and support is part of that deal.

gurraman4y ago

A developer on a team I worked with many years ago accidentally committed our AWS keys in a repo. Got a $30k bill due to a an enormous amount of EC2-instances being spawned. We contacted AWS and they were very understanding and reduced the bill to $50.

Fomite4y ago

I had this happen to me once on Digital Ocean, and I contacted them - they were rather understanding that the bill I had was clearly "atypical for my account and not intended" and refunded it.

SoapSeller4y ago

I'll second that.

I've seen several cases on both Azure and AWS that bills got weaved after someone opened support ticket starting with "oops, I just did..."

ramraj074y ago

I got an $800 aws expense (one line item) waived after I contacted them and they asked me to explain why it happened and how I’ll prevent it from happening in the future. I think it’s a once per account thing they’d probably do and Troy should definitely do it.

0x0084y ago

> Secondly, there's cost alerts. I really should have had this in place much earlier as it helps guard against any resource in Azure suddenly driving up the cost.

He did not enable alerts.

sylens4y ago

Every online course that requires you to use a public cloud to deploy something should first have you set up a billing alert that notifies you when costs start to creep past something reasonable, like $20 or $50 (depending on the course and work involved).

asadlionpk4y ago

OP do this! It works, they are usually very generous (same for gcloud!)

goodguyamericun4y ago

Op is Troy hunt, an ms MVP. You can bet there are people from MS doing it for him as soon as they got wind

quartz4y ago

100% do this. Azure has a surprisingly responsive billing support team and will likely eat this as goodwill (honestly with this on the front page of HN they'll probably do it proactively). Just open a ticket in the portal.

tinus_hn4y ago

Also there is 0% chance serving this traffic cost Microsoft anything near $10000.

jacquesm4y ago

As opposed to all those other customers who are not good faith actors?

scrollaway4y ago

You’re trying to be snarky to GP, why exactly? Yes there are bad faith actors that might try to get some free cash out of cloud refunds. And other customers can also be good faith actors and included in the assertion.

The post applies to everyone and I’d second it. Ask nicely for a refund in these situations, the worst that can happen is they say no.

Where did they say that “only Troy Hunt shall receive a refund, for only Troy Hunt is a good faith actor, so say we all”?

3 more replies

suction4y ago

I wonder if before cloud computing, has there ever been a successful product / service where it was accepted with just a shrug that the volatility of monthly costs means it could bankrupt you with next month's bill, because of complexities and opaqueness of the cost structure make it virtually impossible to predict and protect against extreme peaks in all parts of the setup.

Even if you run a relatively opaque cost structure business like a restaurant, you can still calculate the maximum cost of ingredients for one month, the salaries, energy, etc. if you simply use the "best case scenario" of having every seat at every table booked for all opening hours, with people ordering your most sold dishes. Cloud computing is still leagues above that in terms of cost predictability.

I once worked for small, non-startup software company who pondered moving servers to Azure. The Azure partner shop analysed the needs and came up with a monthly cost "between 30k and 120k per month". They were really surprised the company stuck with their non-cloud setup because "everybody is moving into the cloud!!"

tolien4y ago

> Even if you run a relatively opaque cost structure business like a restaurant, you can still calculate the maximum cost of ingredients for one month, the salaries, energy, etc.

If the restaurant suddenly ordered ten thousand times more ingredients than usual, their supplier would probably call back and say "is that really what you want?" rather than just shrugging and shipping them tonnes of tomatoes with a bill for one billion dollars.

suction4y ago

Very true. And in terms of cloud computing, it would mean that alerts and notifications and limits are worth absolutely nothing if it's on the customer to set them up in the correct way for every scenario imaginable. Which is nearly impossible. The tomato supplier's human alerting system is a catch-all-system which would be easily implementable as well.

1 more reply

stickfigure4y ago

I'll bet Sysco would deliver $10k worth of canned tomatoes to your restaurant without checking.

2 more replies

corobo4y ago

In this scenario though you’ve used tonnes of tomatoes and they’re now asking you to pay

1 more reply

bstpierre4y ago

A gas or electric bill works a bit like this... if you have some appliance that fails in a way that suddenly starts consuming much more than usual you can end up with a fairly large bill at the end of the month. Same for old school landlines or cell phones, before flat rate billing became ubiquitous.

Though in those cases the billing isn't really complex or opaque, and you _can_ monitor it if you care to check your meter regularly throughout the month. But, for the electrical case anyway, you can't drill into what exactly is consuming watts without either fancy monitoring equipment or potentially tedious investigation.

Nextgrid4y ago

The big difference in case of gas or electric (or even telephone) is that you and only you are in control. Someone would have to physically break into your house or steal your phone to rack up a huge bill.

In contrast, with the cloud the bill is directly proportional to the amount of inbound requests from the Internet, with no out of the box way to implement a limit (I guess you could install Apache/Nginx and enforce a limit there, but doesn't that kinda defeat the whole point of the cloud?).

1 more reply

macintux4y ago

> A gas or electric bill works a bit like this

Just ask Texans.

avrionov4y ago

I worked on both cloud computing and on premise project. Before cloud computing the risks were different: - much harder to scale. It was much more common to over provision and have machines and bandwidth being unused for years.

- when we were hit with very high traffic due to a bug or something else, most of the time it would lead to customer outages. Based on the contract some times it requires to pay back because SLAs were not reached. Also an outage could lead to customers canceling the subscription.

We swapped one type of problems with another.

suction4y ago

If you have a bug that renders your product unusable and refunds are in order, the flexibility of handling traffic peaks which a cloud provider offers won't solve that problem for you. It could even aggravate it. If a show-stopping bug is introduced, it would probably be preferable to fail quickly.

1 more reply

lytefm4y ago

> It was much more common to over provision and have machines and bandwidth being unused for years.

But the overprovisioned server might still be a lot cheaper than the cloud bill. It can be totally reasonable to have a server running at 1-5% load 98% of the time if you really need the capacity for the remaining 2%.

Also, neither "scaling up" as in "re-deploying the same setup on a beefier instance" nor "scaling out" as in "let's expand to the US and have a server there" is too difficult if the setup is automated (Ansible).

octoberfranklin4y ago

Banking.

Credit card chargebacks, especially.

usr11064y ago

That's the typical story. Something goes wrong and it costs you (typically a small company) a lot of money. At that time just nobody is looking at metrics. Even alarms don't help absolutely because they can also be missed.

The only thing that would really help were a hard spending limit that stops all services except storage. If your site is important there will be such an amount of user feedback that it is impossible to miss it for a long time.

dspillett4y ago

Alerts can also fail to be timely due to mail/SMS/other delivery issues, or the right people being in the middle of something else. This delay means it is still possible to rack up and unexpected cost.

Or they can fail completely.

And the alerts themselves cost if you want something reliable so you have to weight that against the danger. Pay as you go cloud can be a maze of costing concerns..

> The only thing that would really help were a hard spending limit that stops all services except storage.

Yep. Though that is small comfort if you need to guarantee more than a couple if 9s of uptime, hopefully those with that requirement can soak up the unexpected billing blips.

alfiedotwtf4y ago

> The only thing that would really help were a hard spending limit that stops all services except storage.

Sadly, I haven't found a way to do that with AWS

dx0344y ago

It's funny that even Hetzner can do that and AWS can't. Shows that there's no interest from AWS to prevent these things from happening.

3 more replies

UnFleshedOne4y ago

I just looked at my AWS account and there seems to be a way to set budget, attach alerts to it and attach actions to alerts. For example there is an action to stop EC2 instances. Not sure if other AWS services have something similar, but at least you can kill your instances if something weird happens.

Actions weren't there last time I checked (few years ago).

1 more reply

Monotoko4y ago

Kill switches in lambda I believe is possible, running when the alert is triggered

1 more reply

mrb4y ago

Most worrying is that even an expert like Troy Hunt was UNABLE to figure out the cause of the issue by himself. He "reached out to a friend at Cloudflare" who investigated and found the cause.

alkonaut4y ago

Cloud providers should always have a max spend and it should be a standard feature. The cap shouldn't even be some optional feature or notification service. It should be a hard cap that you can move - at your own risk.

manquer4y ago

SMB or indie developers are not the first/primary customers for Azure/AWS that they design their application for.

Any enterprise will not want any limits because of spends, they would be lot more pissed if service was pulled because spending cap set by someone sometime in the past is now exceeded. Likely is why such feature is optional not mandatory.

Excess/unexpected billing would be negotiated in typical sales cycle discussions. Making a default hard cap however would result in a lot of senior people are going midnight calls for emergency budget approvals, management would get annoyed by that.

1 more reply

kuu4y ago

One thing I hate about the cloud providers is that there isn't an option to set a maximum cost. I would prefer to plug the cable of my side project than just receive an email saying me that next bill is going to be over my cost. I understand not everyone would like to do that, but I would like to have that option.

defaultname4y ago

Oracle has fantastic budget tools. Not just "you've passed your budget", but "you're forecast to pass your budget in 22 days before the month is up". And you can couple it with quotas to create hard budgets.

AWS has decent tools in this regard, but it pales compared to Oracle. Azure is a product I've never used with any scale (just small projects), but the fact that it actually costs money to setup alerts is gross (and morally reprehensible). Even if it's a trivial amount, that alone just sours the product in my eyes. I mean, already Azure is pretty uncompetitive unless you're running on free credits, as Troy apparently is (purportedly some $13K per year, so unsure what the pitch for donations to cover a bill is about).

schemescape4y ago

This piqued my interest, but a few quick searches (using a search engine--the Oracle Cloud site search only turned up press releases...), indicate that quotas just prevent you from spinning up new instances. That's helpful, but I was hoping for some sort of way to cap my bill (for hobby projects), even if that requries deleting resources.

Oracle Cloud has an enticing free tier, but I'm too afraid to use it because it requires a credit card and I don't see any way to put a monthly cap on my budget. (I'm sure hobby projects with ~$5 - 10/month budgets isn't their target market, but I can dream :)

Edit to add the page I was reading: https://docs.oracle.com/en/cloud/get-started/subscriptions-c...

cma4y ago

They'd rather refund small guys for mistakes than give big guys an easy limit to set.

kuu4y ago

I guess big guys don't want they service to suddenly stop, so they probably would not use this... But it's just a guess

2 more replies

herodoturtle4y ago

I assume you meant “pull” or “unplug” the cable :)

kuu4y ago

Yes ;)

frameset4y ago

But there is an option. In Azure you can "set a budget". He even goes over it in the post. Did you read the linked article?

lkxijlewlf4y ago

https://news.ycombinator.com/newsguidelines.html

cdmckay4y ago

It would be really classy if MS forgave that debt, especially considering the service is a public benefit.

kelsolaar4y ago

I would go as far as saying that the hosting for such a service should be entirely sponsored by Microsoft.

lodovic4y ago

He's a "Microsoft Regional Director and MVP" so Microsoft pays the bill one way or another. I expect that he has reduced Azure rates as well.

1 more reply

anothernewdude4y ago

Would be even classier if the major cloud providers responded to customers calling out for budget limits for the past decade. Not many people want to risk potentially infinite costs.

scanr4y ago

I wonder how much of the cloud provider revenue comes from situations like this. I suspect quite a lot.

I think that the cloud provider business model that allows for uncapped maximum costs is a bit of a commercial dark pattern. What makes it somewhat more nefarious is that it is relatively easy to blame the customer.

I’m not surprised that the cloud providers are quick to refund users as it’s likely that they only do it in a fraction of cases and it buys a lot of goodwill.

It would be interesting to try and design a cloud that supports OutOfMoneyException’s with gradual degradation and capped liability for costs built in.

Nextgrid4y ago

> I suspect quite a lot.

I don't actually believe so. Cloud providers are known to refund bills incurred by mistake. They make so much margins on legitimate usage by big companies & startups that it's just not worth burning developer goodwill & potentially waste efforts trying to collect a bill the customer legitimately can't pay (and will guarantee he will never use nor advocate for your service again).

throwawayffffas4y ago

Question, is the 0.014AUD per GB quoted here correct? Looking at the linked page[1] I would think the cost would be 0.1102AUD per GB as is quoted in the Internet egress section.

https://azure.microsoft.com/en-au/pricing/details/bandwidth/

throwawayffffas4y ago

Also (3200 GB per day * 30 days) * 0.014 AUD per GB is 1344 AUD. While (3200 GB per day * 30 days * 0.1102 AUD per GB) is 10579.2 AUD much closer to the final bill.

My conclusion Troy still doesn't know how much he is paying.

CodesInChaos4y ago

It clearly isn't. It looks like he confusing transfers between availability zones in one region with egress to the internet. A factor 10 mistake like that should be obvious, but he didn't fix it, even after I pointed it out in the comments on his blog (he responded that the price for me might be different due to region/currency settings).

zzt1234y ago

Interestingly, Troy says that egress is expensive on Azure at $0.014 AUD/gB (~$0.010 USD/gB), but that is the same price as additional egress for Linode and DO, and Linode egress has never struck me as expensive. In fact, I’m kind of shocked (as an AWS user) that Azure egress is the same price as Linode.

Actually, wow it seems AWS is also the same price as Linode and DO for egress. While Linodes and DO do come with decent free bandwidth, this is a surprise to me.

coder5434y ago

You’ve interpreted the numbers wrong. Yes, Linode, DigitalOcean, and most of this class of providers charge $0.01/GB. Almost literally an order of magnitude less than Azure or AWS. The megaclouds massively overcharge for bandwidth. It’s not even close.

AWS charges $0.09/GB, and Azure charges $0.0875/GB.

Maybe Troy Hunt gets a discount for being a Microsoft Regional Director and MVP. (Neither of which make him an employee of Microsoft, confusingly enough.)

https://docs.digitalocean.com/products/billing/bandwidth/

https://www.linode.com/docs/guides/network-transfer/

https://aws.amazon.com/ec2/pricing/on-demand/

https://azure.microsoft.com/en-us/pricing/details/bandwidth/

zzt1234y ago

Ah weird. I was on some AWS page that said the cost was $0.01/gB, which threw me off as seeking ridiculous compared to what I remembered. Not sure what page it was, but clearly that is not the actual pricing.

graton4y ago

I think the article is incorrect.

https://azure.microsoft.com/en-au/pricing/details/bandwidth/...

The AUD $0.014/GB is only for data transfer between Availability Zones.

patrec4y ago

How can $10 per TB not strike you as expensive? You can easily download that much a day on consumer broadband that will cost you far less than $10/day.

fabian2k4y ago

If you download the data twice at that price point, you could buy an HDD to store it for the same price (the bigger HDDs seem to be at ~ 18 EUR per TB here).

CodesInChaos4y ago

$10/TB is between availability zones in the same region. Egress to the internet costs $50-$90. So it's much more expensive than the already expensive $10.

bluedino4y ago

Reminds me of a time, we had a new site that was going to run on GCP, we had been using a couple co-located servers for years.

When everything was moved to production, URL went live, nobody ever did any kind of bandwidth checking, caching, no CDN, no cost tracking. $10,000 in our first week. That's about 1/4 what our total spend on the co-located servers was for the whole year. Boss flipped his lid and wanted to kill the new guy who was on the project.

After about 2 years we got rid of all the co-located stuff and were spending about 1.5x, but we had more apps, they served heavier pages, etc.

dijit4y ago

1.5x is pretty good.

We overspent quite heavily on our on-prem stuff for a game I helped launch, for political reasons the next game ended up running on the cloud.

The price was roughly 10x before discounts. With our heavy discounts and a wide amount of slimming down/cost optimisation (easily 3 months of work) we got it to 2.3x

There will always be a need for sysadmins/cloudops/devops for that environment, so we didn't save any headcount either.

I can't imagine getting anywhere close to parity in costs, Functions-as-a-service ended up costing more than compute instances too so we went back to compute instances in places where we thought we'd get away from it.

That said, it was a lot nicer to use!

hogrider4y ago

Awful toxic boss.

hdjjhhvvhga4y ago

It is very good these things are getting publicized. More and more people realize these payment schemes for what they are: a scam. Every cloud provider that refuse to put a hard spending limit participates in this.

It is important to remember that not all cloud providers participate in it. For example, in Hetzner Cloud, they explicitly provide the maximum amount you are going to pay for a given instance or service in a given month. You are guaranteed not to pay more. Everybody knows why Amazon etc. refuses to do it this way.

zekica4y ago

On Hetzner and with their €1.00 per TB after 20TB included, you can pay up to €324 per vps as you are limited to 1Gbps if you fully saturate the link all month.

dx0344y ago

I doubt you'll manage to get the exact 1Gbps per VPS out all month. On dedicated that's more likely. But luckily they have a very easy setting for billing alerts and maximum in the settings page.

mawalu4y ago

Hetzner Cloud(!) only has 20TB/Month included in the monthly costs and states that you have to pay for any additional traffic. I never reached that on one of their cloud boxes so I don't know how it looks like but it definitely isn't all up front. But yes the dedicated machines come with no additional traffic charges whatsoever

CodesInChaos4y ago

Additional traffic costs 1 EUR/TB (plus VAT, depending on where you live). So it's about 50 times cheaper than the big clouds.

Seattle35034y ago

My (naive) solution. Every new account by default has an SMS alert that trips at $100. It says

"Your account has exceed $100 spend. Reply 'SHUTDOWN' to shutdown all services, 'STOP ALERTS' to never see this alert again, or 'DOUBLE TRIGGER' to double the alert trigger value to $200."

$100 is arbitrary, it could be any nominal sum. The idea being that the user can double the alert each time they get it just from SMS. I bet 95% of users would double their alert limit to a comfortable point. The other ~5% will be power users who customize their alerts.

The idea that these companies couldn't know what limits customers want is kinda silly. We can use the same techniques for alerts that we use in algorithms for expanding vector storage, for example. We can "amortize" alerts, so to speak.

Nextgrid4y ago

The problem is that metering these services at such granularity is difficult: https://news.ycombinator.com/item?id=30066538

1 more reply

llampx4y ago

Very nice writeup, thanks to the author for writing it so clearly for someone who is not familiar with the nitty-gritty to be able to follow it.

kidsil4y ago

Shameless plug - the core of my work is about ensuring these unexpected costs never happen.

We have some recent case studies where we've successfully reduced cloud costs by 95%

https://www.cloudexpat.com/case-studies/

hi(at)cloudexpat.com - happy to help!

Nextgrid4y ago

Out of curiosity, do you merely optimize existing cloud usage or do you help your clients move to hybrid/bare-metal?

knorker4y ago

As soon as I saw "17GB file" i thought "that's what torrents are for". Otherwise one mistake and... Well this happens.

Or someone maliciously bypasses CF cache e.g. by parameters.

Cloud just is not suitable for any kind of volume egress. It's a death trap. Like going on vacation with data roaming enabled.

Aissen4y ago

Yeah, HIBP is using torrents:

> I removed the direct download links from the HIBP website and just left the torrents which had plenty of seeds so it was still easy to get the data. Since then, Cloudflare upped that 15GB limit and I've restored the links for folks that aren't in a position to pull down a torrent. Crisis over.

knorker4y ago

I know, I read the article.

But I feel like Dr Strangelove here. Of course, the whole point of a torrent on a cloud service is lost if you also provide a raw download link.

Also providing a download link is tempting, but can easily cost (for a 17GB file and growing) up to US $3 per click.

Even off of their premium global network it's over $2 per click. The cheapest in Microsofts entire egress table would be $0.68 per click. (but that only kicks in after you've spent way more than $9400 in cheaper tiers in a given month)

Egress kills you, in cloud. "Oh, cloudflare probably caches most of this" is not something I'd recommend.

dx0344y ago

And then Cloudflare will not cache it at some locations for random reasons and the cloud bill is back. Anyone with technical knowledge should have no problem routing static files via machines at OVH/Hetzner and the like, no reason to enter such risks for maybe an hour of setup time saved.

dx0344y ago

Or Hetzner server auction to get a cheap 20/30€ machine with unlimited traffic at 1Gbps. Setup time is max 1h even if you do it manually, with cloudflare Tunnel it's also really easy to lock down everything with a firewall and have minimal exposure to threats.

InsomniacL4y ago

> Setup time is max 1h even if you do it manually

- Patching - Remediation, Monitoring, day0 response

- Security Information and Event Management - exports, alerts, OS configuration

- OS/Application Hardening - Encryption, Password/keys rotation, CIS/other baselines, Drift Management

- Backup - Encryption, (don't forget your passwords/keys are changing), retention, data protection compliance, monitoring, alerting, test days

- High Availability - replication, synchronisation, monitoring, alerts, test days

This is just the tip of the ice berg, if you operate in an environment where Insurance, Reputation, Regulatory Compliance, etc.. are important, then it's easy to see why PAAS solutions are desirable.

faebi4y ago

I have 10gbits internet at home. Sometimes I wonder how many services/people I could bankrupt by using it harder. Not that I want this, but more like, why is it even possible?

ccbccccbbcccbb4y ago

> I have been, and still remain, a massive proponent of "the cloud".

Mice cried and stung themselves, but kept eating the cactus.

sudhirj4y ago

This particular problem basically boils down to "CDN providers don't like caching large files", which is a very common problem. Everything else was configured and setup exactly right to not have a large bill.

Most CDN providers have a lot of machines out on the edges of their networks, and it's understandable that they don't stuff these machines with large disks, likely preferring smaller faster SSDs. But this is a very common pitfall of CDNs that needs more attention, along with messaging on the dashboards and settings pages.

I've had problems with no warning on Cloudfront, Cloudflare, Bunny.net all from not realising that my files were beyond the CDN's cache size limit, but none of them seem to do a good job at surfacing this other than "talk to customer support".

Cloudfront does list the max size clearly in the limits and quotas page, though, and if you front your S3 bucket with Cloudfront, you could turn caching off and still get the discounted bandwidth out rates (S3 -> Cloudfront is always free, even if the file is fetched every time).

jrochkind14y ago

Cloudfront isn't much discounted bandwidth out compared to S3 though, is it?

I see S3 is initial $0.09/GB, going down to $0.07 after 50TB or $0.05 after 150TB.

Cloudfront North America is $0.085 for first 10TB; but $0.110 and up for other regions. going down to $0.060 north america after 100TB, and okay $0.025 after 1PB. (but $0.050 and up in other regions even after 1PB).

So okay, Cloudfront gets cheaper egress at large scale, I guess. By about 50% though, not an order of magnitude, and could be much less depending on region.

sudhirj4y ago

The reserved capacity pricing is lower, in a business setting your account manager will usually suggest this pretty quickly if you have a steady and/or increasing Cloudfront bill.

1 more reply

2ion4y ago

This is why I use fixed price offerings for personal projects.

A large bill is probably chump change for someone like Troy, for others it's a year or two of savings. The risk is not worth it.

schemescape4y ago

Would you mind sharing the services you’ve found that have fixed prices? I haven’t had much luck finding services like that (although I’m looking in the < $20/month range).

1 more reply

ksec4y ago

>What we're talking about here is egress bandwidth for data being sent out of Microsoft's Azure infrastructure (priced at AU$0.014 per GB).

AUD $0.014 is roughly USD $0.01. Which I thought was reasonable. But on [1] only "Data transfer between Availability Zones(Egress and Ingress)" cost $0.01. Do transferring from Azure to CF count as that? Other Internet egress (routed via Routing preference transit ISP network) starts at $0.08

I hope someone from Azure CS could give him a custom discount.

It is also worth thinking, the cost HIBP saved on Cloud / Serverless over the years could have wiped out ( if not more ) by this single incident.

[1] https://azure.microsoft.com/en-au/pricing/details/bandwidth/...

nbevans4y ago

Cloudflare and Azure have a "Bandwidth Alliance" peering which - if you correctly set up your Azure resources to use "Internet Routing" - will result in a modest discount. It is a bit of a scam though as it is marketed as though you'll get 100% discount but in reality it is more like 15% off. I think GCP is 100% though.

gcbirzan4y ago

Definitely not 100%, more like 66% off: https://cloud.google.com/network-connectivity/docs/cdn-inter...

hkh4y ago

We've been thinking about this for a while, and if there is any way we can catch these types of cost spikes before they happen. We've managed to do it for Terraform resources using an estimation approach, and using a usage file, you can model expected usage-based resources (https://github.com/infracost/infracost/blob/master/infracost...), but this one has got us thinking more about policies.

To be clear - we would not have been able to catch this one right now :'(

Would love to hear thoughts / brainstorm ideas - is there any way we can proactively catch these types of cost spikes?

Olreich4y ago

I think this is fundamental to on-demand services. Anything outside terraform or another configuration file system is hard to reason about. If cloudflare is in your config system, then you could put up a warning that files bigger than whatever won’t get cached, but that still assumes a level of knowledge about the system that you don’t generally have.

Setting up limits and alerts as part of the system creation is usually the best strategy.

hkh4y ago

I like that, maybe we have to build up a knowledge base of wisdom (probably learnt through the hard way), and warn if the conditions are met or at least a list of the things to note. Then the cloud cost alert being a fallback safety net.

nbevans4y ago

One wonders how Cloudflare can essentially absorb all bandwidth costs. But AWS and Azure are using them as a profit center.

uncertainrhymes4y ago

On the cloud providers, you are paying for your usage (yes, marked up, but they have costs too).

Cloudflare has the same model, but they distribute the costs. The vast majority of people never use anywhere close to their share, so they subsidize the outliers and the free tier.

tyingq4y ago

Lots of peering. They pay $0 for roughly half of their egress.

https://blog.cloudflare.com/the-relative-cost-of-bandwidth-a...

OtomotO4y ago

Well, the cloud is just a convenient way of accessing someone else's server.

Convenience always costs money, there is no (big) cloud provider doing it out of their own pocket or rather not optimizing for huge profits.

It's the same as with any other service, really. So I don't understand, why some people assume it would be different here.

(Note: I am not saying that Troy Hunt assumed this, but I know people who go to the cloud because "It's cheaper". It was never cheaper, on no project I worked on. It was more convenient, but in the end it was more expensive mostly)

DigitalSea4y ago

I would be surprised if Azure doesn't waive or reduce this bill dramatically. Something similar happened to me with AWS. I had a simple file upload service where files would expire if they hadn't been accessed in 24 hours. Someone started using it to upload music and videos. I ended up with a high bandwidth bill on Amazon S3. I reached out and explained what happened, they waived the costs entirely (to the tune of $5000).

Abishek_Muthian4y ago

Valuable investigation steps to find the erring cloud resource, But as Troy concludes 'Budget Alerts' would have saved him from this issue.

No matter what the traffic is, The first thing to do with any cloud service provider is to set the budget alerts according to our wallet, be it one with credits or otherwise. At this point, I don't even try any new cloud service provider who doesn't offer credible budget alerts.

Another key takeaway is,

> Huh, no "CacheControl" value. But there wasn't one on any of the previous zip files either and the Cloudflare page rule above should be overriding anything here by virtue of the edge cache TTL setting anyway.

Even this could blow up. All cloud service providers set the "CacheControl" to "No" and if we would want to cache something which is not cached by CF by default e.g. *html using Page Rules then we need to set CacheControl (e.g. max-age) at the cloud service provider end too.

P.S. I've written about these recently on my blog titled 'Saving Cloud Costs'[1] from a frugal solopreneur PoV.

[1] https://hitstartup.com/saving-cloud-costs/

emptybottle4y ago

This is why I personally won't run projects on infrastructure with what roughly equates to unlimited risk billing.

It's my opinion that it's better to work with known limitations and optimize for them.

In the case of bandwidth, work with a fixed pipe size, or do the math and set up a QoS that implements a throttle to avoid exceeding your bandwidth allotment.

jskrablin4y ago

First thing one should always set on any cloud account is billing alerts. Set > 1 and set first to ~ 80% of what you think will be your normal cost then add extra alerts all the way up to 100%. That way you'll usually get an early warning with some time to act before it becomes really expensive.

pontifier4y ago

Everything can be going fine for a long time, and then cloud costs kill your business.

This happened to Murfie a couple of years ago, and that's why I had to step in to try to fix things. I'm still trying, and there are still challenges, but I won't allow landlords and cloud costs to disrupt things again.

mathattack4y ago

Think about how many big companies struggle with his. Most don’t have one person who can think through the cost of the cloud, as well as the activities to manage the costs. Many even say “Let engineers be engineers, and business people own the costs.” And all of a sudden you get a ton of surprises…

dtx14y ago

If Microsoft doesn't show the decency to forgive that bill, i'd be happy to chip in!

fleddr4y ago

Cloud providers should really start protecting customers from these spikes. Alerts are not enough, there should also be hard caps (stop serving) and soft caps (serve at reduced speed/capacity) based on configured max budgets.

intricatedetail4y ago

If you are not a VC backed corporation you must be insane to run anything on a "cloud". Why not rent a dedicated server from OVH or others where you can actually control costs and pay 10-100 times less?

Nextgrid4y ago

Because experience getting shit done using boring tools doesn't translate well to a future career in a VC-backend company wrangling Terraform & YAML files.

bawolff4y ago

Seems at least a little unethical that cloud companies do pay as you go up to infinity, instead of some model where you transfer money in and if you use it all up your service gets cut.

XorNot4y ago

There'd be value in a model which allowed you to pay up to some limit then switch into a user-pays model if the user wanted the service right now.

polote4y ago

As I spent a few hours to successfully get cf cache b2 files. I'm curious about the part of support Cloudflare requests due to caching issues.

It's time for cf to work a bit on its UX

sergiotapia4y ago

>This was about AU$350 a day for a month. It really hurt, and it shouldn't have happened. I should have picked up on it earlier and had safeguards in place to ensure it didn't happen. It's on me.

Uh no - it's on cloudflare and azure. Why don't they have a global setting that says Max Charges Per Month: $X and it just shuts down when it hits that number? This is why I don't really like using big cloud services like this.

rcarmo4y ago

This prompted me to go and check my custom static site generator (which renders my blog onto an Azure storage account exposed via HTTP and Cloudflare).

Turns out I wasn't setting x-ms-cache-control when writing all the blobs, so that's a win right there.

(interestingly, it appears that rclone, which I was in the process of moving to, doesn't do that, so I might have to keep my custom Azure storage library around)

mro_name4y ago

Shouldn't lookups be where cdb shines? Hold my beer:

  $ shard="$(echo "${sha1}" | cut -c 1)"
  $ cdb -q pwned-passwords-v8-sha1-${shard}.cdb "${sha1}"

But as a cloud evangelist at Microsoft, you may sing the corporate IT gospel anyway. ¹https://mro.name/agakdfa

BonoboIO4y ago

Well ... it's not like it was the first time this happened to a software developer.

He should have known better that there is a risk, that you don't know some detail that costs you a lot of money.

Cloud Bandwidth is soooooooooo expensive. If there is a risk that you have to pay this, please us a provider like Hetzner with fixed costs. If you like your serverless things, just host the big files at Hetzner.

commandlinefan4y ago

> I always knew bandwidth on Azure was expensive and I should have been monitoring it better

It's suspicious that cloud providers STILL don't have any sort of "circuit breaker" infrastructure for this sort of thing - yes, you can set up alerts, but you can't say, "shut the whole thing down before the costs go above a certain threshold".

rkwasny4y ago

I guess all Microsoft PR and Marketing departments are now on the phone trying to get this guy a refund and take down this post :)

throwawayffffas4y ago

This guy is a Microsoft Regional Director he is part of the Microsoft PR engine.

jve4y ago

> I, uh, have a bill I need to pay

Kind of sad that service we are accustomed to using, various software integrates it (whether using HIBP API or downloaded pwned passwords archive) - is on a shoulder of single guy that now has to pay for his mistake.

Great that Cloudflare helps him with the service, otherwise who knows if we had access to HIBP at this scale?

razzio4y ago

Hope it is okay and not too much off-topic. I just donated. He deserves it for this service!

Fact is that stuff like this can happen. Consider how many variables are in play to determine the final cost of a cloud service it is very much a double-edged sword. Sometimes you cut yourself unintentionally.

So now we all learn from this, I suggest we help him out.

queuebert4y ago

Looking forward for the followup post in early 2033 when he forgets to extend the cost alert expiration.

lysecret4y ago

This is a big trap to fall in to. I dont understand why network trafficking is so expencive also in AWS. I once had a 2k monthly bill purely from networking because i accidentally routed a lot of requests through a NAT. That hurt haha. Now i stay away from those things :D

jrochkind14y ago

> But these would always cache at the Cloudflare edge node, that's why I could provide the service for free, and I'd done a bunch of work with the folks there to make sure the bandwidth from the origin service was negligible.

If you're not Troy Hunt or another celebrity with special access to Cloudflare -- I don't think you really have access to Cloudflare to do a lot of work with you to ensure that your data gets cached and your egress is minimal, for large files on a very cheap cloudflare plan. (Based on the costs reported by Hunt as catastrophic, I don't think he's paying cloudflare for a large enterprise plan)

(Also, it's unclear if caching large data like this is even within the ToS of Cloudflare?)

I don't think Cloudflare promises to cache any particular URLs for any particular amounts of time (except no greater than cache headers etc; but they don't promise never to evict from cache sooner; they evict LRU according to their own policies). Cloudflare's marketed purposes include globally distributed performance, and security. I don't think they include "saving egress charges by long-term caching your data".

I have a much smaller project, but egress charges for data are an increasingly large part of my budget. I've been trying to figure out what if anything can be done about it. I wish I had a guaranteed way to get ultra-long-cache promise-to-be-within-ToS for very large data files from Cloudflare for a affordable fixed-rate price. (Maybe I do? But just haven't reassured myself of it yet?)

> In desperation, I reached out to a friend at Cloudflare… I recalled a discussion years earlier where Cloudflare had upped the cacheable size… Since then, Cloudflare upped that 15GB limit…

Since I'm looking for solutions for this same problem (delivering lots of data at very cheap prices), I am finding myself a bit annoyed that Hunt is talking about how he solved it, using tools/price-levels not available to most of us who don't have his level of access due to position.

Interestingly, MSN/Azure is part of the "Bandwidth Alliance" with cloudflare, which initially one thinks means there are no egress charges when delivering to cloudflare. (That is what it means for some other alliance members like backblaze). But that's clearly not the case or this story wouldn't happen, right? Turns out Azure gives you a fairly small egress discount when delivering to cloudflare, and only if you set things up in a non-standard way.

superphil04y ago

First thing i do is set an alert when costs go over 10$ for any new project. Highly recommend

onion2k4y ago

Do you also make sure you never go on vacation, never go anywhere that doesn't have a phone signal, never turn off your phone, that your alerts have multiple levels of redundancy, and that you always have access to a computer to modify settings?

progx4y ago

Clouds are good for quick start and fast grow. But after this phase, you should think about "classic" hosting solutions (multiserver, load balancer, etc.), they could be much cheaper.
as long as your human admin costs are lower then cloud services

cgtyoder4y ago

It's unconscionable that MS doesn't have warning notifications in place BY DEFAULT, so when you start incurring charges e.g. 10x of normal, you get notified immediately. One shouldn't have to set these up manually ever.

philliphaydon4y ago

It seems like everyone is blaming azure when this was an issue with CloudFlare…

I get that everyone has an obsession with dirt cheap providers instead of cloud solutions like aws/azure. But that doesn’t mean it’s better. Everything has pros and cons.

lkxijlewlf4y ago

I'm sure some cloud providers have it, but they all should have a global, "If my account hits $XXX shut it all down immediately and email me" flag. And yes, that's kind of what he did here, I get that.

hogrider4y ago

I wonder if people will start to make shell companies to just go brankrupt when this happens and start afresh with another company. The cloud vendor doesn't look too closely ehat you are running right? So this could work.

pibefision4y ago

Most of the clouds have functionalities to manage this. In AWS for example you can create an alarm with AWS Budget to monitor costs by tools/service/etc. Using a complex cloud without using this is not good practice.

taubek4y ago

It is good thing to know that this could happen to anyone. I guess that setting limits and alters should be one of the first things that one should do.

What would happen if a credit card limit was exceeded, a site would just stop working?

therealbilly4y ago

Yeah the problem with Cloud vendors is that if they make a mistake, it will usually disadvantage the customer...not them. I'm a little biased as I don't completely buy into the whole Cloud paradigm.

csours4y ago

Cloud seems like a pet tiger - really cool and fun, until it turns on you.

pdimitar4y ago

Enjoyed the article.

But still, couldn't help to get the following lasting impression after reading it: these days being able to click around the UIs of the cloud providers should be a billable skill by itself.

floor_4y ago

This guy needs to clean up his bio. There seems to be a lot of confusion on whether or not he works for Microsoft when it appears that he is a uhh... reverse pay midlevel manager inter?

Havoc4y ago

These things really should have a AI like alert that is basically “cost is departing dramatically from historical pattern” without the need to set thresholds and the like

TacticalCoder4y ago

Are there cloud services that allow to easily put a maximum budget, to make sure you have no surprise costs like that?

napolux4y ago

In my experience you can only setup billing alerts, which are fair, if you ask me.

I took a good course on pluralsight about AWS and the first lesson was to setup a billing alert.

What will hard limits will do to your infra? You can't take down / suspend DBs, EC2s, etc... Just because you set a 1k USD limit and that's it.

Alerts are the 1st thing you should setup IMHO

notreallyserio4y ago

> You can't take down / suspend DBs, EC2s, etc... Just because you set a 1k USD limit and that's it.

You (the cloud provider) can shut down VMs, block access to all services, and just retain the content in storage until the bill is resolved or the account is permanently closed. The cost would be trivial as storage is dirt cheap.

1 more reply

snovv_crash4y ago

Google App Engine allows you to set up hard spending caps, after which your application will start returning 503s

unixhero4y ago

It would be good if he contacts Microsoft about this. Sometimes they will give credits for situations such as this.

goodguyamericun4y ago

He is Troy hunt and an ms MVP, as soon as ms gets wind, they'd be the one to contact him

3pt141594y ago

Happily donated to Troy. He's done more than most to help everyday folks weather these data breaches.

dx0344y ago

My issue with this is that the donation is basically to Microsoft for their dark patterns. There's no way this traffic cost much to Microsoft, so it all is added profit for their shareholders. Other providers would've provided the same service and bandwidth for a much lower price.

I really appreciate the work that Troy is doing, but seeing much needed money ending up and Microsoft or Amazon leaves a bitter taste. I hope at some point it will become cool again to just rent a VM or dedicated server for small projects and stop throwing so much money at the already richest people in the world.

jimmydorry4y ago

Unfortunately, data in Aus really costs this much (more actually), from my experience colocating in a few data centres (I was typically paying $0.3/GB). It’s certainly possible it cost them less, but very doubtful on it being close to free.

EDIT: Apparently it was hosted out of US West, so I agree that the actual data cost would probably be a lot less.

YetAnotherNick4y ago

I don't understand it. Does a cloudflare edge server sit inside Azure?

mstrem4y ago

No. Cloudflare is configured as a reverse proxy in front of the site. So traffic reaches the Cloudflare edge first, then it is proxied to the origin on Azure unless the file is served directly from the Cloudflare cache.

rob_c4y ago

close account, cancel card and move on with life before they charge you.

lom4y ago

If anything, this shows the insane scalability of the cloud

joking4y ago

outbound transfer cost is one of the most expensive things in cloud computing, it's much better when you can pay for allocated bandwith.

Mave834y ago

Just avoid cloud and choose dedicated infrastructure

_8j504y ago

Didn't Troy sell HIBP to Verizon?

joantune4y ago

Donated! Hope it helps

parentheses4y ago

TL;DR: I got a big bill from my cloud provider, so I used more cloud provider features, to make sure I know before I get the bill; isn't my cloud provider great?

lpcvoid4y ago

Can somebody explain to me why I wouldn't just rent a 40 EUR dedicated server from Hetzner with unlimited traffic and gigabit uplink? His 600GB/day is way less than what you get over a gigabit link within a day. Sure, sudden bursts would perhaps "throttle" at a gigabit, but according to his article that was only the cloudflare proxy anyhow, so no pain in having that take a few seconds longer.

As far as I am concerned, I just don't understand why people use cloud services.

technion4y ago

He is a Microsoft MVP. A title that is given for being a "community evangelist" of Microsoft. You wouldn't get that throwing it on a Heztner machine.

Edit: Consider this article, and Geoff's statement about Azure credits.

https://www.theregister.com/2021/04/21/microsoft_revokes_mvp...

fs1114y ago

Sounds like a pretty expensive privilege.

How is using cloudflare okay in this then? Cloudflare is also not Azure

7 more replies

pdimitar4y ago

Grooming influential people to promote your corp and then bullying them when they didn't turn out to be just parroting your marketing slogans. Classic corporations.

kingcharles4y ago

Huh. As an MVP myself (of DRM lol) I have to agree that was a poor astroturfing idea of Microsoft's. Although one employee != Microsoft. In all my MVP years Microsoft has never asked me to do anything like that. They've sent me to cool parties and events, but never asked for me to do anything as a result.

nunez4y ago

Lol I KNEW IT! An independent consultant blogging about awesome things in Azure? #doubt

Seriously, yeah, if he's an MVP, he'll be fine.

southerntofu4y ago

Just did the calc and 600GB/day is about 55Mbit/s. That's really not a lot and if there's not too much computation server-side you could serve this from a raspberry pi at home (provided you have good uplink). But that's assuming you keep the CloudFlare cache of course, or as author mentioned himself, advertising only torrents for the multi-gig files.

I really don't understand the cloud craze. Everything is more complex to debug, more expensive, and more shitty in all the possible ways you can imagine. I mean i was not exactly a fan of the VPS craze 10-15 years ago, but at least it wouldn't automatically ruin your bank account whenever you got a little traffic.

Kudos to the author for having so much money (thousands in one month?!) to waste. I wish i did too :)

brodouevencode4y ago

> Everything is more complex to debug, more expensive, and more shitty in all the possible ways you can imagine.

Coming from traditional infrastructure and development methods, you're mostly right. Part of the expectation of the cloud is that you do things _their way_. And even then each cloud provider does things a little differently. However, if you're willing to subscribe to the <insert provider> way of doing things it (and you'll have to trust me here) makes many things easier. Here's a short list:

* networking setup is free/cheap/doesn't require a Cisco cert. you can trust a developer to set things up.

* object storage is so much easier than any file hosting scheme you can come up with

* the path from container-on-a-host to container-in-a-cluster to container-in-{serverless,k8s} is extremely straightforward

* I turn all my dev/test servers off at night and they don't cost me a thing

* consumption based compute will result in a much cheaper solution than a VPS or colo (admittedly there are many assumptions baked into this)

* some core services (like sqs, sns on Amazon) are extremely cheap and have provably reduced development time because you're not having to build these abstractions yourself.

This all being said I'm not advocating an all-in approach without thinking it through, but to do so where it's easy and makes sense.

EDIT: clarity

2 more replies

Spooky234y ago

When you are growing, it’s a no brainer. When you are at steady state it depends.

As a case in point, I worked in standing up a critical system in a large enterprise a few years ago. We spent about $12M on compute, storage, networking, etc. At operational state, it was about 40% cheaper than AWS. The problem is, it all sat there for 6-18 months filling up before we fully hit that state.

With a cloud provider, you pay a high unit cost but if you engineer intelligently your costs should move with utilization. Except for government, most entities generally want to see opex move with revenue and prefer to minimize capex where possible.

1 more reply

SkipperCat4y ago

The cloud is great for scaling. The lead time for new servers deployed in a data center is weeks compared to seconds in the cloud. Plus there's no sunk cost in the cloud - you can turn it off when done and it evaporates.

Also, the cloud offers managed software as a service. You don't have to manage your own HA DB cluster or PubSub. It's all just there and it works. That can save you a lot on technical labor costs.

But yes, I do agree with your point. If you don't know what you're doing, you can nuke your budget super quick.

5 more replies

jollybean4y ago

"I really don't understand the cloud craze"

The opposite, I don't understand why anyone would ever put up a server if they didn't have to.

It's not 'processing power' that's going to be the 'big cost' for most projects.

It's headcount and salary.

If you can materially improve the operating ability of your company, then a few $K in cloud fees is dirt cheap.

I used to work at a 'tech company' that made a physical product and our IT was abysmal. We had to wait weeks for our sysadmins to order blades, get things set up, there were outages etc..

If a project is definitely going to be 'a few linux servers and never more' - even then it would be cheaper and more reasonable to use virtual instances.

The time to 'roll your own' is when the infra. operating costs are a material part of your business.

For example, 'Dropbox' invariably had to roll their own infra, that was inevitable.

Similarly others.

That said - as this article indicates, it's easy to 'over do it' and end up in ridiculous amounts of complexity.

The Amazon IAM security model has always been bizarre and confusing, and the number of AWS services is mind-boggling.

But the core case of EC2+S3 +Networking, and then maybe a couple of other enhanced services for special case works fine.

I also object to what I think is a vast overuse of Cloudflare, I just don't believe that in most scenarios needing to have content at the edge really changes the experience that much.

1 more reply

sockpuppet694y ago

> 600GB/day is about 55Mbit/s

In what universe? This frictionless perfect vacuum where traffic comes in a wholly predictable consistent continuum?

1 more reply

oblio4y ago

You're not the target audience.

Startups growing fast are the secondary audience.

The primary audience is large enterprises where their internal IT costs <<more>> than the cloud costs. Plus internal IT provides those resources after 6 months...

Retric4y ago

Most people that use cloud computing aren’t stuck with the bills the companies they work for are.

As to difficulty, they “solve” organizational problems by avoiding sticker shock when someone wants 100+k in equip that’s often a huge number of hoops to jump through and possibly months of delays, a giant bill every month and nobody a complains about the electric bill etc.

1 more reply

rr8084y ago

> 600GB/day is about 55Mbit/s. not really it was minimal traffic then sudden bursts of gigabytes. Of course throttling the big spikes would actually have been a good idea in hindsight to give an early warning.

TheIronMark4y ago

> but at least it wouldn't automatically ruin your bank account whenever you got a little traffic.

This only happens when consumers fail to set budget alerts. Troy could have saved himself $10k with 15min worth of work.

hnbad4y ago

I think it is an irresponsible fad that people use cloud services for hobby projects (and despite its wide popularity I'm calling HIBP a hobby project since he's running it on the side for free) unless they have solid cloud ops experience from their day job.

Cloud providers love it when people do this and are famously easy to talk to when you get an unexpected invoice high enough to require remortgaging your house to even begin addressing it, but I think unless you're working on a side hustle that inherently will need to run in the cloud regardless of scale or are experimenting with cloud technologies in an explicitly time boxed toy project, using cloud services is the financial equivalent of handing a hobbyist craftsperson one of these chainsaw angle grinder attachments that even professionals find hard to keep from bouncing into your body.

If you do want to use cloud services for anything you pay out of your own pocket, the first consideration should be cost management and monitoring. Your employer might have big enough pockets to shrug off a runaway compute instance you forgot about for a month, but that can quickly translate into money that can be anything from inconvenient to life altering if it comes out of your personal budget.

Or just stick with the free tier and make sure everything simply shuts down if you run out. Sure, a "bandwidth exceeded" error page might not get you as many upvotes on HN, Reddit or social media, but it also won't impair your finances.

pcthrowaway4y ago

I don't know what the alternative is. Run a home server and pay an ISP $$$ for unusually high upload bandwidth/throughput? 99/100 times running it in the cloud is going to be cheaper, easier, and more resilient.

Of course, the delayed sticker shock is a problem.. I think Google cloud actually lets you create a budget that turns services off if they go over, so there's a solution here if you run a hobby project that you suspect might take off and cost you more than it's worth.

2 more replies

papito4y ago

My cloud costs for my micro instance are about $12 a month. Multiple domains on there. I don't use RDS, ElasticCache, not even load balancers. If you want to keep the costs reasonable, you must roll that stuff on your own, which is totally possible (and free), and in fact kind of fun as a learning experience.

bennyp1014y ago

Because it's not cool, and won't make your CV sparkle.

I'm sure there becomes a point where cost of (hardware + maintenance + staffing) > (cloud + staffing), in which case sure crack on. But like you, I'll stick to a rented server for my stuff.

omegalulw4y ago

The direction is opposite IMO. As you grow bigger on prem starts making a lot more sense.

5 more replies

pid-14y ago

I have a few dozens of personal projects on AWS using APIGW, Lambda, CloudFront, Dynamo DB and S3.

Their monthly cost is something between 0 and a few cents.

Stuff like Hertzner is fine, but if you know your way around AWS you realize have massive cost savings. Prob the same for Azure.

Finally, in many places 40 EUR for a pet project is actually a lot of money.

welterde4y ago

Probably would run just fine on a <= 4 euro/month virtual machine too. Of course it doesn't quite scale to zero like APIGW,lambda,etc. but on the other hand you can be fairly confident to not pay more if your pet project suddenly lands on the front page of HN.

2 more replies

llampx4y ago

> Finally, in many places 40 EUR for a pet project is actually a lot of money.

Doesn't change the equation, unless you set up all your PAYG cloud infrastructure and never use it.

fuzzy24y ago

That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

Also, as you can see in a screenshot on TFA: Some services are simply dirt cheap. The storage account and its various “sub-services” is such a thing. It’s hard to compete with dedicated hardware here.

Depending on your dedicated hosting provider, the traffic cost trap exists, too. Hetzner is a bit of a special case.

ghughes4y ago

> ensure security, install the software you need, keep it updated and secure etc

These things are now trivial enough that it doesn't make sense to pay 10x the cost of bare metal for a cloud provider to solve them for you unless you have a crazy amount of runway or absolutely no idea what you're doing.

1 more reply

sildur4y ago

> That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

apt install unattended-upgrades. And Hetzner's firewall.

2 more replies

dx0344y ago

Most cloud users will have a VM somewhere which you also have to manage.

1 more reply

creshal4y ago

> That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone.

Hetzner also offers managed servers where all this is taken care of, for relatively fair prices.

2 more replies

FpUser4y ago

>"That dedicated server you have to manage (ensure security, install the software you need, keep it updated and secure etc). It’s not for everyone."

Typical FUD. On modern servers and the type of software it occupies very little time. You'd spend more managing your cloud architecture.

BlueTemplar4y ago

Arguably Hetzner is a cloud operator too. I guess it's a spectrum...

bluedino4y ago

I wonder if the disk on a $40 Hetzner server would be fast/big enough for him. All the searching and storing of massive password hash collections.

He has a writeup here on how he gets costs down in a big way: https://www.troyhunt.com/serverless-to-the-max-doing-big-thi...

pdimitar4y ago

I tried to scan through the linked article (and OP) but couldn't quite figure out Troy's storage requirements. Are they really massive?

The sum of the GB figured shown in the OP doesn't even amount to 200GB AFAICT. But even if it's something like 10TB that's still not super expensive on many hosting providers.

2 more replies

tetha4y ago

It depends somewhat on the organizational skillset you have, in my opinion.

Current workplace is considering a fully self-hosted stack as a unique selling point for the customers and segments we're in. That means, we have storage and linux admins available, as well as tooling and know-how how to run this securely and efficiently. Thus, placing large and often downloaded files on our file stores at hetzner is very much a no-brainer, because it adds very little workload to the teams maintaining these stores and it's cheap.

However, this can be a daunting thing if you don't have this skillset in the org. It can be learned, but that's time spent not working on the product (and it's not trivial to learn good administrative practices from the hell that google results can be). At such a point, a cloud service just costs you less man-hours. And again - it wouldn't be much time for me, but it would be a lot of time if you had to figure all of that out on the fly. That's essentially why the saying goes that cloud services save you time, but cost money.

selestify4y ago

Where is a good place to learn good administrative practices?

1 more reply

zarzavat4y ago

> I just don't understand why people use cloud services.

1, when they need to adjust rapidly between different resource usage profiles, e.g. because they are growing rapidly and can't predict what the usage will be X days in advance

2. They have huge resource requirements and don't care to invest in their own infrastructure, but can negotiate lower rates with a cloud provider

3. When their resource usage is modest but profitability is high enough that cloud expenditure is a rounding error

tlamponi4y ago

> 1, when they need to adjust rapidly between different resource usage profiles, e.g. because they are growing rapidly and can't predict what the usage will be X days in advance

One can add new servers in minutes, removing has a bit more latency to it, but I'd figure with the huge price difference between rented and cloud you'll come out on top with the former in most case. Also, just use a clustering or orchestration layer in between, they range from very simple to setup and use (e.g., Proxmox VE), to quite complex but also very capable (OpenShift, kubernetes, ...).

> 2. They have huge resource requirements and don't care to invest in their own infrastructure, but can negotiate lower rates with a cloud provider

Using hetzner or other providers is not investing in their own infra, that's using (= renting) the providers infra and ability (peering, fast uplinks, datacenter perks like utility redundancy and staff on site). The second sentence may be true but probably not for most use cases that aren't huge yet, like the post here.

> 3. When their resource usage is modest but profitability is high enough that cloud expenditure is a rounding error

IFF, yes, and often infra costs are relatively low compared to salary costs, so that's definitively some optimization problem one should go through when deciding such things. Chances are that for most projects the profitability can be good but not magic money printing and infra costs are a non-negligible part that eats on their revenue, and then it's definitively worthwhile to think about avoiding the high premium most of those cloud offerings ask for.

1 more reply

dmurray4y ago

4. When their resource usage used to be modest, so they got on cloud services for increased developer convenience, and now can't afford the switching costs even though their bills are expensive.

andi9994y ago

Maybe one wants to mantain the application and not the server? Long time ago i booked a vps, install some bsd on it and thought i am good.

A month later a ntp security vulnerability was discovered, soon the server was put offline, some 'patch your things asap' not so nice emails came in. From that time my take is one should spend some time probably daily on an own server if one wants to mantain it.

pmlnr4y ago

Right, because a barebone docker hypervisor needs so much admining.

1 more reply

sdze4y ago

Aren't Azure Compute Nodes also "bare metal"?

2 more replies

hardwaresofton4y ago

Well there’s a gap between the amount of convenience you get on the major clouds and one like Hetzner.

I’m a huge Hetzner fan, and their cloud offering is definitely growing but still isn’t as convenient and featureful as it could be (and they don’t share their roadmap currently so hard to tell what they’re working on next).

I’m trying to do something about it though, working on Nimbus Web Services[0]. In my mind all we need is something to bridge the managed services gap and make it very easy to set up the basic 3 tier app with some amount of scale/performance elasticity!

[0]: https://nimbusws.com

dx0344y ago

But he could've put static files on a Hetzner server and still have his backend in Azure. That would've solved these issues and probably saved even more money.

1 more reply

fbrncci4y ago

I have a pretty complicated architecture that would cost me about 20-35$ if it was hosted just on Digitalocean or Hetzner. Instead its AWS ...soon to be multicloud, and costs me about 140$/mo (which does vary). But it does allow me to experiment, write long articles and design some fun stuff; about which I blog on my own website. The blog has gotten me both clients on freelance projects and enough "cred" to start on new projects I don't have any resume experience on. That's the only reason that I personally use cloud services (of course, the reasons for SaaS/Enterprise clients are usually more valid than mine).

rhn_mk14y ago

What stops you from having a blog on Hetzner? That doesn't seem like it has anything to do with AWS whatsoever... or do they offer a blogging pltform?

2 more replies

InsomniacL4y ago

- Patching - Remediation, Monitoring, day0 response

- Security Information and Event Management - exports, alerts, OS configuration

- OS/Application Hardening - Encryption, Password/keys rotation, CIS/other baselines, Drift Management

- Backup - Encryption, (don't forget your passwords/keys are changing), retention, data protection compliance, monitoring, alerting, test days

- High Availability - replication, synchronisation, monitoring, alerts, test days

This is just the tip of the ice berg, if you operate in an environment where Insurance, Reputation, Regulatory Compliance, certification, etc.. are important, then it's easy to see why PAAS solutions are desirable.

chillfox4y ago

Eh, if my bank goes down or gets compromised then I will hold it against them regardless of if they are self hosting or using the cloud.

1 more reply

rcarmo4y ago

Because they provide managed services that VPS hosters don't have or which would require the overhead of maintaining and patching servers, and many people just want to get on with their lives instead of worrying about OS exploits...

martin_a4y ago

That's why you take some kind of "managed hosting" where all of this is taken care of.

1 more reply

alpaca1284y ago

But they do offer managed servers.

lazyant4y ago

If you only need a server, as in CPU, RAM, disk and bandwidth, with a more or less constant demand, then sure, a dedicated server is way cheaper than any cloud. You want to use cloud for the ecosystem of other services besides VM/instances, and especially to use them in an automated way. The other use case is elastic demand.

lvass4y ago

IIRC, hertzner "unlimited" traffic isn't quite unlimited. You have a few monthly TB depending on what you contracted, if you go over it there's massive speed reductions until you pay a fee.

FpUser4y ago

I do rent from Hetzner and OVH. Before signing contract I emailed them and asked if there are ANY limits / throttling beyond their unlimited 1gbs. They assured me in writing (email) that there are none. Some of my rented servers host giant 4K high video files and transferring those which happens all the time keeps that bandwidth pretty occupied. So far I did not see them impose any throttling. Not on my business anyways.

jerf4y ago

In this case, that arguably would have been preferable.

A lot of cloud cost objections would be solved if they defaulted to that instead of defaulted to just charging you the fees. That has its own tradeoffs, of course, but I find myself suspicious that the reason the clouds work this way isn't so much a cold and sober consideration of the aforementioned tradeoffs so much as "this way makes more money when we charge people lots of money they weren't expecting" and "this way makes lots of money when the people deploying the service are organizationally and fiscally disconnected from the people paying for it so they care and notice less".

xuki4y ago

It's truly unlimited now. I know someone who's pushing 1Gbps constantly (selling Plex access) and Hetzner have no issues with it.

jasode4y ago

>Can somebody explain to me why I wouldn't just rent a 40 EUR dedicated server from Hetzner [...] , I just don't understand why people use cloud services.

This recurring question of "why AWS/Azure instead of Hetzner/OVH ?" keeps happening because people are incorrectly comparing higher-level PaaS to lower-level IaaS without realizing it.

PaaS != IaaS are not equivalent. IaaS is not a direct drop-in replacement for PaaS to save money if the workload is using PaaS features that IaaS does not include.

The author Troy Hunt is using the higher-level Azure services like Table Storage (like AWS DynamoDB/SimpleDB) and Azure Functions (like AWS Lambda), and others. E.g. One of the article's hyperlinks talks about using Azure Functions.[1]

If he used Hetzner, he'd have to reinvent the Azure services stack with open-source projects (some of which are buggy and immature) and expend extra sysadmin/programming work for something that's not as integrated. The Azure/AWS stack includes many desirable housekeeping tools such as provisioning, monitoring, routing, etc which he'd also have to re-invent.

TLDR: People choose Azure/AWS because it has more features out of the box. You just have to figure out on a case-by-case basis if the PaaS value-add makes financial sense for your particular workload.

EDIT to downvoters: if Hetzner actually has built-in equivalents to AWS Lambda and DynamoDB, please reply with a correction because I don't want to spread misinformation.

[1] https://www.troyhunt.com/serverless-to-the-max-doing-big-thi...

forty4y ago

Yeah, it feels like someone saying "why don't you build your house yourself? Would be much cheaper". This is certainly true, but

- My house is probably going to be build much faster if it's built by professional house builder (even more true for services since it's available immediately)

- I have better things to do than building houses

marcosdumay4y ago

> people are incorrectly comparing higher-level PaaS to lower-level IaaS without realizing it.

Hum, no. People are asking what kind of value that platform adds that can justify all that risk.

And nobody is giving any clear answer, so I'll stand with my previous answer of "none".

unixhero4y ago

You are not wrong. Hetzner would be a good choice instead.

raxxorrax4y ago

> As far as I am concerned, I just don't understand why people use cloud services.

I use the credit card of my employer. For my own projects I use my own server for everything. Granted, it doesn't get much traffic.

Some offers from cloud providers are pretty good. If you want to scale to more (virtual) machines, it can be more easily done with the usual providers. I also expect Amazon to know more about firewall and reverse proxy configuration, it renews my certificates automatically and has rudimentary services for monitoring of server state. There is a certain convenience to it.

Would I recommend cloud based hosting? Absolutely not. You become dependent on the provider and prices are often steep. Even if you do not know much about server security, your unsecured s3 bucket will be far more exposed than your standard db installation on your own server. Better build expertise for systems you have full control over than to invest the time on the details of AWS which are more subjected to change.

that_guy_iain4y ago

> As far as I am concerned, I just don't understand why people use cloud services.

For companies the benefits are the abiltiy to get new servers at a click of a button and get rid of a server. For example, asking the ops team to setup a snapshot of a database for a few hours while I do something is super useful.

There is also the ability to use autoscale and other stuff to automagically scale your system to handle traffic peaks. With dedcicated servers you need to always have those resources available. It's attractive to managers that they're only paying for resources when they're using it.

There are also managed services like DynamoDb, Lambda, S3, etc that can make things easier and reduce your sysadmin work. And allow you to get up and running very quickly.

Obivously, a major downside is that the pricing is extremely vulnerable to spikes like this. I think we see an article like this every 3 months or so. This one is rather tame compared to some others that were 10x as much for a 24-hour period.

withinboredom4y ago

Hetzner dedicated server * 3 + k3s + vnet + longhorn + metallb = basically the cloud.

I can snapshot a database disk with a click of a button and restore the snapshot with yet another few clicks.

I have 1.5 TB of highly available disk space, 40 cores of full CPU power, 160 GB of RAM, & dynamically provisioned IPs for metallb. For only $130USD a month. For the same price in Azure, I had 6 CPU cores & 8 GB RAM.

2 more replies

INTPenis4y ago

>As far as I am concerned, I just don't understand why people use cloud services.

Well that's the first issue. Many people have automated large parts of their infrastructure in this way so that distributing one huge file becomes part of that whole mess. The goal is of course to keep costs down to a minimum. You can actually do a lot with little money using cloud services.

But the careful balance is that you can easily miss little details. But how does that differ from any systems administration? The details are just in new areas that didn't exist 5-10 years ago.

And the details you miss are more likely to increase cost. And when you process a lot of traffic, you're popular, that can go real fast.

20 years ago in hosting we might get a porn stash on a hacked NT4 server that would draw bandwidth. And back then a whole company might have 100Mbit fiber so you'd notice.

PragmaticPulp4y ago

For such a (relatively) simple architecture: I agree. Easy dedicated server, make a point to watch security updates.

The reason to use cloud-style services is so you can focus on building the product quickly instead of building and maintaining architecture. But once the product is stable, a cost-reduction pass is in order.

dom964y ago

I don't understand why anyone would sign up for services that have an unknown future cost. This is exactly why I avoid Amazon's S3 and prefer something like Digital Ocean (or Hetzner). I would much rather have my service shut down than spend many thousands of dollars because some cache failed.

erwincoumans4y ago

Agreed, I've had large bills for cloud providers, forgetting to terminate a GPU instance, or didn't realize that having a disk image (even not running) costs money.

>> why anyone would sign up

It happens more often than you think: people sign up for credit cards and forget to pay the monthly bill in full. Sign up for a cell phone plan and get charged with large bills of international roaming. People sign up for monthly subscriptions, and exceed the usage limits.

api4y ago

The entire ecosystem has been herded into complex deployment patterns that make it labor intensive to manage infrastructure without using managed cloud services.

jenscow4y ago

> I just don't understand why people use cloud services.

To handle that day of getting 1 million customers, which you've been forever optimising for.

Any.. day.. now...

300bps4y ago

Where did you get 600 GB per day? That only would’ve cost $8.40 per day. It looks like it was actually 25 TB per day which is over 40x what you said.

From the article:

This was about AU$350 a day for a month… priced at AU$0.014 per GB

A company could not stay in business if every one of their “unlimited 1 Gbps” customers for €40 per month actually used that bandwidth.

pmlnr4y ago

Hype, HIPPOs, FOMO, buzzword driven resume.

vbezhenar4y ago

Scalability, reliability, provided maintenance for every aspect (hardware, software, backups).

tester7564y ago

I'm fan of cheap VPSes too, but I'd like to have things like metrics out of the box

tlamponi4y ago

€40 gives you a dedicated server, not just a VPS.

Getting metrics on that is not a hard problem, there are various projects that are relatively simple to set up.

If you want to make it easier manage resources, metrics out of the box, and avoid (hoster) lock-in then I'd use a hyper visor distro like Proxmox VE (disclaimer, am a dev there) or the like, and you can migrate (or backup/restore) VMs or Containers easily to other providers. That gives you a (relatively) simple web-interface to manage most things and also opens the possibility to just add a second or third dedicated host down the line to scale out, if those new hosts are in the same DC or have a good interconnect (latency wise) you could even cluster the nodes.

1 more reply

dx0344y ago

You can use Hetzner's cloud. You get metrics and still have a lot of free traffic with very low cost above that.

65104y ago

He should and did use torrents.

distantsounds4y ago

good luck getting a gigabit speeds from a hetzner box in any form of consistency

pbalau4y ago

Wouldn't riding a horse prevent that car crash?

kuschku4y ago

Luckily you can avoid both by just cycling everywhere. Lower CO2 output and lower cost, too.

I use rented dedicated servers for everything, and always travel by bicycle or transit. It's not as ridiculous as you make it seem.

1 more reply

onlyrealcuzzo4y ago

Horse carriage accidents were surprisingly common and deadly for the low speeds they traveled at - but, I did enjoy the analogy [=

dt3ft4y ago

Not much to explain, you're absolutely right. Hetzner would have been a much wiser choice here, but advocating any cloud provider at this scale probably has its perks too or he wouldn't be burning his money. Then again, perks only go so long and at some point do come to an end, so this is why he may be writing about costs right now.

Take a look at their datacenter in Germany: https://www.youtube.com/watch?v=5eo8nz_niiM

octoberfranklin4y ago

Wow, that video is fascinating.

Love how they are totally not ashamed to kick off the video with their collection of 14,000 mini-tower desktop PCs. Not rackmounted. Mini-towers.

Also totally ultra-curious about the PS/2 kvm. All those machines are from an era when USB keyboards had been around a long time already. Wondering if this is a security measure...

closeparen4y ago

Perhaps for the same reason that the vast majority of the readers of this site don’t use Hetzner: they are not European and neither are their users.

lpcvoid4y ago

Hetzner is just an example - you can get cheap dedicated boxes with gigabit uplinks all over the world. And in this example it's not even important what latency the server has, since it was only feeding Cloudflares CDN with data.

1 more reply

hnbad4y ago

The reason Europeans tend to favor European service providers generally has to do with strong data protection guarantees and some level of protection against foreign surveillance. In practice a lot of European companies still use US services or at least services provided by US companies -- Troy Hunt is Australian and uses Azure from Microsoft, so this isn't just a thing Europeans do either.

I'd love to hear your reasoning why people who aren't European would prefer to avoid European service providers.

2 more replies

vitro4y ago

OVH then? They have similar offerings, unlimited traffic, multiple datacenters to pick from.

1 more reply

NicoJuicy4y ago

Hetzner launched in the US by now

j / k navigate · click thread line to collapse

641 comments

buro94y ago

Don't put Cloudflare in front of a Cloud egress bill. i.e. don't do this: Azure|Amazon > Cloudflare

Always use your own proxy where the egress is well within your free tier, i.e. do this: Azure|Amazon > Hetzner|Linode > Cloudflare

Why?

sascha_sl4y ago

Or simply use a proper CDN that doesn't pretend to eat all the cost for a flat fee but then sometimes does not. BunnyCDN has an amazing volume tier at half a cent per GB.

buro94y ago

Oh exactly that.

Or if caching is your biggest priority then Fastly or Akamai will shine too.

2 more replies

reitzensteinm4y ago

Will BunnyCDN reliably keep an 18gb file in cache without hitting origin? I use and like Bunny, but relying on that to not get a massive bill in the mail scares the shit out of me.

1 more reply

jimbobimbo4y ago

Azure has its own CDN. If one wants to do Cloudfare -> CDN -> Azure Storage, then at least let it be Azure CDN in the middle, not another cloud provider in the mix. ¯\_(ツ)_/¯

z3t44y ago

Or simply run everything on your own server. All those middlemen are going to kill any latency improvements you get from anycast edge servers.

martindbp4y ago

I've switched to Backlaze B2, which has a bandwidth alliance with Cloudflare. Even without it, B2 egress is something like 1/5th of S3, so may be worth thinking about.

rawtxapp4y ago

bastawhiz4y ago

Argo does not affect caching, only performance. You're maybe mistaking it for tiered caching or a custom caching topology.

1 more reply

XCSme4y ago

> Azure|Amazon > Hetzner|Linode > Cloudflare

Why not directly Hetzner|Linode > Cloudflare?

nightpool4y ago

Because Hetzner and Linode VPSs have fixed disk sizes, while Azure and AWS have basically infinite storage. You use your cheap commodity VPS as a cache, not a source-of-truth.

3 more replies

nostrebored4y ago

So that you incur as much downtime risk as possible, obviously.

I hate these 'cloud economics' optimizations that people tend to try.

3 more replies

zrail4y ago

https://www.reliablesite.net

(search HN and reddit for that URL, you'll see they've been around and recommended for a really long time).

edub4y ago

If you're going to have an intermediary proxy that you run, for AWS perhaps use Lightsail. It is price competitive, and includes more bandwidth than Linode/DigitalOcean/Vultr for the price.

klohto4y ago

You are not allowed to use Lightsail once you use more professional services on AWS atleast per ToS

2 more replies

ddlutz4y ago

Why not use the CDN of the cloud provider you are on? Azure Storage > Azure CDN

3 more replies

canucker20164y ago

Or Troy Hunt can ping his Cloudflare contacts and see if he can get access to Cloudflare R2 Storage.

see https://blog.cloudflare.com/introducing-r2-object-storage/

1 more reply

cuham_17544y ago

How about Amazon Lightsail? It price structure is basically the same with Hetzner or Linode, and you get it in-house if you use AWS.

2 more replies

jitbit4y ago

CloudFlare tiered cache is now free BTW

Dave3of54y ago

Team I'm in at the moment is in the early stages of cloud adoption but the company in total has fell hook line and sinker for AWS. When I mentioned the cost there is always an excuse.

Kneecaps074y ago

drdaeman4y ago

I always was a software developer first, but in the old days I spent enough time in the server rooms doing all sorts of sysadmin work, and those days I dabble in devops.

Have I turned old and sour? Or maybe it's just the nostalgia about the youth, and I've forgotten or diminished most the issues while warmly remembering all the good moments?

1 more reply

Symbiote4y ago

1 more reply

kortilla4y ago

This reads like a software engineer being happy work caters lunch so he/she didn’t have to cook for the whole team anymore. Didn’t anyone discuss maybe hiring a cook?

1 more reply

Dave3of54y ago

If cloud improved QOL for ALL employees I'd agree but I think it just shifts work around and costs more.

goodpoint4y ago

> Dealing with hardware failures ... it's all awful work

I've met plenty of datacenter technicians that loved they work and the opportunities for growth it provided.

Some companies really know how to manage a datacenter with minimum pain. Some don't.

BlueTemplar4y ago

It's not like all those jobs have been taken over by automation - someone still has to take care of these cloud servers ?

Handytinge4y ago

> Dealing with hardware failures, hardware vendors, confusing licensing, having to know SKUs, racking new cabinets, swapping hard drives, patching servers - it's all awful work.

Each to their own, but I think you'll find there's a fairly significant portion of sysadmins who love that work!

3pt141594y ago

On the other hand, if you're not looking for explosive growth man oh man is DigitalOcean or anyone of a number good providers of good old VPSes / Cloud-lite.

capableweb4y ago

I keep hearing this argument against using your own infrastructure again and again, and I'm not sure how true it is.

I've worked with teams on both sides, and everyone is gonna have to deal with figuring out how to run at scale, it's just different ways of achieving that.

Definitely a case of "right tool for the right job", but I don't think it's as easy as "Self-managed: harder to scale, PaaS/Cloud: easy peazy to scale".

1 more reply

martinald4y ago

I don't disagree; but I think the cloud (AWS/Azure/GCP) have sort of shielded people from how cheap/powerful the underlying hardware has became.

For ~100eur/month on hertzner you can get a 16core Zen3, 128GB RAM with 8TB of NVMe SSD.

Unless your stack is horrendously badly optimised you can serve SO MUCH traffic off that - definitely billions of postgres records without breaking a sweat.

2 more replies

Dave3of54y ago

> Or is your ten person team really going to figure out how to get Postgres to reliably run with billions of records, with encrypted backups, etc?

I also think you haven't really looked at the SO link I sent through with thoughtful engineering they have huge user base with a tiny footprint.

> DigitalOcean or anyone of a number good providers of good old VPSes / Cloud-lite

Not sure why you are dunking on DO here they are a fully fledged cloud provider with much the same stuff you would need. You can also run up a huge bill on DO as well.

1 more reply

ignoramous4y ago

> I can see both sides. If you're a startup that needs to be able to scale quickly if product market fit is achieved, the cloud really saves your bacon.

[0] https://tailscale.com/blog/modules-monoliths-and-microservic...

[1] https://www.swyx.io/cloud-distros/

fiddlerwoaroof4y ago

> is your ten person team really going to figure out how to get Postgres to reliably run with billions of records, with encrypted backups, etc?

Most of the problems here will be DBA problems like understanding query plans and such. Even with AWS RDB, I’ve had to upload various setting files to tweak tunables to get things working.

mcbain4y ago

That stackoverflow infra blog post is out of date. They use more than a single webserver now. For example: https://stackexchange.com/performance

dijit4y ago

Now they have 9.

They still serve a lot more traffic than I do and I have hundreds of instances; thousands of containers.

1 more reply

andrewxdiamond4y ago

Most importantly, SO is extremely read-heavy, write-lite, and cache-friendly.

A similar “scale” e-commerce site would be significantly more load, have more dynamic data, and just be overall harder to run.

Dave3of54y ago

Looks like they have actually reduces their footprint. It not that they do run on a single webserver it's that they can run on one.

1 more reply

traceroute664y ago

nova220334y ago

our department now spend > £1 million per year to AWS in hosting costs. A 20% reduction in those fees could pay for a few sysadmin(s).

You can hire a "few" sysadmins for 200k/year?

Dave3of54y ago

In the UK/Europe yes:

https://uk.indeed.com/jobs?q=System%20Administrator&vjk=5149...

Probably not at FAANG level salaries but I doubt there are many sysadmins working for FAANG companies anymore.

DevOps btw are more expensive and infact in the UK DevOps can be higher paid that a developer. I suspect most of the DevOps working for this company are on £65k+. According to:

https://ifs.org.uk/tools_and_resources/where_do_you_fit_in

That puts those earners in the top 3% or from that website:

" In the below graph, the alternatively shaded sections represent the different decile groups. As you can see, you are in the 10th decile group.

In conclusion, Your income is so high that you lie beyond the far right hand side of the chart. "

mattbee4y ago

£200k / year, in the UK? That's about 2-5 depending on experience.

sparselogic4y ago

A 20% reduction would result in ~£800k/yr.

1 more reply

InefficientRed4y ago

> > £1 million per year

I'm curious about your workload. I tend to only use cloud for workloads where it's either (1) by far the only feasible option (e.g. need GPUs for short periods of time), or else (2) basically free.

> I mean I'm not against cloud it's just not the cheapest option

This is certainly true for most workloads. It's also true that buying is better than renting, but here I am living in a rented apartment.

The logic from on high might be something like "if demand is uncertain and capex is risky, why buy when you can rent?"

cyberCleve4y ago

j1elo4y ago

> If Troy Hunt of all people can make this mistake, it can happen to anybody.

qw4y ago

> I reached out to a friend at Cloudflare and shortly thereafter, the penny dropped

Another advantage is his big network that he can ask for help. There's also a chance that his blog post will reach the right person in Azure and he'll get a reduced bill.

As someone who doesn't have the same network or the "fame", I am concerned about what would have happened to me in that situation.

chasd004y ago

sydthrowaway4y ago

How are you teaching yourself?

1 more reply

tpetry4y ago

Every cloud makes this mistake easy! You have to manually activate billing alerts for everyone because they want you to spend more snd more each month.

I am still waiting for a cloud without these dark patterns. But that will never happen because it‘s leaving a big amount of money on the table by not being hostile.

Dave3of54y ago

Also the billing alerts is just that an alert. They should have something in place to put a hard cap on monthly spend. That way his free website would go offline when he's spent > $X.

As you say they make it hard deliberately.

Edit: Turn out Azure have this:

https://docs.microsoft.com/en-us/azure/cost-management-billi...

4 more replies

harry84y ago

Dark patterns - this sounds like a colour scheme you don't care for.

"Predatory death-trap pricing" captures the spirit of the thing with rather more clarity. It is wholly intentional after all.

3 more replies

iso16314y ago

Get a VPS from linode for $5 a month and it costs $5 a month.

5 more replies

sgustard4y ago

2 more replies

nix234y ago

>Every cloud makes this mistake easy!

Funny enough...Oracle (OCI) makes it better, you can buy oracle"coins" 1to1 with $ and load your account just with what you think you need.

1 more reply

dx0344y ago

1 more reply

TedDoesntTalk4y ago

> I am still waiting for a cloud without these dark patterns.

This is how mobile and landline phone companies made enormous fortunes before flat rate billing. It’s called post-paid vs pre-paid billing.

Grollicus4y ago

Do you have any substance to your allegation of Microsoft hiding behind their pricing model?

This is very straight forward from their view, before: almost no traffic = almost no costs, now: huge traffic = $$$.

NicoJuicy4y ago

He's an Azure MVP. He already has 13 k in credits/yr, which could absorb the costs ( just guessing here)

1 more reply

Nextgrid4y ago

capableweb4y ago

Closi4y ago

> Correct me if I'm wrong, but Troy Hunt is a person focusing on security, not infrastructure, deployments or development even.

1 more reply

moritonal4y ago

There are both spending limits and alerting that you could use, but would be impossible to predetermine from Azure's perspective, so they rightly ask you to.

brimble4y ago

1 more reply

baybal24y ago

Who is Hunt Troy?

jacquesm4y ago

The guy behind 'Have I been pwned', a website where you can check if your login credentials to some website have been leaked.

https://haveibeenpwned.com/

oneepic4y ago

It is worth mentioning that the alert itself costs money. So if you're evaluating the alert every 5 minutes on the past 24h of data it can burn a small but surprising amount of money.

manarth4y ago

A time-series represents a "thing you're monitoring" – in this instance, it's aggregate egress, so $0.10 per month, regardless of the evaluation period.

Monitoring CPU? Another $0.10 per month. Memory? Another $0.10.

Thankfully, not $900.

oneepic4y ago

I meant to emphasize frequency, not eval period. Apologies. That said I took a look at the pricing docs and didnt see frequency mentioned, so hopefully I am in the wrong about the price.

As an aside, their (Azure's) pricing docs are written in the same fishy way their technical docs are written (my opinion only)...

1 more reply

TriNetra4y ago

The alert emails are way more meaningful (with projected amount in subject for example) unlike generic ones from Azure Alerts, so you see a real alert and prompted to take immediate action.

1: https://cloudalarm.in/Home/Docs/#how-is-budget-alarm-differe...

GordonS4y ago

But surely CloudAlarm relies on the same data as Azure's alerts do? Azure support told me that data is only updated daily.

Also, Azure has an option to alert you beforehand if it looks like you'll go over; struggling to see how your service is any better.

1 more reply

mnahkies4y ago

This is something to be mindful of when using datadog synthetics monitors as well - if you have a short interval, or many locations being tested from they can become expensive quickly

godot4y ago

In a business setting, you want your service to stay up, at the cost of spike in costs if accidents or mistakes happen.

kortilla4y ago

> In a business setting, you want your service to stay up, at the cost of spike in costs if accidents or mistakes happen.

No you don’t. This is absolutely not a given. Being a “business” doesn’t mean you suddenly have unlimited budget.

The vast majority of businesses are not “web scale” and are better off taking an availability outage than suddenly handling 1,000,000x the normal volume of traffic.

1 more reply

trulyme4y ago

ghaff4y ago

Certainly the cloud providers probably make money by not having hard limits.

1 more reply

benbristow4y ago

Azure (and I'm sure other cloud providers do) allow you to set email notifications for when your bill goes over a set amount so you can stop it before it happens.

If you're using a cloud provider I'd highly recommend setting one of those up.

In Azure it's under your Subscription and then Budgets

1 more reply

temp89644y ago

Is it really that black and white? I think there is a continuum in hosting service. Not just A) very low end VPS, and B) unlimited cloud.

The fact is that there are low end VPS, middle end VPS, high end VPS, and dedicated servers. If you started from a low end VPS, it is very easy to gradually upgrade your VPS.

A $5/month VPS can be used to play for tons of things. I just don't get people who use free tier cloud, unless you just want to learn about the cloud hosting per se.

Nextgrid4y ago

bushbaba4y ago

Isn’t that why services such as AWS lightsail and digital ocean exist?

stevehind4y ago

jillesvangurp4y ago

gurraman4y ago

Fomite4y ago

I had this happen to me once on Digital Ocean, and I contacted them - they were rather understanding that the bill I had was clearly "atypical for my account and not intended" and refunded it.

SoapSeller4y ago

I'll second that.

I've seen several cases on both Azure and AWS that bills got weaved after someone opened support ticket starting with "oops, I just did..."

ramraj074y ago

0x0084y ago

> Secondly, there's cost alerts. I really should have had this in place much earlier as it helps guard against any resource in Azure suddenly driving up the cost.

He did not enable alerts.

sylens4y ago

asadlionpk4y ago

OP do this! It works, they are usually very generous (same for gcloud!)

goodguyamericun4y ago

Op is Troy hunt, an ms MVP. You can bet there are people from MS doing it for him as soon as they got wind

quartz4y ago

tinus_hn4y ago

Also there is 0% chance serving this traffic cost Microsoft anything near $10000.

jacquesm4y ago

As opposed to all those other customers who are not good faith actors?

scrollaway4y ago

The post applies to everyone and I’d second it. Ask nicely for a refund in these situations, the worst that can happen is they say no.

Where did they say that “only Troy Hunt shall receive a refund, for only Troy Hunt is a good faith actor, so say we all”?

3 more replies

suction4y ago

tolien4y ago

> Even if you run a relatively opaque cost structure business like a restaurant, you can still calculate the maximum cost of ingredients for one month, the salaries, energy, etc.

suction4y ago

1 more reply

stickfigure4y ago

I'll bet Sysco would deliver $10k worth of canned tomatoes to your restaurant without checking.

2 more replies

corobo4y ago

In this scenario though you’ve used tonnes of tomatoes and they’re now asking you to pay

1 more reply

bstpierre4y ago

Nextgrid4y ago

1 more reply

macintux4y ago

> A gas or electric bill works a bit like this

Just ask Texans.

avrionov4y ago

We swapped one type of problems with another.

suction4y ago

1 more reply

lytefm4y ago

> It was much more common to over provision and have machines and bandwidth being unused for years.

octoberfranklin4y ago

Banking.

Credit card chargebacks, especially.

usr11064y ago

dspillett4y ago

Or they can fail completely.

And the alerts themselves cost if you want something reliable so you have to weight that against the danger. Pay as you go cloud can be a maze of costing concerns..

> The only thing that would really help were a hard spending limit that stops all services except storage.

Yep. Though that is small comfort if you need to guarantee more than a couple if 9s of uptime, hopefully those with that requirement can soak up the unexpected billing blips.

alfiedotwtf4y ago

> The only thing that would really help were a hard spending limit that stops all services except storage.

Sadly, I haven't found a way to do that with AWS

dx0344y ago

It's funny that even Hetzner can do that and AWS can't. Shows that there's no interest from AWS to prevent these things from happening.

3 more replies

UnFleshedOne4y ago

Actions weren't there last time I checked (few years ago).

1 more reply

Monotoko4y ago

Kill switches in lambda I believe is possible, running when the alert is triggered

1 more reply

mrb4y ago

Most worrying is that even an expert like Troy Hunt was UNABLE to figure out the cause of the issue by himself. He "reached out to a friend at Cloudflare" who investigated and found the cause.

alkonaut4y ago

manquer4y ago

SMB or indie developers are not the first/primary customers for Azure/AWS that they design their application for.

1 more reply

kuu4y ago

defaultname4y ago

schemescape4y ago

Edit to add the page I was reading: https://docs.oracle.com/en/cloud/get-started/subscriptions-c...

cma4y ago

They'd rather refund small guys for mistakes than give big guys an easy limit to set.

kuu4y ago

I guess big guys don't want they service to suddenly stop, so they probably would not use this... But it's just a guess

2 more replies

herodoturtle4y ago

I assume you meant “pull” or “unplug” the cable :)

kuu4y ago

Yes ;)

frameset4y ago

But there is an option. In Azure you can "set a budget". He even goes over it in the post. Did you read the linked article?

lkxijlewlf4y ago

https://news.ycombinator.com/newsguidelines.html

cdmckay4y ago

It would be really classy if MS forgave that debt, especially considering the service is a public benefit.

kelsolaar4y ago

I would go as far as saying that the hosting for such a service should be entirely sponsored by Microsoft.

lodovic4y ago

He's a "Microsoft Regional Director and MVP" so Microsoft pays the bill one way or another. I expect that he has reduced Azure rates as well.

1 more reply

anothernewdude4y ago

Would be even classier if the major cloud providers responded to customers calling out for budget limits for the past decade. Not many people want to risk potentially infinite costs.

scanr4y ago

I wonder how much of the cloud provider revenue comes from situations like this. I suspect quite a lot.

I’m not surprised that the cloud providers are quick to refund users as it’s likely that they only do it in a fraction of cases and it buys a lot of goodwill.

It would be interesting to try and design a cloud that supports OutOfMoneyException’s with gradual degradation and capped liability for costs built in.

Nextgrid4y ago

> I suspect quite a lot.

throwawayffffas4y ago

Question, is the 0.014AUD per GB quoted here correct? Looking at the linked page[1] I would think the cost would be 0.1102AUD per GB as is quoted in the Internet egress section.

https://azure.microsoft.com/en-au/pricing/details/bandwidth/

throwawayffffas4y ago

Also (3200 GB per day * 30 days) * 0.014 AUD per GB is 1344 AUD. While (3200 GB per day * 30 days * 0.1102 AUD per GB) is 10579.2 AUD much closer to the final bill.

My conclusion Troy still doesn't know how much he is paying.

CodesInChaos4y ago

zzt1234y ago

Actually, wow it seems AWS is also the same price as Linode and DO for egress. While Linodes and DO do come with decent free bandwidth, this is a surprise to me.

coder5434y ago

AWS charges $0.09/GB, and Azure charges $0.0875/GB.

Maybe Troy Hunt gets a discount for being a Microsoft Regional Director and MVP. (Neither of which make him an employee of Microsoft, confusingly enough.)

https://docs.digitalocean.com/products/billing/bandwidth/

https://www.linode.com/docs/guides/network-transfer/

https://aws.amazon.com/ec2/pricing/on-demand/

https://azure.microsoft.com/en-us/pricing/details/bandwidth/

zzt1234y ago

graton4y ago

I think the article is incorrect.

https://azure.microsoft.com/en-au/pricing/details/bandwidth/...

The AUD $0.014/GB is only for data transfer between Availability Zones.

patrec4y ago

How can $10 per TB not strike you as expensive? You can easily download that much a day on consumer broadband that will cost you far less than $10/day.

fabian2k4y ago

If you download the data twice at that price point, you could buy an HDD to store it for the same price (the bigger HDDs seem to be at ~ 18 EUR per TB here).

CodesInChaos4y ago

$10/TB is between availability zones in the same region. Egress to the internet costs $50-$90. So it's much more expensive than the already expensive $10.

bluedino4y ago

Reminds me of a time, we had a new site that was going to run on GCP, we had been using a couple co-located servers for years.

After about 2 years we got rid of all the co-located stuff and were spending about 1.5x, but we had more apps, they served heavier pages, etc.

dijit4y ago

1.5x is pretty good.

We overspent quite heavily on our on-prem stuff for a game I helped launch, for political reasons the next game ended up running on the cloud.

The price was roughly 10x before discounts. With our heavy discounts and a wide amount of slimming down/cost optimisation (easily 3 months of work) we got it to 2.3x

There will always be a need for sysadmins/cloudops/devops for that environment, so we didn't save any headcount either.

That said, it was a lot nicer to use!

hogrider4y ago

Awful toxic boss.

hdjjhhvvhga4y ago

zekica4y ago

On Hetzner and with their €1.00 per TB after 20TB included, you can pay up to €324 per vps as you are limited to 1Gbps if you fully saturate the link all month.

dx0344y ago

I doubt you'll manage to get the exact 1Gbps per VPS out all month. On dedicated that's more likely. But luckily they have a very easy setting for billing alerts and maximum in the settings page.

mawalu4y ago

CodesInChaos4y ago

Additional traffic costs 1 EUR/TB (plus VAT, depending on where you live). So it's about 50 times cheaper than the big clouds.

Seattle35034y ago

My (naive) solution. Every new account by default has an SMS alert that trips at $100. It says

"Your account has exceed $100 spend. Reply 'SHUTDOWN' to shutdown all services, 'STOP ALERTS' to never see this alert again, or 'DOUBLE TRIGGER' to double the alert trigger value to $200."

Nextgrid4y ago

The problem is that metering these services at such granularity is difficult: https://news.ycombinator.com/item?id=30066538

1 more reply

llampx4y ago

Very nice writeup, thanks to the author for writing it so clearly for someone who is not familiar with the nitty-gritty to be able to follow it.

kidsil4y ago

Shameless plug - the core of my work is about ensuring these unexpected costs never happen.

We have some recent case studies where we've successfully reduced cloud costs by 95%

https://www.cloudexpat.com/case-studies/

hi(at)cloudexpat.com - happy to help!

Nextgrid4y ago

Out of curiosity, do you merely optimize existing cloud usage or do you help your clients move to hybrid/bare-metal?

knorker4y ago

As soon as I saw "17GB file" i thought "that's what torrents are for". Otherwise one mistake and... Well this happens.

Or someone maliciously bypasses CF cache e.g. by parameters.

Cloud just is not suitable for any kind of volume egress. It's a death trap. Like going on vacation with data roaming enabled.

Aissen4y ago

Yeah, HIBP is using torrents:

knorker4y ago

I know, I read the article.

But I feel like Dr Strangelove here. Of course, the whole point of a torrent on a cloud service is lost if you also provide a raw download link.

Also providing a download link is tempting, but can easily cost (for a 17GB file and growing) up to US $3 per click.

Egress kills you, in cloud. "Oh, cloudflare probably caches most of this" is not something I'd recommend.

dx0344y ago

InsomniacL4y ago

> Setup time is max 1h even if you do it manually

- Patching - Remediation, Monitoring, day0 response

- Security Information and Event Management - exports, alerts, OS configuration

- OS/Application Hardening - Encryption, Password/keys rotation, CIS/other baselines, Drift Management

- Backup - Encryption, (don't forget your passwords/keys are changing), retention, data protection compliance, monitoring, alerting, test days

- High Availability - replication, synchronisation, monitoring, alerts, test days

This is just the tip of the ice berg, if you operate in an environment where Insurance, Reputation, Regulatory Compliance, etc.. are important, then it's easy to see why PAAS solutions are desirable.

faebi4y ago

I have 10gbits internet at home. Sometimes I wonder how many services/people I could bankrupt by using it harder. Not that I want this, but more like, why is it even possible?

ccbccccbbcccbb4y ago

> I have been, and still remain, a massive proponent of "the cloud".

Mice cried and stung themselves, but kept eating the cactus.

sudhirj4y ago

jrochkind14y ago

Cloudfront isn't much discounted bandwidth out compared to S3 though, is it?

I see S3 is initial $0.09/GB, going down to $0.07 after 50TB or $0.05 after 150TB.

So okay, Cloudfront gets cheaper egress at large scale, I guess. By about 50% though, not an order of magnitude, and could be much less depending on region.

sudhirj4y ago

The reserved capacity pricing is lower, in a business setting your account manager will usually suggest this pretty quickly if you have a steady and/or increasing Cloudfront bill.

1 more reply

2ion4y ago

This is why I use fixed price offerings for personal projects.

A large bill is probably chump change for someone like Troy, for others it's a year or two of savings. The risk is not worth it.

schemescape4y ago

Would you mind sharing the services you’ve found that have fixed prices? I haven’t had much luck finding services like that (although I’m looking in the < $20/month range).

1 more reply

ksec4y ago

>What we're talking about here is egress bandwidth for data being sent out of Microsoft's Azure infrastructure (priced at AU$0.014 per GB).

I hope someone from Azure CS could give him a custom discount.

It is also worth thinking, the cost HIBP saved on Cloud / Serverless over the years could have wiped out ( if not more ) by this single incident.

[1] https://azure.microsoft.com/en-au/pricing/details/bandwidth/...

nbevans4y ago

gcbirzan4y ago

Definitely not 100%, more like 66% off: https://cloud.google.com/network-connectivity/docs/cdn-inter...

hkh4y ago

To be clear - we would not have been able to catch this one right now :'(

Would love to hear thoughts / brainstorm ideas - is there any way we can proactively catch these types of cost spikes?

Olreich4y ago

Setting up limits and alerts as part of the system creation is usually the best strategy.

hkh4y ago

nbevans4y ago

One wonders how Cloudflare can essentially absorb all bandwidth costs. But AWS and Azure are using them as a profit center.

uncertainrhymes4y ago

On the cloud providers, you are paying for your usage (yes, marked up, but they have costs too).

Cloudflare has the same model, but they distribute the costs. The vast majority of people never use anywhere close to their share, so they subsidize the outliers and the free tier.

tyingq4y ago

Lots of peering. They pay $0 for roughly half of their egress.

https://blog.cloudflare.com/the-relative-cost-of-bandwidth-a...

OtomotO4y ago

Well, the cloud is just a convenient way of accessing someone else's server.

Convenience always costs money, there is no (big) cloud provider doing it out of their own pocket or rather not optimizing for huge profits.

It's the same as with any other service, really. So I don't understand, why some people assume it would be different here.

DigitalSea4y ago

Abishek_Muthian4y ago

Valuable investigation steps to find the erring cloud resource, But as Troy concludes 'Budget Alerts' would have saved him from this issue.

Another key takeaway is,

P.S. I've written about these recently on my blog titled 'Saving Cloud Costs'[1] from a frugal solopreneur PoV.

[1] https://hitstartup.com/saving-cloud-costs/

emptybottle4y ago

This is why I personally won't run projects on infrastructure with what roughly equates to unlimited risk billing.

It's my opinion that it's better to work with known limitations and optimize for them.

In the case of bandwidth, work with a fixed pipe size, or do the math and set up a QoS that implements a throttle to avoid exceeding your bandwidth allotment.

jskrablin4y ago

pontifier4y ago

Everything can be going fine for a long time, and then cloud costs kill your business.

mathattack4y ago

dtx14y ago

If Microsoft doesn't show the decency to forgive that bill, i'd be happy to chip in!

fleddr4y ago

intricatedetail4y ago

Nextgrid4y ago

Because experience getting shit done using boring tools doesn't translate well to a future career in a VC-backend company wrangling Terraform & YAML files.

bawolff4y ago

Seems at least a little unethical that cloud companies do pay as you go up to infinity, instead of some model where you transfer money in and if you use it all up your service gets cut.

XorNot4y ago

There'd be value in a model which allowed you to pay up to some limit then switch into a user-pays model if the user wanted the service right now.

polote4y ago

As I spent a few hours to successfully get cf cache b2 files. I'm curious about the part of support Cloudflare requests due to caching issues.

It's time for cf to work a bit on its UX

sergiotapia4y ago

>This was about AU$350 a day for a month. It really hurt, and it shouldn't have happened. I should have picked up on it earlier and had safeguards in place to ensure it didn't happen. It's on me.

rcarmo4y ago

This prompted me to go and check my custom static site generator (which renders my blog onto an Azure storage account exposed via HTTP and Cloudflare).

Turns out I wasn't setting x-ms-cache-control when writing all the blobs, so that's a win right there.

(interestingly, it appears that rclone, which I was in the process of moving to, doesn't do that, so I might have to keep my custom Azure storage library around)

mro_name4y ago

Shouldn't lookups be where cdb shines? Hold my beer:

  $ shard="$(echo "${sha1}" | cut -c 1)"
  $ cdb -q pwned-passwords-v8-sha1-${shard}.cdb "${sha1}"