Tell HN: Azure outage

885 pointstartieret6mo ago806 comments

Azure is down for us, we can't even access the azure portal. Are other experiencing this? Our services are located in Canada/Central and US-East 2

https://downdetector.ca/status/windows-azure/

https://azure.status.microsoft/en-gb/status

806 comments

croemer6mo ago

Preliminary post incident review: https://azure.status.microsoft/en-gb/status/history/

Timeline

15:45 UTC on 29 October 2025 – Customer impact began.

16:04 UTC on 29 October 2025 – Investigation commenced following monitoring alerts being triggered.

16:15 UTC on 29 October 2025 – We began the investigation and started to examine configuration changes within AFD.

16:18 UTC on 29 October 2025 – Initial communication posted to our public status page.

16:20 UTC on 29 October 2025 – Targeted communications to impacted customers sent to Azure Service Health.

17:26 UTC on 29 October 2025 – Azure portal failed away from Azure Front Door.

17:30 UTC on 29 October 2025 – We blocked all new customer configuration changes to prevent further impact.

17:40 UTC on 29 October 2025 – We initiated the deployment of our ‘last known good’ configuration.

18:30 UTC on 29 October 2025 – We started to push the fixed configuration globally.

18:45 UTC on 29 October 2025 – Manual recovery of nodes commenced while gradual routing of traffic to healthy nodes began after the fixed configuration was pushed globally.

23:15 UTC on 29 October 2025 - PowerApps mitigation of dependency, and customers confirm mitigation.

00:05 UTC on 30 October 2025 – AFD impact confirmed mitigated for customers.

5 more replies

mystcb6mo ago

Update 16:57 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. In addition. customers may experience issues accessing the Azure Portal. Customers can attempt to use programmatic methods (PowerShell, CLI, etc.) to access/utilize resources if they are unable to access the portal directly. We have failed the portal away from Azure Front Door (AFD) to attempt to mitigate the portal access issues and are continuing to assess the situation.

We are actively assessing failover options of internal services from our AFD infrastructure. Our investigation into the contributing factors and additional recovery workstreams continues. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:57 UTC on 29 October 2025

---

Update: 16:35 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

---

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

---

Message from the Azure Status Page: https://azure.status.microsoft/en-gb/status

planewave6mo ago

Azure Network Availability Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. We suspect that an inadvertent configuration change as the trigger event for this issue. We are taking two concurrent actions where we are blocking all changes to the AFD services and at the same time rolling back to our last known good state.

We have failed the portal away from Azure Front Door (AFD) to mitigate the portal access issues. Customers should be able to access the Azure management portal directly.

We do not have an ETA for when the rollback will be completed, but we will update this communication within 30 minutes or when we have an update.

This message was last updated at 17:17 UTC on 29 October 2025

1 more reply

cyptus6mo ago

AFD is down quite often regionally in Europe for our services. In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

3 more replies

8cvor6j844qw_d66mo ago

I'll be interested in the incident writeup since DNS is mentioned. It will be interesting in a way if it is similar to what happened at AWS.

2 more replies

jjp6mo ago

Whilst the status message acknowledge's the issue with Front Door (AFD), it seems as though the rest of the actions are about how to get Portal/internal services working without relying on AFD. For those of us using Front Door does that mean we're in for a long haul?

2 more replies

NDizzle6mo ago

They briefly had a statement about using Traffic Manager to work with your AFD to work around this issue, with a link to learn.microsoft.com/...traffic-manager, and the link didn't work. Due to the same issue affecting everyone right now.

They quickly updated the message to REMOVE the link. Comical at this point.

1 more reply

jdc05896mo ago

yea its not just the portal. microsoft.com is down too

5 more replies

jonathanlydall6mo ago

Yet another reason to move away from Front Door.

We already had to do it for large files served from Blob Storage since they would cap out at 2MB/s when not in cache of the nearest PoP. If you’ve ever experienced slow Windows Store or Xbox downloads it’s probably the same problem.

I had a support ticket open for months about this and in the end the agent said “this is to be expected and we don’t plan on doing anything about it”.

We’ve moved to Cloudflare and not only is the performance great, but it costs less.

Only thing I need to move off Front Door is a static website for our docs served from Blob Storage, this incident will make us do it sooner rather than later.

2 more replies

eddie_catflap6mo ago

We saw issues before 16:00 UTC - approx 15:38

ThatManulTheCat6mo ago

DNS. Ofc.

rconti6mo ago

Sounds like they need to move their portal to a region with more capacity for the desired instance type. /s

Uehreka6mo ago

I noticed that Starbucks mobile ordering was down and thought “welp, I guess I’ll order a bagel and coffee on Grubhub”, then GrubHub was down. My next stop was HN to find the common denominator, and y’all did not disappoint.

pants26mo ago

Good thing HN is hosted on a couple servers in a basement. Much more reliable than cloud, it seems!

3 more replies

Havoc6mo ago

The sysadmin subreddit tends to beat hn on outage reports by an hour+ in my experience.

Bunch of on-call peeps over there that definitely know the instant something major goes down

sergiotapia6mo ago

Wow I just left a Starbucks drivethru line because it was just not moving. I guess it was because of this.

1 more reply

hypeatei6mo ago

Starbucks mobile was down during the AWS outage too...

3 more replies

Theodores6mo ago

My inner Nelson-from-the-Simpsons wishes I was on your team today, able to flaunt my flask of tea and homemade packed sandwiches. I would tease you by saying 'ha ha!' as your efforts to order coffee with IP packets failed.

I always go everywhere adequately prepared for beverages and food. Thanks to your comment, I have a new reason to do so. Take out coffees are actually far from guaranteed. Payment systems could go down, my bank account could be hacked or maybe the coffee shop could be randomly closed. Heck, I might even have an accident crossing the road. Anything could happen. Hence, my humble flask might not have the top beverage in it but at least it works.

We all design systems with redundancy, backups and whatnot, but few of us apply this thinking to our food and drink. Maybe get a kettle for the office and a backup kettle, in case the first one fails?

01284a7e6mo ago

Ha, maybe rethink the I AM NOTHING BUT A HUGE CLOUD CONSUMER thing on some fundamental levels? Like food?

port116mo ago

I noticed it when my Netatmo rigamajig stopped notifying me of bad indoor air quality. Lovely. Why does it need to go through the cloud if the data is right there in the home network…

1 more reply

garbagewoman6mo ago

Service culture is so hollow

jeffrallen6mo ago

You know you can talk to your barista and ask for a bagel, right? If you're lucky they still take cash... if you still _have_ cash. :)

1 more reply

foresterre6mo ago

It still surprises me how much essential services like public transport are completely reliant on cloud providers, and don't seem to have backups in place.

Here in The Netherlands, almost all trains were first delayed significantly, and then cancelled for a few hours because of this, which had real impact because today is also the day we got to vote for the next parlement (I know some who can't get home in time before the polls close, and they left for work before they opened).

14 more replies

Imustaskforhelp6mo ago

Google cloud run or cloudflare workers it is.

Personally I am thinking more and more about hetzner, yes I know its not an apples to orange comparison. But its honestly so good

Someone had created a video where they showed the underlying hardware etc., I am wondering if there is something like https://vpspricetracker.com/ but with geek-benchmarks as well.

This video was affiliated with scalahosting but still I don't think that there was too much bias of them and they showed at around 3:37 a graph comparison with prices https://www.youtube.com/watch?v=9dvuBH2Pc1g

Now it shows how contabo has better hardware but I am pretty sure that there might be some other issues, and honestly I feel a sense of trust with hetzner I am not sure about others.

Either hetzner or self hosting stuff personally or just having a very cheap vps and going to hetzner if need be but hetzner already is pretty cheap or I might use some free service that I know of are good as well.

Havoc6mo ago

Hetzner seems sound, but I doubt they play in the same reliability league as google

1 more reply

TiredOfLife6mo ago

One of recent (4 months ago) Cloudflare outages (I think it was even workers) was caused by Google Cloud being down and Cloudflare hosting an essential service there

2 more replies

hshdhdhehd6mo ago

Are you after nines? Maybe do multi provider?

bob10296mo ago

For some reason an Azure outage does not faze me in the same way that an AWS outage does.

I have never had much confidence in Azure as a cloud provider. The vertical integration of all the things for a Microsoft shop was initially very compelling. I was ready to fight that battle. But, this fantasy was quickly ruined by poor execution on Microsoft's part. They were able to convince me to move back to AWS by simply making it difficult to provision compute resources. Their quota system & availability issues are a nightmare to deal with compared to EC2.

At this point I'd rather use GCP over Azure and I have zero seconds of experience with it. The number of things Microsoft gets right in 2025 can be counted single-handedly. The things they do get right are quite good, but everything else tends to be extremely awful.

xmcp1236mo ago

Many years back was the first time I used Azure, evaluating it for a client.

I remember I at one point had expanded enough menus that it covered the entirety of the screen.

Never before have I felt so lost in a cloud product.

5 more replies

multiplegeorges6mo ago

> At this point I'd rather use GCP over Azure and I have zero seconds of experience with it.

TBH, GCP is very good! More people should use it.

4 more replies

aftbit6mo ago

The problem is that in some industries, Microsoft is the only option. Many of these regulated industries are just now transitioning from the data center to the cloud, and they've barely managed to get approval for that with all of the Microsoft history in their organization. AWS or GCloud are complete non-starters.

1 more reply

issafram6mo ago

I've used AWS, Azure, and recently GCP. You do NOT want to use GCP.

Nemo_bis6mo ago

Microsoft is better at regulatory capture, so Azure has many customers in the public sector. So an Azure outage probably affects the public sector more (see example above about trains).

karel-3d6mo ago

Microsoft has the regulatory capture. All the European privacy and regulatory laws are good for Azure. That's why your average European government or baking app runs most likely on Azure. (or Oracle, but more likely Azure)

otterdude6mo ago

What Amazon, Azure, and Google are showing with their platform crashes amid layoffs, while they supports governments that are Oppressing's Citizens and Ignoring the Law, is that they do not care about anything other than the bottom line.

They think they have the market captured, but I think what their dwindling quality and ethics are really going to drive is adoption of self hosting, distributed computing frameworks. Nerds are the ones who drove adoption of these platforms, and we can eventually end if we put in the work.

Seriously with container technology, and a bit more work / adoption on distributed compute systems and file storage (IPFS,FileCoin) there is a future where we dont have to use big brothers compute platform. Fuck these guys.

3 more replies

arccy6mo ago

The only reason you'd notice MS was down was if Github was down....

2 more replies

major5056mo ago

I like azure. I think they are more intuitive tua aws and good, and they have good prices for startups ( essentially free for a whole year )

danielovichdk6mo ago

What did you do when AWS was down last week?

redwood6mo ago

I read "Microsoft shop" as "Microsoft slop". Fitting. But at least they open source wash themselves so much they're practically a charity right?

WD-426mo ago

Azure outages don’t faze anyone because nobody notices when it happens.

Jamie4526mo ago

Currently standing in a half closed supermarket because the tills are down and they cant take payments

david4226mo ago

IIRC, the grocery chain I worked for used to have an offline mode to move customers out the door. But it meant that when the system came back online, if the customers card was denied, the customer got free groceries.

5 more replies

chasd006mo ago

There's a Family Dollar by my house that is down at least 2 full days per month because of bad inet connectivity. I live close enough that with a small tower on my roof i can get line of sight to theirs. I've thought about offering them a backup link off my home inet if they give me 50% of sales whenever its in use. It would be a pretty good deal for them, better some sales when their inet is down vs none.

3 more replies

pndy6mo ago

I remember last mechanical cash registers in my country in 90s and when these got replaced by early electronic ones with blue vacuum fluorescent tubes. Then everything got smaller and smaller. Now I'm pestered to "add the item to the cart" by software.

Last week I couldn't pay for flowers for grandma's grave because smartphone-sized card terminal refused to work - it stuck on charging-booting loop so I had to get cash. Tho my partner thinks she actually wanted to get cash without a receipt for herself excluding taxes

Jamie4526mo ago

Just to add - this particular supermarket wasn’t fully down, it took ages for them to press “sub total” and then pick the payment method. I suspect it was slow waiting for a request to timeout perhaps

thisOtterBeGood6mo ago

In Germany many stores still accept cash and some even only accept cash and we are ridiculed for this... Seems like one of the rare instances where this is useful :D

1 more reply

SoftTalker6mo ago

Mind-boggling that any retailer would not have the capability to at least run the checkout stations offline.

3 more replies

reaperducer6mo ago

Currently standing in a half closed supermarket because the tills are down and they cant take payments

There's a fairly large supermarket near me that has both kinds of outages.

Occasionally it can't take cards because the (fiber? cable?) internet is down, so it's cash only.

Occasionally it can't take cash because the safe has its own cellular connection, and the cell tower is down.

I was at Frank's Pizza in downtown Houston a few weeks ago and they were giving slices of pizza away because the POS terminal died, and nobody knew enough math to take cash. I tried to give them a $10 and told them to keep the change, but "keep the change" is an unknown phrase these days. They simply couldn't wrap their brains around it. But hey, free pizza!

the_black_hand6mo ago

why tf would a supermarket depend on Azure? Payment processing isn't their thing

kierenj6mo ago

Ouch, and login.microsoftonline.com too - i.e. SSO using MS accounts. We'd just rolled that out across most (all?) of our internal systems...

And microsoft.com too - that's gotta hurt

planewave6mo ago

It is interesting to see the differential across different tenants in different geographies:

- on a US tenant I am unable to access login.microsoftonline.com and the login flow stalls on any SSO authentication attempt.

- on a European tenant, probably germany-west, I am able to login and access the Azure portal.

parliament326mo ago

SSO and 365 are working fine for us, but admin portals for Azure/365 are down. Our workloads in Azure don't seem to be impacted.

manbitesdog6mo ago

Guess you have NASSO now (Not A Single Sign On)

1 more reply

ocdtrekkie6mo ago

I am still stunned people choose to do this, considering major Office 365 outages are basically a weekly thing now.

1 more reply

gmassman6mo ago

I’ve been migrating our services off of Azure slowly for the past couple of years. The last internet facing things remaining are a static assets bucket and an analytics VM running Matomo. Working with Front Door has been an abysmal experience, and today was the push I needed to finally migrate our assets to Cloudflare.

I feel pretty justified in my previous decisions to move away from Azure. Using it feels like building on quicksand…

alt2276mo ago

All the clouds hav had major outages this year.

At this point I dont believe that any one of them is any better or reliable than the others.

not_a_bot_4sho6mo ago

> I feel pretty justified in my previous decisions to move away from Azure

I felt this way about AWS last week

btmiller6mo ago

Never let a good disaster go to waste ;)

basfo6mo ago

We’re 100% on Azure but so far there’s no impact for us.

Luckily, we moved off Azure Front Door about a year ago. We’d had three major incidents tied to Front Door and stopped treating it as a reliable CDN.

They weren’t global outages, more like issues triggered by new deployments. In one case, our homepage suddenly showed a huge Microsoft banner about a “post-quantum encryption algorithm” or something along those lines.

Kinda wild that a company that big can be so shaky on a CDN, which should be rock solid.

Aperocky6mo ago

Outages are one thing, but having your content polluted seems like a more serious problem? Unless you subscribed to microsoft banners somehow.

1 more reply

qiller6mo ago

We battled https://learn.microsoft.com/en-us/answers/questions/1331370/... for over a year, and finally decided to move off since there was no any resolution. Unfortunately our API servers were still behind AFD so they were affected by today's stuff...

gianpaj6mo ago

Can't download VSCode :D

Error: visual-studio-code: Download failed on Cask 'visual-studio-code' with message: Download failed: https://update.code.visualstudio.com/1.105.1/darwin-arm64/st...

progmetaldev6mo ago

I have had intermittent issues with winget today. I use UniGetUI for a front-end, and anything tied to Microsoft has failed for me. Judging by the logs, it's mostly retrieving the listing of versions (I assume similar to what 'apt-get update' does, I'm fairly new to using winget for Windows package management).

robotnikman6mo ago

Also cant do anything right now with the repo's we have in Azure Devops, how lovely...

loopduplicate6mo ago

get vscodium then

agency6mo ago

So that's why I can't check in for my Alaska Airlines flight... https://news.microsoft.com/source/features/digital-transform...

MangoCoffee6mo ago

"BREAKING: Alaska Airlines' website, app impacted amid Microsoft Azure outage"

https://www.youtube.com/watch?v=YJVkLP57yvM

Shuddown6mo ago

Pretty much every single Microsoft domain I've tried to access loads for a looooong time before giving me some bare html. I wonder if someone can explain why that's happening.

1 more reply

kurttheviking6mo ago

I am unable to load this article...presumably for related reasons

vachina6mo ago

microsoft.com and some subdomains (answers.microsoft.com) has no A and AAA records. They screwed up big time.

https://archive.is/Q4izZ

0xbadcafebee6mo ago

That specific subdomain has issues with propagation: https://dnschecker.org/#A/answers.microsoft.com (only four resolvers return records)

The root zone and www. do not: https://dnschecker.org/#A/microsoft.com (all resolvers return records)

And querying https://www.microsoft.com/ results in HTTP 200 on the root document, but the page elements return errors (a 504 on the .css/.js documents, a 404 on some fonts, Name Not Resolved on scripts.clarity.ms, Connection Timed Out on wcpstatic.microsoft.com and mem.gfx.ms). That many different kinds of errors is actually kind of impressive.

I'm gonna say this was a networking/routing issue. The CDN stayed up, but everything else non-CDN became unroutable, and different requests traveled through different paths/services, but each eventually hit the bad network path, and that's what created all the different responses. Could also have been a bad deploy or a service stopped running and there's different things trying to access that service in different ways, leading to the weird responses... but that wouldn't explain the failed DNS propagation.

Aperocky6mo ago

wow, right after AWS suffered a similar thing.

I wonder if this is microsoft "learning" to "prevent" such an issue and instead triggered it...

"One often meets his destiny on the path he takes to avoid it" -- Master Oogway

ape46mo ago

2026: the year of your own metal in a rack

0xbadcafebee6mo ago

2027: the year of migrating from your own metal to a managed provider

2028: the year of migrating from a managed provider to the cloud

2029: the year of migrating from the cloud to your own metal in a rack

People keep thinking the solution to their problems is to do something new (that they don't fully understand).

TIL it's called Nirvana Fallacy

3 more replies

Aperocky6mo ago

I'd predict the year of linux desktop instead.

1 more reply

drewnick6mo ago

I've been doing it since 1998 in my bedroom with a dual T1 (and on to real DCs later). While I've had some outages for sure it makes me feel better I am not that divergent in uptime in the long run vs big clouds.

1 more reply

move-on-by6mo ago

Instead of cyber security awareness month, we should rename it to cloud availability awareness month.

00000000001006mo ago

Yeah just took down the prod site for one of our clients since we host the front-end out of their CDN. Just got wrapped up panic hosting it somewhere else for the past hour, very quickly reminds you about the pain of cookies...

alt2276mo ago

... and DNS caching, and browser file cache, and sessions...

Moving a website quickly is never fun.

chemodax6mo ago

For me the same. It's very confusing that status page [1] is green

[1]: https://azure.status.microsoft/en-us/status

martini3336mo ago

That status page is never red. Absolutely useless.

> There are currently no active events. Use Azure Service Health to view other issues that may be impacting your services.

Links to a page on Azure Portal which is down...

1 more reply

kylecazar6mo ago

They added a message at the same time as your comment:

"We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly."

givemeethekeys6mo ago

Surely more vibecoding will fix this problem. Time to fire more staff

whalesalad6mo ago

Yikes, http://schemas.xmlsoap.org/soap/encoding/ is running on Azure and it's down. So any SOAP/WSDL api's are dead in the water.

    HTTPSConnectionPool(host='schemas.xmlsoap.org', port=443): Max retries exceeded with url: /soap/encoding/ (Caused by SSLError(CertificateError("hostname 'schemas.xmlsoap.org' doesn't match '*.azureedge.net'")))

A service we rely on that isn't even running on Azure is inaccessible due to this issue. For an asset that probably never changes. Wild for that to be the SPOF.

160k+ results on GitHub: https://github.com/search?q=http%3A%2F%2Fschemas.xmlsoap.org...

flumpcakes6mo ago

Pretty much all Azure services seem to be down. Their status page says it's only the portal since 16:00. It would be nice if these mega-companies could update their status page when they take down a large fraction of the Internet and thousands of services that use them.

parliament326mo ago

All of our Azure workloads are up, but we don't use Azure Front Door. That seems to be the only impacted product, apart from the management portal.

1 more reply

kierenj6mo ago

FWIW, all of our databases, VMs, AKS clusters, services, jobs etc - are all working fine. Which services are down for you, maybe we can build a list?

1 more reply

wbsun6mo ago

Does their status page depend on something that is down already, so the page just fails static now hence no new updates?

jayw_lead6mo ago

Same playbook for AWS. When they admitted that Dynamo was inaccessible, they failed to provide context that their internal services are heavily dependent on Dynamo

It's only after the fact they are transparent about the impact

port116mo ago

So much of Belgium runs on Azure… it's honestly baffling how many services are down, there's no resilience built into (even large) companies anymore.

MangoCoffee6mo ago

The Internet is supposed to be decentralized. The big three seem to have all the power now (Amazon, Microsoft, and Google) plus Cloudflare/Oracle.

How did we get here? Is it because of scale? Going to market in minutes by using someone else's computers instead of building out your own, like co-location or dedicated servers, like back in the day.

kube-system6mo ago

It still is very decentralized. We are discussing this via the internet right now.

5 more replies

mrinterweb6mo ago

A lot of money and years of marketing the cloud as the responsible business decision led us here. Now that the cloud providers have vendor lock-in, few will leave, and customers will continue to wildly overpay for cloud services.

1 more reply

deaux6mo ago

From today [0].

> Big Tech lobbying is riding the EU’s deregulation wave by spending more, hiring more, and pushing more, according to a new report by NGO’s Corporate Europe Observatory and LobbyControl on Wednesday (29 October).

> Based on data from the EU’s transparency register, the NGOs found that tech companies spend the most on lobbying of any sector, spending €151m a year on lobbying — a 33 percent increase from €113m in 2023.

Gee whizz, I really do wonder how they end up having all the power!

[0] https://news.ycombinator.com/item?id=45744973

alt2276mo ago

Thats the whole point, big players like AWS and MS can go down, but here we are still talking on the internet.

Decentralisation is winning it seems.

1 more reply

nzach6mo ago

> How did we get here?

I think the response lies in the surrounding ecosystem.

If you have a company it's easier to scale your team if you use AWS (or any other established ecosystem). It's way easier to hire 10 engineers that are competent with AWS tools than it is to hire 10 engineers that are competent with the IBM tools.

And from the individuals perspective it also make sense to bet on larger platforms. If you want to increase your odds of getting a new job, learning the AWS tools gives you a better ROI than learning the IBM tools.

AndrewKemendo6mo ago

A natural monopoly is a monopoly in an industry in which high infrastructure costs and other barriers to entry relative to the size of the market give the largest supplier in an industry, often the first supplier in a market, an overwhelming advantage over potential competitors. Specifically, an industry is a natural monopoly if a single firm can supply the entire market at a lower long-run average cost than if multiple firms were to operate within it. In that case, it is very probable that a company (monopoly) or a minimal number of companies (oligopoly) will form, providing all or most of the relevant products and/or services.

https://en.wikipedia.org/wiki/Natural_monopoly

pphysch6mo ago

Consolidation is the inevitable outcome of free unregulated markets.

In our highly interconnected world, decentralization paradoxically requires a central authority to enforce decentralization by restricting M&A, cartels, etc.

1 more reply

anonymars6mo ago

Efficiency (aka cost) <---> Resiliency/redundancy

Pick your point on the scale

1 more reply

codethief6mo ago

Meredith Whittaker (of Signal) addressed your question the other day: https://mastodon.world/@Mer__edith/115445701583902092

SecretDreams6mo ago

> How did we get here?

Stonks

ApolloFortyNine6mo ago

They admit in their update blurb azure front door is having issues but still report azure front door as having no issues on their status page.

And it's very clear from these updates that they're more focused on the portal than the product, their updates haven't even mentioned fixing it yet, just moving off of it, as if it's some third party service that's down.

consp6mo ago

> as having no issues on their status page

Unsubstantiated idea: So the support contract likely says there is a window between each reporting step and the status page is the last one and the one in the legal documents giving them several more hours before the clauses trigger.

cbovis6mo ago

Looks to be affecting our pipelines that rely on Playwright as they download images from Azure e.g. https://playwright.azureedge.net/builds/chromium/1124/chromi... which aren't currently resolving.

sedatk6mo ago

The paradox of cloud provider crashes is that if the provider goes down and takes the whole world with it, it's actually good advertisement. Because, that means so many things rely on it, it's critically important, and has so many big customers. That might be why Amazon stock went up after AWS crash.

If Azure goes down and nobody feels it, does Azure really matter?

thewebguyd6mo ago

People feel it, but usually not general consumers like they do when AWS goes down.

If Azure goes down, it's mostly affecting internal stuff at big old enterprises. Jane in accounting might notice, but the customers don't. Contrast with AWS which runs most of the world's SaaS products.

People not being able to do their jobs internally for a day tends not to make headlines like "100 popular internet services down for everyone" does.

Aldipower6mo ago

Hetzner, Netcup, OVH, BunnyCDN, ClouDNS, Postmark

You name them. Other good providers you have experience with?

There is no reason for an expensive cloud. Never has been, but decision makers tried to keep their pants dry.

empath756mo ago

Friend of mine at MSFT says it's a Sev-0 outage and they can't even get to the ticket tracking system.

kierenj6mo ago

Sorry - my bad. I literally just connected an old XP VM to the internet to activate it.

kure2566mo ago

We’ve been experimenting with multi-cluster failover for Kubernetes workloads, and one open-source project that actually works really well is k8gb .

It acts as a GSLB controller inside Kubernetes — doing DNS-level health checks, region awareness, and automatic failover between clusters when one goes down.

It integrates with ExternalDNS and supports multiple DNS providers (Infoblox, Route53, Azure DNS, NS1, etc.), so it can handle failover across both on-prem and cloud clusters.

It’s not a silver bullet for every architecture, but it’s one of the few OSS projects that make multi-region failover actually manageable in practice.

blenderob6mo ago

https://azure.status.microsoft/en-us/status says everything's fine! Any place I can read more about this outage?

reid6mo ago

You're looking at it. I couldn't find any discussion elsewhere yet...

sbergot6mo ago

official status pages are useless most of the time.

1 more reply

sbergot6mo ago

now there is an information about "Azure Portal Access Issues". No word about front door being down.

amaccuish6mo ago

Seeing users having issues with the "Modern Outlook", specifically empty accounts. Switching back to the "Legacy Outlook" which functions largely without the help of the cloud fixes the issue. How ironic.

tyfon6mo ago

Seems to be down in Norway.

Even the national digital id service is down.

hexbin0106mo ago

> Even the national digital id service is down.

Can't help but smirk as my country is ramming through "Digital ID" right now

1 more reply

Steven_Vellon6mo ago

For us, it looks like most services are still working (eastus and eastus2). Our AKS cluster is still running and taking requests. Failures seem limited to management portal.

mythz6mo ago

High availability is touted as a reason for their high prices, but I swear I read about major cloud outages far more than I experience any outages at Hetzner.

prmoustache6mo ago

I think the biggest features of the big cloud vendors is that when they are down, not only you but your customers and your competitors usually have issues at the same time so everybody just shrug and have a lazy/off day at the same time. Even on call teams reall just have to wait and stay on standby because there is very little they can do. Doing a failover can be slower than waiting for the recovery, not help at all if outage is spanned accross several region, or bring aditional risks.

And more importantly nobody lose any reputation except AWS/Azure/Google.

1 more reply

graemep6mo ago

Ostensible reason.

The real reason is that outages are not your fault. Its the new version of "nobody ever got fired for buying IBM" - later it became MS, and now its any big cloud provider.

1 more reply

jmaker6mo ago

For one it’s statistics - Hetzner simply runs far fewer major services than hyperscalers. And the services they run are also more affluent, with larger customer bases, so downtimes are systemically critical. Therefore it’s louder.

On the merits though, I agree, haven’t had any serious issues with Hetzner.

bad_haircut726mo ago

Same with DigitalOcean. I run one box and it hasnt gone down for like 2 years

3 more replies

bongodongobob6mo ago

It's just the admin portal.

5 more replies

reid6mo ago

This is impacting the Azure CDN at azureedge.net. DNS A records for azureedge.net tenants are taking 2-6 seconds and often return nothing.

etyhhgfff6mo ago

It's always DNS, unless it's not DNS.

jmspring6mo ago

The outage was really weird. For me, parts of the portal worked, other parts didn't. I had access to a couple of resource groups, but no resources visible in those groups. Azure Devops Pipelines that needed do download from packages.microsoft.com didn't work.

The Microsoft status page mostly referenced the portal outage, but it was more than that.

1 more reply

AdmiralAsshat6mo ago

Some exec at Microsoft told the Azure guys to ape everything Amazon does and they took it literally.

Telemakhos6mo ago

Or, the NSA needed to upgrade their access at both.

1 more reply

jrochkind16mo ago

I was gonna say that obv AWS hacked em to even things up.

dboreham6mo ago

This is funny but also possibly true because: business/MBA types see these outages as a way to prove how critical some services are, leading to investors deciding to load up on the vendor's stock.

1 more reply

aftbit6mo ago

I still can't log into Azure Gov Cloud with

https://microsoft.com/deviceloginus

Seems like they migrated the non-Gov login but not the Gov one. C'mon Microsoft, I've got a deadline in a few days.

mystcb6mo ago

Updated 16:35 UTC

Azure Portal Access Issues

This message was last updated at 16:35 UTC on 29 October 2025

----

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

-- From the Azure status page

jammo6mo ago

We all need to move away from these big cloud providers. Two medium size smaller providers is enough.

-Cloudflare for R2 (object storage) and CDN (Fastly+backblaze also available). -Two VPS/Server providers with a decent reputation and mid-size (using a comparison site like https://serversearcher.com or look directly into people like Hetzner or latitude) -PlanetScale or Neon for database if you don't co-locate it, though better to use someone like digital ocean, vultr or latitude who offer databases too)

2 more replies

hedayet6mo ago

The sad thing is - $MSFT isn't even down by 1%. And IIRC, $AMZN actually went up during their previous outage.

So if we look at these companies' bottom lines, all those big wigs are actually doing something right. Sales and lobbying capacity is way more effective than reliability or good engineering (at least in the short term).

locusofself6mo ago

AMZN went up almost 4 percent between the day of the outage and the day after. Crazy market.

1 more reply

navane6mo ago

Look how important we are, is what these failures show

3 more replies

bigstrat20036mo ago

That's a good thing. Stock prices shouldn't go down because of rare incidents which don't accurately represent how successful a company is likely to be in the future.

AtNightWeCode6mo ago

I looked into this before and the stocks of these large corps simply does not move when outages happens. Maybe intra-day, I don't have that data, but in general no effect.

iamtheworstdev6mo ago

well, at this point, 90% of the market cap of FAANGS plus Microsoft is... OMG AI LLM hype

vincebowdren6mo ago

UK, and other regions too; our APAC installation in Australia is affected.

zimpenfish6mo ago

"Microsoft Azure will serve as the backbone of Asda’s digital infrastructure"[0]

Oh, that'll be why Scan & Go was down yesterday evening. I thought it was another instance of an iOS 26 update breaking their crappy code.

[0] https://corporate.asda.com/newsroom/2025/22/09/asda-announce...

Sharparam6mo ago

The learning modules on https://learn.microsoft.com/ also seem to have a lot of issues properly loading.

vpears876mo ago

At least MSFT is consistent: https://www.microsoft.com/en-us/ is down as well

CommanderData6mo ago

Likely behind Azure Front Door.

Much of Xbox is behind that too.

tartieretOP6mo ago

Microsoft posted an update on X: https://x.com/AzureSupport/status/1983569891379835372?ref_sr...

"We’re investigating an issue impacting Azure Front Door services. Customers may experience intermittent request failures or latency. Updates will be provided shortly."

llama0526mo ago

Always fun when you can't trust the main status page but have to go to some opinionated social medial website to see the actual problem.

1 more reply

progmetaldev6mo ago

I was having issues a few hours ago. I'm now able to access the portal, although I get lots of errors in the browser console, and things are loading slowly. I have services in the US-East region.

I have been having issues with GitHub and the winget tool for updates throughout the day as well. I imagine things are pulling from the same locations on Azure for some of the software I needed to update (NPM dependencies, and some .NET tooling).

borg166mo ago

i guess folks in azure wanted to show some solidarity with aws brethren

(couldn't resist adding it. i acknowledge this comment adds no value to the discussion)

aurumque6mo ago

Azure goes down all the time. On Friday we had an entire regional service down all day. Two weeks ago same thing different region. You only hear about it when it's something everyone uses like the portal, because in general nobody uses Azure unless they're held hostage.

1 more reply

glzone16mo ago

Wasn't the saying "It's always DNS" floating around somewhere?

Be interesting to understand cause here. Pretty big impact on services we use

mikestew6mo ago

Could be DNS, I'm seeing SERVFAIL trying to resolve what look to be MS servers when I'm hitting (just one example) mygoodtogo.com (trying to pay a road toll bill, and failing).

bragma6mo ago

They suggest to use Traffic Manager to router around failing FrontDoor CDN, but DNS is failing too, making the suggestion another failure.

1 more reply

buttscicles6mo ago

Interesting that everybody knows when AWS goes down but Azure needs a "Tell HN" :)

Best of luck to the teams responding to this incident.

1 more reply

alt2276mo ago

Microsoft have started putting customer status pages up on windows.net, so it must be really really bad!

For example when I try to log into our payroll provider Brightpay, it sends me here:

https://bpuk1prod1environment.blob.core.windows.net/host-pro...

reid6mo ago

Portal and Azure CDN are down here in the SF Bay Area. Tenant azureedge.net DNS A queries are taking 2-6 seconds and most often return nothing. I got a couple successful A response in the last 10 minutes.

Edit: As of 9:19 AM Pacific time, I'm now getting successful A responses but they can take several seconds. The web server at that address is not responding.

m_fayer6mo ago

And there goes https://www.microsoft.com/

eeasss6mo ago

Deglobalization in geopolitics should be followed by deglobalization in cloud providers as well. Viva la local vendors.

chrisgeleven6mo ago

"Front Door" has to be the worst product name for a CDN I've ever heard of. I used to work for a CDN too.

unethical_ban6mo ago

I wonder if many Germans are eager to sign up for AFD.

But seriously I thought it would be the console, not a CDN.

1 more reply

oliyoung6mo ago

We should've never let marketing in the door honestly, all of the product names for the big three are awful.

Microsoft CDN

There, that's it. You're selling it to (hopefully) technical people

joquarky6mo ago

It so strongly implies a counterpart.

FrostKiwi6mo ago

Surprised to see the situation getting worse, what the hell.

Had some Frontdoor operations timing out, but now I'm straight up denied with "Message: All Changes to Azure Frondoor Configuration are blocked currently."

What a mess.

jacquesm6mo ago

It is much more than azure. One of my kids needs a key for their laptop and can't reach that either. Great excuse though, 'Azure ate my homework'. What a ridiculous world we are building. Fuck MS and their account requirements for windows.

elFarto6mo ago

We saw all incoming traffic to our app drop to zero at about 15:45. I wonder how long this one will take to fix.

sech84206mo ago

Same exact time for us as well.

NDizzle6mo ago

My best guess at the moment is something global like the CDN is having problems affecting things everywhere. I'm able to use a legacy application we have that goes directly to resources in uswest3, but I'm not able to use our more modern application which uses APIM/CDN networks at all.

vs4vijay6mo ago

Service Status: https://status.cloud.microsoft/ and https://azure.status.microsoft/en-us/status

ipsum26mo ago

Status page (first link) is down for me. Second one works

1 more reply

aftergibson6mo ago

Looks like the status page is overloaded...

andhuman6mo ago

I bet it’s DNS.

andhuman6mo ago

“ Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025”

pbhjpbhj6mo ago

That was my bet too, then I looked at ISC and noticed there were PoCs released for critical BIND9 vulns yesterday ... might be related?

vinyl76mo ago

Vibe coded internet keeps getting better

avgDev6mo ago

Quick find someone who can actually read documentation and code!

the_af6mo ago

You just paste the outage error codes back to the LLM and pray it's still working and can fix whatever went wrong!

1 more reply

ApolloFortyNine6mo ago

Two hours after the initial outage, they have finally updated the Front Door status on their status page.

LouisLazaris6mo ago

The VS Code website is down: https://code.visualstudio.com/

And so is Microsoft: http://www.microsoft.com/

codethief6mo ago

https://www.microsoft.com works for me (with the www subdomain).

tonymet6mo ago

Any healthcare IT admins care to chime in? A predominantly MS industry with critical workloads.

SoftTalker6mo ago

We're on Office 365 and so far it's still responding. At least Outlook and Teams is.

jeffdn6mo ago

They don't run on Azure!

2 more replies

rvz6mo ago

Looking forward to the post mortem.

internet_points6mo ago

> What went wrong and why?

> An inadvertent tenant configuration change within Azure Front Door (AFD) triggered a widespread service disruption affecting both Microsoft services and customer applications dependent on AFD for global content delivery. The change introduced an invalid or inconsistent configuration state that caused a significant number of AFD nodes to fail to load properly, leading to increased latencies, timeouts, and connection errors for downstream services.

> As unhealthy nodes dropped out of the global pool, traffic distribution across healthy nodes became imbalanced, amplifying the impact and causing intermittent availability even for regions that were partially healthy. We immediately blocked all further configuration changes to prevent additional propagation of the faulty state and began deploying a ‘last known good’ configuration across the global fleet. Recovery required reloading configurations across a large number of nodes and rebalancing traffic gradually to avoid overload conditions as nodes returned to service. This deliberate, phased recovery was necessary to stabilize the system while restoring scale and ensuring no recurrence of the issue.

> The trigger was traced to a faulty tenant configuration deployment process. Our protection mechanisms, to validate and block any erroneous deployments, failed due to a software defect which allowed the deployment to bypass safety validations. Safeguards have since been reviewed and additional validation and rollback controls have been immediately implemented to prevent similar issues in the future.

So, so far they're saying it's a combination of bad config + their config-validator had a bug. Would love more details.

1 more reply

udfalkso6mo ago

OpenAI Clip python library fails because the model download is a hardcoded azure cdn url :(

zingababba6mo ago

This brings to mind this -> https://thenewstack.io/github-will-prioritize-migrating-to-a...

chemodax6mo ago

It seems Azure FrontDoor is affected, because our private VM works fine in different regions.

bossyTeacher6mo ago

I noticed issues on Azure so I went to the status page. It said everything was fine even though the Azure Portal was down. It took more than 10 minutes for that status page to update.

How can one of the richest companies in the world not offer a better service?

Ylpertnodi6mo ago

>How can one of the richest companies in the world not offer a better service?

Better service costs money.

Jarwain6mo ago

On our end, our VMs are still working, so our gitlab instance is still up. Our services using Azure App Services are available through their provided url. However, Front Door is failing to resolve any domains that it was responsible for.

irusensei6mo ago

I was working when I saw the portal page showing only resource groups and lots of items missing. I thought it was a weird browser cache issue.

The actual stuff I was working on (App Insights, Function App) that was still open was operational.

mattdecker1006mo ago

Unable to use Ona's GitPod through VSCode SSH - Unable to download code server from https://update.code.visualstudio.com

baconbrand6mo ago

All of our sites went down. This is my company’s busiest time of year. Hooray.

a_f6mo ago

Looks like MyGet is impacted too. Seems like they use Azure:

>What is required to be able to use MyGet? ... MyGet runs its operations from the Microsoft Azure in the West Europe region, near Amsterdam, the Netherlands.

hypeatei6mo ago

All of my employers things are hosted on Azure and running just fine and didn't go down at all. Portal access has been fixed.

Doesn't seem to be too bad of an outage unless you were relying on Azure Front Door.

randomsofr6mo ago

SSO is down, Azure Portal Down and more, seems like a major outage. Already a lot of services seem to be affected: banks, airlines, consumer apps, etc.

2 more replies

_pdp_6mo ago

With all the recent outages considered, it is time to move off the cloud.

rcarmo6mo ago

Not seeing it. I have VMs in US East and Netherlands and they're up.

tgv6mo ago

I tried to look some things up on their support pages before 1600Z, and it timed-out. The Dutch railways are also affected (they're an MS shop, IIRC).

LaserToy6mo ago

Azure portal still insists the issue is jsut with Console.

We had to bypass the Frontdoor

8cvor6j844qw_d66mo ago

Quite close to the recent AWS outage. Let me take a look if its a major one similar to AWS.

Any guess on what's causing it?

In hindsight, I guess the foresight of some organizations to go multi-cloud was correct after all.

jcims6mo ago

We're multi-cloud and it really saved a few workloads last week with the AWS issue.

It's not easy though.

1 more reply

stuff4ben6mo ago

It's always freakin DNS...

iAMkenough6mo ago

Trusting AI without sufficient review and oversight of changes to production.

2 more replies

conroydave6mo ago

cost cutting attempts

avgDev6mo ago

I am having a bunch of issues. It looks like their sites and azure are both affected.

I also got weird notification in VS2022 that my license key was upgraded to Enterprise, but we did not purchase anything.

Mr_Bees696mo ago

Might be a failsafe, if you cant get a license status, and you're aware that MS is down, just default to the highest tier.

ThatManulTheCat6mo ago

Free upgrade

dlcarrier6mo ago

Yesterday Amazon, today Microsoft. Are Google's cloud services going down tomorrow?

gtowey6mo ago

This is because Azure just copies everything AWS does. Google is a bit more innovative, they will have something else unexpected happen.

1 more reply

Insanity6mo ago

Maybe they are and no one realized yet.. :P

That said, I don't hear about GCP outages all that often. I do think AWS might be leading in outages, but that's a gut feeling, I didn't look up numbers.

3 more replies

m_fayer6mo ago

And if they don't, we'll know who the culprit is.

1 more reply

briffle6mo ago

here's hoping its Oracle's cloud instead....

CKMo6mo ago

Reasons to not use hyperscalers, exhibit 654

There's a lot of outages this month!

anon0256mo ago

It's the DNS https://dnschecker.org/#A/get.helm.sh is unreachable

I_am_tiberius6mo ago

Why are Azure App Services still working?

thimkerbell6mo ago

Does (should, could) DownDetector also say what customer-facing services are down, when some infrastructure is unworking? Or is that the info that the malefactors are seeking?

alt2276mo ago

Cant access certain banking websites in the UK, I am assuming it because of this.

https://www.natwest.com/

tpl6mo ago

Part of this outage involves outlook hanging and then blaming random addins. Pretty terrible practice by Microsoft to blame random vendors for their own outage.

syntaxing6mo ago

I absolutely love the utility aspect of LLMs but part of me is curious if moving faster by using AI is going to make these sorts of failure more and more often.

monkaiju6mo ago

If true then what "utility" is there?

1 more reply

bronco210166mo ago

Unable to access the portal and any hit to SSO for other corporate accesses is also broken. Seems like there's something wrong in their Identity services.

btbuildem6mo ago

https://login.microsoftonline.com/ is down, so that's fun

user39393826mo ago

I know how to fix this but this community is too close minded and argumentative egocentric sensitive pedantic threatened angry etc to bother discussing it

1 more reply

perks_126mo ago

Thank you. I was wondering what was going on at a company whose web app I need to access. I just checked with BuiltWith and it seems they are on Azure.

ThatManulTheCat6mo ago

Azure portal currently mostly not working (UK)... Downdetector reporting various Microsoft linked services are out (Minecraft, Microsoft 365, Xbox...)

senderista6mo ago

Even if the cloud providers have much better reliability than most on-prem infra, the failure correlation they induce negates much of the benefit.

_oleksandr_6mo ago

Based on the delay in resolving the issue, it appears MC attempted to rehire some of the DevOps engineers whom AI had previously replaced.

1 more reply

djeastm6mo ago

I'm mid-deployment, but thankfully it seems to be running ok so far. Just the portal is not working so my visibility is not good.

nartaczact6mo ago

Sounds like Shrodinger's Deploy

bragma6mo ago

They suggest to use Traffic Manager to route around failing CDNs. But DNS is not working too, making the suggestion another fail.

tecleandor6mo ago

LinkedIn has been acting funny for an hour or so, and some pages in the learn.microsoft.com domain have been failing for me too...

ZeroConcerns6mo ago

Oh, well, I'm sure Azure will be given the same pass that AWS got here recently when they had their 12-hour outage...

taeric6mo ago

I didn't realize AWS got a pass?

1 more reply

speckx6mo ago

FYI: https://status.cloud.microsoft/

chuckadams6mo ago

Which itself is^H^H was down. Wow.

speckx6mo ago

FYI: https://status.cloud.microsoft/

Boxersteavee6mo ago

503 Service Unavailable

everfrustrated6mo ago

GitHub runners (specifically the "larger" runner types) are all down for us. These are known to be hosted on Azure.

martijnvds6mo ago

This probably explains why paying for street parking in Cologne by phone/web didn't work (eternal spinner) then

zbowling6mo ago

Alaska Airlines is redircting folks to their slimmed down international site and you can't check in on mobile.

zaoui_amine6mo ago

Language models aren't perfect; they can still generate similar outputs. Invertibility is a stretch.

1 more reply

smithkl426mo ago

The iron law of uptime: "The mandatory single point of failure in every possible system is configuration."

baconbrand6mo ago

Our Azure DevOps site is still functioning and our Azure hosted databases are accessible. Everything else is cooked.

jimmyl026mo ago

pretty interesting how datadog's uptime tracker (https://updog.ai/) says all the sites are fully available.

if that's true then it's a sign that Azure's control / data plane separation is doing it's job! at least for now

jonathanlydall6mo ago

Our Azure hosted dotnet App Service is working fine, but our docs site served via Front Door went down. Can’t access anything through the Portal.

layer86mo ago

Maybe they need a downtime tracker. ;)

ycombinatornews6mo ago

So that’s why CapitalOne is out today. Even though their (incorrect) status page says all systems operational.

glzone16mo ago

I remember the saying "It's always DNS". I'm old.

Kind of mindboggling it's still sometimes DNS maybe.

alt2276mo ago

That saying is just as alive today as it ever was.

https://isitdns.com/

montague276mo ago

Guess when/who has the next outage!

Mr_Bees696mo ago

MS website seems to be up but really slow. Think xbox might still be down, Bing works for some reason tho!?

udev40966mo ago

Luckily, no one uses azure and it's fully expected from azure to go down all the time! Keep it up!

ksec6mo ago

>Last week AWS, now this.

This is not the first or second time this happened, multiple Hyperscaler failed one by one.

twodave6mo ago

Appears to be an issue in Front Door. Our back end stuff is fine but FD is bouncing everything.

NDizzle6mo ago

Yeah, I have non prod environments that don't use FD that are functioning. Routing through FD does not work. And a different app, nonprod doesn't use FD (and is working) but loads assets from the CDN (which is not working).

FD and CDN are global resources and are experiencing issues. Probably some other global resources as well.

Hate to say it, but DNS is looking like it's still the undisputed champ.

tartieretOP6mo ago

it took a good half hour after we detected the problem to see a notification on the Azure status page. Thanks to those who responded to my question as it validated the issue was global and we contacted our users t right away

qmr6mo ago

Always in these large provider outages you see people who have forgotten the old ways.

vanviegen6mo ago

Many (all?) LinkedIn profiles are also down for me. Luckily the frontpage still works. ;-)

Go cloud!

AznHisoka6mo ago

Luckily?

AtNightWeCode6mo ago

Earnings report today. A coincidence?

I can at least login to Azure. But several MS sites are down.

zaoui_amine6mo ago

Yeah, Azure is a mess today. Can't do anything without the portal.

amluto6mo ago

vscode.dev appears to be down. I think this will be my excuse to find an alternative -- I never really liked vscode.dev anyway.

(Coder is currently at the top of the experiment list. Any other suggestions?)

redwood6mo ago

Is it Cosmos DB? If so the symmetry with AWS/Dynamo would be very eerie.

macshome6mo ago

I just tried to check the Xbox services status page and it never even loaded.

chokolad6mo ago

Majority of actual Xbox services are working fine, xbox.com itself is busted.

Shuddown6mo ago

Github Codespaces (for the 5 people that use them) are also still down.

DeathArrow6mo ago

Buy cloud because you're always safe! Until you aren't.

1 more reply

major5056mo ago

Somewhere, an ex microsoft engineer that where layoff during the last week, is saying to himself “thank god, this shit is not my problem anymore”

kryogen1c6mo ago

downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?

kryogen1c6mo ago

downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?

Mr_Bees696mo ago

nope, dont see any cf issues.

voidpointer20006mo ago

Down in Sweden Central as well (all our production systems are down)

jasonthorsness6mo ago

Ahh it got me, Alaska air web site has an Azure outage banner

ThatManulTheCat6mo ago

Yudkowsky's feared Superintellignece holding Azure hostage

somerandomness6mo ago

yep having trouble logging into https://entra.microsoft.com/ as well

wingless_angel6mo ago

Please sort it out, I'll be out of a job tomorrow.

ChuckMcM6mo ago

"On Prem" is looking better and better :-).

llimos6mo ago

Yep, down from here too (in Israel).

Services too, not just the portal.

andoma6mo ago

Can confirm

xer0x6mo ago

Wow, they are still down 12 hours later. :/

1 more reply

I_am_tiberius6mo ago

Shouldn't regions be completely independent?

pred8er6mo ago

on the line with msft, they said 4 hours is what they are thinking. a workaround they are saying is to use traffic manager,

rodolphoarruda6mo ago

I could not access MS Clarity the entire day.

ukblewis6mo ago

GitHub also seems to be having trouble for me

nextworddev6mo ago

Fascinating timing given the APEC summit ;)

acd6mo ago

Putting all your eggs software in one basket

howard9416mo ago

Took out the archive.ph and .is sites too?

uuuubbbb6mo ago

Intune, Azure, Entra down in Switzerland

opengrass6mo ago

Github Actions and Codespaces degraded.

kierenj6mo ago

microsoft.com is back -

edit: it worked once, then died again. So I guess - some resolvers, or FD servers may be working!

majnata6mo ago

The Azure API is still working though.

philipallstar6mo ago

Can't get to microsoft.com even.

zelias6mo ago

Anyone have betting odds on when Google will go down next? Are we looking at all 3 providers having outages in the span of 3 weeks?

xuf6mo ago

Down here too (region West Europe)

jacquesclouseau6mo ago

My bet is on a bad config change.

croemer6mo ago

They already announced that.

rluhar6mo ago

Looks like AWS is also impacted?

zavec6mo ago

Yeah the graph for that one looks exactly the same shape. I wonder if they were depending on some azure component somehow, or maybe there were things hosted on both and the azure failure made enough things failover to AWS that AWS couldn't cope? If that was the case I'd expect to see something similar with GCP too though.

Edit: nope looks like there's actually a spike on GCP as well

1 more reply

seinecle6mo ago

Can't connect to Claude

thewisenerd6mo ago

they recently had an incident with front door reachability, wonder if it's back.

QNBQ-5W8

okokwhatever6mo ago

This cannot be a coincidence

pred8er6mo ago

looks like MS completed a failover and things are be recovering slowly

giantg26mo ago

Compare the comments and news coverage on this compared to the AWS outage... pretty telling.

nflekkhnnn6mo ago

Shut the front door!

dlcarrier6mo ago

We're quickly learning who's relying on a single cloud provider.

Insanity6mo ago

Multi cloud is really hard to get right at scale, and honestly not worth the effort for the majority of companies and use-case.

shagie6mo ago

Like AWS or GCP? https://downdetector.com/status/aws-amazon-web-services/ - https://downdetector.com/status/google-cloud/

1 more reply

joaomoreno6mo ago

Yup, see it as well.

razodactyl6mo ago

AWS, now Azure - wasn't this a plot point in Terminator where SkyNet was causing computer systems to have issues much before it finally become self-aware?

Funnily enough, AI has been training on its own data as generated by users writing AI conversations back to the internet - there's a feedback loop at play.

worik6mo ago

An important quality of the cloud is that it is always available.

Except that it is not!

Interesting times...

journal6mo ago

one day these outages will cause a starvation.

_andrei_6mo ago

https://www.reddit.com/r/cscareerquestions/comments/1ojbebq/...

tonymet6mo ago

Hello fellow boomers!

I noticed that winget is also down eg.

  winget upgrade fabric
  Failed in attempting to update the source: winget
  An unexpected error occurred while executing the command:
  InternetOpenUrl() failed.
  0x80072ee7 : unknown error

pred8er6mo ago

things seem to be coming back up now

patching-trowel6mo ago

As of now Azure Status page still shows no incident. It must be manually updated, someone has to actively decide to acknowledge an issue, and they're just... not. It undermines confidence in that status page.

baconbrand6mo ago

I have never noticed that page being updated in a timely manner.

charles_f6mo ago

It shows that some people have issues accessing the portal.

amir734jj6mo ago

It's DNS

m_a_g6mo ago

It’s not DNS

There is no way it’s DNS

It was DNS

AtNightWeCode6mo ago

From Azure status page: "Customers can consider implementing failover strategies with Azure Traffic Manager, to fail over from Azure Front Door to your origins".

What a terrible advise.

rsolva6mo ago

So that's why all of our municipality's digital services are down ... utter chaos at the political meeting I attended just now.

siva76mo ago

auth services are down

barpol6mo ago

still down

improbableinf6mo ago

What a time to be alive!

shivenigma6mo ago

what's happening? self hosting advocate groups attacking all cloud to prove their point?

zzake6mo ago

Portal is now accessible, bypassing FDN

tonyhart76mo ago

Wtf happen with US east????

llama0526mo ago

Just another day with microsoft. Honestly pretty tiring as something is always generally broken.

1 more reply

bernardo7866mo ago

now aws down again?

rawgabbit6mo ago

Meanwhile the layoffs continue https://www.entrepreneur.com/business-news/microsoft-ceo-exp...

the_af6mo ago

I especially like how Nadella speaks of layoffs as some kind of uncontrollable natural disaster, like a hurricane, caused by no-one in particular. A kind of "God works in mysterious ways".

    > “Microsoft is being recognized and rewarded at levels never seen before,” Nadella wrote. “And yet, at the same time, we’ve undergone layoffs. This is the enigma of success in an industry that has no franchise value.”
     
    > Nadella explained the disconnect between thriving financials and layoffs by stating that “progress isn’t linear” and that it is “sometimes dissonant, and always demanding.”

I've read the whole memo and it's actually worse than those excerpts. Nadella doesn't even claim these were low performers:

    > These decisions are among the most difficult we have to make. They affect people we’ve worked alongside, learned from, and shared countless moments with—our colleagues, teammates, and friends.

Ok, so Microsoft is thriving, these were friends and people "we've learned from", but they must go because... uh... "progress isn't linear". Well, thanks Nadella! That explains so much!

ctoth6mo ago

Layoffs will continue until uptime improves!

1 more reply

FeteCommuniste6mo ago

> [Satya Nadella] said that the company’s future opportunity was to bring AI to all eight billion people on the planet.

But what if I don't want AI brought to me?

4 more replies

delf6mo ago

The outage impacted GitSocial minor version bump release: https://marketplace.visualstudio.com/items?itemName=GitSocia...

There's no way to tell, and after about 30 minutes, the release process on VS Code Marketplace failed with a cryptic message: "Repository signing for extension file failed.". And there's no way to restart/resume it.

almosthere6mo ago

Reports of Azure and AWS down on the same day? Infrastructure terrorism?

reaperducer6mo ago

Reports of Azure and AWS down on the same day? Infrastructure terrorism?

> We have confirmed that an inadvertent configuration change as the trigger event for this issue.

Save the speculation for Reddit. HN is better than that.

12_throw_away6mo ago

> Infrastructure terrorism?

Unless that's a euphemism for "vibe coding", no.

improbableinf6mo ago

According to downtector.com - both AWS and GCP are down as well. Interesting

jasonjmcghee6mo ago

Don't visit this address.

j / k navigate · click thread line to collapse