Thanks HN: Lessons learned after Google nearly killed my site (opens in new tab)

(uploader.win)

536 pointsuploaderwin5y ago284 comments

284 comments

So, clearly Google has too much power over the internet, it's arbitrary and opaque, etc. I agree. However, I think it is worth pointing out that:

1) malware is often very aggressive and fast-spreading, and once it's on a user's computer it's hard to get off, therefore...

2) the system to detect it and stop access to the site has to be automated, not a human-in-the-loop system that might take hours or days to shut off access to a site which is infecting many users per minute, and...

3) the more clarity there is on how exactly that automated system works, the more certain we can be that malware will be able to evade it; it's much like how spam detection or search page rankings are opaque, because the incentives to game the system are very great

I'm not saying Google's system is perfect, but I am saying it's a very hard problem to solve in a way that doesn't give us an even worse time stopping malware spread than we already have. So while it is hard to feel sorry for a company as wealthy and powerful as Google, I think the issue is not as clear-cut as some comments on this thread seem to suggest.

arp2425y ago

I absolutely agree, and the same applies to things like moderating YouTube and such. The scale is mind-boggling, and people will do anything to get past any measures you put in place. It's a hard problem to solve, and I feel people are too quick to jump to the "Google bad" bandwagon.

But that being said, by far the biggest problem is just the lack of recourse and communication. Compare this to email spam prevention and the like, which solves a very similar problem but if you accidentally get blacklisted you can just talk to the SpamHaus people or whatnot and get the problem sorted.

It's not hard to imagine how Google could improve here: send better notifications when something is blacklisted, provide a reason why, and offer a better procedure to get your problem fixed.

Yes, this will cost time and money due to the large scale of things. But if you have the ability to block parts of the internet for much of the population then you also have some responsibility here; you can quite literally kill companies with this. Email spam prevention usually step up to this responsibility. Google ... not so much.

Mistakes will still happen, and that's okay. I appreciate the hard job they're doing, which does provide a lot of value. It's how you deal with those mistakes that matters, and Google deals with them terribly across all of their products.

draebek5y ago

I was about to comment on your example of talking to SpamHaus, but I misremembered and was thinking of SPEWS instead:

> One common criticism is that there was no way to contact SPEWS. According to the SPEWS FAQ: "Q41: How does one contact SPEWS? A41: One does not..."

https://en.wikipedia.org/wiki/Spam_Prevention_Early_Warning_...

lostcolony5y ago

I think if Google is going to decide to police the web like this, they need to alert people more proactively. The first the OP should have heard of this was Google emailing them through uploader.win's Contact Us email address. It's easy and obvious to find on the site; seems like that should be part of the automated process.

dhosek5y ago

Locating contact info for websites is not something that can be automated. Some sites provide an email address, some a form, some point people at twitter or facebook and some don't provide any contact information at all. None of this is arranged in any sort of standard way. Contact info may be under a link marked "contact" or "about" or "bio" or appear at the bottom of every page.

4 more replies

thayne5y ago

Surely there is a better way to address malicious content than blocking the entire domain.

It also doesn't seem like sites like Facebook, Reddit, Youtube, Google photos, etc. run into this problem, even though they allow user uploaded content so there is some kind of bias against smaller companies.

brendoelfrendo5y ago

You won't get away with uploading malware to your Google drive without Google noticing. The no-no here is that this guy is operating a demo site that allows anonymous file uploads without policing the content that goes there. That's just mind blowingly dumb and ripe for abuse.

2 more replies

kortilla5y ago

Or... it’s not Google’s job to police all sites of malware?

jeffgreco5y ago

Whose should it be? Why would they be better than Google?

2 more replies

jitbit5y ago

It's not just Google, Microsoft, Twitter and Facebook too. Our website was recently flagged by Facebook's ML [1], and the only way to "unban" it was to find ways to contact a human being inside.

PS. Twitter still does not allow me to share links to OP's website.

[1] https://news.ycombinator.com/item?id=25546868

jimmont5y ago

The things this situation posed in the article illuminate are that proper government oversight and distribution of concerns are a necessary part of balancing power. The tensions designed into a system help address its edge cases to varying degrees. And the article points out how to manage one of these situations generally--it's an excellent list of insights. Automation should be balanced with humans in the loop, automated systems are imperfect, humans are too. Balanced systems with separations of concerns yield good results. Discretion can be exercised and applied appropriately and the relationships with their respective concerns can sort out these edge cases and share that info when and where it furthers common interest.

that_guy_iain5y ago

>Now we run automated tests to monitor server uptime and check server for problems every 30 seconds. Unfortunately automated test scripts were happily getting HTTP/200 replies while people using the Chrome browser were being told this is a scam business trying to steal their bank account information.

I was surprised this wasn't part of the lessons learned. But it seems the monitoring basically failed but that wasn't a lesson.

I feel like majority of uptime monitors are falling for this same trap. One of the reasons why for my monitoring service I choose to do full page load monitoring via Chrome instead of just a http request via Curl or whatever. Main reason, people care if the webpage loads or not. People care how long it takes for their webpage to load. Having a website respond in 200ms is great but if it takes 8000ms for all the JS to load and process your website is still slow. I get why sites are just doing curl requests because it's way cheaper but really you're monitoring one part of the stack while really caring about all of it. If your website starts producing javascript errors you want to know, etc.

[1] https://www.ootliers.com (The landing page and everything are terrible and I'm working on improving that)

core-utility5y ago

When talking about checks on the order of twice a minute, curl is probably the right approach. You can/should still do a full check, but that can be done at a lower frequency.

zepearl5y ago

Not sure why you're getting downvoted, personally I agree.

For example:

- a frequent/simple check dealing directly (on the internal network) with the webserver ("does it work well yes/no, what's the raw response time, etc..."). Here is where I would definitely use "curl".

- another less frequent test involving as well the DNS and the external network.

- another end-to-end test (e.g. once every 10 minutes?) involving as well one or more real browsers (this would test as well for example revoked SSL certs).

=> all these infos/metrics should be quite helpful to identify problems, or at least to shrink the potential area that is causing it.

brongondwana5y ago

Yep, for sure. Fastmail still uses a once-every-10-minutes frequency for the full end-to-end "can log into the website, compose an email, receive the email, trigger an automated background fetch, receive that email too" tests, though the "can connect to service" tests run much more often.

that_guy_iain5y ago

To be fair, my page load checks aren't just is the site up and responding 200 but also page performance anomaly detection and stuff. So I want/need to see the performance minutely to be able to detectly in a quickish amount of time if the performance has degraded for the entire pageload. You could have a minute where it's slow and that is ok but if you have 5 minutes of it being slower than normal one after another then you have an issue. I feel if you're doing checks every few minutes your data won't be as good as doing it every 30 seconds. It is way more resources but honestly, I think it's the future of monitoring. Also, having multiple types of ways of calling a site via http to monitor it is way more complex.

The main offer is for order monitoring but I am in the middle of creating a just page load monitoring offer for others since I think that service by itself is super useful.

phist_mcgee5y ago

I wonder if there are page monitoring utilities that can download the full html payload, render the page via a headless chrome browser, and then perform a diff against the content? Most landing pages are quite static, and like you said, it's about full page load, so measuring how long the entire page took to load, along with each asset's network request and subsequent content paints, you could get a pretty good idea of your holistic page load health.

that_guy_iain5y ago

I've seen automated testing tools for that. I was thinking about it for the future once I get other monitoring I want. I'm literally building this to be a monitoring tool I want to use. But you would have to make the montioring system aware of deployments or design changes. But those are the things that generally break things.

But the key thing for me is the "goal" montioring. So for ecommerce it's orders but for other systems it's different. That's the thing I really want to monitor. If other things break they'll affect those so you can detect lots of failures. The only issue is finding out what the cause is. But first I'll improve the anomaly detection a bunch before looking into root cause detection.

1 more reply

noisy_boy5y ago

Why not run a fully functional monitoring job i.e. upload file with a monitoring account and check the results to validate that it is working end to end. Doing this even once a minute shouldn't put any load and is a much more reliable test.

exikyut5y ago

It would have needed to be done through Chromium in non-headless (likely full Xvfb) mode, with step screenshots, and screenshot comparison (always flaky!), for the Safe Browsing interstitial to have generated an alert.

1 more reply

nchelluri5y ago

It's funny you read it that way, you may understand it correctly but I came away with a different interpretation, that they allow-listed the developer's IP and returned good non-phishing-warning responses to the monitoring check, but not to end-users.

that_guy_iain5y ago

They said their test scripts worked but people using Chrome got an error. So I take that as in their scripts weren't using Chrome at all.

To be fair, I've not had this happen yet so I am going to try and find a site that chrome won't let me visit and see what happens when I visit it programmatically.

1 more reply

klingon795y ago

How would Google know that a site is curling another site?

Why would they flag that as a phishing site?

I’m just having a difficult time determining how this situation is not the fault of the site/app; we don’t even know that any of this is true and it looks more scripted than an offended rant.

stevarino5y ago

The curl'ing is not necessarily bad (and can continue just fine), but rather its blind to these types of problems. A false positive in your testing framework (especially something like this) is the worst case scenario.

that_guy_iain5y ago

Google wouldn't know that someone is curling it which their script said everything was ok. While the website was basically down because Chrome and Firefox will both block a site based on Google's safe browsing list.

They could use Google's safe browsing api to check if they're on that list as well as curl.

2 more replies

ttt05y ago

They can remove your YouTube account, app, entire Google account or even your website at any time and you can only make guesses why did that happen, because they always make the rules really vague and it's never clear what is or is not allowed. And even when they do admit the mistake and get you back up, they still won't explain anything and nothing is ever fixed. Thank you Google, very cool.

superzamp5y ago

At this point they, and other giants, successfully demonstrated that they cannot regulate themselves over these random terminations and that the public needs to step in.

whatshisface5y ago

Rather than further secure their market positions by forcing them to treat their customers well, why not replace the entire company by competition from less insane providers?

1 more reply

doliveira5y ago

I am now left wondering about the .dev gTLD, isn't it owned by Google?

edoceo5y ago

Yep. Adjust risk factors as necessary.

https://en.m.wikipedia.org/wiki/.dev

zepearl5y ago

One one hand I understand that any site can be hacked anytime and that that can have huge repercussions, therefore I'm happy if Google reacts quickly when it detects something like that (if I were the owner of such a site I would be even thankful to Google to limit the damage).

One the other hand it seems, based on this and many other posts, that there isn't much communication from Google to its "clients" to 1) explain what's wrong and 2) quickly/directly ask for a reevaluation (e.g. after the problem has been fixed, to question the validity of the problem, etc)?

I understand that there might be bad actors around doing everything on purpose on their website/app and that therefore #1 (basically telling the bad people why they got detected) would be a bit of a gray zone, but at least #2 should be a no-brainer (e.g. in the case of the previous ".ass"-files-case anybody in any support desk could have immediately whitelisted that "problem")?

ezconnect5y ago

Google doesn't need to explain to him anything because he is not their client. That's the problem we have now. Google have become judge and executioner. They are also decide what gets distributed because they own a browser used by most users. The solution to this is just stick to the basic, let the browser be a browser and the search engine just a search engine.

teclordphrack25y ago

The reason it is made vague is because there are people who will set their site up so it technically passes the rules but it certainly does not pass the spirit of what was trying to be done by the rules. By making it opaque they do get to cast a wider net and keep those a$$hats from harming others but they certainly catch other fish with that net.

yellowapple5y ago

> there are people who will set their site up so it technically passes the rules but it certainly does not pass the spirit of what was trying to be done by the rules.

Then that's a problem with the rules, which need clarified to better encode the "spirit" thereof. Hiding the rules entirely is a poor substitute for that.

himujjal5y ago

Really. You really sound like first making up your mind on something and trying to then justify it.

What are your saying?

aboringusername5y ago

Anyone who thinks this is the functioning of a "normal" internet is mistaken. This is a symptom of a decades-in-the-making problem. It strongly appears those in charge of legislation are not technically minded and have no idea "how" the internet works. Or they do and they have data-sharing agreements with all the 'big tech' software and are okay to "appear" to legislate but cannot actually change anything substantial in fear of retaliation (losing access to all that juicy data they collect). Imagine the power Google wields in this scenario, to me they are more scary than any drug cartel boss. I genuinely can't see how this isn't akin to a Coup d'état of the internet as a means of transmitting information. We cannot shut down these tentacles because of how deeply ingrained they are (remember when FB's SDK was having issues? Hundreds of third parties apps just broke).

Google should have been regulated years ago, instead, they have been allowed to snap up every smaller company to solidify their position in the market and ensure they and only they are allowed positions of power, control and authority.

If Google dislikes you (or their baseless algorithms that are detached from reality) then you are toast. How long before Google's algorithm results in an actual human death? Doesn't seem totally far fetched and entirely plausible.

Yet, you let this happen, or rather, it seems this isn't concerning enough for it to warrant a massive protest, after all, Big Tech controls protest online and can just shut it down. Amazon seems to have been mightily effective at stopping any "union" movement, so we know the censor machines are fine tuned and ready to fire at any moment.

We need to be talking about this daily, in needs to be front and center for weeks and weeks, and we need to demand accountability. We are ruled and governed not by elected officials but by faceless, nameless and non-human machines. They do not Think. They do not Talk. They do not care.

Yet this thread will disappear in a few short hours, and this will be just another episode of the weekly "Google's systems are out of control and one developer got caught out, too bad I hope they are okay".

This is happening to thousands of others undoubtedly that do not make hackernews or have the resources/energy to fix it.

We should demand better.

still_grokking5y ago

> It strongly appears those in charge of legislation are not technically minded and have no idea "how" the internet works.

Of course they know. Everybody knows, it's just a series of tubes.

But that's not the point. The people in charge also know:

> If Google dislikes you (or their baseless algorithms that are detached from reality) then you are toast.

Replace here Google with FAANG, and see how whole countries are completely depended on those companies. At this point those companies can blackmail any government on earth into almost anything they want. FAANG are actually even richer than most countries on this planet.

dehrmann5y ago

> How long before Google's algorithm results in an actual human death? Doesn't seem totally far fetched and entirely plausible.

https://en.wikipedia.org/wiki/YouTube_headquarters_shooting

stonecraftwolf5y ago

You’d think there would be a business opportunity for advocacy consulting, but I think the total lack of regulatory consequences for ruining people’s livelihoods renders that moot. FAANG can just ignore advocacy that isn’t backed by regulatory teeth.

I think if FAANG didn’t already control so much of our communications you might see such advocacy groups, but as it is...

Do you want to be the face of a campaign that will piss off FAANG?

fefe235y ago

Can someone explain to my why Google isn't being drowned in a torrent of lawsuits?

We are getting stories like this on a weekly basis now.

Google is clearly causing measurable harm to your company and you. And apparently to thousands before you.

Considering how much money patent trolls manage to extract from Big Tech with considerably weaker cases, how is it that everybody is treating Google like a fragile grandmother with dementia, going out of their way not to hold them responsible in court?

This is not a rhetorical question. I really don't get it.

America is the land of getting millions in settlement when McDonald's gives you coffee that is hotter than you anticipated. How the hell is Google getting away with their behavior?

at_a_remove5y ago

That last line is unpleasantly wrong.

The coffee was not merely "hotter than you anticipated" (although that's at least sort of right), it was near boiling: McDonald's required franchisees to hold coffee at 180–190 °F, much closer to actual boiling than what other establishments hold coffee at, which is typically twenty degrees below that in that area. She had third degree burns on six percent of her body, six rather sensitive percent. She needed an eight day hospital stay just for skin grafts. I once dug up the photos, by the way, they're rather unpleasant, and I say that as someone who has attended autopsies.

Of course, the temperature differences may not seem like much, but a ten degree drop at that point changes the time from "skin graft city" from three seconds to perhaps four or five times that.

Final verdict, before settlement, was $640,000, not "millions." The parties settled out of court for an undisclosed final amount less than $600,000.

sayhar5y ago

Thank you! The whole "hurr durr McDonalds Coffee" thing is one of those stories that simply won't seem to die, no matter how often heroes like you show up and do the work of correcting it.

notriddle5y ago

People want three things:

1. publishers want to be able to put content on the Web without undergoing background checks

2. everyone wants to be able to discover content with as little friction as possible

3. consumers don’t want to drown in unwanted crap

The incomprehensible Algorithm is the result of trying to square that circle. Give up any of those requirements, and the arms race would end:

Give up #1, and it’ll be possible to do all of the rules enforcement reactively, with no algorithms and no inhumane call centers, because when someone is banned, they’ll stay banned. The ban will be tied to a legal name and anyone caught ban-dodging can be sued.

Give up #2, and it won’t matter how much spam you make available on the web because nobody will fall victim to it. The web becomes less like a publishing platform and more P2P, because you basically only find content on there through your in-person social contacts.

Give up #3, and you don’t need Safe Browsing any more. Good luck selling that to everyone, though.

In order to sue them, you need to come up with something that they should’ve done but didn’t. Having a human review every web page that’s ever published is obviously dumb, so they’re going to have to go with the algorithmic approach.

AnthonyMouse5y ago

> Give up #1, and it’ll be possible to do all of the rules enforcement reactively, with no algorithms and no inhumane call centers, because when someone is banned, they’ll stay banned.

This doesn't actually work because the people doing bad stuff are criminals with no qualms about committing crimes, like identity theft. Some large fraction of spam is sent from compromised but otherwise legitimate mail servers.

> Give up #2, and it won’t matter how much spam you make available on the web because nobody will fall victim to it. The web becomes less like a publishing platform and more P2P, because you basically only find content on there through your in-person social contacts.

This is the one you can actually fix because it's a spectrum rather than binary. It's also something that doesn't need to be a monopoly, and not being a monopoly would significantly reduce the consequences of mistakes.

Discovery is also fundamentally a search issue. Not putting something you suspect of being spam in the first page of your search results is a world away from shutting down some guilty until proven innocent third party's DNS or hosting.

megous5y ago

> In order to sue them, you need to come up with something that they should’ve done but didn’t.

How so? If you sue for damages, you only have to prove you were harmed by Google's actions, no? And actively misrepresenting your website as dangerous and deceptive to your customers is sort of libelous and clearly damaging.

2 more replies

pja5y ago

Because Google has set things be up so that they have no legal responsibility & even if they do it's an enormous legal mountain to climb to a) prove it and b) get any kind of reasonable recompense out of them.

Currently they have all the benefits of their monopoly with none of the responsibility which is exactly the way they like it.

jokethrowaway5y ago

Go figure why nobody keeps them accountable.

They have enough money to influence the USA government if anything changing the situation were to be introduced.

1 more reply

matthewheath5y ago

In the UK at least, these consequences (website going offline / certificate warning / unsearchable in the search engine) would likely be deemed "pure economic loss" following Spartan Steel & Alloys Ltd v Martin & Co (Contractors) Ltd [1973] QB 27 and Murphy v Brentwood District Council [1991] 1 AC 398 where the Court of Appeal and House of Lords respectively held that unless some sort of physical harm was suffered to you or your property, the losses were held to be "purely economic" and so not recoverable in tort.

It's unlikely that any claimant would be able to show a contractual provision that enables them to claim for damages against Google (thus allowing them to sue in contract), so a cause of action for tort would be the usual way to sue Google - except unless Google makes you suffer some form of physical harm or damages your property, you're unlikely to be able to recover any damages for your website suffering these consequences, in the UK at least. I understand US law may be quite different.

There's a testable argument to be made about the requirement for "damage" to your property (the website) being inflicted by the certificate warning, but policy arguments on the matter of "ripple effect" liability makes it seem likely the courts would hold that Google isn't liable.

Also Google is probably far better placed to weather lawsuits than most ordinary people; they can probably afford to induce the other party to settle out of court, and presumably the relevant monopoly and abuse of market position laws only allow a regulator to take legal action (the ordinary consumer being restricted to contract and tort lawsuits).

fefe235y ago

I'm guessing the web site has telemetry and analytics and can show the conversion rate going down. If the web site sells something, you could even put a dollar amount on the damage.

I'm probably misunderstanding your argument here, but if, say, Google steals your bike that would be purely economic damage. Surely the UK legal system would still punish that...!?

2 more replies

matheusmoreira5y ago

> how is it that everybody is treating Google like a fragile grandmother with dementia, going out of their way not to hold them responsible in court?

Yeah, it's a really good question. We got all these fully staffed insanely rich companies causing measurable harm to people. They just insist there's nothing they can do to stop it. Why does everyone believe them?

drivebycomment5y ago

Google provides the safe browsing API to the browsers. So if anything, websites will have to sue the browser makers. However, browsers don't have any contractual obligation to the websites. Seriously. It's a user agent. So if anyone is going to sue browsers, users have to sue or some government.

Users won't sue as long as there's no meaningful harm to the users. And there's essentially no meaningful harm to the users by dropping a single site. As a user, I don't care if any particular site hosts a malware and gets blocked - that's what I want. If that site gets back slowly, I don't care either. That's the website owner's loss.

Government doesn't have standing to sue, as long as there's no discriminatory effect - and as long as the selection criteria is fair (malware/phishing), and they are not negligent in fixing false positives, government will have hard time finding a leg to sue.

It comes down to - as a society, safe browsing APIs are critically important and they have been working reasonably well. You'll have to show they are mismanaging, or malicious, or doing damages to the users. There's no evidence for any of those.

ttt05y ago

Probably something something along the lines of "private company and they can do whatever they want".

justAnIdea5y ago

Sure, but this is not a Google issue per se, this is a browser issue. If they f** up and put you on a phishing list and your business just evaporates because people's browsers literally stop working with your site, that goes far beyond what google does as a private company on its private platform. I think this is totally worth suing for and probably winning.

imwillofficial5y ago

Two words.

Regulatory Capture.

The dividing line between big tech and big gov is far thinner than most people consider.

getlawgdon5y ago

how are they not "being downed in a torrent of lawsuits?" because nearly all would-be litigants believe they can't persevere against google's depth of resources. so they don't try.

alisonkisk5y ago

What is the tort?

Mcdonald's burned off a woman's labia after burning the flesh of several people with coffee tens of degrees hotter than is safe, and then refused to simply pay her medical bills, prompting a lawsuit.

Has Google burned your labia?

jedberg5y ago

https://en.wikipedia.org/wiki/Tortious_interference

They interfered with the contract OP has with their customers.

1 more reply

imwillofficial5y ago

McDonald’s did not “burn off a woman’s labia”

That’s some dumb shit right there.

1 more reply

dang5y ago

The previous thread:

Help HN: Google just blocked my site as deceptive site - https://news.ycombinator.com/item?id=26326528 - March 2021 (20 comments)

ufmace5y ago

I'm wondering if this could actually be spun into being a good thing.

I just looked over the site a little more. The business idea seems to be to have a widget to add to your site that can be used to upload arbitrary files to it. The real advantage looks to be that they have a bunch of integrations set up with Facebook, GDrive, Dropbox, Instagram, etc so that all just works without you having to set up and manage developer accounts with 10 different services. Plus built-in image resizing and such things that all just works. Pretty cool, and I might use it if I built a site that needed to do that.

One way you can frame the point of this business is that they worry about the details of integrating with these other services so that you don't have to. As they found out, hosting external content is inherently dangerous, and it pays to have someone responsible for that who knows the risks and has experience in mitigating them. If a site owner wasn't using this service, they would have to take that responsibility on for themselves and re-learn these same lessons. So that's just another advantage of using this service - "we have experience in mitigating the risk of hostile users abusing upload services to serve malware, so you don't have to worry about it".

random56345y ago

Quick notes:

Site owner has not confirmed they screened all uploaded content for malware - this is a major issue these days and google and others will flag you if you host viruses and pump out malware.

And no - you cannot sue google to force them to allow users to be infected.

It’s not clear that all customer content is hosted on a separate domain, and each customer on a separate sub domain . Your reputation will be trashed pretty quickly if you host content on main domain blindly.

It’s not clear that all uploaded content is protected from being linked too or downloaded. Google admins and other virus vendors can setup screens on downloads.

Anyways - see plenty of shady / scam and incompetent website owners hosting malware - not much sympathy in most cases.

rightbyte5y ago

Something tells me that Google doesn't ban G Drive, Dropbox or MS's what ever it is named when those host malware. I rather not have only the giants host user generated content ...

temp6675y ago

First - most of these places are running pretty advanced virus / malware scanners. So when you go to download a file from drive etc, a scan is done (at least for files that are not enormous).

This is actually a big issue sometimes for folks who use google drive, because malware will infect their files, they will then be synced to google, then blocked from downloading them ever again!

That leads to support requests list this:

https://support.google.com/a/thread/60528209?hl=en

Even if you pay the ransomeware fee, google WILL NOT let you access your own files again. So years worth of files - GONE.

They do use different origins for these services. Google DOES actively ban users (everything, youtube, drive and email) from their service even users using google services (vs a third party upload service). Ie, if you are going to run a phishing scam, host the image on this service, not google, or your drive account + a lot of other stuff is at risk. I've even seen google follow links to other accounts your google account is an admin on, so can have business impacts and more.

I don't know where the idea comes from that google is very hands off on this stuff, they run a major spam fighting op that blocks lots of even potentially legit email, they do tons of scans through chrome, they do advanced stuff for opt-in domains on their paid platforms (even more intrusive but let's them pick stuff up behind password locked pages etc).

This last one can really confuse site owners, when NON PUBLIC content results in site bans.

lrem5y ago

Google does the silly separate domain dance GP recommended. I couldn't figure out what is it for, until I read this advice in the previous discussion.

Disclaimer: I'm a Google SRE. But never supported anything reachable from the outside.

2 more replies

simion3145y ago

Is not about the viruses, a pdf that looks like phishing can be reported and you get your website blocked. If anyone knows of a way to scan pdfs please let me know(I think it would involve finding the links in the pdf, try to follow them and detect if are phishing but maybe the link is fine at the pdf upload time and it changes after)

hamburglar5y ago

I haven’t read all comments so I don’t know if anyone made this suggestion already, but for a demo uploader, you could probably just have all the file contents replaced with zeros, or stand-in data of the same content type (eg all videos turn into a video saying thanks for trying it out, padded with zeros to the original upload size)

wnoise5y ago

> We never allow anything other than video and image files either.

I would have thought this would be an excellent way to not host malware.

freeone30005y ago

Not quite. There have been several flaws in WMF that have caused video and image files to be viral vectors. https://en.m.wikipedia.org/wiki/Windows_Metafile_vulnerabili...

gpm5y ago

> And no - you cannot sue google to force them to allow users to be infected.

Has anyone tried, genuinely curious how this would turn out.

denysvitali5y ago

Ironically I had the opposite issue a couple of weeks ago: I've found a phishing website (for Facebook) that was hosted on a Google server and was actively used. I sent an email to Google's abuse email address - got an automated reply back saying basically "use this other form instead". Did that, never got a reply back. I have reported the website to their SecureSearch (or whatever the name is) product, entered the URL and all the related infos: nada. The site is still up and running, phishing users, and no alerts are triggered for Chrome users... Sad, really sad.

guerby5y ago

Let's do an experiment, please post the URL here and see if someone at google takes notice :)

denysvitali5y ago

Well, enjoy: https://nbbdfxhqcc[ remove me ]fll.agilecrm.com/landing/6754083888234496#0.593636668875394

You are warned: the above link is a phishing website that when used will spam you whole Facebook friends with the same link via message. Google Chrome, still today, shows it as a normal website: https://imgur.com/a/1bsFutr

1 more reply

imwillofficial5y ago

So this sucks for the developer, but I have another story to share.

I was trying to buy a school bus to make a schoolie out of, the Craigslist add directed me to a seemingly innocuous eBay motors link that looks pretty close to the real thing. I was busy and clicked, totally intending to drop $5k. I got distracted and had to come back to it later, when I did, credit card in hand, the page showed the red screen with a huge warning. A closer look revealed the bad url.

Saved by google? Oh god, I think I need a shower now.

sneak5y ago

The fact that they are sometimes useful does not negate the fact that they have too much power to censor the web.

colinmhayes5y ago

It kind of does though? Either people get scammed by broken URLs or google sometimes bans innocent sites. I would expect most chrome users prefer innocent sites occasionally banned.

bilater5y ago

Glad you got a resolution. Google recently banned my ad account for running ads to my landing page templates and I still don't know what was wrong with that. They just gave me a bs corporate answer and that was it.

stonecraftwolf5y ago

I’m so sorry this happened to you. Can you show us the site?

bilater5y ago

The site is https://nextails.com/

I just ran ads with headlines like Nextjs + TailwindCSS Landing Pages

Apparently somehow I ran afoul of their Circumventing Systems policy. I don't know how this qualifies and when I appealed they came back saying the same thing.

3 more replies

rgj5y ago

Let’s not forget that the site probably was actually hosting malicious content. The problem is not Google blocking the site, that was the right decision. The problem is that Google is hard to reach in cases like this.

jhugo5y ago

I believe it’s deliberate. A human-staffed support desk will be vulnerable to social engineering by fraudsters looking to get their site reactivated. A public list of very specific policies and disclosure of which one was violated will be vulnerable to engineering too, by making sites that are fraudulent/deceptive/harmful yet somehow fall between those specific policies.

lanevorockz5y ago

Google is a monopoly and they destroy the lives of anyone that even dares to challenge them or their owners. It's time to break this big tech monopolies. Obviously, through make something better ... This is more of an inevitability than a question.

martin_a5y ago

> But there are plenty of Google engineers and good helpful people on Hacker news.

> (from a screenshot) I work at Google [...] so I escalated your issue [...]

> I believe the HN thread getting on the homepage tremendously helped me and somebody from Google saw it and expedited the review after all

So, once more an issue with FAANG could only be fixed because somebody knew somebody else and went out of his way to get this to the right eyes.

This could easily have gone another way and OP would have received no help whatsoever and would have waited for days or weeks to get this issue cleared and lost his business.

Maybe it's only me but I find it unbearable that you'll usually not be able to reach any real person at all for issues like these and it's pure luck what happens to you.

davidmurdoch5y ago

I can't access my 13+ year old gmail account because Google now requires I verify I have access to a phone number I've never owned. There is no 2FA on this account, I know my password, and have access to the "Recovery Email" (which gets emailed a "Somebody knows your password" warning whenever I attempt to sign in to the account with my password).

I reached a real person at Google Domains and managed to get things escalated to "Specialist". Their response: "we can't help you, post about it on the community forums" (which I had already done 20 days prior).

This account "owns" digital goods, thousands of songs, and many domain names. Google is actively stealing these things, but they don't care and, "can't help".

jokethrowaway5y ago

I'm in a similar situation with Apple. I can't access my 10 years old account even though I know the password and control the email attached to it because I don't remember my security questions and I don't have my recovery email anymore. I probably put rubbish data in the security questions as they weaken the security of the account.

Funnily enough that password leaked and someone managed to take over my account (I wonder how they manage to bypass the security questions, sounds like a security vulnerability on Apple) and they're using it to register (I assume) stolen devices and install software. I get email notifications every time these people do something.

I reached Apple support but they're unwilling to help, they even refuse to nuke the account as a last resort.

6 more replies

ncann5y ago

Same thing with my Skype account. I have it logged in on both my phone and my laptop. One day it decides to automatically log me out on both and when I try to log in it insists on requiring sending a code to a phone number that I no longer have access to. So just like that I lost access to my long time Skype account even though I know the password, even though there's no 2FA setup, even though I'm trying to log in on 2 devices that I've logged in previously, from the same IP address. All support requests went nowhere.

I wonder how many people lost their account like me because of these overzealous security measures.

heavyset_go5y ago

> This account "owns" digital goods, thousands of songs, and many domain names. Google is actively stealing these things, but they don't care and, "can't help".

I long for the day that they cross the wrong person with means to take them to court over their negligence.

1 more reply

maddyboo5y ago

Same thing happened to a Bitbucket account of mine. I know the email and password, but the primary email is under a domain I lost access to. At some point, Bitbucket decided I needed to verify my email in order to sign in. Support was utterly unhelpful.

MeinBlutIstBlau5y ago

I can't access my flickr account because it's tied to a Yahoo account which won't let me access unless I know the 2nd Yahoo account email address 2fa which is blurred out and yahoo won't let me access it because it's been locked even though I know all the info.

1 more reply

childintime5y ago

dencodev5y ago

Have you considered calling the number tied to the account and asking them to help?

2 more replies

deckard15y ago

The worst part is that we are just conditioning people to accept this as normal. Just like EULA and cookie banners.

It's always the same story. Some guy gets on Twitter or HN who happens to get noticed, then FAANG releases a statement saying they made a "mistake". Mistakes in the aggregate that affect millions of people aren't "mistakes." That's deliberate malfeasance at scale.

Funny they never ask you that design question in interviews. "Design a system which will harm at most 5% of your users while scaling up to billions of people." Maybe if more people understood the sobering dark side of scale, they would stop gleefully promoting runaway scale-at-any-cost engineering.

Just kidding. Profit is God.

I'm also reminded of the dystopian movie Brazil. You're always at danger of getting eaten by the bureaucratic machine today, with only the most absurd recourse available. Just read the passive indifference of the email that Google sent this guy. "Google has received and processed...", "Google systems indicate...". This is one shit dystopia are are living.

edoceo5y ago

Upvote for Brazil! Great movie, Gilliam is a genius, so good. https://m.imdb.com/title/tt0088846/

inglor_cz5y ago

This is really feudalism replayed.

If you know important people at the emperor's court, you have a chance to get yor problem solved.

franklampard5y ago

Not really just feudalism. But how most parts of the world work

2 more replies

drc500free5y ago

Definitely seems like a class system, you’re either above the algorithm or below it.

jahewson5y ago

It’s really not, least of all because feudalism didn’t have emperors.

3 more replies

alisonkisk5y ago

It's not "feudalism", it's human social relations and power dynamics.

5 more replies

Jimmc4145y ago

This gentleman had a similar issue where his site was taken down without explanation at youtube. I intervened certain that we just needed to light some fires and get a human to look at it. He never got back into his account and to my knowledge never got a reply that was not a canned response. https://www.linkedin.com/posts/mohammedadam24_cybersecurity-...

bitcharmer5y ago

This is the norm with FAANG and it really annoys me. How many of these cases never saw the light of day because of that?

Even with HN it's a complete lottery what contents reaches the front page, so getting issues like these resolved is a matter of extreme luck for a common person.

uh_uh5y ago

Probably naive idea: Would it be possible to set up an insurance fund for this? As in, the money would be used to sue FAANG for damages in cases when businesses are lost due to them being flagged/blacklisted by FAANG incorrectly.

If enough companies contribute to this, it might put some pressure on FAANG to take things seriously.

1 more reply

matheusmoreira5y ago

You're not the only one who finds it unbearable. Google acts like they own the internet. They cause massive damage to people and businesses when they unilaterally block them. They compound that damage by refusing to resolve the issue unless people somehow manage to reach some insider.

marshmallow_125y ago

>Google acts like they own the internet

They kind of do

dang5y ago

That quote is from an 8-year-old comment: https://news.ycombinator.com/item?id=5972927.

martin_a5y ago

You're right, didn't look closely enough to see the timestamp. Saw it in the screenshot in the post and thought it was a recent comment.

1 more reply

C19is205y ago

Isn't that how life works?

vntok5y ago

From the article:

> So after a lot of brainstorming and ideas from HNers I finally figured out the culprit(s).

> We have a live demo on our home where people can upload a test file. [...]

> We also give all users a 20MB test storage. [...]

> I believe that somebody signed up for our service (it’s free to sign up) and then uploaded a malicious file on our test storage and abused this feature.

If that is correct, Google was completely in the right to flag the domain as malicious and warn visitors.

matsemann5y ago

Why? Should GDrive be banned if a single user uploads a malicious file and links to it from a Gdoc?

vntok5y ago

If you create a folder in GDrive, share it with "anyone with a link" and then publish that link on Twitter, you will instantly collect porn, piracy & generally malicious files from all over the world. And your account will promptly get blocked, as it should.

ttt05y ago

Like they were in the right in removing decades of comp.lang.c archives, because it contained some spam?

edit: just noticed, their comp.lang.c archives are back up now

julianlam5y ago

Thank you for the write up, I really appreciate how there were actionable suggestions within.

NodeBB does host a demo instance to allow people to kick the tires. I don't believe we allow people to upload images, but it is worth double checking just in case.

trinovantes5y ago

How do cdn providers (Cloudflare, Cloudfront etc.) avoid the subdomain blacklisting problem? Do they just have some agreement with browser vendors to whitelist their all of their subdomains because they're big enough?

mhio5y ago

Mozilla maintains a public suffix list - https://publicsuffix.org/

https://github.com/publicsuffix/list/blob/master/public_suff...

simion3145y ago

The issue with this black lists is that all the antiviruses/security tools will immediately put you on their list but it can take days or weeks to have them remove you and you can still get some customer that uses some weird security program that he still gets the issue. One of the anti-viruses company has a form to submit a dispute but their form was broken for weeks.

nikita22065y ago

Sounds quite familiar. Similar thing happens in judicial system, at least in my country but from what I observe - in most.

mscarborough5y ago

Your cert is triggering SSL_ERROR_BAD_CERT_DOMAIN.

CuriousCosmic5y ago

Curious, I'm not seeing it on my end (just another person accessing the site). Which domain is it upset about for you?

r1ch5y ago

I wonder if the use of a .win domain had any influence. I've seen nothing but spam and malware / phishing from these $2 TLDs.

https://symantec-enterprise-blogs.security.com/blogs/feature...

gortok5y ago

I tried to send the blog post link to another person on Twitter and got a notice that the tweet couldn’t be sent because the site was potentially harmful: https://twitter.com/gortok/status/1368309384619626506?s=21

Kiro5y ago

> But there are plenty of Google engineers and good helpful people on Hacker news.

Way less nowadays due to all the employee shaming.

david4225y ago

Thanks for the writeup - I've learned some things. I have a site that allows user image uploads as well. I take each "image" and resize and compress it. If it's not an image after that, it get's rejected. Hopefully this is rejecting any malware.

I have gotten warnings from google multiple times about hosting NSFW images (that is not the purpose of the site) that have ads on the page. This isn't google disliking NSFW content - it's google not liking NSFW content and ads together. Due to multiple warnings, and worried about bans, I now actually manually review each image. This is actually easier than it sounds. I wrote myself a batch script and review in chunks before I allow google to view any images.

mleonhard5y ago

Website blacklists exist because of malware and phishing. Malware exists because our browsers and OS's are insecure. Phishing exists because our auth systems are insecure. Solving software security and auth will have wide positive effects on society.

weef5y ago

Clicking the OP link I get a warning page from my ESET AV:

"Potential phishing attempt. This web page tries to trick visitors to submit sensitive personal information such as login data or credit card numbers."

Is this somehow related to the Google situation?

hertzrat5y ago

I made a Wordpress site last year to start blogging that had this happen. The only reason I found out in this case was from visiting it in edge, which showed a warning pop up, so maybe it was a Microsoft flag instead of google in this case. I never figured out the cause or a way to remedy it and just took the site offline because it was invisible to all search engines. Pretty disappointing

system25y ago

What was your site about? Did you have uploaders or embedded 3rd party widgets?

hertzrat5y ago

Nope. It was about game programming. Text and images. A gif or two. Analytics configured to track as little as possible

thinkloop5y ago

Someone uploaded a "virus" to OP's domain and Google crawler found it and blocked said domain? Is that the mechanics?

thayne5y ago

So... The proposed mitigation is to use multiple top-level domains. At the same time, third party cookies probably won't be around much longer and already don't work for some browsers, so if you want to share state between pages, you need them to be on the same domain (but can be subdomains). There is no winning scenario here.

gridder5y ago

This 100%

chx5y ago

If you want to avoid this issue with a Drupal site, the file_public_base_url settings is helpful and you might -- or might not, given the latest comment there -- need the patch from this issue: https://www.drupal.org/node/2754273

kjrose5y ago

Everyone in this thread is clearly stating how this is not a properly functioning system and there is story after story of the kafkaesque disasters to which Google is not responsible at all.

The question I have is what can anyone do to really change things? If we all agree this is a major issue why can't we find a reasonable solution to it.

noisy_boy5y ago

Another idea could be to maybe have a separate dedicated companion domain (not a sub-domain) for communication which can be mentioned in the main domain. Atleast if the main domain is affected, you can still have a working channel that is a single place of truth for updates/communications.

egorfine5y ago

Slightly offtopic: both "Drag and drop" and "Embed on page" examples at https://www.uploader.win/docs/ do not work.

duckfang5y ago

To be quite honest, this seems like a case of Libel and possibly Tortious Interference on behalf of Google/Alphabet.

Especially if you can show damages/customers cancelling service, I think this would be a hill to die on. Google et al have too much power, even over people and orgs that aren't even customers. Its high time we reign their powers in, find them strongly culpable for what they do (and what they change and then refuse to do), and consider breaking up these monster companies up when they show they are against the public interest.

Were you, uploaderwin, given a notice prior (say to abuse@uploader.win , admin@uploader.win or other appropriate mails) to being effectively banned WRT google? I'd go on a limb and say you didnt. No, you have to be aware of the right page at Google, register you as an admin to the site, and hope they share what they consider abuse.

And frankly, you were lucky you got the social media escalation. You should have never had this happen... But here we are.

kemayo5y ago

Based on what the article says, it sounds like the Google auto-blocking was correct.

The website owner's theory is that someone used their demo to upload a genuinely malicious file, and presumably then shared it to others. Adding the site to their blocklist immediately is a reasonable action taken in defense of Google's users. It's certainly not libel for them to truthfully say the website is hosting malicious content. Well, not in the US; other jurisdictions don't necessarily have truth as a defense. (Tortious interference is complicated, but typically requires that the person interfering knows about the business relationship they're obstructing, and is taking the action for the purpose of obstructing it. It seems like a stretch here.)

As always with Google, the real issue here is their awful communication and slow responses to people who can't find a way to go outside the normal channels.

EDIT: and the article has some useful suggestions for practices to follow if you need to let people upload files as a demo. I hadn't really considered the purpose of a separate domain for such things from this angle before.

croh5y ago

> Based on what the article says, it sounds like the Google auto-blocking was correct.

Even it is correct, we can't assume it will be always correct.

> As always with Google, the real issue here is their awful communication and slow responses to people who can't find a way to go outside the normal channels.

Real problem is their slow repsponse can kill business (or may be people). If they are yielding this much power, there must be atleast some paid support service. I guess, it is time, all govs should look into this and regulate FAANG.

hn_throwaway_995y ago

I think it's fairly easy to acknowledge the the following are all true:

1. The poster was hosting malicious content from their domain (user uploaded no doubt, but still on the domain they control).

2. On one hand, it is desirable that people who are not malicious be given enough information as fast as possible to rectify their sites.

3. On the other hand, this same sort of information can make it easier for malicious users to evade detection.

That is, it seems to me like there is an inherent tension between #2 and #3 that make a simple solution difficult.

Seems to me that:

1. As the poster discovered, user content should always be hosted on a separate domain. Google should recommend this as a standard good practice.

2. Perhaps I'm missing something, but when Google blocks an entire domain, I don't see the harm in telling the site owner which subdomain is causing the flag, which could let good users identify the problem faster.

mortehu5y ago

If you're hosting lots of malware on different subdomains, there is harm in Google telling you which ones it detected. You could use that information to keep hosting the undetected malware, perhaps out of laziness.

1 more reply

LocalH5y ago

> On the other hand, this same sort of information can make it easier for malicious users to evade detection

I never bought that excuse. That sounds like saying we should be secretive about legal charges brought against a person, lest that information help criminals evade detection.

2 more replies

pier255y ago

> Don’t use base-64 images (or inline images)

For SVG just paste the markup on your HTML. Browser support is excellent and it will weight less than using a base64 encoded string.

You will be able to style it using CSS as if it was regular HTML, use JS, etc.

stevenalowe5y ago

Sounds to me like Google protected the Internet from your site after you got hacked, which alerted you to a severe security hole in your system, so what are you complaining about?

person_of_color5y ago

The comment from the Google engineer who helped the OP is not there.

https://news.ycombinator.com/threads?id=daave

What the?

8note5y ago

Keeping user content on a separate domain is something I'll Reber out of this. Suddenly it makes sense why social media sites have so many different domain names

bawana5y ago

Seems like another failure of machine learning at scale.

HDMI_Cable5y ago

This is another argument on why we shouldn’t be using Google Safe Browsing. It’s frankly unacceptable that for every 5 (or less!) bad sites it blocks, we get something like this.

thomas5y ago

And now the side it actually dead? Anyone find a cached version? Had something similar almost happen and was curious to read!

asddubs5y ago

so what was the contents of the actual malicious file that was uploaded?

yccs275y ago

It was likely already deleted, since they only kept files for 24h.

dna_polymerase5y ago

As an aside, Google themselves use base64 images quite heavily. They are the kings of inlining.

dgellow5y ago

From what I see Google should now be considered an active threat. You have to design your system knowing they will eventually act against you, either your domains or your accounts. And your chances to get it fixed are slim, unless you’re able to get some public outrage.

Really a disgusting company.

marshmallow_125y ago

>Really a disgusting company

that's quite a strong word. For the average Joe, google has immesurably improved their internet experience. The vast majority of people are perfectly happy with google and love it for gmail, youtube etc. Just because they are good at destroying some peoples lives, most users don't really care at all. You might find it hard to recruit supporters just because google are horrible sometimes.

dgellow5y ago

Other people may have a different opinion. I personally find the company disgusting. A horrible company can bring value to a lot of people. I would still find the company horrible.

1 more reply

codesternews5y ago

whats your revenue? just curious. Plans are good. Thanks

j / k navigate · click thread line to collapse

284 comments

rossdavidh5y ago

So, clearly Google has too much power over the internet, it's arbitrary and opaque, etc. I agree. However, I think it is worth pointing out that:

1) malware is often very aggressive and fast-spreading, and once it's on a user's computer it's hard to get off, therefore...

arp2425y ago

It's not hard to imagine how Google could improve here: send better notifications when something is blacklisted, provide a reason why, and offer a better procedure to get your problem fixed.

draebek5y ago

I was about to comment on your example of talking to SpamHaus, but I misremembered and was thinking of SPEWS instead:

> One common criticism is that there was no way to contact SPEWS. According to the SPEWS FAQ: "Q41: How does one contact SPEWS? A41: One does not..."

https://en.wikipedia.org/wiki/Spam_Prevention_Early_Warning_...

lostcolony5y ago

dhosek5y ago

4 more replies

thayne5y ago

Surely there is a better way to address malicious content than blocking the entire domain.

brendoelfrendo5y ago

2 more replies

kortilla5y ago

Or... it’s not Google’s job to police all sites of malware?

jeffgreco5y ago

Whose should it be? Why would they be better than Google?

2 more replies

jitbit5y ago

It's not just Google, Microsoft, Twitter and Facebook too. Our website was recently flagged by Facebook's ML [1], and the only way to "unban" it was to find ways to contact a human being inside.

PS. Twitter still does not allow me to share links to OP's website.

[1] https://news.ycombinator.com/item?id=25546868

jimmont5y ago

that_guy_iain5y ago

I was surprised this wasn't part of the lessons learned. But it seems the monitoring basically failed but that wasn't a lesson.

[1] https://www.ootliers.com (The landing page and everything are terrible and I'm working on improving that)

core-utility5y ago

When talking about checks on the order of twice a minute, curl is probably the right approach. You can/should still do a full check, but that can be done at a lower frequency.

zepearl5y ago

Not sure why you're getting downvoted, personally I agree.

For example:

- another less frequent test involving as well the DNS and the external network.

- another end-to-end test (e.g. once every 10 minutes?) involving as well one or more real browsers (this would test as well for example revoked SSL certs).

=> all these infos/metrics should be quite helpful to identify problems, or at least to shrink the potential area that is causing it.

brongondwana5y ago

that_guy_iain5y ago

The main offer is for order monitoring but I am in the middle of creating a just page load monitoring offer for others since I think that service by itself is super useful.

phist_mcgee5y ago

that_guy_iain5y ago

1 more reply

noisy_boy5y ago

exikyut5y ago

1 more reply

nchelluri5y ago

that_guy_iain5y ago

They said their test scripts worked but people using Chrome got an error. So I take that as in their scripts weren't using Chrome at all.

To be fair, I've not had this happen yet so I am going to try and find a site that chrome won't let me visit and see what happens when I visit it programmatically.

1 more reply

klingon795y ago

How would Google know that a site is curling another site?

Why would they flag that as a phishing site?

I’m just having a difficult time determining how this situation is not the fault of the site/app; we don’t even know that any of this is true and it looks more scripted than an offended rant.

stevarino5y ago

that_guy_iain5y ago

They could use Google's safe browsing api to check if they're on that list as well as curl.

2 more replies

ttt05y ago

superzamp5y ago

At this point they, and other giants, successfully demonstrated that they cannot regulate themselves over these random terminations and that the public needs to step in.

whatshisface5y ago

Rather than further secure their market positions by forcing them to treat their customers well, why not replace the entire company by competition from less insane providers?

1 more reply

doliveira5y ago

I am now left wondering about the .dev gTLD, isn't it owned by Google?

edoceo5y ago

Yep. Adjust risk factors as necessary.

https://en.m.wikipedia.org/wiki/.dev

zepearl5y ago

ezconnect5y ago

teclordphrack25y ago

yellowapple5y ago

> there are people who will set their site up so it technically passes the rules but it certainly does not pass the spirit of what was trying to be done by the rules.

Then that's a problem with the rules, which need clarified to better encode the "spirit" thereof. Hiding the rules entirely is a poor substitute for that.

himujjal5y ago

Really. You really sound like first making up your mind on something and trying to then justify it.

What are your saying?

aboringusername5y ago

This is happening to thousands of others undoubtedly that do not make hackernews or have the resources/energy to fix it.

We should demand better.

still_grokking5y ago

> It strongly appears those in charge of legislation are not technically minded and have no idea "how" the internet works.

Of course they know. Everybody knows, it's just a series of tubes.

But that's not the point. The people in charge also know:

> If Google dislikes you (or their baseless algorithms that are detached from reality) then you are toast.

dehrmann5y ago

> How long before Google's algorithm results in an actual human death? Doesn't seem totally far fetched and entirely plausible.

https://en.wikipedia.org/wiki/YouTube_headquarters_shooting

stonecraftwolf5y ago

I think if FAANG didn’t already control so much of our communications you might see such advocacy groups, but as it is...

Do you want to be the face of a campaign that will piss off FAANG?

fefe235y ago

Can someone explain to my why Google isn't being drowned in a torrent of lawsuits?

We are getting stories like this on a weekly basis now.

Google is clearly causing measurable harm to your company and you. And apparently to thousands before you.

This is not a rhetorical question. I really don't get it.

America is the land of getting millions in settlement when McDonald's gives you coffee that is hotter than you anticipated. How the hell is Google getting away with their behavior?

at_a_remove5y ago

That last line is unpleasantly wrong.

Of course, the temperature differences may not seem like much, but a ten degree drop at that point changes the time from "skin graft city" from three seconds to perhaps four or five times that.

Final verdict, before settlement, was $640,000, not "millions." The parties settled out of court for an undisclosed final amount less than $600,000.

sayhar5y ago

Thank you! The whole "hurr durr McDonalds Coffee" thing is one of those stories that simply won't seem to die, no matter how often heroes like you show up and do the work of correcting it.

notriddle5y ago

People want three things:

1. publishers want to be able to put content on the Web without undergoing background checks

2. everyone wants to be able to discover content with as little friction as possible

3. consumers don’t want to drown in unwanted crap

The incomprehensible Algorithm is the result of trying to square that circle. Give up any of those requirements, and the arms race would end:

Give up #3, and you don’t need Safe Browsing any more. Good luck selling that to everyone, though.

AnthonyMouse5y ago

> Give up #1, and it’ll be possible to do all of the rules enforcement reactively, with no algorithms and no inhumane call centers, because when someone is banned, they’ll stay banned.

megous5y ago

> In order to sue them, you need to come up with something that they should’ve done but didn’t.

2 more replies

pja5y ago

Currently they have all the benefits of their monopoly with none of the responsibility which is exactly the way they like it.

jokethrowaway5y ago

Go figure why nobody keeps them accountable.

They have enough money to influence the USA government if anything changing the situation were to be introduced.

1 more reply

matthewheath5y ago

fefe235y ago

I'm guessing the web site has telemetry and analytics and can show the conversion rate going down. If the web site sells something, you could even put a dollar amount on the damage.

I'm probably misunderstanding your argument here, but if, say, Google steals your bike that would be purely economic damage. Surely the UK legal system would still punish that...!?

2 more replies

matheusmoreira5y ago

> how is it that everybody is treating Google like a fragile grandmother with dementia, going out of their way not to hold them responsible in court?

drivebycomment5y ago

ttt05y ago

Probably something something along the lines of "private company and they can do whatever they want".

justAnIdea5y ago

imwillofficial5y ago

Two words.

Regulatory Capture.

The dividing line between big tech and big gov is far thinner than most people consider.

getlawgdon5y ago

how are they not "being downed in a torrent of lawsuits?" because nearly all would-be litigants believe they can't persevere against google's depth of resources. so they don't try.

alisonkisk5y ago

What is the tort?

Has Google burned your labia?

jedberg5y ago

https://en.wikipedia.org/wiki/Tortious_interference

They interfered with the contract OP has with their customers.

1 more reply

imwillofficial5y ago

McDonald’s did not “burn off a woman’s labia”

That’s some dumb shit right there.

1 more reply

dang5y ago

The previous thread:

Help HN: Google just blocked my site as deceptive site - https://news.ycombinator.com/item?id=26326528 - March 2021 (20 comments)

ufmace5y ago

I'm wondering if this could actually be spun into being a good thing.

random56345y ago

Quick notes:

Site owner has not confirmed they screened all uploaded content for malware - this is a major issue these days and google and others will flag you if you host viruses and pump out malware.

And no - you cannot sue google to force them to allow users to be infected.

It’s not clear that all uploaded content is protected from being linked too or downloaded. Google admins and other virus vendors can setup screens on downloads.

Anyways - see plenty of shady / scam and incompetent website owners hosting malware - not much sympathy in most cases.

rightbyte5y ago

Something tells me that Google doesn't ban G Drive, Dropbox or MS's what ever it is named when those host malware. I rather not have only the giants host user generated content ...

temp6675y ago

First - most of these places are running pretty advanced virus / malware scanners. So when you go to download a file from drive etc, a scan is done (at least for files that are not enormous).

This is actually a big issue sometimes for folks who use google drive, because malware will infect their files, they will then be synced to google, then blocked from downloading them ever again!

That leads to support requests list this:

https://support.google.com/a/thread/60528209?hl=en

Even if you pay the ransomeware fee, google WILL NOT let you access your own files again. So years worth of files - GONE.

This last one can really confuse site owners, when NON PUBLIC content results in site bans.

lrem5y ago

Google does the silly separate domain dance GP recommended. I couldn't figure out what is it for, until I read this advice in the previous discussion.

Disclaimer: I'm a Google SRE. But never supported anything reachable from the outside.

2 more replies

simion3145y ago

hamburglar5y ago

wnoise5y ago

> We never allow anything other than video and image files either.

I would have thought this would be an excellent way to not host malware.

freeone30005y ago

Not quite. There have been several flaws in WMF that have caused video and image files to be viral vectors. https://en.m.wikipedia.org/wiki/Windows_Metafile_vulnerabili...

gpm5y ago

> And no - you cannot sue google to force them to allow users to be infected.

Has anyone tried, genuinely curious how this would turn out.

denysvitali5y ago

guerby5y ago

Let's do an experiment, please post the URL here and see if someone at google takes notice :)

denysvitali5y ago

Well, enjoy: https://nbbdfxhqcc[ remove me ]fll.agilecrm.com/landing/6754083888234496#0.593636668875394

1 more reply

imwillofficial5y ago

So this sucks for the developer, but I have another story to share.

Saved by google? Oh god, I think I need a shower now.

sneak5y ago

The fact that they are sometimes useful does not negate the fact that they have too much power to censor the web.

colinmhayes5y ago

It kind of does though? Either people get scammed by broken URLs or google sometimes bans innocent sites. I would expect most chrome users prefer innocent sites occasionally banned.

bilater5y ago

stonecraftwolf5y ago

I’m so sorry this happened to you. Can you show us the site?

bilater5y ago

The site is https://nextails.com/

I just ran ads with headlines like Nextjs + TailwindCSS Landing Pages

Apparently somehow I ran afoul of their Circumventing Systems policy. I don't know how this qualifies and when I appealed they came back saying the same thing.

3 more replies

rgj5y ago

jhugo5y ago

lanevorockz5y ago

martin_a5y ago

> But there are plenty of Google engineers and good helpful people on Hacker news.

> (from a screenshot) I work at Google [...] so I escalated your issue [...]

> I believe the HN thread getting on the homepage tremendously helped me and somebody from Google saw it and expedited the review after all

So, once more an issue with FAANG could only be fixed because somebody knew somebody else and went out of his way to get this to the right eyes.

This could easily have gone another way and OP would have received no help whatsoever and would have waited for days or weeks to get this issue cleared and lost his business.

Maybe it's only me but I find it unbearable that you'll usually not be able to reach any real person at all for issues like these and it's pure luck what happens to you.

davidmurdoch5y ago

This account "owns" digital goods, thousands of songs, and many domain names. Google is actively stealing these things, but they don't care and, "can't help".

jokethrowaway5y ago

I reached Apple support but they're unwilling to help, they even refuse to nuke the account as a last resort.

6 more replies

ncann5y ago

I wonder how many people lost their account like me because of these overzealous security measures.

heavyset_go5y ago

> This account "owns" digital goods, thousands of songs, and many domain names. Google is actively stealing these things, but they don't care and, "can't help".

I long for the day that they cross the wrong person with means to take them to court over their negligence.

1 more reply

maddyboo5y ago

MeinBlutIstBlau5y ago

1 more reply

childintime5y ago

dencodev5y ago

Have you considered calling the number tied to the account and asking them to help?

2 more replies

deckard15y ago

The worst part is that we are just conditioning people to accept this as normal. Just like EULA and cookie banners.

Just kidding. Profit is God.

edoceo5y ago

Upvote for Brazil! Great movie, Gilliam is a genius, so good. https://m.imdb.com/title/tt0088846/

inglor_cz5y ago

This is really feudalism replayed.

If you know important people at the emperor's court, you have a chance to get yor problem solved.

franklampard5y ago

Not really just feudalism. But how most parts of the world work

2 more replies

drc500free5y ago

Definitely seems like a class system, you’re either above the algorithm or below it.

jahewson5y ago

It’s really not, least of all because feudalism didn’t have emperors.

3 more replies

alisonkisk5y ago

It's not "feudalism", it's human social relations and power dynamics.

5 more replies

Jimmc4145y ago

bitcharmer5y ago

This is the norm with FAANG and it really annoys me. How many of these cases never saw the light of day because of that?

Even with HN it's a complete lottery what contents reaches the front page, so getting issues like these resolved is a matter of extreme luck for a common person.

uh_uh5y ago

If enough companies contribute to this, it might put some pressure on FAANG to take things seriously.

1 more reply

matheusmoreira5y ago

marshmallow_125y ago

>Google acts like they own the internet

They kind of do

dang5y ago

That quote is from an 8-year-old comment: https://news.ycombinator.com/item?id=5972927.

martin_a5y ago

You're right, didn't look closely enough to see the timestamp. Saw it in the screenshot in the post and thought it was a recent comment.

1 more reply

C19is205y ago

Isn't that how life works?

vntok5y ago

From the article:

> So after a lot of brainstorming and ideas from HNers I finally figured out the culprit(s).

> We have a live demo on our home where people can upload a test file. [...]

> We also give all users a 20MB test storage. [...]

> I believe that somebody signed up for our service (it’s free to sign up) and then uploaded a malicious file on our test storage and abused this feature.

If that is correct, Google was completely in the right to flag the domain as malicious and warn visitors.

matsemann5y ago

Why? Should GDrive be banned if a single user uploads a malicious file and links to it from a Gdoc?

vntok5y ago

ttt05y ago

Like they were in the right in removing decades of comp.lang.c archives, because it contained some spam?

edit: just noticed, their comp.lang.c archives are back up now

julianlam5y ago

Thank you for the write up, I really appreciate how there were actionable suggestions within.

NodeBB does host a demo instance to allow people to kick the tires. I don't believe we allow people to upload images, but it is worth double checking just in case.

trinovantes5y ago

mhio5y ago

Mozilla maintains a public suffix list - https://publicsuffix.org/

https://github.com/publicsuffix/list/blob/master/public_suff...

simion3145y ago

nikita22065y ago

Sounds quite familiar. Similar thing happens in judicial system, at least in my country but from what I observe - in most.

mscarborough5y ago

Your cert is triggering SSL_ERROR_BAD_CERT_DOMAIN.

CuriousCosmic5y ago

Curious, I'm not seeing it on my end (just another person accessing the site). Which domain is it upset about for you?

r1ch5y ago

I wonder if the use of a .win domain had any influence. I've seen nothing but spam and malware / phishing from these $2 TLDs.

https://symantec-enterprise-blogs.security.com/blogs/feature...

gortok5y ago

Kiro5y ago

> But there are plenty of Google engineers and good helpful people on Hacker news.

Way less nowadays due to all the employee shaming.

david4225y ago

mleonhard5y ago

weef5y ago

Clicking the OP link I get a warning page from my ESET AV:

"Potential phishing attempt. This web page tries to trick visitors to submit sensitive personal information such as login data or credit card numbers."

Is this somehow related to the Google situation?

hertzrat5y ago

system25y ago

What was your site about? Did you have uploaders or embedded 3rd party widgets?

hertzrat5y ago

Nope. It was about game programming. Text and images. A gif or two. Analytics configured to track as little as possible

thinkloop5y ago

Someone uploaded a "virus" to OP's domain and Google crawler found it and blocked said domain? Is that the mechanics?

thayne5y ago

gridder5y ago

This 100%

chx5y ago

kjrose5y ago

Everyone in this thread is clearly stating how this is not a properly functioning system and there is story after story of the kafkaesque disasters to which Google is not responsible at all.

The question I have is what can anyone do to really change things? If we all agree this is a major issue why can't we find a reasonable solution to it.

noisy_boy5y ago

egorfine5y ago

Slightly offtopic: both "Drag and drop" and "Embed on page" examples at https://www.uploader.win/docs/ do not work.

duckfang5y ago

To be quite honest, this seems like a case of Libel and possibly Tortious Interference on behalf of Google/Alphabet.

And frankly, you were lucky you got the social media escalation. You should have never had this happen... But here we are.

kemayo5y ago

Based on what the article says, it sounds like the Google auto-blocking was correct.

As always with Google, the real issue here is their awful communication and slow responses to people who can't find a way to go outside the normal channels.

croh5y ago

> Based on what the article says, it sounds like the Google auto-blocking was correct.

Even it is correct, we can't assume it will be always correct.

> As always with Google, the real issue here is their awful communication and slow responses to people who can't find a way to go outside the normal channels.

hn_throwaway_995y ago

I think it's fairly easy to acknowledge the the following are all true:

1. The poster was hosting malicious content from their domain (user uploaded no doubt, but still on the domain they control).

2. On one hand, it is desirable that people who are not malicious be given enough information as fast as possible to rectify their sites.

3. On the other hand, this same sort of information can make it easier for malicious users to evade detection.

That is, it seems to me like there is an inherent tension between #2 and #3 that make a simple solution difficult.

Seems to me that:

1. As the poster discovered, user content should always be hosted on a separate domain. Google should recommend this as a standard good practice.

mortehu5y ago

1 more reply

LocalH5y ago

> On the other hand, this same sort of information can make it easier for malicious users to evade detection

I never bought that excuse. That sounds like saying we should be secretive about legal charges brought against a person, lest that information help criminals evade detection.

2 more replies

pier255y ago

> Don’t use base-64 images (or inline images)

For SVG just paste the markup on your HTML. Browser support is excellent and it will weight less than using a base64 encoded string.

You will be able to style it using CSS as if it was regular HTML, use JS, etc.

stevenalowe5y ago

Sounds to me like Google protected the Internet from your site after you got hacked, which alerted you to a severe security hole in your system, so what are you complaining about?

person_of_color5y ago

The comment from the Google engineer who helped the OP is not there.

https://news.ycombinator.com/threads?id=daave

What the?

8note5y ago

Keeping user content on a separate domain is something I'll Reber out of this. Suddenly it makes sense why social media sites have so many different domain names

bawana5y ago

Seems like another failure of machine learning at scale.

HDMI_Cable5y ago

This is another argument on why we shouldn’t be using Google Safe Browsing. It’s frankly unacceptable that for every 5 (or less!) bad sites it blocks, we get something like this.

thomas5y ago

And now the side it actually dead? Anyone find a cached version? Had something similar almost happen and was curious to read!

asddubs5y ago

so what was the contents of the actual malicious file that was uploaded?

yccs275y ago

It was likely already deleted, since they only kept files for 24h.

dna_polymerase5y ago

As an aside, Google themselves use base64 images quite heavily. They are the kings of inlining.

dgellow5y ago

Really a disgusting company.

marshmallow_125y ago

>Really a disgusting company

dgellow5y ago

Other people may have a different opinion. I personally find the company disgusting. A horrible company can bring value to a lot of people. I would still find the company horrible.

1 more reply

codesternews5y ago

whats your revenue? just curious. Plans are good. Thanks

j / k navigate · click thread line to collapse