Google Drive flags file only containing “1” for copyright infringement (opens in new tab)

(twitter.com)

1117 pointsthanatosmin4y ago449 comments

449 comments

I'm curious if it was related to the file name. I created a few 1-byte files with just "1" in them, with different names, including "output04.txt". No problems so far. Also uploaded variations with "\n" and "\r\n" after the "1". And enabled sharing to anyone with the link. No issues so far.

Google drive does support metadata like a description and comments. I wonder if someone posted some copyrighted text in a comment?

Update: Recreated it. Most of them are now flagged. Took about an hour for that to happen. So far, all that have just one byte, being a "1", and also the one that contains "1\n".

The one with "1\r\n" hasn't been flagged. The file names of the flagged files: "one.txt", "onev2.txt", "output04.txt" and "output05.txt".

Screenshots of the email and Google drive: https://imgur.com/a/RHnEJcj (note the little flags on the Google drive view, and the file sizes)

Just added some files with "0" and "0\n", we'll see if "0" is copyrighted :)

OneLeggedCat4y ago

Just tested this myself. Got the same results as you while using different file names.

Difficult to believe that Google has become so internally dysfunctional that it hasn't fixed this technical embarrassment twenty hours after first being being seen, and then five hours after being reported on front page of Hacker News.

Barrin924y ago

At the scale at which Google operates, even if they fixed it within a minute it probably takes some time to test and roll these things out.

Sometimes I wonder if people think that infrastructure for billions of people is some sort of magic where you push the fix button and magically everything's solved across the globe.

rezonant4y ago

Sure it'll take time.

Google cannot have it both ways though. They have a habit of completely restricting all ways of appealing or contesting these issues ostensibly because they think their tech is so good that it isn't required (at least in the vast majority of cases), and yet we have people permanently locked out of their personal and work GMails for months on end (or indefinitely) and now being accused of copyright infringement for the digit "1", with a bold assertion at the end of the email saying "This restriction cannot be reviewed".

Google's become so big that collapse/disruption is inevitable.

2 more replies

OneLeggedCat4y ago

I'd have more sympathy if it weren't "A review cannot be requested for this restriction." It's designed to be as brutal and draconian as possible. They chose this. It is guilty until proven innocent, with no recourse.

Sometimes I wonder if people will excuse anything Google does that hurts people and assume it is some sort of magic to not choose to do that.

3 more replies

fragmede4y ago

The faster the fix can roll out, the faster a breakage can also roll out. Move slower and break fewer things.

dhosek4y ago

I remember interviewing at Amazon (this was 2009) and being told that any engineer could deploy to the main site at any time. The thought terrified me.

1 more reply

cardosof4y ago

A product manager with an MBA will argue this affects only a handful of customers, mostly non-paying, so fixing it should sit at the end of the backlog.

jodrellblank4y ago

Is that wrong? How many outraged people here are going to stop using paid-for Google products because of this? (people who didn't stop using paid-for Google products for all the times this kind of thing happened before). It's approximately none, isn't it?

7 more replies

mianos4y ago

I assume 99% of googlers are juggling production issues. Maybe the rest are supporting the ad platforms, as the public facing applications have been fairly static for years, aside from moving the comments on youtube to the top and bottom and back up to the top again.

I wish they would finish the thumbs down functionality in youtube music or look at WearOS once in a while (150000 one stars reviews LOL).

Some may say 'you get what you pay for', but after a few paid google gapps accounts getting locked out lately for no good reason, I'll be moving and not paying anymore after drawing a blank from support.

judge20204y ago

It only affects sharing so it isn't a disaster if this happens, probably P3 medium if you went by their bug bounty priorities.

tyingq4y ago

That is what I saw, I can still access the file. I do wonder, though, if there's some threshold of flagged files where they suddenly enact some other punishment.

1 more reply

LightG4y ago

Altenatively, just the next brick laid upon the pyramid of "Google sux".

tyingq4y ago

Update: It has now flagged the file with "0\n" in it.

So it's official, Both 0 and 1 are copyrighted :)

https://imgur.com/a/xMgh6Xn

teachrdan4y ago

Relevant Onion article here: https://www.theonion.com/microsoft-patents-ones-zeroes-18195...

PixelOfDeath4y ago

Relevant real Microsoft patent on the IS NOT OPERATOR:

https://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2...

labster4y ago

Why is satire so much better at predicting our dumb reality than sci-fi? This is definitely the darkest timeline.

2 more replies

ImprovedSilence4y ago

Haha wow. In another testament to the times, the length of yesterday’s satire seems as long as todays “long form”, and much more developed than todays two line satire “articles”. /end soapbox

1 more reply

rgoulter4y ago

Relevant SMBC comic: "Mister President! India is suing for half of our data!" https://www.smbc-comics.com/comic/2012-08-29

jfoster4y ago

Perhaps this issue came about because Google's AI read this article.

kregasaurusrex4y ago

From 1998 no less!

steelframe4y ago

Careful there. My law firm in east Texas happens to own the patent rights to a system and method of concatenating a plurality of 1s and 0s for the purpose of generating an encoding of a creative work, and as such you are in peril of infringing on one of my claims with your concatenation of 1s and 0s!

politician4y ago

General Mills owns the copyright on 0s, sir. (Cheerios)

1 more reply

meshaneian4y ago

I believe 0 and 1 belong in the public domain, at least under US copyright law - there appears to be a significant amount of prior art.

judge20204y ago

FYI while Drive and YouTube's Content ID systems operate based on copyright, they are not entirely bound to it since they only actually have to comply with DMCA requests (barring any lawsuits that claim Google has the resources to create automated copyright detection systems[0]). Even when claiming copyright, they can remove and block access to content for any reason they want legally.

0: https://www.nytimes.com/2014/03/19/business/media/viacom-and...

2 more replies

lostcolony4y ago

Not sure if prior art is relevant, per se (usually quoted in the context of patent law but IANAL), but, certainly, any copyright has expired by this point.

2 more replies

blibble4y ago

I hope you did that from a burner account!

gtirloni4y ago

Exactly my thoughts. I'd not want to risk angering the Google gods with this experiment, considering your account can be disabled by yet another automated system with no way to contact a human.

1 more reply

rezonant4y ago

Remember, "there is no review for this restriction"

Crosseye_Jack4y ago

Well that MP3 the RIAA flagged was FULL of ones and zeros, so it only makes sense they would get flagged ;-)

kingcharles4y ago

Silence copyrighted too:

https://en.wikipedia.org/wiki/4%E2%80%B233%E2%80%B3

a9h74j4y ago

Does this mean every binary file is now a derivative work?

gcanyon4y ago

That’s it, kids, the internet is now illegal — until we switch to a quaternary system using only 2’s and 3’s <taps forehead smartly>

a9h74j4y ago

Suspicious -- just when Google is announcing success with qbits and quantum computing.

kelnos4y ago

Wouldn't that still be binary, just using different symbols?

1 more reply

bl4ckneon4y ago

Does that mean someone copyrighted binary? :p I guess that mean that this comment and everything digital is copyrighted.

jrockway4y ago

Technically any creative work you author is copyrighted, so your comment was copyrighted anyway.

Whether or not large tech companies will punish people for infringing upon your copyright is yet to be determined.

1 more reply

zapdrive4y ago

Maybe it's because you have "one" in the file name: "zeronewline".

knodi1234y ago

"A review cannot be requested". What a soulless suck machine.

JorgeGT4y ago

"In order to review, please reach the top of HN"

hlbjhblbljib4y ago

You do not have sufficient social credit to request a review, please become a better person and try again.

2 more replies

CJefferson4y ago

Turns out Google also hates 500, 174, 833, 285 and 302 (from generating files from -1000 to 1000).

apocalyptic0n34y ago

This is relatively new then. As part of some testing last summer for how to handle a migration from one Drive to another, I generated a folder that had I think 10k files in it. They were all named 1.txt, 75.txt, 8492.txt and their contents matched the filename (i.e. 1, 75, 8492). They never got flagged for me. That was for a Google Workspace account, though, and they were deleted a few days later once I was done with the tests.

tyingq4y ago

I did notice it flagged "1", "1\n", but not "1\r\n". Then, it didn't flag "0", but did flag "0\n". So line endings seem to matter as well, but not in some consistent way.

1 more reply

CJefferson4y ago

Another bunch of numbers got flagged (186, 451, 336, 173, 266). I deleted the experiment, just in case I got my account deleted for too many naughty numbers.

Mathnerd3144y ago

That probably looks even more suspicious.

tyingq4y ago

That's interesting. 500 and 302 popped into my head as common HTTP status codes. The others don't seem notable to me.

duxup4y ago

“A review cannot be requested for this restriction.”

Absolute madness cannot be reviewed…

Reason cannot be applied.

kingcharles4y ago

Things are getting out of control. Reason has left the building.

Here for instance is a clunker I've had from Cash App support (I'm disputing a package I never received which was returned to the merchant and the merchant won't answer any communications):

"M.J. from Cash Support once more. To be clear, I am not misunderstanding the location of the package at this point, and understand that it has been returned to the sender now.

...

As I mentioned in my previous message, should FedEx update their delivery information to indicate a non-delivered status, we can then process a dispute on your behalf. You can reply back to me here directly should you file the missing package claim with FedEx, they conduct their investigation, and update the package status to reflect non-delivery. At that point we can then move forward on getting a dispute processed for you."

LOL. They know the package has been delivered back to the merchant but they still want me to file a missing package claim for it. The absurdity of the situation is lost on them. All these large companies hire now is soulless drones with no mind of their own.

"Follow the script on the screen, never deviate, even in the face of absurdity. Forever onwards, loyal drone."

rtsil4y ago

It's terrifying, to be honest.

melissalobos4y ago

It may take some time for it to index and check the file, youtube for example takes ages to process video, I would imagine google drive is lower priority. Check back in a week or two and see if it changes.

folbec4y ago

My bet is someone did a DMCA claim on a file containing 1, either by mistake or as a joke.

Then Automated Stupidity took over.

TillE4y ago

I'd guess some giant folder of mostly-copyrighted material, which happened to include those files for some reason.

a_f4y ago

metadata match on an album track number?

ImprovedSilence4y ago

Na, it’s gotta be something dumb on the big G’s part. I bet their unit tests for their DCMA search have test cases with simple files like 1 and 0, and those cases are just put into the “copyright” database.

rapnie4y ago

It's AI. Automated Ineptitude.

DebtDeflation4y ago

"A review can not be requested for this restriction."

avisser4y ago

I wonder if this is an incredible hash collision. hash("1") == hash(disney_movie.mp4)

xigoi4y ago

Not the case, since it also happens with other numbers.

rezonant4y ago

well as it turns out hash("0") == hash(pirates of the carribean) as well.

Not sure how Disney is finding these.

1 more reply

p1mrx4y ago

Assuming is's a cryptographic hash function, that sort of collision just never happens.

askvictor4y ago

Well, it has to happen at some point. Exceedingly unlikely, but never say never.

2 more replies

account424y ago

What makes you think that this would use a cryptographic hash instead of a perceptual hash?

hderms4y ago

Birthday paradox could play a role if there's enough content out there

kgwxd4y ago

Did you do that with a throwaway account? You'd be playing with fire getting a bunch of files flagged on your primary account.

tyingq4y ago

Nope, living dangerously :)

ezoe4y ago

Think it this way. If Google is this dumb, any account can be flagged for no reason so it doesn't increase the risk even further than it currently is.

discordance4y ago

Make sure to encrypt your 0’s and 1’s

melenaboija4y ago

I really want to think there is few people left out there leaving their 0’s and 1’s unencrypted in the cloud, I hope this post gets enough hype to solve the problem

vaillancourtmax4y ago

> I really want to think there is few people left out there leaving their 0’s and 1’s unencrypted in the cloud

This describes most everybody.

mr_toad4y ago

Just encrypt the 1s. The 0s are nothing.

mooman2194y ago

It looks like those files are marked as public. Does it trigger for you on files not being shared?

tyingq4y ago

I didn't try that. I marked them public because I assumed that's what triggers the scan.

reincarnate0x144y ago

AFAIK the copyright scan only happens on files shared from certain types of Drive accounts, the assumption presumably being that while Google doesn't know or care if you have rights to possess data, they can be reasonably sure you wouldn't be using Drive links as the means of distributing many well-known files (video, music, software, mostly) if you had rights to do so.

How far Artificial Stupidity runs into the weeds from that basically sound beginning appears to be "still farther."

FYI Drive also does scans (configurable on business accounts) for data such as likely accidental PII disclosures on shared data, and anti-malware checks on at least Windows and Mac executables, installers, and dmgs.

1 more reply

obmelvin4y ago

Does it just prevent you from sharing the files or does it prevent you from accessing the file as well?

1 more reply

can16358p4y ago

Does it flag comments for copyright infringement?

What if I comment on some file with some copyrighted content in the text, just implying something about that IP, with the copyrighted text in my comment? How can this be infringement?

tyingq4y ago

No idea. I was just trying to guess at why it might flag a single byte file as a copyright infringement.

pyuser5834y ago

Clearly a violation of U2’s copyright.

rst4y ago

The filename is "output04.txt".

hulitu4y ago

They changed their algorithm. Now it will flag files full of 0s.

iameli4y ago

All software has bugs; I'm not mad at all that this silly test case was flagged incorrectly. The truly infuriating part is "A review cannot be requested for this restriction."

Translation: "We have no idea if you actually own this content or not, but it would be _way too expensive_ for us to find out for sure! So you're out of luck, but don't worry — it's all worth it so we can make sure children can't stream Marvel movies from Google Drive! Thank you for your contributions to Disney+'s bottom line."

PostOnce4y ago

Disney robbed us, our children, their children, and possibly generations beyond that with their more-than-a-century copyright terms.

I thought about posting this comment the other day and decided not to, but your mention of Disney+ stirred the idea in me again.

We have so much modern media about Dracula, Sherlock, Cthulhu, etc, a thousand flowers bloom... new movies, new games, new art of all kinds.

Disney & friends stole that from us. We won't have a million new takes on (for example) The Hobbit for decades because of them.

We have copyright terms of up to 120 years... stuff like Pong was made before I was born and won't be public domain until long after I'm dead.

Disney kills culture by ensuring that by the time the copyright expires, no one cares anymore, because no-one was exposed to it in the many decades after it's initial-release profitability (think abandonware, not-in-print books, etc). I think this is true for 99.999% of all works, not the outliers that the corporation milked for a century or more.

josho4y ago

Really great points.

It also manifests in other strange ways that I'd summarize as killing our culture. Think of nursery songs as an example. Recall Little Miss Muffet sitting on a tuffet eating her curds and whey. I mean what is a tuffet, who eats curds and whey? The reason we don't have nursery rhymes in present day is that any of those songs are wrapped up in copyright and can't legally be shared.

The only saving grace about your Pong example is that game mechanics aren't copyrightable, so we thankfully have countless clones of Pong to play. Interestingly the same doesn't apply to Tetris, for some reason EA seems to have been able to succeed in largely removing Tetris clones.

https://www.gamedesigning.org/gaming/copyright/

Sebguer4y ago

Here's a case on why Tetris is not copied as easily: https://publicknowledge.org/tetris-copyright-decision-shows-...

Also, game mechanics can't be copyrighted but they can be patented: https://www.gamesindustry.biz/articles/2021-02-08-warner-bro...

oehpr4y ago

There was an was an incident with Games Workshop and fan animation. Games Workshop has set up their own animation studio and decided to go hard against the fan community that grew around the setting.

The whole incident made me mad, sure. But what it really made me feel was disenfranchised under the regime we live, as it is.

We... are not allowed to LIKE things, unless we do so under a capricious mega corps terms. And whatever release valves our society have are guarded behind financial, logistical, legal barriers, that are just inaccessibly to all but the largest fish with the most to lose. The fact that you like something is and can only be, as far as our culture is concerned, an asset to some corps bottom line. That's it. You're chattel and chattel only.

Sit there

Consume passively

Do not do anything.

Every time I see a thing that I like, this thought just stews in the back of my mind. Fandoms are cattle pens. Liking things is a mistake.

MugaSofer4y ago

Kind of funny that you mentioned the Hobbit, given that half the fantasy genre is takes on LotR with the serial numbers filed off. So people have found some partial workarounds.

op00to4y ago

Man, what a bummer. People have to come up with new ideas rather than rehashing old ones. How will we ever stay entertained?

mahogany4y ago

Except... Disney heavily relies on rehashing old ideas from the public domain. That same public domain that they fight against. If the Brothers Grimm were still under copyright, would Disney even have been started?

For example, you may want to take a look at: https://en.wikipedia.org/wiki/List_of_Disney_animated_films_...

1 more reply

andybak4y ago

That's not how culture works. Nothing is truly new and most creative works have been variations on those that precede them. The obsession with originality is remarkably recent and doesn't withstand much scrutiny.

rexreed4y ago

No new idea emerges in a vacuum. Every idea builds upon other ideas. Ever heard of the phrase "on the shoulders of giants"?

heavyset_go4y ago

It gets boring re-inventing the wheel over and over again.

JamesBarney4y ago

It'd be nice if the law had a escrow appeal process. Alleged violator now posts a $100 escrow, now accuser has to do the same. Then Google reviews it, makes a decision, and loser has to pay for it.

amne4y ago

It should be the other way around. The claimant must "bet" $100 that he owns the copyright. Then the defendant can call the bluff and say "I raise $100000 that you do not in fact own this output04.txt file with a 1 in it". If the the claimant still thinks he can win he can call the $100k and prove ownership. Otherwise the defendant just made $100. How cool would that be?

jiggawatts4y ago

An observation made by shrewd businesspeople throughout history is that you can only trust money. No amount of words, documents, statements, etc... matter unless someone is willing to put up real cash. If there are no consequences, then by definition misdeeds aren't punished and will be effectively incentivised.

E.g.: You can trust a legally enforced warranty with full refunds guaranteed by the government, because it costs real money to the manufacturer. You can't trust a "Best Quality!" sticker. It basically costs nothing. It's just words.

Copyright protection laws are the same kind of thing. While the marginal cost of enforcement is zero, there is similarly zero incentive to do it correctly and respectfully of the law.

If there was enforced financial penalties for each screw up, then it is assured that any errors like this will be ironed out very quickly.

No penalty? No bug fixing!

kofejnik4y ago

Bloody brilliant, and of course will never happen

Wicher4y ago

> Then Google reviews it, makes a decision, and loser has to pay for it.

I'm afraid they'll have incentives to automate that review, and then simply repeat that you can't appeal. Now you still can't access your file AND you're out of a $100 :-/

withinboredom4y ago

Sounds like some shenanigans you’d see on a blockchain. Though, if a blockchain did something like that to reverse a transaction, that’d be amazing.

vidarh4y ago

A legal requirement to provide an appeals process for automated decisions would be a good step.

Many places have restrictions like that for limited things like loan decisions, but it's about time to start forcing companies to provide a manual appeals process for other types of decisions that can significantly affect people.

raxxorrax4y ago

A legal requirement that disallows removing content until the claim has been proved would be sensible.

eterm4y ago

Not just own, but you can't even license the use of copyrighted works because even if you were somehow licensed the automatons will take over and you'll get flagged off the internet anyway.

We've gone from copyright as a mechanism for sharing works and licensing others to a situation where there are the in-group, the big media corporations who are allowed to license and remix content, and a sub-class who essentially are not.

kaetemi4y ago

So Google Drive is not an option for safely storing documents that you don't want to lose. And by extension, Google Docs is equally dangerous.

jfoster4y ago

Yeah, based on this, any GSuite presentation containing a logo seems like it might meet Google's criteria for getting blocked.

1 more reply

yeetaccount44y ago

Fuck that, you’re the big game in town, you get the big bitches. Fix your shit.

ChicagoBoy114y ago

I experienced something similar building an internal tool on GSuite. I had a large file with sequences of 9 digit numbers specific to our use-case, all tied to names of people (employees). Whelp, at one point the tool I was working on stopped working, and it was flagged as apparently containing social security numbers (which I suppose matched the character length).

Whelp, on the admin panel, you can get a report of those files, and then mark it as a false positive. Which I did. But then nothing happened, and nothing changed. It was no use.

The hilarious bit: It did, of course, allow me to make a copy of the file in question, and then just point the resource I was building to the new file, which was exactly the same. Weeks later... so far, so good.

spicybright4y ago

That's ridiculous. Storing social security numbers is necessary for lots of businesses.

Imagine your filing cabinet not letting you file employment forms with a SS# on them.

It's the sticky note password problem all over again...

015a4y ago

This isn't some default thing on G-Suite; its the DLP setting which enterprises can elect to toggle on, if they pay for the most expensive G-Suite plan. It also, afaik, only applies to shared files.

It does not work, in such a myriad of ways that I'd be blown away if it wasn't just some summer intern's project three years ago and it hasn't been touched since. But it does check boxes for audits. And enterprises don't care about actual security, they just want to check boxes for audits.

spicybright4y ago

That actually changes the entire context of this. Sorry, I don't think I read closely enough to the source article.

1 more reply

driverdan4y ago

> That's ridiculous. Storing social security numbers is necessary for lots of businesses.

They should absolutely not be stored in a GSuite document. SSNs should be treated more securely than credit card numbers.

spicybright4y ago

Should, yes, but in practice not really. I'm talking more about employee information.

You need it for tax forms, background checks, citizenship queries, sometimes bank information, etc.

So your options are:

1. Store them locally on a computer. Typically on some old windows 7 machine in the corner that hasn't been updated in some time.

2. Store documents physically. Which will either be scanned onto random computers belonging to whoever needs them to be sent through probably insecure mail servers.

Or worse, your boss taking a picture of your form and sending it to people that way, leaving the form on their phone.

3. Some other online storage like whatever M$ is offering

4. Use google and somehow store SS#'s somewhere less secure, or obfuscate them in a way no one but a few people will understand and hope they don't block any other files you upload.

Businesses have been deciding how to manage these things since the start that work best for them. Having google force you into procedures that might not work for your use case is annoying at best. And they obviously don't know best if they have issues like in OP's post.

It's like they take away your gun so you can't shoot yourself in the foot, then fires it at things it thinks are problems hoping not to hit your feet.

And what's a few toes to a company the size of google?

1 more reply

madaxe_again4y ago

Why? You can figure out someone’s SSN if you know where they’re from and what their birthday is. They’re highly predictable - particularly given that you’ll often see the last four digits as the “obfuscated” version in lists etc, and the first three are state, and the next two are based on when they were born. Put the two together, and job done. They’re effectively public data. The fact that they’re used as a magic security number/identifier is frankly mind-boggling.

Citation: https://www.pnas.org/content/106/27/10975

1 more reply

bonzini4y ago

Which is a problem of its own, since they're effectively usernames.

Secure usernames that have no corresponding password is already an oxymoron; that's what credit card numbers used to be, hence the introduction of the CVV, 3DSecure and so on; but at least a credit card can be blocked with relative ease. But SSNs are secure unchangeable usernames, which makes even less sense.

Do any countries other than the US have such an abomination, where you can figure out the SSN of someone and ruin their life?

3 more replies

jsymolon4y ago

> SSNs should be treated ...

As someone who dealt with identity theft, SSN should only be collected if contact with the SSA is needed. I.E. payment of social security benefits.

Any and ALL other "ID", nope. Use some other number.

4 more replies

nathanaldensr4y ago

Thanks to Equifax, SSNs are effectively public anyway.

bob10294y ago

> SSNs should be treated more securely than credit card numbers.

I disagree. Both SSNs and credit card numbers should be treated with equal consideration.

Establishing arbitrary classes of PII protection based upon perceived severity of compromise is a bad strategy. In the market we work in, you either get this 100% right or you don't get it right at all. There is no happy middle-ground when you are selling software to banks or other such organizations. No one is interested in "mostly" correct when it comes to PII they are responsible for protecting.

vidarh4y ago

Not just that, but official US government websites publish plenty of data with SSNs of dead people in them. For my genealogy research I have plenty of SSNs sitting around for relatives to emigrated to the US because they've happened to show up in searches.

E.g. here[1] is one of Ancestry's many catalogs based on US government data dumps that even allows you to explicitly search by SSN.

So even if one were to argue that SSNs of living people shouldn't be in GSuite, and there may be many good arguments for that, there are vast quantities of SSNs out there that are explicitly and openly shared by the government. If Google starts blocking me from accessing any of my files with notes about family history, I'll be pissed off.

I guess it's time to move off Google Docs too (I've largely left Gmail)

[1] https://www.ancestry.com/search/collections/60901/

slig4y ago

Google is working very hard to make everyone drop their shitty services.

kwhitefoot4y ago

Is this a tool that you pay money for? Sounds like it fails the 'fitness for purpose' test.

Kim_Bruning4y ago

This is why running filters on automated data is never a good idea, and should never be accepted (at least for enterprise applications, but really for anything) .

Arnavion4y ago

This is another case for only pushing encrypted files to storage hosts, unless it's against Google Drive's TOS or something. Has anyone tried it? Did Google complain?

aspenmayer4y ago

If you don’t know about this feature of rclone, you’re in for a treat! Combined with unionfs or mergerfs, or the builtin union function, you can have cached local decrypted mounts of locally encrypted remote mounted directories on gdrive, mirrored to a local folder.

https://rclone.org/crypt/

https://rclone.org/union/

Edit: there is also the (sorta abandoned, or finished/complete depending on your pov) plexdrive project, which does a bit of Plex server specific opinionated stuff and mounts gdrive read only, but which may help with reducing API quota usage according to some reports. I’ve never had any issues with the quotas that I’m aware of, but I did have to tune my settings a fair bit to get it dialed in on a somewhat memory constrained vps.

https://github.com/plexdrive/plexdrive

anon90014y ago

Tarsnap is entirely encrypted and the service provider can't see the keys. They run it on EC2 and S3: https://www.tarsnap.com/infrastructure.html

xcjs4y ago

I have an Enterprise Gsuite account with 50 TB of encrypted data for around 5 years (this was of course prior to the rebranding). I've had no complaints from Google so far.

aspenmayer4y ago

You use rclone, or another stack? I’m always trying new tools in this space.

CPAhem4y ago

Syncdocs is also an easy end-to-end encryption/sync app for Google Drive

https://syncdocs.com

Gigachad4y ago

Problem is it makes the apps / integrations useless. I use google drive primarily from my ipad and encrypted files would mean I can't just "save to files" and have it drop in to google drive.

stevens374y ago

yes

openssl enc -e -aes-128-cbc -in ${1} -out ${1}.cr -iter +123456 -k <password>

version_five4y ago

"A review cannot be requested for this restriction"

ML enforcing rules is bad enough, but not allowing false positives to be corrected is ridiculous. This is why I would never consider g-suite for any business application.

Otoh, I think there is a legitimate business to be made helping small businesses and individuals secure themselves against arbitrary behavior from big tech. This kind of thing can have serious consequences (imagine if it was something of real substance that got restricted without recourse) and people need to consider hardening their activities against google et al

ehnto4y ago

Reminds me of that 70s IBM presentation quote that surfaced recently, "A computer can never be held accountable, therefore a computer must never make a management decision".

It's one thing to have a computer flag issues, another to make it responsible for taking action and in this case, making a decision final. Google continues to set poor examples with irresponsible implementations of machine learning. With no accountability, no recourse, no humans to talk to.

sb0574y ago

I think that quote should be flipped.

"A computer can never be held accountable, therefore a computer must always make management decisions."

sandworm1014y ago

Or..

"A computer can never be held accountable for decisions, therefore all computer decisions are management decisions."

queuebert4y ago

That misunderstanding could explain the last several decades of corporate management.

beebeepka4y ago

I thought the same as that's our current reality. "It wasn't us, it was the computer!" Things are only going to get worse

1 more reply

LudwigNagasena4y ago

A program can never be held accountable, but the person (or the whole company, or both) who decided that it should make management decisions can.

ehnto4y ago

It's possible to read the original quote was as a warning of exactly that. We should all be very wary of letting our code do the walking because the fault should realistically lie with us, but it's as if the software industry feels invincible, and it's not hard to see why.

1 more reply

bobm_kite94y ago

Yes! This is one thing humans are still much better at than AI: taking blame.

giaour4y ago

The best of both worlds is when you have a human supervise a fully automated process. You can pay them peanuts and still use them as liability sponges.

(The term "liability sponge" is shamelessly stolen from https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2757236 .)

dylan6044y ago

Computer: "It's not my fault". The sunspots made me do it.

Sunspots are computers' devil.

nefitty4y ago

AI-aided management decisions can't come soon enough. Decision Model and Notation made big headway on this, but I don't see it discussed. They're fancy decision trees but can handle complex factors and are designed to be intuitive to reason about.

1 more reply

patrick4514y ago

Now, imagine letting a computer decide how to drive a 15 ton truck down the freeway at 75 mph.

jrockway4y ago

How long until cloud providers are forced to scan block devices, so you can't even self-host a file erroneously flagged by the ML? (Actually, how long until this is integrated with the "security engine" in CPUs, so you can't even hide with encryption? Everyone is talking about on-device voice-recognition and CSAM recognition, but no doubt the copyright lobby is going to push hard for this as well.)

matheusmoreira4y ago

It hurts me to even think about this. We're headed towards such a bleak future. The only solution I can think of is to kill copyright in order to save computing freedom. Otherwise our computers will eventually no longer be ours.

belter4y ago

We are in the middle of the hostilities, and we can always appreciate if more young idealists, as well as practical and hands-on militias want to join the revolution:

"The Coming War on General Computation":

http://opentranscripts.org/transcript/coming-war-general-com...

1 more reply

version_five4y ago

I think there needs to be more work on open source hardware (and the associated fab capabilities to make what people want) in order to preserve computing freedom. I have no idea how to actually do this, but it feels like it should be prioritized

3 more replies

josephcsible4y ago

Do we need to kill copyright? Couldn't we just kill vigilante enforcement of it? Maybe we should make it the law that if you see a copyright violation, your only legal options are (1) nicely ask the infringer to stop voluntarily, or (2) sue.

2 more replies

bostik4y ago

I don't think we need to kill copyright, but we sure as hell need to control how it's used. To start with, we need to spay and neuter the out of control copyright enforcers, and especially every form of copyfraud.

My previous take on the subject: https://news.ycombinator.com/item?id=22488496

1 more reply

landemva4y ago

The surface is larger than just copyright and includes anything the government can enforce in the name of protecting or preventing ... [children, terrorists, money laundering].

Lobbyists wondering why not slip this into appropriations bill and make MS put this in desktop.

2 more replies

kbrannigan4y ago

Load up on usb drives. I mean really when did we suddenly become dependent on cloud services? It's phenomenon that's less than 15 years old.

1 more reply

narrator4y ago

Gab.com is how to run a service that gets kicked off all cloud providers. They had to build their own bare metal infrastructure. That's right! They use zero cloud infrastructure. No AWS, etc. They even have release announcements about how they've upgraded or bought more servers so things should go faster and they can enable more features and so forth.

aasasd4y ago

Wow, who would've thought that you can run stuff on servers.

withinboredom4y ago

I have an email from myself to myself sent in 2007 in GMail. I can't open the attachment because "it is malicious." It doesn't, just a schematic, but there's no way to tell GMail to let me download it anyway.

vidarh4y ago

Have you tried downloading it via IMAP?

1 more reply

rsync4y ago

"How long until cloud providers are forced to scan block devices, so you can't even self-host a file erroneously flagged by the ML?"

If you are using a cloud provider you are not self-hosting.

Self hosting means you own the machine.

Johnny5554y ago

If Amazon didn't offer EBS block device encryption (with a key that's ostensibly only accessible to the customer), then customers would just use full disk encryption instead.

blibble4y ago

client side rot13 to the rescue

(... I have actually used this in the past to work around my employer's moronic proxy)

tempodox4y ago

Given that going viral on Twitter is the only functional help desk for such situations, “Tweet storm as a service” would be a promising proposition.

ThrustVectoring4y ago

Companies can say that they won't have an actual human look into your issues, but there's a person that handles letters that might turn into lawsuits and other legal issues, and you can always ensure that they get contacted instead. Certified mail is pretty quick to write and costs like $3 to send, and you can look up the company's agent for service of process.

It might not do anything - writing a letter that sufficiently implies that you are actually collecting documentation and preparing for a lawsuit is an actual skill, and your demands may be unreasonable for them to handle - but you will get an actual factual human being to at least start reading what you've written. And as a bonus, their KPI is in terms of "number of incidents per year" rather than "number of resolved tickets per day".

kingcharles4y ago

I've not had much luck with that option, you need a large megaphone for it in most circumstances.

I've had better luck lately just using contact services to find the personal cellphone and email address of high-ranking employees and contacting them to get escalation.

klyrs4y ago

You don't think that would automatically get shut down for spam-like activity?

kadoban4y ago

Most of the spam bots don't, they'd have a good shot if they don't put all their eggs in one basket.

friedman234y ago

"The trial" by Kafka needs to be required reading for everyone involved in implementing systems like this.

capableweb4y ago

Most likely the people actually implementing this system did raise potential issues to their supervisors. It's not the implementer that needs more education, it is the people who drive the implementer that needs better education in how to design systems.

But since it probably doesn't affect the bottom-line, it's unlikely to actually happen.

miohtama4y ago

There probably was risk discussion involved in some point of feature implementation. Lawyers got involved and the discussion likely went like this

1) DMCA Safe Harbor gives us, Google, unlimited protection

2) What are the users going to do any case? Sue us? Migrate to Microsoft Office 365? The minuscule amount of issues caused on 0.1% users is not going to hit our bottom line and the users cannot damage us.

3) We offer key account manager services for organizations large enough that could cause stir (Spotify size)

2 more replies

kwhitefoot4y ago

And RMS' Right to Read: https://www.gnu.org/philosophy/right-to-read.en.html

newsbinator4y ago

Or "Catch-22" by Heller: https://en.wikipedia.org/wiki/Catch-22

robocat4y ago

A wonderful comedy. I presumed it was American literature, so I had avoided reading it because that isn’t a category that interests me by default, but decided to read it and I had a good laugh. Easy to pick up cheaply second hand.

I don’t really think the book is relevant to this discussion though.

tinus_hn4y ago

Really? They don’t need any more ideas!

pdonis4y ago

> people need to consider hardening their activities against google et al

Isn't your own solution--that you would never consider g-suite for any business application--the obvious simplest way to "harden" against Google?

pokot04y ago

I agree with you and in general the move from owning software to accessing a service has been detrimental for the end user (more costs, more problems, less control).

But the restriction here is most likely just the inability to share (I would guess publicly). I don't believe it prevents you from accessing your file.

gopher_space4y ago

A public share might be the entire point of using a service.

dane-pgp4y ago

> ML enforcing rules is bad enough, not allowing false positives to be corrected is ridiculous.

And potentially illegal. According to Article 22 of the GDPR:

"The data subject shall have the right not to be subject to a decision based solely on automated processing, including profiling, which produces legal effects concerning him or her or similarly significantly affects him or her."

I think that being accused of copyright infringement (and having your free speech rights curtailed) should count as "similarly significant" to a legal effect on someone.

bryanrasmussen4y ago

Free speech does not exist in the same way in the EU, where the GDPR exists, as it does in the U.S

on edit: clarification as I seem to have offended some people, there is no right to free speech in Europe similar to the right in the U.S Bill of Rights, the closest is the following https://fra.europa.eu/en/eu-charter/article/11-freedom-expre...

hope this clarifies it for people.

on edit 2: here you can get a country by country overview https://en.wikipedia.org/wiki/Freedom_of_speech_by_country#E...

robbedpeter4y ago

Automated tools that overwhelm these triggers should be deployed. Get everything flagged and force them to put a human in the loop and fix the system. Make it technically impossible to appease the entertainment mafia and batshit regulations.

hetspookjee4y ago

I think the most likely outcome is much like the services they offer to help with settling airplane ticket compensation. I presume most of them are actually in service of the airplane operators.

Imagine a device that indeed offers to help small time companies in resolving the issues with big tech. The business case will eventually come up to just buy them up and slowly drain their effectivity, while upping the rate tremendously and being able to - if any - strike up the compensation dishes out by big tech. Pardon my cynism but I sometimes look at the airflight industry as the equilibrium of a race to the bottom. With big tech we’re not there yet

Volker_W4y ago

> I presume most of them are actually in service of the airplane operators.

why?

karsinkk4y ago

I've noticed the same behavior with the recommendation engine from Reddit and Instagram as well. While they don't totally block providing feedback on recommended posts that I'm actually not interested in, the UI flow to submit the feedback is difficult/confusing enough that I've just stopped doing it altogether. Could this be because retraining whatever model that they've built is difficult/expensive with new feedback?

wolpoli4y ago

A benign explaination would be that the feedback feature is really designed as a relief valve for the most frustrated users to feel better. Moderately frustrated users will find it takes too many clicks to provide feedback and just scroll to the next item.

jjcon4y ago

I really don't think their copyright checks are run by anything in the ML domain.

blunte4y ago

Ironically, it may end up being one of these "tiny" scenarios which finally does Google in.

When trying to illustrate a problem or bug, one of the typically time consuming challenges is reducing the scenario to the minimal case which illustrates the problem. So thank you, @emilyldolson!

Aside from an empty file, you cannot reduce this any further. It brings to light in simple terms that non-techies can understand how absurd the "ML to solve everything" promise is -- and even moreso how wilfully negligent companies are by providing NO human intervention or support when the machines break down.

quickthrower24y ago

It's a real motivator to leave Google and the MAG cloud in general. It has at least reminded me again to do regular Google takeouts.

trhway4y ago

> "tiny" scenarios which finally does Google in

Danger of tiny scenarios - I expect that Google like any BigCo will try files containing only 2, 3, 7 - no bug, ok, and then push the fix like this

if (l = read(file) ; l == "1") ... else

cgrealy4y ago

Given it's almost certainly a ruleset generated by an ml agent, it's more likely to be a change in the training data.

bonzini4y ago

Or just don't flag any file below 100 bytes.

1 more reply

choward4y ago

The fact that Google is scanning your files for "copyright infringement" is bad enough. They have no way of knowing that you don't legitimately own something. Then pair that with this example and if that isn't enough of a deal breaker for using Google drive I don't know what is.

russellbeattie4y ago

I had no idea that Drive isn't just a disk drive in the cloud. I've always treated it as such.

Do they all do this? OneDrive, Drop, etc.?

andrewxdiamond4y ago

I believe this is only for accounts with sharing on. Otherwise, there is no infringement

chickenmonkey4y ago

Exactly, I was surprised by the lack of outrage at this.

leokennis4y ago

15 years ago the first word that came to mind when thinking of Google was “magic”.

10 years ago “useful”.

These days it’s just “dread”.

bxparks4y ago

Maybe not "dread", but distrust with a tinge of sadness. It is surprising to see how much their executives are willing to destroy Google's reputation, in search of.. I don't know.. their next bonus and promotion? Seven to eight years ago, I was using probably 30-40 Google products and services. I used to recommend Google products to my friends and family without reservation. But over time, I have whittled down my dependency to Google to maybe 7-9 products/services. With the killing of Legacy GSuite, I'll be migrating to FastMail or something similar, and I'll be down to only a handful of Google products in a few months. I never thought that I would stop using Google Search and Chrome, but DDG and Firefox have been perfectly fine for me for several years now.

Gigachad4y ago

That's what happens when a product gets big and you have to start worrying about how terrorists, pedophiles, and mass copyright crime is being committed on the platform. Hosts like Google have been held legally responsible for the content they host and have acted accordingly.

notyourwork4y ago

I think dread is a bit over dramatic. I still find utility in Google Maps and GMail. Some products in the Google domain are heading that way but I don't dread Google I treat them as a business with respect to my coupling to them on a case by case basis.

I used to be a Google first, now I am one to look at all options and decide if its worth coupling something else in my life to Google. In many cases its not worth it or even required.

matheweis4y ago

> I still find utility in Google Maps and GMail.

Give it time... There are a lot of people feeling that dread about “gmail” now. Have you seen the recent threads about GSuite Legacy? Small businesses and families suddenly need to cough up hundreds of dollars per year or figure out how to migrate away from a product that originally marketed itself as free forever.

notyourwork4y ago

Sure, maybe they will and maybe they won't. Personally, I'll be happy to pay for it and have considered over the last few months a migration plan away to a paid subscription. It's just not on top of my todo list. If Google started asking me to pay, I'd pay because they have provided this service and its pretty darn good if you ask me. How much I'd pay is an open question.

aendruk4y ago

Google Maps may not have progressed to “dread” yet, but it’s solidly at frustration/wariness as a result of its incessant attempts to manipulate me. It used to be just a tool, but it’s become an adversary.

fsflover4y ago

> I still find utility in Google Maps and GMail.

Concerning the maps, try https://openstreetmap.org

Concerning the GMail, see this:https://news.ycombinator.com/item?id=30051054

notyourwork4y ago

Thanks for sharing but I wasn't really trying to debate whether I do or do not find utility in these. I do and I'm fine using them. I'm also aware of the alternatives, I use Apple maps from time to time, I use DDG on occasion. Neither is of consequence to the premise in my original post.

accelbred4y ago

The Gmail app on Android also replaces all the links in your emails with Google tracking links which is not okay.

TT-3924y ago

"Thanks for helping google keep the web safe"

Interesting thing to add in there, how on earth does copyright stuff have anything to do with safety?

tyingq4y ago

Not that I agree with it, but here's the FBI view:

"Not only can the violation of intellectual property rights damage the economy, it also poses serious health and safety risks to consumers, and often times, it fuels global organized crime."

https://www.fbi.gov/investigate/white-collar-crime/piracy-ip...

No helpful detail on why it's not safe.

zerocrates4y ago

Since they went wide with "intellectual property rights" there, the references to health and safety are probably more in the realm of trademark and maybe patent... think counterfeit drugs.

You can probably gin up a copyright example from, I dunno, the DRM system on some medical device or something, though that's obviously not the real focus of their copyright enforcement work.

coliveira4y ago

But drug safety is not an issue of copyright, but of physical control of medications. You don't need to break the copyright of a drug to create and distribute fake medication.

1 more reply

EdwardDiego4y ago

Because the FBI might ask your government to enforce US copyright law and your overly enthusiastic leader sends helicopters filled with heavily armed Police officers to raid your house at dawn, perhaps?

https://www.nzherald.co.nz/nz/dotcom-wins-settlement-from-po...

thomond4y ago

That's the FBI's general view on IP rights infringement and covers more physical products. You can apply of that to file sharing.

mminer2374y ago

I assume that message is used for both trojans and copyrighted content and that there's quite the overlap there.

userbinator4y ago

That phrase is so disturbingly dystopian that I wonder if anyone at Google ever had that thought upon seeing it. Clearly not its author, but then again, I wonder if the people working there care about anything other than their own luxurious working conditions and the $$$...

hedora4y ago

Second offense, corporate security breaks your knuckles to prevent a third strike?

j0ba4y ago

Because they're doing God's work, and how dare you question their motives?

lhorie4y ago

I have a pet theory that all of these recent Google bloopers could be explained easily if you start from the assumption that Google internal incentives promote efforts to cut costs such as storage.

"Garbage" docs, inactive email accounts, less search results etc can all be reasonably explained by a desire to not spend money on storage for "low value" data (i.e. data that is unlikely to be accessed in a way that translates to profit for Google). Users, having been trained to rely on free services and the magic of search to summon stuff, have zero incentives to clean up their digital "pollution", and at some point, something's gotta give.

munificent4y ago

I don't think there's anything specific to Google. I think the chain of events was basically:

1. Storage is expensive, so software designers and users build tools and habits that are parsimonious with it. Programs would store files in carefully designed binary formats to save space because space was expensive. Users would periodically go through directories on our computers and manually delete old stuff. Apps required users to explicitly choose what to save.

2. Storage gets much cheaper.

3. Seeing that, companies like Google and others offer "unlimited storage" by projecting the observed user behavior from (1) onto the storage costs of (2).

4. But now users and app developers change their behavior since the incentive environment is now (3). Camera apps automatically save every photo you take. Phones record higher and higher resolution video. Users stop deleting anything and rely on search to wade through their sea of bits.

5. Companies how have to adapt to the reality of (4).

I don't think there was anything particularly nefarious or shitty on the part of any participant. It's just the nature of big complex iterated systems with emergent properties.

lhorie4y ago

> I don't think there was anything particularly nefarious or shitty on the part of any participant. It's just the nature of big complex iterated systems with emergent properties.

I find it interesting that one can draw some parallels to physical consumerism and its impact of ecology. We don't generally consider buying day-to-day stuff as nefarious either, until we pay attention to the aggregate impact of the entire supply chain machine, and then it's "80% less polinators" this, and "donated clothes landfills" that. The big difference is that Google as an organization can make the call to - and follow through on - telling users directly to back off if users' consumption patterns themselves become a significant enough liability on the sheets.

Gigachad4y ago

They don't delete flagged content. I have had stuff be flagged and locked and then a while later it gets unblocked. There are much easier ways to prune inactive accounts than to pretend to copyright flag them.

withinboredom4y ago

> explained by a desire to not spend money on storage for "low value" data

I think this "just happens" when you reach a certain scale. For example, I was looking at our reporting at my job the other day and realized for 10k people a day, something wasn't working with our emails. We decided it was a "low priority" because it's "only" 10k people when we send over 30 million emails a day. I'm sure those 10,000 people (a small city's worth!) don't feel that way.

onion2k4y ago

If Google has the copyright on "1", they only need to get "0" as well and they'll have everything.

tyingq4y ago

Already there, I made some test files. The ones with "1", "1\n", and "0\n" are all now flagged. https://news.ycombinator.com/item?id=30063319

So, "someone" has them copyrighted.

misnome4y ago

All copyright-infringing files are ~50% 1's, therefore there is a 50% chance of every file with a 1 in being copyright-infringing!

Statistics don't lie, which is presumably why google employs so many of them, to calculate these efficiencies.

denton-scratch4y ago

> Statistics don't lie

However statistics can be used to confuse.

If a file is 50% 1's, then a 1-digit file has a 50% chance of infringing. More than that, the chance of infringement grows pretty fast.

Also, if "1" is copyright, then a file with a "1" in it is infringing; there's no chance there. It's certainty.

gvb4y ago

0 = 1 - 1 so they have everything already!

smnrchrds4y ago

Only if they have patented -

wlesieutre4y ago

Not - as an abstract concept, but perhaps "a method and apparatus for subtracting numbers"

1 more reply

gvb4y ago

Point of order: you've changed the method of restriction from copyright to patent.

version_five4y ago

Or XOR

vmception4y ago

Here is a joke that relies on the assumption that the platform is suggesting they own the copyright, instead of someone else that isn't the user.

kingcharles4y ago

God, I'm glad I stored all my files using quantum superpositions.

bluecheese334y ago

https://www.smbc-comics.com/comic/2012-08-29

nocturnial4y ago

Just call google support... oh... wait... right...

I wonder how many ads we need to watch before google implements something even remotely similar to user support? How many billions are enough before we get support?

I know I'm overreacting but I'm getting tired of these articles. We all know that google is messed up (to put it lightly). Some people here don't think that's the case and that's fine. Other people, including me, don't find it surprising at all.

Post something about google killing cute kittens.

I wouldn't be surprised but I would be interested in that story.

verytrivial4y ago

A quick note to anyone working to reproduce this: the automated stupidity that caused this is of the same variety that will CANCEL YOUR GOOGLE ACCOUNT without recourse if your stats lean a certain way. Tread carefully.

mastazi4y ago

I remember when it was announced that this was going to be possible and people here on HN were defending Google's decision with comments along the lines of "this is fine, they're not reading your private files, they're just going to stop people that use Google Docs for distributing pirated content"

https://news.ycombinator.com/item?id=27858032

dmitrygr4y ago

"A review cannot be requested for this restriction"

I always did say that Franz Kafka never died. He is semi-retired working in google’s PM org, occasionally consulting for the UX teams as well.

jacquesm4y ago

Pretty weird that Google would be scanning files for copyright infringement in the first place, it's supposed to be a Drive not the enforcement arm of the copyright mafia.

Gigachad4y ago

It's not clear from this tweet but usually google drive only cares about shared files. Lots of people have copyrighted content in their private drives but until you make it public, it doesn't get flagged.

everyone4y ago

It's so dystopian / Kafkaesque it's like a parody.

"Thankyou for helping google keep the web safe"

followed by...

"A review cannot be requested for this restriction"

slig4y ago

Computer says no.

Qub3d4y ago

Always operate under the assumption that iCloud (Apple), Microsoft and Google will delete any/all of your data, with no notice, and for no reason.

Because they explicitly reserve the right to do so in their TOSes.

Not your computer, not your data etc.

(https://www.quentb.com/posts/diy-cloud-backup/)

quickthrower24y ago

Cloud drives are a cache. (Actually treat all storage as a cache, i.e. the data will be lost, it's just a matter of when.).

Qub3d4y ago

Good advice. This is why I encourage anyone with sensitive digital data (photos, important receipts, etc) to set up a 3-2-1 backup:

3 copies of the data,

2 of which are on different, local mediums (i.e. hard drive and a thumb drive)

1 of which is offsite (cloud storage, safe deposit box, salt mine, etc)

PragmaticPulp4y ago

To clarify: Google isn't deleting these files. They're just "restricted", which as far as I can tell means they can't be shared publicly. It's triggering the mechanism to prevent people from using Google Drive as a file sharing service for copyrighted works. I think owners can still access their own files just fine.

That said, I agree 100% that you shouldn't rely on a single point of failure for any backup. Data must be in at least two places.

Qub3d4y ago

In this case, that is correct. From my (and the project TOS;DR's [0]) interpretation the TOS is still in general worded in such a way that Google can ultimately say, "we feel this shouldn't be here" and remove it.

For the record, that is fine, legally speaking. It just is something that I think we don't keep in mind, until our Gmail login gets locked[1] and our last backup was 6+ months ago.

[0]:https://tosdr.org/en/service/217

[1]:https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

Gigachad4y ago

I had one of my google docs files restricted which was just a school group project. No one was able to access it.

panarky4y ago

> they explicitly reserve the right to do so in their TOSes

Do you have an example of Google explicitly reserving the right to delete any/all data with no notice and for no reason?

Qub3d4y ago

https://edit.tosdr.org/points/14762

Note the wording "we reasonably believe". There is nothing objective about this.

Additionally: https://policies.google.com/terms#toc-removing

> Removing your content

> If any of your content (1) breaches these terms, service-specific additional terms or policies, (2) violates applicable law, or (3) could harm our users, third parties, or Google, then we reserve the right to take down some or all of that content in accordance with applicable law.

Finally, buried 3 links deep in the Terms[0], we can find a list of some of what Google considers valid under this clause. A lot of it makes sense (clearly illegal things) some of it is subjective ("misleading content"). If you feel comfortable all of your data could never be interpreted by anyone as falling under any of these categories, be my guest, use GDrive.

[0]: https://support.google.com/docs/answer/148505

panarky4y ago

> If any of your content (1) breaches these terms, service-specific additional terms or policies, (2) violates applicable law, or (3) could harm our users, third parties, or Google

These are definitely reasons, are they not?

For the record, would you please retract your claim that Google reserves the right to delete all data with no reason?

1 more reply

unclekev4y ago

Meanwhile my Mom uses Google Drive to share pirated movies with family members (despite my protests) and is yet to have a single file flagged.

Just need to name your file something like "Output04.S01E01.NumberOne.1080p.HEVC.x265-MeGusta" and you'll be fine /s

How can they get things so wrong?

Animats4y ago

File a DMCA counter-notice, of course.[1]

You may have to do this the hard way, via Google's address for service of process.[2] Use registered mail or FedEx.

There's also the option of taking Google to arbitration. Legal advice from one of those "free quick consult" services may be helpful.

[1] https://www.nolo.com/legal-encyclopedia/responding-dmca-take...

[2] https://support.google.com/faqs/answer/6151275

watusername4y ago

Can you file a counter-notice _in absence of_ a DMCA notice? The problem at hand is not DMCA.

Animats4y ago

Google claimed a copyright violation and did a takedown. That's what DMCA counter-notices are for. They phrased it in other terms, but ask a lawyer if that matters.

manquer4y ago

It matters for counter notice that they didn't issue a DMCA notice, you can always sue them of course, but since the first action is not under framework of DMCA, you cannot issue a counter notice to what is not a DMCA notice.

The lawsuit is not strong either because their ToS says they can delete your all your data for any and no reason at all.

1 more reply

newhotelowner4y ago

I am a small business owners. I pay for google one so that all my files are backed up and sync across devices. I also pay for backblaze to backup all my files (Just in the case google screws me).

Is there an alternative for encrypted backup & sync between different computers?

Filligree4y ago

Plenty. One of the easiest to use is probably Syncthing, if you’re thinking of self-hosting.

hyperdimension4y ago

You're talking about their subscription, Google [Removed for DMCA Violation]?

kbumsik4y ago

I use Dropbox for almost 10 years and I have been using it well so far!

Dropbox offers great file history and restoration support. One day I deleted files permanently then the team kindly supported my case to restore the files within a day.

manquer4y ago

Depending on how technically inclined you are something like Tarsnap might be a good fit. [1]

[1] https://www.tarsnap.com/

mbrukman4y ago

Disclosure: I work at Google, but not on the Google Drive team specifically.

Sorry about the issue, folks! The Google Drive team is aware of it and is working on remediating it.

And thank you all for the many test cases! :)

Ansil8494y ago

> The Google Drive team is aware of it and is working on remediating it.

So reviews of copyright infringement claims _can_ be requested, but only if they reach the front page of HN? That is not OK.

PaulHoule4y ago

Must have infringed on Metallica.

gpderetta4y ago

Must be google's new Easter egg...

iszomer4y ago

"it's not a bug, it's a feature!"

itronitron4y ago

I guess the moral of the story is, never do business with a company that doesn't provide a mailing address to which you can mail a turd (at book rate.)

Gigachad4y ago

Or just use google takeout to grab regular dumps of your files

daneel_w4y ago

And googling "15.91/4" throws a SafeSearch alert letting us know that "some results may be explicit".

Devasta4y ago

No pity, if you are using Google products for anything important you only have yourself to blame.

1vuio0pswjnm74y ago

Maybe for those times when copyright infringers try to split an infringing file into separate files containing only one bit, represented as text, to avoid detection. No, I am not serious.

Try testing a file that contains more than a single 1 or 0, such as 01111000.

reaperducer4y ago

I've said it before, Google is trying to be the new Microsoft.

https://www.theonion.com/microsoft-patents-ones-zeroes-18195...

ahsima14y ago

I wonder if it's a part of some sort of cyberattack. Someone knows that deleting a file, containing a "1" or "0" from target's gdrive will break something they want, so they filed a false DMCA claim.

dibujante4y ago

Maybe it's a low-key attempt to disrupt the use of Google Drive as a command and control source for botnets? Seems very easy to work around.

woliveirajr4y ago

Google Drive answered:

"Hi Dr. Emily Dolson, thank you for letting us know about this issue! The Drive team is very much aware of this now thanks to all of you we're working on it!"

hderms4y ago

This seems like a really great case for property based testing and/or fuzzers. Randomly generated output should virtually never flag copyright (and would be rare enough that you could manually assess if it was accurate or not, likely). The core utility of system like this, which puts an enormous amount of leverage in the hands of automated decision making must be robust against things like this.

marivilla4y ago

Google Drive also offers client side encryption, which would make this scanning ineffectual: https://flowcrypt.com/blog/article/2021-06-14-google-workspa...

So as long as you have a ton of money and are a corporation your privacy should be just fine

Snuupy4y ago

You can do the same with rclone crypt or cryptomator.

mateo14y ago

This is your annual reminder that you don't own your files if they're stored in someone else's computer (also known as "the cloud"). Keep offline backups, legislation has made it very easy to export literally everything from google.

raydev4y ago

Looks like the Google Drive team is aware of it. Wonder what happens next.

https://twitter.com/MishaBrukman/status/1485804925561057291

diogenesjunior4y ago

I feel bad for the new hire who wasn't entirely sure what he was doing. Something similar happened at reddit[0], wouldn't put it past google.

0: https://redd.it/m0rmux

jacob0194y ago

Another example of why it is time to dump google. With google you are the product, not the customer. There are decent alternatives for everything that google offers. It feels really good to do.

flykespice4y ago

Serious I'm upset that google drive can block files that you own, I feel my trust betrayed. We're really moving to an dystopian age where companies can control your personal data.

pmontra4y ago

Is this an unintended adversarial attack to some copyright classifier?

Grismar4y ago

I wonder: is this a technical issue, or just a practical joke by someone who has managed to convince Google Drive that they have the copyright to files containing only "1"?

shmerl4y ago

Copyright was always bizarre in the sense that any information can be expressed as numbers. So why are some numbers more copyrightable?

Also reminds me this ("Microsoft Patents Ones, Zeroes"): https://web.archive.org/web/20100607151726/http://www.theoni...

otterley4y ago

Any novel can be expressed as a sequence of alphabetical characters. So why are these copyrightable?

shmerl4y ago

Exactly the question you should wonder about. So translating in to abstract form, different numbers have different legal application :)

Illegal numbers anyone?

ccbccccbbcccbb4y ago

No surprise here. One World Corporation is going to reserve 1 for itself. This tweet is just a test drive.

luckystarr4y ago

Could this be a rare case of a SHA-whatever collision? Or do they use MD5 to identify these files.

Ansil8494y ago

Are there any updates on this? Like any accountability or an explanation why this happened?

userbinator4y ago

I wonder how quickly this would have been fixed if it happened to a Google employee.

hedora4y ago

This is a great example of why all cloud services should be end to end encrypted.

wanderingmind4y ago

Maybe just use rclone or other tools to store encrypted files at rest in drive.

kats4y ago

May want to back up your Google account if you see a message like that.

golem144y ago

That's Number Wang!!!

jpambrun4y ago

Google's bots are crazy. Thank god they sold Boston Dynamics...

loudtieblahblah4y ago

Protondrive is a thing.

Zero knowledge storage needs to be the default everywhere.

ComradePhil4y ago

As is Mega. I don't know why one would ever use Google Drive, OneDrive or Dropbox when Mega and Protondrive are available.

loudtieblahblah4y ago

The collaborative apps aspect of Suite/Drive is a wondrous 1-2 punch

dwaite4y ago

So who is going to run out and get a tattoo a la DeCSS?

fortran774y ago

Ar least it wasn’t flagged for illegal porn, too.

davebailey4y ago

This could easily have been a test value.

mbfg4y ago

i remember BlackDuck flagging a one pixel (white) image as a copyright infringement in our product.

spaniard_dev4y ago

Who said that “Don’t be evil” thing?

davebailey4y ago

This might have been a test case.

olliej4y ago

1 is the loneliest IP :D

gumby4y ago

what about an empty audio file four minutes and 33 seconds long?

TedShiller4y ago

Maybe but anyone can claim that on Twitter

zydex4y ago

It's been reproduced by other users, see below comment in this topic.

https://news.ycombinator.com/item?id=30061080

TedShiller4y ago

Interesting, you’re right then.

The thing with Twitter posters is they often don’t understand what they’re doing.

superkuh4y ago

Play stupid games, win stupid prizes. Putting your data in a megacorp basket means it'll be treated primarily with consideration towards their legal liability first, other megacorps second, and you third or fourth.

xiphias24y ago

If the megacorps buy up the best competitors, there's not much chance left for people, just use them.

gruez4y ago

It's true to some extent, but in the file storage space it's not really applicable. There are hundreds of competitors, and I'm not even sure if any file storage providers got bought up by "megacorps".

xiphias24y ago

Not really. I'm using all parts of Google suite together as they are well integrated with every device I have and eachother. I am using Adobe cloud for photos instead of Google photos for example, and even there I quite often feel the difference in integration with the Android operating system.

1 more reply

j / k navigate · click thread line to collapse

449 comments

tyingq4y ago

Google drive does support metadata like a description and comments. I wonder if someone posted some copyrighted text in a comment?

Update: Recreated it. Most of them are now flagged. Took about an hour for that to happen. So far, all that have just one byte, being a "1", and also the one that contains "1\n".

The one with "1\r\n" hasn't been flagged. The file names of the flagged files: "one.txt", "onev2.txt", "output04.txt" and "output05.txt".

Screenshots of the email and Google drive: https://imgur.com/a/RHnEJcj (note the little flags on the Google drive view, and the file sizes)

Just added some files with "0" and "0\n", we'll see if "0" is copyrighted :)

OneLeggedCat4y ago

Just tested this myself. Got the same results as you while using different file names.

Barrin924y ago

At the scale at which Google operates, even if they fixed it within a minute it probably takes some time to test and roll these things out.

Sometimes I wonder if people think that infrastructure for billions of people is some sort of magic where you push the fix button and magically everything's solved across the globe.

rezonant4y ago

Sure it'll take time.

Google's become so big that collapse/disruption is inevitable.

2 more replies

OneLeggedCat4y ago

Sometimes I wonder if people will excuse anything Google does that hurts people and assume it is some sort of magic to not choose to do that.

3 more replies

fragmede4y ago

The faster the fix can roll out, the faster a breakage can also roll out. Move slower and break fewer things.

dhosek4y ago

I remember interviewing at Amazon (this was 2009) and being told that any engineer could deploy to the main site at any time. The thought terrified me.

1 more reply

cardosof4y ago

A product manager with an MBA will argue this affects only a handful of customers, mostly non-paying, so fixing it should sit at the end of the backlog.

jodrellblank4y ago

7 more replies

mianos4y ago

I wish they would finish the thumbs down functionality in youtube music or look at WearOS once in a while (150000 one stars reviews LOL).

judge20204y ago

It only affects sharing so it isn't a disaster if this happens, probably P3 medium if you went by their bug bounty priorities.

tyingq4y ago

That is what I saw, I can still access the file. I do wonder, though, if there's some threshold of flagged files where they suddenly enact some other punishment.

1 more reply

LightG4y ago

Altenatively, just the next brick laid upon the pyramid of "Google sux".

tyingq4y ago

Update: It has now flagged the file with "0\n" in it.

So it's official, Both 0 and 1 are copyrighted :)

https://imgur.com/a/xMgh6Xn

teachrdan4y ago

Relevant Onion article here: https://www.theonion.com/microsoft-patents-ones-zeroes-18195...

PixelOfDeath4y ago

Relevant real Microsoft patent on the IS NOT OPERATOR:

https://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2...

labster4y ago

Why is satire so much better at predicting our dumb reality than sci-fi? This is definitely the darkest timeline.

2 more replies

ImprovedSilence4y ago

1 more reply

rgoulter4y ago

Relevant SMBC comic: "Mister President! India is suing for half of our data!" https://www.smbc-comics.com/comic/2012-08-29

jfoster4y ago

Perhaps this issue came about because Google's AI read this article.

kregasaurusrex4y ago

From 1998 no less!

steelframe4y ago

politician4y ago

General Mills owns the copyright on 0s, sir. (Cheerios)

1 more reply

meshaneian4y ago

I believe 0 and 1 belong in the public domain, at least under US copyright law - there appears to be a significant amount of prior art.

judge20204y ago

0: https://www.nytimes.com/2014/03/19/business/media/viacom-and...

2 more replies

lostcolony4y ago

Not sure if prior art is relevant, per se (usually quoted in the context of patent law but IANAL), but, certainly, any copyright has expired by this point.

2 more replies

blibble4y ago

I hope you did that from a burner account!

gtirloni4y ago

Exactly my thoughts. I'd not want to risk angering the Google gods with this experiment, considering your account can be disabled by yet another automated system with no way to contact a human.

1 more reply

rezonant4y ago

Remember, "there is no review for this restriction"

Crosseye_Jack4y ago

Well that MP3 the RIAA flagged was FULL of ones and zeros, so it only makes sense they would get flagged ;-)

kingcharles4y ago

Silence copyrighted too:

https://en.wikipedia.org/wiki/4%E2%80%B233%E2%80%B3

a9h74j4y ago

Does this mean every binary file is now a derivative work?

gcanyon4y ago

That’s it, kids, the internet is now illegal — until we switch to a quaternary system using only 2’s and 3’s <taps forehead smartly>

a9h74j4y ago

Suspicious -- just when Google is announcing success with qbits and quantum computing.

kelnos4y ago

Wouldn't that still be binary, just using different symbols?

1 more reply

bl4ckneon4y ago

Does that mean someone copyrighted binary? :p I guess that mean that this comment and everything digital is copyrighted.

jrockway4y ago

Technically any creative work you author is copyrighted, so your comment was copyrighted anyway.

Whether or not large tech companies will punish people for infringing upon your copyright is yet to be determined.

1 more reply

zapdrive4y ago

Maybe it's because you have "one" in the file name: "zeronewline".

knodi1234y ago

"A review cannot be requested". What a soulless suck machine.

JorgeGT4y ago

"In order to review, please reach the top of HN"

hlbjhblbljib4y ago

You do not have sufficient social credit to request a review, please become a better person and try again.

2 more replies

CJefferson4y ago

Turns out Google also hates 500, 174, 833, 285 and 302 (from generating files from -1000 to 1000).

apocalyptic0n34y ago

tyingq4y ago

I did notice it flagged "1", "1\n", but not "1\r\n". Then, it didn't flag "0", but did flag "0\n". So line endings seem to matter as well, but not in some consistent way.

1 more reply

CJefferson4y ago

Another bunch of numbers got flagged (186, 451, 336, 173, 266). I deleted the experiment, just in case I got my account deleted for too many naughty numbers.

Mathnerd3144y ago

That probably looks even more suspicious.

tyingq4y ago

That's interesting. 500 and 302 popped into my head as common HTTP status codes. The others don't seem notable to me.

duxup4y ago

“A review cannot be requested for this restriction.”

Absolute madness cannot be reviewed…

Reason cannot be applied.

kingcharles4y ago

Things are getting out of control. Reason has left the building.

Here for instance is a clunker I've had from Cash App support (I'm disputing a package I never received which was returned to the merchant and the merchant won't answer any communications):

"M.J. from Cash Support once more. To be clear, I am not misunderstanding the location of the package at this point, and understand that it has been returned to the sender now.

...

"Follow the script on the screen, never deviate, even in the face of absurdity. Forever onwards, loyal drone."

rtsil4y ago

It's terrifying, to be honest.

melissalobos4y ago

folbec4y ago

My bet is someone did a DMCA claim on a file containing 1, either by mistake or as a joke.

Then Automated Stupidity took over.

TillE4y ago

I'd guess some giant folder of mostly-copyrighted material, which happened to include those files for some reason.

a_f4y ago

metadata match on an album track number?

ImprovedSilence4y ago

rapnie4y ago

It's AI. Automated Ineptitude.

DebtDeflation4y ago

"A review can not be requested for this restriction."

avisser4y ago

I wonder if this is an incredible hash collision. hash("1") == hash(disney_movie.mp4)

xigoi4y ago

Not the case, since it also happens with other numbers.

rezonant4y ago

well as it turns out hash("0") == hash(pirates of the carribean) as well.

Not sure how Disney is finding these.

1 more reply

p1mrx4y ago

Assuming is's a cryptographic hash function, that sort of collision just never happens.

askvictor4y ago

Well, it has to happen at some point. Exceedingly unlikely, but never say never.

2 more replies

account424y ago

What makes you think that this would use a cryptographic hash instead of a perceptual hash?

hderms4y ago

Birthday paradox could play a role if there's enough content out there

kgwxd4y ago

Did you do that with a throwaway account? You'd be playing with fire getting a bunch of files flagged on your primary account.

tyingq4y ago

Nope, living dangerously :)

ezoe4y ago

Think it this way. If Google is this dumb, any account can be flagged for no reason so it doesn't increase the risk even further than it currently is.

discordance4y ago

Make sure to encrypt your 0’s and 1’s

melenaboija4y ago

I really want to think there is few people left out there leaving their 0’s and 1’s unencrypted in the cloud, I hope this post gets enough hype to solve the problem

vaillancourtmax4y ago

> I really want to think there is few people left out there leaving their 0’s and 1’s unencrypted in the cloud

This describes most everybody.

mr_toad4y ago

Just encrypt the 1s. The 0s are nothing.

mooman2194y ago

It looks like those files are marked as public. Does it trigger for you on files not being shared?

tyingq4y ago

I didn't try that. I marked them public because I assumed that's what triggers the scan.

reincarnate0x144y ago

How far Artificial Stupidity runs into the weeds from that basically sound beginning appears to be "still farther."

1 more reply

obmelvin4y ago

Does it just prevent you from sharing the files or does it prevent you from accessing the file as well?

1 more reply

can16358p4y ago

Does it flag comments for copyright infringement?

What if I comment on some file with some copyrighted content in the text, just implying something about that IP, with the copyrighted text in my comment? How can this be infringement?

tyingq4y ago

No idea. I was just trying to guess at why it might flag a single byte file as a copyright infringement.

pyuser5834y ago

Clearly a violation of U2’s copyright.

rst4y ago

The filename is "output04.txt".

hulitu4y ago

They changed their algorithm. Now it will flag files full of 0s.

iameli4y ago

All software has bugs; I'm not mad at all that this silly test case was flagged incorrectly. The truly infuriating part is "A review cannot be requested for this restriction."

PostOnce4y ago

Disney robbed us, our children, their children, and possibly generations beyond that with their more-than-a-century copyright terms.

I thought about posting this comment the other day and decided not to, but your mention of Disney+ stirred the idea in me again.

We have so much modern media about Dracula, Sherlock, Cthulhu, etc, a thousand flowers bloom... new movies, new games, new art of all kinds.

Disney & friends stole that from us. We won't have a million new takes on (for example) The Hobbit for decades because of them.

We have copyright terms of up to 120 years... stuff like Pong was made before I was born and won't be public domain until long after I'm dead.

josho4y ago

Really great points.

https://www.gamedesigning.org/gaming/copyright/

Sebguer4y ago

Here's a case on why Tetris is not copied as easily: https://publicknowledge.org/tetris-copyright-decision-shows-...

Also, game mechanics can't be copyrighted but they can be patented: https://www.gamesindustry.biz/articles/2021-02-08-warner-bro...

oehpr4y ago

There was an was an incident with Games Workshop and fan animation. Games Workshop has set up their own animation studio and decided to go hard against the fan community that grew around the setting.

The whole incident made me mad, sure. But what it really made me feel was disenfranchised under the regime we live, as it is.

Sit there

Consume passively

Do not do anything.

Every time I see a thing that I like, this thought just stews in the back of my mind. Fandoms are cattle pens. Liking things is a mistake.

MugaSofer4y ago

Kind of funny that you mentioned the Hobbit, given that half the fantasy genre is takes on LotR with the serial numbers filed off. So people have found some partial workarounds.

op00to4y ago

Man, what a bummer. People have to come up with new ideas rather than rehashing old ones. How will we ever stay entertained?

mahogany4y ago

For example, you may want to take a look at: https://en.wikipedia.org/wiki/List_of_Disney_animated_films_...

1 more reply

andybak4y ago

rexreed4y ago

No new idea emerges in a vacuum. Every idea builds upon other ideas. Ever heard of the phrase "on the shoulders of giants"?

heavyset_go4y ago

It gets boring re-inventing the wheel over and over again.

JamesBarney4y ago

It'd be nice if the law had a escrow appeal process. Alleged violator now posts a $100 escrow, now accuser has to do the same. Then Google reviews it, makes a decision, and loser has to pay for it.

amne4y ago

jiggawatts4y ago

Copyright protection laws are the same kind of thing. While the marginal cost of enforcement is zero, there is similarly zero incentive to do it correctly and respectfully of the law.

If there was enforced financial penalties for each screw up, then it is assured that any errors like this will be ironed out very quickly.

No penalty? No bug fixing!

kofejnik4y ago

Bloody brilliant, and of course will never happen

Wicher4y ago

> Then Google reviews it, makes a decision, and loser has to pay for it.

I'm afraid they'll have incentives to automate that review, and then simply repeat that you can't appeal. Now you still can't access your file AND you're out of a $100 :-/

withinboredom4y ago

Sounds like some shenanigans you’d see on a blockchain. Though, if a blockchain did something like that to reverse a transaction, that’d be amazing.

vidarh4y ago

A legal requirement to provide an appeals process for automated decisions would be a good step.

raxxorrax4y ago

A legal requirement that disallows removing content until the claim has been proved would be sensible.

eterm4y ago

Not just own, but you can't even license the use of copyrighted works because even if you were somehow licensed the automatons will take over and you'll get flagged off the internet anyway.

kaetemi4y ago

So Google Drive is not an option for safely storing documents that you don't want to lose. And by extension, Google Docs is equally dangerous.

jfoster4y ago

Yeah, based on this, any GSuite presentation containing a logo seems like it might meet Google's criteria for getting blocked.

1 more reply

yeetaccount44y ago

Fuck that, you’re the big game in town, you get the big bitches. Fix your shit.

ChicagoBoy114y ago

Whelp, on the admin panel, you can get a report of those files, and then mark it as a false positive. Which I did. But then nothing happened, and nothing changed. It was no use.

spicybright4y ago

That's ridiculous. Storing social security numbers is necessary for lots of businesses.

Imagine your filing cabinet not letting you file employment forms with a SS# on them.

It's the sticky note password problem all over again...

015a4y ago

This isn't some default thing on G-Suite; its the DLP setting which enterprises can elect to toggle on, if they pay for the most expensive G-Suite plan. It also, afaik, only applies to shared files.

spicybright4y ago

That actually changes the entire context of this. Sorry, I don't think I read closely enough to the source article.

1 more reply

driverdan4y ago

> That's ridiculous. Storing social security numbers is necessary for lots of businesses.

They should absolutely not be stored in a GSuite document. SSNs should be treated more securely than credit card numbers.

spicybright4y ago

Should, yes, but in practice not really. I'm talking more about employee information.

You need it for tax forms, background checks, citizenship queries, sometimes bank information, etc.

So your options are:

1. Store them locally on a computer. Typically on some old windows 7 machine in the corner that hasn't been updated in some time.

2. Store documents physically. Which will either be scanned onto random computers belonging to whoever needs them to be sent through probably insecure mail servers.

Or worse, your boss taking a picture of your form and sending it to people that way, leaving the form on their phone.

3. Some other online storage like whatever M$ is offering

4. Use google and somehow store SS#'s somewhere less secure, or obfuscate them in a way no one but a few people will understand and hope they don't block any other files you upload.

It's like they take away your gun so you can't shoot yourself in the foot, then fires it at things it thinks are problems hoping not to hit your feet.

And what's a few toes to a company the size of google?

1 more reply

madaxe_again4y ago

Citation: https://www.pnas.org/content/106/27/10975

1 more reply

bonzini4y ago

Which is a problem of its own, since they're effectively usernames.

Do any countries other than the US have such an abomination, where you can figure out the SSN of someone and ruin their life?

3 more replies

jsymolon4y ago

> SSNs should be treated ...

As someone who dealt with identity theft, SSN should only be collected if contact with the SSA is needed. I.E. payment of social security benefits.

Any and ALL other "ID", nope. Use some other number.

4 more replies

nathanaldensr4y ago

Thanks to Equifax, SSNs are effectively public anyway.

bob10294y ago

> SSNs should be treated more securely than credit card numbers.

I disagree. Both SSNs and credit card numbers should be treated with equal consideration.

vidarh4y ago

E.g. here[1] is one of Ancestry's many catalogs based on US government data dumps that even allows you to explicitly search by SSN.

I guess it's time to move off Google Docs too (I've largely left Gmail)

[1] https://www.ancestry.com/search/collections/60901/

slig4y ago

Google is working very hard to make everyone drop their shitty services.

kwhitefoot4y ago

Is this a tool that you pay money for? Sounds like it fails the 'fitness for purpose' test.

Kim_Bruning4y ago

This is why running filters on automated data is never a good idea, and should never be accepted (at least for enterprise applications, but really for anything) .

Arnavion4y ago

This is another case for only pushing encrypted files to storage hosts, unless it's against Google Drive's TOS or something. Has anyone tried it? Did Google complain?

aspenmayer4y ago

https://rclone.org/crypt/

https://rclone.org/union/

https://github.com/plexdrive/plexdrive

anon90014y ago

Tarsnap is entirely encrypted and the service provider can't see the keys. They run it on EC2 and S3: https://www.tarsnap.com/infrastructure.html

xcjs4y ago

I have an Enterprise Gsuite account with 50 TB of encrypted data for around 5 years (this was of course prior to the rebranding). I've had no complaints from Google so far.

aspenmayer4y ago

You use rclone, or another stack? I’m always trying new tools in this space.

CPAhem4y ago

Syncdocs is also an easy end-to-end encryption/sync app for Google Drive

https://syncdocs.com

Gigachad4y ago

Problem is it makes the apps / integrations useless. I use google drive primarily from my ipad and encrypted files would mean I can't just "save to files" and have it drop in to google drive.

stevens374y ago

yes

openssl enc -e -aes-128-cbc -in ${1} -out ${1}.cr -iter +123456 -k <password>

version_five4y ago

"A review cannot be requested for this restriction"

ML enforcing rules is bad enough, but not allowing false positives to be corrected is ridiculous. This is why I would never consider g-suite for any business application.

ehnto4y ago

Reminds me of that 70s IBM presentation quote that surfaced recently, "A computer can never be held accountable, therefore a computer must never make a management decision".

sb0574y ago

I think that quote should be flipped.

"A computer can never be held accountable, therefore a computer must always make management decisions."

sandworm1014y ago

Or..

"A computer can never be held accountable for decisions, therefore all computer decisions are management decisions."

queuebert4y ago

That misunderstanding could explain the last several decades of corporate management.

beebeepka4y ago

I thought the same as that's our current reality. "It wasn't us, it was the computer!" Things are only going to get worse

1 more reply

LudwigNagasena4y ago

A program can never be held accountable, but the person (or the whole company, or both) who decided that it should make management decisions can.

ehnto4y ago

1 more reply

bobm_kite94y ago

Yes! This is one thing humans are still much better at than AI: taking blame.

giaour4y ago

The best of both worlds is when you have a human supervise a fully automated process. You can pay them peanuts and still use them as liability sponges.

(The term "liability sponge" is shamelessly stolen from https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2757236 .)

dylan6044y ago

Computer: "It's not my fault". The sunspots made me do it.

Sunspots are computers' devil.

nefitty4y ago

1 more reply

patrick4514y ago

Now, imagine letting a computer decide how to drive a 15 ton truck down the freeway at 75 mph.

jrockway4y ago

matheusmoreira4y ago

belter4y ago

We are in the middle of the hostilities, and we can always appreciate if more young idealists, as well as practical and hands-on militias want to join the revolution:

"The Coming War on General Computation":

http://opentranscripts.org/transcript/coming-war-general-com...

1 more reply

version_five4y ago

3 more replies

josephcsible4y ago

2 more replies

bostik4y ago

My previous take on the subject: https://news.ycombinator.com/item?id=22488496

1 more reply

landemva4y ago

The surface is larger than just copyright and includes anything the government can enforce in the name of protecting or preventing ... [children, terrorists, money laundering].

Lobbyists wondering why not slip this into appropriations bill and make MS put this in desktop.

2 more replies

kbrannigan4y ago

Load up on usb drives. I mean really when did we suddenly become dependent on cloud services? It's phenomenon that's less than 15 years old.

1 more reply

narrator4y ago

aasasd4y ago

Wow, who would've thought that you can run stuff on servers.

withinboredom4y ago

vidarh4y ago

Have you tried downloading it via IMAP?

1 more reply

rsync4y ago

"How long until cloud providers are forced to scan block devices, so you can't even self-host a file erroneously flagged by the ML?"

If you are using a cloud provider you are not self-hosting.

Self hosting means you own the machine.

Johnny5554y ago

If Amazon didn't offer EBS block device encryption (with a key that's ostensibly only accessible to the customer), then customers would just use full disk encryption instead.

blibble4y ago

client side rot13 to the rescue

(... I have actually used this in the past to work around my employer's moronic proxy)

tempodox4y ago

Given that going viral on Twitter is the only functional help desk for such situations, “Tweet storm as a service” would be a promising proposition.

ThrustVectoring4y ago

kingcharles4y ago

I've not had much luck with that option, you need a large megaphone for it in most circumstances.

I've had better luck lately just using contact services to find the personal cellphone and email address of high-ranking employees and contacting them to get escalation.

klyrs4y ago

You don't think that would automatically get shut down for spam-like activity?

kadoban4y ago

Most of the spam bots don't, they'd have a good shot if they don't put all their eggs in one basket.

friedman234y ago

"The trial" by Kafka needs to be required reading for everyone involved in implementing systems like this.

capableweb4y ago

But since it probably doesn't affect the bottom-line, it's unlikely to actually happen.

miohtama4y ago

There probably was risk discussion involved in some point of feature implementation. Lawyers got involved and the discussion likely went like this

1) DMCA Safe Harbor gives us, Google, unlimited protection

3) We offer key account manager services for organizations large enough that could cause stir (Spotify size)

2 more replies

kwhitefoot4y ago

And RMS' Right to Read: https://www.gnu.org/philosophy/right-to-read.en.html

newsbinator4y ago

Or "Catch-22" by Heller: https://en.wikipedia.org/wiki/Catch-22

robocat4y ago

I don’t really think the book is relevant to this discussion though.

tinus_hn4y ago

Really? They don’t need any more ideas!

pdonis4y ago

> people need to consider hardening their activities against google et al

Isn't your own solution--that you would never consider g-suite for any business application--the obvious simplest way to "harden" against Google?

pokot04y ago

I agree with you and in general the move from owning software to accessing a service has been detrimental for the end user (more costs, more problems, less control).

But the restriction here is most likely just the inability to share (I would guess publicly). I don't believe it prevents you from accessing your file.

gopher_space4y ago

A public share might be the entire point of using a service.

dane-pgp4y ago

> ML enforcing rules is bad enough, not allowing false positives to be corrected is ridiculous.

And potentially illegal. According to Article 22 of the GDPR:

I think that being accused of copyright infringement (and having your free speech rights curtailed) should count as "similarly significant" to a legal effect on someone.

bryanrasmussen4y ago

Free speech does not exist in the same way in the EU, where the GDPR exists, as it does in the U.S

hope this clarifies it for people.

on edit 2: here you can get a country by country overview https://en.wikipedia.org/wiki/Freedom_of_speech_by_country#E...

robbedpeter4y ago

hetspookjee4y ago

I think the most likely outcome is much like the services they offer to help with settling airplane ticket compensation. I presume most of them are actually in service of the airplane operators.

Volker_W4y ago

> I presume most of them are actually in service of the airplane operators.

why?

karsinkk4y ago

wolpoli4y ago

jjcon4y ago

I really don't think their copyright checks are run by anything in the ML domain.

blunte4y ago

Ironically, it may end up being one of these "tiny" scenarios which finally does Google in.

When trying to illustrate a problem or bug, one of the typically time consuming challenges is reducing the scenario to the minimal case which illustrates the problem. So thank you, @emilyldolson!

quickthrower24y ago

It's a real motivator to leave Google and the MAG cloud in general. It has at least reminded me again to do regular Google takeouts.

trhway4y ago

> "tiny" scenarios which finally does Google in

Danger of tiny scenarios - I expect that Google like any BigCo will try files containing only 2, 3, 7 - no bug, ok, and then push the fix like this

if (l = read(file) ; l == "1") ... else

cgrealy4y ago

Given it's almost certainly a ruleset generated by an ml agent, it's more likely to be a change in the training data.

bonzini4y ago

Or just don't flag any file below 100 bytes.

1 more reply

choward4y ago

russellbeattie4y ago

I had no idea that Drive isn't just a disk drive in the cloud. I've always treated it as such.

Do they all do this? OneDrive, Drop, etc.?

andrewxdiamond4y ago

I believe this is only for accounts with sharing on. Otherwise, there is no infringement

chickenmonkey4y ago

Exactly, I was surprised by the lack of outrage at this.

leokennis4y ago

15 years ago the first word that came to mind when thinking of Google was “magic”.

10 years ago “useful”.

These days it’s just “dread”.

bxparks4y ago

Gigachad4y ago

notyourwork4y ago

I used to be a Google first, now I am one to look at all options and decide if its worth coupling something else in my life to Google. In many cases its not worth it or even required.

matheweis4y ago

> I still find utility in Google Maps and GMail.

notyourwork4y ago

aendruk4y ago

fsflover4y ago

> I still find utility in Google Maps and GMail.

Concerning the maps, try https://openstreetmap.org

Concerning the GMail, see this:https://news.ycombinator.com/item?id=30051054

notyourwork4y ago

accelbred4y ago

The Gmail app on Android also replaces all the links in your emails with Google tracking links which is not okay.

TT-3924y ago

"Thanks for helping google keep the web safe"

Interesting thing to add in there, how on earth does copyright stuff have anything to do with safety?

tyingq4y ago

Not that I agree with it, but here's the FBI view:

"Not only can the violation of intellectual property rights damage the economy, it also poses serious health and safety risks to consumers, and often times, it fuels global organized crime."

https://www.fbi.gov/investigate/white-collar-crime/piracy-ip...

No helpful detail on why it's not safe.

zerocrates4y ago

Since they went wide with "intellectual property rights" there, the references to health and safety are probably more in the realm of trademark and maybe patent... think counterfeit drugs.

You can probably gin up a copyright example from, I dunno, the DRM system on some medical device or something, though that's obviously not the real focus of their copyright enforcement work.

coliveira4y ago

But drug safety is not an issue of copyright, but of physical control of medications. You don't need to break the copyright of a drug to create and distribute fake medication.

1 more reply

EdwardDiego4y ago

https://www.nzherald.co.nz/nz/dotcom-wins-settlement-from-po...

thomond4y ago

That's the FBI's general view on IP rights infringement and covers more physical products. You can apply of that to file sharing.

mminer2374y ago

I assume that message is used for both trojans and copyrighted content and that there's quite the overlap there.

userbinator4y ago

hedora4y ago

Second offense, corporate security breaks your knuckles to prevent a third strike?

j0ba4y ago

Because they're doing God's work, and how dare you question their motives?

lhorie4y ago

I have a pet theory that all of these recent Google bloopers could be explained easily if you start from the assumption that Google internal incentives promote efforts to cut costs such as storage.

munificent4y ago

I don't think there's anything specific to Google. I think the chain of events was basically:

2. Storage gets much cheaper.

3. Seeing that, companies like Google and others offer "unlimited storage" by projecting the observed user behavior from (1) onto the storage costs of (2).

5. Companies how have to adapt to the reality of (4).

I don't think there was anything particularly nefarious or shitty on the part of any participant. It's just the nature of big complex iterated systems with emergent properties.

lhorie4y ago

> I don't think there was anything particularly nefarious or shitty on the part of any participant. It's just the nature of big complex iterated systems with emergent properties.

Gigachad4y ago

withinboredom4y ago

> explained by a desire to not spend money on storage for "low value" data

onion2k4y ago

If Google has the copyright on "1", they only need to get "0" as well and they'll have everything.

tyingq4y ago

Already there, I made some test files. The ones with "1", "1\n", and "0\n" are all now flagged. https://news.ycombinator.com/item?id=30063319

So, "someone" has them copyrighted.

misnome4y ago

All copyright-infringing files are ~50% 1's, therefore there is a 50% chance of every file with a 1 in being copyright-infringing!

Statistics don't lie, which is presumably why google employs so many of them, to calculate these efficiencies.

denton-scratch4y ago

> Statistics don't lie

However statistics can be used to confuse.

If a file is 50% 1's, then a 1-digit file has a 50% chance of infringing. More than that, the chance of infringement grows pretty fast.

Also, if "1" is copyright, then a file with a "1" in it is infringing; there's no chance there. It's certainty.

gvb4y ago

0 = 1 - 1 so they have everything already!

smnrchrds4y ago

Only if they have patented -

wlesieutre4y ago

Not - as an abstract concept, but perhaps "a method and apparatus for subtracting numbers"

1 more reply

gvb4y ago

Point of order: you've changed the method of restriction from copyright to patent.

version_five4y ago

Or XOR

vmception4y ago

Here is a joke that relies on the assumption that the platform is suggesting they own the copyright, instead of someone else that isn't the user.

kingcharles4y ago

God, I'm glad I stored all my files using quantum superpositions.

bluecheese334y ago

https://www.smbc-comics.com/comic/2012-08-29

nocturnial4y ago

Just call google support... oh... wait... right...

I wonder how many ads we need to watch before google implements something even remotely similar to user support? How many billions are enough before we get support?

Post something about google killing cute kittens.

I wouldn't be surprised but I would be interested in that story.

verytrivial4y ago

mastazi4y ago

https://news.ycombinator.com/item?id=27858032

dmitrygr4y ago

"A review cannot be requested for this restriction"

I always did say that Franz Kafka never died. He is semi-retired working in google’s PM org, occasionally consulting for the UX teams as well.

jacquesm4y ago

Pretty weird that Google would be scanning files for copyright infringement in the first place, it's supposed to be a Drive not the enforcement arm of the copyright mafia.

Gigachad4y ago

everyone4y ago

It's so dystopian / Kafkaesque it's like a parody.

"Thankyou for helping google keep the web safe"

followed by...

"A review cannot be requested for this restriction"

slig4y ago

Computer says no.

Qub3d4y ago

Always operate under the assumption that iCloud (Apple), Microsoft and Google will delete any/all of your data, with no notice, and for no reason.

Because they explicitly reserve the right to do so in their TOSes.

Not your computer, not your data etc.

(https://www.quentb.com/posts/diy-cloud-backup/)

quickthrower24y ago

Cloud drives are a cache. (Actually treat all storage as a cache, i.e. the data will be lost, it's just a matter of when.).

Qub3d4y ago

Good advice. This is why I encourage anyone with sensitive digital data (photos, important receipts, etc) to set up a 3-2-1 backup:

3 copies of the data,

2 of which are on different, local mediums (i.e. hard drive and a thumb drive)

1 of which is offsite (cloud storage, safe deposit box, salt mine, etc)

PragmaticPulp4y ago

That said, I agree 100% that you shouldn't rely on a single point of failure for any backup. Data must be in at least two places.

Qub3d4y ago

For the record, that is fine, legally speaking. It just is something that I think we don't keep in mind, until our Gmail login gets locked[1] and our last backup was 6+ months ago.

[0]:https://tosdr.org/en/service/217

[1]:https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...

Gigachad4y ago

I had one of my google docs files restricted which was just a school group project. No one was able to access it.

panarky4y ago

> they explicitly reserve the right to do so in their TOSes

Do you have an example of Google explicitly reserving the right to delete any/all data with no notice and for no reason?

Qub3d4y ago

https://edit.tosdr.org/points/14762

Note the wording "we reasonably believe". There is nothing objective about this.

Additionally: https://policies.google.com/terms#toc-removing

> Removing your content

[0]: https://support.google.com/docs/answer/148505

panarky4y ago

> If any of your content (1) breaches these terms, service-specific additional terms or policies, (2) violates applicable law, or (3) could harm our users, third parties, or Google

These are definitely reasons, are they not?

For the record, would you please retract your claim that Google reserves the right to delete all data with no reason?

1 more reply

unclekev4y ago

Meanwhile my Mom uses Google Drive to share pirated movies with family members (despite my protests) and is yet to have a single file flagged.

Just need to name your file something like "Output04.S01E01.NumberOne.1080p.HEVC.x265-MeGusta" and you'll be fine /s

How can they get things so wrong?

Animats4y ago

File a DMCA counter-notice, of course.[1]

You may have to do this the hard way, via Google's address for service of process.[2] Use registered mail or FedEx.

There's also the option of taking Google to arbitration. Legal advice from one of those "free quick consult" services may be helpful.

[1] https://www.nolo.com/legal-encyclopedia/responding-dmca-take...

[2] https://support.google.com/faqs/answer/6151275

watusername4y ago

Can you file a counter-notice _in absence of_ a DMCA notice? The problem at hand is not DMCA.

Animats4y ago

Google claimed a copyright violation and did a takedown. That's what DMCA counter-notices are for. They phrased it in other terms, but ask a lawyer if that matters.

manquer4y ago

The lawsuit is not strong either because their ToS says they can delete your all your data for any and no reason at all.

1 more reply

newhotelowner4y ago

I am a small business owners. I pay for google one so that all my files are backed up and sync across devices. I also pay for backblaze to backup all my files (Just in the case google screws me).

Is there an alternative for encrypted backup & sync between different computers?

Filligree4y ago

Plenty. One of the easiest to use is probably Syncthing, if you’re thinking of self-hosting.

hyperdimension4y ago

You're talking about their subscription, Google [Removed for DMCA Violation]?

kbumsik4y ago

I use Dropbox for almost 10 years and I have been using it well so far!

Dropbox offers great file history and restoration support. One day I deleted files permanently then the team kindly supported my case to restore the files within a day.

manquer4y ago

Depending on how technically inclined you are something like Tarsnap might be a good fit. [1]

[1] https://www.tarsnap.com/

mbrukman4y ago

Disclosure: I work at Google, but not on the Google Drive team specifically.

Sorry about the issue, folks! The Google Drive team is aware of it and is working on remediating it.

And thank you all for the many test cases! :)

Ansil8494y ago

> The Google Drive team is aware of it and is working on remediating it.

So reviews of copyright infringement claims _can_ be requested, but only if they reach the front page of HN? That is not OK.

PaulHoule4y ago

Must have infringed on Metallica.

gpderetta4y ago

Must be google's new Easter egg...

iszomer4y ago

"it's not a bug, it's a feature!"

itronitron4y ago

I guess the moral of the story is, never do business with a company that doesn't provide a mailing address to which you can mail a turd (at book rate.)

Gigachad4y ago

Or just use google takeout to grab regular dumps of your files

daneel_w4y ago

And googling "15.91/4" throws a SafeSearch alert letting us know that "some results may be explicit".

Devasta4y ago

No pity, if you are using Google products for anything important you only have yourself to blame.

1vuio0pswjnm74y ago

Maybe for those times when copyright infringers try to split an infringing file into separate files containing only one bit, represented as text, to avoid detection. No, I am not serious.

Try testing a file that contains more than a single 1 or 0, such as 01111000.

reaperducer4y ago

I've said it before, Google is trying to be the new Microsoft.

https://www.theonion.com/microsoft-patents-ones-zeroes-18195...

ahsima14y ago

I wonder if it's a part of some sort of cyberattack. Someone knows that deleting a file, containing a "1" or "0" from target's gdrive will break something they want, so they filed a false DMCA claim.

dibujante4y ago

Maybe it's a low-key attempt to disrupt the use of Google Drive as a command and control source for botnets? Seems very easy to work around.

woliveirajr4y ago

Google Drive answered:

"Hi Dr. Emily Dolson, thank you for letting us know about this issue! The Drive team is very much aware of this now thanks to all of you we're working on it!"

hderms4y ago

marivilla4y ago

Google Drive also offers client side encryption, which would make this scanning ineffectual: https://flowcrypt.com/blog/article/2021-06-14-google-workspa...

So as long as you have a ton of money and are a corporation your privacy should be just fine

Snuupy4y ago

You can do the same with rclone crypt or cryptomator.

mateo14y ago

raydev4y ago

Looks like the Google Drive team is aware of it. Wonder what happens next.

https://twitter.com/MishaBrukman/status/1485804925561057291

diogenesjunior4y ago

I feel bad for the new hire who wasn't entirely sure what he was doing. Something similar happened at reddit[0], wouldn't put it past google.

0: https://redd.it/m0rmux

jacob0194y ago

Another example of why it is time to dump google. With google you are the product, not the customer. There are decent alternatives for everything that google offers. It feels really good to do.

flykespice4y ago

Serious I'm upset that google drive can block files that you own, I feel my trust betrayed. We're really moving to an dystopian age where companies can control your personal data.

pmontra4y ago

Is this an unintended adversarial attack to some copyright classifier?

Grismar4y ago

I wonder: is this a technical issue, or just a practical joke by someone who has managed to convince Google Drive that they have the copyright to files containing only "1"?

shmerl4y ago

Copyright was always bizarre in the sense that any information can be expressed as numbers. So why are some numbers more copyrightable?

Also reminds me this ("Microsoft Patents Ones, Zeroes"): https://web.archive.org/web/20100607151726/http://www.theoni...

otterley4y ago

Any novel can be expressed as a sequence of alphabetical characters. So why are these copyrightable?

shmerl4y ago

Exactly the question you should wonder about. So translating in to abstract form, different numbers have different legal application :)

Illegal numbers anyone?

ccbccccbbcccbb4y ago

No surprise here. One World Corporation is going to reserve 1 for itself. This tweet is just a test drive.

luckystarr4y ago

Could this be a rare case of a SHA-whatever collision? Or do they use MD5 to identify these files.

Ansil8494y ago

Are there any updates on this? Like any accountability or an explanation why this happened?

userbinator4y ago

I wonder how quickly this would have been fixed if it happened to a Google employee.

hedora4y ago

This is a great example of why all cloud services should be end to end encrypted.

wanderingmind4y ago

Maybe just use rclone or other tools to store encrypted files at rest in drive.

kats4y ago

May want to back up your Google account if you see a message like that.

golem144y ago

That's Number Wang!!!

jpambrun4y ago

Google's bots are crazy. Thank god they sold Boston Dynamics...

loudtieblahblah4y ago

Protondrive is a thing.

Zero knowledge storage needs to be the default everywhere.

ComradePhil4y ago

As is Mega. I don't know why one would ever use Google Drive, OneDrive or Dropbox when Mega and Protondrive are available.

loudtieblahblah4y ago

The collaborative apps aspect of Suite/Drive is a wondrous 1-2 punch

dwaite4y ago

So who is going to run out and get a tattoo a la DeCSS?

fortran774y ago

Ar least it wasn’t flagged for illegal porn, too.

davebailey4y ago

This could easily have been a test value.

mbfg4y ago

i remember BlackDuck flagging a one pixel (white) image as a copyright infringement in our product.

spaniard_dev4y ago

Who said that “Don’t be evil” thing?

davebailey4y ago

This might have been a test case.

olliej4y ago

1 is the loneliest IP :D

gumby4y ago

what about an empty audio file four minutes and 33 seconds long?

TedShiller4y ago

Maybe but anyone can claim that on Twitter

zydex4y ago

It's been reproduced by other users, see below comment in this topic.

https://news.ycombinator.com/item?id=30061080

TedShiller4y ago

Interesting, you’re right then.

The thing with Twitter posters is they often don’t understand what they’re doing.

superkuh4y ago

xiphias24y ago

If the megacorps buy up the best competitors, there's not much chance left for people, just use them.

gruez4y ago

xiphias24y ago

1 more reply

j / k navigate · click thread line to collapse