Internet Archive: Security breach alert (opens in new tab)

(theverge.com)

1091 pointsewenjo1y ago607 comments

607 comments

Just in terms of privacy, it's worth noting that anyone who has uploaded something on IA already has their email address publicly viewable.

This isn't something that commonly known (even judging by comments here) but in the publicly viewable metadata of every upload it contains the uploader's IA account email address. So from a security perspective it's bad but from a privacy perspective a lot of users probably weren't aware of this detail if they've uploaded anything.

hunter2_1y ago

This raises an interesting question: should email addresses be private? Addresses of buildings aren't private, and they're somewhat analogous as with many computing concepts. (Aside: Before spam filters were quite good, it was typical to avoid scraping of addresses by mild obfuscation, but I think those days are gone, and this is distinct from privacy anyway.)

If someone wants to upload and never be found out, then they need to use a throwaway address in any case, lest they be providing their "private" address to the administrators of the service without explicitly forbidding further disclosure. If I say something to Alice without demanding that Alice keep it from Bob, then I implicitly don't mind if Alice tells Bob what I said.

tjoff1y ago

Whether the email is considered private or not is completely orthogonal to whether you are allowed / should tie an action to your email. And then again completely orthogonal whether you can/should make that connection public.

Even if your email is public information and even if what is uploaded is public information that doesn't imply that the email address behind the account that uploaded that information should be public.

2 more replies

emidoots1y ago

There is software which is intended to e.g. locate the GitHub profiles of people working at companies, then scrape all public repositories they've contributed to for their email address and the emails of their coworkers - to enable targeted advertising to those individuals. Very common in enterprise sales.

With ChatGPT, this can be extended to create emails that look very personal - as if someone has followed all of your work and is genuinely interested in what you are up to - with extremely low effort. And people are already doing this, I already get emails like this today.

Should emails be private? I don't know - I personally consider them to be public because I know for a fact mine will eventually be public whether I like it or not. But I am aware AI is out their slurping up every public communication I've ever had, and is likely trying to manipulate me in various ways already today.

4 more replies

II2II1y ago

> This raises an interesting question: should email addresses be private? Addresses of buildings aren't private, and they're somewhat analogous as with many computing concepts.

There are several ways to look at that.

The organization that I work for considers anything that ties two pieces of information about a person together as private information. That is to say that a person's name is not private and a phone number is not private, but connecting a phone number to a name is private. In one form or another, an email is frequently tied to a name (e.g. the email address is based on their name, or an account record includes both a name and an email address).

Another way is to consider how accessible the information is. There was a lot of information that was not considered as private prior to the widespread adoption of the internet. One issue that I remember popping up in the early 1990's involved property (i.e. land) records. Historically, people had to go to a government office to access them but they were publicly available. Since they were publicly available, some governments made them available online. Once they were available online, the barriers to access were removed (e.g. having to physically visit an office) and the ability to abuse that information was vastly increased. All of a sudden, people started considering something that used to be considered as public information as private information.

Springtime1y ago

An issue is for most sites/services an email has just become a standard authentication method, rather than something that can easily be more unique per account. So any usernames across sites/services that share it identify that user as being the same person (for data broker profiling, doxxing, etc), which is the privacy issue (not the email address per se, unless it perhaps contained one's real name).

For contrast truly unique email aliases for example aren't possible on common services like free Gmail*, only things like self-hosting/certain paid email hosts, which makes less feasible for many. So from a privacy perspective while in an ideal world everyone would be able to freely create entirely unique per-account creds we're mostly stuck with the email implementation.

* One could create entirely separate accounts but it's high friction and IIRC the same phone number (now a requirement) can only be used for 2-3 accounts.

2 more replies

KronisLV1y ago

> This raises an interesting question: should email addresses be private?

I sadly don't think that's viable.

What might be, in our current world, would be having a mail server/client setup where you can generate random addresses for yourself like Wf1JJUBHLu@domain.com and never re-use an e-mail address, much like with passwords, while being able to see all of the incoming mail in the same place and respond with the corresponding accounts.

Then, when your address gets traded around, it'd be fairly obvious (with some basic bookkeeping, e.g. a text field with purpose/URL for why a certain address was created) who is to blame for it and blocking incoming traffic from somewhere would be trivial as well.

I do have a self-hosted mail server and there are commands to create new accounts pretty easily, I'd just need to figure out the configuration for collecting everything in one place, as well as maybe make a web UI for automating some of the bits. I wonder if there are any off the shelf solutions for this out there.

2 more replies

squarefoot1y ago

> This raises an interesting question: should email addresses be private?

Yes and no. Both of them. As any powerful tool, email is going to be abused, like any other alternative would be when it will come one day. Those services allowing creation of dynamic email addresses do their job (until they're banned, that's why I'm not mentioning them), however using them isn't automatic and most people don't even know about their existence. What if we then did upgrade email protocols to reflect current needs wrt privacy and modified existing mail servers so that they could create dynamic addresses when asked by a simple flag? Example: I want to subscribe to a service from company XYZ, however I'm not sure how much I can trust them, therefore, when writing an email or filling a web form I can activate the option to create a new address that is tied to the recipient I'll be writing to, and will work as a dedicated proxy for my real address, that is, every mail I send to the recipient using my real address will be actually sent from the new dynamic address, then all replies to the dynamic address will be routed to my real one, but a field in its headers will always contain either a memo by me (example: "signup with XYZ") or the original recipient (example: "info@xyz_trustuswerenotspammers_yeahsure.com"). This way one can immediately spot whoever sold their address to others and blacklist them. As said, those services work well but not being built in into mail servers and clients their adoption is quite restricted. I don't see why that function shouldn't be embedded in a new upgraded email protocol as the modification would neither be that hard nor consume any serious resource. I would however expect heavy resistance against the adoption, of course.

tomjen31y ago

In a world where email costs ten cents to send (per receiver) email addresses need not be private. In our world? They kinda need to for sanity.

1 more reply

numpad01y ago

I think it just needs to be communicated. Some websites allow login only by login name and not by email, some people have identifying last name, others hardly identifying full name and whatnot. There's no universal or universally agreed answer to that, so it needs to be said whether your service _consider_ it public information or not.

makach1y ago

Pr definition the email address is considered as private information and should be protected accordingly.

figassis1y ago

It should, mainly because an email is not just an email, it's a channel to reach otu to you, your internet address. And we know how that is going in your inbox.

weinzierl1y ago

This raises an interesting question: should email addresses be private?

GDPR is clear on this and there have been significant fines for revealing email addresses against the will of their owners (e.g. using cc instead of bcc). Not saying this is the ultimate wisdom, just a data point to consider.

1 more reply

iicc1y ago

>Addresses of buildings aren't private, and they're somewhat analogous as with many computing concepts.

Buildings are analogous to domains, not email addresses.

fortyseven1y ago

> should email addresses be private?

I dunno. Should your personal phone number be private? Or your home address? Would you be okay if I knew it and shared it with a stranger? Or would you rather be asked permission to share it first?

Seems pretty cut and dry to me. Yeah, there's going to be someone out there (there always is) who doesn't care, but I'd wager the majority would be pretty ticked off if you gave those pieces of information out to a rando on the street.

6 more replies

szundi1y ago

This question could not be more academic

keybpo1y ago

It's not just uploads but any item that uses the email address as a unique user identifier (I'm not technical enough to explain this clearer but [1]).

An email address will be part of the xml in his uploads but also in his profile, which anyone can access by simply changing the url from https://archive.org/details/@foobar to https://archive.org/download/foobar. So, in essence, one just needs to have a registered account, independeltly any uploads made.

[1] https://help.archive.org/help/accounts-a-basic-guide-2/

steffanA1y ago

This is bad enough. This alone is a privacy bug/data leak.

Theoretically, someone could scrape the pages and compile a list of exposed email addresses.

1 more reply

rrwo1y ago

One solution is to use a unique email address for every website, and change the address if the site gets compromised (with the old address getting added to a spam filter).

9999000009991y ago

A pulled an old friends website down from Internet Archive.

He's moved on the next stage, but I was glad I was able to put his site back up.

It'll be a shame if IA goes down permanently, but we need a decentralized solution anyway.

Having a single mega organization in charge of our collective heritage isn't a good idea.

gabeio1y ago

I have always thought about this. It would be interesting to have users actually store small amounts of redundant info on a device connected to the internet. Very similarly to what a torrent does but with more peers (more data shards than full copies) and less seeds. And try and keep a huge database for everyone. Obviously open source and it would end up something like tor where they just assist the network with security patches but they don’t actually have any real “control” (admin dashboard control) over the network at large. We already do something smaller but like that with website static file caching, but at much smaller scale. Obviously security implications of this would be very hard but maybe not impossible to overcome. ipfs comes close but it again does more seeds then peers.

if anyone knows something like what I'm suggesting, I'd love to hear about it!

pbhjpbhj1y ago

IIRC there were a few storage based projects that popped up using alt coins to encourage people to offer excess storage space for other randos on there internet. The possibility you might be storing illegal content might have been what killed it/them.

https://en.wikipedia.org/wiki/Cooperative_storage_cloud gives a few examples, like Filecoin.

1 more reply

IAmGraydon1y ago

Are you, by any chance, named Richard Hendricks?

xyzsparetimexyz1y ago

The main issue that such hosting faces is that it's less efficient and more expensive than just regular centralized servers.

1 more reply

rottc0dd1y ago

Does https://ipfs.tech/ fit the bill?

Geezus_421y ago

This was a plot line in Silicon Valley.

Xen91y ago

I believe that it would be possible to cost effectively build and implement an architecture for a distributed IA backup—this comment entails some notes.

The system that asks volunteers about their age, sex, location, and storage format details (the model, past use etc. can be used to predict the durability of a single storage) without sharing most of this data anywhere.

The downloaders are then algorithmically allocated pieces of the archive. Exampli gratia such that there is at least limited amount of overlap between the pieces, and two people same country won't provide redunancy for each other.

When a downloader verifies that they have completed the download by giving (unique, to prevent fake-download sabotage) SHA hashes of the data, the information that these pieces have been downloaded in this or that country, plus an estimate of the reliability of the storage, is added to a public database, for the algorithm to use in the future.

Every downloader is then generated a public and private key so that they can give the hash of their download again once in a while or just verify that the piece is still there. The reliability estimates (based on storage / hardware details) would be empirically calibrated based on the data about the actual storage failures.

A public counter, estimating how well the archive is currently backed up via this scheme, could be displayed.

For copyright issues, it would be possible to encrypt some of the data, e.g. such that normally borrowable items become readable files only when X% of downloads are pieced together.

The scheme would be primarily based on existing designs and algorithms but work roughly as depicted above. I am not an expert of what compression, hashing and other algorithms should be used, and it needs lots of good work, to determine how to avoid errors in the scientific part of estimating the reliability of the downloads—and generally a situation where it would turn out that lots of data was lost when attempting to put the pieces back together again.

Remark (engineering): To empirically validate the correctness of the software of the backup architecure by testing it on grids of real hard drives in single places will probably give safety against catastrophic failure. Even better would be to obtain large amount of old hard drives and SSDs kept in a single place for a long time, to validate that the software works over time.

Remark (integrity): That a downloader actually has the downloads can be verified efficiently by IA server adding small part to the piece the downloader has, hashing it again, and requesting the new hash.

Remark (redunancy): It may be possible to develop a social program that analyzes whether a volunteer in certain place can provide more redunancy by buying themselves a hard drive or by supporting the acquisition of hard drives for volunteers who have proved themselves realiable elsewhere. This is speculative and the benefit may be lower than the risks.

Finally, instead of "public database" it may be much more optimal to decide to use a blockchain of some sort. Not a cryptocurrency, but a blockchain. This is because if the idea is to distribute copies over the world to ensure continguency in case of IA main architecture collapse, then the more parts of the distributed backup architecture (which must actually not be "the backup architecture" but "a scheme", that no everyday IA decisions rely upon, and that just exists out there) are on a blockchain network run by a "decentralized" system, the more reliable it will be.

My heuristic plausibility analysis: 0. IA backup would not need to be constantly accessed or changed (this makes storage easier, cheaper and prolongs the maximun age of the storage) 1. Not all IA has to be backed up: a distrobuted backup that successfully recovers 10% of IA in a catastrophe is by all means a great success (consequently priorization of what might / should be stored should probably be part of the algorithm that decides what volunteers download; and what existing "big" archives already store that overlaps with IA should be taken into account in this analysis) 2. I recall you estimated 30-40 M USD ballparks for a single copy: a properly led open source project may be able to develop this for free, and fairly compensated one could be ~ 0.1% to 1% of the cost. 3. The Sia network https://siascan.com/ has space for 7PB; and it's for storage where one can download their own files at any time; and they have had very little publicity. 4. 2TB hard drive costs 50-100 USD and 20PB would be 10 000 humans buying one 2TB hard drive which by itself is possible. Hobbyists and organizations may be able to provide even larger capacities. 5. Most IT projects fail, but since lots of technology already exists and in this we know what we are doing and IA might be able to recruit above talent we can conservatively, give conservatively 50% chance the groundwork development to succeed, or 45% without funding. 6. If the develoment succeeds, then there may already be around ~ 100 potential volunteers. I estimated that 0.1% IA visitors may volunteer, plus 1% from Hacker News traffick were to project to be mentioned there, plus growth over first few years and traffick from elsewhere. Perhaps 75% chance to get 10% of IA backed up by volunteers, given development succeeds. 7. If that much is backed up, there is perhaps 5% of attaining 200 TB in next few decades.

Conservatively, given that open-source development starts, one gets apprx. 33% - 38% chance that 10% backup is achieved & apprx. 1-2% that 100% of what is now in the IA, could be backed up. These are of course rather meaningless numbers, but the fact seems that in the lack of funding to build a complete backup IA can best guarantee continguency by starting to build a distributed one. Perhaps this was needlessly lots of words for a simple proposal.

- X

---

Note: It's probable that at least the NSA has a private full IA backup.

max-throat1y ago

This is why BitTorrent and other P2P solutions were invented, but alas: A. The RIAA, MPAA, and ESA have given these technologies a terrible reputation. B. Nobody likes to seed. Some kind of seeding-based crypto would have been a great incentive if cryptocurrency wasn't also demonized by now.

1 more reply

aucisson_masque1y ago

It's called torrent protocol and it doesn't work, no one wants to spend money and bandwidth hosting a god forsaken movie or book that only a handful of people care about.

squarefoot1y ago

Not much money and bandwidth if you aren't on a metered connection. You can share tens of gigabytes or more on a cheap read only flash plugged into into a $25 single board computer that draws way less than a full PC and can be left sitting there near the router. Just limit its bandwidth on the torrent client and you won't even notice it during online gaming. The client can be as small as the Transmission daemon running headless on one of the many Debian based embedded distros: all control through either the web interface or from its client: no monitor, mouse, keyboard etc. just a small cheap box.

https://www.friendlyelec.com/index.php?route=product/product...

(just an example, as it's way overkill for the task)

https://transmissionbt.com/

https://github.com/transmission-remote-gui/transgui

oxygen_crisis1y ago

I see 24 seeders for the entire 72-episode run of the 1991 sitcom "Herman's Head" which was so poorly rated that it's never seen a home media or streaming release, your premise doesn't hold any water at all.

1 more reply

0x1ch1y ago

It does work, when you don't notice it. We need sane limits and permanent seeders. This is why so many regular people get hit with ISP notices, they don't know they've seeded Captain America for the last six months every time they started their PC.

1 more reply

Timber-65391y ago

If the whole world has bandwidth available for TikTok, it can make the same available for sharing torrent files.

homebrewer1y ago

I've been seeding some unpopular torrents for ten years (would have done for even longer if I did not change the torrent client a decade ago). "No one" is too strong a word, as usual with these absolutist things.

1 more reply

trinix9121y ago

In addition to the costs, I'd say it's also that no one wants to risk getting sued like the IA is getting.

EamonnMR1y ago

I keep wanting to do this for old sites, make like a personal mini IA. Besides just using wget or curl, any tips for pulling down useable complete websites from IA?

account421y ago

Agreed, especially an organziation that has already shown to not always be impartial.

Simran-B1y ago

A decentralized solution, doesn't that scream internet archive on blockchain? What could go wrong.

brundolf1y ago

This is one of the very few real use-cases I can think of for the blockchain

micromacrofoot1y ago

torrents maybe

steffanA1y ago

More details here about the data breach. Stolen database contains 31 million records.

https://www.bleepingcomputer.com/news/security/internet-arch...

ano-ther1y ago

> the Have I Been Pwned data breach notification service created by Troy Hunt, with whom threat actors commonly share stolen data to be added to the service

Do they? Why?

Maxious1y ago

Proves they really did hack something. There's other sites where hackers register defacements etc.

richbell1y ago

If Troy authenticates the data, they can use that as an 'endorsement' when trying to sell it.

3 more replies

xproot1y ago

Anyone who buys it or finds it in the wild can also upload it.

mkl1y ago

> The data will soon be added to HIBP

My unique-to-archive.org email address is not there yet.

nikisweeting1y ago

I just checked and my unique-to-archive.org email is showing up in the breach as of 2024-08-09.

2 more replies

paulnpace1y ago

Many hackers will remove addresses that are obviously unique, including tags, to keep silent which database has been hacked, but it seems inconsistent.

I have checked and known my address was in a hack and it isn't there, while other times it is there. I also wonder if they start filtering out by domain, as they see a domain across multiple databases with unique addresses in each database exactly one time.

mobeigi1y ago

Out of curiosity, do you use a unique email address for every single service?

2 more replies

ranger_danger1y ago

How do they get a hold of all these leaks so fast?

1 more reply

maltris1y ago

My question is: How did Scott Helme end up with a password hash that features his own name?

jgrahamc1y ago

He didn't. If you break down that field you see:

    $2a$
    10$
    Bho2e2ptPnFRJyJKIn5Bie
    hIDiEwhjfMZFVRM9fRCarKXkemA3Pxu
    ScottHelme

2a = bcrypt, 10 = 2^10 rounds, Bho2e2ptPnFRJyJKIn5Bie is the 22 character salt, hIDiEwhjfMZFVRM9fRCarKXkemA3Pxu is the 31 character hash value, and then there's ScottHelme. Best guess is that the archive.org folks just appended the user name to the stored hash. Maybe once upon a time they didn't have a username column in their table and this was a creative way of adding it.

Funes-1y ago

Friendly reminder to generate a unique password for every account you create so database leaks like this one don't bother you (besides on the site they're used).

AStonesThrow1y ago

https://xkcd.com/2176/

2 more replies

JohnMakin1y ago

MFA

1 more reply

haha1121y ago

I use login with google, idk if it is safe

ewenjoOP1y ago

Just noticed the site now alerts this:

> Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!

mewpmewp21y ago

Jokes on them... I'm already on HIBP countless of times...

jsheard1y ago

It's all good, as long as you're not in that recent AI Girlfriend breach which exposed a ton of users who were trying to coax it into generating CSAM images.

https://x.com/troyhunt/status/1843788319785939422

1 more reply

to-too-two1y ago

I'm also on HIBP over 10x. What are we supposed to do? Create a new email address for every service we sign up for?

I don't know what the best practice is for keeping our personal data safe anymore.

6 more replies

nxobject1y ago

And my SSN's probably available for purchase with 9 types of crypto, too.

mendym1y ago

I assume that if this is a bad actor, then account email/name will be leaked?

uticus1y ago

Is it a genuine alert, or hacking artifact?

Sometimes with friendly / attempt-at-humorous error messages it’s difficult to tell

jrochkind11y ago

I feel like it's safe to assume the official Internet Archive would not write a "friendly"/attempt-at-humurous/unprofessional/confusing/delivered-by-popup message advertising a devastating security breach. Oh also while announcing that nowhere else.

Obv an attackers ability to insert a message does imply a breach beyond a DoS. But I am pretty confident that message was not from the IA.

n_i_k_h_i_l1y ago

It's a literal window.alert()

1 more reply

EKSolutions1y ago

It looks like someone has compromised one of their subdomains for Polyfill

Update: Subdomain seems to be returning normal responses again now.

Aachen1y ago

You mean the IA included some JS polyfill from a subdomain and that's what's compromised / where the alert is coming from?

mendym1y ago

Yup.

https://news.ycombinator.com/item?id=41792651

qnsc1y ago

yes, "https://polyfill.archive.org/v3/polyfill.min.js?features=fet..." is the URL with the malicious code

1 more reply

EKSolutions1y ago

Correct. The source subdomain of the popup seems to be hxxps[:]//polyfill[.]archive[.]org

jrochkind11y ago

That would perhaps explain how they managed to inject the JS alert popup, right?

TZubiri1y ago

Yeah, but the leak has been confirmed by HIBP, I found my address in there.

1 more reply

EasyMark1y ago

One of those instances when you really wish curses worked on whoever was pulling this stunt “may you and your descendants suffer the bites of 10000 fleas for 10000 nights as punishment for your misdeeds”

PenguinRevolver1y ago

Probably not the best time to say this, but it's surprisingly easy to go through a collection with items and grab every email along with the usernames.

https://archive.org/metadata/naturally_a_girl/metadata

One way or another, there was going to be someone who would take loads of emails with a username attached to it. A bit intrigued by how the hacker compromised the database and got the passwords.

fewgrehrehre1y ago

Damn, I had no idea about this. Definitely would've changed some things had I known that emails were public.

This honestly seems like a bit of a design flaw.

Gingeas1y ago

Yeah, they have ignored everyone's concerns about the email thing. https://github.com/internetarchive/iaux/issues/892

Nathans2201y ago

Why go for the Internet Archive go for something else not the fucking archive!

mewpmewp21y ago

We all need our easily accessible decentralized archive of some sort...

Nathans2201y ago

yes

pityJuke1y ago

This thread is looking like it'll be one of the first places this incident will be documented (seems to be on the top of Google).

Already there are two new users just for this.

mendym1y ago

i see more than 2

ewenjoOP1y ago

Yeah, I was looking around, but saw no mention of it anywhere until I realized it just happened.

iamtedd1y ago

I have had an IA account for a number of years, with a gmail address. Nine months ago, I changed the email address to a masked address using my own domain. Now I find that my gmail address was still stored, and was involved in the breach. Why? I get that they might store change history, but why?

BTW, for the current account details, I changed the password to another random string generated by my password manager, and also deleted the masked email address and generated another one, so going forward this sort of thing isn't that much of an issue for me.

keybpo1y ago

I have a similar situation, where I signed up with my main account and later changed IA's email to a more private address. It was the first email I checked on HaveIBeenPwned and it doesn't show up in this leak. The other couple IA accounts I have, whose emails and passwords are exclusive to them, they all show in this leak alright. I have no explanation to your situation but this was also my immediate though and I also wanted to give the opposite perspective.

account421y ago

It's also possible that the breach was earlier or going on for longer than reported.

marviel1y ago

https://www.reddit.com/r/DataHoarder/comments/h02jl4/lets_sa...

I found this reddit thread from /r/DataHoarder about backing up the internet archive particularly interesting, given the circumstances

numpad01y ago

50 PB * $0.014/GB = $0.7M. $0.014/GB is from[1], bare drive cost without chassis, power, or redundancy.

1: https://www.backblaze.com/blog/hard-drive-cost-per-gigabyte/

Aachen1y ago

How long does an average hard drive last? You'd have to spend that 700k every that many years (plus the extra bits you mentioned). Quite an operation actually

7 more replies

PostOnce1y ago

IA stores lots of redundant stuff in 5 file formats and none of them are particularly well-compressed, I think. There are (big) savings to be had, but maybe figuring that out (software dev and compute time) isn't worth it?

ks20481y ago

Interesting to compare their stated drive $/GB to their B2 offering: $6/TB/mo for "pay-as-you-go",

hard-drive price: $0.014/GB

B2 price (12*6/1024): $0.070/GB/year

1 more reply

nikisweeting1y ago

It's been tried several times, but it's hard because it's such a massive quantity of data. The IPFS backup never really got off the ground.

They have their own backups which I think is good enough for now unless someone plans on donating a few hundred million.

vincentpants1y ago

Oh no! I didn't know their IPFS initiative didn't pan out. What happened to it? I am surprised how hard it is to google. I remember interviewing for a role on that team at the archive to help move it to filecoin. Was so happy to hear that the effort was underway to decentralize their datastore. We need this more than ever.

1 more reply

pbhjpbhj1y ago

Perhaps you can persuade Elon that it owns the libs?

1 more reply

creer1y ago

Backup / duplication is not an easy project for sure. But meanwhile for now IA is a single organization operating under one legal system. And one technical setup, would be relevant today. That's a major weakness.

EamonnMR1y ago

Suppose we each backed up sites we cared about rather than trying to mirror the whole thing...

Aachen1y ago

A few minutes ago (22:48 UTC), I got three emails from HIBP about accounts of mine breached on the Internet Archive. Troy is quick! And I'm surprised the author of that alert() actually had the data as well as followed through

Bit of a shame the emails contain an ad for a password manager, saying there's two easy steps to become more secure: Step 1: use our password manager (fair enough), "Step 2: Enable 2 factor authentication and store the codes inside your [password manager]" ehh now it's back to 1 factor or am I missing something?

Edit: according to https://www.bleepingcomputer.com/news/security/internet-arch... (via https://news.ycombinator.com/item?id=41793669), Troy Hunt / HIBP already received and verified this "three days ago" as of yesterday 6pm AoE

almyk1y ago

I think it is safer to have 2FA in your password manager than not using 2FA at all. Because even if they got your password, if they don't have access to your password manager they can't login.

If you protect your password manager with a yubikey or any other hardware key, then your 2FA inside your password manager is quite secure and convenient. But this is very individual, what your threat model is and how secure you want/need to be.

Aachen1y ago

See also the considerations mentioned in the sibling thread btw: https://news.ycombinator.com/item?id=41793846

> even if they got your password, if they don't have access to your password manager they can't login.

Wouldn't the same argument go for a non-2fa password? What's the difference between a randomly generated 2fa secret and a randomly generated password here?

1 more reply

nixosbestos1y ago

I was going to disagree with you (and I sort of do about password managers and storing 2FA in them, but I also unlock my password manager with a yubikey).

But, doesn't a DB compromise mean that the attacker would have the TOTP seed as well? It can only increase your account security elsewhere, but also not re-using password prevents the IA leak from hurting you elsewhere as well?

Aachen1y ago

> I was going to disagree with you (and I sort of do about password managers and storing 2FA in them

Note I'm quoting HIBP's advice from the email they've sent me! I'm absolutely not recommending to store one's 2FA secrets in the same place as the password!

Even if one uses 2FA for the password manager, it stops proving "something you have" in addition to something you know and you're one unlock away from malware vacuuming it all up. The point of 2FA is to be on a separate device you need to have on hand

Of course, the same logic goes for a password manager in the first place, but password reuse is a big enough problem that (for most people's threat model) it seems to be a net positive. 2FA tokens don't have that reuse issue

EasyMark1y ago

They use bcrypt and I always use a really long password so I’m not gonna freak out over this one for once.

bjourne1y ago

Are bcrypt password hashes difficult to crack? I signed up for IA over 10 years ago with a much weaker password than those I use today.

Tepix1y ago

The difficulty is configurable. You can play around with it at https://bcrypt-generator.com/

I found this, not sure if it's still up-to-date:

◉ PHP's default implementation of bcrypt uses 10 rounds.

◉ Python's bcrypt library uses 12 rounds by default.

◉ Node.js's bcrypt library uses 10 rounds by default.

Jach1y ago

Besides being slow, there's also an implicit salt, so rainbow tables to quickly check every account for "password" don't exist. Still, if you just used a simple dictionary word present in e.g. /usr/share/dict/words (my system has 234,937 entries), you don't have as much time. I have a Ryzen 9 5900X, 12 cores; using a random Go implementation of bcrypt I found with default work factor of 10 and going through that dictionary with 24 threads, it takes my machine about 18 minutes to get through every entry. A thousand years if I wanted to go through 31 million accounts and each one was a worst-case at-the-end value. But there are quite a few more than a thousand of my CPU or better out there, some surely part of botnets which routinely number in the thousands of devices, and probably faster bcrypt implementations. Earlier this year, the FBI dismantled a botnet with 19 million infected devices globally and over 600,000 US IP addresses. Surely some of those were weak IoT devices, but still, there's a lot of compute available to bad actors such that you shouldn't necessarily rely on bcrypt et al. to protect a very weak password. (They are rather good at protecting normally weak and mid passwords, though, and there's opportunity cost for all that compute.)

nicce1y ago

If you don't reuse that password anymore, does it matter tho. Some services might use older hashing for older passwords without updating the hash algorithm. But I don't know what is the case here.

brypt passwords are very slow to crack.

1 more reply

tkgally1y ago

As of 01:09 GMT on October 10, the Internet Archive is back up.

In fact, the Wayback Machine and the book archives are responding more quickly than they did for me a week ago, when I showed the Archive to the students in an online class I teach. I gave the students a homework assignment that involves accessing some old books at the Archive. That assignment is due in about 12 hours, and I was just getting ready to e-mail the students about the outage when I saw that the site is working again.

divbzero1y ago

As of 08:34 GMT on October 10, the Internet Archive is down again.

tkgally1y ago

Thanks. I e-mailed my students to let them know.

lordfrito1y ago

Confused about this breach... I received a notification from HIBP about this hack, but I don't recall ever creating an account on archive.org (was creating an account there even a thing?).

What info does archive.org have on people? Is this info scraped from other websites and stored in the archive.org database? Or is this info related to personal archive.org accounts (as I said I don't recall making an account)?

floam1y ago

They are actual archive.org accounts. Maybe you made an account to upload something, or to check out a digitized book from their library?

lordfrito1y ago

Thank you.. was worried at first as I didn't understand the true scope of the breach. For such a vital website, the info gleaned seems relatively harmless (for those of us who don't reuse passwords that is)

1 more reply

AdmiralAsshat1y ago

Well this should be fun.

Now I'll have to dig through my IA account and remember if I donated to them directly via credit card (and if they stored it), or if it was through PayPal.

paxys1y ago

Even if you paid by credit card, there's zero chance they processed the payment themselves.

zelse1y ago

HaveIbeenpwnd says it was just passwords/usernames/emails, so seemingly not. (My company just got an email from them about the breach and I confirmed I'm in there with a quick search on their website.)

bigiain1y ago

That's what Troy got sent. It's not necessarily all the attacker took.

gaudystead1y ago

Good point and thank you for the reminder. Time to go check my email archives...

KerrAvon1y ago

they use Stripe

1 more reply

account421y ago

If they stored your email from your donation the IA would have already used it to spam you themselves, no attackers needed.

pentagrama1y ago

The reported alert on the site states:

> Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!

But is this an official message from the company? It sounds odd and unprofessional, especially the "See 31 million of you on HIBP!" part, which jokingly refers to a huge privacy issue for users. Could it also be that the site was hacked, with hackers posting that message in addition to the data breach and DDoS attack?

andrelaszlo1y ago

Troy Hunt's tweet mentions the IA getting breached, defaced AND DDoSed. Here it is, in case you don't want to use that site:

>>>

Let me share more on the chronology of this:

30 Sep: Someone sends me the breach, but I'm travelling and didn't realise the significance

5 Oct: I get a chance to look at it - whoa!

6 Oct: I get in contact with someone at IA and send the data, advising it's our goal to load within 72 hours

7 Oct: They confirm and I ask for a disclosure notice

8 Oct: I follow up on the disclosure notice and advise we'll load tomorrow

9 Oct: They get defaced and DDoS'd, right as the data is loading into HIBP

The timing on the last point seems to be entirely coincidental. It may also be multiple parties involved and when we're talking breach + defacement + DDoS, it's clearly not just one attack.

<<<

3np1y ago

> The timing on the last point seems to be entirely coincidental. It may also be multiple parties involved and when we're talking breach + defacement + DDoS, it's clearly not just one attack.

It could also be that the attacker has compromised IA communication channels and timed it for maximum dramatic effect and confusion.

1 more reply

gtirloni1y ago

It's a thankless job to be always begging for donations to keep something working when the Internet at large doesn't value it as much as it should. And now getting targeted like that? I wouldn't judge them if this is an official communication coming from exhausted and frustrated staff.

appendix-rock1y ago

Just a reminder that AI tried pivoting to much more clear-cut legitimate piracy, presumably because they got bored or something, and certainly put ‘donations’ toward that effort.

IA is an incredibly valuable resource, but let’s not put them on a pedestal.

2 more replies

nostromo1y ago

The hackers wrote that.

https://www.bleepingcomputer.com/news/security/internet-arch...

internetter1y ago

The alert is gone now. It appears the attacker compromised their front end deployment

Uptrenda1y ago

The funny thing is the internet archive is more connected to hacker culture than cracking a website will ever be. I hate posers more than anything. Hopefully the internet archive comes back stronger than ever.

TZubiri1y ago

Yeah, this is hacker news, not hacking news

Mr-Hyde1y ago

https://x.com/Sn_darkmeta/status/1844080692772401399?t=j3xDz...

Annoying

Aeolun1y ago

What are they looking for here? Negative karma?

navigate83101y ago

Probably want it wants to purge incriminating documents against a nation state?

driver8_1y ago

That sucks, I was reading my email in the morn and saw the news from haveibeenpwned.com, and I'm indeed effected by it.

Consolation is that I used a randomly generated unique password, tried to reset my credentials and see of any 2FA options but the site is overloaded throwing 504s.

left-struck1y ago

I’ve been mentioning this a lot lately but it’s also a good idea to use email forwarding services like Firefox relay, icloud/apple “hide my email”, duckduckgo has a free one, simplelogin you can host yourself… In an email breach you can confirm who was breached if you used a unique email, and it also means your actual email remains at least as secure as those services I mentioned

Aachen1y ago

Should we be linking to the site that is very likely to be breached? Could start to host any type of malware until the access can be definitively revoked

btown1y ago

This - dang/mods is there a policy for this?

abracadaniel1y ago

Verge article as possible replacement: https://www.theverge.com/2024/10/9/24266419/internet-archive...

1 more reply

RGamma1y ago

Let's hope it was someone dumb enough to be extraditable.

popcalc1y ago

No one gets extradited when the attack aligns with US interests abroad.

bawolff1y ago

What weird conspiracy is this? US interests dont involve taking down archive.org

3 more replies

odo12421y ago

Fun fact: this is the first time using a password manager (Bitwarden) protyected me from a security breach! Now I only have to update my archive.org password instead of all of them lol

adfm1y ago

They're hiring, if you're looking for a job.

https://www.indeed.com/viewjob?jk=3bb8222ccd9a88ea

Aachen1y ago

> Software Engineer, Archiving & Data Services (Remote) [...] Preliminary duties of the role will primarily focus on developing Archive-It

That is. Paying over 100k at the lower end of the range for 3y experience as software engineer

jjice1y ago

It's a non profit. You're probably not choosing to work for the IA for high compensation.

1 more reply

adfm1y ago

Not even in the 10th % for the area per https://www.levels.fyi/heatmap/

2 more replies

bawolff1y ago

Reporting on security issues is always so terrible. Is it a data breach or is it a DDoS? (Or both). Those are opposite things. One is trying to release secret information one is trying to make the site inaccessible.

odo12421y ago

It is both. They got attacked by a DDOS after the security breach.

treesknees1y ago

Which is pretty common. While the org is running around dealing with the DDoS, they're not doing anything to fix their systems. In this case, I can't even get to my account page on IA to change my password.

Aachen1y ago

That's like complaining the reporting on the weather forecast channel is so often wrong. This news broke about an hour ago and the IA is down, what witchcraft do you expect news media to practice! Nobody yet has the answers you're looking for, give it some time and log files will be audited and the reporting becomes useful :)

bawolff1y ago

Actually figure out what is happening, or at least say how confident they are in what they know.

They aren't predicting the future, they are reporting on an ongoing event.

1 more reply

meindnoch1y ago

How much of the archive is affected? Could be a targeted effort to tamper with historical records.

EamonnMR1y ago

If they wanted to do that they'd probably not try to draw this much attention.

jl61y ago

Does the IA publish hashes of its data to a 3rd party, so we could (in principle) verify that nothing has been tampered with?

markus_zhang1y ago

Wouldn't be surprised if the service was purchased by some publishing empires. This kind of things usually costs some $$$.

xyst1y ago

One of the many benefits of owning my own email server:

- I have a catch all setup to forward all emails to specific user on mail server

- able to setup adhoc email addresses for each online service (ie, iarch@example.com)

- able to claim example.com in haveibeenpwned

Now I get breach emails from hibp for the whole domain. Unfortunately, I was exposed in this IA breach

lolinder1y ago

In case anyone would like these benefits but doesn't want to actually run an email server: All you actually need to accomplish this is a domain name and a decent provider. Fastmail is what I use and it's been great for me.

halJordan1y ago

To be even easier, you can just have Apple or Google hold your domain and provide mail.

4 more replies

srhngpr1y ago

You can do this easily (and for free) via Cloudflare [1]. Works great, I've been using it across several domains for quite some time. Migrated from Google.

[1] https://www.cloudflare.com/en-ca/developer-platform/email-ro...

xyst1y ago

yea, but now i rely on cloudflare which is no-go for me.

1 more reply

lunatuna1y ago

I used to do this, now I use icloud and the 'hide my email' tool and it works without any hassle. Even asks me when signing up for something if I want to hide my email. It is easier than adding it to my old setup. Even easier than when I was using my free Google for Business setup.

The rest of apple's email landscape sucks. It is pretty poor at managing spam, the client is terrible, it doesn't sync rules between the desktop app, icloud email, and iphone.

I hate email in general. It is getting to be 1 in a 100 type scenario of anything of value and likely worse if I knew all the emails that were deleted before I saw them.

f17428d275841y ago

I recently ran into an issue where Toyota’s app/site was detecting and refusing Apple iCloud hide-my-email addresses when trying to sign up.

The error message was very clear: hide-my-email was not permitted.

I was just trying to check for available service appointments near me and didn’t want the spam. But I guess sending spam is very very important to Toyota.

EricE1y ago

https://c-command.com/spamsieve/

Worth every penny.

yonixw1y ago

Google workspace lets you do it if they mange emails for your domain (and it will cost ~5-10$/month if you are the only user)

https://support.google.com/a/answer/12943537?hl=en

xyst1y ago

it “works”, but handing over this control to Google is a no-go for me.

nostromo1y ago

The only drawback being that all of your outgoing email is sent directly to the receiver’s spam folder..?

floren1y ago

Memes are fun and all but this one is both untrue and just serves to entrench the big bastards, who don't need any more help.

atrettel1y ago

I often use custom domains for email and haven't encountered this. From what I know, the best practice is to use a domain that you have had for a while and to use nameservers or MX records from an established service (basically). I don't run my own server but I am sure there are tricks to getting it to work that way too.

homebrewer1y ago

Use a commercial service then, they're cheap and provide every benefit mentioned by GP. The thing that you really need is not your own server, but your own domain.

nikisweeting1y ago

I've never had this issue, been running my own email server for almost 10 years.

CobaltFire1y ago

I do the same thing. Absolutely worth the small hassle.

core-utility1y ago

You don't need to deal with the hassle of your own email server for this. Just buy a domain and use Fastmail, Protonmail, or any other service you trust.

alwayslikethis1y ago

Simplelogin can do the first two. The third matters little anyways if you don't reuse passwords.

wackget1y ago

Great until you need to give someone an email address in real life and awkwardness ensues.

  Cashier: "What's your email?"
  Me:      "walmart@somedomain.com"
  Cashier: "No I meant YOUR email address."
  Me:      "Yeah walmart@somedomain.com"
  Cashier: "Oh do you work for Walmart???"
  Me:      "No see I set up my email so... oh nevermind, 420BLAZEIT@GMAIL.COM"

bunabhucan1y ago

I do this. I just say "this will sound strange but my email is ..." and then spell it.

I think if you are at the level of catch-alls and your own domain(s) then you tell the cashier "no thanks!"

shwouchk1y ago

i have a similar setup for the past 20 years or so. I rarely get a raised eyebrow at giving X.yourcompany@mydomain.com, and if i do i state it upfront “this is for categorization” and never had to explain it again.

guiambros1y ago

Zero problem. I have used this exact setup with my domain for over 23 years. First, it's rare that I had to give my email over the phone or something. And in the couple of times someone raised an eyebrow, it was an opportunity to educate the person that yes, "donotspamYOURCOMPANY@" is indeed a valid address (not exactly what I use, but similar).

The advantages are numerous: tracking who leaked my data (many times before the company even noticed it), easier to spot spam (20 years ago spam filters were a lot less sophisticated), minimize credential stuffing (before Pwd Managers became the norm), etc.

1 more reply

irobeth1y ago

I have this same setup and this conversation happens often, you get used to it happening and navigating it.

ON only one occasion in ~20 years, someone refused to do business with me because they thought I was impersonating them and told me I was being disrespectful by using their brand as my email, and even after explaining how it works they weren't happy.

worstspotgain1y ago

almartway@somedomain.com

xyst1y ago

Meh, it’s not that bad. I have a short domain and usually use an abbreviated version for user part. If it’s a big corp, just the stock ticker will suffice and nobody bats an eye. Some boomers raise an eye if it’s not @gmail.com or one of the big providers, but otherwise nobody cares.

But better than giving them an iCloud “hide my email” generated addy ;)

1 more reply

appendix-rock1y ago

All things that aren’t remotely unique to running your own mail server.

account421y ago

Good. Maybe this will get them to reconsider their website changes that make the IA unusable without javascript.

1 more reply

honeybadger11y ago

Lets attack one of the bastions of information freedom...in the name of Palestine, sigh. Ass-hat hackers.

xproot1y ago

I've made a timeline of events: https://gist.github.com/xproot/b574dc868a9db012bbe07252a1f7f...

Fun fact! Troy actually got this database back in Sep. 30th.

tomrod1y ago

That's a shame.

We need not one but many internet archives. Just one and we will repeat the outcome of the Library of Alexandria.

kiba1y ago

The Library of Alexandria wasn't that significant and likely wasn't destroyed in one cataclysmic event, but rather centuries of neglect.

eikenberry1y ago

The metaphor takes precedence over the fact.

1 more reply

tdeck1y ago

Here is a great video on the subject in case folks want to learn more: https://m.youtube.com/watch?v=M4WU8gqrgsQ

mrguyorama1y ago

Then you have to write legislation in multiple countries to do so, including large carveouts in DMCA and copyright law.

"Goodwill and donations" will never be robust against an entire industry that makes profit off of artificial digital scarcity.

jacooper1y ago

More like the library of Baghdad.

hammock1y ago

https://archive.today/ is another one

1 more reply

19h001y ago

They reported a DDOS attack yesterday, wonder if this is their alert as they manage the fallout?

n3uman1y ago

https://blog.archive.org/2021/02/04/thank-you-ubuntu-and-lin... "The Internet Archive is wholly dependent on Ubuntu and the Linux communities that create a reliable, free (as in beer), free (as in speech), rapidly evolving operating system. It is hard to overestimate how important that is to creating services such as the Internet Archive." Maybe CUPS?

Wowfunhappy1y ago

Archive.org is now down. Could anyone explain what it used to show?

Mr-Hyde1y ago

A pop-up that said,

"Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!"

ks20481y ago

I had to look it up, but I guess HIBP refers to https://haveibeenpwned.com/

1 more reply

1024core1y ago

Why should an Archive need accounts anyways? This is like a public library: you don't need to authenticate yourself to enter a public library, do you?

r7211y ago

I created an account there because https://web.archive.org/save requires an account to set "Save outlinks" checkbox on.

ileonichwiesz1y ago

Don’t you? That’s what a library card is.

nevster1y ago

Anyone who contributes by uploading material needs an account

ct01y ago

How do you think they keep track of late fees?

acherion1y ago

To enter? No. To borrow? Yes.

1024core1y ago

What are you "borrowing" from the Archive?

1 more reply

nioj1y ago

Related submission: https://news.ycombinator.com/item?id=41792614

msephton1y ago

I just got a Discord "breaking news" notification about this from a server I am, said it may not show on Have I Been Pwned as it is so new.

TZubiri1y ago

shows now

crispair1y ago

I wonder how they got access the their database? I read in this thread that they likely used a supply chain attack by replacing some polyfill scripts. So they could've injected malicious code (XSS) that logged email and password to a remote server which they could have gone through. With a bit of luck they couldve gotten access to an admin account or whatever…

TZubiri1y ago

That much is not clear yet. It's possible the polyfill is an unrelated red herring, but it's also possible they somehow managed to elevate permissions. Seems the polyfill use was self hosted as well.

Maybe they managed to convince some critical service like an SSL cert provider that they were the owners of the subdomain? I don't know still wouldn't explain access to user and password database.

Nathans2201y ago

Strange I just received this message when going to the archive.org website I thought I might have misspelled the url

alkonaut1y ago

Does IA have much information on users? I’ve been in dozens of these HIBP leaks (including this one) but still none have concerned me, since they were mostly just email/password and nothing else.

Does IA store anything sensitive for any users?p physical addresses, credit cards, etc?

pastureofplenty1y ago

Maybe this will make Google reconsider relying on them for cached versions of webpages.

1970-01-011y ago

Archive.org is completely down

consumer4511y ago

Yeah, the fact that it's still down is a bit depressing.

I hope that this event makes some forward-thinking benevolent rich folks step up, or alternative solution.

pmontra1y ago

Does anybody know the details of the attack via the JS library? Was that the exploit of a bug that could affect every site or a chain of supply attack targeted at the Internet Archive?

meow_catrix1y ago

Bet it’s just a stored XSS alert from a poisoned cache.

TZubiri1y ago

Troy Hunt received the leak, tested it and confirmed it. You can find emails on HIBP now

arresin1y ago

The recent news on IA has made me worried about it. It seems to be a fragile thing and if it goes it'll be something we'll all regret.

Nathans2201y ago

After this error 504 Gateway Time-out Now 503 Service Unavailable No server is available to handle this request. Not looking good

silexia1y ago

Why does this link to the verge (garbage clickbait site) and not to the original source of the internet archive?

daveoc641y ago

That was an intentional choice:

https://news.ycombinator.com/item?id=41792698

Apocryphon1y ago

Hachette Book Group or Hack-it Boot Group?

midnight_shaman1y ago

I hope it will be back again soon

godshatter1y ago

The conspiracy theorist in me wonders what was accidentally copied into the archive that powerful interests want removed and if this is all smoke and mirrors while they make that happen.

carloslfu1y ago

"You are all cooked" vibes from that message hahaha

Levitating1y ago

I just received my haveibeenpwned.com email...

sirolimus1y ago

Truly unnecessary

max_1y ago

Is Internet Archive teh same as Archive.is?

stephen_g1y ago

No. It’s not clear who runs Archive.is (there are domains registered by a ‘Denis Petrov’ with an address in Prague), but the Internet Archive (archive.org) is run by a non-profit foundation.

el_jay1y ago

And only weeks before a US election.

yreg1y ago

What's the connection?

tap-snap-or-nap1y ago

Any information on SN_Blackmeta?

excalibur1y ago

The overall state of cybersecurity in 2024 depends to an astonishing degree on Troy Hunt's schedule.

anigbrowl1y ago

They have a Telegram channel and there's some blurb about it being pushback on US support of Israel, but it reads as bullshit. Probably a script kiddie.

themingus1y ago

I was disappointed to discover that https://haveibeenpwned.com does not report an email as pwned if it is subaddressed/plus addressed. myemail@gmail.com is reported as still safe, but myemail+archive@gmail.com is pwned. I wonder if my email has been leaked by any other websites without me knowing.

TonyTrapp1y ago

I don't think they can do that, because they do not store plaintext addresses in their database, merely hashes. It certainly reduces the impact of someone hacking HIBP.

firen7771y ago

Considering the hacker's motive: https://x.com/Sn_darkmeta/status/1844358501952618976

Is it safe to assume the hacker want to erase the evidence?

Forcing the service offline also means they want to prevent people from archiving evidence in the next how-ever-long hours. Combining with the spoken language they used in that video, are they planning some online disinformation campaign?

----

Edit: some more info about this group: https://old.reddit.com/r/technology/comments/1g0kupb/hacktiv...

----

This group claims to be pro palestinian and it's entirely based on Russia.

[https://therecord.media/middle-east-financial-institution-6-...

>SN\_BLACKMETA has operated its Telegram channel since November 2023, boasting of DDoS incidents and cyberattacks on infrastructure in Israel, the Palestinian Territories and elsewhere. While all of the group’s messages focus on the Palestinian Territories and perceived opponents to Palestine, many of its posts are written in Russian.

>The group’s account on X also shows that it was created by someone in Staraya, a town in Novgorod Oblast, Russia. The account’s initial language was also set to Russian.

>The researchers added that analysis of timestamps and activity patterns showed possible evidence that the actors within the group are operating in a timezone “close to Moscow Standard Time (MSK, UTC+3) or other Middle Eastern or Eastern European time zones (UTC+2 to UTC+4).”

~~Attacks include pro palestine sites and groups, so~~ take that "pro palestine" with a grain of salt.

EDIT: edited for clarity on what is actually in the article and not in outside anonymous sources. If you want to read more, [there's a clearer report on one of their attacks and their usual targets.](https://www.radware.com/security/threat-advisories-and-attac...)

TZubiri1y ago

Possible false flag?

How is someone stupid enough to post this? Warrant for the account's IP is probably already issued. I don't know how many proxies the guy is behind, but it's playing with fire.

Also at some point the account of a malicious hacker has to be banned right?

1 more reply

anon1151y ago

I wouldn't be surprised if it has something to do Israel

lionkor1y ago

... Why? How so?

boffinAudio1y ago

There is/was plenty of anti-Zionist material available in the IA.

1 more reply

Krasnol1y ago

This is why humanity can't have nice things.

worstspotgain1y ago

In unrelated news, apparently most world leaders in the Internet era, from Thatcher to GHWB to Mitterand to Rabin, expressed great admiration for Vladimir Putin.

Ekaros1y ago

So now the data also has off-site third-party archive. Isn't this along the goals of organization. It is less likely now to be destroyed in many eventualities.

lloydatkinson1y ago

Deeply disappointing. The only reason I have a IA account is to upload correct book covers to obviously wrong or poor quality books on the Library.

joshchernoff1y ago

What an asshole, honestly this is a good public service they offer.

accrual1y ago

Yeah, I can't understand why anyone would attack IA. The service is a gift to the whole internet.

rnd01y ago

Because in the main, people are vicious, blind, narcissistic brutes.

haha1121y ago

Damn I get the notice too

EchoReflection1y ago

shouldn't info about this breach be ON the IA landing page??

haha1121y ago

Where to see dump data?

Nurbek-F1y ago

solution: MFA

dt3ft1y ago

Imagine if we could get rid of passwords. Entirely. Forever.

cbg01y ago

You don't need to daydream, just use a password manager.

dt3ft1y ago

I use several, but I dream about a world with no passwords. Managers or not, passwords are always at risk and it is only a matter of time before one of the 300 sites leaks your data.

indus1y ago

I mistakenly read HIBP as Half Price Books..wait what?

mendym1y ago

Now it shows a 'Temporarily Offline' message

haha1121y ago

I saw it too

phplovesong1y ago

WHY would you attack IA? Whats the point?

testfrequency1y ago

I’m feeling extremely conflicted on all of this with IA right now.

On one hand, I love IA

On the other hand…I’m in a long thread with their support right now on removing old snapshots of a social media account I have. Creeps are actively using the old snapshots to dox me and send me death threats using my PII.

It’s incredibly frustrating and IA keeps insisting they cannot do anything about it.

A small part of me hoped IA didn’t recover from today because I knew my info would be finally deleted :/

boomboomsubban1y ago

Pretty sure you own the copyright of your social media postings, so DMCA claim them.

echelon1y ago

That's why I'm told ezboard as a whole was removed from the index (sadly).

You probably can do this, OP.

hackernewds1y ago

Isn't the point of IA to retain information? How can you, without hypocrisy, love IA if you don't agree with it happening to you, that you benefit from happening to others. There's a conflict here.

Sucks to hear you are getting doxxed still

bryant1y ago

It's an uncommon opinion for someone to be in favor of IA to retain all information, and it's also not their stated purpose.

It's a perfectly reasonable opinion to wish for retention of old sources of knowledge without retaining pages containing personal information of non-public people, or sensitive non-newsworthy information about anyone at all.

johnsonIV1y ago

Here in Australia we've had so many large data leaks I just assume all my PII is accessible to anyone motivated to find it. I'd guess folks from many other countries are in the same boat.

Not downplaying or excusing; just adding context that IA aren't the only ones and it's difficult to prevent (since the cause can be well outside of the individual's control).

cortesoft1y ago

Once you have been doxed, isn’t the cat kinda out of the bag at that point? Creeps already have the snapshots now, deleting them from IA is just closing the barn door after the livestock has already escaped.

ocdtrekkie1y ago

Bear in mind that is the doxxing and doxxers that have happened now. There are plenty of future opportunities to be doxxed and plenty of other potential victims.

Not that I'd cheer for the loss of IA, but it'd probably be nice if they took down PII on request.

hackernewds1y ago

Still worth deleting future instances. What's your point?

arresin1y ago

Can I ask why they're trying to dox you? I have literally never inspired this kind of passion on the internet--and I'm usually pretty blunt. I'm genuinely curious what it takes.

jfengel1y ago

Attacks like that tend to have little to do with bluntness. They occur when you've touched something they consider to be theirs, and you are not entitled to. Usually that's some matter of group identity, where they feel the need to show off for each other just how angry they are at you.

It has less to do with what you say or how you say it, but with who you are.

1 more reply

kleiba1y ago

What kind of asshole attacks the Internet Archive of all places on the web??

sunaookami1y ago

Pro-palestine activists: https://x.com/Sn_darkmeta/status/1844080692772401399 & https://x.com/Sn_darkmeta/status/1844104165192253945

boffinAudio1y ago

Or, equally valid, pro-zionist activists who want something that is normally easily accessible in the IA to be censored.

mcpar-land1y ago

>They are under attack because the archive belongs to the USA, and as we all know, this horrendous and hypocritical government supports the genocide that is being carried out by the terrorist state of “Israel”.

Ah yes, known arm of the US military-industrial complex, The Internet Archive

debit-freak1y ago

...or someone attempting to blame palestinian activists. This smells a lot more like someone trying to ape activist language.

2 more replies

GaryNumanVevo1y ago

Both tweets have received a community note disproving this.

3 more replies

sschueller1y ago

RIAA, MPAA, etc...

dewey1y ago

I don't think they'd post cringe messages on Twitter though.

1 more reply

Onavo1y ago

Probably funded by some bored executive at a publishing house.

1 more reply

wasabinator1y ago

Some people on this planet add such negative value. What does this clown hope to gain, apart from costing us all an incredibly useful shared resource?

squarefoot1y ago

What if the clown is actually someone hired by one of the many enemies that IA made during the years?

tinktank1y ago

He or she is still a clown. What difference does it make who hired him or her? At an individual level one can always disagree to do things that only destroy value.

2 more replies

ErikAugust1y ago

“According to their twitter, they’re doing it just to do it. Just because they can. No statement, no idea, no demands.”

A special place in Hell…

Aachen1y ago

That's a strange thing to read on Hacker news. Isn't that description the definition of hack value? As in http://www.catb.org/jargon/html/H/hack-value.html

Now, it depends what the "it" is referring to here, but so far all I've heard is about an alert() message saying the usernames will be sent to a breach alerting site. If they're doing it just for the heck of it, it's still costing a lot of people a lot of time that they could have spent doing better things, but I'd reserve special places in hell for the people who do plan this out carefully and make malicious demands

jonahx1y ago

There is a big difference between doing something for pure curiosity, love, or exploration and doing something directly harmful to other people for the same reasons. One is art; the other is sadism.

2 more replies

zymhan1y ago

It isn't "breaking into things" hackers.

It's "whipping something together" hackers.

Breaking into the Internet Archive's servers is like breaking into your public library. There's no honor to be had.

1 more reply

ttepasse1y ago

https://www.ccc.de/en/hackerethik

> Make public data available, protect private data.

1 more reply

klntsky1y ago

True hackers probably have a special place in hell, but, in a good sense.

1 more reply

skeaker1y ago

Accessing the data is one (hackery) thing, haphazardly publishing it and not responsibly disclosing it is another (criminal) thing.

Apocryphon1y ago

This isn't Cracker News.

NelsonMinar1y ago

Did you miss the part about the DDOS attack?

1 more reply

edm0nd1y ago

Its being done by pro Palestine Islamic hacktivists.

They stated on twitter because IA is controlled by "the US" and is "pro Israel".

could also just be RU larping under another flag. They have done this in the past with groups like Anonymous Sudan.

89l89l8l1y ago

100% the result of boredom. Visit website, notice its design is old and crusty and you start to dig deeper. That's all it takes. Funny how we just expect hackers to have a manifesto now.

edm0nd1y ago

nah. its politically motivated hacktivists that are pro Palestinian.

See their Twitter https://x.com/Sn_darkmeta

could also just be RU larping under another flag.

1 more reply

yard20101y ago

It's like the wild west in which a group of outlaws could just start a mess in a bar denying everyone from having fun there.

This is why we can't have nice things.

hexage18141y ago

>No statement, no idea, no demands. A special place in Hell…

I mean... would it be better if the hackers had asked for money or did it to protest global warming or something?

manquer1y ago

Yes? For society in general, for professionals in criminal justice system and also to some extent even victim as well, it is lot harder when there is no motive.

Perpetrators without motive can not be negotiated with, punishment may not a strong deterrent, rehabilitation is lot harder. Economic crimes or crimes of passion or ones as a result of addiction can have a path to rehabilitation and recidivism can be solved by tackling the underlying issue like poverty, addition etc. Even solving crimes without motive can be harder as there is less assumptions we can make about the perpetrator.

kibwen1y ago

"Say what you will about the tenets of National Socialism, but at last it's an ethos."

1 more reply

xyst1y ago

“For the lulz”

mynameyeff1y ago

huh i thought everyone already knew this

muppetman1y ago

Great. Bunch of pricks. Refuse to remove any of my data they scraped.

msephton1y ago

They seem to roll out the we're being DDOS'd every time there's some other thing happening.

msephton1y ago

So, it seems there are multiple things potentially including DDOS.

j / k navigate · click thread line to collapse

607 comments

Springtime1y ago

Just in terms of privacy, it's worth noting that anyone who has uploaded something on IA already has their email address publicly viewable.

hunter2_1y ago

tjoff1y ago

2 more replies

emidoots1y ago

4 more replies

II2II1y ago

> This raises an interesting question: should email addresses be private? Addresses of buildings aren't private, and they're somewhat analogous as with many computing concepts.

There are several ways to look at that.

Springtime1y ago

* One could create entirely separate accounts but it's high friction and IIRC the same phone number (now a requirement) can only be used for 2-3 accounts.

2 more replies

KronisLV1y ago

> This raises an interesting question: should email addresses be private?

I sadly don't think that's viable.

2 more replies

squarefoot1y ago

> This raises an interesting question: should email addresses be private?

tomjen31y ago

In a world where email costs ten cents to send (per receiver) email addresses need not be private. In our world? They kinda need to for sanity.

1 more reply

numpad01y ago

makach1y ago

Pr definition the email address is considered as private information and should be protected accordingly.

figassis1y ago

It should, mainly because an email is not just an email, it's a channel to reach otu to you, your internet address. And we know how that is going in your inbox.

weinzierl1y ago

This raises an interesting question: should email addresses be private?

1 more reply

iicc1y ago

>Addresses of buildings aren't private, and they're somewhat analogous as with many computing concepts.

Buildings are analogous to domains, not email addresses.

fortyseven1y ago

> should email addresses be private?

I dunno. Should your personal phone number be private? Or your home address? Would you be okay if I knew it and shared it with a stranger? Or would you rather be asked permission to share it first?

6 more replies

szundi1y ago

This question could not be more academic

keybpo1y ago

It's not just uploads but any item that uses the email address as a unique user identifier (I'm not technical enough to explain this clearer but [1]).

[1] https://help.archive.org/help/accounts-a-basic-guide-2/

steffanA1y ago

This is bad enough. This alone is a privacy bug/data leak.

Theoretically, someone could scrape the pages and compile a list of exposed email addresses.

1 more reply

rrwo1y ago

One solution is to use a unique email address for every website, and change the address if the site gets compromised (with the old address getting added to a spam filter).

9999000009991y ago

A pulled an old friends website down from Internet Archive.

He's moved on the next stage, but I was glad I was able to put his site back up.

It'll be a shame if IA goes down permanently, but we need a decentralized solution anyway.

Having a single mega organization in charge of our collective heritage isn't a good idea.

gabeio1y ago

if anyone knows something like what I'm suggesting, I'd love to hear about it!

pbhjpbhj1y ago

https://en.wikipedia.org/wiki/Cooperative_storage_cloud gives a few examples, like Filecoin.

1 more reply

IAmGraydon1y ago

Are you, by any chance, named Richard Hendricks?

xyzsparetimexyz1y ago

The main issue that such hosting faces is that it's less efficient and more expensive than just regular centralized servers.

1 more reply

rottc0dd1y ago

Does https://ipfs.tech/ fit the bill?

Geezus_421y ago

This was a plot line in Silicon Valley.

Xen91y ago

I believe that it would be possible to cost effectively build and implement an architecture for a distributed IA backup—this comment entails some notes.

A public counter, estimating how well the archive is currently backed up via this scheme, could be displayed.

For copyright issues, it would be possible to encrypt some of the data, e.g. such that normally borrowable items become readable files only when X% of downloads are pieced together.

- X

---

Note: It's probable that at least the NSA has a private full IA backup.

max-throat1y ago

1 more reply

aucisson_masque1y ago

It's called torrent protocol and it doesn't work, no one wants to spend money and bandwidth hosting a god forsaken movie or book that only a handful of people care about.

squarefoot1y ago

https://www.friendlyelec.com/index.php?route=product/product...

(just an example, as it's way overkill for the task)

https://transmissionbt.com/

https://github.com/transmission-remote-gui/transgui

oxygen_crisis1y ago

1 more reply

0x1ch1y ago

1 more reply

Timber-65391y ago

If the whole world has bandwidth available for TikTok, it can make the same available for sharing torrent files.

homebrewer1y ago

1 more reply

trinix9121y ago

In addition to the costs, I'd say it's also that no one wants to risk getting sued like the IA is getting.

EamonnMR1y ago

I keep wanting to do this for old sites, make like a personal mini IA. Besides just using wget or curl, any tips for pulling down useable complete websites from IA?

account421y ago

Agreed, especially an organziation that has already shown to not always be impartial.

Simran-B1y ago

A decentralized solution, doesn't that scream internet archive on blockchain? What could go wrong.

brundolf1y ago

This is one of the very few real use-cases I can think of for the blockchain

micromacrofoot1y ago

torrents maybe

steffanA1y ago

More details here about the data breach. Stolen database contains 31 million records.

https://www.bleepingcomputer.com/news/security/internet-arch...

ano-ther1y ago

> the Have I Been Pwned data breach notification service created by Troy Hunt, with whom threat actors commonly share stolen data to be added to the service

Do they? Why?

Maxious1y ago

Proves they really did hack something. There's other sites where hackers register defacements etc.

richbell1y ago

If Troy authenticates the data, they can use that as an 'endorsement' when trying to sell it.

3 more replies

xproot1y ago

Anyone who buys it or finds it in the wild can also upload it.

mkl1y ago

> The data will soon be added to HIBP

My unique-to-archive.org email address is not there yet.

nikisweeting1y ago

I just checked and my unique-to-archive.org email is showing up in the breach as of 2024-08-09.

2 more replies

paulnpace1y ago

Many hackers will remove addresses that are obviously unique, including tags, to keep silent which database has been hacked, but it seems inconsistent.

mobeigi1y ago

Out of curiosity, do you use a unique email address for every single service?

2 more replies

ranger_danger1y ago

How do they get a hold of all these leaks so fast?

1 more reply

maltris1y ago

My question is: How did Scott Helme end up with a password hash that features his own name?

jgrahamc1y ago

He didn't. If you break down that field you see:

    $2a$
    10$
    Bho2e2ptPnFRJyJKIn5Bie
    hIDiEwhjfMZFVRM9fRCarKXkemA3Pxu
    ScottHelme

Funes-1y ago

Friendly reminder to generate a unique password for every account you create so database leaks like this one don't bother you (besides on the site they're used).

AStonesThrow1y ago

https://xkcd.com/2176/

2 more replies

JohnMakin1y ago

MFA

1 more reply

haha1121y ago

I use login with google, idk if it is safe

ewenjoOP1y ago

Just noticed the site now alerts this:

> Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!

mewpmewp21y ago

Jokes on them... I'm already on HIBP countless of times...

jsheard1y ago

It's all good, as long as you're not in that recent AI Girlfriend breach which exposed a ton of users who were trying to coax it into generating CSAM images.

https://x.com/troyhunt/status/1843788319785939422

1 more reply

to-too-two1y ago

I'm also on HIBP over 10x. What are we supposed to do? Create a new email address for every service we sign up for?

I don't know what the best practice is for keeping our personal data safe anymore.

6 more replies

nxobject1y ago

And my SSN's probably available for purchase with 9 types of crypto, too.

mendym1y ago

I assume that if this is a bad actor, then account email/name will be leaked?

uticus1y ago

Is it a genuine alert, or hacking artifact?

Sometimes with friendly / attempt-at-humorous error messages it’s difficult to tell

jrochkind11y ago

Obv an attackers ability to insert a message does imply a breach beyond a DoS. But I am pretty confident that message was not from the IA.

n_i_k_h_i_l1y ago

It's a literal window.alert()

1 more reply

EKSolutions1y ago

It looks like someone has compromised one of their subdomains for Polyfill

Update: Subdomain seems to be returning normal responses again now.

Aachen1y ago

You mean the IA included some JS polyfill from a subdomain and that's what's compromised / where the alert is coming from?

mendym1y ago

Yup.

https://news.ycombinator.com/item?id=41792651

qnsc1y ago

yes, "https://polyfill.archive.org/v3/polyfill.min.js?features=fet..." is the URL with the malicious code

1 more reply

EKSolutions1y ago

Correct. The source subdomain of the popup seems to be hxxps[:]//polyfill[.]archive[.]org

jrochkind11y ago

That would perhaps explain how they managed to inject the JS alert popup, right?

TZubiri1y ago

Yeah, but the leak has been confirmed by HIBP, I found my address in there.

1 more reply

EasyMark1y ago

PenguinRevolver1y ago

Probably not the best time to say this, but it's surprisingly easy to go through a collection with items and grab every email along with the usernames.

https://archive.org/metadata/naturally_a_girl/metadata

One way or another, there was going to be someone who would take loads of emails with a username attached to it. A bit intrigued by how the hacker compromised the database and got the passwords.

fewgrehrehre1y ago

Damn, I had no idea about this. Definitely would've changed some things had I known that emails were public.

This honestly seems like a bit of a design flaw.

Gingeas1y ago

Yeah, they have ignored everyone's concerns about the email thing. https://github.com/internetarchive/iaux/issues/892

Nathans2201y ago

Why go for the Internet Archive go for something else not the fucking archive!

mewpmewp21y ago

We all need our easily accessible decentralized archive of some sort...

Nathans2201y ago

yes

pityJuke1y ago

This thread is looking like it'll be one of the first places this incident will be documented (seems to be on the top of Google).

Already there are two new users just for this.

mendym1y ago

i see more than 2

ewenjoOP1y ago

Yeah, I was looking around, but saw no mention of it anywhere until I realized it just happened.

iamtedd1y ago

keybpo1y ago

account421y ago

It's also possible that the breach was earlier or going on for longer than reported.

marviel1y ago

https://www.reddit.com/r/DataHoarder/comments/h02jl4/lets_sa...

I found this reddit thread from /r/DataHoarder about backing up the internet archive particularly interesting, given the circumstances

numpad01y ago

50 PB * $0.014/GB = $0.7M. $0.014/GB is from[1], bare drive cost without chassis, power, or redundancy.

1: https://www.backblaze.com/blog/hard-drive-cost-per-gigabyte/

Aachen1y ago

How long does an average hard drive last? You'd have to spend that 700k every that many years (plus the extra bits you mentioned). Quite an operation actually

7 more replies

PostOnce1y ago

ks20481y ago

Interesting to compare their stated drive $/GB to their B2 offering: $6/TB/mo for "pay-as-you-go",

hard-drive price: $0.014/GB

B2 price (12*6/1024): $0.070/GB/year

1 more reply

nikisweeting1y ago

It's been tried several times, but it's hard because it's such a massive quantity of data. The IPFS backup never really got off the ground.

They have their own backups which I think is good enough for now unless someone plans on donating a few hundred million.

vincentpants1y ago

1 more reply

pbhjpbhj1y ago

Perhaps you can persuade Elon that it owns the libs?

1 more reply

creer1y ago

EamonnMR1y ago

Suppose we each backed up sites we cared about rather than trying to mirror the whole thing...

Aachen1y ago

almyk1y ago

I think it is safer to have 2FA in your password manager than not using 2FA at all. Because even if they got your password, if they don't have access to your password manager they can't login.

Aachen1y ago

See also the considerations mentioned in the sibling thread btw: https://news.ycombinator.com/item?id=41793846

> even if they got your password, if they don't have access to your password manager they can't login.

Wouldn't the same argument go for a non-2fa password? What's the difference between a randomly generated 2fa secret and a randomly generated password here?

1 more reply

nixosbestos1y ago

I was going to disagree with you (and I sort of do about password managers and storing 2FA in them, but I also unlock my password manager with a yubikey).

Aachen1y ago

> I was going to disagree with you (and I sort of do about password managers and storing 2FA in them

Note I'm quoting HIBP's advice from the email they've sent me! I'm absolutely not recommending to store one's 2FA secrets in the same place as the password!

EasyMark1y ago

They use bcrypt and I always use a really long password so I’m not gonna freak out over this one for once.

bjourne1y ago

Are bcrypt password hashes difficult to crack? I signed up for IA over 10 years ago with a much weaker password than those I use today.

Tepix1y ago

The difficulty is configurable. You can play around with it at https://bcrypt-generator.com/

I found this, not sure if it's still up-to-date:

◉ PHP's default implementation of bcrypt uses 10 rounds.

◉ Python's bcrypt library uses 12 rounds by default.

◉ Node.js's bcrypt library uses 10 rounds by default.

Jach1y ago

nicce1y ago

If you don't reuse that password anymore, does it matter tho. Some services might use older hashing for older passwords without updating the hash algorithm. But I don't know what is the case here.

brypt passwords are very slow to crack.

1 more reply

tkgally1y ago

As of 01:09 GMT on October 10, the Internet Archive is back up.

divbzero1y ago

As of 08:34 GMT on October 10, the Internet Archive is down again.

tkgally1y ago

Thanks. I e-mailed my students to let them know.

lordfrito1y ago

Confused about this breach... I received a notification from HIBP about this hack, but I don't recall ever creating an account on archive.org (was creating an account there even a thing?).

floam1y ago

They are actual archive.org accounts. Maybe you made an account to upload something, or to check out a digitized book from their library?

lordfrito1y ago

1 more reply

AdmiralAsshat1y ago

Well this should be fun.

Now I'll have to dig through my IA account and remember if I donated to them directly via credit card (and if they stored it), or if it was through PayPal.

paxys1y ago

Even if you paid by credit card, there's zero chance they processed the payment themselves.

zelse1y ago

bigiain1y ago

That's what Troy got sent. It's not necessarily all the attacker took.

gaudystead1y ago

Good point and thank you for the reminder. Time to go check my email archives...

KerrAvon1y ago

they use Stripe

1 more reply

account421y ago

If they stored your email from your donation the IA would have already used it to spam you themselves, no attackers needed.

pentagrama1y ago

The reported alert on the site states:

> Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!

andrelaszlo1y ago

Troy Hunt's tweet mentions the IA getting breached, defaced AND DDoSed. Here it is, in case you don't want to use that site:

>>>

Let me share more on the chronology of this:

30 Sep: Someone sends me the breach, but I'm travelling and didn't realise the significance

5 Oct: I get a chance to look at it - whoa!

6 Oct: I get in contact with someone at IA and send the data, advising it's our goal to load within 72 hours

7 Oct: They confirm and I ask for a disclosure notice

8 Oct: I follow up on the disclosure notice and advise we'll load tomorrow

9 Oct: They get defaced and DDoS'd, right as the data is loading into HIBP

The timing on the last point seems to be entirely coincidental. It may also be multiple parties involved and when we're talking breach + defacement + DDoS, it's clearly not just one attack.

<<<

3np1y ago

> The timing on the last point seems to be entirely coincidental. It may also be multiple parties involved and when we're talking breach + defacement + DDoS, it's clearly not just one attack.

It could also be that the attacker has compromised IA communication channels and timed it for maximum dramatic effect and confusion.

1 more reply

gtirloni1y ago

appendix-rock1y ago

Just a reminder that AI tried pivoting to much more clear-cut legitimate piracy, presumably because they got bored or something, and certainly put ‘donations’ toward that effort.

IA is an incredibly valuable resource, but let’s not put them on a pedestal.

2 more replies

nostromo1y ago

The hackers wrote that.

https://www.bleepingcomputer.com/news/security/internet-arch...

internetter1y ago

The alert is gone now. It appears the attacker compromised their front end deployment

Uptrenda1y ago

TZubiri1y ago

Yeah, this is hacker news, not hacking news

Mr-Hyde1y ago

https://x.com/Sn_darkmeta/status/1844080692772401399?t=j3xDz...

Annoying

Aeolun1y ago

What are they looking for here? Negative karma?

navigate83101y ago

Probably want it wants to purge incriminating documents against a nation state?

driver8_1y ago

That sucks, I was reading my email in the morn and saw the news from haveibeenpwned.com, and I'm indeed effected by it.

Consolation is that I used a randomly generated unique password, tried to reset my credentials and see of any 2FA options but the site is overloaded throwing 504s.

left-struck1y ago

Aachen1y ago

Should we be linking to the site that is very likely to be breached? Could start to host any type of malware until the access can be definitively revoked

btown1y ago

This - dang/mods is there a policy for this?

abracadaniel1y ago

Verge article as possible replacement: https://www.theverge.com/2024/10/9/24266419/internet-archive...

1 more reply

RGamma1y ago

Let's hope it was someone dumb enough to be extraditable.

popcalc1y ago

No one gets extradited when the attack aligns with US interests abroad.

bawolff1y ago

What weird conspiracy is this? US interests dont involve taking down archive.org

3 more replies

odo12421y ago

Fun fact: this is the first time using a password manager (Bitwarden) protyected me from a security breach! Now I only have to update my archive.org password instead of all of them lol

adfm1y ago

They're hiring, if you're looking for a job.

https://www.indeed.com/viewjob?jk=3bb8222ccd9a88ea

Aachen1y ago

> Software Engineer, Archiving & Data Services (Remote) [...] Preliminary duties of the role will primarily focus on developing Archive-It

That is. Paying over 100k at the lower end of the range for 3y experience as software engineer

jjice1y ago

It's a non profit. You're probably not choosing to work for the IA for high compensation.

1 more reply

adfm1y ago

Not even in the 10th % for the area per https://www.levels.fyi/heatmap/

2 more replies

bawolff1y ago

odo12421y ago

It is both. They got attacked by a DDOS after the security breach.

treesknees1y ago

Aachen1y ago

bawolff1y ago

Actually figure out what is happening, or at least say how confident they are in what they know.

They aren't predicting the future, they are reporting on an ongoing event.

1 more reply

meindnoch1y ago

How much of the archive is affected? Could be a targeted effort to tamper with historical records.

EamonnMR1y ago

If they wanted to do that they'd probably not try to draw this much attention.

jl61y ago

Does the IA publish hashes of its data to a 3rd party, so we could (in principle) verify that nothing has been tampered with?

markus_zhang1y ago

Wouldn't be surprised if the service was purchased by some publishing empires. This kind of things usually costs some $$$.

xyst1y ago

One of the many benefits of owning my own email server:

- I have a catch all setup to forward all emails to specific user on mail server

- able to setup adhoc email addresses for each online service (ie, iarch@example.com)

- able to claim example.com in haveibeenpwned

Now I get breach emails from hibp for the whole domain. Unfortunately, I was exposed in this IA breach

lolinder1y ago

halJordan1y ago

To be even easier, you can just have Apple or Google hold your domain and provide mail.

4 more replies

srhngpr1y ago

You can do this easily (and for free) via Cloudflare [1]. Works great, I've been using it across several domains for quite some time. Migrated from Google.

[1] https://www.cloudflare.com/en-ca/developer-platform/email-ro...

xyst1y ago

yea, but now i rely on cloudflare which is no-go for me.

1 more reply

lunatuna1y ago

The rest of apple's email landscape sucks. It is pretty poor at managing spam, the client is terrible, it doesn't sync rules between the desktop app, icloud email, and iphone.

I hate email in general. It is getting to be 1 in a 100 type scenario of anything of value and likely worse if I knew all the emails that were deleted before I saw them.

f17428d275841y ago

I recently ran into an issue where Toyota’s app/site was detecting and refusing Apple iCloud hide-my-email addresses when trying to sign up.

The error message was very clear: hide-my-email was not permitted.

I was just trying to check for available service appointments near me and didn’t want the spam. But I guess sending spam is very very important to Toyota.

EricE1y ago

https://c-command.com/spamsieve/

Worth every penny.

yonixw1y ago

Google workspace lets you do it if they mange emails for your domain (and it will cost ~5-10$/month if you are the only user)

https://support.google.com/a/answer/12943537?hl=en

xyst1y ago

it “works”, but handing over this control to Google is a no-go for me.

nostromo1y ago

The only drawback being that all of your outgoing email is sent directly to the receiver’s spam folder..?

floren1y ago

Memes are fun and all but this one is both untrue and just serves to entrench the big bastards, who don't need any more help.

atrettel1y ago

homebrewer1y ago

Use a commercial service then, they're cheap and provide every benefit mentioned by GP. The thing that you really need is not your own server, but your own domain.

nikisweeting1y ago

I've never had this issue, been running my own email server for almost 10 years.

CobaltFire1y ago

I do the same thing. Absolutely worth the small hassle.

core-utility1y ago

You don't need to deal with the hassle of your own email server for this. Just buy a domain and use Fastmail, Protonmail, or any other service you trust.

alwayslikethis1y ago

Simplelogin can do the first two. The third matters little anyways if you don't reuse passwords.

wackget1y ago

Great until you need to give someone an email address in real life and awkwardness ensues.

  Cashier: "What's your email?"
  Me:      "walmart@somedomain.com"
  Cashier: "No I meant YOUR email address."
  Me:      "Yeah walmart@somedomain.com"
  Cashier: "Oh do you work for Walmart???"
  Me:      "No see I set up my email so... oh nevermind, 420BLAZEIT@GMAIL.COM"

bunabhucan1y ago

I do this. I just say "this will sound strange but my email is ..." and then spell it.

I think if you are at the level of catch-alls and your own domain(s) then you tell the cashier "no thanks!"

shwouchk1y ago

guiambros1y ago

1 more reply

irobeth1y ago

I have this same setup and this conversation happens often, you get used to it happening and navigating it.

worstspotgain1y ago

almartway@somedomain.com

xyst1y ago

But better than giving them an iCloud “hide my email” generated addy ;)

1 more reply

appendix-rock1y ago

All things that aren’t remotely unique to running your own mail server.

account421y ago

Good. Maybe this will get them to reconsider their website changes that make the IA unusable without javascript.

1 more reply

honeybadger11y ago

Lets attack one of the bastions of information freedom...in the name of Palestine, sigh. Ass-hat hackers.

xproot1y ago

I've made a timeline of events: https://gist.github.com/xproot/b574dc868a9db012bbe07252a1f7f...

Fun fact! Troy actually got this database back in Sep. 30th.

tomrod1y ago

That's a shame.

We need not one but many internet archives. Just one and we will repeat the outcome of the Library of Alexandria.

kiba1y ago

The Library of Alexandria wasn't that significant and likely wasn't destroyed in one cataclysmic event, but rather centuries of neglect.

eikenberry1y ago

The metaphor takes precedence over the fact.

1 more reply

tdeck1y ago

Here is a great video on the subject in case folks want to learn more: https://m.youtube.com/watch?v=M4WU8gqrgsQ

mrguyorama1y ago

Then you have to write legislation in multiple countries to do so, including large carveouts in DMCA and copyright law.

"Goodwill and donations" will never be robust against an entire industry that makes profit off of artificial digital scarcity.

jacooper1y ago

More like the library of Baghdad.

hammock1y ago

https://archive.today/ is another one

1 more reply

19h001y ago

They reported a DDOS attack yesterday, wonder if this is their alert as they manage the fallout?

n3uman1y ago

Wowfunhappy1y ago

Archive.org is now down. Could anyone explain what it used to show?

Mr-Hyde1y ago

A pop-up that said,

"Have you ever felt like the Internet Archive runs on sticks and is constantly on the verge of suffering a catastrophic security breach? It just happened. See 31 million of you on HIBP!"

ks20481y ago

I had to look it up, but I guess HIBP refers to https://haveibeenpwned.com/

1 more reply

1024core1y ago

Why should an Archive need accounts anyways? This is like a public library: you don't need to authenticate yourself to enter a public library, do you?

r7211y ago

I created an account there because https://web.archive.org/save requires an account to set "Save outlinks" checkbox on.

ileonichwiesz1y ago

Don’t you? That’s what a library card is.

nevster1y ago

Anyone who contributes by uploading material needs an account

ct01y ago

How do you think they keep track of late fees?

acherion1y ago

To enter? No. To borrow? Yes.

1024core1y ago

What are you "borrowing" from the Archive?

1 more reply

nioj1y ago

Related submission: https://news.ycombinator.com/item?id=41792614

msephton1y ago

I just got a Discord "breaking news" notification about this from a server I am, said it may not show on Have I Been Pwned as it is so new.

TZubiri1y ago

shows now

crispair1y ago

TZubiri1y ago

That much is not clear yet. It's possible the polyfill is an unrelated red herring, but it's also possible they somehow managed to elevate permissions. Seems the polyfill use was self hosted as well.

Maybe they managed to convince some critical service like an SSL cert provider that they were the owners of the subdomain? I don't know still wouldn't explain access to user and password database.

Nathans2201y ago

Strange I just received this message when going to the archive.org website I thought I might have misspelled the url

alkonaut1y ago

Does IA have much information on users? I’ve been in dozens of these HIBP leaks (including this one) but still none have concerned me, since they were mostly just email/password and nothing else.

Does IA store anything sensitive for any users?p physical addresses, credit cards, etc?

pastureofplenty1y ago

Maybe this will make Google reconsider relying on them for cached versions of webpages.

1970-01-011y ago

Archive.org is completely down

consumer4511y ago

Yeah, the fact that it's still down is a bit depressing.

I hope that this event makes some forward-thinking benevolent rich folks step up, or alternative solution.

pmontra1y ago

Does anybody know the details of the attack via the JS library? Was that the exploit of a bug that could affect every site or a chain of supply attack targeted at the Internet Archive?

meow_catrix1y ago

Bet it’s just a stored XSS alert from a poisoned cache.

TZubiri1y ago

Troy Hunt received the leak, tested it and confirmed it. You can find emails on HIBP now

arresin1y ago

The recent news on IA has made me worried about it. It seems to be a fragile thing and if it goes it'll be something we'll all regret.

Nathans2201y ago

After this error 504 Gateway Time-out Now 503 Service Unavailable No server is available to handle this request. Not looking good

silexia1y ago

Why does this link to the verge (garbage clickbait site) and not to the original source of the internet archive?

daveoc641y ago

That was an intentional choice:

https://news.ycombinator.com/item?id=41792698

Apocryphon1y ago

Hachette Book Group or Hack-it Boot Group?

midnight_shaman1y ago

I hope it will be back again soon

godshatter1y ago

The conspiracy theorist in me wonders what was accidentally copied into the archive that powerful interests want removed and if this is all smoke and mirrors while they make that happen.

carloslfu1y ago

"You are all cooked" vibes from that message hahaha

Levitating1y ago

I just received my haveibeenpwned.com email...

sirolimus1y ago

Truly unnecessary

max_1y ago

Is Internet Archive teh same as Archive.is?

stephen_g1y ago

No. It’s not clear who runs Archive.is (there are domains registered by a ‘Denis Petrov’ with an address in Prague), but the Internet Archive (archive.org) is run by a non-profit foundation.

el_jay1y ago

And only weeks before a US election.

yreg1y ago

What's the connection?

tap-snap-or-nap1y ago

Any information on SN_Blackmeta?

excalibur1y ago

The overall state of cybersecurity in 2024 depends to an astonishing degree on Troy Hunt's schedule.

anigbrowl1y ago

They have a Telegram channel and there's some blurb about it being pushback on US support of Israel, but it reads as bullshit. Probably a script kiddie.

themingus1y ago

TonyTrapp1y ago

I don't think they can do that, because they do not store plaintext addresses in their database, merely hashes. It certainly reduces the impact of someone hacking HIBP.

firen7771y ago

Considering the hacker's motive: https://x.com/Sn_darkmeta/status/1844358501952618976

Is it safe to assume the hacker want to erase the evidence?

----

Edit: some more info about this group: https://old.reddit.com/r/technology/comments/1g0kupb/hacktiv...

----

This group claims to be pro palestinian and it's entirely based on Russia.

[https://therecord.media/middle-east-financial-institution-6-...

>The group’s account on X also shows that it was created by someone in Staraya, a town in Novgorod Oblast, Russia. The account’s initial language was also set to Russian.

~~Attacks include pro palestine sites and groups, so~~ take that "pro palestine" with a grain of salt.

TZubiri1y ago

Possible false flag?

How is someone stupid enough to post this? Warrant for the account's IP is probably already issued. I don't know how many proxies the guy is behind, but it's playing with fire.

Also at some point the account of a malicious hacker has to be banned right?

1 more reply

anon1151y ago

I wouldn't be surprised if it has something to do Israel

lionkor1y ago

... Why? How so?

boffinAudio1y ago

There is/was plenty of anti-Zionist material available in the IA.

1 more reply

Krasnol1y ago

This is why humanity can't have nice things.

worstspotgain1y ago

In unrelated news, apparently most world leaders in the Internet era, from Thatcher to GHWB to Mitterand to Rabin, expressed great admiration for Vladimir Putin.

Ekaros1y ago

So now the data also has off-site third-party archive. Isn't this along the goals of organization. It is less likely now to be destroyed in many eventualities.

lloydatkinson1y ago

Deeply disappointing. The only reason I have a IA account is to upload correct book covers to obviously wrong or poor quality books on the Library.

joshchernoff1y ago

What an asshole, honestly this is a good public service they offer.

accrual1y ago

Yeah, I can't understand why anyone would attack IA. The service is a gift to the whole internet.

rnd01y ago

Because in the main, people are vicious, blind, narcissistic brutes.

haha1121y ago

Damn I get the notice too

EchoReflection1y ago

shouldn't info about this breach be ON the IA landing page??

haha1121y ago

Where to see dump data?

Nurbek-F1y ago

solution: MFA

dt3ft1y ago

Imagine if we could get rid of passwords. Entirely. Forever.

cbg01y ago

You don't need to daydream, just use a password manager.

dt3ft1y ago

I use several, but I dream about a world with no passwords. Managers or not, passwords are always at risk and it is only a matter of time before one of the 300 sites leaks your data.

indus1y ago

I mistakenly read HIBP as Half Price Books..wait what?

mendym1y ago

Now it shows a 'Temporarily Offline' message

haha1121y ago

I saw it too

phplovesong1y ago

WHY would you attack IA? Whats the point?

testfrequency1y ago

I’m feeling extremely conflicted on all of this with IA right now.

On one hand, I love IA

It’s incredibly frustrating and IA keeps insisting they cannot do anything about it.

A small part of me hoped IA didn’t recover from today because I knew my info would be finally deleted :/

boomboomsubban1y ago

Pretty sure you own the copyright of your social media postings, so DMCA claim them.

echelon1y ago

That's why I'm told ezboard as a whole was removed from the index (sadly).

You probably can do this, OP.

hackernewds1y ago

Isn't the point of IA to retain information? How can you, without hypocrisy, love IA if you don't agree with it happening to you, that you benefit from happening to others. There's a conflict here.

Sucks to hear you are getting doxxed still

bryant1y ago

It's an uncommon opinion for someone to be in favor of IA to retain all information, and it's also not their stated purpose.

johnsonIV1y ago

Here in Australia we've had so many large data leaks I just assume all my PII is accessible to anyone motivated to find it. I'd guess folks from many other countries are in the same boat.

Not downplaying or excusing; just adding context that IA aren't the only ones and it's difficult to prevent (since the cause can be well outside of the individual's control).

cortesoft1y ago

ocdtrekkie1y ago

Bear in mind that is the doxxing and doxxers that have happened now. There are plenty of future opportunities to be doxxed and plenty of other potential victims.

Not that I'd cheer for the loss of IA, but it'd probably be nice if they took down PII on request.

hackernewds1y ago

Still worth deleting future instances. What's your point?

arresin1y ago

Can I ask why they're trying to dox you? I have literally never inspired this kind of passion on the internet--and I'm usually pretty blunt. I'm genuinely curious what it takes.

jfengel1y ago

It has less to do with what you say or how you say it, but with who you are.

1 more reply

kleiba1y ago

What kind of asshole attacks the Internet Archive of all places on the web??

sunaookami1y ago

Pro-palestine activists: https://x.com/Sn_darkmeta/status/1844080692772401399 & https://x.com/Sn_darkmeta/status/1844104165192253945

boffinAudio1y ago

Or, equally valid, pro-zionist activists who want something that is normally easily accessible in the IA to be censored.

mcpar-land1y ago

Ah yes, known arm of the US military-industrial complex, The Internet Archive

debit-freak1y ago

...or someone attempting to blame palestinian activists. This smells a lot more like someone trying to ape activist language.

2 more replies

GaryNumanVevo1y ago

Both tweets have received a community note disproving this.

3 more replies

sschueller1y ago

RIAA, MPAA, etc...

dewey1y ago

I don't think they'd post cringe messages on Twitter though.

1 more reply

Onavo1y ago

Probably funded by some bored executive at a publishing house.

1 more reply

wasabinator1y ago

Some people on this planet add such negative value. What does this clown hope to gain, apart from costing us all an incredibly useful shared resource?

squarefoot1y ago

What if the clown is actually someone hired by one of the many enemies that IA made during the years?

tinktank1y ago

He or she is still a clown. What difference does it make who hired him or her? At an individual level one can always disagree to do things that only destroy value.

2 more replies

ErikAugust1y ago

“According to their twitter, they’re doing it just to do it. Just because they can. No statement, no idea, no demands.”

A special place in Hell…

Aachen1y ago

That's a strange thing to read on Hacker news. Isn't that description the definition of hack value? As in http://www.catb.org/jargon/html/H/hack-value.html

jonahx1y ago

There is a big difference between doing something for pure curiosity, love, or exploration and doing something directly harmful to other people for the same reasons. One is art; the other is sadism.

2 more replies

zymhan1y ago

It isn't "breaking into things" hackers.

It's "whipping something together" hackers.

Breaking into the Internet Archive's servers is like breaking into your public library. There's no honor to be had.

1 more reply

ttepasse1y ago

https://www.ccc.de/en/hackerethik

> Make public data available, protect private data.

1 more reply

klntsky1y ago

True hackers probably have a special place in hell, but, in a good sense.

1 more reply

skeaker1y ago

Accessing the data is one (hackery) thing, haphazardly publishing it and not responsibly disclosing it is another (criminal) thing.

Apocryphon1y ago

This isn't Cracker News.

NelsonMinar1y ago

Did you miss the part about the DDOS attack?

1 more reply

edm0nd1y ago

Its being done by pro Palestine Islamic hacktivists.

They stated on twitter because IA is controlled by "the US" and is "pro Israel".

could also just be RU larping under another flag. They have done this in the past with groups like Anonymous Sudan.

89l89l8l1y ago

100% the result of boredom. Visit website, notice its design is old and crusty and you start to dig deeper. That's all it takes. Funny how we just expect hackers to have a manifesto now.

edm0nd1y ago

nah. its politically motivated hacktivists that are pro Palestinian.

See their Twitter https://x.com/Sn_darkmeta

could also just be RU larping under another flag.

1 more reply

yard20101y ago

It's like the wild west in which a group of outlaws could just start a mess in a bar denying everyone from having fun there.

This is why we can't have nice things.

hexage18141y ago

>No statement, no idea, no demands. A special place in Hell…

I mean... would it be better if the hackers had asked for money or did it to protest global warming or something?

manquer1y ago

Yes? For society in general, for professionals in criminal justice system and also to some extent even victim as well, it is lot harder when there is no motive.

kibwen1y ago

"Say what you will about the tenets of National Socialism, but at last it's an ethos."

1 more reply

xyst1y ago

“For the lulz”

mynameyeff1y ago

huh i thought everyone already knew this

muppetman1y ago

Great. Bunch of pricks. Refuse to remove any of my data they scraped.

msephton1y ago

They seem to roll out the we're being DDOS'd every time there's some other thing happening.

msephton1y ago

So, it seems there are multiple things potentially including DDOS.

j / k navigate · click thread line to collapse