Then on Saturday, two weeks ago, I learnt that Imgur was going to delete all anonymous & NSFW files on the 15th of May. It was pointed out that this would mean broken links in communities that had relied on Imgur. By the Sunday I had decided that I would build another DAM, initially with the intent of avoiding Imgur link rot.
It was challenging to find time to spend on this, the project was put together over about 8 evenings. It still has rough patches, this is an early MVP (a Michael Seibel "brick").
I have many ideas of where to take this project, but for now it only does one thing: backup Imgur files and produce new links that are easy to swap out for old soon-to-be-deleted Imgur links.
Until this site succumbs to the challenges of being a free image host?
FWIW ArchiveTeam is grabbing as much as possible so it could eventually show up in the wayback machine. Though what they have now, 47TB, is a drop to the petabytes imgur probably has now. There was an estimated 376 TiB in 2015.
For this to work, I think you need to find revenue sources right away and run it as cheaply as possible, otherwise you'll run into problems, creating more link rot in the process.
Anyway, I hope you manage to make it work.
How closely did you read this? Since I see you used an TOC generator (which was nice for me learn about), I'm just curious how closely you read some of the conditions.
I just may be reading the Prohibited[1] section too closely as well as the User Generated Content [2] section, but it seems like your stated goal about protecting the NSFW content from Imgur, as I guess such content is potentially obscene, lewd, lascivious, filthy, violent, harassing, libellous, slanderous, or otherwise objectionable (as determined by [you and your team]).
I'm not here to trash your service or trying to "gotcha", just since you're responding in the thread, I'm only curious how closely you've read the TOC and if you stand by all the elements of it.
0 - https://haasie.com/terms_and_conditions
1 - https://haasie.com/terms_and_conditions#prohibited
2- https://haasie.com/terms_and_conditions#ugc
edit: fixed link formatting
Imgur atm still allows for NSFW, although presumably this will change. But as it stands the contents of these TOCs are not dissimilar from Imgur's or any other file hosts that allow for NSFW content.
To clarify Haasie is not intended to be a specifically NSFW host - like RedGIF someone here mentioned - but rather to host any legal content, which includes NSFW.
Emphasis here is on legal. The intention behind the TOCs is to specifically prohibit the use of the service for anything illegal or otherwise prohibited by the hosting company. And to state that Haasie will act on any content that is found to be in violation of the TOCs.
good luck on the project!
The age old question is how you afford to run this, and how if you get popular do you not become an imgur yourself and have their same pressures to delete, moderate and make money.
I don't believe it should be government funded, but I wish there was a protocol that would let my ISP allocate a few GB of resources on my behalf that stay online even when my PC is powered down, and then I could reference them anywhere, and there's be a succession protocol in case I move to a different ISP so links don't break, etc.
So now you need caps. And enforcement. And exceptions. And suddenly, you have a cloud service provider. And they've invested a ton into making the place nice, and there are teams drawing salaries keeping this running, and "succession protocols" aren't making them any money.
What you're asking for is infrastructure. Humanity to date has not found a way to finance long-term working infrastructure that doesn't involve a government, because that's the only body that invests in the commons. (Yeah, sometimes we pretend we can privatize, and then you get PG&E, and everybody suffers)
I don't think there ever was any free method of mass distributing information. Or storing it if someone didn't find it interesting. If you need to share something, you should burden the cost. Just because some idiots thought they had a model that didn't work doesn't mean there isn't real costs involved. And those should be paid. In free market they should approach margin.
Also I'm already paying my ISP for a connection, and so is everyone that views the things I host, so personally I hope for advances in P2P to leverage that existing already-paid-for bandwidth for distribution instead of having to toss more money on top.
If only there was some way to gather and organize this rich fraction of users' contributions. Maybe we could create some kind of assembly where we discuss and vote on how to do so.
>Why is "the government should pay for it" always a comment in every thread on HN? Is there something I haven't understood about this community.
Because the purpose of government is to organize and manage society and that requires funds to do so? I'm not sure how to answer such a simple question with such a self-evident answer. How old are you?
And yes, you do need to convince me: If it's "publicly funded" then I (and everyone else) am/are paying for it. Personally, I would sooner pay for another US navy aircraft carrier.
In terms of a government funded imgur backup: nah not convinced!
Don't you already pay for it?
In the future I could also add a subscription account for additional features, access to an API, bigger upload limits, transformations, greater ability to organise, etc.
As a sidenote, reddit is (was?) the one of main user of imgur, however https://www.google.com/search?q=%22haasie.com%22+site%3Aredd... yields no results - maybe it is worth to promote it there also.
One is to turn off any new uploads, and just serve the existing content, presumably the bandwidth on old content would be much lower - until some sustainable plan can be devised.
Last resort is to approach the ArchiveTeam mentioned here to see if and we could to setup a maintenance service.
To your side note, yes, I want to promote it on Reddit, as soon as I've added a few more features - like donations.
Except you haven't, because anyone that wants to view these images where they are linked needs to know to change the URL to Haasie instead of Imgur. So all the links are still broken.
And on top of that, even if the users know about Haasie, it only works if someone went to this website before the original image was deleted, and enter the URL on a form. Non-popular images will be long gone before people do this manually. Perhaps accepting any URL and transparently fetching it from Imgur would improve this a little. From my manual testing, this doesn't seem to be the case, just gives an S3 error from bucket ID haasie01, and nothing shows up there even if I wait and refresh.
All of this is assuming this website won't go down or go bad. Probably the Archive Team, and Archive.org can handle this task better.
One potentially problematic thing about the new Imgur policy is that it will break machine learning projects that depend on Imgur for NSFW data. For example, here's a project relying on Imgur links in order to train a classifier that detects adult content:
I have a question for all of you. My approach is that we are all understand that there is no such thing as a free service, and that for something like this to be long term sustainable donations (or some source of revenue) is crucial. If people get value from the service then it only makes sense to give back to the service to sustain it.
With this in mind, I'm toying with an idea that I would like to get a temperature check from HN on.
How would you react if, when donations are too low, Haasie serves a single image that requests donations to sustain the service in place of the original anonymously uploaded content, until the required donation level is reached? Once there are enough donations normal service is resumed.
Let forum admins compile a list of linked imgur posts that should be archived. Storage and bandwidth are cheap compared to what they used to be, so some may be able to self-host images now.