Ask HN: How do you back up your site hosted on a VPS such as Digital Ocean?

127 pointsjoeclef9y ago87 comments

I have a production site running on Digital Ocean. I'm looking for a cheap and reliable backup solution. What are your suggestions? Thanks.

87 comments

dangrossman9y ago

Write a little program in your favorite shell or scripting language that

* rsyncs the directories containing the files you want to back up

* mysqldumps/pg_dumps your databases

* zips/gzips everything up into a dated archive file

* deletes the oldest backup (the one with X days ago's date)

Put this program on a VPS at a different provider, on a spare computer in your house, or both. Create a cron job that runs it every night. Run it manually once or twice, then actually restore your backups somewhere to ensure you've made them correctly.

drinchev9y ago

Yep. Here is an example that I use to upload it to dropbox.

I don't delete and/or gzip my oldest uploads though.

    #!/bin/sh

    DATE=$(date +%d-%m-%Y@%H:%M:%S.%3N)
    DB_USER="qux"
    DB_PASS="foo"
    DB_NAME="bar"
    DROPBOX_TOKEN="baz"

    /usr/bin/mysqldump -u${DB_USER} -p${DB_PASS} ${DB_NAME} > /tmp/${DATE}.sql
    /usr/bin/curl -H "Authorization: Bearer ${DROPBOX_TOKEN}" https://api-content.dropbox.com/1/files_put/backup/ -T /tmp/${DATE}.sql

gcr9y ago

Careful! If someone hacks your server, they now get your Dropbox account.

One alternative is to put these backups into S3 using pre-signed requests rather than Dropbox. An S3 pre-signed request gives permission only to upload files, perhaps only to a certain location in a certain bucket.

It's a bit harder to set up, but the shell script will look almost the same.

meowface9y ago

You can actually set up app folders in Dropbox so that a particular API key effectively chroots you to that folder. The attacker would only get the backups.

1 more reply

blfr9y ago

Or put this program on the same VPS but instead of doing the gzipping and versioning yourself, incorporate Tarsnap[1] into the script. With Tarsnap you can also create a read/write-only key so that if someone hacks your server, a real threat mentioned downthread, they won't be able to delete your backups.

And whatever you do, check that you can actually recover from these backups every once in a while.

[1] https://www.tarsnap.com/

anotherevan9y ago

My only con against tarsnap is that it can take a long time to do a restore, even for a smallish (30G) backup. Last time I tested at least, I was looking at over three hours. The dev is aware of the issue and may have improved upon it in the meantime.

That is the _only_ reason I have for looking at something else.

klodolph9y ago

Recommended change for additional peace of mind: The backup script does not have deletion privileges. A separate process expires old backups.

kyrra9y ago

Mine used to rsync to my home Linux box and it would copy over all files and database dumps using this.

Then on the local Linux box I had a separate script that would doing snapshots of that directory to a complete different place on the filesystem.

thaumaturgy9y ago

A slightly nicer solution that still has the same spirit as your approach is to use BackupPC: http://backuppc.sourceforge.net/info.html

I've used it for many, many years. Setup is a bit of a pain, especially if it's your first time, but it's a totally reliable backup system and gives you something much better than just a pile of zip archives.

All of our servers get BackupPC'd (rsync-over-ssh, pulled) twice a day to an in-house server that's totally unreachable from the internet. I get emails from BackupPC when something goes wrong, which is pretty much never. Backups aren't a thing I have to worry about much anymore.

briholt9y ago

This. It's like you've read my code. Also worth considering encrypting backups so your users don't get screwed when your secondary VPS gets hacked.

yvan9y ago

The simplest way for us it's to use rsync, there is this service decade old (even more) that is just perfect for the offsite backup. http://rsync.net/index.html

We basically create a backup folder (our assets and MySQL Dump, then rsync it to rsync.net). Our source code is already on git, so basically backuped on Github, and all developers computer.

On top of it, rsynch has a very clear and simple documentation to implement it very quickly with any Linux distrib.

rsync9y ago

Glad to hear it's working for you.

I hope that you know that your account, like all accounts at rsync.net, is on a ZFS filesystem.

This is important because it means that inside your account, in the .zfs directory, are 7 daily "snapshots" of your entire rsync.net account, free of charge.

Just browse right in and see your entire account as it existed on those days in the past. No configuration or setup necessary. Also, they are immutable/readonly so even if an attacker gains access to your rsync.net account and uses your credentials to delete your data, the snapshots will still be there.

michaelcampbell9y ago

> clear and simple documentation

Not sure I'd agree there, but it's not inscrutable. I use rsync for almost all file transfers, backups included, so I'm used to it. But there are oddities here and yon.

Kjeldahl9y ago

DigitalOcean has a droplet backup solution priced at 20% of the monthly cost of your droplet. Doesn't get much easier than that, if you can afford it. For a small droplet ($10/month) that's a full backup of everything for a buck a month. https://www.digitalocean.com/community/tutorials/understandi...

batuhanicoz9y ago

It's always good to have your backup off-site. If something happens to DigitalOcean servers, your backups will be gone too.

adamb_9y ago

It also protects yourself in the event your relationship with DigitalOean goes sour (e.g. CC expires, they're unable to notify, account gets deleted.)

ryanSrich9y ago

I imagine DO has some contingency plan to prevent both your server and the backup being down at the same time.

batuhanicoz9y ago

Well, most backups are for unseeable events. It's your choice to trust DO's contingency plans but I for one would like to put my eggs in multiple baskets, depends of the value of said eggs of course. I wouldn't set-up off-site backups for a personal website for example.

funkyy9y ago

Unless bug in their system, hack attack or human error deletes all droplets from your account. And yes, I have heard of stories like that from other hosting companies where a human error caused the account to be deleted and data removed.

1 more reply

schwede9y ago

That's nice in theory, but it's better to not to have all your eggs in one basket.

vonklaus9y ago

Is 20% of $10 a $1.00?That said, if you have a small application, I would pay DO to pack it up. They have global servers & imaging/snapshots. Your app would prob be good if you can get away with that kind of hosting.

I do believe you can take images and snapshots and download them, so using the api, a user could prob rig up a script to make it refundant if it was mission critical

cyphar9y ago

I don't think that separating your hosting and backup providers is only necessary for "mission critical" things. If you don't have at least three copies of some data, in at least two distinct physical locations/services then it doesn't exist.

vonklaus9y ago

I agree. However, practically speaking I was just pointing out that if cost & time are extremely important and you are bootstrapping a $10 webapp, $2 for 2mins may be the right choice for the OP and people like him, with 0-500 users.

I do agree though. If the answer to the question, "if this data disappeared, would I be more than slightly miffed?" Is not, NO! Then dumping the data offsite makes sense.

The most inportant thing (which I, and many other make the mistake of) is not only backing it up-- but testing restore time.

If you have your entire database on AWS long-term tape storage, it may not even serve utility. If you can't restore a backup in the timeframe nec, then it is essentially the same as having no backup at all

no_protocol9y ago

Whatever strategy you use, make sure you test the process of recreating the server from a backup to make sure you will actually be able to recover. You'll also have an idea how long it will take, and you can create scripts to automate the entire flow so you don't have to figure it all out while you're frantic.

I use tarsnap, as many others in this thread have shared. I also have the Digital Ocean backups option enabled, but I don't necessarily trust it. For the handful of servers I run, the small cost is worth it. Tarsnap is incredibly cheap if most of your data doesn't change from day to day.

budhajeewa9y ago

Is there any reason to not trust DigitalOcean's back up system?

no_protocol9y ago

If their entire site disappeared your backups would be gone too. You want to be able to move to a different provider if necessary.

rsync9y ago

Some of our customers have already recommended rsync.net to you - let me remind folks that there is a "HN Readers Discount" - just email us[1] and ask for it.

[1] info@rsync.net

kumaraman9y ago

I use AWS S3 for this as the storage prices are so cheap, at $0.03 per GB. I recommend using a utility called s3cmd, which is a similar to rsync, in that you can backup directories. I just have this setup with a batch of cron jobs which dump my databases and then sync the directories to s3 weekly.

AgentME9y ago

I use duply (a simpler CLI front-end to duplicity) for doing encrypted incremental backups to S3.

The only annoying thing is that duplicity uses an old version of the boto s3 library that errors out if your signatures tar file is greater than 5gb unless you add `DUPL_PARAMS="$DUPL_PARAMS --s3-use-multiprocessing "` to your duply `conf` file. Took me days to figure that out.

aliencat9y ago

Duplicity is great! I use duply to back up my photos to Backblaze B2, which is really cheap: 0.005$/GB/Month.

jordanlev9y ago

I host many sites for clients, and use the same approach. Our VPS host offers Plesk (which we use) and it creates a backup every day (basically ZIPs up non-system directories and runs mysqldump / pg_dumps on the databases)... then I wrote a simple bash script which sends the zipped backup to an S3 bucket using s3cmd.

It took a little time to set up, but it is conceptually simple, very inexpensive (especially if you set up S3 to automatically send older files to Glacier, and/or remove old backups every now and then)... and I like that the backups are off-site and stored by a different company than the web hosts.

double_h9y ago

Agree, s3cmd is simple and effective. I have written bash scripts to regularly take db dump and put it on S3. Same can be done for folders.

pritambarhate9y ago

Also you can configure s3 buckets to delete files older than X days. For non critical projects I tend to set this at 30 days. This way the total storage used by the backups doesn't grow beyond a certain limit and helps you to control the spending.

Here are the relevant docs: http://docs.aws.amazon.com/AmazonS3/latest/dev/object-lifecy...

stevekemp9y ago

I see a lot of people mentioning different tools, but one thing you'll discover if you need to restore in the future is that it is crucial to distinguish between your "site" and your "data".

My main site runs a complex series of workers, CGI-scripts, and deamons. I can deploy them from scratch onto a remote node via fabric & ansible.

That means that I don't need to backup the whole server "/" (although I do!). If I can setup a new instance immediately the only data that needs to be backed up is the contents of some databases, and to do that I run an offsite backup once an hour.

AdamGibbins9y ago

I use config management to build the system (Puppet in my case, purely due to experience rather than strong preference) so it's fully reproducible. I push my data with borg (https://github.com/borgbackup/borg) to rsync.net (http://rsync.net/products/attic.html) for offsite backup.

xachen9y ago

www.tarsnap.com - it's pay as you go, encrypted and super simple to use and script using cron

buro99y ago

^ this.

Github takes care of code and config.

AWS S3 takes care of uploaded static files.

But Tarsnap takes care of my database backups.

The only thing to be aware of is that restore times can be very slow.

touch_o_goof9y ago

All automated, with one copy to AWS, one copy to Azure, and an scp local that goes on my home server. Rolling 10, put every 10th backup in cold storage. And I use a different tool for each, just in case.

bretpiatt9y ago

For a static site put it in version control and keep as copy of your full site and deployment code.

For a database driven dynamic site or a site with content uploads you can also use your version control via cron job to upload that content. Have the database journal out the tables you need to backup before syncing to your DVCS host over choice.

If you're looking for a backup service to manage multiple servers with reporting, encryption, dedupelication, etc. I'd love your feedback on our server product: https://www.jungledisk.com/products/server (starts at $5 per month).

billhathaway9y ago

Remember to have automated restore testing that validates restores are successful and the data "freshness" is within a reasonable period of time, such as last updated record in a database.

Lots of people only do a full test of their backup solution when first installing it. Without constant validation of the backup->restore pipeline, it is easy to get into a bad situation and not realize it until it is too late.

shostack9y ago

Novice here. How would you go about testing the integrity and ability to restore from the backup as part of an automated backup/restore pipeline?

darkst4r9y ago

http://tarsnap.com + bash scripts for mysqldump and removing old dumps + cron

pmontra9y ago

On OVH I rsync to another VPS in a different data center. I pick the lowest priced VPS with enough space. I also rsync to a local disk at my home. I would do the same with DO.

OVH has a backup by FTP premium service but the FTP server is accessible only by the VPS it backups. Pretty useless because in my experience if an OVH VPS fails the technical support has never been able to take it back online.

jasey9y ago

I used Duplicity[1] to backup to Amazon S3

[1]http://duplicity.nongnu.org/

http://mindfsck.net/incremental-backups-amazon-s3-centos-usi...

jenkstom9y ago

Backup ninja. It handles backing up to remote servers via rdiff, so I have snapshots back as far as I need them. The remote server is on another provider. As long as I have SSH login via key to the remote server enabled, ninja backup will install the dependencies on the remote server for me.

kzisme9y ago

How much does something like that cost to maintain (backups mainly since I'm assuming your DO droplet is ~5$/mo)

Osiris9y ago

I use attic backup (there's a fork called borg backup). It runs daily to make incremental backups to a server at my home.

For database, I use a second VPS running as a read only slave. A script runs daily to create database backups on the VPS.

2bluesc9y ago

I use a daily systemd timer on my home machine to remotely back-up the data on my VPS. From there, my home machine backs-up a handful of data from different places to a remote server.

Make sure you check the status of backups, I send journald and syslog stuff to papertrail[0] and have email alerts on failures.

I manually verify the back-ups at least once a year, typically on World Back-up Day [1]

[0] https://papertrailapp.com/ [1] http://www.worldbackupday.com/en/

spoiledtechie9y ago

I use https://www.sosonlinebackup.com.

Stupid simple and stupid cheap. Install, select directories you want backed up, set it and forget it.

All for $7.00 a month.

stephenr9y ago

How is this at 70+ comments without a mention of rsync.net?

Collect your files, rsync/scp/sftp them over.

Read only snapshots on the rsync.net side means even an attacker can't just delete all your previous backups.

aeharding9y ago

Because I use Docker Cloud, I use Dockup to back up a certain directory daily to S3 from my DO VPS. https://github.com/tutumcloud/dockup

I just use a simple scheduled AWS lambda to PUT to the redeploy webhook URL.

I use an IAM role with put-only permissions to a certain bucket. Then, if your box is compromised, the backups cannot be deleted or read. S3 can also be setup to automatically remove files older than X days... Also very useful.

geocrasher9y ago

I run a couple of virtualmin web servers which do virtualmin based backups (backs up each website with all its files/email/db's/zones etc into a single file, very much like how cPanel does its account backups), and those are rsynced (cron job) to my home server than runs two mirrored 1tb disks. A simple bash script keeps a few days of backups, plus a weekly backup that I keep two copies of. Overall pretty simple, and it's free since I'm not paying for cloud storage.

colinbartlett9y ago

The sites I host on DigitalOcean are all very simple Rails sites deployed with Dokku. The source code is in GitHub and the databases I backup hourly to S3 with a very simple cron job.

mike5039y ago

Bash script to dump all DBs local and tar up any config files.

Then the script sends it to s3 using aws s3 sync. If versioning is enabled you get versioning applied for free and can ship your actual data and webdocs type stuff up extremely fast and it's browsable via the console or tools. Set a retention policy how you desire. Industry's best durability, nearly the cheapest too.

kevinsimper9y ago

This is the same question I had [1], but just asked in "how can I outsource this cheap" instead of "how can I do this cheap". I also use docker, so I would only need to get a hosted database.

[1] https://news.ycombinator.com/item?id=12659437

dotancohen9y ago

I see lots of great suggestions for backup hosts and methods, but I don't see anybody addressing encrypting said backups. I'm uncomfortable with rsync.net / Backblaze / etc having access to my data. What are some good ways to encrypt these multiple-GB backups before uploading them to a third-party backup server?

ing33k9y ago

Check this https://github.com/backup/backup

dotancohen9y ago

That looks really nice, thanks. However, I've sworn off Ruby as installing and maintaining the whole Gems stack is such a pain. I'd rather write a shell script, or conjure up some Python.

dotancohen9y ago

Just to kid myself, I ran `sudo gem install backup` on my CentOS 7 desktop. The CPU has been pegged for over a minute, I have no idea what it's doing. It's not using much memory, and I'm not sure if it is making or hanging on network requests.

EDIT: After a few minutes of warming the CPU, Ruby failed but at least with an informative error message.

2 more replies

extesy9y ago

I currently use https://github.com/backup/backup on my Digital Ocean instances, but https://github.com/bup/bup also looks nice.

benbristow9y ago

What type of site is it?

jagger279y ago

Important question. If it's a Wordpress site, then all you need to back up is the theme and your MySQL db. If it's a static site then just use rsync or sync to a git service.

fergbrain9y ago

And your /wp-contents/uploads/ folder...probably worth backing up everything in /wp-contents/ since some plugins create additional directories.

moreentropy9y ago

I use restic[1] to make encrypted backups to S3 (self hosted minio service in my case).

I can't praise restic enough. It's fast, secure, easy to use and set up (golang) and the developer(s) are awesome!

[1] https://restic.github.io/

wtbob9y ago

I have duplicity set up, sending encrypted backups to S3. It works pretty well, and is pretty cheap.

educar9y ago

If you use docker to deploy, see cloudron.io. You can install custom apps and it takes care of encrypted backups to s3. And automates lets encrypt as well.

00deadbeef9y ago

I have BackupPC running on another system

http://backuppc.sourceforge.net/

bedros9y ago

I use borg backup to a backup-drive formatted as btrfs, then I use btrfs snapshot feature, to create a snapshot after every backup,

voycey9y ago

I really rate Jungledisk, you can choose S3 or Rackspace Cloudfiles as your storage medium, very much set it and forget it!

ausjke9y ago

Many ways to backup, but I always encrypt them other than just copying them to somewhere.

yakamok9y ago

i run a python/shell program to rsync and collect what i want backed up into one folder i then compress it and gpg encrypt it and send it to my backup server

edoceo9y ago

I make archives and put them in S3.

Use pg_dump and tar then just s3cp

chatterbeak9y ago

Here's how we do it:

All the databases and other data are backed up to s3. For mysql, we use the python mysql-to-s3 backup scripts.

But the machines themselves are "backed up" by virtue of being able to be rebuilt with saltstack. We verify through nightly builds that we can bring a fresh instance up, with the latest dataset restored from s3, from scratch.

This makes it simple for us to switch providers, and can run our "production" instances locally on virtual machines running the exact same version of CentOS or FreeBSD we use in production.

X86BSD9y ago

I don't know what the OP is running OS wise but if it's any modern Unix variant it uses ZFS. And a simple ZFS send/receive would be perfect. There are tons of scripts for that and replication.

If you're not using a modern Unix variant with ZFS... well there isn't a good reason why you would be.

LeoPanthera9y ago

I am amused by your subtle-ish attempts to brand Linux as "not modern".

X86BSD9y ago

Interesting that you chose to assume Linux there. Your words not mine. DO offers other OS options than that, which also don't have modern file systems. OpenBSD etc.

nwilkens9y ago

We have cheap reliable storage servers at https://mnx.io/pricing -- $15/TB. Couple our storage server with R1soft CDP (r1soft.com), Attic, Rsync, or Innobackupex, etc..

You can also use https://r1softstorage.com/ and receive storage + R1soft license (block based incremental backups) -- or just purchase the $5/month license from them and use storage where you want.

j / k navigate · click thread line to collapse

87 comments

dangrossman9y ago

Write a little program in your favorite shell or scripting language that

* rsyncs the directories containing the files you want to back up

* mysqldumps/pg_dumps your databases

* zips/gzips everything up into a dated archive file

* deletes the oldest backup (the one with X days ago's date)

drinchev9y ago

Yep. Here is an example that I use to upload it to dropbox.

I don't delete and/or gzip my oldest uploads though.

    #!/bin/sh

    DATE=$(date +%d-%m-%Y@%H:%M:%S.%3N)
    DB_USER="qux"
    DB_PASS="foo"
    DB_NAME="bar"
    DROPBOX_TOKEN="baz"

    /usr/bin/mysqldump -u${DB_USER} -p${DB_PASS} ${DB_NAME} > /tmp/${DATE}.sql
    /usr/bin/curl -H "Authorization: Bearer ${DROPBOX_TOKEN}" https://api-content.dropbox.com/1/files_put/backup/ -T /tmp/${DATE}.sql

gcr9y ago

Careful! If someone hacks your server, they now get your Dropbox account.

It's a bit harder to set up, but the shell script will look almost the same.

meowface9y ago

You can actually set up app folders in Dropbox so that a particular API key effectively chroots you to that folder. The attacker would only get the backups.

1 more reply

blfr9y ago

And whatever you do, check that you can actually recover from these backups every once in a while.

[1] https://www.tarsnap.com/

anotherevan9y ago

That is the _only_ reason I have for looking at something else.

klodolph9y ago

Recommended change for additional peace of mind: The backup script does not have deletion privileges. A separate process expires old backups.

kyrra9y ago

Mine used to rsync to my home Linux box and it would copy over all files and database dumps using this.

Then on the local Linux box I had a separate script that would doing snapshots of that directory to a complete different place on the filesystem.

thaumaturgy9y ago

A slightly nicer solution that still has the same spirit as your approach is to use BackupPC: http://backuppc.sourceforge.net/info.html

briholt9y ago

This. It's like you've read my code. Also worth considering encrypting backups so your users don't get screwed when your secondary VPS gets hacked.

yvan9y ago

The simplest way for us it's to use rsync, there is this service decade old (even more) that is just perfect for the offsite backup. http://rsync.net/index.html

We basically create a backup folder (our assets and MySQL Dump, then rsync it to rsync.net). Our source code is already on git, so basically backuped on Github, and all developers computer.

On top of it, rsynch has a very clear and simple documentation to implement it very quickly with any Linux distrib.

rsync9y ago

Glad to hear it's working for you.

I hope that you know that your account, like all accounts at rsync.net, is on a ZFS filesystem.

This is important because it means that inside your account, in the .zfs directory, are 7 daily "snapshots" of your entire rsync.net account, free of charge.

michaelcampbell9y ago

> clear and simple documentation

Not sure I'd agree there, but it's not inscrutable. I use rsync for almost all file transfers, backups included, so I'm used to it. But there are oddities here and yon.

Kjeldahl9y ago

batuhanicoz9y ago

It's always good to have your backup off-site. If something happens to DigitalOcean servers, your backups will be gone too.

adamb_9y ago

It also protects yourself in the event your relationship with DigitalOean goes sour (e.g. CC expires, they're unable to notify, account gets deleted.)

ryanSrich9y ago

I imagine DO has some contingency plan to prevent both your server and the backup being down at the same time.

batuhanicoz9y ago

funkyy9y ago

1 more reply

schwede9y ago

That's nice in theory, but it's better to not to have all your eggs in one basket.

vonklaus9y ago

I do believe you can take images and snapshots and download them, so using the api, a user could prob rig up a script to make it refundant if it was mission critical

cyphar9y ago

vonklaus9y ago

I do agree though. If the answer to the question, "if this data disappeared, would I be more than slightly miffed?" Is not, NO! Then dumping the data offsite makes sense.

The most inportant thing (which I, and many other make the mistake of) is not only backing it up-- but testing restore time.

no_protocol9y ago

budhajeewa9y ago

Is there any reason to not trust DigitalOcean's back up system?

no_protocol9y ago

If their entire site disappeared your backups would be gone too. You want to be able to move to a different provider if necessary.

rsync9y ago

Some of our customers have already recommended rsync.net to you - let me remind folks that there is a "HN Readers Discount" - just email us[1] and ask for it.

[1] info@rsync.net

kumaraman9y ago

AgentME9y ago

I use duply (a simpler CLI front-end to duplicity) for doing encrypted incremental backups to S3.

aliencat9y ago

Duplicity is great! I use duply to back up my photos to Backblaze B2, which is really cheap: 0.005$/GB/Month.

jordanlev9y ago

double_h9y ago

Agree, s3cmd is simple and effective. I have written bash scripts to regularly take db dump and put it on S3. Same can be done for folders.

pritambarhate9y ago

Here are the relevant docs: http://docs.aws.amazon.com/AmazonS3/latest/dev/object-lifecy...

stevekemp9y ago

I see a lot of people mentioning different tools, but one thing you'll discover if you need to restore in the future is that it is crucial to distinguish between your "site" and your "data".

My main site runs a complex series of workers, CGI-scripts, and deamons. I can deploy them from scratch onto a remote node via fabric & ansible.

AdamGibbins9y ago

xachen9y ago

www.tarsnap.com - it's pay as you go, encrypted and super simple to use and script using cron

buro99y ago

^ this.

Github takes care of code and config.

AWS S3 takes care of uploaded static files.

But Tarsnap takes care of my database backups.

The only thing to be aware of is that restore times can be very slow.

touch_o_goof9y ago

bretpiatt9y ago

For a static site put it in version control and keep as copy of your full site and deployment code.

billhathaway9y ago

Remember to have automated restore testing that validates restores are successful and the data "freshness" is within a reasonable period of time, such as last updated record in a database.

shostack9y ago

Novice here. How would you go about testing the integrity and ability to restore from the backup as part of an automated backup/restore pipeline?

darkst4r9y ago

http://tarsnap.com + bash scripts for mysqldump and removing old dumps + cron

pmontra9y ago

On OVH I rsync to another VPS in a different data center. I pick the lowest priced VPS with enough space. I also rsync to a local disk at my home. I would do the same with DO.

jasey9y ago

I used Duplicity[1] to backup to Amazon S3

[1]http://duplicity.nongnu.org/

http://mindfsck.net/incremental-backups-amazon-s3-centos-usi...

jenkstom9y ago

kzisme9y ago

How much does something like that cost to maintain (backups mainly since I'm assuming your DO droplet is ~5$/mo)

Osiris9y ago

I use attic backup (there's a fork called borg backup). It runs daily to make incremental backups to a server at my home.

For database, I use a second VPS running as a read only slave. A script runs daily to create database backups on the VPS.

2bluesc9y ago

I use a daily systemd timer on my home machine to remotely back-up the data on my VPS. From there, my home machine backs-up a handful of data from different places to a remote server.

Make sure you check the status of backups, I send journald and syslog stuff to papertrail[0] and have email alerts on failures.

I manually verify the back-ups at least once a year, typically on World Back-up Day [1]

[0] https://papertrailapp.com/ [1] http://www.worldbackupday.com/en/

spoiledtechie9y ago

I use https://www.sosonlinebackup.com.

Stupid simple and stupid cheap. Install, select directories you want backed up, set it and forget it.

All for $7.00 a month.

stephenr9y ago

How is this at 70+ comments without a mention of rsync.net?

Collect your files, rsync/scp/sftp them over.

Read only snapshots on the rsync.net side means even an attacker can't just delete all your previous backups.

aeharding9y ago

Because I use Docker Cloud, I use Dockup to back up a certain directory daily to S3 from my DO VPS. https://github.com/tutumcloud/dockup

I just use a simple scheduled AWS lambda to PUT to the redeploy webhook URL.

geocrasher9y ago

colinbartlett9y ago

The sites I host on DigitalOcean are all very simple Rails sites deployed with Dokku. The source code is in GitHub and the databases I backup hourly to S3 with a very simple cron job.

mike5039y ago

Bash script to dump all DBs local and tar up any config files.

kevinsimper9y ago

This is the same question I had [1], but just asked in "how can I outsource this cheap" instead of "how can I do this cheap". I also use docker, so I would only need to get a hosted database.

[1] https://news.ycombinator.com/item?id=12659437

dotancohen9y ago

ing33k9y ago

Check this https://github.com/backup/backup

dotancohen9y ago

That looks really nice, thanks. However, I've sworn off Ruby as installing and maintaining the whole Gems stack is such a pain. I'd rather write a shell script, or conjure up some Python.

dotancohen9y ago

EDIT: After a few minutes of warming the CPU, Ruby failed but at least with an informative error message.

2 more replies

extesy9y ago

I currently use https://github.com/backup/backup on my Digital Ocean instances, but https://github.com/bup/bup also looks nice.

benbristow9y ago

What type of site is it?

jagger279y ago

Important question. If it's a Wordpress site, then all you need to back up is the theme and your MySQL db. If it's a static site then just use rsync or sync to a git service.

fergbrain9y ago

And your /wp-contents/uploads/ folder...probably worth backing up everything in /wp-contents/ since some plugins create additional directories.

moreentropy9y ago

I use restic[1] to make encrypted backups to S3 (self hosted minio service in my case).

I can't praise restic enough. It's fast, secure, easy to use and set up (golang) and the developer(s) are awesome!

[1] https://restic.github.io/

wtbob9y ago

I have duplicity set up, sending encrypted backups to S3. It works pretty well, and is pretty cheap.

educar9y ago

If you use docker to deploy, see cloudron.io. You can install custom apps and it takes care of encrypted backups to s3. And automates lets encrypt as well.

00deadbeef9y ago

I have BackupPC running on another system

http://backuppc.sourceforge.net/

bedros9y ago

I use borg backup to a backup-drive formatted as btrfs, then I use btrfs snapshot feature, to create a snapshot after every backup,

voycey9y ago

I really rate Jungledisk, you can choose S3 or Rackspace Cloudfiles as your storage medium, very much set it and forget it!

ausjke9y ago

Many ways to backup, but I always encrypt them other than just copying them to somewhere.

yakamok9y ago

i run a python/shell program to rsync and collect what i want backed up into one folder i then compress it and gpg encrypt it and send it to my backup server

edoceo9y ago

I make archives and put them in S3.

Use pg_dump and tar then just s3cp

chatterbeak9y ago

Here's how we do it:

All the databases and other data are backed up to s3. For mysql, we use the python mysql-to-s3 backup scripts.

This makes it simple for us to switch providers, and can run our "production" instances locally on virtual machines running the exact same version of CentOS or FreeBSD we use in production.

X86BSD9y ago

I don't know what the OP is running OS wise but if it's any modern Unix variant it uses ZFS. And a simple ZFS send/receive would be perfect. There are tons of scripts for that and replication.

If you're not using a modern Unix variant with ZFS... well there isn't a good reason why you would be.

LeoPanthera9y ago

I am amused by your subtle-ish attempts to brand Linux as "not modern".

X86BSD9y ago

Interesting that you chose to assume Linux there. Your words not mine. DO offers other OS options than that, which also don't have modern file systems. OpenBSD etc.

nwilkens9y ago

We have cheap reliable storage servers at https://mnx.io/pricing -- $15/TB. Couple our storage server with R1soft CDP (r1soft.com), Attic, Rsync, or Innobackupex, etc..

You can also use https://r1softstorage.com/ and receive storage + R1soft license (block based incremental backups) -- or just purchase the $5/month license from them and use storage where you want.

j / k navigate · click thread line to collapse