Persisting state between AWS EC2 spot instances (opens in new tab)

(peteris.rocks)

109 pointsp8donald8y ago77 comments

77 comments

48 comments · 19 top-level

manigandham8y ago· 3 in thread

Persistent storage remains a complicated problem. Attaching volumes on the fly with docker volume abstraction works well enough for most cloud workloads, whether on-demand or spot, but it's still easy to run into problems.

This is leading to rapid progress in clustered/distributed filesystems and it's even built into the Linux kernel now with OrangeFS [1]. There are also commercial companies like Avere [2] who make filers that run on object storage with sophisticated caching to provide a fast networked but durable filesystem.

Kubernetes is also changing the game with container-native storage. This seems to be the most promising model for the future as K8S can take care of orchestrating all the complexities of replicas and stateful containers while storage is just another container-based service using whatever volumes are available to the nodes underneath. Portworx [3] is the great commercial option today with Rook and OpenEBS [4] catching up quickly.

1. http://www.orangefs.org

2. http://www.averesystems.com/products/products-overview

3. https://portworx.com

4. https://github.com/openebs/openebs

manigandham8y ago

Also want to highlight that AWS will now allow spot instances to just be stopped instead of terminated, so only compute power is removed but data is persisted automatically as long as you use EBS root/attached volumes.

https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

objectivefs8y ago

Using a clustered/distributed filesystem definitively simplifies persisting the state between EC2 spot instances. It also makes it easier to scale out the work load when you need more instances accessing the same data. To add to your list: there is also ObjectiveFS[1] that integrates well with AWS (uses S3 for storage, works with IAM roles, etc) and EC2 spot instances.

[1]. https://objectivefs.com

manigandham8y ago

This looks very interesting, good competition to Avere based on info so far. Is there any native kubernetes integration in the works?

1 more reply

solatic8y ago· 7 in thread

OP is offering some very dangerous advice.

Twenty years ago, software was hosted on fragile single-node servers with fragile, physical hard disks. Programmers would read and write files directly from and to the disk, and learn the hard way that this left their systems susceptible to corruption in case things crashed in the middle of a write. So behold! People began to use relational databases which offered ACID guarantees and were designed from the ground up to solve that problem.

Now we have a resource (spot instances) whose unreliability is a featured design constraint and OP's advice is to just mount the block storage over the network and everything will be fine?

Here's hoping OP is taking frequent snapshots of their volumes because it sure sounds like data corruption is practically a statistical guarantee if you take OP's advice without considering exactly how state is being saved on that EBS volume.

colechristensen8y ago

Your response is fairly ridiculous.

A spot instance interruption isn't a system crash, it's a shutdown signal. Storing your important spot instance data on EBS is recommended by AWS. If your application can't handle a normal system shutdown without losing data, your application is at fault, not your system setup.

>exactly how state is being saved on that EBS volume

Files are written to a filesystem which is cleanly unmounted at shutdown when interruption happens.

derefr8y ago

And even if that wasn't true, network-attached storage (unlike local storage) has no semantics for communicating a "partially completed" write of a block. Your server either manages to send an iSCSI packet to the SAN with a completed checksum, or it doesn't. Which means that—for the problems that would arise from a sudden power-cut to a VM (let's say from unexpected hypervisor failure)—using a journalling filesystem on your network disks would perfectly compensate for those problems.

2 more replies

solatic8y ago

> If your application can't handle a normal system shutdown without losing data, your application is at fault, not your system setup.

Unless something in the system shutdown fails to give the application what it needs (for instance, time) to shutdown cleanly. Which is entirely possible considering that Amazon is selling you the spot instance on the given assumption that it can give the hardware at any time to somebody who is willing to pay more. Amazon does not guarantee the time needed for a clean shutdown (only that a two-minute warning will be available via their proprietary mechanism, if you architect your application to monitor for it) for a spot instance anywhere in their documentation, and you would be ill-advised to not architect for that.

> Storing your important spot instance data on EBS is recommended by AWS

Because EBS itself is reasonably reliable. If you have configuration data (i.e. in /etc) for a legacy application that isn't managed, it's reasonable to mount that data on EBS since it's rarely written to and writes are generally human-initiated and human-monitored (with operations policy possibly mandating a snapshot even before any changes are made).

That's still very different from daemon writes to /var. Take for instance, the PostgreSQL documentation which warns that snapshots must include WAL logs in order for the snapshot to be recoverable, and that it is quite difficult to restore from a snapshot if you stored your WAL logs on a different mount: https://www.postgresql.org/docs/10/static/backup-file.html

You need to understand precisely how your application is treating your storage and act accordingly. Thinking that all applications interact with storage the same way is dangerous and liable to cause data corruption and loss. That's all.

otterley8y ago

Spot instances are shut down cleanly via the usual stop semantics (which includes all the shutdown handlers provided your OS supports them). Assuming your database software supports clean shutdowns via SIGTERM, everything should be fine.

solatic8y ago

> Assuming your database software

You're assuming that people are saving their state in databases to begin with. If you're saving state to a database in production, typically you're communicating with that database over a network connection, and not running the database on the same machine as your application. Containerizing databases is a whole separate issue.

OP's specific example is saving /var/opt/gitlab to an EBS volume and expecting to be able to move it from one spot instance to another without corruption. That strikes me as insane.

2 more replies

jen208y ago

This pattern is a lot safer if you use ZFS. Spot instances don't just disappear though, you get notification and have a chance to perform shutdown actions, except in the case of hardware failure - which is the same with non-spot instances.

solatic8y ago

- EBS, being block storage, doesn't recognize the filesystem format on top of it, and therefore doesn't recognize if you formatted the block storage as ZFS and therefore will not use ZFS snapshots when using Amazon's native EBS snapshotting. If you wish to use ZFS snapshots, you have to build that on top of what Amazon gives you, along with all the other aspects of ZFS storage, i.e. building a ZFS storage pool from separate EBS volumes. I mean, it would be nice if Amazon had a hosted ZFS solution, but so far, doesn't seem like it.

- Yes, you get a notification, but it's a proprietary notification scheme that your application must be designed to poll for. Why can't Amazon use standard signals like SIGPWR to indicate imminent shutdown?

- Just because it isn't smart for non-spot instances doesn't suddenly make it smart for spot instances ;)

1 more reply

bdcravens8y ago· 2 in thread

Spot instances can now "stop" instead of "terminate" when you get priced out, persisting the attached EBS volumes:

https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

fredsted8y ago

This should really be at the top!

likelynew8y ago

EBS is not the root volume.

1 more reply

otterley8y ago

Even if you don't use spot instances, the technique of using separate EBS volumes to hold state is useful (and well-known). Ordinary on-demand instances can also be terminated prematurely due to hardware failure or other issues, so storing state on a non-root volume should be considered a best current practice for any instance type.

fulafel8y ago

There's a mechanism exactly for this purpouse in Linux: pivot_root. It's used in the standard boot process to switch from the initrd (initial ramdisk) environment to the real system root.

ec2-spotter classic uses this, but you can also make a pivoting AMI of your favourite Linux distribution.

One thing to watch out for is how to keep the OS automatic kernel updates working. AMIs are rarely updated and you're going to have a "damn vulnerable linux" if you don't get the updates just after booting a new image.

js4all8y ago

When you are using Kubernetes, you won't have to deal with this yourself. The Cluster will move pods from nodes that are stopped because the spot price is exceeded. Ideally place nodes at different bids. So there will be a performance hit but no outage. With the new AWS start/stop feature [1] nodes will come up again when the spot price sinks.

1) https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

yjftsjthsd-h8y ago

TLDR: Attach EBS volume and use that to store Docker containers.

I suppose it's a decent solution if you don't want to deal with prefixes.

Pirate-of-SV8y ago· 1 in thread

To make this even more streamlined you'd tag the volumes and discover the volumes with `aws ec2 describe-volumes` and filter unattached volumes with the magic tag.

sevagh8y ago

There's a handful of tag-based automatic EBS volume attachers out there:

* https://github.com/sevagh/goat (my own) * https://github.com/UKHomeOffice/smilodon

stonewhite8y ago

We normally utilize spots with Spotinst + Elasticbeanstalk. Our billing looked great ever since.

This solution looks good, yet only applies to single instance scenarios. I presume this kind of thinking might move forward with EFS + chroot for an actual scalable solution that cannot be ran on Elasticbeanstalk.

archgoon8y ago· 1 in thread

So I was pleasantly surprised to discover that for the last several years, spot instances have provided a mechanism that give you 2 minutes notice prior to shutdown:

http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/spot-inte...

Learn something new everyday. :)

https://aws.amazon.com/blogs/aws/new-ec2-spot-instance-termi...

bdcravens8y ago

See my top-level comment - you can now set "shutdown" behavior to stop instead of terminate (though 2-minute notice still useful)

sciurus8y ago

The author goes to great lengths to come up with a way for the software that was running on a terminated spot instance to be relaunched using the same root filesystem on a new spot instance, but they never explain why they need to do exactly this. Maybe they already ran everything in Docker containers on CoreOS, so their solution isn't a big shift, but I strongly suspect they could find a simpler way to save and restore state if they got over this obsession with preserving the root filesystem their software sees.

olegkikin8y ago· 9 in thread

If you don't care about reliability, why not just get a cheap and powerful VPS? Paying $90/month for that machine is madness. I pay $6/month for 6GB RAM, 4 cores, 50GB disk.

dagw8y ago

If you don't care about reliability, why not just get a cheap and powerful VPS?

Personally, because my needs aren't constant. I might need two cores for two months followed by 100 cores for a week.

yjftsjthsd-h8y ago

Perhaps integration with other AWS services?

mayank8y ago

AWS Lightsail is AWS’s option there.

deivid8y ago

Where? I'm using Digital Ocean and it'd be way more expensive for that kind of configuration.

sacheendra8y ago

Digital Ocean is still a premium provider.

I would look at providers like OVH and even cheaper (Treudler, TransIP, RamNode, etc.) For example, an SSD with 2 vCPUs, 8GB RAM and 40GB SSD is 13.49$ per month from OVH.

kuschku8y ago

Here’s a list of providers by cost:

https://git.io/vps

(PS: Don’t use DigitalOcean, they tend to steal your credit if they feel like it. Lost 100 bucks "promotional credit" that way with only a few days notice)

4 more replies

blibble8y ago

where are you getting that for $6?

amq8y ago

Not quite $6, but close: https://www.scaleway.com/pricing/

1 more reply

olegkikin8y ago

Hostus.us

The deal was found on LowEndBox, not sure if it's still available, but there are many other ones.

ramanan8y ago· 1 in thread

Well, one easy way when using Ubuntu-like distributions is to simply place your `/home` folder on a separate (persistent) EBS volume [1].

With a few on-boot scripts to attach-volumes / start-containers, it should be fairly easy to get going as well.

[1] https://engineering.semantics3.com/the-instance-is-dead-long...

TrickyRick8y ago

This was exactly what I was thinking, why complicate things by replacing the root volume when one can simply mount the disk to any other directory and point the application there?

likelynew8y ago

I don't know why all the comments are saying this is bad idea. For me, one of thing for I use EC2 is deep learning. I just use spot GPU instance, attach overlayroot volume and launch jupyter notebook in it. Other things like google dataflow is not useful to me due to the price and the process of installing packages. I can also think of many other use cases for using some persistence volume for some manual task.

amq8y ago· 5 in thread

Wouldn't it be simpler to have the smallest possible instance run an NFS server? This would also have an additional bonus of scalability.

Edit: or use AWS EFS

manigandham8y ago

NFS is nice but a single instance can easily become network bound, especially on AWS. It also introduces a single point of failure for that instance, and clustered NFS can be fragile.

otterley8y ago

EFS is far more expensive than EBS. Price it out; you'll see.

Johnny5558y ago

It is 3X more expensive ($0.30/gb vs $0.10/gb for us-east), but it's replicated across AZ's (so is more durable than EBS which is only replicated within an AZ), and you only pay for what you use, you don't need to overprovision the EBS volume to account for peak dataset size.

And since it's shared, you don't need to replicate data across multiple nodes... so if 10 compute nodes needs access to the data set, they can all just read it from the same EFS filesystem, no need to download it 10 times to each compute node.

So EFS can still be very cost effective compared to EBS.

1 more reply

samstave8y ago

And while we are talking about costs, make sure you check for unused WBA volumes frequently, as you still pay for them if they aren't attached/used - and sometimes a dev will create a provisioned iops drive and forget to delete it and you pay a lot for those volumes..

atmosx8y ago

EFS is also slower than EBS, for I/O intensive workloads is not recommended.

A positive thing with EFS is that it can be shared across AZ while EBS needs to be snapshotted and then imported to the other AZ.

raverbashing8y ago

Is it just me or to me spot instances should deal with work and not storage, and hence your (stateful) units of work should be in a Queue/DB? (in a non-spot instance)

Attaching and detaching volumes is a good idea but I wouldn't use that to keep state

tuananh8y ago

we use k8s at work. i just have to create PVC and when spot instance terminated along with the container; new container will be created and mount the PVC again automatically.

jdchernofsky8y ago

Or you could just use Spotinst: https://spotinst.com/

alex_duf8y ago

It sounds wrong to try to keep the state across two ec2 instances. If you find yourself in that situation, try pushing your state outside the ec2 instance a bit harder. (dynamodb, s3 etc...)

You will get a lot of benefit out of it, but may lose in performance, which is fine in 99% of the cases.

j / k navigate · click thread line to collapse

77 comments

48 comments · 19 top-level

manigandham8y ago· 3 in thread

1. http://www.orangefs.org

2. http://www.averesystems.com/products/products-overview

3. https://portworx.com

4. https://github.com/openebs/openebs

manigandham8y ago

https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

objectivefs8y ago

[1]. https://objectivefs.com

manigandham8y ago

This looks very interesting, good competition to Avere based on info so far. Is there any native kubernetes integration in the works?

1 more reply

solatic8y ago· 7 in thread

OP is offering some very dangerous advice.

Now we have a resource (spot instances) whose unreliability is a featured design constraint and OP's advice is to just mount the block storage over the network and everything will be fine?

colechristensen8y ago

Your response is fairly ridiculous.

>exactly how state is being saved on that EBS volume

Files are written to a filesystem which is cleanly unmounted at shutdown when interruption happens.

derefr8y ago

2 more replies

solatic8y ago

> If your application can't handle a normal system shutdown without losing data, your application is at fault, not your system setup.

> Storing your important spot instance data on EBS is recommended by AWS

otterley8y ago

solatic8y ago

> Assuming your database software

OP's specific example is saving /var/opt/gitlab to an EBS volume and expecting to be able to move it from one spot instance to another without corruption. That strikes me as insane.

2 more replies

jen208y ago

solatic8y ago

- Just because it isn't smart for non-spot instances doesn't suddenly make it smart for spot instances ;)

1 more reply

bdcravens8y ago· 2 in thread

Spot instances can now "stop" instead of "terminate" when you get priced out, persisting the attached EBS volumes:

https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

fredsted8y ago

This should really be at the top!

likelynew8y ago

EBS is not the root volume.

1 more reply

otterley8y ago

fulafel8y ago

There's a mechanism exactly for this purpouse in Linux: pivot_root. It's used in the standard boot process to switch from the initrd (initial ramdisk) environment to the real system root.

ec2-spotter classic uses this, but you can also make a pivoting AMI of your favourite Linux distribution.

js4all8y ago

1) https://aws.amazon.com/about-aws/whats-new/2017/09/amazon-ec...

yjftsjthsd-h8y ago

TLDR: Attach EBS volume and use that to store Docker containers.

I suppose it's a decent solution if you don't want to deal with prefixes.

Pirate-of-SV8y ago· 1 in thread

To make this even more streamlined you'd tag the volumes and discover the volumes with `aws ec2 describe-volumes` and filter unattached volumes with the magic tag.

sevagh8y ago

There's a handful of tag-based automatic EBS volume attachers out there:

* https://github.com/sevagh/goat (my own) * https://github.com/UKHomeOffice/smilodon

stonewhite8y ago

We normally utilize spots with Spotinst + Elasticbeanstalk. Our billing looked great ever since.

archgoon8y ago· 1 in thread

So I was pleasantly surprised to discover that for the last several years, spot instances have provided a mechanism that give you 2 minutes notice prior to shutdown:

http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/spot-inte...

Learn something new everyday. :)

https://aws.amazon.com/blogs/aws/new-ec2-spot-instance-termi...

bdcravens8y ago

See my top-level comment - you can now set "shutdown" behavior to stop instead of terminate (though 2-minute notice still useful)

sciurus8y ago

olegkikin8y ago· 9 in thread

If you don't care about reliability, why not just get a cheap and powerful VPS? Paying $90/month for that machine is madness. I pay $6/month for 6GB RAM, 4 cores, 50GB disk.

dagw8y ago

If you don't care about reliability, why not just get a cheap and powerful VPS?

Personally, because my needs aren't constant. I might need two cores for two months followed by 100 cores for a week.

yjftsjthsd-h8y ago

Perhaps integration with other AWS services?

mayank8y ago

AWS Lightsail is AWS’s option there.

deivid8y ago

Where? I'm using Digital Ocean and it'd be way more expensive for that kind of configuration.

sacheendra8y ago

Digital Ocean is still a premium provider.

I would look at providers like OVH and even cheaper (Treudler, TransIP, RamNode, etc.) For example, an SSD with 2 vCPUs, 8GB RAM and 40GB SSD is 13.49$ per month from OVH.

kuschku8y ago

Here’s a list of providers by cost:

https://git.io/vps

(PS: Don’t use DigitalOcean, they tend to steal your credit if they feel like it. Lost 100 bucks "promotional credit" that way with only a few days notice)

4 more replies

blibble8y ago

where are you getting that for $6?

amq8y ago

Not quite $6, but close: https://www.scaleway.com/pricing/

1 more reply

olegkikin8y ago

Hostus.us

The deal was found on LowEndBox, not sure if it's still available, but there are many other ones.

ramanan8y ago· 1 in thread

Well, one easy way when using Ubuntu-like distributions is to simply place your `/home` folder on a separate (persistent) EBS volume [1].

With a few on-boot scripts to attach-volumes / start-containers, it should be fairly easy to get going as well.

[1] https://engineering.semantics3.com/the-instance-is-dead-long...

TrickyRick8y ago

This was exactly what I was thinking, why complicate things by replacing the root volume when one can simply mount the disk to any other directory and point the application there?

likelynew8y ago

amq8y ago· 5 in thread

Wouldn't it be simpler to have the smallest possible instance run an NFS server? This would also have an additional bonus of scalability.

Edit: or use AWS EFS

manigandham8y ago

NFS is nice but a single instance can easily become network bound, especially on AWS. It also introduces a single point of failure for that instance, and clustered NFS can be fragile.

otterley8y ago

EFS is far more expensive than EBS. Price it out; you'll see.

Johnny5558y ago

So EFS can still be very cost effective compared to EBS.

1 more reply

samstave8y ago

atmosx8y ago

EFS is also slower than EBS, for I/O intensive workloads is not recommended.

A positive thing with EFS is that it can be shared across AZ while EBS needs to be snapshotted and then imported to the other AZ.

raverbashing8y ago

Is it just me or to me spot instances should deal with work and not storage, and hence your (stateful) units of work should be in a Queue/DB? (in a non-spot instance)

Attaching and detaching volumes is a good idea but I wouldn't use that to keep state

tuananh8y ago

we use k8s at work. i just have to create PVC and when spot instance terminated along with the container; new container will be created and mount the PVC again automatically.

jdchernofsky8y ago

Or you could just use Spotinst: https://spotinst.com/

alex_duf8y ago

It sounds wrong to try to keep the state across two ec2 instances. If you find yourself in that situation, try pushing your state outside the ec2 instance a bit harder. (dynamodb, s3 etc...)

You will get a lot of benefit out of it, but may lose in performance, which is fine in 99% of the cases.

j / k navigate · click thread line to collapse