Comparing Filesystem Performance in Virtual Machines (opens in new tab)

(mitchellh.com)

123 pointsSevein12y ago45 comments

45 comments

There is a lot of caching involved and it looks like the VM writes are not synchronous - they do not wait for the actual disk to be written. Normally nothing can beat the native access, but in a VM the "disk" is actually a sparse file that can be efficiently cached in RAM. I see the same behavior/speeds in my VMs if the virtual disk has a lot of free space and I have a lot of free RAM on the host. The speeds get "down to earth" if you fill up the host's RAM.

weddpros12y ago

3GB/s writes on a single SSD should raise more eyebrows. I dunno what was actually benchmarked, but there's a problem somewhere...

mitchellh12y ago

You're assuming the writes are actually going to a physical disk. As I mentioned in the post, the hypervisors are very likely just writing to RAM and not ever committing it to disk. Even when you `fsync()` from a VM, there is no guarantee the hypervisor puts that to disk.

If you look at the graphs, they corroborate this. The "native" disk never really exceeds 500 to 600 MB/s, which is about as fast as my SSD goes. The hypervisors, however, are exceeding multiple GB/s. It must be RAM.

Also, re: "I'm not sure what was actually benchmarked" The method of benchmarking is covered in the bottom of the post. I realize it isn't extremely detailed. If you have any questions, I'd be happy to answer.

wila12y ago

The problem is that the tests are flawed and the native speed being slower as virtual is a pretty big red herring for that. That's OK because benchmarking that tests what you are after is actually very difficult. Small assumptions can create big differences. If you can't guarantee that the data has actually been written to the disk then you're testing caching mechanisms, something you already point out in your article, but then you're no longer testing file system performance as the article claims it is benchmarking. The problem is that we don't even know which caching mechanisms (guest OS, hypervisor, hard disk driver) or that the conditions are always the same.

A typical thing that performance benchmarks do to negate guest OS caching is to process significantly more data as what the available RAM is set to. For example, if your guest OS RAM is set 512MB, process 10GB of random data. Of course then the question is how-to get random data as you don't want to end up testing the random generator ;) or your host OS caching.

Another way to make sure you test data committed to disk could be to include a "shutdown guest OS" part and measure total time until the guest has been fully shut down.

I know that at least VMware has the ability to turn off disk caching (in Fusion, select Settings, advanced "Hard Disk Buffering" <- disable). I am not aware of a similar feature in Virtual Box, it might exist though.

Even while you tested the same guest OS, we don't even know if the hard disk adapters where both using the same hard disk drivers. Performance differs between IDE/Sata/SCSI drivers. SCSI drivers have queue depths, IDE drivers have not.

1 more reply

weddpros12y ago

I meant to say you've not benchmarked disk access, and we have no idea what each filesystem is actually doing. Caching performance is "nice", but it says nothing about the actual performance we'd get in real use. Maybe the "slow" fs just exhibits less aggressive caching, which might prove just as efficient depending on the workload. It's definitely interesting to note the huge differences, but I'd really like to see how it goes in "real" conditions...

1 more reply

rdtsc12y ago

Writing to /dev/null is even faster I bet.

gizmo68612y ago

  >dd if=/dev/zero of=/dev/null
  ^C32646111+0 records in
  32646110+0 records out
  16714808320 bytes (17 GB) copied, 17.881 s, 935 MB/s

Running on the host OS of my laptop.

1 more reply

stefanha12y ago

This benchmark is bogus because the iozone -I flag is missing. -I uses O_DIRECT to avoid the page cache.

Due to page cache usage it's hard to say what this benchmark is comparing. The I/O pattern seen by the actual disk, shared folder, or NFS may be different between benchmark runs. It all depends on amount of RAM available, state of cache, readahead, write-behind, etc.

Please rerun the benchmark with -I to get an apples-to-apples comparison.

kika12y ago

It will avoid page cache in the VM, but will not avoid cache on the host, right?

vl12y ago

Depends on configuration, but most setups propagate O_DIRECT all the way through - otherwise it would be impossible to run many apps, such as DBs in VM.

azinman212y ago

Is it just me or do the graphs not match up to the text in several places? For example in the 64MB random file write graph (http://i.imgur.com/iGxn2H1.png) green is the vmware native according to the legend, which is clearly the highest bar graph across the board, yet he says "VirtualBox continues to outperform VMware on writes"

icebraining12y ago

He's probably talking about the Shared Folders performance.

orthecreedence12y ago

That's the conclusion I drew, but it's really unclear. He said at the end that VMWare blows Virtualbox out of the water. The graphs show that for shared folders but as far as disk access in general, they look fairly evenly matched (at least, that's what the graphs I saw depicted).

1 more reply

newman31412y ago

It would have been interesting to see a comparison with Xen etc. too.

rbanffy12y ago

It's primarily a development environment test, where the host runs OSX. It would be interesting extending the test to Parallels on Macs and adding a Linux host where KVM and LXC could be used.

jtreminio12y ago

just a note: Mitchell Hashimoto is the mastermind behind Vagrant and Packer.

Nux12y ago

Would love to see in there KVM and Xenserver; you know, stuff that actual clouds run on.

bradleyland12y ago

This is pretty clearly a test of developer related tools, not production cloud server infrastructure. I'm not even sure there's an equivalent of VirtualBox/VMWare shared folders in KVM or Xen, because guests and hosts don't usually share folders in the same way that you do with these workstation virtualization tools.

...

Spoke too soon. A Google search shows there are some methods [1], but their use cases are different.

[1]: http://www.linux-kvm.org/page/9p_virtio

mitchellh12y ago

bradleyland is correct: This test was focused primarily on using VMs for development tools. This test was done on a local machine with desktop virtualization software. The opening paragraph mentions I was investigating performance for development environments. This post should not be used for any production applications, since it would make no sense.

liuw12y ago

I think you mean KVM and Xen. Xen hypervisor is open source project just like KVM while XenServer is a product that uses Xen hypervisor.

Just think of Linux kernel and Linux distributions.

Nux12y ago

I actually meant Xenserver (which is open source, too), but you do have a valid point. Xen's in use a lot.

ajayka12y ago

Interesting and timely article! On an Ubuntu guest (Windows host), I install the Samba server and then use the native Windows CIFS client to connect to the Ubuntu host. This gives me the advantage of vm (virtualbox) native filesystem and letting me use my windows machine to open files on the guest

Perhaps this support can be added to some later version of vagrant

jtreminio12y ago

This is what I would do when I was on Windows. The biggest (really big) downside is that the files live inside the VM and are only accessible when the VM is up and running.

buster12y ago

How is it that native is slower then virtual i/o in his tests? I don't get it... if it's only reading some cached data, it's not a real test scenario, isn't it?

So i suppose, the host system caches the reads. Also, how could it possibly be true that native writes are slower then virtual writes?

icebraining12y ago

From the article:

It is interesting that sometimes the native filesystem within the virtual machine outperforms the native filesystem on the host machine. This test uses raw read system calls with zero user-space buffering. It is very likely that the hypervisors do buffering for reads from their virtual machines, so they’re seeing better performance from not context switching to the native kernel as much. This theory is further supported by looking at the raw result data for fread benchmarks. In those tests, the native filesystem beats the virtual filesystems every time.

buster12y ago

It doesn't explain a thing on why. He just measured the performance of memory access and different caching strategies. From my point of view the "benchmarks" say nothing at all about actual I/O disk performance in virtual and native environments.

2 more replies

cwyers12y ago

On write, the VM probably reports that data is written to disk when it's written to an in-memory cache, then writes it to actual disk. So write is faster because the application is being lied to, not actual performance. That wouldn't explain the rads, though.

bluedino12y ago

Benchmarks are flawed. Combine that with 'virtual' devices and you're bound to get amazingly weird results.

bryanlarsen12y ago

It could be because native is running OS X but they're running Ubuntu inside the VM.

contingencies12y ago

In the past, industry threw hardware at things. Virtualization reduced this wastefulness somewhat, but now developers are fighting back against unreliable performance. If you are developing a performance-sensitive system, executing similar tests routinely but with real workloads should be part of your test process... and certainly occur before deployment. Third party tests on some hardware with some version of some code on some kernel, such as what we see here, are really neither here nor there.

cgbystrom12y ago

With our team, we also found shared folders performance to be too low. Our Python framework/app is very read-heavy and stat() a lot of files (the Python module loading system isn't your friend)

We ended up using the synchronization feature in PyCharm to continually rsync files from native FS into the VirtualBox instance. Huge perf improvement but a little more cumbersome for the developers. But so far it has been working good, PyCharm's sync feature does what it is supposed to.

polskibus12y ago

I would love to see MS HyperV added to this benchmark or similar.

Thaxll12y ago

No KVM / Xen ... :/

k_bx12y ago

On big repository, if you want to use zsh -- you will have to use NFS, otherwise my VirtualBox just hangs for 30 seconds until it can show me "git status" in a prompt. So only option for me is NFS (for VirtualBox).

jonalmeida12y ago

Another user on lobste.rs posted this photronix article of Virtualbox vs QEMU-KVM.

Thought it might be of interest on HN as well.

http://www.phoronix.com/vr.php?view=19551

fsiefken12y ago

I thought it was a test of different filesystems performance within the client os. Like fo example btrfs lzo vs ext4.

j / k navigate · click thread line to collapse

45 comments

miahi12y ago

weddpros12y ago

3GB/s writes on a single SSD should raise more eyebrows. I dunno what was actually benchmarked, but there's a problem somewhere...

mitchellh12y ago

wila12y ago

Another way to make sure you test data committed to disk could be to include a "shutdown guest OS" part and measure total time until the guest has been fully shut down.

1 more reply

weddpros12y ago

1 more reply

rdtsc12y ago

Writing to /dev/null is even faster I bet.

gizmo68612y ago

  >dd if=/dev/zero of=/dev/null
  ^C32646111+0 records in
  32646110+0 records out
  16714808320 bytes (17 GB) copied, 17.881 s, 935 MB/s

Running on the host OS of my laptop.

1 more reply

stefanha12y ago

This benchmark is bogus because the iozone -I flag is missing. -I uses O_DIRECT to avoid the page cache.

Please rerun the benchmark with -I to get an apples-to-apples comparison.

kika12y ago

It will avoid page cache in the VM, but will not avoid cache on the host, right?

vl12y ago

Depends on configuration, but most setups propagate O_DIRECT all the way through - otherwise it would be impossible to run many apps, such as DBs in VM.

azinman212y ago

icebraining12y ago

He's probably talking about the Shared Folders performance.

orthecreedence12y ago

1 more reply

newman31412y ago

It would have been interesting to see a comparison with Xen etc. too.

rbanffy12y ago

It's primarily a development environment test, where the host runs OSX. It would be interesting extending the test to Parallels on Macs and adding a Linux host where KVM and LXC could be used.

jtreminio12y ago

just a note: Mitchell Hashimoto is the mastermind behind Vagrant and Packer.

Nux12y ago

Would love to see in there KVM and Xenserver; you know, stuff that actual clouds run on.

bradleyland12y ago

...

Spoke too soon. A Google search shows there are some methods [1], but their use cases are different.

[1]: http://www.linux-kvm.org/page/9p_virtio

mitchellh12y ago

liuw12y ago

I think you mean KVM and Xen. Xen hypervisor is open source project just like KVM while XenServer is a product that uses Xen hypervisor.

Just think of Linux kernel and Linux distributions.

Nux12y ago

I actually meant Xenserver (which is open source, too), but you do have a valid point. Xen's in use a lot.

ajayka12y ago

Perhaps this support can be added to some later version of vagrant

jtreminio12y ago

This is what I would do when I was on Windows. The biggest (really big) downside is that the files live inside the VM and are only accessible when the VM is up and running.

buster12y ago

How is it that native is slower then virtual i/o in his tests? I don't get it... if it's only reading some cached data, it's not a real test scenario, isn't it?

So i suppose, the host system caches the reads. Also, how could it possibly be true that native writes are slower then virtual writes?

icebraining12y ago

From the article:

buster12y ago

2 more replies

cwyers12y ago

bluedino12y ago

Benchmarks are flawed. Combine that with 'virtual' devices and you're bound to get amazingly weird results.

bryanlarsen12y ago

It could be because native is running OS X but they're running Ubuntu inside the VM.

contingencies12y ago

cgbystrom12y ago

With our team, we also found shared folders performance to be too low. Our Python framework/app is very read-heavy and stat() a lot of files (the Python module loading system isn't your friend)

polskibus12y ago

I would love to see MS HyperV added to this benchmark or similar.

Thaxll12y ago

No KVM / Xen ... :/

k_bx12y ago

jonalmeida12y ago

Another user on lobste.rs posted this photronix article of Virtualbox vs QEMU-KVM.

Thought it might be of interest on HN as well.

http://www.phoronix.com/vr.php?view=19551

fsiefken12y ago

I thought it was a test of different filesystems performance within the client os. Like fo example btrfs lzo vs ext4.

j / k navigate · click thread line to collapse