OpenZFS 2.0 (opens in new tab)

(github.com)

364 pointsascom5y ago148 comments

148 comments

boston_sre875y ago

Has there been any progress on the zfs on linux Linus disagreement front since this article?

https://arstechnica.com/gadgets/2020/01/linus-torvalds-zfs-s...

ed25519FUUU5y ago

zfs on linux available as root partition since 20.04. Working quite well I might add!

3np5y ago

That’s Ubuntu-specific where they provide their own kernel bundled with ZFS. It was working fine before 20.04 as well in the same way it does for other distros. Has nothing to do with the comment you’re replying to.

boomboomsubban5y ago

There's not much of a difference betweens building with a kernel module or building an independent kernel module. I can't figure out what the question being asked is, but I don't see how Ubuntu would matter.

yjftsjthsd-h5y ago

It always worked, the question is how much work they have to do to work around a kernel that dislikes them.

gilrain5y ago

It's still a big pain if you like to keep your kernel relatively up to date. I switched to btrfs; it just working is worth the few extra warts over ZFS.

yjftsjthsd-h5y ago

> It's still a big pain if you like to keep your kernel relatively up to date.

Replacing a core system component with an out-of-repo version is always going to hurt, yes.

> I switched to btrfs; it just working is worth the few extra warts over ZFS.

I'm not sure I'd call "catastrophic failure and data loss" a "wart". In all my years of distro hopping, I've had 3 root filesystems become unbootable: 1 F2FS system early on, which I actually did manage to fsck out of, and 2 on an openSUSE tubleweed system using BTRFS as root.

2 more replies

xenophonf5y ago

I've had zero problems with kernel updates on Ubuntu 20.04 with ZFS on a natively encrypted root. I followed the instructions in the wiki, lightly modified for my hardware and workload:

https://gist.github.com/xenophonf/76fd44ae24772e457cb63d00c0...

`apt-get update && apt-get dist-upgrade -y` works as expected. I plan to switch to a similar config on my Lenovo laptop when I upgrade it to the next Ubuntu LTS release.

1 more reply

teddyfrozevelt5y ago

Adding on, I've been using ZFS as my root partition on Arch with the latest kernel and zfs-dkms and have never had a problem.

tpetry5y ago

Zstd compression with configurable levels is really interesting: You could write every block first with a level comparable to lz4 for very fast performance. And if a block has not been rewritten for some time you recompress them with a compression level allowing more compression and comparable decompression performance.

So cold data (cold write, cold/hot read) will take less and less space over time while still having the same read performance.

KMag5y ago

That would be an even more interesting feature for NILFS2, as I understand it, its ring buffer structure requires moving the oldest unmodified blocks as the ring buffer write frontier approaches. Any blocks that are forced to be copied are by definition old and unmodified, and need to be moved anyway, so why not recompress? AFAIK, there are no plans for compression in NILFS, but I think it's an interesting idea.

rcthompson5y ago

My understanding is that for ZFS, things like this would require a mythical feature called "block pointer rewrite", the same feature required to implement out-of-band deduplication.

rincebrain5y ago

You are correct - ZFS hardcodes the assumption that data's location on disk will never change once written very deeply, and offline dedup/data migrating of any sort would require that.

(It would also be a performance nightmare - you'd have a permanent indirection table you'd need to use for _everything_, and if you've ever seen how ZFS dedup performs with its indirection table not on dedicated SSDs, you can understand why this is terrible.)

tpetry5y ago

The block could still be rewritten from the view of zfs as long as it does not update the last-written timestamp (does zfs have this?). I was just describing how it would look like from the birds eye.

rdc125y ago

Directly no, but if you moved the data to a new dataset, with a command that preserves the timestamp that would work (rsync -a or zfs send/recv), which could be run from a cronjob.

Compression settings are set at a per dataset level, so applying this to only some files in a dataset isn't practical.

throw0101a5y ago

Sadly dRAID (parity Declustered RAIDz) just missed the cut-off for 2.0, but it looks like it will be in 2.1:

* https://openzfs.github.io/openzfs-docs/Basic%20Concepts/dRAI...

* https://www.youtube.com/watch?v=jdXOtEF6Fh0

Nican5y ago

dRAID looks really fascinating, but presentation is pretty abstract. Would it allow to add/remove drives from a pool, and allow ZFS to rebalance itself?

Would be great for home use, where I have a lot of drives that I collected over the years that are not the same size.

EDIT: The more I read into this, it still seems assume that all drives must be of the same size.

diegocg5y ago

I don't think so. The essence of draid is that, instead of keeping a spare drive unused in case one of the working drives fail, it incorporates the spare drive to the array and uses it, but one drive worth of free space is reserved randomly across the entire array.

That way, if one disk fails, the reserved space is used to write the data necessary to keep the array consistent. Because the free space is distributed randomly across the array, the write performance of a single drive doesn't become a bottleneck.

This is unrelated to the ability to remove drives from a pool (which is difficult to support in ZFS due to design constraints)

hardwaresofton5y ago

Maybe this presentation by Mark will help?

dRAID, Finally![0]

[0]: https://www.youtube.com/watch?v=jdXOtEF6Fh0

ecnahc5155y ago

This sounds like synology hybrid raid, which uses lvm and mdadm together for something similar if I recall.

pantalaimon5y ago

You can already do that with btrfs.

Nican5y ago

I currently use btrfs with RAID1 at home, and it works great. But btrfs also does not have the track record for being the most stable filesystem as compared to ZFS.

[1] https://lore.kernel.org/linux-btrfs/20200627032414.GX10769@h... [2] https://lore.kernel.org/linux-btrfs/20200627030614.GW10769@h... [3] https://lore.kernel.org/linux-btrfs/20200520013255.GD10769@h...

rewtraw5y ago

You can do that with ZFS too, at least for mirrored sets (i.e. RAID10). It's possible to remove a vdev, and the pool will migrate the data to the remaining vdevs.

codetrotter5y ago

This is huge! And very exciting :D

One thing I am wondering about is this:

> Redacted zfs send/receive - Redacted streams allow users to send subsets of their data to a target system. This allows users to save space by not replicating unimportant data within a given dataset or to selectively exclude sensitive information. #7958

Let’s say I have a dataset tank/music-video-project-2020-12 or something and it is like 40 GB and I want to send a snapshot of it to a remote machine on an unreliable connection. Can I use the redacted send/recv functionality to send the dataset in chunks at a time and then at the end have perfect copy of it that I can then send incremental snapshots to?

kogir5y ago

zfs send supports a resume token (-t) to resume interrupted streams received with (-s). Just use normal send/receive until you have the full stream sent.

0xCMP5y ago

I think it's more if you want to not send scratch or cached files you can have it automatically remove it from the snapshot being sent

> Redacted send/receive is a three-stage process. First, a clone (or clones) is made of the snapshot to be sent to the target. In this clone (or clones), all unnecessary or unwanted data is removed or modified. This clone is then snapshotted to create the "redaction snapshot" (or snapshots).

Think of it like a selective sync in Dropbox or SyncThing at the FS level.

vorpalhex5y ago

That's a protocol problem, use a protocol such as rsync. You don't need to use redacted sends/recvs.

rleigh5y ago

rsync doesn't scale like zfs send/recv. It requires scanning of every file at both the source and destination to compute the delta to send. zfs snapshots and send/recv don't need to do that. The delta is already fully described by the snapshots themselves. zfs is also working with immutable snapshots. It guarantees the source and destination copies are identical; rsync can't do much about the source and destination being modified while it is running since it's reliant upon other users of the system not touching the data being synced.

That's not to say rsync doesn't work. It does. But it doesn't scale well, and the data integrity guarantees aren't there.

XorNot5y ago

rsync has it's own issues if the connection has high latency though - zfs send was originally developed by a Sun engineer who wanted to speed up large transfers to servers in China, if I recall correctly.

nix235y ago

+1 for rsync, but with check-summing turned on, i think that's acceptable for 40GB.

RantyDave5y ago

It's not really enough for ZFS (unfortunately). It won't move snapshots, bookmarks etc.

1 more reply

anderspitman5y ago

I'd love to get rid of my FreeNAS VM and run ZFS directly on my Linux desktop, but having to mess with the kernel has kept me from attempting it so far. Maybe I'm worrying about nothing.

btrfs seems like the main alternative if you want native kernel support, but when I checked a couple years ago there seemed to be a lot of concerns about the stability. Is that still the case?

briskstorm5y ago

To mirror other comments, btrfs is pretty stable now and I run it on my server. The main problems now are with RAID5/6 profiles, the implementation has lots of issues still that can cause data loss[1]. Seems most of the core developers don't use those profiles so it hasn't been getting better. If you want to use RAID it would be safer to stick to RAID1/10[2]

[1] https://lore.kernel.org/linux-btrfs/20200627032414.GX10769@h...

[2] https://www.man7.org/linux/man-pages/man8/mkfs.btrfs.8.html#...

curt155y ago

On the btrfs mailing list [1], there are still sporadic reports of unrecoverable FS corruption for whatever reason. See [2], [3] for some recent examples.

[1] https://lore.kernel.org/linux-btrfs/

[2] https://lore.kernel.org/linux-btrfs/CAD7Y51i=mTDnEWEJtSnUsq=...

[3] https://lore.kernel.org/linux-btrfs/CAMXR++KUj2L7qpR7QZeiM2T...

viraptor5y ago

[3] is interesting, thanks for linking that. [2] has so many moving parts, I wouldn't expect it to be related to btrfs without more information. I mean, there's both the fs and the cache layer being resized down, with unknown method.

ariabuckles5y ago

Both openSUSE and [as of very recently] Fedora use btrfs by default, so btrfs support seems pretty stable these days.

(But as others have pointed out, there are options for using zfs on linux, too)

bmn__5y ago

Attempting to use zfs for the root partition is a huge headache because the software lives in the supplementary `filesystems` repo. https://build.opensuse.org/package/show/filesystems/zfs

1. It often happens that the main repo offers a new kernel, but the corresponding module is not ready on obs yet. This means upgrading to the latest rolling release cannot just happen at any time, but requires careful planning. This is a big inconvenience.

2. In the past dracut sometimes just failed to pick up the module for the initrd, causing a boot failure at the next system start. I could not figure out why, however this never happened with the first class supported ext/xfs.

3. The distro's boot/rescue media do not contain the driver. This means a third-party boot medium is required to go into a broken system, and repairing it when chroot is involved is now much more complicated because of the different distro.

ed25519FUUU5y ago

btrfs was a really underutilized filesystem. It still has some superior features to zfs (such as offline deduplication), but the momentum now is clearly with zfs.

Ericson23145y ago

ZFS is no extra work with NixOS! You just declare the filesystem type like any other in the config and it takes care of kernel modules and what-not.

pbronez5y ago

Sure - after you figure out NixOS lol

takeda5y ago

Actually no, NixOS is probably easier to use than other Linuxes. It gets more difficult when you need to package something new that it doesn't have, then you have to know the Nix language and how nixpkgs work.

1 more reply

brnt5y ago

Is there a guide for this? Sounds interesting!

jfb5y ago

For instance, I have:

     fileSystems."/zfs/media" =
        { device = "tank/media";
          fsType = "zfs";
        };

in my hardware-configuration.nix. tank/media is defined as using a legacy mount-point or whatever the ZFS terminology is. Done.

ETA: I mean, I had to do all the gruntwork to get the pool built, yeah. But once it was defined, getting it mounted and all the kernel bits and bobs set was trivial like that.

1 more reply

linsomniac5y ago

I've been running encrypted ZFS on 20.04 on my main workstation since it came out and it's worked great. Wrote up details here, it's a slight hack for encryption, no hack if you don't want crypto. https://linsomniac.gitlab.io/post/2020-04-09-ubuntu-2004-enc...

A friend did a video based on my blog: https://www.youtube.com/watch?v=PILrUcXYwmc

koolba5y ago

Why ZFS encryption vs unencrypted ZFS atop LUKS?

necheffa5y ago

Native ZFS encryption make pool management easier. Less bookkeeping to import/export a pool. And the file system can make better decisions about performance and recovery.

phil215y ago

I wanted to say "as someone who tends to follow the unix philosophy" and realized the irony of saying that regarding ZFS...

That said, I generally agree with you in that do one thing and do it well is a laudable design goal. However, I also am very excited about encrypted ZFS for one main reason: backups.

Okay two. Snapshots and backups!

ZFS is absolutely amazing to use as a home NAS that does daily (or more) snapshots and then nightly differential syncs to a second location. In the past I had to run all my own infrastructure to do this, as the data was in the clear.

Now my ZFS nerd friend and I can simply swap backup space and have "zero knowledge" of the others' files, while retaining the amazing features of ZFS snapshots+zfs send/receive.

This also tickles the "create an encrypted ZFS backups as a service" service itch for me, but then I realize I'd be creating it for all 13 potential users of the service. That said, I'm sure rsync.net will offer this functionality shortly - which would make them a viable backup target for me.

3 more replies

brnt5y ago

Once you put a fs on top of something, don't you lose any guarantees of finding broken sectors when scrub?

This is why I really wish btrfs would get native encryption, but maybe my info is out of date.

yjftsjthsd-h5y ago

Why add another separate layer?

2 more replies

johnisgood5y ago

Does it offer plausible deniability like dm-crypt?

kbumsik5y ago

I personally use ZFS on Arch Linux. The DKMS package works almost out of the box and I haven't had any troubles. It takes a long time (but not too much) to compile though.

Or you can use the latest Ubuntu that is shipped with ZFS.

nvllsvm5y ago

Be aware of possible incompatibilities with the regular Arch kernel. I switched my NAS to linux-lts when a kernel point-release proved incompatible with openzfs.

kbumsik5y ago

Right, I forgot to mention I always use the LTS kernel, since I had getrandom() booting problem last year. https://lwn.net/Articles/800509/

ed25519FUUU5y ago

Reporting in from Ubuntu 20.04 with ZFS on root. It works great. No issues so far, other than docker requires "zfs" as the storage-driver and doesn't support overlay2. See: https://bugs.launchpad.net/ubuntu/+source/zfs-linux/+bug/171...

kbumsik5y ago

Interesting. I thought overlay2 can be agnostic to the base filesystems.

heavyset_go5y ago

I use btrfs on both servers and laptops without a problem these days. This wasn't the case almost a decade ago, though, when I got bit by its then-instability.

paulsmal5y ago

You know Ubuntu support ZFS since 20.04. Experimental, but quite stable for me. Just select file system during installation process.

sliken5y ago

Or say apt install zfs-utils, that's it, not even a reboot.

paulsmal5y ago

Oh yeah, right. Installation method if one needs ZFS on root.

rodgerd5y ago

> but having to mess with the kernel has kept me from attempting it so far. Maybe I'm worrying about nothing.

For the most part, yes. Occasionally a kernel developer who seems to be bitter about a company that doesn't exist any more tries to break compat with ZFS, but it's generally smooth sailing on Fedora, Debian, and CentOS, with dkms handling the building of modules seamlessly.

plmu5y ago

Just use something else for your root file system, and zfs for the rest. I've been running Zol for 10 years (on arch) and had to recover a few times, but it was never difficult because of the totally standard setup except for the data disks.

weitzj5y ago

The easiest way is using t Proxmox installer which has ZFS as a filesystem. Underneath it is a Debian 10 installation. Last time I tried you could not enable ZFS encryption. I don’t know what is the case with Openzfs 2.0

Do we have encryption,yet?

vetinari5y ago

Proxmox uses their own kernels, so they build the zfs modules as well. They are bundled in the pve-kernel package.

paulsmal5y ago

Ubuntu 20.10 has an option in installer to use ZFS encryption for root partition.

remram5y ago

Oh sweet, I don't think encryption was an option in 20.04. Is it still under an "experimental" option in the installer?

1 more reply

Jnr5y ago

I'm using btrfs and my system still works. :)

muxator5y ago

Me too. I have been running full btrfs since at least 5 years (single and RAID1 disks), which no data loss whatsoever. Now that there is support for swapfiles, I do not even need to create a dedicated partition for them.

nix235y ago

>ZFS directly on my Linux desktop

Use BTRFS trust me it's stable now...well the commands are terrible compared to ZFS. All my Server are FreeBSD but on the Laptop and on one Workstation i have openSUSE Tumbleweed since like 2 years and it works great.

raegis5y ago

I've used btrfs for years with no problems ever. But, I see weekly reports on the btrfs reddit forum of the type "I was doing btrfs RAIDxyz, and I can't mount read/write" etc., so there do exist people who have issues with it today. If you do RAID on steroids, you might do some research before trying btrfs.

gpanders5y ago

> well the commands are terrible compared to ZFS

Really? I don’t think so, I find btrfs usage extremely straightforward and easy to grok. ZFS on the other hand has all that confusing lingo about vdevs, etc...

I get that this is subjective but I disagree.

rleigh5y ago

The Brtfs commands are very poor compared with what ZFS offers. The ZFS commands are organised around the end user: the system administrator. The Brtfs commands are not.

As an example, you're running low on space and need to find out which datasets (subvolumes) are using the most space. How do you do that? With ZFS it's a single command which runs in a few milliseconds. With Btrfs...

nix235y ago

Hey everyone has a different taste, but vdevs, datasets, and pool are for me much more logical than lv's and lg's (pun was NOT intended).

1 more reply

neolog5y ago

> the commands are terrible

what does that mean?

chungy5y ago

The "btrfs" tool has a lot of leaky abstractions, confusing intended usage, and gotchas all over the place. If you aren't a btrfs developer, it is difficult to know what exactly you want to do and how to accomplish it.

ZFS on the other hand has just two commands for common administration tasks: zpool and zfs. zpool controls pool-level operations, mainly ones that deal with the storage layer; zfs controls the logical file systems and volumes that are contained within a pool. The zpool and zfs commands have been meticulously crafted to not expose much of the underlying software architecture and focus only on what administrators want, and all of it is clearly documented.

There are actually a few other commands that come with ZFS if you really want or need to deal with low-level and difficult details, commands like zdb, zinject, zstreamdump. You almost never need any of them.

vetinari5y ago

For zfs specific features, there are 'zfs' and 'zpool' commands (well, binaries, and the first parameter is a command). For btrfs, there is 'btrfs'.

So I guess that the GP considers /usr/sbin/{zfs,zpool} more intuitive than /usr/sbin/btrfs.

1 more reply

nix235y ago

https://btrfs.wiki.kernel.org/index.php/Manpage/btrfs

>what does that mean?

Not functional but logical (for me)

arwineap5y ago

debian has zfs in the contrib repo since stretch; no manual hacking required just have to enable contrib

I switched my freebsd box over to debian about two years ago. No complaints so far :)

qalmakka5y ago

Finally, this means we've a way to share "real" filesystems on both FreeBSD and Linux. The only other filesystems you could open without issues on both are FAT and NTFS (thought NTFS-3G), both of which are less than ideal for data you care about.

justinclift5y ago

Slightly off topic, but it seems like GitHub can't/won't display the user profile page for one of the OpenZFS developers:

https://github.com/behlendorf

For me, that gives a unicorn 100% of the time (tried across several minutes), instead of showing the developer profile.

Anyone else seeing that?

jclulow5y ago

It does, indeed, report that "This page is taking too long to load."!

justinclift5y ago

Yeah, it's still unicorning for me, about a day later. :(

rincebrain5y ago

Loaded in under 5 seconds flat for me, perhaps it's something strange with whatever edge server you're hitting?

justinclift5y ago

Could be, but if so it's persistent. It's about a day later now, and the page still won't load.

justinclift5y ago

Testing now, the page is finally loading. Page load time of ~2 days... that's different. ;)

bromonkey5y ago

It loaded for me earlier today, I think github is just having issue.

rodgerd5y ago

Congratulations - it's great to see the code unification on the two key ZFS platforms, and continuing to add useful features, especially around at-rest encryption.

Many thanks to the various OpenZFS contributors.

KMag5y ago

How's the memory consumption of ZFS without deduplication these days? I've got a couple of 4 TB drives connected to a single board ARM computer with 2 GB of RAM. I used to use btrfs, but switched to XFS after I accidentally filled up a drive and was unable to recover.

rincebrain5y ago

ZFS without dedup will just run slower with less RAM available for caching, up to a point (I think the lowest I've seen someone run it with ARC configured to use in recent memory is 128 MB? I believe 32 MB or so is the minimum below which OpenZFS will just ignore you if you try to tell it to use less...)

I've seen people use it as a rootfs on RPis, and have personally run it on Pis for brief occasions without encountering any RAM problems.

mholt5y ago

I'm looking at setting up my first ZFS pool ('zpool'?) in a few weeks, on Linux. Will I be using OpenZFS or something else? Ubuntu 20.04.

(Sorry if noise; I'm just trying to get an idea of how relevant this 2.0 release is to me.)

iotku5y ago

> The ZFS on Linux project has been renamed OpenZFS! Both Linux and FreeBSD are now supported from the same repository making all of the OpenZFS features available on both platforms.

Previously it was called ZFS on Linux, but now ZFS development is unified on the "OpenZFS" codebase shared both between Linux and FreeBSD as much of the development effort for ZFS in general ended up there.

mholt5y ago

Ah, I was wondering what happened since I stopped hearing about "ZFS on Linux" so now I know what to search for. Thanks!

mlex5y ago

Just built a FreeNAS system over the past couple weeks and finished doing burn-in tests of my hard drives, wonder if I should wait and see how to install OpenZFS 2.0.0 before I create my storage config.

1over1375y ago

FreeNAS 12 (now named TrueNAS) is already using OpenZFS 2.0, or very nearly.

nraynaud5y ago

Does it support NFS4.2?(fallocate, sparse files and server side copy)

ed25519FUUU5y ago

Aren't ZFS upgrades to existing vdevs really simple? I don't see any reason why you need to wait.

mlex5y ago

That’s the idea I’ve gotten when looking around online. I figured I was in the uncommon situation of having a completely blank and ready system, so I could afford to just wait a few days.

1over1375y ago

Yes, ZFS upgrades are really simple, but they are one-way, you can't downgrade after.

rodgerd5y ago

They certainly seem to be within OpenZFS over the past few years.

voltagex_5y ago

Anyone know what version of Ubuntu Server this will land in?

cogman105y ago

Likely 21.04. I doubt they'll pull it into 20.10 or 20.04.

GlitchMr5y ago

Probably 21.04. 22.04 if you want an LTS release.

jstrong5y ago

hooray for zstd compression!

ed25519FUUU5y ago

Side note, they really should have in big-bold letters "DO NOT ENABLE DEDUPLICATION UNLESS YOU HAVE A TON OF RAM!" on their readme. That was a huge mistake on my part. The ram requirements are VERY high for good performance.

I realized how bad the performance was when it took about 2 hours to delete 1000 files.

freddie_mercury5y ago

It does already say that. This is what it says:

Deduplication is the process for removing redundant data at the block level, reducing the total amount of data stored. If a file system has the dedup property enabled, duplicate data blocks are removed synchronously. The result is that only unique data is stored and common components are shared among files.

Deduplicating data is a very resource-intensive operation. It is generally recommended that you have at least 1.25 GiB of RAM per 1 TiB of storage when you enable deduplication. Calculating the exact requirement depends heavily on the type of data stored in the pool.

Enabling deduplication on an improperly-designed system can result in performance issues (slow IO and administrative operations). It can potentially lead to problems importing a pool due to memory exhaustion. Deduplication can consume significant processing power (CPU) and memory as well as generate additional disk IO.

1over1375y ago

That's not new with 2.0 though. It's forever been the case with ZFS. Everything that discusses dedupe basically says: 'don't use it'.

Mashimo5y ago

Most guides I read tell you that you should not enabled DEDUP unless you know what you are doing and it will use a lot of ram.

zmix5y ago

To me this sounds more like you didn't RTFM ;-)

hlandau5y ago

Will OpenZFS on Linux ever be integrated with the Linux page cache?

keeperofdakeys5y ago

Probably never. ZFS isn't just a filesystem, it was developed to be an entire storage system that's vertically integrated, so ARC is a fundamental part of the filesystem design.

ZFS also has a huge legacy. Right now the license (probably) prevents you from legally shipping a compiled zfs module with the linux kernel, just solving that seems insurmountable. It's also supported on Illumos and FreeBSD, trying to refactor it to use the linux page cache would have a chance of introducing bugs to these platforms.

RantyDave5y ago

ZFS isn't really designed for local 'temporary' file systems (IMHO). You don't really need to nest checksums, create snapshots or volume manage when you're slugging pages between ram and nvme.

nix235y ago

No, they have ARC and ARCL2, if you want the traditional thing go to NILFS2 or BTRFS or in the future XFS (when they have full check-summing).

curt155y ago

>in the future XFS (when they have full check-summing).

Is this actually planned?

nix235y ago

YES! Step by step and keep XFS as stable as it is (the most trustworthy linux FS of them all)

2 more replies

kzrdude5y ago

OpenZFS is in fact a more prestigeous name and it already sounds better than ZFS on Linux.

solarengineer5y ago

If you get on the calls, you’ll find zero hostility across the operating systems devs. The focus is on OpenZFS, with the Linux branch gradually becoming baseline for the FreeBSD work as well. Illumos ( where OpenZFS originated after Illumos was formed post the OpenSolaris shutdown) hasn’t moved to this baseline yet due to the significant OS level differences and instead code is pulled between the “branches” as needed. The collaboration happens via email and regular calls.

j / k navigate · click thread line to collapse

148 comments

boston_sre875y ago

Has there been any progress on the zfs on linux Linus disagreement front since this article?

https://arstechnica.com/gadgets/2020/01/linus-torvalds-zfs-s...

ed25519FUUU5y ago

zfs on linux available as root partition since 20.04. Working quite well I might add!

3np5y ago

boomboomsubban5y ago

yjftsjthsd-h5y ago

It always worked, the question is how much work they have to do to work around a kernel that dislikes them.

gilrain5y ago

It's still a big pain if you like to keep your kernel relatively up to date. I switched to btrfs; it just working is worth the few extra warts over ZFS.

yjftsjthsd-h5y ago

> It's still a big pain if you like to keep your kernel relatively up to date.

Replacing a core system component with an out-of-repo version is always going to hurt, yes.

> I switched to btrfs; it just working is worth the few extra warts over ZFS.

2 more replies

xenophonf5y ago

I've had zero problems with kernel updates on Ubuntu 20.04 with ZFS on a natively encrypted root. I followed the instructions in the wiki, lightly modified for my hardware and workload:

https://gist.github.com/xenophonf/76fd44ae24772e457cb63d00c0...

`apt-get update && apt-get dist-upgrade -y` works as expected. I plan to switch to a similar config on my Lenovo laptop when I upgrade it to the next Ubuntu LTS release.

1 more reply

teddyfrozevelt5y ago

Adding on, I've been using ZFS as my root partition on Arch with the latest kernel and zfs-dkms and have never had a problem.

tpetry5y ago

So cold data (cold write, cold/hot read) will take less and less space over time while still having the same read performance.

KMag5y ago

rcthompson5y ago

My understanding is that for ZFS, things like this would require a mythical feature called "block pointer rewrite", the same feature required to implement out-of-band deduplication.

rincebrain5y ago

You are correct - ZFS hardcodes the assumption that data's location on disk will never change once written very deeply, and offline dedup/data migrating of any sort would require that.

tpetry5y ago

rdc125y ago

Directly no, but if you moved the data to a new dataset, with a command that preserves the timestamp that would work (rsync -a or zfs send/recv), which could be run from a cronjob.

Compression settings are set at a per dataset level, so applying this to only some files in a dataset isn't practical.

throw0101a5y ago

Sadly dRAID (parity Declustered RAIDz) just missed the cut-off for 2.0, but it looks like it will be in 2.1:

* https://openzfs.github.io/openzfs-docs/Basic%20Concepts/dRAI...

* https://www.youtube.com/watch?v=jdXOtEF6Fh0

Nican5y ago

dRAID looks really fascinating, but presentation is pretty abstract. Would it allow to add/remove drives from a pool, and allow ZFS to rebalance itself?

Would be great for home use, where I have a lot of drives that I collected over the years that are not the same size.

EDIT: The more I read into this, it still seems assume that all drives must be of the same size.

diegocg5y ago

This is unrelated to the ability to remove drives from a pool (which is difficult to support in ZFS due to design constraints)

hardwaresofton5y ago

Maybe this presentation by Mark will help?

dRAID, Finally![0]

[0]: https://www.youtube.com/watch?v=jdXOtEF6Fh0

ecnahc5155y ago

This sounds like synology hybrid raid, which uses lvm and mdadm together for something similar if I recall.

pantalaimon5y ago

You can already do that with btrfs.

Nican5y ago

I currently use btrfs with RAID1 at home, and it works great. But btrfs also does not have the track record for being the most stable filesystem as compared to ZFS.

rewtraw5y ago

You can do that with ZFS too, at least for mirrored sets (i.e. RAID10). It's possible to remove a vdev, and the pool will migrate the data to the remaining vdevs.

codetrotter5y ago

This is huge! And very exciting :D

One thing I am wondering about is this:

kogir5y ago

zfs send supports a resume token (-t) to resume interrupted streams received with (-s). Just use normal send/receive until you have the full stream sent.

0xCMP5y ago

I think it's more if you want to not send scratch or cached files you can have it automatically remove it from the snapshot being sent

Think of it like a selective sync in Dropbox or SyncThing at the FS level.

vorpalhex5y ago

That's a protocol problem, use a protocol such as rsync. You don't need to use redacted sends/recvs.

rleigh5y ago

That's not to say rsync doesn't work. It does. But it doesn't scale well, and the data integrity guarantees aren't there.

XorNot5y ago

nix235y ago

+1 for rsync, but with check-summing turned on, i think that's acceptable for 40GB.

RantyDave5y ago

It's not really enough for ZFS (unfortunately). It won't move snapshots, bookmarks etc.

1 more reply

anderspitman5y ago

I'd love to get rid of my FreeNAS VM and run ZFS directly on my Linux desktop, but having to mess with the kernel has kept me from attempting it so far. Maybe I'm worrying about nothing.

btrfs seems like the main alternative if you want native kernel support, but when I checked a couple years ago there seemed to be a lot of concerns about the stability. Is that still the case?

briskstorm5y ago

[1] https://lore.kernel.org/linux-btrfs/20200627032414.GX10769@h...

[2] https://www.man7.org/linux/man-pages/man8/mkfs.btrfs.8.html#...

curt155y ago

On the btrfs mailing list [1], there are still sporadic reports of unrecoverable FS corruption for whatever reason. See [2], [3] for some recent examples.

[1] https://lore.kernel.org/linux-btrfs/

[2] https://lore.kernel.org/linux-btrfs/CAD7Y51i=mTDnEWEJtSnUsq=...

[3] https://lore.kernel.org/linux-btrfs/CAMXR++KUj2L7qpR7QZeiM2T...

viraptor5y ago

ariabuckles5y ago

Both openSUSE and [as of very recently] Fedora use btrfs by default, so btrfs support seems pretty stable these days.

(But as others have pointed out, there are options for using zfs on linux, too)

bmn__5y ago

Attempting to use zfs for the root partition is a huge headache because the software lives in the supplementary `filesystems` repo. https://build.opensuse.org/package/show/filesystems/zfs

ed25519FUUU5y ago

btrfs was a really underutilized filesystem. It still has some superior features to zfs (such as offline deduplication), but the momentum now is clearly with zfs.

Ericson23145y ago

ZFS is no extra work with NixOS! You just declare the filesystem type like any other in the config and it takes care of kernel modules and what-not.

pbronez5y ago

Sure - after you figure out NixOS lol

takeda5y ago

1 more reply

brnt5y ago

Is there a guide for this? Sounds interesting!

jfb5y ago

For instance, I have:

     fileSystems."/zfs/media" =
        { device = "tank/media";
          fsType = "zfs";
        };

in my hardware-configuration.nix. tank/media is defined as using a legacy mount-point or whatever the ZFS terminology is. Done.

ETA: I mean, I had to do all the gruntwork to get the pool built, yeah. But once it was defined, getting it mounted and all the kernel bits and bobs set was trivial like that.

1 more reply

linsomniac5y ago

A friend did a video based on my blog: https://www.youtube.com/watch?v=PILrUcXYwmc

koolba5y ago

Why ZFS encryption vs unencrypted ZFS atop LUKS?

necheffa5y ago

Native ZFS encryption make pool management easier. Less bookkeeping to import/export a pool. And the file system can make better decisions about performance and recovery.

phil215y ago

I wanted to say "as someone who tends to follow the unix philosophy" and realized the irony of saying that regarding ZFS...

That said, I generally agree with you in that do one thing and do it well is a laudable design goal. However, I also am very excited about encrypted ZFS for one main reason: backups.

Okay two. Snapshots and backups!

Now my ZFS nerd friend and I can simply swap backup space and have "zero knowledge" of the others' files, while retaining the amazing features of ZFS snapshots+zfs send/receive.

3 more replies

brnt5y ago

Once you put a fs on top of something, don't you lose any guarantees of finding broken sectors when scrub?

This is why I really wish btrfs would get native encryption, but maybe my info is out of date.

yjftsjthsd-h5y ago

Why add another separate layer?

2 more replies

johnisgood5y ago

Does it offer plausible deniability like dm-crypt?

kbumsik5y ago

I personally use ZFS on Arch Linux. The DKMS package works almost out of the box and I haven't had any troubles. It takes a long time (but not too much) to compile though.

Or you can use the latest Ubuntu that is shipped with ZFS.

nvllsvm5y ago

Be aware of possible incompatibilities with the regular Arch kernel. I switched my NAS to linux-lts when a kernel point-release proved incompatible with openzfs.

kbumsik5y ago

Right, I forgot to mention I always use the LTS kernel, since I had getrandom() booting problem last year. https://lwn.net/Articles/800509/

ed25519FUUU5y ago

kbumsik5y ago

Interesting. I thought overlay2 can be agnostic to the base filesystems.

heavyset_go5y ago

I use btrfs on both servers and laptops without a problem these days. This wasn't the case almost a decade ago, though, when I got bit by its then-instability.

paulsmal5y ago

You know Ubuntu support ZFS since 20.04. Experimental, but quite stable for me. Just select file system during installation process.

sliken5y ago

Or say apt install zfs-utils, that's it, not even a reboot.

paulsmal5y ago

Oh yeah, right. Installation method if one needs ZFS on root.

rodgerd5y ago

> but having to mess with the kernel has kept me from attempting it so far. Maybe I'm worrying about nothing.

plmu5y ago

weitzj5y ago

Do we have encryption,yet?

vetinari5y ago

Proxmox uses their own kernels, so they build the zfs modules as well. They are bundled in the pve-kernel package.

paulsmal5y ago

Ubuntu 20.10 has an option in installer to use ZFS encryption for root partition.

remram5y ago

Oh sweet, I don't think encryption was an option in 20.04. Is it still under an "experimental" option in the installer?

1 more reply

Jnr5y ago

I'm using btrfs and my system still works. :)

muxator5y ago

nix235y ago

>ZFS directly on my Linux desktop

raegis5y ago

gpanders5y ago

> well the commands are terrible compared to ZFS

Really? I don’t think so, I find btrfs usage extremely straightforward and easy to grok. ZFS on the other hand has all that confusing lingo about vdevs, etc...

I get that this is subjective but I disagree.

rleigh5y ago

The Brtfs commands are very poor compared with what ZFS offers. The ZFS commands are organised around the end user: the system administrator. The Brtfs commands are not.

nix235y ago

Hey everyone has a different taste, but vdevs, datasets, and pool are for me much more logical than lv's and lg's (pun was NOT intended).

1 more reply

neolog5y ago

> the commands are terrible

what does that mean?

chungy5y ago

vetinari5y ago

For zfs specific features, there are 'zfs' and 'zpool' commands (well, binaries, and the first parameter is a command). For btrfs, there is 'btrfs'.

So I guess that the GP considers /usr/sbin/{zfs,zpool} more intuitive than /usr/sbin/btrfs.

1 more reply

nix235y ago

https://btrfs.wiki.kernel.org/index.php/Manpage/btrfs

>what does that mean?

Not functional but logical (for me)

arwineap5y ago

debian has zfs in the contrib repo since stretch; no manual hacking required just have to enable contrib

I switched my freebsd box over to debian about two years ago. No complaints so far :)

qalmakka5y ago

justinclift5y ago

Slightly off topic, but it seems like GitHub can't/won't display the user profile page for one of the OpenZFS developers:

https://github.com/behlendorf

For me, that gives a unicorn 100% of the time (tried across several minutes), instead of showing the developer profile.

Anyone else seeing that?

jclulow5y ago

It does, indeed, report that "This page is taking too long to load."!

justinclift5y ago

Yeah, it's still unicorning for me, about a day later. :(

rincebrain5y ago

Loaded in under 5 seconds flat for me, perhaps it's something strange with whatever edge server you're hitting?

justinclift5y ago

Could be, but if so it's persistent. It's about a day later now, and the page still won't load.

justinclift5y ago

Testing now, the page is finally loading. Page load time of ~2 days... that's different. ;)

bromonkey5y ago

It loaded for me earlier today, I think github is just having issue.

rodgerd5y ago

Congratulations - it's great to see the code unification on the two key ZFS platforms, and continuing to add useful features, especially around at-rest encryption.

Many thanks to the various OpenZFS contributors.

KMag5y ago

rincebrain5y ago

I've seen people use it as a rootfs on RPis, and have personally run it on Pis for brief occasions without encountering any RAM problems.

mholt5y ago

I'm looking at setting up my first ZFS pool ('zpool'?) in a few weeks, on Linux. Will I be using OpenZFS or something else? Ubuntu 20.04.

(Sorry if noise; I'm just trying to get an idea of how relevant this 2.0 release is to me.)

iotku5y ago

> The ZFS on Linux project has been renamed OpenZFS! Both Linux and FreeBSD are now supported from the same repository making all of the OpenZFS features available on both platforms.

mholt5y ago

Ah, I was wondering what happened since I stopped hearing about "ZFS on Linux" so now I know what to search for. Thanks!

mlex5y ago

1over1375y ago

FreeNAS 12 (now named TrueNAS) is already using OpenZFS 2.0, or very nearly.

nraynaud5y ago

Does it support NFS4.2?(fallocate, sparse files and server side copy)

ed25519FUUU5y ago

Aren't ZFS upgrades to existing vdevs really simple? I don't see any reason why you need to wait.

mlex5y ago

That’s the idea I’ve gotten when looking around online. I figured I was in the uncommon situation of having a completely blank and ready system, so I could afford to just wait a few days.

1over1375y ago

Yes, ZFS upgrades are really simple, but they are one-way, you can't downgrade after.

rodgerd5y ago

They certainly seem to be within OpenZFS over the past few years.

voltagex_5y ago

Anyone know what version of Ubuntu Server this will land in?

cogman105y ago

Likely 21.04. I doubt they'll pull it into 20.10 or 20.04.

GlitchMr5y ago

Probably 21.04. 22.04 if you want an LTS release.

jstrong5y ago

hooray for zstd compression!

ed25519FUUU5y ago

I realized how bad the performance was when it took about 2 hours to delete 1000 files.

freddie_mercury5y ago

It does already say that. This is what it says:

1over1375y ago

That's not new with 2.0 though. It's forever been the case with ZFS. Everything that discusses dedupe basically says: 'don't use it'.

Mashimo5y ago

Most guides I read tell you that you should not enabled DEDUP unless you know what you are doing and it will use a lot of ram.

zmix5y ago

To me this sounds more like you didn't RTFM ;-)

hlandau5y ago

Will OpenZFS on Linux ever be integrated with the Linux page cache?

keeperofdakeys5y ago

Probably never. ZFS isn't just a filesystem, it was developed to be an entire storage system that's vertically integrated, so ARC is a fundamental part of the filesystem design.

RantyDave5y ago

ZFS isn't really designed for local 'temporary' file systems (IMHO). You don't really need to nest checksums, create snapshots or volume manage when you're slugging pages between ram and nvme.

nix235y ago

No, they have ARC and ARCL2, if you want the traditional thing go to NILFS2 or BTRFS or in the future XFS (when they have full check-summing).

curt155y ago

>in the future XFS (when they have full check-summing).

Is this actually planned?

nix235y ago

YES! Step by step and keep XFS as stable as it is (the most trustworthy linux FS of them all)

2 more replies

kzrdude5y ago

OpenZFS is in fact a more prestigeous name and it already sounds better than ZFS on Linux.

solarengineer5y ago

j / k navigate · click thread line to collapse