Disaster Recovery with ZFS and Zrepl

justinclift · on May 9, 2024

It sounds like the author is using ZFS encryption.

If that's the case, then doing so with replicated ZFS snapshots is probably not a good idea.

That specific scenario (ZFS encryption -> replication of encrypted snapshots) is a known cause of ZFS corruption. :(

https://www.phoronix.com/news/OpenZFS-Encrypt-Corrupt

Unfortunately it doesn't seem to be widely known about, though there is a suggestion to make it official:

https://github.com/openzfs/openzfs-docs/issues/494

l2dy · on May 9, 2024

> I don’t back up my drives, I replicate them.

Don't take this as a general advice. For important data, it's important to have multiple backups and validate their effectiveness routinely.

chromakode · on May 9, 2024

It's part of the 3-2-1 rule: https://www.backblaze.com/blog/the-3-2-1-backup-strategy/

My mid term goal is to trade offsite replication with a friend for automatically replicating the offsite portion of important things.

toastal · on May 9, 2024

That works assuming you have the budget & technical friends

chromakode · on May 9, 2024

Residential fiber internet is a lovely thing :D

aargh_aargh · on May 9, 2024

One case where this might go wrong is if you accidentally delete a file, the deletion replicates and you're left without a backup. Replication is nice but it is not a replacement for backup.

mhio · on May 9, 2024

The setup in the article replicates regular incremental snapshots rather than file system changes.

ZFS snapshots allow you to roll back to the point in time of each snapshot, up to the amount of disk space you can maintain for storing the changes.

diarrhea · on May 9, 2024

I feel like a filesystem with native snapshot support (zfs, btrfs), replicated once (also native in zfs) obsoletes conventional 3-2-1 backup systems. It’s technically only 2 versions, but you’re protected from all the same failure modes.

aborsy · on May 9, 2024

If there is a bug in file system, both source and its replication can be lost.

I will add to this a different backup software to an offsite or cloud. I use restic.

CTDOCodebases · on May 9, 2024

This is why I run both FreeBSD and Linux on my backup systems and use restic, rsync and zfs replication to move the data there.

Sure all the systems use a zfs file system but I am diversifying risk and don't really trust the other copy on write file systems.

tjoff · on May 9, 2024

But then you run into the risk of incompatibility bugs between the different versions.

I had tons of issues getting it to work with versions too far apart, which tainted my feel for that approach.

There is an argument to be made for dumb file copying even when you have access to fancy features.

CTDOCodebases · on May 9, 2024

To clarify all data that is transferred from Linux ZFS to FreeBSD based systems or visa versa is copied using restic or rsync.

I only use ZFS replication when doing Linux to Linux transfers and when that happens they are running the exact same operating system and version of OpenZFS.

yjftsjthsd-h · on May 9, 2024

Eh, especially if you're just using Linux and FreeBSD (doubly so now that they're both using openzfs) it's easy enough to keep pool features compatible. Obviously you need to either pin to a compatible feature level or avoid upgrading the pool, but I don't think it's terribly hard.

tjoff · on May 9, 2024

This was before freebsd used openzfs, maybe easier now. But my point still stands, you are off the beaten track at a time when you want to minimize risks.

Comment on this thread to this effect, didn't know about the encryption issues: https://news.ycombinator.com/item?id=40306873

raverbashing · on May 9, 2024

Especially in a fs that falls apart if you look at it wrong like ZFS

stiray · on May 9, 2024

Heh, you must really have foul look, I am using it on FreeBSD since 2007ish and never had any issues. Same pool, no longer the same disks as they were all replaced since. Also the hardware around it was replaced and FreeBSD 8 updated to all new versions, currently running 14.0.

Never lost any data. There isn't much software that could claim that.

Meanwhile I lost (I had backups) all my data twice on btrfs, I know it is more stable now, but I certainly wont ever use it again. Even HAMMER1 (I would love to use HAMMER2, but until my server dies, I will stay with FreeBSD) lost it only once and even in that case, after debugging irc session with Matt Dillon, I was able to recover most files.

The only thing that pisses me off is that Kubuntu doesnt support it trough installer (yes I could do it manually but been there with Fedora and I am sick of tracking if initfs was updated or finish with unbootable - easy solvable, but annoying - situation ) and I am now forced to use Ubuntu with KDE on workstation. But this is not zfs.

justinclift · on May 10, 2024

> The only thing that pisses me off is that Kubuntu doesnt support it trough installer ...

For my personal workstation I've started experimenting with using Proxmox (it's a Debian 12 variant) as the OS because its installer supports multi-drive ZFS installation (RAIDZ1/2/10/etc) out of the box. So, boot setup is currently mirrored SSDs (with /home on mirrored NVMe drives).

Apt installing the standard desktop stuff afterwards (Nvidia drivers, KDE desktop, etc) has worked well, and it all seems happy.

That being said, I'm only 4 or 5 days into this test setup. So far so good though. :)

pepa65 · on May 10, 2024

I completely lost a 5T encrypted ZFS drive, nothing whatsoever recoverable. That would never happen with LUKS/LVM.

ZFS is great on enterprise hardware I'm sure.

sshine · on May 9, 2024

How does zfs fall apart?

I’ve run it beneath Ubuntu for several years, and it’s only saved my ass.

I didn’t replicate, only used the apt snapshots.

justinclift · on May 9, 2024

It's quite possible to get ZFS into weird states without too much effort when you're screwing around with the underlying devices (adding, deleting, changing things).

This seems to crop up at really inconvenient times too, like when you're trying to do something during a scheduled outage. :(

That kind of thing aside though, it's been pretty solid in my use for actual data storage.

Just don't use ZFS's native encryption + ZFS snapshots + send/recv.

Reportedly that combination is a cause of data corruption:

https://www.phoronix.com/news/OpenZFS-Encrypt-Corrupt

prmoustache · on May 9, 2024

Hmmm from what I understand zrepl can do copy of source to backup both in a push or pull mode. I would say the push mode is still a very fragile way to do backups.

Imagine source machine is compromised and attacker decide to delete/encrypt your data, and see there is a backup mechanism connecting to the backup machine, what prevents him from using the deleting/encrypting the backups as well?

You'd definitely want the backup machine to pull the snapshots, and have no way to connect to it from source machine directly with a user that would have access to the data or an admin account. That means no ssh keys on the source machine, no password kept in a password manager that would be loaded on the source machine either.

Another strong method would involve 3 machines: source --push--> replica1 <--pull-- replica2

Where source and replica1 would have ZFS filesystem and snapshots while replica2 is using a different filesystem (LVM + ext4 ) and snapshots to safeguard from replicating bugs that lead to data not being available. ZFS snapshots could be saved as individual files on this filesystem.

justinclift · on May 9, 2024

> ... what prevents him from using the deleting/encrypting the backups as well?

With ZFS snapshots the older snapshots would still be present on the target server, in their unencrypted form.

> That means no ssh keys on the source machine ...

Typically for non-user logins (eg script access and similar) you do the extra step of configuring the receiving ssh to only allow a specific command for a given key.

It's a configurable ssh thing, where you add extra info to the .ssh/authorized_keys file on the destination server. With that approach, it doesn't allow general user logins while still allowing the source machine to send the data.

prmoustache · on May 9, 2024

> With ZFS snapshots the older snapshots would still be present on the target server, in their unencrypted form.

Only and only if you apply best practicies mentionned above in the second point of your post.

Anyway I'd rather protect my backup as much as I can and not allow the source machine to have any direct access to an account on the backup server because of possible security. Security is hard sometimes and you never know when some bugs might increase the possibilities. I like my backup servers to not have any open port. The caveat is that it is only possible if your primary backup server is locally and physically accessible. If you are travelling you might want to be able to access it without being physically present. An option might be to just have ssh disabled when you are at home and you enable the service when you know you won't be at home for a long enough period to make it a problem if you need to restore data.

justinclift · on May 9, 2024

> Anyway I'd rather ...

That's fair, it's a different risk profile, and one that you're happy with. Nothing wrong with that. :)

tjoff · on May 9, 2024

That doesn't make any sense. Native snapshots are fantastic, but are merely an effective way to do backups. One where you trade some complexity from the backup software to the filesystem.

(only talking backups, snapshots of course have utility beyond that usecase as well)

kkfx · on May 9, 2024

Well... The issue with encrypted zfs + raw send is that a pool encrypted with a common key for all volume became an individual key per volume, a non-RAW send means your target read your files. If you use a keyfile this is a non-issue. If you type your key, well, you import all the old volumes, create a new pool and send them re-encrypting them with a common key. Very raw but doable at home scale setups.

tie-in · on May 9, 2024

You don't actually need a dedicated ZFS backup program. A simple cron script will handle incremental backups just fine. If anyone is interested, the script we use to backup our multi-TB PostgreSQL database can be found here: https://lackofimagination.org/2022/04/our-experience-with-po...

justinclift · on May 9, 2024

For ZFS use it's probably a good idea to avoid WD NVMe drives too.

There are a large number of people who've reported problems with ZFS (and btrfs) with WD SN770 and a few other WD models:

• https://github.com/openzfs/zfs/discussions/14793

chromakode · on May 9, 2024

Funny story -- when I was working on the xkcd Machine comic, I actually used the ZFS snapshots to rescue data. I accidentally blew away some early physics prototype code and fished it out of /.zfs/snapshot.

sshine · on May 9, 2024

Thanks for advertising the machine.

I hadn’t seen it yet.

It’s amazing!

aargh_aargh · on May 9, 2024

And here's the dev blogpost, hot off the press!

https://news.ycombinator.com/item?id=40300454

totetsu · on May 9, 2024

My face palm moment of this year was accidentally restoring a zfs snapshot of my root pool form a week ago, but it was actually a year and a week ago. Didn’t lose any of my data, but suddenly I had some offering to format their version mismatched databases.