I have backups on a backup hard drive and also synced to B2, but I am thinking about backing up to some format to put in the cupboard.
The issue I see is that if I don’t have a catastrophic failure and instead just accidentally delete some files one day while organising and don’t realise, at some point the oldest backup state is removed and the files are gone.
The other thing is if I get hit by a bus and no one can work out how to decrypt a backup or whatever.
So I’m thinking of a plain old unencrypted copy of photos etc that anyone could find and use. Bonus points if I can just do a new CD or whatever each year with additions.
I have about 700GB of photos and videos which is the main content I’m concerned about. Do people use DVDs for this or is there something bigger? I am adding 60GB or more each year, would be nice to do one annual addition or something like that.
I usually use a dehydrator for ~3 days on my drives to make them shelf stable. So far I haven’t had any issues.
I have not used them myself, but M-DISC sounds like what you’re looking for. There are a few other alternatives listed on that Wikipedia article, too.
Thanks, I think the risk here is that there may not be hardware to read it.
From the suggestions here I’m thinking a hard drive with USB connection would be best. It won’t last 50 years but instead I’d replace it every 5 years or so. I’d use an error resistant file system and plug it in each year to add the new files.
This way I also get the chance to move it to newer technology in future instead of a new hard drive. It would then only need to survive for some period of time after I last replaced it, so there’s a good chance of it remaining readable for most of my life.
Thanks, I think the risk here is that there may not be hardware to read it.
M-Disc DVDs are readable by ordinary DVD drives. So you could simply put a USB DVD drive alongside those backup M-Discs on the shelf.
Yeah that’s an idea. It does seem like I’d need a lot of disks though. And I don’t actually have a disk reader or writer at all at the moment.
Just buy a spinning drive and call it a day. Follow 3-2-1 protocol and you’ll never lose anything . Check your longterm “cold storage” HDD’s 1-2 times a year. They should last 5-10 with little fuss. Some will go longer with no issues.
If you follow 3-2-1 you’ll never lose anything and you only have to spend a few hours a year thinking about backups.
I have 3 2 1 but I want the equivalent of a suitcase of photos in the cupboard. No family member is gonna be cleaning out my house as they move me to a rest home and stumble upon my Borg backup in B2 object storage. And if they do they won’t have the key. I want something a bit closer to physical.
I think an extra drive for cold storage is a good idea. My main backups are automated, this one I can add any new files done in the last year once a year, then back in the cupboard. I just need to make sure I’m rotating the drives so I don’t have the same one in storage for 50 years, and instead buy new ones every 5 years or so.
I’m using blu-ray disks for the 3rd copy, but I’m not backing up nearly as much data as you are.
The only problem with optical media is that you should only expect it to be readable for a couple of years, best case, at this point and probably not even that as the tier 1 guys all stop making it and you’re left with the dregs.
You almost certainly want some sort of tape option, assuming you want long retention periods and are only likely to add incremental changes to a large dataset.
Edit: I know there’s longer-life archival optical media, but for what that costs, uh, you want tape if at all possible.
Same.
Bought a Blu-ray burner and “archive grade” disks for third location backups.
I made a list of files that is just a text document (3MB!) that sits on the root of the Blu-ray. There’s probably a better way of doing that, but it works for me.
Hmm I am keen for something that could be left in the cupboard for 50 years and still works when brought out.
What does it take me to do home tape storage? Do the tapes needs to be stored with climate control or are they pretty stable? Is it feasible for the average person to load the contents?
I’m thinking of pulling a suitcase out of the cupboard of all the baby photos, but digital files or photo and video.
Tape isn’t readable by normal people even if they found it tomorrow with a drive already configured to be used.
In 50 years good luck finding a working drive compatible with LTO4 when LTO32 is out (it’s backwards compatible only with previous gen).
Unless you write on the box “here there are the keys for 100k bitcoins” they’ll just trash the tape
Yeah from some other comments I think my initial plan (that I’ll research some more) will be:
- buy a new HDD, format with ZFS or btrfs for error correction
- copy data onto drive
- store in cupboard with sata-> USB cable and instructions about what it is, how to access .
- every year, load the previous year’s data onto the drive
- about every 5 years, replace the drive by copying onto a brand new one (timeframe will likely depend on when my other HDD drives die)
This way I should get a chance to update storage medium as technology changes as well.
So, 50 years isn’t a reasonable goal unless you have a pretty big budget for this. Essentially no media is likely to survive that long and be readable unless they’re stored in a vault, under perfect climate controlled conditions. And even if the media is fine, finding an ancient drive to read a format that no longer exists is not a guaranteed proposition.
You frankly should be expecting to have to replace everything every couple of years, and maybe more often if your routine tests of the media show it’s started rotting.
Long term archival storage really isn’t just a dump it to some media and lock it up and never look at ever again.
Alternately, you could just make someone else pay for all of this, and shove all of this to something like Glacier and make the media Amazon’s problem. (Assuming Amazon is around that long and that nothing catches fire.)
Hmm damn. I don’t really think cloud is the right answer for what I’m trying to do.
I disagree that formats like JPEG won’t be readable in 50 years. I feel like there would be big demand for being able to read the format even if it’s been superceded, on account of all the JPEGs that still living people have.
Maybe I get a big drive. Each year I copy over files from the last year. Every X years I swap the hard drive for a new one, copy all data.
How can I tell if individual files get corrupted? Like the hard drive failed in that section, then I copy the corrupted file to the new drive, and I’d never know. Can I test in bulk? 50k+ photos and videos so far.
The format is the tape in the drive, or the disk or whatever.
Tape existed 50 years ago: nothing modern and in production can read those tapes.
The problem is, given a big enough time window, the literal drives to read it will simply no longer exist, and you won’t be able to access even non-rotted media because of that.
As for data integrity, there’s a lot of options: you can make a md5 sum of each file, and then do it again and see if anything is different.
The only caveat here is you have to make sure whatever you’re using to make the checksums gets stored somewhere that’s not JUST on the drive because if the drive DOES corrupt itself, and your only record of the “good” hashes is on the drive, well, you can’t necessarily trust those hashes either.
Ah good thinking. I am thinking a spare drive that I update once a year with new content and replace every few years with a new drive is a good idea.
I believe M-Disc to be the best consumer grade, optical solution out there. If you want to go commercial grade you are looking for LTO tapes, but your costs begin rising exponentially. If M-Disc claims are to be believed, they should last well longer than your requirement and be able to handle your data footprint using multiple, but not an unreasonable amount of discs.
No matter which solution you choose, if you are targeting multiple decades, you must save not only the media, but ideally the drive, computer and software used to archive. There is no guarantee that any of the existing technology will be relevant or backward compatible across several decades.
Thanks! I think this is probably a big risk of not being able to find the hardware to play it.
Through other conversations I think the answer is to instead get a normal drive, USB connection, and every few years replace the drive and copy the data to the new drive, using an error resistant file system and something like rsync that validates that the files arrived correctly.
As technology changes, I’d move the files as needed onto the more modern media.
USB hard drive? If we’re talking about a cold backup that’s easy to access a USB drive is reliable and easy.
Yeah based on the suggestions so far this seems like the best option. Just need to make sure I have a way to verify no files are corrupted (and if they are, which ones), and remember to swap it out for a new one every 5 years or so or each time I need a new drive.
Tape. Amazon glacier if you’re okay with that.
And regular test restores. An untested backup is not a backup.
But when considering what I need to back up, I usually overestimate how much I or other people will care if it’s lost. Family photos are great, but what are the odds of someone saying “damn I wish we still had two dozen photos of that one barbecue?”
I use tape but haven’t been happy with my drive for a while, where do you get your drives? (Also OP I wouldn’t recommend tape until you cross the 10TB mark personally)
Yeah after looking at the price of a drive, I agree it doesn’t seem necessary at the level of data I have.
I don’t personally use tape, but I get most of my stuff from eBay. Tape drives are surprising expensive, even LTO-6 is going to run you a few hundred. But you still can’t beat the density and longevity.
I have cloud with B2, I’m looking for cupboard storage that a random family member can pull out and browse through after I get put in a resthome (only half joking).
Is home tape storage feasible (and good for this use-case)?
In terms of what to backup, I’m running on the assumption that technology will be able to autofilter the good stuff at some point, no need to put much effort in now haha.
For your amount, just an external hard drive attached to a NAS or something is fine, or a 2-bay synology would be more than enough. Drives are coming in 20-24TB models now, that’d keep you going for a long time.
That’s effectively what I have now. However I seem to kill a drive every couple of years, so I am keen for something that can be stored for many years (preferably decades).
The other thing is if I get hit by a bus and no one can work out how to decrypt a backup or whatever.
Documentation, documentation, documentation. No matter what system you have, make sure your loved ones have a detailed, image-heavy, easy to follow guide on how restorations work - at the file level, at the VM level, at whatever level you are using.
That being said, DVDs actually have quite a short shelf life, all things considered. I’d be more inclined to use a pair of archival strength USB NVME drive, updated and tested routinely(quarterly, yearly, whatever makes sense). Or even an LTO tape, if you want to purchase the drive and some tapes.
You can put your backups in something like VeraCrypt. Set an insanely long password, encoded in a QR code, printed on paper. Store it in the same secured location you store your USB drives (or elsewhere, if you have a security posture).
You may also consider, if money is not a concern, a cloud VPS or other online file storage, similarly encrypted. This can provide an easy URL to access for the less tech-savvy, along with secured credentials for recovery efforts. Depending on what your successors might need to access, this could be a very straightforward way to log into a website and download what they need in an emergency.
archival strength USB NVME drive,
Does such a thing exist? Ordinary flash storage is pretty bad at keeping its content when powered off for a long time, due to how flash memory works. I’d be curious about such drives.
As much as I’m worried about family not being able to do it, I’m just as worried that I will do something dumb and lose the encryption key, losing everything. I am keen on the digital equivalent of a suitcase full of photos that could be stumbled upon.
I also already have borg backup set up to a backup drive and synced to the cloud (Backblaze B2).
For tape drives, is many thousands of dollars a normal price? Not sure I’m that keen.
If you buy your LTO drive new, then yes they rip you a new one, for sure! Buy it used…but it still will cost you a few hundred. Like I said, if money is not a concern. If losing the encryption key is a concern, then USB is still your best bet. Make two, keep them simple and unencrypted, stick em in two different safes, update them regularly. And print the documentation with pictures!
I have had terrible experience with USBs failing, including losing a bunch of photos beyond recovery (some 15 years ago, but it still hurts). Plus it’s quite aot of data.
I’m thinking a hard drive + USB SATA cable might be my best option. Add the new content each year. Work out some way of verifying it’s not corrupted. Replace drive every 5 years or something, it can be swapped in when I get a failure and the new one can be the new backup.
This is why I do my first-level of backups with
rsnapshot
. It backs up to the plain filesystem using rsync and uses hard-links to de-dup between backups. No special filesystem, no encryption, restore is just an ‘rsync’ away.Yes my issue is that I seem to be replacing a drive somewhere every couple of years. I am keen for something that can be stored in a cupboard for years, preferably a decent chance at lasting decades.
Ah - I gotcha. That’s some terrible luck with drives.
I mean, there are a lot of drives. Two laptops with a drive each. A desktop/server with three drives, and a spare laptop used for Kodi is the current setup. I’m not counting but I think it’s three drives, one laptop, and one mobo since I started self-hosting perhaps 5 or 6 years ago.
The drives themselves, one was still under warranty, one was probably 3 or 4 years old, and the last was probably 6 or 8 and was in an old laptop and well used.
I think some of the drives have had a hard live while I messed around self-hosting, especially during my phase of trying out photo solutions.
ZFS with automatic snapshots and scrubbing. This will keep as many and as old snapshots as your like. It’ll ensure the files don’t rot. It’ll ensure the media doesn’t die, so long as you have enough redundancy and you replace disks as they die. This is what I’d trust for long term storage because I think I understand how and why it works. It should last as long as I feed it disks. If I delete something, I should be able to restore it from a snapshot. The hardware doesn’t need to be anything fancy. Just a Pi 4/5 with a couple of WD Elements would be fine. Could add more disks for more redudnancy. I’m running 2-disk residency.
You don’t have to touch the software if it’s not exposed to the Internet. Whatever works today on it will work 20 years from now, so long as the hardware works. A couple of spare Pis, SD cards and power supplies should let it last for decades.
I’d rather cold storage but am thinking of looking along these lines, ZFS or btrfs on a standard HDD that I add files on to once a year and replace the disk every few years.
I have a standard backup setup I just want something that is more point in time and not connected to all the automation, in case I automatically delete everything.
Yup, turn it on, let it do a scrub, then turn it off. I’d still use redudnancy though. Not merely to cover the case of the drive failing, but also to cover the bit rot use case. It’s exceedingly unlikely bits to rot at the exact same spot on two or more disks. When ZFS finds a checksum mismatch during a scrub (which indicates bit rot), it’ll be able to trivially recover the data from the drive where the checksum matches. It’ll then rewrite the rotten part.
Would that be two disks under a type of RAID or does ZFS have something?
With 2 disks that would be type mirror in ZFS-speak, completely built-in. Equivalent to RAID1 in terms of hardware fault tolerance.
You could do a 3-disk mirror or n-disk mirror really. The RAID5/6 rough equivalents are called RAIDzN where N is the number of disk failures they tolerate. E g. RAIDz1, RAIDz2, etc. You probably want a mirror unless you need more space than a single disk provides.
Ah thanks, that gives me something to research.
In your scenario, I’d be looking at ZFS or BTRFS for your live data, especially when taking photos into account. They’ll self-repair files that may run into decay issues, which I’ve seen a lot of with photos in all formats. Since you already keep off-site backups, I’d then just keep an extra drive around that you snapshot to from time to time.
So my offsites are an incremental backup, but at some point the oldest version is gone. I am keen for a completely separate, long term snapshot of what I had that could be thrown in a cupboard, and any random family member clearing my house out as I get moved into a rest home at 108 can go through the photos and find a good one to put on my headstone.
I am also keen for protection against doing something dumb and losing everything (like losing my hard drive and finding out for some reason I can’t access my backups because I lost the encryption key because I put it in bitwarden and they shut down years ago and I never moved the key over because I forgot it was stored there).
ZFS and BTRFS both provide that functionality. Have a look into the features.
So the drive doesn’t need to be hot, I can just plug in once a year and it auto-repairs?
No, the “live” filesystems will repair themselves when they detect problems. They keep revisions of your data, and run checksums constantly. When they find a file has inadvertently changed without access, it will restore said files. Think of it like Mac “Time Machine”, but it’s just the filesystem . You can restore stuff from points in time when needed.
Just read up on it.
Yeah I will read up on it, thanks for the tip!