LWN: Comments on "The casync filesystem image distribution tool" https://lwn.net/Articles/726005/ This is a special feed containing comments posted to the individual LWN article titled "The casync filesystem image distribution tool". en-us Thu, 02 Oct 2025 02:58:42 +0000 Thu, 02 Oct 2025 02:58:42 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net The casync filesystem image distribution tool https://lwn.net/Articles/726426/ https://lwn.net/Articles/726426/ abo <div class="FormattedComment"> As far as I can tell, SHA-1 is not used.<br> </div> Mon, 26 Jun 2017 10:02:47 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726336/ https://lwn.net/Articles/726336/ nix <div class="FormattedComment"> Are relatively unlikely to collide, then. :) (and git already checks for that collision mechanism. One wonders how a future SHA-256 migration would work with existing bup repos, though... something to look at.)<br> </div> Fri, 23 Jun 2017 13:51:27 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726335/ https://lwn.net/Articles/726335/ nix <div class="FormattedComment"> I don't think it wrong that backups are encrypted -- mine are -- I think that the *backup program* is usually the wrong place for encryption, because frankly encryption is a horribly difficult problem best worked on by people who specialize in it, not people who are doing it on the side of writing a backup program.<br> <p> So I'd go with an unencrypted backup atop an encrypted filesystem or encrypted block device layer. This has the advantage that you can use whatever ridiculously contrived means you like to acquire the passphrase, which is relatively rarely possible with backup-level encryption (e.g. mine currently comes from a shell script that does a challenge-response on a yubikey -- and getting that requires an ssh to the machine with the key plugged in...)<br> </div> Fri, 23 Jun 2017 13:49:24 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726304/ https://lwn.net/Articles/726304/ mgedmin <div class="FormattedComment"> <font class="QuotedText">&gt; represented as SHA-1 hashes so can't collide</font><br> <p> *cough* <a href="https://shattered.io">https://shattered.io</a><br> </div> Fri, 23 Jun 2017 11:37:30 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726301/ https://lwn.net/Articles/726301/ Sesse <div class="FormattedComment"> Why is backup the wrong place for encryption? I'm very happy that my backup host can't compromise my servers—all it can do is get encrypted backups out of it. Less trust is good.<br> <p> (Likewise, my servers don't have SSH access to my backup host.)<br> </div> Fri, 23 Jun 2017 10:20:59 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726131/ https://lwn.net/Articles/726131/ drag <div class="FormattedComment"> <font class="QuotedText">&gt; I'm not entirely sure what the point of turning it into a backup tool is</font><br> <p> Simplify the user experience with these sorts of tools, I expect. <br> <p> If the operations involving the creating, pushing, distributing, tracking, sharing, pulling, backing up, and restoration of file systems images have a lot of overlap then it makes sense to have the same basic tools and protocols for all those things. That way it's easier to build high-level solutions on top this tool. <br> <p> So right now if I want to have, in my environment, the ability to use containers, virtual machines, sync my workstation home directory and backup my workstation and state-full applications/databases then these all require lots of different tools and services to configure. I have to, maybe, setup cinder for virtual machine images, docker registry for containers images, rsync for backing up, etc etc. <br> <p> Wouldn't it be nice to have a single thing that could provide all that? <br> <p> Provided, of course, the actual operations for these different needs have a lot of overlap and similarities. Don't want to end up with some massive mess of conflicting and mismatched functionality in a single service...<br> </div> Wed, 21 Jun 2017 17:26:26 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726090/ https://lwn.net/Articles/726090/ rkeene <div class="FormattedComment"> For a slightly different, but similar, take on a similar problem take a look at AppFS ( <a href="http://appfs.rkeene.org/">http://appfs.rkeene.org/</a> )<br> </div> Wed, 21 Jun 2017 14:33:36 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726073/ https://lwn.net/Articles/726073/ nix <div class="FormattedComment"> So does bup if you ignore encryption (which you should IMHO: the backup is the wrong place for it, we have perfectly good encryption at lower levels, be that cryptfs or LUKS). (It's gaining remote restore, too, as we speak, though this is fairly redundant because you can use *any* other method to ship the backup contents over for a restore run anyway: I just use NFS. So this is only a convenience, really.)<br> </div> Wed, 21 Jun 2017 11:29:24 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726063/ https://lwn.net/Articles/726063/ Sesse <div class="FormattedComment"> duplicity is painfully slow, though.<br> </div> Wed, 21 Jun 2017 07:42:35 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726053/ https://lwn.net/Articles/726053/ walex <div class="FormattedComment"> Not really sure what's the point of this, considering the existence of Jigdo or of the content-based Arvados Keep distributed filesystem, which match two different application areas quite well.<br> </div> Wed, 21 Jun 2017 06:43:53 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726027/ https://lwn.net/Articles/726027/ aggelos <p>Seems the main contribution is the definition of the chunk store. Then you'd just need a tracker for the set of mirrors that could have any related data set (oh <a href="https://ipfs.io">wait</a> :-)).</p> <p>What I didn't find in the blog post is a strategy for pruning chunks - specifically one that would work well for CDNs and mirror operators. Reasonable strategies could be devised without any extra effort of course (e.g. ctime/atime based), but perhaps the format could accommodate more elaborate algorithms?</p> Tue, 20 Jun 2017 22:10:02 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726013/ https://lwn.net/Articles/726013/ jhoblitt <div class="FormattedComment"> Doesn't duplicity already nicely handle chunked backups, with encryption, and support for remote push?<br> </div> Tue, 20 Jun 2017 20:23:03 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726012/ https://lwn.net/Articles/726012/ compenguy <div class="FormattedComment"> <font class="QuotedText">&gt; I'm not entirely sure what the point of turning it into a backup tool is [...]</font><br> <p> Perhaps the point isn't about backup, but rather about restore. It seems like it might enable fast, network-efficient snapshot-type restore operations.<br> <p> Not really sure, though... I'm still trying to wrap my head around it.<br> </div> Tue, 20 Jun 2017 20:12:07 +0000 The casync filesystem image distribution tool https://lwn.net/Articles/726008/ https://lwn.net/Articles/726008/ nix <div class="FormattedComment"> Very nice. It appears to be a network-synchronization version of bup, almost (at least with regard to the method used to deduplicate, though its use of a better version of tar feels nicer than the bup approach of representing metadata by stuffing it into a "file" in each directory's representation on the backup, though this hack does work well enough given that all real on-disk file chunks are represented as SHA-1 hashes so can't collide, and has the major advantage that bup can just use git for its underlying storage, which is redundant for casync since it wants its "blobs" to be independent operating-system files).<br> <p> I'm not entirely sure what the point of turning it into a backup tool is, since as a backup tool it feels like everything it does, bup would do better... but as a distribution tool, particularly over CDNs, it seems likely to be without equal. Ah well, the more options the merrier, for backup!<br> <p> </div> Tue, 20 Jun 2017 19:36:37 +0000