The casync filesystem image distribution tool
casync takes inspiration from the popular rsync file synchronization tool as well as the probably even more popular git revision control system. It combines the idea of the rsync algorithm with the idea of git-style content-addressable file systems, and creates a new system for efficiently storing and delivering file system images, optimized for high-frequency update cycles over the Internet. Its current focus is on delivering IoT, container, VM, application, portable service or OS images, but I hope to extend it later in a generic fashion to become useful for backups and home directory synchronization as well".
Posted Jun 20, 2017 19:36 UTC (Tue)
by nix (subscriber, #2304)
[Link] (11 responses)
I'm not entirely sure what the point of turning it into a backup tool is, since as a backup tool it feels like everything it does, bup would do better... but as a distribution tool, particularly over CDNs, it seems likely to be without equal. Ah well, the more options the merrier, for backup!
Posted Jun 20, 2017 20:12 UTC (Tue)
by compenguy (guest, #25359)
[Link] (5 responses)
Perhaps the point isn't about backup, but rather about restore. It seems like it might enable fast, network-efficient snapshot-type restore operations.
Not really sure, though... I'm still trying to wrap my head around it.
Posted Jun 20, 2017 20:23 UTC (Tue)
by jhoblitt (subscriber, #77733)
[Link] (4 responses)
Posted Jun 21, 2017 7:42 UTC (Wed)
by Sesse (subscriber, #53779)
[Link]
Posted Jun 21, 2017 11:29 UTC (Wed)
by nix (subscriber, #2304)
[Link] (2 responses)
Posted Jun 23, 2017 10:20 UTC (Fri)
by Sesse (subscriber, #53779)
[Link] (1 responses)
(Likewise, my servers don't have SSH access to my backup host.)
Posted Jun 23, 2017 13:49 UTC (Fri)
by nix (subscriber, #2304)
[Link]
So I'd go with an unencrypted backup atop an encrypted filesystem or encrypted block device layer. This has the advantage that you can use whatever ridiculously contrived means you like to acquire the passphrase, which is relatively rarely possible with backup-level encryption (e.g. mine currently comes from a shell script that does a challenge-response on a yubikey -- and getting that requires an ssh to the machine with the key plugged in...)
Posted Jun 21, 2017 6:43 UTC (Wed)
by walex (guest, #69836)
[Link]
Posted Jun 21, 2017 17:26 UTC (Wed)
by drag (guest, #31333)
[Link]
Simplify the user experience with these sorts of tools, I expect.
If the operations involving the creating, pushing, distributing, tracking, sharing, pulling, backing up, and restoration of file systems images have a lot of overlap then it makes sense to have the same basic tools and protocols for all those things. That way it's easier to build high-level solutions on top this tool.
So right now if I want to have, in my environment, the ability to use containers, virtual machines, sync my workstation home directory and backup my workstation and state-full applications/databases then these all require lots of different tools and services to configure. I have to, maybe, setup cinder for virtual machine images, docker registry for containers images, rsync for backing up, etc etc.
Wouldn't it be nice to have a single thing that could provide all that?
Provided, of course, the actual operations for these different needs have a lot of overlap and similarities. Don't want to end up with some massive mess of conflicting and mismatched functionality in a single service...
Posted Jun 23, 2017 11:37 UTC (Fri)
by mgedmin (subscriber, #34497)
[Link] (1 responses)
*cough* https://shattered.io
Posted Jun 23, 2017 13:51 UTC (Fri)
by nix (subscriber, #2304)
[Link]
Posted Jun 26, 2017 10:02 UTC (Mon)
by abo (subscriber, #77288)
[Link]
Posted Jun 20, 2017 22:10 UTC (Tue)
by aggelos (subscriber, #41752)
[Link]
Seems the main contribution is the definition of the chunk store. Then you'd just need a tracker for the set of mirrors that could have any related data set (oh wait :-)). What I didn't find in the blog post is a strategy for pruning chunks - specifically one that would work well for CDNs and mirror operators. Reasonable strategies could be devised without any extra effort of course (e.g. ctime/atime based), but perhaps the format could accommodate more elaborate algorithms?
Posted Jun 21, 2017 14:33 UTC (Wed)
by rkeene (guest, #88031)
[Link]
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool
The casync filesystem image distribution tool