Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1)
Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1)
Posted Apr 19, 2016 15:17 UTC (Tue) by Wol (subscriber, #4433)In reply to: Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1) by nye
Parent article: Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1)
As I understand it (and no way am I an expert in filesystems :-) it comes over that the more the top (user interface) knows about the bottom (the disk interface) the easier it is to make sensible decisions that don't have exponential performance as the load rises ...
> It's not that a layered stack is the problem per se, but more that allowing arbitrary selections of layers chosen from a generous smorgasbord and combined in arbitrary orders dramatically expands the scope of the problem.
:-)
If ZFS is split into two tightly-coupled components, that sounds in line with my understanding. Could you put a ZFS filesystem layer over a linux lvm layer? Or a linux ext4 over the ZFS volume layer? I guess the answer's "no", so ZFS as a whole has control over the whole stack :-) and hence should achieve far better performance.
Cheers,
Wol
Posted Apr 19, 2016 16:41 UTC (Tue)
by zdzichu (subscriber, #17118)
[Link] (1 responses)
Posted Apr 20, 2016 11:07 UTC (Wed)
by nye (subscriber, #51576)
[Link]
I've seriously considered it for storing my email backups where more than 50% of the space usage is waste due to internal fragmentation, where ZFS (the filesystem layer) suffers fairly badly compared to ext4. Ultimately, losing low double digit gigabytes isn't a pressing concern these days so I've not bothered, but I can well imagine there would be circumstances where ext4 or some other filesystem would be a sensible choice.
Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1)
You CAN put ext4 on ZVOLs. Or swap. Or export it via iSCSI.
Costa: Designing a Userspace Disk I/O Scheduler for Modern Datastores: the Scylla example (Part 1)