Integration into file formats.
Integration into file formats.
Posted Jan 14, 2026 22:21 UTC (Wed) by himi (subscriber, #340)Parent article: Format-specific compression with OpenZL
This seems like it would be entirely compatible with the OpenZL approach, though I think you'd need additional tooling to support this kind of use case. You'd also want to make sure there was lots of information about how to design file formats to suit this model, particularly the trade-offs between different data layouts; probably also consideration of archival versus live data formats (with archival being designed for maximum compression efficiency, versus the live format optimising for whatever IO patterns your active use case requires), and streaming versus random-access, and probably a bunch of other considerations I haven't thought of . . . In fact, the world in general could benefit quite a bit from having a readily available knowledge-base about designing good file formats, particularly if that was supported by high quality tooling and libraries.
Of course you'd still need to support the generalised use cases, and the current OpenZL model of special-case with fallback to general also makes lots of sense (there's a lot of uncompressed data already out there, after all), but building good support for compression into the file formats themselves seems like a reasonable next step, and supporting the development of better file formats in general would be a pretty good end goal.
