LWN: Comments on "DAX on BTT" https://lwn.net/Articles/686150/ This is a special feed containing comments posted to the individual LWN article titled "DAX on BTT". en-us Sun, 19 Oct 2025 16:48:11 +0000 Sun, 19 Oct 2025 16:48:11 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net DAX on BTT https://lwn.net/Articles/686653/ https://lwn.net/Articles/686653/ robbe <div class="FormattedComment"> Is there any basic device that guarantees no errors, ever? (Sure, you can build stacks of redundancy that make them less and less probable.)<br> <p> FWIW, ECC does not guarantee detection of errors. I don’t know what the distance of the code used for your disk is (are these values universal?), so I can’t tell what the probability of an undetected error is.<br> </div> Sun, 08 May 2016 13:39:02 +0000 DAX on BTT https://lwn.net/Articles/686544/ https://lwn.net/Articles/686544/ phro <div class="FormattedComment"> <font class="QuotedText">&gt;&gt; Traditional storage, however, never guaranteed sector atomicity</font><br> <p> <font class="QuotedText">&gt; citation needed.</font><br> <p> I suppose "never" is a strong word. What I meant to say was that the SCSI and ATA standards did not say anything about power-fail write atomicity of a single sector. Because they did not standardize it, you cannot rely on it.<br> </div> Fri, 06 May 2016 14:50:08 +0000 DAX on BTT https://lwn.net/Articles/686543/ https://lwn.net/Articles/686543/ phro <div class="FormattedComment"> I think breaking basic assumptions of applications is bad, yes. So, pursuing DAX+BTT is interesting in that respsect. I forgot about the limitation you described, however. That does put a rather serious monkeywrench in the works, but I think that it could be worked around. The question is whether anyone is willing to put the work in, and at this stage it's not clear whether it would be worth the effort.<br> </div> Fri, 06 May 2016 14:48:21 +0000 DAX on BTT https://lwn.net/Articles/686507/ https://lwn.net/Articles/686507/ andresfreund <div class="FormattedComment"> I'd argue that a read error is an atomicity problem.<br> </div> Fri, 06 May 2016 00:15:24 +0000 DAX on BTT https://lwn.net/Articles/686504/ https://lwn.net/Articles/686504/ neilbrown <div class="FormattedComment"> <font class="QuotedText">&gt; Traditional storage, however, never guaranteed sector atomicity</font><br> <p> citation needed.<br> <p> My model of traditional storage includes a ECC for each block. So the options for a read after an aborted write are:<br> - old data<br> - new data<br> - read error (ECC reports an uncorrectable error)<br> <p> How can you get a torn sector?<br> <p> </div> Fri, 06 May 2016 00:00:40 +0000 DAX on BTT https://lwn.net/Articles/686495/ https://lwn.net/Articles/686495/ stellarhopper <div class="FormattedComment"> Are you suggesting that we should indeed pursue DAX+BTT? One of the cons we also realized was that even if we do support this hybrid model, it will preclude DAX mappings of larger than a page (i.e. 2MB and 1GB) mappings, and the lost performance there is probably not worth the minor gains in convenience from the hybrid mode.<br> </div> Thu, 05 May 2016 21:11:30 +0000 DAX on BTT https://lwn.net/Articles/686491/ https://lwn.net/Articles/686491/ phro <div class="FormattedComment"> Actually, what I said was that sector tearing doesn't usually happen on SSDs due to the nature of the FTL. Traditional storage, however, never guaranteed sector atomicity, but it usually does provide it. When you switch over to a block driver on top of pmem, it's possible that there will be increased risk of tripping over torn sectors (since it's just an interrupted memcpy). I don't have any numbers to back that up, presently. If you're wondering whether applications do rely on atomic sector updates, wonder no more! There is research that shows that some applications do, in fact, expect it [1].<br> <p> The real issue is that DAX and the BTT are incompatible. If you want to use DAX, you have to give up sector atomicity. If applications truly depend on that, then they can't run unmodified on a pmem device mounted with -o dax. That means that you would have to separate out your pmem mount points into those that will support legacy applications and those that will only support DAX. By combining the two, you get the best of both worlds.<br> <p> I got the overwhelming impression that the room was not convinced that applications should rely on atomic sector updates. Such applications are broken and should be fixed. Thus, there is little impetus to support the mixed DAX+BTT mode that was proposed.<br> <p> [1] <a href="http://research.cs.wisc.edu/wind/Publications/alice-osdi14.pdf">http://research.cs.wisc.edu/wind/Publications/alice-osdi1...</a><br> </div> Thu, 05 May 2016 20:38:01 +0000