LWN: Comments on "That massive filesystem thread" https://lwn.net/Articles/326471/ This is a special feed containing comments posted to the individual LWN article titled "That massive filesystem thread". en-us Sun, 14 Sep 2025 16:32:25 +0000 Sun, 14 Sep 2025 16:32:25 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net per-inode noatime https://lwn.net/Articles/336848/ https://lwn.net/Articles/336848/ pjm <div class="FormattedComment"> One can more or less achieve that by applying chattr +A to the whole filesystem and then chattr -A on specific files. Note that ‘chattr +A’ on a directory will be inherited by any files created in that directory (excluding moving an existing file into that directory).<br> </div> Wed, 10 Jun 2009 10:06:04 +0000 fsync() and disk flushes https://lwn.net/Articles/330294/ https://lwn.net/Articles/330294/ nix <div class="FormattedComment"> You don't need a UPS. A battery-backed disk controller is just as good <br> (and perhaps better because the battery failing doesn't take your machine <br> down if the power is otherwise OK, while the UPS failing *does*).<br> </div> Mon, 27 Apr 2009 10:43:21 +0000 fsync() and disk flushes https://lwn.net/Articles/330275/ https://lwn.net/Articles/330275/ bersl2 <blockquote><p>It's hard to believe there are disk drives out there (not counting an occasional broken one) that write trash over random areas as they power down. Disk drives I have seen have a special circuit to disconnect and park the head the moment voltage begins to drop. It has to park the head because you can't let the head land on good recording surface, and it has to cut off the write current because otherwise it's dragging a writing head all the way across the disk, pretty much guaranteeing the disk will never come back. I believe it's a simple circuit that doesn't involve any controller intelligence.</p> <p>There is a related failure mode where the drive's <i>client</i> loses power and in its death throes ends up instructing the drive to trash itself while the drive still has enough power to operate normally. I've heard that's not unusual, and it's the best argument I know for a UPS that powers a system long enough for it to shut down cleanly.</p></blockquote> <p>One of these happened to me. $DEITY as my witness, I will never run an important system without an UPS again.</p> <p>Bonus: The drive was a Maxtor. Serves me right.<br> Double bonus: That still wasn't traumatic enough to compel me to make backups.</p> Mon, 27 Apr 2009 06:24:18 +0000 Put hardware in between RAM and Disk features to work in between these https://lwn.net/Articles/328783/ https://lwn.net/Articles/328783/ hozelda <div class="FormattedComment"> Is there a reason why solid state drives aren't use on PCs as a cache level ahead of the main disk drives? These can even be made replaceable (it's just a cache anyway). Wouldn't these improve the performance/correctness trade-offs enough for many use cases?<br> <p> </div> Fri, 17 Apr 2009 00:05:15 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327757/ https://lwn.net/Articles/327757/ jschrod <div class="FormattedComment"> Well, I just decided to give you feedback, from someone who is subscribed to LWN quite a bit longer than you and who did not participate in this topic after you took over all its discussion threads: You showed that LWN really needs a KILL file feature where one can put a poster in it; you, in particular. Others have succinctly explained why, no need to repeat this.<br> <p> But your self-rightousness doesn't allow to understand this, obviously. Luckily, there are still some discussion threads where you don't try to take over. I hope the likes of you will remain few on LWN in the future, this is not Slashdot, after all.<br> </div> Wed, 08 Apr 2009 00:05:25 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327417/ https://lwn.net/Articles/327417/ jospoortvliet <div class="FormattedComment"> Maybe. Probably for this one response. However, as I pointed out, there are wider repercussions to be expected from such behavior, and it is worth to show, as a community, that we disapprove of such ways of communicating. Even if that is done in a rather unfriendly manner.<br> <p> On re-reading the thread, I think you are right in that ajross was more impolite than bojan, which often leads to a downward spiral and isn't helpful... bojan's post wasn't that far off from the normal tone on this site.<br> <p> Anyway. This is went pretty far off-topic, and I think we mostly agree. For as far as we don't, we at least agree on that ;-)<br> </div> Sun, 05 Apr 2009 17:11:39 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327412/ https://lwn.net/Articles/327412/ GreyWizard <div class="FormattedComment"> It's better to change nothing than to make the situation worse.<br> </div> Sun, 05 Apr 2009 16:27:22 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327411/ https://lwn.net/Articles/327411/ jospoortvliet <div class="FormattedComment"> True, if you meant to point out that 'be polite YOU JERK' isn't very effective, I agree. I do however think that it's better than nothing. Nothing changes nothing.<br> </div> Sun, 05 Apr 2009 16:20:45 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327408/ https://lwn.net/Articles/327408/ GreyWizard <div class="FormattedComment"> Both of your arguments could reasonably be applied to a comment that says "please be polite" but both fail for "be polite you jerk." Someone who is accidentally rude is much more likely to respond to the "you jerk" part than the "be polite" part. The contradiction immediately destroys the credibility of the person posting because he or she is not willing to be held to the standard set for others.<br> <p> A truly polite request for more courtesy might help but it's difficult to be sure because such things are quite rare. Giving in to the temptation to scold even just a little makes the comment worse than useless. Unless you are absolutely certain you can do it right it's better to focus on substantive issues and avoid appointing yourself a courtesy cop.<br> </div> Sun, 05 Apr 2009 15:42:52 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327401/ https://lwn.net/Articles/327401/ jospoortvliet <div class="FormattedComment"> I disagree on your argument that saying 'be polite you jerk' merely drags things down, for two reasons.<br> <p> First of all, some people don't notice their behavior is unnecessarily impolite. Pointing it out can help them (if they are willing to be reasonable in the first place). Never pointing out somebodies failures will make them fail forever.<br> <p> Second, it shows you care about being polite. If others show they care too, a culture of 'you should be polite' can be maintained. As you might have noticed from the differences between FOSS communities, culture is important and heavily influential. And it can be changed.<br> <p> Some things to note:<br> - people DO care about what others think of them. No matter how much they scream 'no I don't', they do. It is our nature.<br> - people should know their arguments are not supported by being mean - it is the other way around.<br> - I agree that a 'be polite you yerk' might not always be the best way to correct someone. A personal mail can do more. However, it won't show up in public (unless an apology is made), thus it does not much to influence others who might think it is acceptable behavior because the guy got away with it. Of course, giving a good example is better than anything else.<br> - Of course discussing without end whether somebody was polite enough or not muddies the discussion and lowers the SNR.<br> </div> Sun, 05 Apr 2009 12:43:36 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327366/ https://lwn.net/Articles/327366/ GreyWizard <div class="FormattedComment"> Pardon me for saying so but you have not understood what I wrote. I did not urge anyone to "say whatever [their] brain produces" or anything equivalent. Nor did I issue a rallying cry against decency. Elevating the level of discourse would be a wonderful thing and if you've got an effective suggestion for how to do so I would be glad to read it.<br> <p> But saying "be polite you jerk" merely drags things even further down into the muck.<br> </div> Sun, 05 Apr 2009 03:34:43 +0000 The end of LWN comment dialog? https://lwn.net/Articles/327280/ https://lwn.net/Articles/327280/ jospoortvliet <div class="FormattedComment"> Hmmm. Just say whatever your brain produces, and if somebody has a problem<br> with what comes out, it's on their own plate.<br> <p> Living in a country where that mode of thinking is the norm, I can tell<br> you it also has disadvantages... If only because the resulted hurt<br> feelings can muddy the discussion more than you might think. Besides, it<br> chases people away who would otherwise have contributed constructively -<br> it's not acceptable behavior in all cultures. Ever wondered why the FOSS<br> community is still predominantly western, despite many smart developers in<br> countries like India?<br> <p> A little decency now and then doesn't hurt. I know people who, knowing how<br> blunt they can be, ask someone else to read certain emails before sending<br> them. After all, reality is that people DO have feelings.<br> </div> Sat, 04 Apr 2009 09:05:52 +0000 That massive filesystem thread https://lwn.net/Articles/327272/ https://lwn.net/Articles/327272/ dirtyepic <div class="FormattedComment"> nevermind, you said cache didn't you.<br> </div> Sat, 04 Apr 2009 07:44:04 +0000 That massive filesystem thread https://lwn.net/Articles/327271/ https://lwn.net/Articles/327271/ dirtyepic <div class="FormattedComment"> wouldn't mutt keep telling you there's new messages an hour after you've read them then?<br> </div> Sat, 04 Apr 2009 07:42:21 +0000 fsync() and disk flushes https://lwn.net/Articles/327235/ https://lwn.net/Articles/327235/ giraffedata The answer is RAID and UPS, but not that way. The RAID goes over the UPS; e.g. a mirror of two disk drives, each with its own UPS. <p> Such redundancy also makes it possible to test the UPS regularly and avoid the problem of two dead batteries when the external power fails. <p> The UPS doesn't count if you don't test, measure, and/or replace the its battery regularly. Sat, 04 Apr 2009 00:01:25 +0000 fsync() and disk flushes https://lwn.net/Articles/327230/ https://lwn.net/Articles/327230/ giraffedata <p> It's hard to believe there are disk drives out there (not counting an occasional broken one) that write trash over random areas as they power down. Disk drives I have seen have a special circuit to disconnect and park the head the moment voltage begins to drop. It has to park the head because you can't let the head land on good recording surface, and it has to cut off the write current because otherwise it's dragging a writing head all the way across the disk, pretty much guaranteeing the disk will never come back. I believe it's a simple circuit that doesn't involve any controller intelligence. <p> There is a related failure mode where the drive's <em>client</em> loses power and in its death throes ends up instructing the drive to trash itself while the drive still has enough power to operate normally. I've heard that's not unusual, and it's the best argument I know for a UPS that powers a system long enough for it to shut down cleanly. Fri, 03 Apr 2009 23:49:58 +0000 fsync() and disk flushes https://lwn.net/Articles/327037/ https://lwn.net/Articles/327037/ nix <div class="FormattedComment"> RAID doesn't really solve sudden-power-loss situations: in fact RAID-5 in <br> particular can make it much worse (turning small-range corruption into <br> apparent scattershot corruption).<br> <p> A UPS, or battery-backing, is the answer (well, moves the failure point: <br> if it's a UPS, the UPS must fail before you lose: if it's battery-backed, <br> you often have to lose the battery first, then power, which is likely to <br> happen because you often have no idea the battery has failed until it's <br> too late).<br> <p> In conclusion: we all suck, our data is doomed, the Second Law shall <br> triumph and Sod and Murphy shall dance above our mangled filesystems.<br> <p> </div> Fri, 03 Apr 2009 00:02:55 +0000 sticks & stones https://lwn.net/Articles/327036/ https://lwn.net/Articles/327036/ bojan <div class="FormattedComment"> Nothing to forgive. All is perfectly fine. I enjoy a robust discussion.<br> </div> Thu, 02 Apr 2009 23:56:35 +0000 fsync() and disk flushes https://lwn.net/Articles/327034/ https://lwn.net/Articles/327034/ xoddam <div class="FormattedComment"> I haven't crashed many disks but my limited experience is similar. If you get to the point where data loss is caused by the hardware, it is likely to be trashing a whole lot more than the contents of the cache at shutdown. The solution to this problem is RAID; journals solve a different problem altogether.<br> </div> Thu, 02 Apr 2009 23:31:45 +0000 fsync() and disk flushes https://lwn.net/Articles/327032/ https://lwn.net/Articles/327032/ anton <blockquote>And if you suddenly lose power, in his experience, the drive is actually much more likely to wipe out some arbitrary track of data from the disk than it is to have anything in the write cache and lose it.</blockquote> While I have experienced drives that damage sectors or tracks on power loss, I consider these drives faulty; and with such drives the problem does not seem to be limited to drives that are trying to write something at the time. However, most drives don't wipe out arbitrary data in my experience. <p>But I have tested two drives with a <a href="http://www.complang.tuwien.ac.at/anton/hdtest/">test program for out-of-order writing</a>, and found that they both wrote data several seconds out of order with a certain access sequence. If we don't see more frequent problems from this, that's probably because the disks don't optimize accesses as aggressively as some people imagine. Thu, 02 Apr 2009 23:28:33 +0000 sticks & stones https://lwn.net/Articles/327031/ https://lwn.net/Articles/327031/ xoddam <div class="FormattedComment"> <font class="QuotedText">&gt; In a post not so long ago, someone accused me of hiding behind Ted's authority</font><br> <p> I plead guilty and I apologise. That was immediately after replying to someone else's post the gist of which was "Ted wrote ext2 and ext3 in the first place, he is therefore above criticism." It concluded with the words "Know your place", which got me riled.<br> <p> [proverb: in the midst of great anger, never answer anyone's letter]<br> <p> Your words were not so condescending but they had much the same emphasis: all ur filesystems are belong to POSIX (not users) 'cos POSIX is the law, and by the way Ted's interpretation is the only correct one because he's the primary implementor.<br> <p> I hope you understand where I was coming from. Forgive me.<br> <p> </div> Thu, 02 Apr 2009 23:17:21 +0000 That massive filesystem thread https://lwn.net/Articles/327030/ https://lwn.net/Articles/327030/ anton <blockquote>The problem with what one might call the fsync() RANDOMLY_LOSE option is that it is something which must be used by everyman to avoid data loss, which if you get it wrong there is no sign unless you lose power at exactly the right time, and which nearly all programs you might clap eyes on other than Emacs have historically got wrong</blockquote> <a href="http://www.complang.tuwien.ac.at/anton/sync-metadata-updates.html">s/other than/including/</a>. However, I don't agree that this application behaviour is wrong; if the application wants to jump through hoops to get a little bit of extra safety on low-quality file systems, that's ok, but if it doesn't, that's also ok. It's up to the users to chose which applications they run and on which file system. Thu, 02 Apr 2009 23:16:34 +0000 That massive filesystem thread https://lwn.net/Articles/327001/ https://lwn.net/Articles/327001/ iabervon <p>Linus actually overstated git's use of fsync(). There are three relevant cases:</p> <ul> <li>git writes to a brand new filename, and then updates an existing file to contain the new filename instead of an old filename. It will optionally do a fsync() between these two operations.</li> <li>git writes all of the data from several existing files to a single new file, and then removes the existing files. It will always do a fsync() between these two operations.</li> <li>git updates an existing file by writing to a temporary file and renaming over the existing file. It will never do a fsync() between these two operations.</li> </ul> <p>That is, git relies on the assumption that a rename() is atomic with respect to the disk and dependent on all operations previously issued on the inode that is being renamed. It uses fsync() only to make sure that operations to different files happen in the order that it wants.</p> <p>Now, obviously, if you want to be really sure to keep some data, write it once and never replace it at all. That'll do a good job of protecting against everything, including bugs where you do something like "open(), fsync(), close(), rename()" but forget or mess up the "write()". Obviously, this isn't an option for a lot of situations, however, but it's what git does for the most important data.</p> Thu, 02 Apr 2009 20:23:21 +0000 fsync() and disk flushes https://lwn.net/Articles/326992/ https://lwn.net/Articles/326992/ iabervon <div class="FormattedComment"> Linus pointed out that, in his experience, the only way you can possibly lose data that has gotten to a drive's write cache is to suddenly lose power. And if you suddenly lose power, in his experience, the drive is actually much more likely to wipe out some arbitrary track of data from the disk than it is to have anything in the write cache and lose it. That is, if you want to be really certain that you don't lose data, you need to have a couple seconds of power for your disk drive after its input goes idle, or you might lose some data completely regardless of anything the application that writes it could possibly do.<br> <p> </div> Thu, 02 Apr 2009 19:15:59 +0000 Rename undo https://lwn.net/Articles/326977/ https://lwn.net/Articles/326977/ butlerm <div class="FormattedComment"> You are right, if you want guaranteed preservation of in-order semantics <br> there are no alternatives other than journalling the data or delaying all <br> journal commits until the corresponding data has been written. Both <br> options are available (e.g. data=journal, and data=ordered), and both have <br> serious performance problems. Of course, if that is really what you need, <br> than the price is worth paying.<br> <p> "data=writeback" is the current alternative which doesn't make any pretense <br> to the preserving in-order semantics of data and meta-data after a crash. <br> You get a snapshot of your meta-data at a certain point of time, but the <br> data may be trashed.<br> <p> Rename undo is a much less severe compromise to in-order semantics after a <br> crash. It is not point in time recovery, it is consistent version recovery. <br> That can have some unexpected side effects, but none remotely as severe as <br> losing the data completely. <br> <p> In the case you mention, if you write a new version, rename it over the old <br> one, change the security permissions on the replacement, and then the <br> system crashes, you are not going to get the new (unwritten) data, the new <br> inode, and the new permissions, you are going to get the old inode, the old <br> data, and the old permissions. The permissions go with the inode (and the <br> data), not the directory entry. That is what you want. The old inode (and <br> the old data) has to be kept around until the data for the new inode is <br> completely on disk. Otherwise you cannot undo the rename replacement after <br> the fact.<br> </div> Thu, 02 Apr 2009 18:13:21 +0000 Totally superfluous. https://lwn.net/Articles/326904/ https://lwn.net/Articles/326904/ rvfh <div class="FormattedComment"> <font class="QuotedText">&gt; The application developer who has the choice of using these options doesn't know whether the data is expendable or not</font><br> How then are they expected to know when to flush?<br> <p> And anyway, do we really not know which files are important and which are not?<br> Examples:<br> * pid file, browser cache: don't care<br> * conf file, document, code: care<br> * database file: care a lot<br> <p> But I do thank you for challenging this idea ;-) Please feel free to give counter-examples and -arguments.<br> </div> Thu, 02 Apr 2009 14:30:38 +0000 Totally superfluous. https://lwn.net/Articles/326901/ https://lwn.net/Articles/326901/ xoddam <div class="FormattedComment"> The application developer who has the choice of using these options doesn't know whether the data is expendable or not, and the end-user doesn't care about the implementation details.<br> <p> Things go strangely pear-shaped when the most irrelevant, trivial data (eg. GNOME configs when we're only using GNOME because it's a default someone else chose) goes missing or gets corrupted.<br> <p> I most definitely don't care if GNOME forgets where I put a window or two. But I do care if it fails to start.<br> <p> What we end-users want (I wear a developer hat much of the time but I'm *always* a user) is not to be annoyed by the things we don't care about. O_EXPENDABLE and its ilk are an invitation for corner-cases to bite end-users. End-users don't deserve such treatment.<br> </div> Thu, 02 Apr 2009 14:14:49 +0000 Rename undo https://lwn.net/Articles/326898/ https://lwn.net/Articles/326898/ xoddam <div class="FormattedComment"> <font class="QuotedText">&gt; If one wants high performance rename replacements, rename undo is much more practical.</font><br> <p> I'm intrigued, but not satisfied. Telling the journal that a metadata change is 'committed' means that the post-crash-recovery state will reflect the change (journal replay).<br> <p> Surely the only satisfactory way to commit data before committing the metadata change is to delay *all* journal commits in-order until after the relevant file data is written in place, or to journal the data itself.<br> <p> For performance reasons it's probably much saner not to journal most data, especially for random access within large files, but I'm thinking that if it makes sense to allocate-on-commit to preserve the in-order semantics of atomic rename, it might also make good sense to special-case data journalling for newly-written (created or truncated) files when they are renamed (perhaps only for small files, and allocate-on-commit larger ones as users will likey expect a delay).<br> <p> Having the ability to unwind a specific kind of metadata change seems very confusing. I fear that winding back a rename could well result in a violation of expected in-order semantics w.r.t. metadata after crash recovery. Or might it be possible to wind back an entire 'transaction', all other metadata changes since the rename included?<br> <p> </div> Thu, 02 Apr 2009 14:01:34 +0000 That massive filesystem thread https://lwn.net/Articles/326857/ https://lwn.net/Articles/326857/ butlerm <div class="FormattedComment"> 'softupdates' is proof of the complexity of keeping two fully generalized <br> versions of the meta data around at all times - not only that but <br> converting back and forth between the two versions on demand.<br> <p> You can't really "queue" a rename, without doing something comparable to <br> what softupdates does, because the rename has to take immediate affect from <br> the application perspective. To do that, somewhere there must be a layer <br> to keep track of the difference between what the user visible meta data is, <br> and what the committed meta data is. If the differences are sufficiently <br> general, that is a major problem. If one wants high performance rename <br> replacements, rename undo is much more practical.<br> <p> It would be practical to update atimes on a low priority basis, with the <br> caveat that a lot of memory may be consumed holding metadata blocks around <br> until the atime updates are complete. In addition, on a system under <br> sufficient load, moving I/O to a low priority thread doesn't really help <br> anyway. <br> </div> Thu, 02 Apr 2009 06:40:34 +0000 Recursive linking https://lwn.net/Articles/326860/ https://lwn.net/Articles/326860/ man_ls Sorry, it was a stupid attempt from a foreigner at an April Fools' prank :D I was hoping that the recursive link would give it away, but maybe it was too plausible altogether. <p> Will try to do better next time :D) Thu, 02 Apr 2009 06:21:51 +0000 That massive filesystem thread https://lwn.net/Articles/326814/ https://lwn.net/Articles/326814/ davecb <div class="FormattedComment"> How do filesystems other than ext3/4 do it? <br> <p> Well, the Unix v6 filesystem implemented <br> in-order writes, as did 4.x BSD and the<br> other pre-journaled filesystems. POSIX allows<br> reordering to make coalesence easy, as<br> a lot of research was being done at that<br> time to get better performance.<br> <p> A colleague at ICL (hi, Ian!) did his <br> masters at UofT on that, and found you<br> could get a performance improvement <br> and still preserve correctness by using<br> what I'd characterize as tsort(1), which<br> worked better than BSD/Solaris soft updates.<br> <p> --dave<br> </div> Thu, 02 Apr 2009 00:37:20 +0000 That massive filesystem thread https://lwn.net/Articles/326793/ https://lwn.net/Articles/326793/ bojan <div class="FormattedComment"> Just a few points, so please don't get offended. I apologise in advance to all sensitive LWN readers for any injury caused by this post.<br> <p> <font class="QuotedText">&gt; Why invent a new system call which cannot (by necessity) be honored by ext2, or ext4 without a journal?</font><br> <p> Even if there was some kind of magical law that said that you could not order commits on the non-journaled file system this way, it can always be trivially implemented through - wait for it - fsync(), which has acceptable performance characteristics on such file systems.<br> <p> <font class="QuotedText">&gt; Everything is working now fine in ext3</font><br> <p> Sure. Except fsync(), which locks the whole system for a few seconds. Hopefully, this will get fixed (or at least its effect reduced) as a result of the hoopla.<br> <p> <font class="QuotedText">&gt; Well, now that Ts'o's commit rights have been officially revoked I think that the whole discussion is moot.</font><br> <p> Now you are really making a fool of yourself.<br> </div> Wed, 01 Apr 2009 22:38:42 +0000 ext4 trees https://lwn.net/Articles/326788/ https://lwn.net/Articles/326788/ corbet I'm confused. The article said that Ted's trees had not been pulled <i>yet</i>. In fact, that happened today; <a rel="nofollow" href="http://lwn.net/Articles/326708/">a bunch of ext4 work</a> went into the mainline, including a number of patches which increase robustness for applications which don't use <tt>fsync()</tt>. I dunno what you were trying to link to, but it didn't work. I've not seen anything about revocation of commit rights. (It's hard to "revoke commit rights" in a distributed system in any case; at worst you can refuse to pull from somebody else's repository.) <p> Maybe it's an April 1 post that went over my head? Wed, 01 Apr 2009 21:46:53 +0000 That massive filesystem thread https://lwn.net/Articles/326786/ https://lwn.net/Articles/326786/ man_ls It is a worthless effort. Each filesystem must keep its house clean. Why invent a new system call which cannot (by necessity) be honored by ext2, or ext4 without a journal? Everything is working now fine in ext3, and if it doesn't work right in ext4 people will just look for a different filesystem. <p> After reading that Linus is not pulling from Mr Tso's trees made me suspect. Well, now that <a href="http://lwn.net/Articles/326786/rss">Ts'o's commit rights have been officially revoked</a> I think that the whole discussion is moot. I wonder if the next ext4 head maintainer will learn from this painful experience and just do the right thing. Wed, 01 Apr 2009 21:37:29 +0000 That massive filesystem thread https://lwn.net/Articles/326781/ https://lwn.net/Articles/326781/ bojan <div class="FormattedComment"> I will just answer this one comment, so that nobody gets "offended".<br> <p> Quite the opposite. I'm all for fixing bugs and giving application programmers the _right_ tools for the job. If some Linux developers took a second to lift their noses out of the specifics of Linux and actually looked around, this could be fixed for _everyone_, not just for some Linux specific file systems. That is my point, in case you didn't get it by now.<br> </div> Wed, 01 Apr 2009 20:55:12 +0000 That massive filesystem thread https://lwn.net/Articles/326763/ https://lwn.net/Articles/326763/ Steve_Baker <div class="FormattedComment"> Perhaps instead they could just invert the meaning of the 'A' file <br> attribute when a filesystem has been mounted with the noatime or relatime <br> option to force strict atime updates for files so marked. That way you <br> can mount your filesystem(s) noatime and only put the A attribute on your <br> mailboxes and you're done.<br> <p> </div> Wed, 01 Apr 2009 19:41:09 +0000 I/O barriers and LVM https://lwn.net/Articles/326759/ https://lwn.net/Articles/326759/ quotemstr <div class="FormattedComment"> No, they don't do this. LVM's lack of barrier support has resulted in my not using LVM in some circumstances. If kernel.org kernels grow LVM barrier support and are otherwise stable, it might be worth using them until distribution kernels are updated to include the functionality.<br> </div> Wed, 01 Apr 2009 19:08:51 +0000 I/O barriers and LVM https://lwn.net/Articles/326757/ https://lwn.net/Articles/326757/ sbergman27 <div class="FormattedComment"> """<br> This change alone is making me consider using a kernel.org kernel, something I haven't done in years.<br> """<br> <p> Are you saying that distro kernels already do this? If so, I'm not suggesting that you are wrong. Just interested in more info. <br> </div> Wed, 01 Apr 2009 19:03:54 +0000 That massive filesystem thread https://lwn.net/Articles/326732/ https://lwn.net/Articles/326732/ quotemstr By your logic, we should never fix bugs. Remember the <a href="http://www.theinquirer.net/inquirer/news/136/1043136/bsd-unix-finally-fixes-old">25 year old readdir bug</a>? Don't you agree it was good to fix that? What if a program, somewhere, depended on that behavior? In reality, programs use rename for atomic replacement. POSIX doesn't say anything about guarantees after a hard system crash, and it's just disingenuous to think that by punishing application authors by giving them as little robustness as possible, you're doing them some kind of portability favor. Wed, 01 Apr 2009 18:09:36 +0000 Yup. It's the beginning of the end. https://lwn.net/Articles/326691/ https://lwn.net/Articles/326691/ foom <div class="FormattedComment"> This is starting to get very repetitive...all these points have been made already at least once in one <br> of the other article's threads. I'd like to suggest that it might be in everyone's interest to move on to <br> more useful pass-times than rehashing the same arguments over and over again every time there's <br> an update on the subject.<br> </div> Wed, 01 Apr 2009 16:10:07 +0000