LWN: Comments on "Con Kolivas returns with a new scheduler" https://lwn.net/Articles/350100/ This is a special feed containing comments posted to the individual LWN article titled "Con Kolivas returns with a new scheduler". en-us Thu, 11 Sep 2025 09:25:05 +0000 Thu, 11 Sep 2025 09:25:05 +0000 https://www.rssboard.org/rss-specification lwn@lwn.net Con Kolivas returns with a new scheduler https://lwn.net/Articles/353181/ https://lwn.net/Articles/353181/ daenzer <div class="FormattedComment"> Do you have any form of kernel preemption enabled? IME that helps a lot for interactivity vs. swap. But of course, there's only so much it can do in the face of a runaway process which eats memory like there's no tomorrow.<br> </div> Thu, 17 Sep 2009 10:05:27 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/352168/ https://lwn.net/Articles/352168/ jbh <div class="FormattedComment"> Good question! The desired property here is mainly "not CFQ", since the <br> default CFQ scheduler seems to behave badly (wrt IO latency) with cheap <br> SSDs. See for example <a href="http://forum.eeeuser.com/viewtopic.php?id=23580">http://forum.eeeuser.com/viewtopic.php?id=23580</a> .<br> <p> The noop scheduler is also sometimes recommended, but the linked thread <br> suggests it may suffer from IO starvation since it doesn't do any balancing <br> between processes.<br> <p> [ it also recommends the following, which I haven't tried:<br> echo 1 &gt; /sys/block/sda/queue/iosched/fifo_batch<br> ]<br> </div> Fri, 11 Sep 2009 15:45:40 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/352158/ https://lwn.net/Articles/352158/ SEMW <div class="FormattedComment"> <font class="QuotedText">&gt; - barrier=0 mount option</font><br> <font class="QuotedText">&gt; - elevator=deadline boot option</font><br> <font class="QuotedText">&gt;- tune2fs -o journal_data_writeback</font><br> <p> Regarding the Deadline i/o scheduler, according to Wikipedia, "The kernel docs suggest this is the preferred scheduler for database systems, especially if you have TCQ aware disks, or any system with high disk performance.". Since an ssd on an eeepc is neither a database system, nor TCQ aware (TCQ being meaningless for SSDs), nor has high disk performance, what property makes it good for that workload?<br> </div> Fri, 11 Sep 2009 14:54:59 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351742/ https://lwn.net/Articles/351742/ realnc <div class="FormattedComment"> Ingo posted a new version of latt.c that when used, reports better values than BFS.<br> <p> Normally I'd say this smells fishy since it comes from Ingo, but I'm sure other devs would have objected if he would modify latt.c in any malicious way.<br> </div> Wed, 09 Sep 2009 20:28:06 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351637/ https://lwn.net/Articles/351637/ kmike I'm kind of late to the party, but I was just reading the linked thread at LKML. Just check out these numbers (<a rel="nofollow" href="http://article.gmane.org/gmane.linux.kernel/887354">post by Jens Axboe</a>): <pre>BFS210 runs on the laptop (dual core intel core duo). With make -j4 running, I clock the following latt -c8 'sleep 10' latencies: -rc9 Max 17895 usec Avg 8028 usec Stdev 5948 usec Stdev mean 405 usec Max 17896 usec Avg 4951 usec Stdev 6278 usec Stdev mean 427 usec Max 17885 usec Avg 5526 usec Stdev 6819 usec Stdev mean 464 usec -rc9 + mike Max 6061 usec Avg 3797 usec Stdev 1726 usec Stdev mean 117 usec Max 5122 usec Avg 3958 usec Stdev 1697 usec Stdev mean 115 usec Max 6691 usec Avg 2130 usec Stdev 2165 usec Stdev mean 147 usec -rc9 + bfs210 Max 92 usec Avg 27 usec Stdev 19 usec Stdev mean 1 usec Max 80 usec Avg 23 usec Stdev 15 usec Stdev mean 1 usec Max 97 usec Avg 27 usec Stdev 21 usec Stdev mean 1 usec One thing I also noticed is that when I have logged in, I run xmodmap manually to load some keymappings (I always tell myself to add this to the log in scripts, but I suspend/resume this laptop for weeks at the time and forget before the next boot). With the stock kernel, xmodmap will halt X updates and take forever to run. With BFS, it returned instantly. As I would expect. So the BFS design may be lacking in the scalability end (which is obviously true, if you look at the code), but I can understand the appeal of the scheduler for "normal" desktop people. </pre> Amazing difference. I'm tempted to try this thing out. Wed, 09 Sep 2009 15:19:49 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351590/ https://lwn.net/Articles/351590/ jbh <div class="FormattedComment"> Ah, then we're in full agreement. An SSD will in fact *cause* the problems <br> described in this thread.<br> <p> ;-)<br> </div> Wed, 09 Sep 2009 08:47:35 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351588/ https://lwn.net/Articles/351588/ realnc <div class="FormattedComment"> This was related to the original comment by kragil you replied to. An SSD won't fix the problems described in this thread.<br> </div> Wed, 09 Sep 2009 08:41:16 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351580/ https://lwn.net/Articles/351580/ jbh <div class="FormattedComment"> Huh? I'm not sure what you mean, I already have an SSD. That's the problem: it's very slow, random writes &lt;100kB/s. And I strongly doubt that BFS can magically fix that.<br> <p> What CAN fix it is a combination of: (i) write less, (ii) don't wait for writes. That's the point of the tuning. Nothing to do with process scheduling.<br> </div> Wed, 09 Sep 2009 07:43:15 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351518/ https://lwn.net/Articles/351518/ realnc <div class="FormattedComment"> You don't need an SSD. You need BFS. Cheaper and works wonders.<br> </div> Tue, 08 Sep 2009 20:11:13 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351137/ https://lwn.net/Articles/351137/ jlokier <div class="FormattedComment"> The real reason we used to use "make -j MORE_THAN_NR_CPUS" was to get useful work done when a process waits on I/O. Con's scheduler doesn't help with that: if gcc foo.c needs to read foo.c from disk, you must have more jobs than CPUs, or you have an idle CPU.<br> <p> What's changed is nowadays most of us have enough RAM to keep the whole kernel source and object files in cache, so there's no I/O delay after the first compile of the day.<br> <p> I'm surprised Con's scheduler compiles more quickly than CFS, and that has to be worth looking into.<br> </div> Mon, 07 Sep 2009 10:09:59 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351117/ https://lwn.net/Articles/351117/ jbh <div class="FormattedComment"> Right! You probably have most things covered. Make sure you use the fast <br> ssd for the system of course, and save the 8GB for data. Three more things:<br> - barrier=0 mount option<br> - elevator=deadline boot option<br> - tune2fs -o journal_data_writeback<br> <p> The first one (mount -o barrier=0) is subjectively the one that makes the <br> most real difference. Ext4 has barriers enabled by default, as opposed to <br> ext3.<br> <p> I have a slow-only 16GB 900. With your setup, it might be effective to put <br> all journals on the fast disk (external journal), but I guess that's if you <br> actually enjoy tweaking :)<br> <p> Finally it's possible to replace that slow SSD with a faster one from <br> RunCore for example. But that doesn't explain why windows does better <br> (which surprises me a bit as I've heard others complain that windows is <br> very bad at least on the 16BG --- but maybe it's ok if it's installed on <br> the fast 4GB disk).<br> </div> Mon, 07 Sep 2009 08:56:39 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351038/ https://lwn.net/Articles/351038/ kragil <div class="FormattedComment"> Thank you.<br> <p> <p> It is a 901 with a fast 4GB SSD drive and a slow 8GB SSD. I have done some tweaking (Firefox ramdisk, ext4, noatime,noload) but maybe you have the right answers for my problems (I am using KDE4 on Fedora 11)<br> <p> I will try 2.6.31 as soon as it is ready. I guess it might also help.<br> </div> Sun, 06 Sep 2009 21:43:30 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351025/ https://lwn.net/Articles/351025/ jbh <div class="FormattedComment"> If your eeepc has an SSD, skips/freezes are more likely to come IO stalls <br> than from the CPU scheduler. Particularly if it's the 16GB variant of <br> 900/900a, those are known to have very poor performance. In that case there <br> are some mount options and other tunables that can make a large difference <br> (disabling barriers and so on, let me know if you need a full list).<br> <p> Or is this a conventional hard drive model?<br> </div> Sun, 06 Sep 2009 17:14:12 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351007/ https://lwn.net/Articles/351007/ trasz <div class="FormattedComment"> z/OS keeps full compatibility only for applications that don't mess with system internals. If your program does something strange (although supported by IBM), it's possible that it will cease working in few releases from now. Of course there is a whole process of phasing out features, so it won't be a nasty surprise to you or your customers - but still, they are definitely not keeping 30 years of backward compatibility for that.<br> <p> </div> Sun, 06 Sep 2009 11:46:27 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/351003/ https://lwn.net/Articles/351003/ kragil <div class="FormattedComment"> I have to agree completely!<br> <p> On my EeePC Linux (Fedora) hangs, skips and freezes. Really annoying. XP and OSX86 don't. I really wish some Netbook distro would integrate this scheduler ASAP or Linus would come to his senses and make the scheduler pluggable. I don't need fair behavior or 4096 CPU support on my netbook. I want it to react to my commands and play my media like it was supposed to be played, but I guess servers don't need that and so most of the kernel devs just don't give a shit.<br> </div> Sun, 06 Sep 2009 10:50:13 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350943/ https://lwn.net/Articles/350943/ realnc <p>"BIG NUMA machines will probably suck a lot with this because it pays no deference to locality of the NUMA nodes when deciding what cpu to use. It just keeps them all busy. The so-called "light NUMA" that constitutes commodity hardware these days seems to really like BFS."</p> <p>The point is that big NUMA machines aren't used for Desktops ;)</p> Sat, 05 Sep 2009 03:28:34 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350820/ https://lwn.net/Articles/350820/ realnc <p>I've just installed a bfs (204) patched kernel on my Core 2 Duo. I fired up <a rel="nofollow" href="http://lmms.sourceforge.net">LMMS</a> and started a CPU hungry project and there's not a single sound skip/pop/crackle. Smooth. It was giving lots of sound artifacts (due to under-runs) before. At the same time I started playing notes in the Fluidsynth synthesizer (utilizing a 1.8GB sound font!), and there's virtually no lag between the key press and when the note actually plays anymore.</p> <p>Then I started a kernel build (-j2), fired up mplayer on a 1080p HD video, and continued working in LMMS.</p> <p>Still smooth. That was totally killing LMMS before (too much lag). I felt like I was using the MS Windows version of LMMS. I now realize that the current scheduler totally sucks for Desktops.</p> <p>IMHO, what Desktop Linux needs desperately right now are more people like Kolivas. Dude, if you're reading this: thank you. I hope people will contribute to BFS. But I guess the kernel devs will not even take a look at it since it doesn't "scale". (The sound cracks/pops and lag in my music authoring tool won't "scale" either, but it seems they don't care very much; you see, LMMS is not MySQL. A 1% regression there and they start running to fix it. But who cares about me making music on Linux, right?)</p> Fri, 04 Sep 2009 11:10:23 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350801/ https://lwn.net/Articles/350801/ DavidG <div class="FormattedComment"> FWIW: for AMD, only Opterons have NUMA and AFAIK Intel i7 don't have NUMA either.<br> So you're save to use BFS.<br> </div> Fri, 04 Sep 2009 09:31:35 +0000 Input pauses https://lwn.net/Articles/350794/ https://lwn.net/Articles/350794/ Cato <div class="FormattedComment"> I've had these pauses for several seconds sometimes in Firefox 3.5.2, but they seem to be associated with large writes e.g. un-tarring a huge file at the same time - so I suspect it's the well known "Firefox 3.0 + ext3 + fsync" issue, reported in <a href="http://lwn.net/Articles/284126/">http://lwn.net/Articles/284126/</a> and elsewhere.<br> </div> Fri, 04 Sep 2009 08:17:40 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350788/ https://lwn.net/Articles/350788/ shane <div class="FormattedComment"> From the linked-to text:<br> <p> GIT repository?<br> <p> Sorry, it's not the right tool for me so it's not worth me investing the time<br> in setting one up.<br> <p> </div> Fri, 04 Sep 2009 07:49:15 +0000 Input pauses https://lwn.net/Articles/350771/ https://lwn.net/Articles/350771/ jzbiciak <div class="FormattedComment"> I know Firefox's stalls are Firefox's fault, not the Linux scheduler's. I was begging Con to do interactivity work on Firefox. ;-) <br> <p> (As in, contribute to the Mozilla team.)<br> </div> Fri, 04 Sep 2009 04:32:56 +0000 Input pauses https://lwn.net/Articles/350767/ https://lwn.net/Articles/350767/ ncm <div class="FormattedComment"> I see input-event stalls for up to six seconds in Firefox (Debian Iceweasel, actually), routinely. To get it I just have to leave it running for a week or so, with a few dozen pages, some of which self-update.<br> <p> I doubt the scheduler would help; during those six seconds, one CPU is pegged at 100%. This is with nothing swapped. I suspect it's doing a garbage collection scan during each pause. I'd welcome suggestions for how to discover what's going on. <br> <p> Google Chrome is coming along too slowly.<br> </div> Fri, 04 Sep 2009 04:09:25 +0000 Stuck keys. https://lwn.net/Articles/350766/ https://lwn.net/Articles/350766/ ncm <div class="FormattedComment"> I've seen on 2.6.29. Unplugging and re-plugging the (USB) keyboard didn't fix it. Only switching to console and back did it. I was inclined to blame the new X input system. It hasn't happened since I upgraded X and also went to 2.6.30, so who knows?<br> </div> Fri, 04 Sep 2009 02:47:57 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350765/ https://lwn.net/Articles/350765/ Tronic <div class="FormattedComment"> Except that it is not really possible to prevent OOM with ulimits. The <br> limits are per-process and thus a larger number of processes can easily eat <br> all your RAM. Setting nproc and memory limits so that running out of RAM is <br> impossible is otherwise unfeasible.<br> <p> This scenario is very real - just last week I managed to crash our master <br> server by accidentally running about 100 Valgrind-debugged web services at <br> the same time. The machine went completely unresponsive for a long time, so <br> finally we went to the server room to reset it the hard way (even the <br> console would not respond at all). Linux still doesn't handle OOM situations <br> properly, it seems.<br> <p> </div> Fri, 04 Sep 2009 02:38:44 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350690/ https://lwn.net/Articles/350690/ jzbiciak <div class="FormattedComment"> That sounds more like a dropped event somewhere such as dropped interrupt or the like. I haven't experienced that on any Linux box so far.<br> </div> Thu, 03 Sep 2009 18:14:15 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350679/ https://lwn.net/Articles/350679/ charris <div class="FormattedComment"> I also see "stuck" keys where a letter will repeat until the buffer is full. I see this on various hardware with various keyboards, so it isn't an actual stuck key. Do you see that also?<br> </div> Thu, 03 Sep 2009 17:31:34 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350675/ https://lwn.net/Articles/350675/ smadu2 <div class="FormattedComment"> OK. <br> </div> Thu, 03 Sep 2009 17:02:16 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350603/ https://lwn.net/Articles/350603/ jzbiciak <div class="FormattedComment"> I've only gotten multi-second pauses like that when I had a hard drive going bad, or I was swapping rather furiously.<br> <p> With that much RAM, I wonder if VM housekeeping itself could cause such lags. 12Gb would have 3 million 4K pages. If you are running at 3GHz and did something that averaged 3000 cycles/page across all 3 million pages, that's 3 seconds.<br> </div> Thu, 03 Sep 2009 13:39:56 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350601/ https://lwn.net/Articles/350601/ zlynx <div class="FormattedComment"> I was just commenting on how they do it. I am not saying anything bad about it.<br> </div> Thu, 03 Sep 2009 13:28:08 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350598/ https://lwn.net/Articles/350598/ nix <div class="FormattedComment"> It's certainly not the bluetooth driver or X, because I see this on the console of a one-die four-core Nehalem (Core i7) with no X running at all and a PS/2 keyboard. It's lightly loaded by these standards (load average 8.1, which as it's hyperthreaded is pretty much equivalent to 'everything loaded but not much'), and surely isn't swapping (12Gb RAM, 6Gb *free*, not even used for cache). Yet I see three-second pauses in keyboard activity.<br> <p> I'll turn on latencytop and see what it says, but the pauses are fairly rare so it might be interesting to interpret.<br> <p> </div> Thu, 03 Sep 2009 13:15:54 +0000 Desktop Debian = Ubuntu https://lwn.net/Articles/350596/ https://lwn.net/Articles/350596/ nix <div class="FormattedComment"> Former DPLs need not apply ;}<br> <p> (of course I'm running Debian on the desktop too, and adminning it remotely for my mum, who's in the same boat, and all she wants is something that Just Works and isn't virally infestable...)<br> <p> </div> Thu, 03 Sep 2009 13:10:09 +0000 Welcome To The Jungle https://lwn.net/Articles/350595/ https://lwn.net/Articles/350595/ gouyou <p>It reminds me of this blog post from the Adobe/Linux/Flash guy: <a href="http://blogs.adobe.com/penguin.swf/2007/05/welcome_to_the_jungle.html">Welcome To The Jungle</a>. Thu, 03 Sep 2009 13:09:14 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350569/ https://lwn.net/Articles/350569/ hppnq From what I've read about your problems with filesystem X eating your data on platform Y, you are likely to be a victim on any combination of X and Y. <p> Just make proper backups and do not assume that any filesystem is going to keep your data, especially if you stress it enough. Thu, 03 Sep 2009 11:53:31 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350553/ https://lwn.net/Articles/350553/ Cato <div class="FormattedComment"> I don't agree the filesystems are currently 100% fine - I've lost a lot of data on ext3, most likely due to the default use of write caching in the drive (not a filesystem issue but the default is dangerous for use with most filesystems) combined with a lack of journal checksumming (added in ext4). Also, until btrfs is stable, we won't have disk block level checksums, as in ZFS, which is the one thing that really makes me want to use OpenSolaris.<br> <p> See <a href="http://lwn.net/Articles/350072/">http://lwn.net/Articles/350072/</a> and in particular the link to the University of Wisconsin paper linked within the thread - they looked at reiserfs, ext3 and JFS, and how they cope with disk failures.<br> </div> Thu, 03 Sep 2009 10:17:40 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350511/ https://lwn.net/Articles/350511/ k8to <div class="FormattedComment"> Please don't complain about that which you don't understand.<br> <p> The Sound and Graphics have arguable unpleasantries (though all platforms do). <br> <p> The scheduler isn't optimal for some workloads, but it's also not bad. Compare it to the windows or Mac scheduler both of which regularly do really unpleasant things for both server and workstation workloads.<br> <p> The filesytems are 100% fine.<br> </div> Thu, 03 Sep 2009 05:02:57 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350509/ https://lwn.net/Articles/350509/ k8to <div class="FormattedComment"> Isn't that kind of aside from the point? I mean how they do it is pretty much their choice. They do it.<br> </div> Thu, 03 Sep 2009 04:56:04 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350487/ https://lwn.net/Articles/350487/ charris <div class="FormattedComment"> I find that gnome terminals will ignore my bluetooth keyboard for up to several seconds. I don't know if this is because of the bluetooth driver, X, or the scheduler, but it sure is annoying. And this on a 3GHz quad core system.<br> </div> Thu, 03 Sep 2009 02:37:58 +0000 Con Kolivas returns with a new scheduler https://lwn.net/Articles/350459/ https://lwn.net/Articles/350459/ smadu2 <div class="FormattedComment"> New addition to the list of things that are not so good in Linux kernel/OS, 1. Sound 2. Graphics 3. Filesystem and now 4. Scheduler.<br> </div> Thu, 03 Sep 2009 00:07:59 +0000 Desktop Debian = Ubuntu https://lwn.net/Articles/350424/ https://lwn.net/Articles/350424/ alex <div class="FormattedComment"> Hey I suggested my other half install Debian on her old laptop as she wanted experiment with free software. I was slightly worried she would have issues with the wireless but it all installed and ran fine. I think Debian makes a fantastic desktop OS if your not concerned about the latest whizz bang graphics effects or media players. It's stable and lightweight and very developer friendly.<br> </div> Wed, 02 Sep 2009 21:01:19 +0000 Desktop Debian = Ubuntu https://lwn.net/Articles/350420/ https://lwn.net/Articles/350420/ zlynx <div class="FormattedComment"> But do I know you?<br> <p> I mean, when I said it, I meant people I actually know in person and I know what they're running.<br> <p> Now, as I don't make a habit of going around asking people if they're running Ubuntu or Debian or what, my sample size is about 5 people.<br> <p> One of those I know used to run a Debian laptop and I *think* he is running Ubuntu now but possibly not. I know the other 4 are running Ubuntu.<br> </div> Wed, 02 Sep 2009 20:47:15 +0000