Shuttleworth: Losing graciously [LWN.net]

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 14:31 UTC (Fri) by juliank (guest, #45896) [Link] (22 responses)

Nelson says Ha Ha!

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:38 UTC (Fri) by mebrown (subscriber, #7960) [Link] (4 responses)

There is also such a thing as winning graciously.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:45 UTC (Fri) by juliank (guest, #45896) [Link] (3 responses)

Well, it's just Nelson saying this. Nelson has no personal interest in things. He just likes saying Ha Ha. It's just a well fitting quote from the Simpsons you can use whenever someone loses.

And it's not about the losing per se, it's more the fact that Mark called it losing. He did not do anything, Canonical was officially not involved in the whole process, and he previously said he was committed to upstart, so how can he call it losing?

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 16:00 UTC (Fri) by mpr22 (subscriber, #60784) [Link] (2 responses)

It is impossible to use the Nelson quote as an unaccompanied one-liner without making it reasonable for people to infer that you are experiencing schadenfreude.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 17:47 UTC (Fri) by kragil (guest, #34373) [Link] (1 responses)

"Schadenfreude ist die beste Freude"

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 18:41 UTC (Sat) by danielf2 (guest, #92398) [Link]

Schadenfreude ist die _einzige_ Freude! :)

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 19:12 UTC (Fri) by jspaleta (subscriber, #50639) [Link] (16 responses)

Uncalled for and unhelpful.

Some of us who are publicly critical of Canonical do so because we genuinely believe they've made poor management decisions and are trying to find a way to encourage them to make better ones going forward. Rubbing there nose in poor decision making, after they decide to change totally defeats the point. No matter how long and drawn out the discussion on a particular error in judgement, its actively destructive to gloat when they decide to change course. A decision to wind down upstart development now, as implied by Mark's blog post, is more of a win for Canonical and Ubuntu than anyone else. Nelsoning that decision makes it that much harder for Canonical and Ubuntu to move forward cleanly and fully embrace the change without hard feelings.

Or to give everyone something that will fit into a tweet:
Juliank, Stop being a dick.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 20:55 UTC (Fri) by juliank (guest, #45896) [Link]

Sorry for that. It was not really meant that way.

But nice word "nelsoning".

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 23:41 UTC (Fri) by HelloWorld (guest, #56129) [Link] (14 responses)

Have you forgotten about the "open source tea party" already? Have you already forgotten about Martin Gräßlin almost quitting his Open Source work due to mobbing by Canonical? https://plus.google.com/app/basic/stream/z13hi1tbptzigbmn...
Not to mention that they're *still* actively fracturing the Linux Community with Mir. So I'm sorry, but Mark had it coming. And when he gives up on Mir, I'll be laughing at him too.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 0:18 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (1 responses)

Mir is a different kettle of fish entirely....

upstart was created as a legitimate effort to solve real deficiencies in sysinit at a time when no other competing option for service management was seeing any serious development or adoption momentum at all. Mir, on the other hand, is just a wasteful distraction that doesn't really solve any technical problems wayland wasn't already trying to solve.

So credit where credit is due. Unlike Mir, Upstart was intended to solve some long standing and commonly experienced deficiencies and had the potential to be a wildly used replacement for sysvinit. Upstart's development stagnation and its inability to attract a development community is a cautionary tale to learn from...not to deride because it failed.

And for me at least, seeing Canonical drop upstart isn't really a win at all. The ultimate win, is to get Canonical to drop that blasted CLA requirement on all their gpl licensed projects. Canonical has not really internalized the fact that their CLA requirements hurt their ability to sustain innovation and to build development communities. Irony abounds, in that Canonical can do such a reasonably good job of community building in every aspect of the Ubuntu distribution _except_ development of Canonical led projects...which is the only area with a legal hurdle like the CLA. I can only imagine that one of the early conversations around community management went like this: "I'm bored burning ants with my magnifying glass. bored bored bored. Hey you know what would be fun?! Let's build a rock star community development team inside the corporate fenceline and then throw up insurmountable hurdles in their way." That's a profoundly sadistic management philosophy.

There's still more work to do to convince them that the CLA gets in the way of building strong developer communities. And sadly if they don't learn this lesson soon, their Unity stack push into mobile is going to get bitten by the same problem that bit upstart. Canonical does not have the manpower to sustain technology development on their own and the CLA is _hostile_ to external developer interests.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 11:14 UTC (Sat) by rsidd (subscriber, #2582) [Link]

+1. Shuttleworth has not learned the right lesson at all. Its the CLA that felled upstart, otherwise (according to SJR, Upstart's creator) systemd may not have existed and upstart may have ended looking much like systemd. I support their decision to work on Mir, but it will never catch on elsewhere either as long as the CLA is in place.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 0:32 UTC (Sat) by Tov (subscriber, #61080) [Link] (11 responses)

I can perfectly see, what MS meant by "open source tea party". The forum here and elsewhere seems to be full of people polarizing the debate, holding grudge and trying to dig the trenches deeper. No matter the preceding heated discussion (technical and otherwise) I really salute MS for trying to bury the hatchet and and bowing for the decision by the TC.

Although personally having enjoyed the benefits of Upstart for several years, I really look forward to the new benefits promised by systemd. Also the (friendly?) competition between Mir and Wayland may really propel the Linux desktop/phone forward. What happened to "Freedom of choice"? MS and Canonical are free to make their choices, and you and me are free to make our choices.

Please try to be polite, respectful, and informative.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 3:55 UTC (Sat) by alison (subscriber, #63752) [Link] (10 responses)

>I can perfectly see, what MS meant by "open source tea party".

I agree. We do have a mob mentality sometimes in open source. We all know of examples and reciting them is unnecessary, assuredly.

The big question is, will ChomeOS now abandon Upstart? Notably Upstart creator Resnick works at Google. Unlike Canonical, Google assuredly has plenty of money to maintain a fork indefinitely, as they have already demonstrated with the kernel, codecs, Android and webkit. I don't see splintering going away at all -- the parties to it are just morphing.

I do hope that the distros will stop fighting each and focus on the real enemy, closed-source. I'm with Zach Pfeffer: let's kill those binary blobs dead, and stop wasting energy wrestling each other.

systemd in ChromeOS

Posted Feb 15, 2014 12:01 UTC (Sat) by alex (subscriber, #1355) [Link]

If they do decide to change to systemd is should be a lot easier for them given there is only one version of ChromeOS and they can dictate when the change gets rolled out.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 13:23 UTC (Sat) by anselm (subscriber, #2796) [Link] (8 responses)

Scott James Remnant, the original Upstart creator, works at Google but has nothing to do with Upstart development any longer – mostly because he would have had to sign the Canonical CLA.

My guess would be that Google will at some point move ChromeOS over to systemd. It's not as if the init system in ChromeOS was a user-serviceable part, or that there'd many tricky services in ChromeOS running as Upstart-native jobs, so they might as well go with the flow.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 16:50 UTC (Sat) by alison (subscriber, #63752) [Link] (2 responses)

Ah, "Remnant," not Resnick -- thanks for the correction. Presumably Google has a policy forbidding their employees from signing CLAs like Canonicals. Android doesn't have a CLA, but that's because AFAICT they don't accept contributions. Is it fair to criticize Canonical's CLA more than projects that don't take patches at all?

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 17:06 UTC (Sat) by juliank (guest, #45896) [Link]

AOSP does accept patches, see http://source.android.com/source/submit-patches.html

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 18:20 UTC (Sat) by mdeslaur (subscriber, #55004) [Link]

> Android doesn't have a CLA

Sure it does:

http://source.android.com/source/licenses.html#contributo...

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 18:18 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (4 responses)

> mostly because he would have had to sign the Canonical CLA

Since there are Google submitted patches in the current Upstart tree, I would believe the CLA has already been signed.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 18:23 UTC (Sat) by anselm (subscriber, #2796) [Link] (3 responses)

Scott James Remnant said that he hadn't signed the CLA. Whether that was due to his own preference or Google policy wasn't specified.

It is also worth pointing out that according to what he said he doesn't do anything Upstart-related at Google in the first place; if there are patches from Google in Upstart then they were presumably submitted by somebody else.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 18:27 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (2 responses)

Well, they are from "Scott James Remnant <keybuk@google.com>".

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 20:15 UTC (Sat) by Wol (subscriber, #4433) [Link] (1 responses)

What's the possibility that *Google* has signed the CLA?

Bearing in mind, as an employee SJR presumably doesn't own the copyright to work he writes on Google's dime, there's no point in him signing the CLA because it's not his copyright to assign?

Cheers,
Wol

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 21:06 UTC (Sat) by mdeslaur (subscriber, #55004) [Link]

Yes, that is quite possible.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 14:37 UTC (Fri) by patrick_g (subscriber, #44470) [Link] (4 responses)

That was quick. Kudos to Mark for this decision, it'll help the Linux ecosystem as a whole.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:11 UTC (Fri) by jospoortvliet (guest, #33164) [Link] (1 responses)

I bet they saw it coming and thought about it. Obviously, there are benefits to collaboration and convergence and I'm glad they decided to go for that. A diverse ecosystem is good, no doubt about it, but there's a point where a better solution is just that - better.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 17:52 UTC (Fri) by drag (guest, #31333) [Link]

Canonical has their strong points and their weak points.

Hopefully this will free up resources for their strong points; which is making Linux usable for a wider audience.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 16:29 UTC (Fri) by rvfh (guest, #31018) [Link]

I guess this may change the game quite a bit. Debian only selected systemd for Jessie, but if the main supporter of Upstart moves to it, then it kind of makes systemd the default for all future Debian releases I guess...

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 16:41 UTC (Fri) by mathstuf (subscriber, #69389) [Link]

A fitting day for the announcement as well.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 14:39 UTC (Fri) by gwolf (subscriber, #14632) [Link]

What an interesting turn of events, and what a gentelmany(?) attitude by Mark. Thanks!

Were Mark not to be accepting things this way, a long and bitter GR would be much more likely (and the outcome, I believe, would still be the same). If Canonical agrees in homogeneizing their system with the ample majority of the Linux distributions, more strong enough players will be able to steer the project and ensure it does not follow (as many have feared) a single (or handful-of) corporation's agenda.

I have yet to read Mark's announcement, but as far as the LWN sum-up shows... I'm really happy and thankful for his personal touch of sensibility, and Canonical's defusing of what could be even worse than what we have seen so far.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:10 UTC (Fri) by SEJeff (guest, #51588) [Link]

This is actually a pretty classy way to lose a debate. Well played sir!

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:17 UTC (Fri) by pranith (subscriber, #53092) [Link] (7 responses)

Kudos in accepting defeat gracefully and not splintering the linux landscape!

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 16:40 UTC (Fri) by mbt (guest, #81044) [Link] (5 responses)

Splintering the landscape? The landscape is already splintered, and it will continue to be.

Systemd was available as an option in Debian, and the vote was to make it now the default init system. I think it is *completely* and *abolutely* inconceivable that systemd will now become the only available init system in Debian. They are going to have to continue to support other init systems, especially since they have tons of clients who manage high-traffic production servers and can't afford to make the sudden shift of init systems.

The Shuttleworth article does not even state unequivocally that they are discontinuing Upstart development.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 17:28 UTC (Fri) by ovitters (guest, #27950) [Link] (3 responses)

Debian has a high attention to detail. If you have some really important production server, then why trust a distribution upgrade, but not the init system change?!? I assume you'd test in any case.

I love that Debian will have systemd as default because they focus on stability so much. You'll know pretty much all the smallest corner cases will be covered.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 18:22 UTC (Fri) by drag (guest, #31333) [Link] (2 responses)

It's taken Redhat a long time to migrate over. Changing the init means a lot of changes in lots of other places.

It's perfectly reasonable to expect that Jessie will just use mostly systemd's compatibility features before Debian makes the full change over.

However from a stability standpoint.. having Debian fracture their resources to support multiple inits is certainly a step in the wrong direction.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 20:44 UTC (Fri) by vonbrand (subscriber, #4458) [Link] (1 responses)

Red Hat didn't take a long time to migrate, it just took a long time for the next release to roll along. Fedora took somewhat more than six months to migrate, mostly because of missing documentation the first time around.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 11:10 UTC (Sat) by drag (guest, #31333) [Link]

> it just took a long time for the next release to roll along.

This is how Redhat did it's migrating. Put it in Fedora and eventually let that become the init for the next major release.

The upside to all of this is that Debian can benefit from the maturity of the software now.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 18:57 UTC (Fri) by smurf (subscriber, #17840) [Link]

Let's face it, with upstart's years-old serious bugs that require a major rewrite to fix properly, Upstart development is already discontinued.

So this was the only sensible decision. Nevertheless, I have to admit that I didn't expect it. At all. Much less that quickly.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 23:43 UTC (Fri) by HelloWorld (guest, #56129) [Link]

They still are splintering the Linux landscape with Mir.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:24 UTC (Fri) by Otus (subscriber, #67685) [Link] (8 responses)

I wonder if they'll avoid a GR now.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:31 UTC (Fri) by mathstuf (subscriber, #69389) [Link]

It depends on how many people are actually supportive of sysvinit yet. I agree with others that the sysvinit supporters (more specifically, the anything-but-something-new supporters) are the ones to watch for a GR.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:32 UTC (Fri) by tau (subscriber, #79651) [Link] (6 responses)

Now all they need to do is ditch Mir and adopt Wayland. I think even SteamOS quickly shifted back to a Debian base from Ubuntu in no small part due to that decision.

Differentiation should be user-visible. If the Ubuntu developers were to throw all of their technical efforts into creating a cohesive and polished experience around their Unity shell and its associated user experience then I wish them the very best, even if I'm not terribly keen on the integrated adware in that system.

differentiating

Posted Feb 14, 2014 15:53 UTC (Fri) by louie (guest, #3285) [Link] (2 responses)

"Differentiation should be user-visible."

To give Mark some credit, I think when they embarked on Upstart and Wayland they felt that these would provide user-visible differentiation - for example, in the upstart case, faster startup and better control over systems for admins; long list for Mir over plain X. Unfortunately, you have to have a strong engineering and design/PM culture to deliver on that. If you don't, other people will see the same set of problems and solve them better, which is what happened with Wayland, systemd, and I think perhaps eventually GNOME 3. And once that happens you're stuck with high maintenance costs and no useful/saleable differentiation.

So far, the evidence is that Canonical doesn't have that strong engineering/PM culture that can deliver when Mark wants to create a differentiated product - that's really what he needs to look at next, I think (and is a very, very hard problem to solve).

differentiating

Posted Feb 14, 2014 18:22 UTC (Fri) by bronson (subscriber, #4806) [Link] (1 responses)

But it wasn't Mir vs. plain X. Canonical was already working on Wayland when they started Mir.

Mir vs. Wayland seems like a pretty short and user-invisible list.

differentiating

Posted Feb 15, 2014 3:49 UTC (Sat) by alison (subscriber, #63752) [Link]

>Mir vs. Wayland seems like a pretty short and user-invisible list.

Don't forget the "shopping lenses".

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 18:36 UTC (Fri) by tjc (guest, #137) [Link] (2 responses)

> Differentiation should be user-visible.

I'm not sure that I share that assumption. The first obvious question is: why?

For example, should we all use the same file system because the differences are, for the most part, not immediately visible to the user? There are differences, of course, but most people can use a system without being aware of the file system that they are using.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 2:32 UTC (Sat) by rgmoore (✭ supporter ✭, #75) [Link] (1 responses)

For example, should we all use the same file system because the differences are, for the most part, not immediately visible to the user?

I think it depends on who your users are. For a distribution targeting desktop users, there are several filesystems that are pretty close in terms of overall performance, so it may make more sense for the distribution to go with only one or two sensible choices to minimize support problems. Just because your kernel supports FAT32 and cramfs doesn't make it smart to make them options in the installer.

For a distribution targeting servers, the workload will vary much more from system to system and the users will be admins and developers who are much more sophisticated and performance sensitive than typical desktop users. Because of that, it makes sense to give them a lot more control over their system and support a wider range of options. It still doesn't mean you need to make it easy to use filesystems that are unlikely to be helpful for typical server workloads.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 11:30 UTC (Sat) by khim (subscriber, #9252) [Link]

I not so many words: if you targets don't notice difference between btrfs, ext4 and/or xfs then yes such targets must use one fs (see Android), if your target are sophisticated sysadmins (see RHEL) then for them fs choice is a user-visible feature thus choice of an fs is a sensible choice.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:29 UTC (Fri) by jezuch (subscriber, #52988) [Link] (5 responses)

What others said - this is a welcome move by Shuttleworth.

But I wanted to say that this reinforces my impression that as the events turned out, it turned out that upstart was basically a prototype of systemd. It was a trailblazer, kind of :) They showed what works, what doesn't and what to expect, and informed (arguably) better decisions by the developers of systemd. (Which, incidentally, is thus in a position to suffer from second system syndrome. Just sayin'!)

In summary - it was not all in vain.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:45 UTC (Fri) by kjp (guest, #39639) [Link] (4 responses)

I'm pretty sure launchd was the trail blazer. Does upstart offer _anything_ beyond that?

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 16:34 UTC (Fri) by mathstuf (subscriber, #69389) [Link]

Works on Linux?

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 22:20 UTC (Fri) by gidoca (subscriber, #62438) [Link] (1 responses)

Does not use XML?

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 15:21 UTC (Sat) by oshepherd (guest, #90163) [Link]

launchd uses plists. There are multiple plist formats, only one of them XML

(And if you're on OS X, you should probably just double click on the plist to open it in the graphical and nice to use plist editor anyway)

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 15:22 UTC (Sat) by oshepherd (guest, #90163) [Link]

systemd and launchd bear a lot more similarities than launchd and upstart. For a start, both systemd and launchd do on-demand socket activation.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 15:32 UTC (Fri) by josh (subscriber, #17465) [Link] (2 responses)

This is highly unexpected, especially after previous communications.

I very much hope to see all members of the Debian Technical Committee taking this into account.

Shuttleworth: Losing gracefully

Posted Feb 14, 2014 18:48 UTC (Fri) by lordsutch (guest, #53) [Link] (1 responses)

Why? The tech committee's job is done on this matter.

Shuttleworth: Losing gracefully

Posted Feb 15, 2014 0:59 UTC (Sat) by josh (subscriber, #17465) [Link]

It *should* be, yes. Some members of the technical committee don't seem to agree with that.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 15:42 UTC (Fri) by Siosm (subscriber, #86882) [Link] (39 responses)

> It will no doubt take time to achieve the stability and coverage that we enjoy today and in 14.04 LTS with Upstart

Looks like they'll keep upstart for 14.04. Thus the next Ubuntu LTS is already doomed, before being released, as it will have completely unsupported software (logind without systemd).

Shuttleworth: Losing graciously

Posted Feb 14, 2014 16:34 UTC (Fri) by rvfh (guest, #31018) [Link] (6 responses)

> Thus the next Ubuntu LTS is already doomed, before being released, as it will have completely unsupported software.

You lost me. LTS releases are supported by Ubuntu for five years.
See https://wiki.ubuntu.com/LTS for details.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 16:53 UTC (Fri) by Siosm (subscriber, #86882) [Link] (5 responses)

They'll use logind without systemd, which not supported upstream, and they'll mostly be the only distribution using it. For five years. This is quite the challenge, like what they're already doing for compiz which is not maintained anymore.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 17:01 UTC (Fri) by palmer_eldritch (guest, #95160) [Link]

Seeing the debates on the debian-ctte mailing list, it seems likely that logind without systemd will also be used in Jessie to support alternative init systems.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 18:49 UTC (Fri) by ewan (guest, #5533) [Link] (3 responses)

As Mark points out, upstart is used as the init in RHEL6, which will be supported by Red Hat until 2020, over a year after Ubuntu 14.04 'LTS' is expected to be EOLed by Canonical.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 22:34 UTC (Fri) by Siosm (subscriber, #86882) [Link] (2 responses)

> As Mark points out, upstart is used as the init in RHEL6

As much as Mark likes to point out, I'm afraid upstart is not "used" in RHEL 6 as much as he'd like it to be. Sure, it is the default init, but NONE of it's core features beyond SysVinit support are used in RHEL 6. So really, the belief that upstart is "used" by RHEL 6 is just wishful thinking.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:01 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (1 responses)

There are a few native jobs in RHEL6. What core features aren't being used?

Shuttleworth: Losing graciously

Posted Feb 16, 2014 3:48 UTC (Sun) by Siosm (subscriber, #86882) [Link]

> There are a few native jobs in RHEL6. What core features aren't being used?

You nailed it: there are almost no native upstart jobs in RHEL6 (tty, graphical login manager, control-alt-delete handler and that's mostly it). Thus everything is just plain init scripts. Therefore the dependency feature is not used, nor is the event feature, nor is the job lifecycle feature...

Upstart is just an intermediary to launch a massive SysVinit-like script which does all the work: /etc/rc.d/rc.sysinit.

So again, no, upstart is not "used" in RHEL 6.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 16:38 UTC (Fri) by mathstuf (subscriber, #69389) [Link] (4 responses)

I don't think I'd trust an init replacement within 2 month of a planned release for an LTS. Maybe if they pushed it to 14.06…

Shuttleworth: Losing graciously

Posted Feb 14, 2014 18:43 UTC (Fri) by tjc (guest, #137) [Link] (1 responses)

I don't think a two month delay would be enough. In any case, there's always *something* that's not as up-to-date as one would like. Delaying releases is a slippery slope.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:05 UTC (Fri) by mathstuf (subscriber, #69389) [Link]

I wouldn't expect it'd be either, but I'd trust it more than twice as much as the two months remaining…

I also agree that delaying releases is a slippery slope (though I sort of watch Fedora's from the sidelines since I usually surf on Rawhide and Alphas, so slips are…less important to me).

Shuttleworth: Losing graciously

Posted Feb 14, 2014 20:34 UTC (Fri) by Thue (guest, #14277) [Link]

Keeping the well-tested upstart for the LTS release, and then having 3 whole non-LTS (basically beta) releases to switch to systemd, is actually perfect. Just before an LTS release is not the time to integrate new software!

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:09 UTC (Fri) by smurf (subscriber, #17840) [Link]

Nah, they won't do that. There's no rush. I assume they'll simply take the Debian systemd when they next pull from sid.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 17:29 UTC (Fri) by ovitters (guest, #27950) [Link]

They would support it themselves. Just like any other software in that LTS.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:35 UTC (Fri) by hkario (subscriber, #94864) [Link] (1 responses)

RHEL 6 is also using it, as such it is far from being unsupported

it may be dead, but it is not forgotten

Shuttleworth: Losing graciously

Posted Feb 14, 2014 22:35 UTC (Fri) by Siosm (subscriber, #86882) [Link]

See my comment at: http://lwn.net/Articles/586439/

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:03 UTC (Fri) by vonbrand (subscriber, #4458) [Link] (23 responses)

I believe systemd is already a seasoned experimental package in Ubuntu, so a switch (while risky) isn't totally out of the question.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:19 UTC (Sat) by HelloWorld (guest, #56129) [Link]

Mark's Blog post is unambiguous about that:
> It will no doubt take time to achieve the stability and coverage that we enjoy today and in 14.04 LTS with Upstart,

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:47 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (21 responses)

In hindsight... its really unfortunate for Ubuntu that the default init decision in Debian in November. A decision 3 months ago, before the vote was complicated by the loose/tight coupling options... would have given Ubuntu 3 more months to work on those integration issues.. if there was an understanding inside the Canonical fenceline at the time that the debian decision was going to play a critical role in their own support of upstart moving forward.

Now Ubuntu release management is faced with the horrible choice of rushing systemd integration into their LTS release. Or sticking with upstart for LTS even though its effectively going to be unmaintained by Canonical.

The timing is really unfortunate for Ubuntu as a derivative. Hopefully there's enough time for them to switch default to systemd for 14.04 for early boot and shutdown at least and use sysvinit compat for installable services.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:48 UTC (Sat) by jspaleta (subscriber, #50639) [Link]

err sorry, I meant to say in the first sentence it was unfortunate that the vote wasn't taken back in November. -ENEEDMORECOFFEE

Shuttleworth: Losing graciously

Posted Feb 15, 2014 21:16 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (18 responses)

> would have given Ubuntu 3 more months to work on those integration issues

3 months is nowhere near enough time to switch init systems in an LTS release. It's likely going to take multiple cycles before all the issues are resolved enough for that.

> Now Ubuntu release management is faced with the horrible choice of rushing systemd integration into their LTS release.

There's no way that can be done in time, and it would be a pretty bad thing to do for an LTS release.

> Or sticking with upstart for LTS even though its effectively going to be unmaintained by Canonical.

It will be maintained throughout the life of the LTS, and any subsequent releases it ships in.

> Hopefully there's enough time for them to switch default to systemd for 14.04 for early boot and shutdown at least and use sysvinit compat for installable services.

No, that wouldn't be appropriate for our users. Switching init systems in a way that works with all the different combinations of how people are using Ubuntu is less than trivial. It took multiple cycles before all the different race conditions and integration issues were ironed out when we switched to Upstart. There are issues with systemd in Debian at the moment that still need to be ironed out, but even when it will be rock solid in Debian, there will likely need to be adjustments for the differences in how Ubuntu boots. This isn't going to happen overnight.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 21:25 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (17 responses)

I would dare say... all the race conditions in upstart are still not ironed out....cough. Avoided...but not ironed out. Did ubuntu ever ship a native apache httpd job file in any Ubuntu release?

You should consider following the packaging choices the systemd maintainers have done in debian and making it as easy as possible to switch to systemd as an alternative init for this lts cycle as a post-install local policy choice....so users who want to take the risk...can help work out those ubuntu specific integration issues for their workloads..ahead of making systemd the default in ubuntu. Help your server users, help your release team get ahead of the systemd integration issues for service workloads in Ubuntu, that never went upstart native and are still relying on sysvinit.

-jef

Shuttleworth: Losing graciously

Posted Feb 15, 2014 21:36 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (16 responses)

> You should consider following the packaging choices the systemd maintainers have done in debian and making it as easy as possible to switch to systemd as an alternative init for this lts cycle

We can't do that, unlike Debian, we're already running a GNOME version that requires logind, etc. Feature freeze is this week, it's much too late to think systemd will be installable, let alone functional, in this release.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 22:03 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (15 responses)

which makes the decision to fork systemd prior to this moment also quite unfortunate...as that one decision tightly constraints your options as a distribution. Nothing like painting yourself in a corner.

The idea that ubuntu users in the lts will have an easier time installing and using minit as a replacement to upstart than using systemd, simply because of historic distro release management decisions which effectively neutered systemd packaging in Ubuntu satiates my dark need for tragic irony.

Well played.

Anyways.... ppas are still a viable option for Ubuntu devs to interface with lts users who want to get a jump on integrating systemd into their workloads. Don't sit on your hands and waste the LTS support lifetime to get systemd integration out into the wild into the hands of users who are willing to help you get ahead of the problems. You guys found a way to do openstack enablement drops to keep apace with its development even for lts users... you can do something similar for systemd as alternative init.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 22:23 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (14 responses)

You're being overly dramatic, as usual. :)

Anyone can stick systemd in a PPA. It's not the package that is the problem, it's everything else around it. There's a massive amount of integration work required to make it work, and none of that is suitable for an LTS release. Our users expect the LTS releases to be rock solid.

Just wait until the next release or so, it will likely be available in the archive as a preview with some of the integration work done.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 22:49 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (13 responses)

Didn't have to be this way.
systemd could be installable and user selectable just like in Debian right now...way ahead of the Ubuntu freeze.

https://launchpad.net/ubuntu/trusty/+source/systemd/204-5...
That changelog for dec 2013 is fantastic. Again...if that Debian vote had happenend in November, could the Ubuntu packaging patchset have been modified in time to get systemd packaging back to what it looks like in Debian and have systemd be installable as an alternative init side-by-side with upstart? I think there would be. I mean hell man, we are talking about cramming in cgmanager into the LTS and it didn't even exist as a codebase publicly until after the fall vUDS discussion. C'mon. If there's enough time to shoehorn the "unstable and undocumented" cgmanager into the "rock solid LTS release" then there's time to undo the craptastic Ubuntu patches for the systmd packaging that disfigure the work done by the Debian maintainers.

The only thing that is keeping systemd from being a viable non-default alternative init for Ubuntu users to _choose_ to use are Ubuntu specific policies and patched applied on top of the merged debian package that neuter systemd as an installable alt init. The Debian maintainers did the work to make it possible to install systemd safely and then choose to enable it and use it. Ubuntu as a derivative undid that work with a set of patches. Didn't have to be this way. In fact that's incredibly short sighted given the possible future where Debian adopts systemd as a default.

And I might add that Ubuntu specifically targetted systemd to neuter as an alterative init. minit sits happily in universe and can be installed without disrupting the system.

And I am not being overly dramatic. This is the unfortunate reality where Ubuntu's past short-sighted and myopic release managing decisions have created problems for itself by throwing away the benefits of the work Debian is doing to provide a sane approach to alternative init packaging. And its just well... ironic...exactly like "... a free ride when you've already paid..." in fact. Just like "... good advice that you just can't take...". Indeed "...who would have thought that it figured?" Oh wait.. me. I thought that.

-jef

Shuttleworth: Losing graciously

Posted Feb 16, 2014 0:17 UTC (Sun) by mdeslaur (subscriber, #55004) [Link] (12 responses)

Dude, you seriously are the master troll :)

> systemd could be installable and user selectable just like in Debian right now...way ahead of the Ubuntu freeze.

Sure, if someone did all the integration work necessary to make it installable and user selectable. So you're saying we should have had engineers working on this, just _in case_ we switched to systemd at some point? As I've said before, having it packaged is just a small part of the issue, and having a package in universe that basically renders your system unbootable upon installation wouldn't have been a good idea.

> if that Debian vote had happenend in November, could the Ubuntu packaging patchset have been modified in time to get systemd packaging back to what it looks like in Debian and have systemd be installable as an alternative init side-by-side with upstart?

No. The packaging is trivial. Making the system boot is not. Ubuntu boot relies mostly on Upstart jobs, there are no sysv init scripts there to boot off of unlike Debian.

> time to undo the craptastic Ubuntu patches for the systmd packaging that disfigure the work done by the Debian maintainers

This is just insulting. The patches exist to make GNOME usable with Upstart since it depends on logind. Debian simply forces the user to install systemd if they want to use GNOME.

> The only thing that is keeping systemd from being a viable non-default alternative init for Ubuntu users to _choose_ to use are Ubuntu specific policies and patched applied on top of the merged debian package that neuter systemd as an installable alt init.

No, you don't understand the issues at all.

> And I might add that Ubuntu specifically targetted systemd to neuter as an alterative init. minit sits happily in universe and can be installed without disrupting the system.

Have you actually tried booting Ubuntu with minit instead of Upstart? :)

> And I am not being overly dramatic.

Yes, you are. Systemd will be available in a next Ubuntu release. Patience.

> This is the unfortunate reality where Ubuntu's past short-sighted and myopic release managing decisions have created problems for itself by throwing away the benefits of the work Debian is doing to provide a sane approach to alternative init packaging.

Once again, this is insulting. Debian didn't provide a sane approach that we could use, they simply forced systemd as a dependency to run GNOME.

Seriously, at least research stuff _a bit_ before you post uninformed opinions.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 0:41 UTC (Sun) by Siosm (subscriber, #86882) [Link]

> Making the system boot is not. Ubuntu boot relies mostly on Upstart jobs, there are no sysv init scripts there to boot off of unlike Debian.

How could an Ubuntu boot be so different from everybody else boot that it would require work not already done by other maintainers in other distributions?

The tedious work of porting all the init scripts to systemd service units has already been done. I really don't understand where work remains apart from testing that it actually work.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 1:12 UTC (Sun) by jspaleta (subscriber, #50639) [Link] (6 responses)

I'm saying that the Ubuntu patch to the systemd source package which explicitly disables providing the systemd binary on disk in a binary package named systemd could be undone without impacting LTS as a deliverable right now. Could have been done months ago. Doesn't change the issue with relying on systemd explicitly post v205 and the cgroups change at all.

If Ubuntu didn't want to provide the systemd-sysvinit package which makes /sbin/init (and related) a symlink(s) to systemd's binary... that's reasonable.

But choosing explicitly to not give users the ability to even install the systemd binary to even play with it as an alternative using the kernel commandline arguments to choose the init... that's clearly a punitive decision meant to keep Ubuntu users from having the option. Having the systemd binary on disk in no way impacts the ability to provide a rock solid default experience at all...for any release...whether it be an lts or not.

However, splitting out the systemd-services package as a new binary package in Ubuntu instead of providing the systemd package and then deliberately leaving out several binaries under /lib/systemd/ was unwarrented by any technical reason whatsoever. When Ubuntu introduced systemd-shim in Ubuntu they should have figured out how to use the packaging system correctly for shim and systemd to be parallel installable or have the packages conflict instead of preferentially maiming the systemd implementation as packaged to force a preference for shim. I mean,holy crap man, there is a whole alternative system already in place to handle the complexity of this sort of parallel installable replacement if a single packaging Conflict is deemed unworkable. Delibrately ripping out bits of competing implementations from inside well constrained directory namespaces inside packaging is a horrible horrible hack and terrible packaging management policy. Terrible. Just yuck. If -shim as a fork of the code base needs to be installable.. then fracking namespace it as a fork and treat it as a fork and make the fracking competing implementations conflict at the packaging level.

But now that Ubuntu maintainers have _finally_ deemed it worthwhile to get shim into Debian, the Debian maintainer collaboration will sort out how best to support -shim as an alternative without horribly maiming systemd packaging in the process of making a space for shim. You know, the hard integration work Ubuntu punted on actually doing when introducing -shim. And Ubuntu will _eventually_ get to benefit from that much better packaging work. It's really a shame that Canonical doesn't make it a policy to push changes into Debian first and benefit from clean merges as a way to minimize exact this sort of thing from happening as was seen with -shim.

And I have to say the accusation that I'm not doing research rankles a bit. I've probably done more research than anyone in this discussion (other than the systemd developers who I'm sure have a much better understanding of init systems outside of linux than I do currently). But there's always more research to do. Until I get greenlighted for the Google neural implant, I expect there will be gaps in my knowledge. Please, if you would be so kind as to provide me some pointers to material to read up on that would give me a better understanding of the technical issues in the decision which necessitated explicitly removing /lib/systemd/systemd and /bin/systemd from the deb packaging in Ubuntu as inherited from Debian when Ubuntu decided to disembowel the Debian packaging for systemd.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 22:37 UTC (Sun) by rgmoore (✭ supporter ✭, #75) [Link] (5 responses)

But choosing explicitly to not give users the ability to even install the systemd binary to even play with it as an alternative using the kernel commandline arguments to choose the init... that's clearly a punitive decision meant to keep Ubuntu users from having the option.

That seems like an excessively strong claim. It's clearly an attempt to prevent users from using systemd at all, but it can equally easily be explained as an attempt to avoid support problems. One of the roles of distributions is to make choices about the system they're going to provide and support, and Ubuntu has frequently come down on the side of restricting user choice to make the system more coherent and easier to support. Refusing to provide a choice of init systems- a choice that is largely invisible to the user but has the potential to make the system harder to support- seems like a perfectly legitimate move, especially for a LTS release.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 22:46 UTC (Sun) by jspaleta (subscriber, #50639) [Link] (4 responses)

if Ubuntu had a policy with regard to explicitly disallowing alt inits... then minit would not be available as an Ubuntu package either.

Ubuntu does not have a policy with regard to limiting choice in inits. Ubuntu made a set of technical decisions to explicitly cripple systemd packaging by undoing integration work Debian maintainers had already done and by delibrately choosing to keep the systemd init binary out of the packaging payload.

They did not so neuter minit packaging by removing functioning binaries from that package. Users can install minit as packaged, and then reconfigure their system to use minit as an alternative init system. The choices made for the systemd packaging very delibrately make it impossible for users to experiment with systemd on Ubuntu using provided packaging. It is easier to get Ubuntu booting up using minit than systemd at present only because of the packaging choices ubuntu maintainers made to cripple systemd entirely as an option.

Shuttleworth: Losing graciously

Posted Feb 17, 2014 20:37 UTC (Mon) by foom (subscriber, #14868) [Link]

But does minit actually *work*? The answer is almost certainly "no way", from the response mdeslaur wrote...I mean, yea, sure, you probably can get it to work with enough work, but "reconfigure" is gonna be putting it rather mildly.

Minit is only present in Ubuntu ("universe") because it was brought over by virtue of being in Debian; ubuntu copies packages from debian by default, when there's not an explicit reason to change them. I'll bet that nobody has actually ever used minit on Ubuntu.

On the other hand, parts of systemd are necessary for a functional desktop on Ubuntu, and it needed to be modified. Given that, why SHOULD they include an untested, known-to-not-work init system along for the ride with the pieces they needed?

There is no hope of systemd pid1 actually working on Ubuntu as is (due to the native upstart jobs taking over from sysvinit scripts for some critical services); given that, it really doesn't matter if the systemd binary package is easily available or not...If some user is technically inclined enough to be able to patch the system boot process to work with systemd init in Ubuntu, recompiling systemd while you're at it is certainly not a real impediment.

Shuttleworth: Losing graciously

Posted Feb 24, 2014 21:31 UTC (Mon) by cas (guest, #52554) [Link] (2 responses)

Ubuntu does not have a policy with regard to limiting choice in inits. Ubuntu made a set of technical decisions to explicitly cripple systemd packaging by undoing integration work Debian maintainers had already done and by delibrately choosing to keep the systemd init binary out of the packaging payload.

more likely, the ubuntu devs had a similar reaction to what I had when I realised that a routine apt-get dist-upgrade in debian sid a few months back was not only going to upgrade gnome and everything else, it was going to install systemd.

my reaction was "fuck that, apt-get purge gnome".

i had switched to XFCE long ago (after giving up on gnome3) and was only keeping gnome installed so that I could check it out every so often - see if gnome 3 had improved to the point of usability yet. i won't be doing that any more.

upgrading a desktop environment or a window manager or whatever should not force a major change in the machine's init - a switch in init systems should only happen after a deliberate, conscious choice by the system admin....not as a side-effect of some other upgrade.

Shuttleworth: Losing graciously

Posted Feb 24, 2014 22:05 UTC (Mon) by jspaleta (subscriber, #50639) [Link]

Yawn.

Unfortunately Ubuntu's knee-jerk, irrational approach to forking and maiming the systemd packaging was not and is not a sustainable technical solution to the problem. It was and is a terrible packaging hack.

The better approach, is to stand up the alternative implementation of logind which will work without systemd as PID=1 (even if its a fork of systemd originally) as a new package in the repository, with payloads designed not to interfere or overwrite systemd namespaced files.

I'm sure Debian will figure out how to adequately package multiple implementations of the logind service provider in a way that lets users use the implementation that works best for the init system they are running. As soon as multiple logind implementation exists....that is. So far there's isn't a second implementation to package, as the existing Ubuntu "solution" cannot be adopted by Debian, without significant rework.

Shuttleworth: Losing graciously

Posted Feb 24, 2014 22:30 UTC (Mon) by nybble41 (subscriber, #55106) [Link]

It's an understandable mistake, but installing the systemd package doesn't cause any change in your system init. For that you'd need the systemd-sysv package (which diverts /sbin/init from sysvinit to systemd), or a kernel command-line option to override the init path. The gnome packages probably just wanted the binaries available to manage the user's login session in the event systemd was configured as the system init.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 1:34 UTC (Sun) by jspaleta (subscriber, #50639) [Link] (3 responses)

ubuntu relies mostly on upstart jobs? Really? Are you saying that right now, given a random real world server workload out in the cloud that those servers are running more upstart units than sysvinit scripts?

My understanding is the exact opposite. Most of the installable services in Ubuntu are still controlled by sysvinit scripts and have not "gone native" and picked up a unit file in their packaging. What has gone upstart native is early boot for required services up to mounting and a very narrow set of services that make up the very narrowly defined desktop boot configuration for the Ubuntu desktop default (and not even everything one could consider desktop relevant either).

But please, prove me wrong. Can you provide me with a full list of packages in the Ubuntu repository which drop files into /etc/init/ as compared to the full list of packages which drop files into /etc/init.d/ so we can make an accurate judgement as to where upstart native unit file adoption actually stands.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 3:29 UTC (Sun) by cesarb (subscriber, #6266) [Link] (1 responses)

> Can you provide me with a full list of packages in the Ubuntu repository which drop files into /etc/init/ as compared to the full list of packages which drop files into /etc/init.d/ so we can make an accurate judgement as to where upstart native unit file adoption actually stands.

That's actually quite easy.

http://archive.ubuntu.com/ubuntu/dists/precise/Contents-a...

That's the full list for the latest 64-bit Ubuntu LTS, in plain text. Just search for ^etc/init within it. Similar lists can be found for 32-bit and other Ubuntu releases.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 13:48 UTC (Sun) by Darkmere (subscriber, #53695) [Link]

Thanks for the hint, running the numbers:
143 unique packages install into /etc/init/
1103 install into /etc/init.d/
124 packages install into both /etc/init and /etc/init.d

Shuttleworth: Losing graciously

Posted Feb 21, 2014 15:35 UTC (Fri) by cdmiller (guest, #2813) [Link]

Here are listings from the /etc/init.d/ of a production Ubuntu 12.04 server running nginx, ftp, and nfs, first stuff symlinked to /etc/init:

acpid -> /lib/init/upstart-job
atd -> /lib/init/upstart-job
console-setup -> /lib/init/upstart-job
cron -> /lib/init/upstart-job
dmesg -> /lib/init/upstart-job
hostname -> /lib/init/upstart-job
hwclock -> /lib/init/upstart-job
hwclock-save -> /lib/init/upstart-job
module-init-tools -> /lib/init/upstart-job
network-interface -> /lib/init/upstart-job
network-interface-container -> /lib/init/upstart-job
network-interface-security -> /lib/init/upstart-job
passwd -> /lib/init/upstart-job
plymouth -> /lib/init/upstart-job
plymouth-log -> /lib/init/upstart-job
plymouth-ready -> /lib/init/upstart-job
plymouth-splash -> /lib/init/upstart-job
plymouth-stop -> /lib/init/upstart-job
plymouth-upstart-bridge -> /lib/init/upstart-job
procps -> /lib/init/upstart-job
resolvconf -> /lib/init/upstart-job
rsyslog -> /lib/init/upstart-job
setvtrgb -> /lib/init/upstart-job
udev -> /lib/init/upstart-job
udev-fallback-graphics -> /lib/init/upstart-job
udev-finish -> /lib/init/upstart-job
udevmonitor -> /lib/init/upstart-job
udevtrigger -> /lib/init/upstart-job
vsftpd -> /lib/init/upstart-job
xinetd -> /lib/init/upstart-job

Now stuff not linked to /etc/init:

bootlogd
grub-common
halt
killprocs
libnss-ldap
networking
nginx
ntp
ondemand
puppet
rc
rc.local
rcS
README
reboot
rsync
sendsigs
single
skeleton
ssh
stop-bootlogd
stop-bootlogd-single
sudo
umountfs
umountnfs.sh
umountroot
urandom
x11-common

Other server appear follow suit, so in the current LTS (12.04) more central system stuff with Upstart support than application daemons (apache, nginx, ssh, samba, openldap, ...). The same can be said of the older LTS 10.04.

Shuttleworth: Losing graciously

Posted Feb 21, 2014 16:07 UTC (Fri) by cdmiller (guest, #2813) [Link]

RHEL5 has extended life cycle support until 2020 and RHEL6 until 2023. Ubuntu 12.04 server LTS expires in 2017 and 14.04 server LTS will expire in 2019. There will be several years of supporting sysvinit and or Upstart in the Linux server space no matter when or how often Debian changes it's default init.

Best news of the day!

Posted Feb 14, 2014 15:48 UTC (Fri) by proski (subscriber, #104) [Link] (1 responses)

In my opinion, both upstart and systemd are complex and hard to understand. But it's clear that simple solutions don't cut it anymore. So I'm glad that systemd won't be a one man show anymore. It would get more attention, it would get more documentation and hopefully more utilities to support it. A good GUI program can do a great job explaining the concepts of the underlying software.

I'm using Fedora at home and Ubuntu at work, and both aspects of me are happy :)

Best news of the day!

Posted Feb 14, 2014 16:03 UTC (Fri) by jackb (guest, #41909) [Link]

So I'm glad that systemd won't be a one man show anymore.

I'm having trouble parsing this statement. What definition of "anymore" are you using?

Shuttleworth: Losing graciously

Posted Feb 14, 2014 16:37 UTC (Fri) by Thue (guest, #14277) [Link] (7 responses)

EnUnLugarDeLaMancha posted an interesting post on reddit:

Just FYI, this is what Mark had to say about systemd in October, while defending Mir:
By contrast, those same outraged individuals have NIH’d just about every important piece of the stack they can get their hands on… most notably SystemD, which is hugely invasive and hardly justified.

And in April 2012:
Rumours and allegations of a move from Upstart to systemd are unfounded: Upstart has a huge battery of tests, the competition has virtually none. Upstart knows everything it wants to be, the competition wants to be everything. Quality comes from focus and clarity of purpose, it comes from careful design and rigorous practices. After a review by the Ubuntu Foundations team our course is clear: we’re committed to Upstart, it’s the better choice for a modern init, innit. For our future on cloud and client, Upstart is crisp, clean and correct. It will be a pleasure to share all the Upstart-enablement patches we carry with other family friends as soon as their release is ready and they can take a breath, so to speak.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 17:28 UTC (Fri) by gwolf (subscriber, #14632) [Link]

Right. I would not expect either Mark, Canonical or the Ubuntu guys to be happy to switch. They have invested quite a bit into Upstart (and Mir), and an argument that was presented repeatedly in the debate was the higher code quality (and clarity) WRT SystemD.

Of course, Ubuntu has pushed reasonably for Debian to follow their way and use their tools — After all, they depend on Debian. But given Debian's (hard!) decision, it makes quite a bit of sense for them to acknowledge it will be better for the system stability and common tool usage in the Linux landscape to reduce the divergence — Even at the high cost of ditching an important part of their infrastructure.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 18:27 UTC (Fri) by bronson (subscriber, #4806) [Link] (4 responses)

It's just marketingspeak, no need to read too closely.

> For our future on cloud and client, Upstart is crisp, clean and correct.

Actually pretty good as marketingspeak goes!

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:02 UTC (Fri) by joyuh (guest, #95216) [Link]

It looks more like he was a bit drunk when writing that.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 10:46 UTC (Sat) by KSteffensen (guest, #68295) [Link] (2 responses)

> It's just marketingspeak, no need to read too closely.

So basically we should ignore everything Shuttleworth says as it's 'just marketingspeak'?

Shuttleworth: Losing graciously

Posted Feb 15, 2014 16:31 UTC (Sat) by micha (guest, #42747) [Link]

> [...] everything Shuttleworth says [is] 'just marketingspeak'?

I would rather say everything Shuttleworth says is just markspeak... :D

SCNR
micha

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:16 UTC (Sat) by HelloWorld (guest, #56129) [Link]

To a first approximation, yes.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 20:48 UTC (Fri) by acollins (guest, #94471) [Link]

In regards to his second comment, note that it's perfectly valid to have a change of opinion after two years, especially in the case of something evolving rapidly as systemd.

I would have agreed with his statements about systemd at that time, since then it has improved *dramatically*. It's gotten more community support in a shorter amount of time than I think anyone anticipated, and as a result it really has grown to solve a lot of real world problems, and provide a ton of utility to administrators. It's stable, fast, and convenient. I'm excited to see it show up in all the major distributions over the next year or so.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 16:38 UTC (Fri) by hadrons123 (guest, #72126) [Link] (3 responses)

Didn't expect to see Upstart's demise this soon!
Its better sooner than later!

One week ago upstart was an viable alternative among every Trolls in the internet and systemd was evil software created by evil people. Now, nowhere to to run and hide!

How long will the openRC resistance will last?

plonk

Posted Feb 14, 2014 18:25 UTC (Fri) by oldtomas (guest, #72579) [Link]

> upstart was an viable alternative among every Trolls in the internet

Look: I do care as much for Upstart as for systemd (not much). But this is too much.

*plonk*

Shuttleworth: Losing graciously

Posted Feb 14, 2014 18:29 UTC (Fri) by bronson (subscriber, #4806) [Link]

Yet another example of winning gracelessly. :(

Shuttleworth: Losing graciously

Posted Feb 14, 2014 18:37 UTC (Fri) by drag (guest, #31333) [Link]

> How long will the openRC resistance will last?

OpenRC has a different problem domain then Upstart or Systemd. There are a number of other application management daemons that have init-like features. Like daemontools or supervisord.

I really see no reason why it has to go away or that it even competes with Systemd much.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:22 UTC (Fri) by jspaleta (subscriber, #50639) [Link] (123 responses)

Unanswered questions:

1) What happens to the new effort to port libnih and upstart to support BSD? Is Canonical staffing that effort with paid manpower currently or is it entirely volunteer work hours right now? I think the Debian porters need to have a clear statement of intent so they aren't pegging their hopes on something that won't materalize now.

2) What happens to the new cgmanager effort to implement an alternative to systemd's cgroup management API? Is that development staffed by Canonical paid manpower currently? And if Ubuntu does transition to systemd for init and picks up systemd's cgroup management API as part of that transition will Canonical/Ubuntu need cgmanager?

2.1) if Canonical provided cgmanager development manpower winds down, will the work to provide a cgmanager backed logind implementation that works for sysvinit continue to be worked on as anticipated and implicitly promised in the scope of the ongoing Debian discussion? And if its continuing with volunteer manpower only now, will that work continue apace in time for the jessie release?

Shuttleworth: Losing graciously

Posted Feb 14, 2014 20:42 UTC (Fri) by Thue (guest, #14277) [Link]

> 1) What happens to the new effort to port libnih and upstart to support BSD? Is Canonical staffing that effort with paid manpower currently or is it entirely volunteer work hours right now? I think the Debian porters need to have a clear statement of intent so they aren't pegging their hopes on something that won't materalize now.

My impression is that Debian GNU/KFreeBSD only wanted upstart if Debian GNU/Linux used it. Otherwise KFreeBSD wanted to go with OpenRC or sysvinit. So there is really no reason to port it now that nobody wants it.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:09 UTC (Fri) by vonbrand (subscriber, #4458) [Link]

If upstart (and its ancilliary packages) isn't the path forward for Ubuntu on Linux, I see absolutely no reason to continue paying for porting it to a platform they don't support at all. Perhaps give their people some leeway to work on it part-time as "personal projects at work," but not much more.

(Just as uninformed onlooker, mind you).

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:33 UTC (Fri) by jspaleta (subscriber, #50639) [Link] (2 responses)

Unanswered questions specifically for Ubuntu.....

1) kdbus integration: Ubuntu's sandboxing model for their convergent mobile push relies explicitly on a SELINUX/AppArmor integration feature of the current dbus-daemon that is an optional part of the D-Bus specification (codified in a _may_ option..not even the strength of a _should_ option).

The kdbus implementation does not and will not expose this feature. Can Ubuntu adequately transition to systemd's dbus support or will Ubuntu have to keep the legacy daemon implementation running to service legacy dbus support (superceding systemd's baked-in support).

2) upstart as user session manager will need to be migrated as well. Not sure what the impacts here are.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 18:37 UTC (Sat) by mdeslaur (subscriber, #55004) [Link] (1 responses)

We will definitely be transitioning to kdbus. We will be able to integrate kdbus in our sandboxing model just fine.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 18:47 UTC (Sat) by jspaleta (subscriber, #50639) [Link]

That's fantastic.

Is there any public discussion on how your sandboxing going to work with kdbus? So far everything I've read, pointed to the need of the click app sandboxing to make use of the existing dbus-daemon's ability to use selinux/apparmor policy hooks to just leave bus messages undelivered to clients.

-jef

Shuttleworth: Losing graciously

Posted Feb 15, 2014 14:57 UTC (Sat) by stgraber (subscriber, #57367) [Link] (117 responses)

Answering for 2.1) as one of the cgmanager developer and the likely candidate to do the logind integration.

While Ubuntu will be switching to systemd, the exact timeline isn't known at this point and we're sure it's not going to be for 14.04 LTS.
All the work we put into cgmanager was done so that we'd have perfect LXC support in 14.04 LTS so I see no reason why we'd re-prioritize any of this.

cgmanager is about to get promoted to main in trusty and as a result will be supported until at least April 2019, LXC is likely going to mandate cgmanager in the near future and we should also soon have it working on Android (for the Android port of LXC).

In the future, unless Lennart changes his view on cgroup management, I still expect cgmanager to be relevant to us, if only as the way to get proper cgroup delegation into containers, sub-containers and unprivileged containers, none of which is possible with a standard DBus interface on the host's system bus.

It's however quite possible and very likely that cgmanager will be updated to support using systemd as the cgroup writer rather than changing the cgroup filesystem itself. However this wouldn't be a switch, this would be added functionality as we'd still need the cgroupfs writer for other distros and for Android.

As for the technical implementation of the logind support, I unfortunately haven't had much time to spend on this yet, but our current plan is to have logind in 14.04 upgraded to something recent, then have our systemd-shim export the systemd cgroup API in a way that's at least usable by logind. That shim package should be perfectly reusable by other distros and should make it possible to have logind use cgmanager on a non-systemd system without actually having to patch it.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 16:49 UTC (Sat) by mezcalero (subscriber, #45103) [Link] (2 responses)

Here's one thing you keep repeating over and over again, which is however a bit misleading. The "single-writer" request by Tejun does not necessarily imply that all cgroup handling is implemented by the same process. It can happen with closely cooperating processes, and this is in fact what we do with systemd, where different systemd instances are responsible for different parts of the tree. So it's only systemd that manages the tree but from separate processes.

Lennart

Shuttleworth: Losing graciously

Posted Feb 15, 2014 17:33 UTC (Sat) by mbunkus (subscriber, #87248) [Link] (1 responses)

A question, if I may: when you say »from separate processes« do you mean multiple instances of /sbin/systemd running in various virtual machines in which each instance is managing the tree for its own virtual machine? Or is this still only one single host, no virtual machines involved, and you mean several processes of the systemd package that cooperate this way?

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:01 UTC (Sat) by mezcalero (subscriber, #45103) [Link]

The term "virtual machine" is usually reserved for kvm and suchlike, i.e. virtualization that emulates hardware. In such vm setups systemd on vms knows nothing of systemd on the host.

If your are refering to container virtualization (i.e. multiple userspaces on the same kernel) then yes, the idea is that the systemd in the container cooperates friendly with the host's systemd when it is interested in cgroups.

Lennart

Shuttleworth: Losing graciously

Posted Feb 15, 2014 18:41 UTC (Sat) by jspaleta (subscriber, #50639) [Link]

Can you please point me to any upstream documentation as to the D-Bus API the cgmanager exposes?

The Ubuntu package itself for Trusty comes with absolutely nothing with regard to API outside the manpage...and it only has examples of the undocumented D-Bus API. And even that manpage is itself an Ubuntu vendor specific patch and not in the cgmanager source tarball or in the upstream git repo afaict.

If cgmanager is expected to be a generally useful single writer, how in the hell can _any_ 3rd party application developers expect to actually work with it if the API is entirely undocumented?

This is crazy. Crazy crazy crazy! Promoting a system-wide cgroups service provider into your main repository that has an entirely undocumented and unstable D-BUS API is putting the cart well before the horse ahead of the alternative, which does have a stable API. Crazy. Setting yourself for having to push updates with API bumps inside your 14.04 release.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 21:30 UTC (Sat) by jspaleta (subscriber, #50639) [Link] (111 responses)

so help me out....
Am I understanding your future scenario correctly?
in the future....Ubuntu host system running systemd as init by default:
1) runs systemd PID=1 and is the cgroups manager for the host system exposing the documented cgroup management API.
2) runs a cgmanager process which exposes its own API, but internally will (in the future) communicate cgroup management requests back to systemd using systemd's native API and will not be touching cgroupfs directly.
3) guest lxc containers run something that talks to the cgmanager process talking the cgmanager API.

I that the future scenario you expect to see once ubuntu switched to systemd?

Assuming yes, here's what's baking my noodle. If in the future, cgmanager is just going to end up talking to systemd using systemd's API.... how does this configuration provide any functionality or service any use cases above and beyond what systemd's API already exposes? Serious question. cgmanager's API doesn't appear to be an abstraction, it appears to be mired in the details of what cgroupsfs exposes. So I don't get how a future, where cgmanager just talks to systemd via systemd's API is more capable or can cover additional use cases than systemd can directly. Well not without patching the bejesus out of systemd on the host..which isn't what you seem to be proposing. It just looks like cgmanager is going to be wedged in between for no benefit at all.

In contrast, I'm far less confused about how libvirt's future roadmap is going to work. Libvirt exposes an abstracted API, that doesn't go into cgroupfs minutia. So I get how libvirt can expose a stable abstracted API for containers to make use of that.. and can internally can then talk to systemd abstracted cgroup API and it will all work out. Libvirt's API doesn't propose to expose capability or usage cases thought to be unsupported by systemd's API.

-jef

Shuttleworth: Losing graciously

Posted Feb 15, 2014 22:34 UTC (Sat) by stgraber (subscriber, #57367) [Link] (110 responses)

The main use of cgmanager is when doing nested LXC containers where the LXC containers run unprivileged as users.

In those cases you'll get requests coming from sub-containers where the emitter of those requests is root in their own namespace but not on the host. The whole cgmanager/cgproxy API is designed so that we can safely check what process the requester actually owns and then allow it to mess with those and those only.

So we basically track the various pid namespaces and user namespaces, deal with the uid and pid translation and then do ACL checks on the host.

Shuttleworth: Losing graciously

Posted Feb 17, 2014 21:03 UTC (Mon) by fandingo (guest, #67019) [Link] (109 responses)

The systemd developers (including Lennart in this very thread) have stated that they intend to allow this sort of behavior where systemd in a container will have access to the proper subtree outside the container. What's the rationale of developing this capability in cgmanager and not doing the work directly in systemd?

Shuttleworth: Losing graciously

Posted Feb 17, 2014 21:28 UTC (Mon) by stgraber (subscriber, #57367) [Link] (108 responses)

While it's great that eventually we may get an API from systemd that may cover the needs of LXC, this isn't the case now and I suspect won't be in the near future.

What Lennart refers to is running systemd in a container managed by systemd (with nspawn) within the host's user namespace and probably without running a full distro inside it (though that last bit doesn't matter that much).

As I already stated before, LXC also supports distros that do not have systemd, including Android. cgmanager was designed to be generic enough to work on any of those and will itself talk to the systemd API or any other similar API instead of cgroupfs if they offer an API that's low level enough for us.

Now if you want an example of complex setup which I need to support with LXC (due to actual user demand, not because I want to find a far fetched example), consider this:

Host runs Ubuntu 14.04 with a 3.13 kernel (upstart).
-> User x with uid 1000 runs an unprivileged container running Debian Testing (using sysvinit)
-> Root in this container (uid 100000 on the host) runs a Plamo Linux system container (some custom init)
-> User nobody with uid 65534 (uid 165534 on the host) runs an unprivileged Ubuntu 12.04 container (upstart)

This all works today with LXC 1.0 and cgmanager, the cgmanager host socket gets passed from one level of container into the next. If the container cares about cgroups (all of the above except the last one), they need to spawn a cgproxy process that'll do SCM calls over DBus to pass user credentials and PIDs in a way that gets translated by the kernel when crossing namespace boundaries.

The main difficulty in the above is when uid 0 in the leaf container with a mapped uid of 200000 (depending on the configured mappings) is requesting for PID 50 to be moved into cgroup "a".

That's because:
- uid 0 is actually uid 200000
- pid 50 is actually pid 123123
- cgroup "a" is actually cgroup "lxc/c1/c2/c3/a"

So that's why we have cgmanager, why we use ucreds to get translated uids and pids and why we need complex logic (using namespace attach and such) to check whether uid 200000 on the host is indeed uid 0 in its namespace and whether pid 123123 is in the pid namespace that's linked with its user namespace and finally whether it's actually supposed to be able to write to lxc/c1/c2/c3/a.

That example is actually a fairly simple and common example of what cgmanager does, we have way trickier cases but those usually need me an hour or so to properly express (mostly happen on older kernels or when a sub-sub-container wants to add a pid to a cgroup which is owned by a user in that namespace. The PID ownership logic becomes pretty tricky pretty quickly.)

Shuttleworth: Losing graciously

Posted Feb 18, 2014 1:45 UTC (Tue) by fandingo (guest, #67019) [Link] (107 responses)

Thanks for the detailed reply.

> As I already stated before, LXC also supports distros that do not have systemd, including Android. cgmanager was designed to be generic enough to work on any of those and will itself talk to the systemd API or any other similar API instead of cgroupfs if they offer an API that's low level enough for us.

What's the reason for not adopting the systemd DBus API, especially when it preceded CGManager? That clearly complicates the situation for application developers, or now we've added another mandatory abstraction layer, CGManager, that should not be needed on systems that already have a cgroups manager.

I guess I don't see what the future possibly holds for CGManager. Even the present is dicey beyond the cgroupfs driver. There's not even *one* page of documentation or explanation on how to use CGManager. (This appears to be the official project page: http://cgmanager.linuxcontainers.org/.) I was perplexed and spent far too long on Google before coming to the conclusion that CGManager's only mention is on a few mailing list threads. I can't find any definition of the DBus API for CGManager. I was under the impression that CGManager was ready for use.

Over the next year and a half, it is extremely likely that new GNU/Linux installations will overwhelming use systemd. During that time, it is hard to envision the actual kernel cgroupfs driver disappearing.

Combine the longevity that the kernel cgroupfs will have with the simplicity of developing the missing features of systemd (system bus delegation and policies), it's not clear that CGManager has much purpose or will become the generic cgroup manager as it was initially advertised.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 2:18 UTC (Tue) by stgraber (subscriber, #57367) [Link] (7 responses)

Documentation wasn't our first priority due to the tight schedule caused by the expected 1.0 release of LXC, we expect more documentation to be published once we're done with the LXC release.

In the mean time, Serge published some notes on github:
https://github.com/cgmanager/cgmanager

As for application developers, our biggest user is LXC and LXC certainly knows why and how to use it (as it's the same group of people who developed both and cgmanager was mostly built from LXC's old cgroup management code). For the others, the choice is relatively straightforward, if you want something very simple that works everywhere, just use cgroupfs directly. If you care about namespaces, uid/pid translation and nesting, use cgmanager. If you prefer to use a standard DBus API and only care about systemd-based distros, use systemd.
At the end of the day, all of those configure the exact same thing. It's not ideal when accesses aren't centralized but we've lived with that for years without any major problem.

I believe my earlier comment explains why the systemd API isn't sufficient for LXC's needs and for the other group of people involved with cgmanager.

As for all distros moving to systemd, I personally think this would be a pretty sad day, diversity is very important and is the main source of improvements. Anyway, we don't expect Android to start using systemd in the near future and that's one of the reasons why LXC will be using cgmanager.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 5:28 UTC (Tue) by fandingo (guest, #67019) [Link] (6 responses)

> Documentation wasn't our first priority due to the tight schedule caused by the expected 1.0 release of LXC, we expect more documentation to be published once we're done with the LXC release.

That's a real shame. How can you be confident that it is tested or used outside the LXC use-cases, or that the API is stable for a 1.0 release?

> I believe my earlier comment explains why the systemd API isn't sufficient for LXC's needs and for the other group of people involved with cgmanager.

Besides the delegation and policy components, what's inadequate with systemd's API (not implementation)?

I feel like CGManager was advertised as the cgroup manager for everyone not using systemd. After talking to you, it seems like a LXC utility only that isn't likely to see much additional use. The use cases that you have outlined strongly indicate that the cgroupfs API is likely to be the only thing that is used.

> diversity is very important and is the main source of improvements.

This is a truism that is oft repeated, but I don't see objective evidence that it's actually true. In fact, the Linux kernel is a perfect counter example. There hasn't been useful competition to Linux for years now, and kernel developers have not had trouble innovating.

> Anyway, we don't expect Android to start using systemd in the near future and that's one of the reasons why LXC will be using cgmanager.

Is Android going to switch to CGManager? If not, how useful is it for testing or use when official Android uses the kernel cgroupfs, not the cgroupfs provided by CGManager?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 7:57 UTC (Tue) by mbunkus (subscriber, #87248) [Link] (1 responses)

> This is a truism that is oft repeated, but I don't see objective evidence that it's actually true. In fact, the Linux kernel is a perfect counter example. There hasn't been useful competition to Linux for years now, and kernel developers have not had trouble innovating.

But the Linux kernel does have competition. It's called Windows, Mac OS, the BSDs and the commercial Unices.

And monocultures are bad for innovation. Just look at the regulated telecommunication industries before they were split up (e.g. in the US) or the governmentally-mandated restrictions lifted (e.g. Germany). The Deutsche Bundespost (predecessor to what today is Deutsche Post AG, the postal service; Deutsche Telekom AG with its offspring T-Online; Deutsche Postbank AG, a bank) was known for bad service, obscene prices, a snail-like pace of innovation, complete lack of flexibility.

However, systemd works in totally different environment. There are no regulatory authorities here; the only thing preventing yet another init system to come along and take its place is technical excellence which translates into people seeing the need for it and then following through with a proper implementation. Therefore I'm not worried about a perceived lack of diversity regarding systemd, especially if the alternatives are so far behind in terms of functionality.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 16:43 UTC (Tue) by fandingo (guest, #67019) [Link]

> Therefore I'm not worried about a perceived lack of diversity regarding systemd, especially if the alternatives are so far behind in terms of functionality.

It's not clear that *anyone* is interested in an alternative and modern init system. If 14.04 weren't the LTS release of Ubuntu, Canonical would be done with Upstart at this point, and Mark Shuttleworth has said that they will switch to systemd as soon as Debian makes the switch. That's the last major holdout from GNU/Linux distributions. The only other system, which is not GNU/Linux, is Android, and I'm sure they'll continue to do their own quasi-proprietary thing.

I guess the bigger question is if something were to come along and try to fulfill the features that systemd does: why wouldn't that init system bring its own cgroup manager?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 10:43 UTC (Tue) by hummassa (subscriber, #307) [Link]

> This is a truism that is oft repeated, but I don't see objective evidence that it's actually true. In fact, the Linux kernel is a perfect counter example. There hasn't been useful competition to Linux for years now, and kernel developers have not had trouble innovating.

This is a silly oversimplification. There are lots of competition in kernel-space: Windows (DOS-based 98 and VMS-based NT), at least five BSDs, commercial unices (I worked with SunOS/Solaris, HP/UX, AIX, ULTRIX, the infamous Microsft/SCO Xenix, amongst a dozen others).

IIRC, once upon a time Linux got inspired by VMS/WinNT for its asynchronous IO, AIX via Sequent for its RCU synchronization, FreeBSD was faster in the same hardware, NetBSD supported more hardware architectures, the BSDs got plug-and-play hardware first/better, firewalls, USB, etc.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 19:50 UTC (Thu) by lsl (subscriber, #86508) [Link] (2 responses)

> There hasn't been useful competition to Linux for years now, and kernel developers have not had trouble innovating.

If there's any ongoing innovation left at all in the OS kernel space it isn't happening in Linux. Not that that's (necessarily) a bad thing: Linux is (and is supposed to be) a 'production' OS with users relying on it for day-to-day work. While it has some cool new stuff that wasn't there in Unix back then I still sometimes wish the attention given to new OS research was a bit greater.

Well, that particular train probably left the station more than a decade ago. It seems that what we have is 'good enough' for people to consider putting up with the pain of transitioning to something new and unknown.

Then again, they seem to gladly endure the torture of gigantic 'programming frameworks' aimed at making up for weaknesses in the operating system interface. ;-)

Shuttleworth: Losing graciously

Posted Feb 20, 2014 23:54 UTC (Thu) by vonbrand (subscriber, #4458) [Link]

I wonder what the current systemd brouhaha is all about then. Also the recent article here on file-owned locks...

Shuttleworth: Losing graciously

Posted Feb 21, 2014 4:00 UTC (Fri) by mathstuf (subscriber, #69389) [Link]

I'd bet that there is lots of research code being written on top of Linux. And, really, if a team wants their research to ship, they'd build it on Linux (capsicum for FreeBSD is one exception that comes to mind though). Have we seen much change from anything in Singularity yet? What about other prototype research kernels? It may take a few releases to actually ship, but if that were my field, I'd base it on Linux (provided it would make sense; testing something like a completely new ABI would not make sense).

Shuttleworth: Losing graciously

Posted Feb 18, 2014 12:38 UTC (Tue) by rleigh (guest, #14622) [Link] (98 responses)

> What's the reason for not adopting the systemd DBus API, especially when it preceded CGManager?

As a systems programmer, I find the use of DBUS APIs as opposed to properly designed and implemented system calls and filesystem interfaces abhorrent. Mandating the use of DBUS for fundamental system functions is wrong on many levels.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 12:43 UTC (Tue) by HelloWorld (guest, #56129) [Link] (26 responses)

Why would anybody care if you like it? Maybe if you gave some reasons for your opinion it would be interesting; like this it's just meaningless trolling.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 12:46 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (25 responses)

I gave the reasons multiple times:
1) Auditing.
2) Security.
3) Transparency.
4) Delegation.

It _all_ works for good Linux filesystem-based interfaces, like /proc or /sys. But somehow not for cgroups.

WTF?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:07 UTC (Tue) by fandingo (guest, #67019) [Link] (24 responses)

1) It's easy to see which policies are in effect on a system. Since policies are XML files, they can also be checked into version control. I'm not sure how that happens with run-time directory permission modifications.

2) It's not possible to implement a complex policy with cgroupfs. The cgroup filesystem does not support ACLs, and consequently, you're left with the limited UGO permissions.

3) I don't know what this is supposed to mean. DBus methods are more transparent to the caller since the call returns with a meaningful response (even if empty). In fact, it seems that `echo` is the primary way that people write to special file systems. From the cgroups.txt documentation:

> bash's builtin 'echo' command does not check calls to write() against errors. If you use it in the cgroup file system, you won't be able to tell whether a command succeeded or failed.

4) Delegation is currently missing, but the systemd developers have affirmatively stated that they intend to add it.

Lastly, it's pretty clear that the kernel developers don't like /sys that much. I wouldn't be surprised to see it gradually moved to DBus over the next few years either.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:46 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (23 responses)

> 1) It's easy to see which policies are in effect on a system. Since policies are XML files, they can also be checked into version control. I'm not sure how that happens with run-time directory permission modifications.
How? Can you point me out a command-line utility that can show who has access to a given group? Do I have to parse XML?

> 2) It's not possible to implement a complex policy with cgroupfs. The cgroup filesystem does not support ACLs, and consequently, you're left with the limited UGO permissions.
So let's add SELinux policies and ACLs to cgroupfs. It's going to be useful in other situations, like /sys delegation. For me, UGO permissions are plenty enough.

> 3) I don't know what this is supposed to mean. DBus methods are more transparent to the caller since the call returns with a meaningful response (even if empty).
How do I check which cgroups are writable by me, for example? I have tons of tools for that for the classic filesystem interfaces.

> In fact, it seems that `echo` is the primary way that people write to special file systems. From the cgroups.txt documentation
Sure, and it's convenient. I can write to cgroups from a pure Java program - can I do the same with DBUS?

> 4) Delegation is currently missing, but the systemd developers have affirmatively stated that they intend to add it.
Only for other systemd containers. There are no plans to support cgmanager or my own incompatible manager that I'm just going to write out of spite.

> Lastly, it's pretty clear that the kernel developers don't like /sys that much. I wouldn't be surprised to see it gradually moved to DBus over the next few years either.
Doesn't matter. /sys and /proc virtualization and delegation are here to stay, forever. And also, [citation needed]

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:57 UTC (Tue) by jspaleta (subscriber, #50639) [Link] (18 responses)

General rule of thumb, if you want to interop with other software, document and version your stable APIs. It's a bit difficult to support cgmanager or your yet to be coded spitemanager if they use undocumented D-Bus APIs.

For example, cgmanager's draft readme, containing a draft design spec for its D-BUS API showed up in the source tree only like 5 days ago.
And even from this, draft, its unclear to me if cgmanager's D-Bus API can be considered stable at present. Since there doesn't appear to be any versioning on the API internally, I'd have to assume its prudent to still consider it unstable and subject to change. As of right now cgmanger's API should be considered an lxc private API, and not suitable to be relied on by external projects, until such time that the API is versioned and marked as stable by its developers.

I bet once cgmanager's API is deemed stable, libvirt developers will look at supporting it.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:04 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (17 responses)

> General rule of thumb, if you want to interop with other software, document and version your stable APIs. It's a bit difficult to support cgmanager or your yet to be coded spitemanager if they use undocumented D-Bus APIs.
And by the time you write this interface, it's going to be so indistinguishable from a filesystem interface that people are going to start asking WTF it was all for.

> I bet once cgmanager's API is deemed stable, libvirt developers will look at supporting it.
Does it have AppArmor support? It works fine for delegated cgroups. How about using fanotify to screen for malicious attacks (yes, I can haz an antivirus on Linux)?

And how about delegation to Android userspace which does not use DBUS at all?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:05 UTC (Tue) by jspaleta (subscriber, #50639) [Link] (5 responses)

Does what have Apparmor support? Pronouns kill. Were you refering to libvirt or cgmanager in your question?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:17 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (4 responses)

Sorry, I was not clear. The current cgroupfs interface supports AppArmor.

Or to be precise, AppArmor simply treats it as usual file operations and can apply all the regular policies.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:24 UTC (Tue) by fandingo (guest, #67019) [Link] (1 responses)

KDBus is LSM-aware[1], so it can be secured with any of those providers. Whether the AppArmor developers actually expose that in their policy language is another matter. (The AppArmor project is sorely lacking in documentation, and I can't even figure out if it currently supports the DBus1 system bus.)

[1] - http://lwn.net/Articles/551969/

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:34 UTC (Tue) by jspaleta (subscriber, #50639) [Link]

The legacy dbus-deamon has security hooks originally written for SELinux support and extended for AppArmor support (at least in Ubuntu and I'd assume on any distro that supports apparmor by default). My understanding is the reference daemon can choose not to send messages based on which ever security policy is being used on the host system. This particular feature of the reference daemon is tersely noted as an optional implementation feature in the D-BUS specification document. , I've not checked if AppArmor extension is in the mainline sources but there's no reason to think its not)

What's not clear to me is how kdbus's support for LSM will practically differ from how the reference userspace daemon's hooks worked. As in will it be more expressive or less expressive in terms of how you can lock down how applications use the bus. Still trying to wrap my head around that.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:25 UTC (Tue) by jspaleta (subscriber, #50639) [Link] (1 responses)

I don't understand how the question is relevant to question as to whether systemd will work with cgmanager and you hypothetic spitemanager.

If you code your spitemanager and you want it to interop with the other managers, then you'll have to expose an API for them to work with.

I recognize that you think any manager construct is sub-optimal to the cgroupfs construct. Noted. But your original question was about whether systemd would support cgmanager and your hypothetical spitemanager... not whether any specific manager would support the same thing that the cgroupfs construct does. My point stands.. the alternative managers to systemd's manager have to expose a stable API interoperate with. Demanding systemd to support an alternative manager that doesn't have a stable API is putting the cart before the horse.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 19:53 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link]

> I don't understand how the question is relevant to question as to whether systemd will work with cgmanager and you hypothetic spitemanager.
This is relevant. Right now the kernel interface is manager-agnostic - it can be used by anything.

With the brain-dead change to single-writer you'd have to reinvent the whole filesystem in DBUS to replicate the functionality. Look, we've already reinvented security policies, delegation (bind-mounts) and almost reinvented containers!

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:14 UTC (Tue) by fandingo (guest, #67019) [Link] (10 responses)

> And how about delegation to Android userspace which does not use DBUS at all?

This is going to be required no matter what. Your choices are either systemd or CGManager for your cgroups manager. Both use systemd. All containers will need support for interfacing with a DBus cgroup manager. CGManager just provides two APIs for its users: traditional cgroupfs style and DBus. The cgroupfs interface cannot be passed through a container.

Arguing against DBus as the principal API for any cgroup manager is a losing cause.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:18 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (9 responses)

Right now I _can_ do this with a simple --bind mount. It works perfectly fine.

> Arguing against DBus as the principal API for any cgroup manager is a losing cause.
Only because of idiotic kernel developers.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:29 UTC (Tue) by fandingo (guest, #67019) [Link] (8 responses)

>> Arguing against DBus as the principal API for any cgroup manager is a losing cause.
> Only because of idiotic kernel developers.

Huh? The kernel cgroups are exposed by system calls to a single writer. The only two writers that exist today (or have even been announced) both principally expose cgroups using a DBus API. CGManager also supports the cgroupfs API.

The inclusion of kDBus (when that happens later this year) is orthogonal to how the kernel exposes cgroups to the manager.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 19:54 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (7 responses)

>Huh? The kernel cgroups are exposed by system calls to a single writer.
Incorrect. Right now cgroups can be manipulated by any number of processes.

I repeat, IT WORKS ALREADY.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 20:44 UTC (Tue) by fandingo (guest, #67019) [Link] (6 responses)

The subject has been thoroughly covered: cgroupsfs as provided by the kernel will eventually go away. There's no ambiguity to the situation. It is being left on for the short-term, so user space is not broken immediately.

You're complaining about a deprecated feature will be removed. You have four options:

1) Start using the systemd or CGManager DBus APIs.

2) Use CGManager with its cgroupfs provider.

3) Implement spitemanager for whatever API (presumably cgroupfs) you desire.

4) Fork the kernel or stop using new versions.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:04 UTC (Tue) by jspaleta (subscriber, #50639) [Link]

I should point out that libvirt is also planning on providing transitional cgroupfs support for hosts not using systemd as manager yet, for libvirt based container users. That's not to say that libvirt provides universal use case coverage. Just pointing out that libvirt was yet another cgroupfs writer and that it has a transition plan in place to work in the single writer world right now and the transition plan is documented on the libvirt project site. They have no plans for cgmanager support yet, but as the api for cgmanager hasn't really been communicated as stable yet, can't really expect them to be able to support that api.

Though it is interesting to see if Ubuntu patches libvirt as shipped in Trusty to talk to cgmanager. Right now it it appears that libvirt as packaged in Trusty isn't patched for that yet and will be relying on the cgroupfs in 14.04. So that's an interesting little wrinkle. Will a trusty host running cgmanager be able to work with trusty libvirt based containers?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:34 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (4 responses)

Basically, all the solutions are sub-optimal compared to the only real solution: use filesystem-based interface.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:57 UTC (Tue) by mathstuf (subscriber, #69389) [Link] (3 responses)

What about another system call where a process in a new PID namespace asks to be the manager, the kernel sees that it's in a namespace and asks the parent cgroup manager whether it should be allowed. If it is, the kernel allows delegation of that subtree to only that manager (the parent can't touch inside of it anymore). If the parent denies access, the kernel gives an error to the container manager that the subtree is already managed (I assume there is such an error condition already). Would that be sufficient for the New World Order[1]?

[1]Assuming that you don't somehow convince kernel developers to forego the single-writer changes (which seems very unlikely at this point).

Shuttleworth: Losing graciously

Posted Feb 18, 2014 22:20 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (2 responses)

Yes, I have something like this in mind:

0) Regular permissions apply.

1) Add 'pid-lock' file at each level of cgroups tree. Everyone can modify cgroups tree if this file is empty.

2) Once you write a pid into this file only this process can make modifications to this tree level and deeper.

3) The pid-lock process can modify pid-lock files in its subtree, either clearing them completely or by writing another pid. It doesn't lose access as long as it's still alive.

4) Subtree moves must respect pid-locks and permissions.

That's basically it. It still allows to lock the tree against accidental modifications and also gives a clear path for delegation. DBUS connoisseurs can still use fully DBUS-based delegation and access control and everybody else can use normal filesystem-based API.

It's also possible to add a bitmask of delegated controllers. For example, so that the parent controller can limit the delegated controllers to cpu manager but not costly memcg.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 14:24 UTC (Wed) by HelloWorld (guest, #56129) [Link] (1 responses)

Well, fair enough. Are you going to implement it? I think it's hardly a secret that kernel developers are usually not interested in designs that aren't accompanied by working code.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 14:28 UTC (Wed) by mathstuf (subscriber, #69389) [Link]

It'd be worth it to ask whether it would even be considered first at least.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:05 UTC (Tue) by fandingo (guest, #67019) [Link] (3 responses)

>> 1) It's easy to see which policies are in effect on a system. Since policies are XML files, they can also be checked into version control. I'm not sure how that happens with run-time directory permission modifications.

> How? Can you point me out a command-line utility that can show who has access to a given group? Do I have to parse XML?

The cgroup delegation is not finished. Systemd generally allows read access on most things, and I doubt this would be different for cgroups. Therefore, it should be as simple as running a dbus-send command.

>> 2) It's not possible to implement a complex policy with cgroupfs. The cgroup filesystem does not support ACLs, and consequently, you're left with the limited UGO permissions.

> So let's add SELinux policies and ACLs to cgroupfs. It's going to be useful in other situations, like /sys delegation. For me, UGO permissions are plenty enough.

You've clearly decided to go it alone on this (or just continually complaining). Step on up and show us the progress you've made.

>> 3) I don't know what this is supposed to mean. DBus methods are more transparent to the caller since the call returns with a meaningful response (even if empty).

> How do I check which cgroups are writable by me, for example? I have tons of tools for that for the classic filesystem interfaces.

See #1.

>> 4) Delegation is currently missing, but the systemd developers have affirmatively stated that they intend to add it.

> Only for other systemd containers. There are no plans to support cgmanager or my own incompatible manager that I'm just going to write out of spite.

Except systemd has a good record of keeping API stability. A container will be free to connect to DBus and talk to systemd. The container's cgroup manager can expose whatever interface it wants inside. (This is exactly the same approach that CGManager is taking. There will always be a requirement that the cgroup manager inside the container knows how to talk to the manager outside the container.)

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:11 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link]

> The cgroup delegation is not finished. Systemd generally allows read access on most things, and I doubt this would be different for cgroups. Therefore, it should be as simple as running a dbus-send command.
Presumably, systemd uses the magical powerz of DBUS for access controls. So all the tools should be already there.

Where are they?

I suspect that all the people pontificating about 'just connect to DBUS' do not even understand how it works. For example, what happens if a container starts its own DBUS daemon that knows nothing about the external daemon? How is authorization of connections handled?

> Except systemd has a good record of keeping API stability. A container will be free to connect to DBus and talk to systemd. The container's cgroup manager can expose whatever interface it wants inside.
Wrong. I can't expose cgroups filesystem interface, for example. Or re-delegate to a manager that can only act as the root manager.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 1:48 UTC (Wed) by dlang (guest, #313) [Link] (1 responses)

>Except systemd has a good record of keeping API stability.

I'm not sure how you can say that.

Systemd hasn't really been around very long, so they are almost entirely stil on their first system, they haven't had much reason to modify much.

But their attitude that they are the only thing that matters, and willingness to take over and replace existing APIs with their different replacement doesn't give _me_ much confidence that they will maintain the old APIs long term.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 12:34 UTC (Wed) by pizza (subscriber, #46) [Link]

> But their attitude that they are the only thing that matters, and willingness to take over and replace existing APIs with their different replacement doesn't give _me_ much confidence that they will maintain the old APIs long term.

So, your argument against their public commitment to (and track record of) API stability is to say "I don't believe them."

You then try to justify that attitude by saying that they're still doing new things, and those new things may require new APIs. Well... duh. It's rather pointless to do something new if you don't create a way of managing it.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 13:01 UTC (Tue) by andresfreund (subscriber, #69562) [Link]

I don't disagree, but that's not really systemd's or cgmanager's fault, they didn't decide that way.
And there actually is a good reason to delegate responsibility in cgmanger's case, delegating part of the responsibility to higher privileges makes sense.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 16:53 UTC (Tue) by fandingo (guest, #67019) [Link] (69 responses)

You do realize that the DBus is moving into the kernel this year, right? It's going to become one of the core IPC mechanisms on Linux (if you didn't already consider it one).

What's actually good about filesystem interfaces besides familiarity? There's nothing advantageous about them, and in fact, there is a great difficulty in sending information back to the caller.

System calls are only good when you're talking to the kernel. Without a more full-fledged IPC mechanism (like DBUs perhaps), you don't want the kernel acting as the arbiter translating calls to other programs.

Lastly, I don't understand the dislike of DBus. What's bad about reliability, easy type support, extensive policy support, and multiple messaging paradigms? I'm also a programmer, and I don't understand why anyone wouldn't want that. Perhaps you could elaborate.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 16:59 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (7 responses)

>What's actually good about filesystem interfaces besides familiarity?
How do I delegate '/sys/fs/cgroup/some/cgroup/container/path' or whatever its counterpart in DBUS is going to be to user 'root' inside a namespaced container?

I have no idea. People usually make some noises about PolKit so I checked it. Its configuration looks something like this: http://cgit.freedesktop.org/polkit/tree/data/org.freedesk...

Yeehaw! An XML config with lots of strange options. Documentation is also quite impenetrable.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:12 UTC (Tue) by michich (guest, #17902) [Link] (1 responses)

That's not polkit's configuration. That's its DBus policy description, which is parsed by dbus-daemon. I.e. it's one of the things that's going away with the move to kdbus.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:30 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link]

And how? A lot of people are saying that it's easy, can anyone point me to a tutorial that describes it?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 19:44 UTC (Tue) by mathstuf (subscriber, #69389) [Link] (4 responses)

Not that this is going to make you happier, but PolicyKit *rules* are written in JavaScript[1][2]. The XML stuff is for the declaration of *actions* which can take place.

[1]https://wiki.archlinux.org/index.php/Polkit#Structure
[2]http://blog.christophersmart.com/2014/01/06/policykit-jav...

Shuttleworth: Losing graciously

Posted Feb 18, 2014 19:57 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (3 responses)

Are you joking??? It just keeps getting better and better.

Now rules are not only opaque, they are not analyzable even in principle! Never mind dirty little tricks of JavaScript like using floats instead of ints for numbers.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 20:28 UTC (Tue) by mathstuf (subscriber, #69389) [Link] (2 responses)

Well, as long as you keep the JS to a subset which is not Turing complete (probably doable; avoiding loops accomplishes that). Other than that, I think the justification was that some policy decisions need more complex logic and JS was the easiest language to embed. I don't know how seriously languages like Perl, Tcl, and Lua was considered (I'm 100% OK with shell not being the language).

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:38 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (1 responses)

I don't see how this can be done automatically, without writing something like SMACK for JavaScript.

Sigh... It looks like DBUS developers have gone off the rails completely and systemd+kernel people are happy to join the bandwagon.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:46 UTC (Tue) by mathstuf (subscriber, #69389) [Link]

These aren't DBus developers; these are PolicyKit developers. There was a decently large thread on fedora-devel about it when it came out. I'm pretty sure that if you want to write a separate access mechanism than PolicyKit, DBus doesn't itself care. I don't know what KDBus will do here since calling back out to PolicyKit for every call seems to work at cross-purposes to some of the goals of it (the number of context switches mainly).

Shuttleworth: Losing graciously

Posted Feb 18, 2014 16:59 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (60 responses)

> System calls are only good when you're talking to the kernel. Without a more full-fledged IPC mechanism (like DBUs perhaps), you don't want the kernel acting as the arbiter translating calls to other programs.

Uhm. Cgroups is a kernel interface. It's not an interface to some userspace program.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:08 UTC (Tue) by fandingo (guest, #67019) [Link] (59 responses)

The kernel cgroupfs interface is depreciated and will disappear in due time.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:30 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (58 responses)

No, it's not. Only delegation and multiple-writers are deprecated.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:57 UTC (Tue) by smurf (subscriber, #17840) [Link] (57 responses)

Exactly. Now what is a poor little init in a nonprivileged namespace to do if it wants to partition its slice of the universe into cgroups, if it can no longer write to cgroupfs?

Right -- it needs to talk to whatever process does the actual cgroups work. Presumably, DBus is a reasonable way to do that -- you can implement more complex permissions, send structured data in, get structured replies out, and have an altogether more high-level interface for what you actually want to do instead of doing baby steps in cgroupfs.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 17:59 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (56 responses)

> you can implement more complex permissions, send structured data in, get structured replies out, and have an altogether more high-level interface for what you actually want to do instead of doing baby steps in cgroupfs.
A simple question: "How?"

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:05 UTC (Tue) by fandingo (guest, #67019) [Link] (1 responses)

PolicyKit.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:12 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link]

Again, show me the code.

To make cgroupfs subtree accessible to a user 'vasja' I simply need to do 'chown -R vasja /sys/fs/cgroup/some/subtree' and that's it. How do I do the same with DBUS?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 18:57 UTC (Tue) by smurf (subscriber, #17840) [Link] (53 responses)

Dbus gives you the mechanisms to do all of that. A file system interface does not.

I'm not involved with the details of which dbus call does, or will do, exactly what, and I like it that way. So, sorry but you'll have to get the actual implementation details from somebody else.

No, the kernel people are not "idiotic" when they want to impose a one-writer-only policy on the cgroups subsystem. It makes perfect sense to have one process arbitrate access instead of adding ACL support to cgroupfs and dealing with multiple processes stepping onto each other's toes.

In any case, unless you can actually convince them to not enforce a single-writer policy after all, demanding that a multi-writer cgroupfs should continue to be available is … somewhat futile. Especially here; this is not a kernel mailing list.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 20:00 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (52 responses)

> I'm not involved with the details of which dbus call does, or will do, exactly what, and I like it that way. So, sorry but you'll have to get the actual implementation details from somebody else.
So I conclude that NOBODY knows how to do it. Does it not ring any alarm bells?

>No, the kernel people are not "idiotic" when they want to impose a one-writer-only policy on the cgroups subsystem.
Yes, they are. They are total idiots in this regard.

>It makes perfect sense to have one process arbitrate access instead of adding ACL support to cgroupfs and dealing with multiple processes stepping onto each other's toes.
Why does it make a perfect sense? What are the reasons? Can you point out a design document with them?

> In any case, unless you can actually convince them to not enforce a single-writer policy after all, demanding that a multi-writer cgroupfs should continue to be available is … somewhat futile. Especially here; this is not a kernel mailing list.
LKML is a dump. Asking there is an almost certain guarantee for a message to be lost.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 20:51 UTC (Tue) by fandingo (guest, #67019) [Link] (51 responses)

> So I conclude that NOBODY knows how to do it. Does it not ring any alarm bells?

It's because these features are currently being developed. They're not finished. None of this work will be completed until KDBus is merged.

I suggest that if you have such pressing concerns and questions about how all of this works, it's more appropriate to take your complaints to the developers directly.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:32 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (50 responses)

> It's because these features are currently being developed. They're not finished. None of this work will be completed until KDBus is merged.
Incorrect. Single-writer mode is already there and KDBus is going to be an optional dependency for a long time even after that.

All the policy mechanisms are already there. Yet nobody here can tell me how to do the simplest thing possible - delegate a DBUS subtree to a user.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:41 UTC (Tue) by mathstuf (subscriber, #69389) [Link] (49 responses)

You don't delegate subtrees of DBus APIs. I imagine that requests for changes to cgroup would come with a parameter for which subtree to apply the changes to. By default, everyone passes '/' as the subtree to apply things to, but if you're delegating a subtree, pass '/machine/vm0' as the subtree. You then authenticate the caller against who is allowed to manage the '/machine/vm0' subtree. Or you attach to the 'org.freedesktop.systemd.cgroupdelegate1' interface at the '/machine/vm0' path and call methods there (everyone else calls the 'org.freedesktop.systemd.cgroup1' interface methods).

I have a feeling that your emotions here are getting in the way of seeing potential solutions and mixing up pieces of information. May I suggest putting your concerns into a wiki page of some sort so that responses to them aren't spread across unpteen LWN articles and subthreads?

Shuttleworth: Losing graciously

Posted Feb 18, 2014 21:51 UTC (Tue) by fandingo (guest, #67019) [Link]

This is correct, but just to clarify:

Telling systemd that user X (even if that's a user that needs to be resolved several times from namespaces) is allowed to perform actions A,B,C on a subtree does not exist presently. That's why no one can tell Cyberax what API calls are needed.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 22:09 UTC (Tue) by Cyberax (✭ supporter ✭, #52523) [Link] (1 responses)

I'm now reading the kernel and systemd code. I still don't understand how authentication is going to work. Also, what happens next?

Suppose that I have systemd managing the root host and I create a cgmanager-based container. Suppose that they interoperate, so somehow cgmanager should connect to the host? How do I pass credentials for it? Or does systemd simply trust the first connection?

Then the cgmanager connects to its local DBUS running inside the container and starts serving its local clients. KDBus doesn't really change this, their namespaces would be separate.

Then the next question, would the cgmanager-based partitions be visible in the global manager? Probably yes, since there's no delegation. However, access rights would definitely be lost because cgmanager is probably going to implement its own policies. So there's not going to be any way to check what users and/or containers are using subtrees.

Still a mess. I guess to dive into it headfirst and try to make sense of it.

Shuttleworth: Losing graciously

Posted Feb 18, 2014 22:46 UTC (Tue) by fandingo (guest, #67019) [Link]

Users inside a namespace are mapped to UIDs outside the namespace. There are more details here http://lwn.net/Articles/532593/, but it seems that some privileged inner UID needs to be mapped.

Here's my understanding on how it would work. Besides some terminology, the model seems common between systemd and CGManager.

1) Bind mount the DBus socket dir into where the container will run.
2) Create cgroup and apply controllers as needed.
3) User namespace is created.
4) Root has an outer UID mapped. If the cgroup manager runs as a different user, it is also mapped.
5) PolicyKit is updated to allow access to the mapped user on the specific cgroup subtree.
6) The container OS boots.
7) The cgroup manager inside the container connects to the DBus socket. (This does not serve as the system bus inside the container. That is separate.)
8) The cgroup manager inside the container attaches to its system bus.

It's expected that the container software takes care of at least 3-8 and possibly the first two as well.

Operation:

A process inside the container wants to make a cgroup modification.

1) It connects to DBus (or cgroupfs if desired and CGManager is running inside the container) and sends the request.
2) PolicyKit (or file system permissions if using cgroupfs via CGManager) inside the container authorizes the action.
3) The cgroup manager inside the container accepts the command, and sends the command over the DBus socket to the outer cgroup manager.

4) The outer cgroup manager receives the message.
5) The outer cgroup manager translate the inner cgroup path to its relative position outside and consults PolicyKit for authorization. If authorized, the cgroup action is completed. A return message (with properly sanitized path) is sent across the socket to the inner cgroup manager, which forwards the message to the process that initiated the call.

=====

Let's say that some process inside the container wants delegation of a part of the container's subtree. That authorization doesn't take place in the outer PolicyKit. It happens inside.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 15:32 UTC (Wed) by HelloWorld (guest, #56129) [Link] (45 responses)

I have to say that I tend to sympathize with Cyberax here. cgroups are a hierarchy of objects, and we have an API to manipulate those: the file system. Of course you can build what essentially amounts to a copy of the file system API with D-Bus, but you'll loose all the tool support along the way, and people won't know how to use it, so why bother?

By the way, I feel similarly with regard to cgroups as a whole. We already have a process hierarchy, why do we need another one? Of course the problem with the traditional process hierarchy was that processes could escape by double-forking, but that was fixed with with prctl(PR_SET_CHILD_SUBREAPER). So why do we need cgroups at all?

Oh well, by now it's probably too late to change any of this.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 16:22 UTC (Wed) by fandingo (guest, #67019) [Link] (17 responses)

I'm wondering what tool support actually exists. It seems more likely that there's a mash of shell scripts that mkdir, chmod, chown, and echo their way through using cgroups. That's pretty lousy. I've already posted the warnings from cgroups.txt on using echo. If you're using something beyond shell script, you'll end up with a better program by interfacing with DBus if only due to better return information and error handling.

> Of course the problem with the traditional process hierarchy was that processes could escape by double-forking, but that was fixed with with prctl(PR_SET_CHILD_SUBREAPER). So why do we need cgroups at all?

Doesn't that require one of the following three situations?

* Well behaved main process of the service that sets PR_SET_CHILD_SUBREAPER, so none of its descendents escape. If this process ever dies, is killed without cleaning up (e.g. sigkill), or fails to set itself as the subreaper processes can escape the hierarchy.

* The init system has to maintain a process for each service that sets itself as the subreaper. It's not responsible for anything besides executing/stopping/killing the service, and cleaning up PIDs. That certainly adds a lot of overhead and complexity. Without dedicating a process to each service, you just end up with everything having subreaper set to PID 1 (or whatever a modular service manager runs as); these service hierarchies would all point to the same parent, making it impossible to distinguish between them.

The first option does not seem appealing because it requires a significant amount of trust in the service, there are reliability concerns, and the developers of each service need to do work to explicitly support this init model.

The second option mainly suffers from complexity. Init has far more processes running as part of its service management. Some IPC mechanism would be needed to track these processes and allow start/stop/restart/kill/etc. commands from the user to the service manager to the hierarchy manager to work.

Subreaper is designed to be used for proper process cleanup, not for tracking process hierarchies.

Lastly, the ability to set resource limits is limited. It would be possible to use something like setrlimit or prlimit, but those are both process-specific, and don't allow the flexibility of group-based resource limits.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 17:16 UTC (Wed) by HelloWorld (guest, #56129) [Link] (4 responses)

> * The init system has to maintain a process for each service that sets itself as the subreaper. It's not responsible for anything besides executing/stopping/killing the service, and cleaning up PIDs. That certainly adds a lot of overhead and complexity.
I don't think so. Processes are cheap on Linux, and I don't think that reaping child processes is likely to ever be a bottleneck for any realistic program.

> Some IPC mechanism would be needed to track these processes and allow start/stop/restart/kill/etc. commands from the user to the service manager to the hierarchy manager to work.
Where “Some IPC mechanism” would obviously be D-Bus, which makes this sort of thing very easy.

> Lastly, the ability to set resource limits is limited. It would be possible to use something like setrlimit or prlimit, but those are both process-specific, and don't allow the flexibility of group-based resource limits.
That's not a fundamental limitation. Just allow processes to specify that an rlimit is supposed to apply to them as well as their descendants, and PR_SET_CHILD_SUBREAPER ensures that your descendants can't reparent themselves to init. So I think this whole thing could be made to work fine if somebody bothered to do the work. Otoh, I'm not sure if anything would actually be gained by doing that instead of cgroups.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 17:43 UTC (Wed) by fandingo (guest, #67019) [Link] (2 responses)

> Just allow processes to specify that an rlimit is supposed to apply to them as well as their descendants, and PR_SET_CHILD_SUBREAPER ensures that your descendants can't reparent themselves to init.

That's true if and only if that ancestor reaper never dies or is killed. The security implications complicate things. It should be possible to overcome them possibly, but the warts add up.

> That's not a fundamental limitation. Just allow processes to specify that an rlimit is supposed to apply to them as well as their descendants[...] Otoh, I'm not sure if anything would actually be gained by doing that instead of cgroups.

I totally agree, but it would require a change to those functions (or new recursive versions).

Shuttleworth: Losing graciously

Posted Feb 19, 2014 17:59 UTC (Wed) by HelloWorld (guest, #56129) [Link] (1 responses)

> That's true if and only if that ancestor reaper never dies or is killed.
So what? systemd is already required to never die.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 18:29 UTC (Wed) by fandingo (guest, #67019) [Link]

If the reaper dies, then processes in that service escaped their "container" (because PPID is now 1). The service manager can no longer track them, and has lost all reliable control of the service.

This becomes a major problem with privileged services. Many services maintain a parent process that runs as root. Any compromise of this privileged service process (or malfeasance by it) allows it to kill it's hierarchy manager process and escape all control.

On the other hand, cgroups in a single-writer environment should be immune to this.

With systemd cgroup manager, PolKit would not authorize a process move outside all cgroups or to another cgroup (outside specific definitions like system.slice/sshd.service/ --> /user.slice/session.scope/).

The major benefit to systemd's cgroup manager is that it is not attackable via this style. It cannot be intentionally killed (it ignores all signals, even sigkill since it is PID 1), and if it were somehow forced to crash, the system would panic. Since PID 1 is the cgroup manager, there is no way to gain control of the kernel interface either.

There's no meaningful way to protect a reaper, unless you mandate that nothing in a hierarchy can run with enough privileges to kill the reaper. That would require a substantial change in many services, or requires additional sandboxing mechanisms in the kernel. (The kernel would need to perform a check that a caller of kill(2) is not trying to kill its reaper.)

Shuttleworth: Losing graciously

Posted Feb 19, 2014 18:58 UTC (Wed) by smurf (subscriber, #17840) [Link]

Umm … did you ever happen to run across the idea that just maybe process hierarchies and cgroups are a VERY bad fit? I can think of a couple of use cases where that wouldn't work at all well.

What if I want to fork off a bundle of programs which need to share the same memory limit (i.e. 200MBytes for all of them in sum, not individually … like for instance all the processes in James' sessions … and what if James logs in with X *and* with ssh)?

What if I realize, after starting my disk copy program, that it eats too much memory / disk bandwidth, and I want to retroactively park it in a more limiting cgroup? Does my shell suddenly need to know about that stuff?

Sorry -- won't work.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 17:42 UTC (Wed) by HelloWorld (guest, #56129) [Link] (11 responses)

> I'm wondering what tool support actually exists. It seems more likely that there's a mash of shell scripts that mkdir, chmod, chown, and echo their way through using cgroups. That's pretty lousy.
It's not lousy, there's nothing wrong with that! Those are tools that every admin knows and uses, and that's a Good Thing. The reason devices are exposed as “files” in /dev is precisely that one can do things like access control just as if they were proper files. Do you want to replace that too? It's certainly possible to give udev a D-Bus interface and use fd passing to open device files!

> I've already posted the warnings from cgroups.txt on using echo. If you're using something beyond shell script, you'll end up with a better program by interfacing with DBus if only due to better return information and error handling.
“We can't use a file system based interface because bash's echo builtin is broken” is about as lame an excuse as it gets. The answer to that is to fix bash or to use printf.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 18:48 UTC (Wed) by smurf (subscriber, #17840) [Link] (10 responses)

> The reason devices are exposed as “files” in /dev is precisely that
> one can do things like access control just as if they were proper files.

"It behaves like a plain file" doesn't work for quite a few device nodes, and most Linux subsystems are not controlled by echo: You don't emit sound by "cat rhapsody.wav >/dev/snd" these days, and you don't resize a LVM partition by "echo 10TB >/sys/devices/virtual/block/volgroup/master/varlog/size".

This is Linux. This is not Plan 9 where you can open a TCP connection with mkdir. cgroupfs is fine for introspection, but control? that always seemed a bit unnatural to me.

Besides, pragmatically, a sensible "cgroupctl"-style program will have a --help option and a manpage. To me that seems a lot more useful than traipsing around in cgroupfs and wondering which magic mkdir+echo+mv combo I need to evoke to limit my disk copy program's memory usage.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 20:22 UTC (Wed) by HelloWorld (guest, #56129) [Link] (9 responses)

> "It behaves like a plain file" doesn't work for quite a few device nodes,
So what? Just because you can't read(2) or write(2) to some device nodes doesn't mean you need to use another interface for things like poll(2) or chmod(2). Stop thinking about the “file system” and start thinking about a general hierarchical namespace for all kinds of objects. This is where we are today with files, sockets, fifos, devices files etc.. It's only natural to extend that further.

> This is Linux. This is not Plan 9 where you can open a TCP connection with mkdir.
Uh, I know this is Linux and not Plan 9. How is that supposed to be an argument? We should learn from Plan 9 instead of taking that kind of “us vs. them” stance.

> cgroupfs is fine for introspection, but control? that always seemed a bit unnatural to me.
And to me it seems unnatural that access control for cgroups is supposed to be done through a completely different mechanism than access control to files or devices. Though I agree with you that the current cgroups API isn't ideal. For one thing, I think the natural thing is to use
ln /proc/42 /sys/fs/cgroup/yaddah/cgroup.procs
and not
echo 42 > /sys/fs/cgroup/yaddah/cgroup.procs
to add processes to a cgroup.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 21:17 UTC (Wed) by smurf (subscriber, #17840) [Link] (8 responses)

> And to me it seems unnatural that access control for cgroups
> is supposed to be done through a completely different mechanism
> than access control to files or devices

I strongly suspect that the main reason for that is because you're used to it.

> ln /proc/42 /sys/fs/cgroup/yaddah/cgroup.procs

Linking.
Across file systems.
Yeah, right.

Sorry, but this is the point where I stop responding to you.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 21:44 UTC (Wed) by HelloWorld (guest, #56129) [Link] (7 responses)

> Linking.
> Across file systems.
> Yeah, right.
So what? It's not allowed for conventional file systems because it doesn't make sense there. It does make sense for this case, so there's no reason for it not to be allowed.

> Sorry, but this is the point where I stop responding to you.
You're doing as if I had somehow offended you. I haven't.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:06 UTC (Wed) by mathstuf (subscriber, #69389) [Link] (6 responses)

Not only are you linking across filesystems (how would one find out that it is hardlinked elsewhere?), but you're hardlinking a directory. When process 42 ends, does the "hardlink" disappear? If not (as one might expect of hardlinks), does a new process with PID 42 get put there? The /sys and /proc filesystems are already pretty magical, but those are only around read and write (AFAIK), not how many other syscalls as well. Really, even echoing the PID to a file is racy. I'd much rather have something like a procfd to use here.

These behaviors you're asking for are quite different than the usual semantics these tools imply. Sure, filesystems and cgroups are both hierarchical, but there is such a thing as stretching a metaphor too far. To make a meta-metaphor: Should we abandon databases and just use spreadsheets instead? Vice versa? They're both "just" grids of data cells.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 0:13 UTC (Thu) by HelloWorld (guest, #56129) [Link] (5 responses)

Alright, you have a point. Using link(2) is probably not a good idea.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 2:08 UTC (Thu) by MrWim (subscriber, #47432) [Link] (4 responses)

rename() might be though. AFAIU all pids have to appear in the cgroup tree so to put a pid in a cgroup you have to remove it from another. You would need permissions for both cgroups and it happens atomically.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 15:20 UTC (Thu) by HelloWorld (guest, #56129) [Link] (3 responses)

rename() was my first thought. But that would remove the process from the /proc directory, and that doesn't really make sense, does it?

Shuttleworth: Losing graciously

Posted Feb 20, 2014 15:23 UTC (Thu) by mathstuf (subscriber, #69389) [Link] (2 responses)

I think the suggestion was to move the pid from one cgroup directory to another, not from /proc.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 17:34 UTC (Thu) by HelloWorld (guest, #56129) [Link] (1 responses)

Well, that would work, but then how do you move a process that isn't a member of any cgroup into one?

Shuttleworth: Losing graciously

Posted Feb 20, 2014 18:15 UTC (Thu) by MrWim (subscriber, #47432) [Link]

That's what I meant by "all pids have to appear in the cgroup tree so to put a pid in a cgroup you have to remove it from another". My assumption is that there cannot be a process which isn't a member of any cgroup. If init starts in a cgroup and it's children end up in the same cgroup and there's no way of unlink()ing pids from the cgroup tree then you're guaranteed that every process is in the tree.

In that setup you can't steal other users processes and put them in your subtree, you can only move pids around in the trees you own. You can then use whichever cgroup manager that you desire in your subtree. Containers work while still only co-operating with the kernel, rather than having to communicate with other user-space programs running outside.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 16:29 UTC (Wed) by paulj (subscriber, #341) [Link] (26 responses)

+1 on the parent. Cyberax makes good points.

Further, no one has been able to explain why this is better implemented in user-space via DBus. There only seem to be assertions from Tejun on mailing lists that there are problems, but pretty much no detail on what those problems are. Worse, nothing, not even in the abstract, on why these problems would be any easier to tackle in user-space.

If the argument is that it is difficult to get multi-writer access to filesystems right, or multi-writer setting of permissions, then the kernel surely has bigger problems.

If the issue is ABI stability, those problems also do not get magically become easier in user-space. Except perhaps that it is easier to circumvent Linus' determination to keep the kernel:user-space ABI stable (?) by simply not having one.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 17:31 UTC (Wed) by fandingo (guest, #67019) [Link] (25 responses)

There are plenty of concerns with the status quo. http://www.linux.com/news/featured-blogs/200-libby-clark/... and https://lwn.net/Articles/574317/ identify many of the issues.

From my perspective, the two most notable deficiencies are:

* Security. Delegating direct access to kernel interfaces is dangerous. The kernel is not running anything resembling a full security policy and shouldn't be expected to. Commands which have the ability to drastically affect a running system need to be vetted by something, and it only makes sense to do that in user space. Traditional UNIX ownership and permissions are insufficient for building comprehensive policies. While this may not be a concern to some, it's a major weakness. There's no excuse for providing an insecure interface to the kernel.

* Exposing too much interior detail. If the cgroups API had been more abstract from the start, it probably would have been possible to fix at least some of the other deficiencies. Unfortunately, cgroupfs exposes too much internal information, making it, in Tejun's opinion, infeasible to fix without major changes.

> If the argument is that it is difficult to get multi-writer access to filesystems right, or multi-writer setting of permissions, then the kernel surely has bigger problems.

Not all kernel developers work on every part of the kernel. The people who maintain and develop cgroups have decided that this is the highest priority undertaking for them.

> If the issue is ABI stability, those problems also do not get magically become easier in user-space. Except perhaps that it is easier to circumvent Linus' determination to keep the kernel:user-space ABI stable (?) by simply not having one.

The kernel will still have an ABI for user space. I'm not sure why people keep saying otherwise. It's only usable by one process, but it's certainly still there. And for user space, it actually does become substantially easier. Just look at CGManager, which has decided to support two APIs simultaneously.

If there is a need to modify the cgroups ABI in the future, it's so much easier. Rather than having hundreds of thousands of users (everything from systemd to shell scripts) that will be impacted, it's just a handful of cgroup managers.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 18:10 UTC (Wed) by paulj (subscriber, #341) [Link] (20 responses)

On security: If delegating direct access to kernel interfaces is inherently dangerous (richness/flexibility of security models excepted), but user-space mediation is not, then we need to ban all direct filesystem access, and make all code access files and data via IPC to a management daemon.

I simply don't think that's the case. If we can't do this securely in the kernel, the user-space mediator won't be any better (certainly not when it's coded in a similar style), security wise (with that exception). Otherwise, please explain how user-space is more secure?

With regard to the Unix ownership/permissions model:

1. This may suffice for many. There's little evidence, in the history of operating systems, of complex security models getting wide-spread end-user use.

2. However, it is incorrect to say the kernel to userspace delegation API is limited to Unix owner/group permissions. It also allows for ACLs - which an fs based cgroups API (not necessarily the current version) could implement.

3. Even if Unix perms and/or ACLs *were* still insufficient for all users, that is *NOT* a reason to not offer the FS API. If a user-space daemon wants to offer some other security model on top, the existence of the FS API does not stop that. They can live together. Why does it mean the fs interface has to be removed?

On exposing too much detail: Then that's a problem of the current cgroupfs API. Fix it with a new one. Why is it better for that new API to live in user-space?

One of the problem's Tejun has is that the fs API *allows delegation*, and hence allows an admin to give resources to non-privileged processes that might affect other users/processes. But isn't that perhaps inherent to an over-committed resource sharing system like normal Unix/Linux? Further, if the problem is inherently to do with the delegation, how will a user-space API that allows delegation fix things? The answer, if delegation really is a problem, must be that that delegation has to be removed. There is no reason this can be done in a kernel API, surely? Or would you argue it is easier to remove things in userspace APIs? (That argument would scare me).

Why not just try and get the kernel API right? Why will it be any easier to get things right if the thousands of shell scripts are calling dbus-send instead of writing to a virtual fs? How will having this in user-space make it any easier to deal with all the thousands of users, just cause they talk to a manager instead of the kernel?

It very much sounds like the kernel cgroups people simply don't want to have to work out the details of what is needed, and so want to punt it to user-space. It sounds almost a social problem, more than a technical one.

Lastly, on the "not all kernel developers are familiar with implementing a virtual fs" issue - they can ask for help, surely. :) Eventually viro, or someone similar, will get annoyed enough to do further extend VFS and library support to further ease implementation of virtual fses, if needed. As (IIRC) happened yonks ago when he got fed up with procfs and others. ;)

Shuttleworth: Losing graciously

Posted Feb 19, 2014 18:55 UTC (Wed) by fandingo (guest, #67019) [Link] (9 responses)

I think the primary reason why it was moved out of the kernel is that the policies authorizing access are not simple and may not follow traditional methods. Here are a couple of situations where non-traditional policies may be desired

* Only allow PID X to manage cgroup subtree /A/B/C.
* Processes in subtree /A/B/C/ may move their processes in /D/E/F but should not have any further control.

Traditional file permissions and even ACLs are incapable of dealing with either situation. In the first, case there's no way to grant that access without allowing all processing owned by that same user/group from controlling /A/B/C. In the second situation, there's no way to allow a one-way move.

Yeah, we can start adding more files to the cgroups to attempt finer control, but now things get messy, and it's likely the who permission issue goes out the window. That's just two random policies that a system may want to enforce. To do #1 you would likely have to give one of U, G, or O +w to the subtree for that user, but that's a broken definition because hidden kernel policy will revoke writes by other processes, even though they have the proper U or G or fall into O. In the end, a cgroup would spout many files to cover all sorts of policy combinations, and the kernel developers will be left with a monstrosity that can't be fixed because next time the same cries about API will arise.

A filesystem hierarchy is not suited to this complexity. It will have to be contorted into all kinds ways where the permissions scheme (even with ACLs) doesn't match traditional behavior.

It's far preferable to take the LSM approach. The kernel provides the primitives throughout the kernel subsystems and talks to another module to establish and enforce policy. It's true that LSM modules are kernel modules, but they remain separate from the LSM code. (I wouldn't have a problem with a cgroup manager living as a kernel module, but I don't see any inherent benefit either.)

Shuttleworth: Losing graciously

Posted Feb 19, 2014 20:36 UTC (Wed) by HelloWorld (guest, #56129) [Link] (3 responses)

> * Only allow PID X to manage cgroup subtree /A/B/C.
> * Processes in subtree /A/B/C/ may move their processes in /D/E/F but should not have any further control.
>
> Traditional file permissions and even ACLs are incapable of dealing with either situation.
But similar restrictions might also make sense for other kinds of hierarchically organised objects. So why not generalise the existing access control mechanisms to allow for things like that instead of inventing something new?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:28 UTC (Wed) by fandingo (guest, #67019) [Link] (1 responses)

The only existing mechanism that could possibly be used would be an LSM. That probably wouldn't be a bad approach.

I don't think that there's any way that traditional permissions, even with ACLs, could be massaged into giving the necessary flexibility and clean interfaces.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:46 UTC (Wed) by dlang (guest, #313) [Link]

the thing is, existing LSMs know how to deal with permissions to filesystem objects. SELinux and AppArmor work on the existing cgroups interfaces today (as Cyberax has noted).

Plus there is the entire extended ACL structure thats available (but very seldom used because it's not needed)

On Linux, permissions for filesystem objects have not been limited to the unix wrx bits for a long time.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:49 UTC (Wed) by vonbrand (subscriber, #4458) [Link]

So the solution to the problem that the API isn't well known/standard is to create another totally new, in practice untested, "general hierarchical security model" to be applied across the board to anything with a hierarchical structure. That sounds much, much harder to do right to me.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:00 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link] (4 responses)

>* Only allow PID X to manage cgroup subtree /A/B/C.
>* Processes in subtree /A/B/C/ may move their processes in /D/E/F but should not have any further control.
Then they should talk to a some kind of privileged program that can do this. Traditional UNIX used suid programs for that, and it totally makes sense to use something like cgmanager/systemd for this.

However, such situations are not really normal. In particular, changing levels in cgroups hierarchy is not a trivial operation - new subtree might have limits that the subtree which is being moved already exceeds.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:32 UTC (Wed) by fandingo (guest, #67019) [Link] (3 responses)

> However, such situations are not really normal. In particular, changing levels in cgroups hierarchy is not a trivial operation - new subtree might have limits that the subtree which is being moved already exceeds.

That's really a question of policy, though. If the policy says that some processes need to move to /D/E/F, then they need to go there, regardless of what the resource controllers say. (I'd argue that the process should be moved first, and then the resource controller terminates processes to get back into proper configuration. I don't think that it is acceptable to leave a process in the wrong the subtree.)

It's worth noting that systemd-login, which is not PID 1, does the second action today on user login.

> Then they should talk to a some kind of privileged program that can do this. Traditional UNIX used suid programs for that, and it totally makes sense to use something like cgmanager/systemd for this.

If there are acknowledged shortcomings of cgroupfs, shouldn't the API be changed to support all reasonable actions? Why should the kernel keep interfaces that clearly have shortcomings that cannot be resolved without massive API incompatibilities?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:40 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link]

> That's really a question of policy, though. If the policy says that some processes need to move to /D/E/F, then they need to go there, regardless of what the resource controllers say.
You policy might say that a process can use 100G of RAM, but that's not going to help you if you only have 500Mb. Right now if you try to do this trick the kernel simply kills the over-limit processes.

> If there are acknowledged shortcomings of cgroupfs, shouldn't the API be changed to support all reasonable actions? Why should the kernel keep interfaces that clearly have shortcomings that cannot be resolved without massive API incompatibilities?
Like filesystem interface? Perhaps we should switch to DBUS instead of using open()?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:43 UTC (Wed) by dlang (guest, #313) [Link] (1 responses)

> If there are acknowledged shortcomings of cgroupfs, shouldn't the API be changed to support all reasonable actions? Why should the kernel keep interfaces that clearly have shortcomings that cannot be resolved without massive API incompatibilities?

backwards compatibility would be a reason for keeping poor interfaces, but if you are going to break them, then you need to do so once, not multiple times.

And the new systemd API is just that, a systemd API, by definition it doesn't deal with use cases that don't use systemd.

and the currently proposed single-writer API in known to not support all use cases, so why should it replace the existing API?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:59 UTC (Wed) by fandingo (guest, #67019) [Link]

> and the currently proposed single-writer API in known to not support all use cases, so why should it replace the existing API?

You are talking about the Google multi-hierarchy complaint, right? The cgroups maintainers are on record as saying that this is not reasonable, and they intend to eliminate it.

> backwards compatibility would be a reason for keeping poor interfaces

The cgroups developers believe them to be broken, not poor. Plus, they get in the way of fixing cgroups since cgroupfs leaks too many implementation details.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 19:04 UTC (Wed) by jspaleta (subscriber, #50639) [Link] (2 responses)

I believe Tejun directly speaks a lot of this in 2012 discussion.

Right now... its a mess. He even comments on the fact that the gentlemen's agreement in the form of PaxControlGroup shows exactly how problematic trying to support multiple writers actually is right now in the multi-heiarchy cgroups because they aren't all multi-heiarchy aware. You can't actually use all the controllers in a multiple writer fashion without them step on each other. Caveats abound. He even correctly notes that the way it works quite well right now for hand crafted setups... where one human has crafted all the cgroup interactions via scripting and its effectively "single writer" in a sense. But once you tack on automation or applications which want to make use of cgroups side-by-side with other applications or other automation or even hand-craft scripts... you run into problems. PAXControlGroups details the in and outs of those problems.

So as part of making room to clean up all the problems the single userspace writer will be mandated in the middle term of the kernel side work to make a flat hierarchy api. I'm not even sure the plan is to require that single writer model forever. I believe the plan right now is to stop pretending that cgroups and all the controllers work with the multiple writer model while that flat hierarchy is being developed and controllers are all reworked to correctly support that new model.

I also think you need to in mind that Tejun also mentions a pie-in-the-sky goal of merging cgroups into the process hierarchy.

Look I think the real issue here is that until the new work (both kernel and userspace) has progressed further along, there are going to be existing use cases that only served by the deprecated API. The kernel developers have always recognized this, from the start of the discussion.
I think the discussion here is only filling in the details of what Tejun knew were local admin policy scripting centric use cases that would be impacted in the short to mid term. Tejun clearly states the choices on how to proceed involved a trade-off that would impact some existing use cases.

The real question is, will the linux distribution vendors continue to provide userspace solution which support the old API as an option? I have no idea on that. I know that libvirt for example has a transition plan in place to support the older cgroupfs if the systemd D-Bus API is not available on the host. But I have no idea if any distribution vendors are going to expose a configuration option to pick which cgroups API to use when mounting cgroups. So Cyberax should probably start making the case to his distribution vendors about supporting his use case by making it possible to choose to run the deprecated cgroupfs API in the future, so long as the API is available and not pulled from the kernel the vendor ships.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:56 UTC (Wed) by dlang (guest, #313) [Link]

Everyone is in agreement that the old multiple hierarchy approach is a problem and that all controllers need to use the same hierarchy.

But using that as justification for a single-writer model doesn't compute, what does one have to do with the other?

> I'm not even sure the plan is to require that single writer model forever.

I agree with you that the kernel developers have said this, I just can't find a quote easily to back this up.

But if this is the case, having systemd take complete control and then defining a complex DBUS interface to be used for delegation strikes me as a very bad thing to do 'temporarily'

> The real question is, will the linux distribution vendors continue to provide userspace solution which support the old API as an option?

is systemd going to even allow this? or is systemd going to say that it's broken if the new interface isn't there?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:12 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link]

Sure, multiple hierarchies writers will have to be rewritten. That's totally OK because the old cgroups API definitely needs to be fixed with a jackhammer. Nobody argues that. It's also clear that the unified tree will make some use-cases impossible, at least for now.

And that's OK - there _are_ valid technical reasons why the current multiple hierarchies model is broken. These reasons are clearly spelled out in scores of mailing list messages.

For example, there are problems with memory accounting and blkio and that's why blkio can't account for buffered writes right now.

The switch to the single-writer model, however, has no such rationale. There are literally NO arguments at all for it that I can find. One would think that unfixable security problems deserves at least a mailing list message, but there's literally _nothing_ there.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 19:13 UTC (Wed) by smurf (subscriber, #17840) [Link] (2 responses)

> It very much sounds like the kernel cgroups people simply don't
> want to have to work out the details of what is needed, and so
> want to punt it to user-space. It sounds almost a social problem,
> more than a technical one.

Surprise: You are exactly right. The kernel people do not WANT to set policy there because the requirements are unknown / too diverse / we don't have much experience what we actually need in complex real-world scenarios / take your pick.

We do know that a user/group scheme will not work: you can nest namespaces, and the process which sets up the outer namespace's access rights does not know which user IDs will eventually end up being mapped to the processes inside these (sub)containers. This is (probably) why cgroupfs does not have ACLs: they'd be insufficient anyway.

It's not the kernel's job to set policy. It's the kernel's job to facilitate a stable ABI, and leave policy to user space where it belongs.

Besides, access rights are insufficient for another reason. Suppose you want to limit users' memory usage to 100MB each; if they want to have 1GB of main memory they can -- but only two at the same time, and only for 10 minutes.

This is not a particularly exotic requirement for a multiuser system. If somebody adds code for that kind of thing to their favorite cgroupsmanager program, no problem whatsoever. In the kernel? don't even think of doing that.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:06 UTC (Wed) by dlang (guest, #313) [Link]

> Besides, access rights are insufficient for another reason. Suppose you want to limit users' memory usage to 100MB each; if they want to have 1GB of main memory they can -- but only two at the same time, and only for 10 minutes.

> This is not a particularly exotic requirement for a multiuser system. If somebody adds code for that kind of thing to their favorite cgroupsmanager program, no problem whatsoever. In the kernel? don't even think of doing that.

what multiuser system supports these sorts of limits today? If you are claiming that they are not unusual, that must mean that something common supports them.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:22 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link]

> Besides, access rights are insufficient for another reason. Suppose you want to limit users' memory usage to 100MB each; if they want to have 1GB of main memory they can -- but only two at the same time, and only for 10 minutes.
Sure. And the current delegation-based interface works perfectly fine in this scenario.

First, you create this hierarchy: root/user1/delegate, root/user2/delegate. Then you set memory limits on them and start a daemon that does the balancing act. This daemon should have permissions to change 'user1' and 'user2' hierarchies.

But nobody stops you from making 'delegate' directories writable for the users! They won't be able to affect the settings in the parent levels of cgroups, and they'll limited by them.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:55 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link] (3 responses)

Except that delegation to untrusted processes is not inherently dangerous. Cgroups (by design!) limit what the children processes can do by setting limits on their parents. Well, except for the broken blkio controller that is being fixed anyway.

I'm pretty familiar with the current cgroups interface and it seems that an untrusted process _at_ _most_ can cause high load on the kernel and perhaps significantly slow down other processes.

There's also a small problem with several controllers which use weights to distribute resources, so it's possible for a cgroup to affect its siblings. But again, that's trivially worked around by using an intermediary tree level if one wants to delegate a subtree to untrusted processes.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:15 UTC (Wed) by fandingo (guest, #67019) [Link] (2 responses)

> it seems that an untrusted process _at_ _most_ can cause high load on the kernel and perhaps significantly slow down other processes.

Only if that process is unprivileged. If you have a service that runs a privileged process (like the parent PID of Apache or OpenSSH), it can modify any part of the cgroup hierarchy.

A single-writer model (especially if the writer is PID 1) with policy enforcement precludes this behavior. Even a privileged user would not be able to gain authorization to perform cgroup changes outside what the policy allows (like managing its subtree). Furthermore, a privileged user couldn't even connect to the kernel cgroup API directly, because a writer is already registered, and if it's PID 1, cannot be crashed in order to register a malicious writer.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:20 UTC (Wed) by dlang (guest, #313) [Link]

existing LSMs can block access to cgroups by even root processes today.

or you can play games with the PID namespace so that those processes are only root within their limited context, not for the whole systems.

But if you are concerned about a malicious root process, the fact that it can change cgroups settings seems like a pretty minor thing to worry about.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:25 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link]

> Only if that process is unprivileged. If you have a service that runs a privileged process (like the parent PID of Apache or OpenSSH), it can modify any part of the cgroup hierarchy.
It might as well simply do 'chmod -R a+r+w+x /' to the same effect.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:48 UTC (Wed) by dlang (guest, #313) [Link] (3 responses)

by the way, the new interface apparent;y isn't actually limited to being accessed by a single process, a group of coorperating processes can be used instead.

this sounds like an even bigger problem to me if the group of coordinating processes don't coordinate well, nothing in ther kernel can know that they aren't and big problems can result.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 22:53 UTC (Wed) by jspaleta (subscriber, #50639) [Link] (2 responses)

Are you sure about that? Can you provide me instructions on how to do that when the cgroups is mounted with the new API?

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:09 UTC (Wed) by dlang (guest, #313) [Link] (1 responses)

I don't know how to do it, but I saw something posted in the last week or so (I think on lwn related to systemd) that stated that this was the case.

Shuttleworth: Losing graciously

Posted Feb 19, 2014 23:14 UTC (Wed) by Cyberax (✭ supporter ✭, #52523) [Link]

Lennart (who is known as 'mezcalero' here) meant that systemd can give other processes access to a subtree, through systemd's interface.

That doesn't change the fact that only one process can do direct cgroups manipulations.

Shuttleworth: Losing graciously

Posted Feb 20, 2014 18:50 UTC (Thu) by jspaleta (subscriber, #50639) [Link]

Just so I'm clear.

Your plan is to have logind updated to use cgmanager in the 14.04 release?

Are you going to be making use of the feature freeze exception to land that new logind code?

Is thre any in-development new logind code public in any version control branch yet?

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:53 UTC (Fri) by cox (guest, #93721) [Link]

Kudos to Mark for this decision.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 19:58 UTC (Fri) by kugel (subscriber, #70540) [Link] (1 responses)

Wow, I didn't see THAT coming.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 23:59 UTC (Fri) by cesarb (subscriber, #6266) [Link]

Me neither. The "soap opera" designation was spot on.

I guess we already know what one of the front page slots of the next weekly LWN will be about.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:02 UTC (Fri) by mgb (guest, #3226) [Link] (6 responses)

A brilliant move by Shuttleworth.

He gets the Debian volunteers to spend tens of thousands of person hours changing Debian's plumbing layer to match Red Hat's. And then Ubuntu makes money by moving into RHEL's markets.

And while Debian volunteers are painting bike sheds for systemd, the Debian distro loses ground on every other front.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:11 UTC (Fri) by jubal (subscriber, #67202) [Link] (5 responses)

oh man, you do live in a world of cabals…

Shuttleworth: Losing graciously

Posted Feb 14, 2014 21:29 UTC (Fri) by mgb (guest, #3226) [Link] (4 responses)

Logic is the opposite of unsubstantiated conspiracy theories.

Your cabal hypothesis might suppose that Poettering had planned for this. But there's no need for cabals and conspiracy theories.

Business logic says that Shuttleworth will graciously profit from the gift of tens of thousands of hours of bike-shedding by Debian volunteers which the TC just donated.

Shuttleworth: Losing graciously

Posted Feb 14, 2014 23:05 UTC (Fri) by rahvin (guest, #16953) [Link] (3 responses)

Ubuntu benefits from the work of Debian in everything they do. Without Debian there is no Ubuntu because Ubuntu themselves do very little of the work that actually makes up the bulk of the Ubuntu distribution, work that is done by the Debian community.

Though I know better than to reply to such trollish comments so I will leave it at that.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 9:02 UTC (Sat) by misc (subscriber, #73730) [Link] (2 responses)

The part about "Debian will do the work on the plumbing so Ubuntu doesn't" is quite credible, the part about "Debian is losing ground due to the debate" is not.

I am pretty sure that Ubuntu will not switch until Debian do it and Upstart will be kept for at least 3 Ubuntu releases (ie, maybe systemd would be default on 16.04, if Canonical still exist by that time)

Shuttleworth: Losing graciously

Posted Feb 15, 2014 10:56 UTC (Sat) by smurf (subscriber, #17840) [Link]

Kept, maybe, but I don't think it'll be the default after Debian's jessie is released. i.e. after Ubuntu 13.10.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 12:22 UTC (Sun) by Thue (guest, #14277) [Link]

> I am pretty sure that Ubuntu will not switch until Debian do it and Upstart will be kept for at least 3 Ubuntu releases (ie, maybe systemd would be default on 16.04, if Canonical still exist by that time)

The main point of the non-LTS Ubuntu releases seems to be to act as betas for the LTS. So it will be extremely important to get at least one non-LTS Ubuntu version (and hopefully more) to use systemd, so that the systemd support in 16.04 has been well tested.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 3:26 UTC (Sat) by djzort (guest, #57189) [Link] (2 responses)

Redhat needs to build off of Debian rather than faildora.

Debian also needs to get themselves more sponsors.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 4:13 UTC (Sat) by mathstuf (subscriber, #69389) [Link]

Out of curiosity, why do you call it "faildora"?

Shuttleworth: Losing graciously

Posted Feb 15, 2014 20:30 UTC (Sat) by Wol (subscriber, #4433) [Link]

Hmmm...

Then what are those of us who don't like Debian supposed to do? For the record, EVERY time I've tried to use Debian or a derivative, it's been a total nightmare.

Mind you, if Red Hat did change it wouldn't make any difference to me - I don't go near that either.

Cheers,
Wol

Shuttleworth: Losing graciously

Posted Feb 15, 2014 7:10 UTC (Sat) by thomas.poulsen (subscriber, #22480) [Link] (6 responses)

Wow!
This is great leadership indeed.
I hope Mark gets the credit he deserves from this.
He obviously does not have a NIH-problem.

I guess it also bears well on the Mir/Wayland issue.
Upstart was needed at the time to break new grounds.
Systemd was able to learn from upstart and do even better.
This is competition and evolution at work.

With Mir and Wayland the order of events is reversed.
Mir can learn from Wayland and perhaps become even better.

How lucky we are to live such interesting times.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 10:15 UTC (Sat) by hunger (subscriber, #36242) [Link]

I doubt that this has any relevance for Wayland/Mir.

The Wayland/Mir situation is altogether very different from the upstart/systems one.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 15:49 UTC (Sat) by HelloWorld (guest, #56129) [Link] (4 responses)

> Wow!
> This is great leadership indeed.
> I hope Mark gets the credit he deserves from this.
> He obviously does not have a NIH-problem.
That's like saying that somebody doesn't have an alcohol problem because he only has one bottle of booze for breakfast instead of two.

> I guess it also bears well on the Mir/Wayland issue.
> Upstart was needed at the time to break new grounds.
> Systemd was able to learn from upstart and do even better.
> This is competition and evolution at work.
Not really. AIUI, the main inspirations for systemd were drawn from SMF and launchd. Lennart Poettering had this to say about Upstart: “Upstart fails hard on that, there's nothing to learn from that, there's no inspiration to get from that”.

> With Mir and Wayland the order of events is reversed.
> Mir can learn from Wayland and perhaps become even better.
There has been *no* meaningful technical criticism against Wayland from the Mir developers. The only thing Mir does is make life harder for Toolkit and application developers. That's not competition or evolution, that's NIH.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 18:17 UTC (Sat) by thomas.poulsen (subscriber, #22480) [Link] (3 responses)

>> He obviously does not have a NIH-problem.
>That's like saying that somebody doesn't have an alcohol problem because he only has one bottle of booze for breakfast instead of two.

I find it strange that you and others here express so strong negative feelings for Mark.
As I see it, he is leading a great distribution, that is bringing Linux to a lot of people.

>There has been *no* meaningful technical criticism against Wayland from the Mir developers. The only thing Mir does is make life harder for Toolkit and application developers. That's not competition or evolution, that's NIH.

As I remember history, he wanted to use Wayland:
http://www.markshuttleworth.com/archives/551

I can understand if he wants something, where he can control the pace of progress. Alternative displayservers are not yet a commodity - as alternative init-systems were not at the time of Upstart. And it is pretty central to what Canonical is doing right now on the phones.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 18:58 UTC (Sat) by tsmithe (guest, #57598) [Link]

In particular, he says at that link that

>> we evaluated the cost of building a new display manager, informed by the
>> lessons learned in Wayland. We came to the conclusion that any such
>> effort would only create a hard split in the world which wasn’t worth
>> the cost of having done it. There are issues with Wayland, but they seem
>> to be solvable, we’d rather be part of solving them than chasing a
>> better alternative. So Wayland it is.

Very interesting. I really would love to know what changed.

Shuttleworth: Losing graciously

Posted Feb 15, 2014 19:48 UTC (Sat) by HelloWorld (guest, #56129) [Link] (1 responses)

> As I remember history, he wanted to use Wayland:
> http://www.markshuttleworth.com/archives/551
Yes, he made some friendly noises at the beginning. And then he didn't do *anything* to contribute to Wayland at all. Canonical developers didn't engage in protocol design or requirements engineering, graphics driver development, toolkit porting or anything else that would be remotely helpful.
Instead, he announced Mir, which doesn't solve any problems Wayland doesn't solve, has a CLA and contributes to the fragmentation that is already holding Linux back. In total, I think Canonical does more harm than good to the Linux ecosystem.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 1:11 UTC (Sun) by mathstuf (subscriber, #69389) [Link]

> In total, I think Canonical does more harm than good to the Linux ecosystem.

Maybe the development side, but certainly not the user side. How many people have been exposed to Linux only through Canonical? Who would have done something similar in a similar timeframe? My guess: no one.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 8:56 UTC (Sun) by smorovic (guest, #52892) [Link] (3 responses)

Logical move. Debian switch was the end of upstart. Although they still have to decide what to do with the other kernels. I guess optional support for other init systems is a way to go, with systemd API compatible equivalent(s) emerging after a while. I see no other long term option as even UIs start to increasingly depend on systemd, without much thought for other inits (GNOME is anyway strongly affected by what Red Hat goals are, their employees are not bound to spend time on supporting non-Linux). Long term mantaining patches against a moving target can be better spent in adding things like cgroups to these kernels.

Shuttleworth: Losing graciously

Posted Feb 16, 2014 12:18 UTC (Sun) by Thue (guest, #14277) [Link] (2 responses)

> with systemd API compatible equivalent(s) emerging after a while

kfreebsd and hurd has less than 1/1000 of the userbase of Linux. Why do you think they will have the development resources to create an API-compatible equivalent?

Shuttleworth: Losing graciously

Posted Feb 16, 2014 19:55 UTC (Sun) by Wol (subscriber, #4433) [Link] (1 responses)

Did you see the post about sysvinit?

How sysvinit is not portable?

If they can create a compatibility layer such that sysvinit can run, it's probably LESS work (once you take the failings of sysvinit into account) to create a systemd compatibility layer than to maintain sysvinit.

Anyways, I gather BSD has its own init ...

Cheers,
Wol

Shuttleworth: Losing graciously

Posted Feb 16, 2014 22:16 UTC (Sun) by Thue (guest, #14277) [Link]

Without actually knowing any of the code, it does sound like systemd uses more Linux-specific interfaces than sysvinit does. kdbus, for starters. Adding support for systemd to kfreebsd would therefore be more difficult than it was for sysvinit.