Topics in live kernel patching

By Jonathan Corbet
November 14, 2016

Getting live-patching capabilities into the mainline kernel has been a multi-year process. Basic patching support was merged for the 4.0 release, but further work has been stalled over disagreements on how the consistency model — the code ensuring that a patch is safe to apply to a running kernel — should work. The addition of kernel stack validation has addressed the biggest of the objections, so, arguably, it is time to move forward. At the 2016 Linux Plumbers Conference, developers working on live patching got together to discuss current challenges and future directions.

This article is not an attempt at a comprehensive summary for a half-day of fast-moving discussion; instead, the goal is to cover some of the more interesting topics as a way of showing that challenges that the live-patching developers must overcome and how they plan to get there.

Unhelpful optimizations

A smart optimizing compiler is necessary for anybody who wants to get reasonable performance from their code, but problems arise if the compiler gets too smart. Developers working with concurrency in the kernel have had to worry about aggressive optimizations for some time; according to Miroslav Benes, live-patching developers have to worry as well. Compiler optimizations can change how code is compiled in subtle ways that can lead to mayhem when a patch is applied.

Starting with the easiest problems before moving on to the trickier ones, Benes noted that the automatic inlining of functions can be a problem if an inlined function must be patched. In that case, the solution is relatively easy; all callers of the function must be changed in the resulting live patch. The -fpartial-inlining option can complicate things by only inlining portions of functions, but it doesn't change the basic nature of the problem.

The -fipa-src option is a bit more subtle, in that it can lead to the removal of unused function parameters or change the way in which parameters are passed into a function. In other words, it changes the ABI of the function in response to its observations on how the function works. A live patch to that function could change how this optimization operates, leading to a surprising change in the ABI. The good news here is that, when this happens, GCC will change the name of the compiled function, so the broken ABI is immediately obvious. But this can prevent the direct patching of a buggy function; callers must be patched as well.

Code compiled with -fipa-pure-const may change in response to how a function operates; if a function is seen as not accessing memory, the compiler will make assumptions about the state of memory before and after calling it. If a patch changes the function's behavior, those assumptions may no longer hold; once again, it will be necessary to patch callers when this happens.

An "even crazier" option is -fipa-icf, which performs identical code folding. It can cause a function to be entirely replaced by an equivalent found elsewhere in the code, and it can be hard to detect that this change has happened. Code folding is also a problem for the kernel's stack unwinder. Other types of code elimination can happen if GCC thinks that a specific global variable won't change over a given function call. If the function is patched to now change the global variable, the calling code may well be incorrect. This sort of change, too, is hard to detect; it would be nice, he said, to have a GCC option to ask it to create a log of the optimizations it has done.

Perhaps the scariest option is -fipa-ra, which tracks the registers used by called functions and avoids saving those that will not be changed. A patch to the called function could easily cause it to use a new register, leading to data corruption in the calling functions and a likely significant reduction in the continuous uptime that live-patching users were hoping to enjoy. This optimization is hard to detect; it can be thought of as an ABI change for the called function, but no name changes are made. This one, he said, is "not good news." For now, this optimization is disabled by GCC when -pg is turned on, and the Ftrace subsystem, needed for live patching, needs -pg. But there is no inherent reason why those two options need to be incompatible, so this behavior could change at any time.

This list, Miroslav said, is only a small subset of the optimizations that can create problems for live patches. As compiler developers pursue increasingly aggressive optimizations, this problem is only going to get worse.

Patch building

The kernel has a standard way to apply a live patch, but there is not, yet, any sort of mainlined mechanism for the creation of live patches. Josh Poimboeuf gave a brief summary of the patch-creation tools out there with an eye toward picking one for upstream.

The first of these is kpatch-build. It works by building the kernel both with and without the patch applied, then does a binary diff to see which functions changed. All of the changed functions are then extracted and packaged up into a "Frankenstein kernel module" that is shipped with the live patch. It is a powerful system, he said, with a number of advantages, including the fact that it automatically deals with most of the optimization issues mentioned in the previous talk.

On the other hand, kpatch-build is quite complex. It has to know about all of the special sections used by the kernel, and it has problems with certain kinds of changes. It only works on the x86_64 architecture at the moment; all of those special sections differ across architectures, so turning it into a multi-architecture tool will not be easy. And, he said, kpatch-build is brittle and a maintenance nightmare.

An alternative is to just use the regular kernel build system and its module-building infrastructure. The changed function is copied and pasted into a new module, some boilerplate is added to register the function with the live-patching API, and the job is done. It's easy, but has its own problems; in particular, this module is unable to access non-exported symbols, which the patched function may need to do. This problem can be worked around by using kallsyms_lookup_name(), but this solution is error-prone, slow, and "yucky."

The third alternative is new; indeed, he posted the proposal the week before the conference. This alternative uses the copy-and-paste approach, but adds an API and a postprocessing tool that allows the generated module to gain access to non-exported symbols. The code works now, though there are a number of possible improvements, including automating the process of attaching to non-exported symbols and detecting interference from compiler optimizations.

In the brief discussion at the end of the talk, it became clear that there were not a lot of concerns about the new tooling, so that is the direction things seem likely to go.

Module dependencies

Live patches can make changes to loadable modules, which leads to an interesting question: what happens if the module isn't present in the system when a patch is applied, but is loaded afterward? The live-patching code currently has some complicated infrastructure designed to detect this case and apply patches to modules as they are loaded. Jessica Yu, who has just taken over as the maintainer of the loadable module subsystem in the kernel, talked briefly about changing this mechanism to require that alle affected modules be loaded before a live patch is applied.

Live patches are, themselves, loadable modules. Allowing a patch module to be loaded before any modules it affects requires carrying a fair amount of information and complex infrastructure, and it circumvents the normal module dependency mechanism. As a result, there is a fair amount of code duplication, including a reimplementation of much of the module loader in the live-patching code.

There are a couple of ways that things could be changed. One would be to simply require that all modules being patched be loaded before the patch itself is loaded. That would work, but it forces the loading of code into the kernel that is unneeded and may never be used on any given site. The alternative would be to split the live-patch module into multiple pieces, each of which applies a patch to a single kernel module. Then, only the pieces that are relevant to any given running system need to be loaded.

Making this change would simplify the live-patching code and reduce code duplication, but there's a problem: there isn't an easy way to force a necessary patch module to be loaded when a module needing patching is loaded. The depmod tool just doesn't recognize that sort of dependency. FreeBSD has a nice MODULE_DEPEND() macro, but Linux has never needed that infrastructure.

Splitting the patch module, it turns out, could be problematic for any sort of wide-ranging change. CVE-2016-7097 was mentioned as an example; it included a virtual filesystem layer API change that had to be propagated to all filesystems. If it were to be split apart, the result would be a long list of modules that would need to be loaded to apply the patch.

There was a lively discussion on whether the rules concerning live patches for modules should be changed, much of it focused on a question asked by Steve Rostedt: if a module isn't present in the kernel, why not just fix it on disk rather than lurking in the kernel, waiting to patch it should it ever be loaded? Jiri Kosina replied that replacing on-disk modules would be hard from a distributor's point of view; it would introduce modules that no longer belong to the kernel package. Live patches can also be disabled; in that case, modified modules would have to be somehow restored. Some consistency models can also create trouble; it is possible to have both the pre-patch and post-patch code live and running in the kernel at the same time. So it's not obvious that fixing things on-disk is a workable solution, though Rostedt was adamant that it should be considered.

As the discussion wound down, it became fairly clear that the consensus was against changing how module dependencies work in live patching. The mechanism that the kernel has now, in the end, works well enough; it looks like it will not be going away anytime soon.

Index entries for this article
Kernel	Live patching
Conference	Linux Plumbers Conference/2016

Topics in live kernel patching

Posted Nov 15, 2016 9:05 UTC (Tue) by mjthayer (guest, #39183) [Link] (2 responses)

It would seem to me to make sense to try to modularise the innards of the kernel somewhat into larger blocks designed to be replaced as wholes, rather than trying to fiddle around gcc optimisations. I am sure it would not be easy either of course.

Regarding replacing modules on disk, why not just have separate directories for override modules from live patches? That has been possible for a long time, possibly even from the beginning of loadable kernel modules.

Topics in live kernel patching

Posted Nov 17, 2016 18:32 UTC (Thu) by intgr (subscriber, #39733) [Link] (1 responses)

> larger blocks designed to be replaced as wholes, rather than trying to fiddle around gcc optimisations

As the article mentions, the patch build system can safely cope with with "most of the optimization issues". If optimisations applied to a function propagate to callees or elsewhere, then those call sites will be picked up by the diff as well.

Are there actually any optimisations that break with this approach?

Topics in live kernel patching

Posted Nov 19, 2016 18:06 UTC (Sat) by mjthayer (guest, #39183) [Link]

> As the article mentions, the patch build system can safely cope with with "most of the optimization issues". If optimisations applied to a function propagate to callees or elsewhere, then those call sites will be picked up by the diff as well.

Not quite sure what you are saying there. Do you mean that the current approach is already good enough, or are you questioning whether it is? It sounds rather fragile to me, in the category "it works fine until it doesn't".

Architectures

Posted Nov 15, 2016 15:53 UTC (Tue) by fratti (guest, #105722) [Link] (5 responses)

I am surprised there is a port in the works for arm64, but not SPARC. I'd have guessed users running SPARC were more interested in livepatching, considering the sorts of applications SPARC is usually found in.

Architectures

Posted Nov 16, 2016 7:40 UTC (Wed) by k8to (guest, #15413) [Link] (4 responses)

Perhaps you didn't notice how Oracle has been fairly effectively killing the popularity of the former Sun's hardware? I mean maybe in the space you operate, it's still clinging on, but in the *many* IT spaces I've touched in that timeframe, Solaris & Sparc have become purely legacy.

Meanwhile, arm is growing. You can get cloud services instances running on arm these days.

Architectures

Posted Nov 17, 2016 14:11 UTC (Thu) by RCL (guest, #63264) [Link] (3 responses)

Where? Serious question since I'd like to give ARM a try for servers, but the support among cloud providers seems to be non-existent. A few small startups get oversubscribed apparently and just put you in the line to be informed "when hardware is available".

Architectures

Posted Nov 19, 2016 11:43 UTC (Sat) by nyfle (guest, #72967) [Link]

In the absence of a reply, I thought I'd add my 2p worth:

Scaleway - https://www.scaleway.com

Architectures

Posted Nov 19, 2016 21:59 UTC (Sat) by mmendez (subscriber, #81435) [Link] (1 responses)

Also checkout packet.net's just released Type-2A servers (2x48 core Cavium ThunderX processors) https://www.packet.net/bare-metal/servers/type-2a. Hard to get one right now as they are being scooped up very quickly, but we are going to be bringing in more online.

Architectures

Posted Nov 20, 2016 18:48 UTC (Sun) by jem (subscriber, #24231) [Link]

This reminded me of the LWN article "Creating a kernel build farm" from Oct 5 (https://lwn.net/Articles/702375/). Does anyone have insight into the economics of using this solution instead of "going small"?

Packet.net advertises a price of USD 0.5 per hour. You'll get quite a lot of hours for the price of four MiQi boards (16 A17 cores total) plus all the extra necessary gear (switch, power supply, cabling, etc).

Topics in live kernel patching

Posted Nov 16, 2016 3:37 UTC (Wed) by fandingo (guest, #67019) [Link]

Who is interested in this feature? Seriously. It's an awful can of worms, and we all know that's it's never going to deliver the compatibility or reliability requirements of people who would be interested.

> changing this mechanism to require that all affected modules be loaded before a live patch is applied.

*Tries not to laugh hysterically.*

> Splitting the patch module, it turns out, could be problematic for any sort of wide-ranging change. CVE-2016-7097 was mentioned as an example; it included a virtual filesystem layer API change that had to be propagated to all filesystems. If it were to be split apart, the result would be a long list of modules that would need to be loaded to apply the patch.

It's almost like the kernel uses subsystems that share core functionality and build complexity on top of each other.

> There was a lively discussion on whether the rules concerning live patches for modules should be changed, much of it focused on a question asked by Steve Rostedt: if a module isn't present in the kernel, why not just fix it on disk rather than lurking in the kernel, waiting to patch it should it ever be loaded? Jiri Kosina replied that replacing on-disk modules would be hard from a distributor's point of view; it would introduce modules that no longer belong to the kernel package.

I don't understand this issue at all. The distro's package management tool will invoke the live patching tool, no? Whatever package controls that module should be able to handle this without issue. The only remaining issue is locking out module loading during the update process. Afterwards, the updated module should be loadable from disk if ever needed.

> The alternative would be to split the live-patch module into multiple pieces, each of which applies a patch to a single kernel module. Then, only the pieces that are relevant to any given running system need to be loaded.

So how would one use `modprobe -r` to remove module X after live patching? Do I remove the patch module first and then the module, or vice versa? How does that affect system stability? Presumably systems using live patching have really long uptimes, so what happens if there's a dozen patch modules on top of the original module?

> He did raise a few larger questions, though. One of those is expanding live patching to user-space code as well; there are evidently, users who are interested in that capability.

The whole endeavor seems academic without reliable mechanisms for patching the complete software stack. We've already seen the difficulties package managers have of reliably applying system updates in https://lwn.net/Articles/702629/.

> He asked: what are the benefits of using live patching rather than performing a live cluster update? If a cluster can be taken down and upgraded one machine at a time, there is no real need for a live-patching infrastructure. We don't all run clusters, but users whose uptime needs make them consider live patching maybe should be using clusters.

He's pointing to a deeper problem of attesting live patches and more fundamentally the trustworthiness of a mutable kernel. Sure, loadable kernel modules inherently undermine the attestation of a system, but live patching substantially complicates security.

Bingo.

"Your scientists were so preoccupied with whether or not they could, they didn’t stop to think if they should."

Topics in live kernel patching

Posted Nov 18, 2016 15:49 UTC (Fri) by unixbhaskar (guest, #44758) [Link]

Whoa! lots of thing going on . But I believe it should see the daylight sooner than later.

Topics in live kernel patching

Posted Nov 21, 2016 7:48 UTC (Mon) by eduard.munteanu (guest, #66641) [Link]

There is one alternative which does not seem to be mentioned: asking the one true authoritative source. That is *drumrolls*... the compiler. It is practically the only thing which could possibly have a clear view of code semantics, so the creation of binary patches would best be handled there. It's probably a large project especially for GCC, but it's the right thing to do in the long run.

Topics in live kernel patching

Posted Nov 30, 2016 17:45 UTC (Wed) by nix (subscriber, #2304) [Link]

> This sort of change, too, is hard to detect; it would be nice, he said, to have a GCC option to ask it to create a log of the optimizations it has done.

This is... ah, impractical, because there is really no boundary between 'optimizations' and 'compilation': there are just a long series of transformations that convert the source into the output. Some of these transformations are optional, but you surely don't want to log all of those (and in how much detail? many optimizations change their output depending on which optimizations have run before them...). As a first step, the code added for LTO which records the state of the compiler (incl compilation flags) into the (LTO streamer) output might be a good start: combine that with -frandom-seed= and you can pretty much guarantee reproducible output between runs, which is probably the best you can hope for here.

Topics in live kernel patching

Unhelpful optimizations

Patch building

Module dependencies

Other topics

Topics in live kernel patching

Topics in live kernel patching

Topics in live kernel patching

Architectures

Architectures

Architectures

Architectures

Architectures

Architectures

Topics in live kernel patching

Topics in live kernel patching

Topics in live kernel patching

Topics in live kernel patching