|| ||Ingo Molnar <mingo-AT-elte.hu> |
|| ||Thomas Gleixner <tglx-AT-linutronix.de> |
|| ||Re: [PATCH 3/5] v2 seccomp_filters: Enable ftrace-based system call
|| ||Wed, 25 May 2011 17:01:53 +0200|
|| ||Peter Zijlstra <peterz-AT-infradead.org>,
Will Drewry <wad-AT-chromium.org>,
Steven Rostedt <rostedt-AT-goodmis.org>,
Frederic Weisbecker <fweisbec-AT-gmail.com>,
James Morris <jmorris-AT-namei.org>, linux-kernel-AT-vger.kernel.org,
Eric Paris <eparis-AT-redhat.com>, kees.cook-AT-canonical.com,
agl-AT-chromium.org, "Serge E. Hallyn" <serge-AT-hallyn.com>,
Ingo Molnar <mingo-AT-redhat.com>,
Andrew Morton <akpm-AT-linux-foundation.org>,
Tejun Heo <tj-AT-kernel.org>, Michal Marek <mmarek-AT-suse.cz>,
Oleg Nesterov <oleg-AT-redhat.com>, Jiri Slaby <jslaby-AT-suse.cz>,
David Howells <dhowells-AT-redhat.com>,
Russell King <linux-AT-arm.linux.org.uk>,
Michal Simek <monstr-AT-monstr.eu>,
Ralf Baechle <ralf-AT-linux-mips.org>,
Benjamin Herrenschmidt <benh-AT-kernel.crashing.org>,
Paul Mackerras <paulus-AT-samba.org>,
|| ||Article, Thread
* Thomas Gleixner <email@example.com> wrote:
> On Tue, 24 May 2011, Ingo Molnar wrote:
> > * Peter Zijlstra <firstname.lastname@example.org> wrote:
> > > On Tue, 2011-05-24 at 10:59 -0500, Will Drewry wrote:
> > > > include/linux/ftrace_event.h | 4 +-
> > > > include/linux/perf_event.h | 10 +++++---
> > > > kernel/perf_event.c | 49 +++++++++++++++++++++++++++++++++++++---
> > > > kernel/seccomp.c | 8 ++++++
> > > > kernel/trace/trace_syscalls.c | 27 +++++++++++++++++-----
> > > > 5 files changed, 82 insertions(+), 16 deletions(-)
> > >
> > > I strongly oppose to the perf core being mixed with any sekurity voodoo
> > > (or any other active role for that matter).
> > I'd object to invisible side-effects as well, and vehemently so. But note how
> > intelligently it's used here: it's explicit in the code, it's used explicitly
> > in kernel/seccomp.c and the event generation place in
> > kernel/trace/trace_syscalls.c.
> > So this is a really flexible solution IMO and does not extend events with some
> > invisible 'active' role. It extends the *call site* with an open-coded active
> > role - which active role btw. already pre-existed.
> We do _NOT_ make any decision based on the trace point so what's the
> "pre-existing" active role in the syscall entry code?
The seccomp code we are discussing in this thread.
> I'm all for code reuse and reuse of interfaces, but this is completely
> wrong. Instrumentation and security decisions are two fundamentally
> different things and we want them kept separate. Instrumentation is
> not meant to make decisions. Just because we can does not mean that it
> is a good idea.
Instrumentation does not 'make decisions': the calling site, which is
already emitting both the event and wants to do decisions based on
the data that also generates the event wants to do decisions.
Those decisions *will be made* and you cannot prevent that, the only
open question is can it reuse code intelligently, which code it is
btw. already calling for observation reasons?
( Note that pure observers wont be affected and note that pure
observation call sites are not affected either. )
> So what the current approach does is:
> - abuse the existing ftrace syscall hook by adding a return value to
> the tracepoint.
> So we need to propagate that for every tracepoint just because we
> have a single user.
This is a technical criticism i share with you and i think it can be
fixed - i outlined it to Will yesterday.
And no, if done cleanly it's not 'abuse' to reuse code. Could we wait
for the first clean iteration of this patch instead of rushing
> - abuse the perf per task mechanism
> Just because we have per task context in perf does not mean that we
> pull everything and the world which requires per task context into
> perf. The security folks have per task context already so security
> related stuff wants to go there.
We do not pull 'everything and the world' in, but code that wants to
process events in places that already emit events surely sounds
related to me :-)
> - abuse the perf/ftrace interfaces
> One of the arguments was that perf and ftrace have permission which
> are not available from the existing security interfaces. That's not
> at all a good reason to abuse these interfaces. Let the security
> folks sort out the problem on their end and do not impose any
> expectations on perf/ftrace which we have to carry around forever.
> Yes, it can be made working with a relatively small patch, but it has
> a very nasty side effect:
> You add another user space visible ABI to the existing perf/ftrace
> mess which needs to be supported forever.
What mess? I'm not aware of a mess - other than the ftrace API which
is not used by this patch.
> Brilliant, we have already two ABIs (perf/ftrace) to support and at
> the same time we urgently need to solve the problem of better
> integration of those two. So adding a third completely unrelated
> component with a guaranteed ABI is just making this even more complex.
So your solution is to add yet another ABI for seccomp and to keep
seccomp a limited hack forever, just because you are not interested
I think we want fewer ABIs and more flexible/reusable facilities.
> We can factor out the filtering code and let the security dudes
> reuse it for their own purposes. That makes them to have their own
> interfaces and does not impose any restrictions upon the
> tracing/perf ones. And really security stuff wants to be integrated
> into the existing security frameworks and not duct taped into
> perf/trace just because it's a conveniant hack around limitiations
> of the existing security stuff.
You are missing what i tried to point out in earlier discussions:
from a security design POV this isnt just about the system call
boundary. If this seccomp variant is based on events then it could
grow proper security checks in other places as well, in places where
we have some sort of object observation event anyway.
So this is opens up possibilities to reuse and unify code on a very
> You really should stop to see everything as a nail just because the
> only tool you have handy is the perf hammer. perf is about
> instrumentation and we don't want to violate the oldest principle
> of unix to have simple tools which do one thing and do it good.
That is one of the most misunderstood principles of Unix.
The original Unix tool landscape was highly *integrated* and
*unified*, into a very tight codebase that was maintained together.
Yes, there were different, atomic, simple commands but it was all
done together and it was a coherent whole and pleasant to use!
People misunderstood this as a license to fragment the heck out
functionality and build 'simple and stupid' tools that fit nowhere
really and use different, incompatible principles not synced with
each other. That is wrong and harmful.
So yes, over-integration is obviously wrong - but so is needless
Anyway, i still reserve judgement on this seccomp bit but the patches
so far looked very promising with a very surprisingly small diffstat.
If anything then that should tell you something that events and
seccomp are not just casually related ...
> Even swiss army knifes have the restriction that you can use only
> one tool at a time unless you want to stick the corkscrew through
> your palm when you try to cut bread.
I'm not sure what you are arguing here - do you pine for the DOS days
where you could only use one command at a time?
to post comments)