Progress in wrangling the Python C API

Posted Nov 9, 2023 2:10 UTC (Thu) by NYKevin (subscriber, #129325)
In reply to: Progress in wrangling the Python C API by 5fdb1f
Parent article: Progress in wrangling the Python C API

This is an unfortunate situation where Python is offering semantics that (safe) Rust is unable to model, mostly because Python's semantics are little more than "here's a pointer, here's a flag telling you whether or not you can write to it, also by the way I'm going to hand out the same pointer and flag to anyone else who asks."

(I mean, yes, it is a little more complicated than that. Python actually tells you all sorts of metadata about how the buffer is laid out in memory, whether it is multi-dimensional, etc., and can even hand out non-contiguous buffers. But in terms of what you are allowed to do with it, it's pretty much just "a pointer and a writable flag.")

In general, Python's attitude is that, if you want to fiddle with the buffer protocol, you should be doing it as an implementation detail of some object that you entirely control, or else you should be using some kind of explicit locking which is not managed by the interpreter. But Python is also totally fine with a library writing "lol this is not thread-safe, don't even try" in their documentation and calling it a day. And frankly, that insouciance cannot be made compatible with "let's have a sound Rust wrapper type that can safely interact with arbitrary Python buffer protocol objects." Python does not provide the guarantees that Rust wants, nor does Python provide a standardized mechanism for Rust to ask for such guarantees.

What I think needs to happen is one (or more) of the following:

* Rust code may only use the buffer protocol to interact with objects that it owns or mutably borrows (in Rust terms) and that have a (Python) refcount of 1 (or which have not returned from their Python constructors yet). Additionally, the object must not be compatible with (Python) weak references (so that the refcount is correct). This combination of requirements is very limiting. I'm not sure how easy it is to prove all of these requirements are satisfied at compile time, but you could dynamically check them, so this could be used to build a safe primitive out of unsafe code.
* Python starts offering flags that indicate a buffer is const or exclusive. const is a stronger condition than read-only, because it means the object won't change of its own accord (or by some means other than the buffer protocol). But const can probably only be offered for objects that Python considers immutable, and exclusive raises questions of how the locking is supposed to work. What happens if a legacy client asks for a (writable) buffer when the new API considers the request unacceptable? Does it just get a BufferError? That probably breaks something, but there's no obvious alternative behavior.
* Python requires only one client to have mutable access to a given buffer at a given time. It is technically possible to do this without breaking ABI compatibility, because buffer accessors are already required to call PyBuffer_Release when they are done with the buffer (so you can keep track of who is reading or writing the buffer at any given time). However, this would be semantically incompatible with the existing behavior and would likely result in problems anyway. For example, there's currently nothing wrong with a client eagerly acquiring all of its buffers up-front, figuring out which ones alias, and then managing those aliases with a separate set of locks on the C side, without informing Python of the locks at all. If you added a lock to the buffer protocol, then that client would break because it would be unable to acquire the same buffer more than once. It is, frankly, unreasonable to break that client just to make things easier for Rust - they used the API as advertised, they're not doing anything particularly sketchy or unusual, and there's currently no way to inform Python of the locks even if you want to (nor does the current version of Python have the slightest idea what to do with that information).
* Python offers an advisory locking mechanism for the buffer protocol. This doesn't really solve anything because clients can ignore it, but maybe you deprecate non-locked access and try to force everybody to take locks in a future version. Single-threaded clients will probably be unimpressed by that.
* Rust treats Python buffers as equivalent to raw pointers, and stops trying to build safe wrappers for the protocol as a whole. Instead, build wrappers specific to each type you want to interact with. Unfortunately, that's probably not a whole lot easier than trying to wrap the whole protocol, but at least you can somewhat depend on the semantics of the specific C extension, which might be able to offer stronger guarantees than Python does.

Progress in wrangling the Python C API

Posted Nov 9, 2023 2:32 UTC (Thu) by DemiMarie (subscriber, #164188) [Link] (2 responses)

Another option is to tell Python programmers “don’t do that”. Python already allows accessing arbitrary memory via ctypes, so there is nothing new here.

Rust could also view this as an array of AtomicU8, which is safe if not very useful.

Progress in wrangling the Python C API

Posted Nov 9, 2023 7:38 UTC (Thu) by NYKevin (subscriber, #129325) [Link] (1 responses)

> Another option is to tell Python programmers “don’t do that”. Python already allows accessing arbitrary memory via ctypes, so there is nothing new here.

That's... completely irrelevant to everything.

ctypes enables Python code to access memory owned by a C object (or, in principle, any sort of data structure that can be describe in C-like terms), as well as to call C functions (and functions with C bindings). The buffer protocol enables C extensions to access memory owned by another C extension (or a builtin CPython type), and it also incidentally has a very small Python API[1] that enables Python to act as glue code for this process.[2] Neither is a substitute for the other, and telling Python programmers not to write glue code will not change much of anything (they will ignore you because "writing glue code" is Python's main use case).

[1]: https://docs.python.org/3/library/stdtypes.html#memoryview
[2]: Python can read from these buffers, but the whole point of the buffer protocol is to enable direct C-to-C memory access (i.e. without holding the GIL or executing Python bytecode). The object that gave you the buffer probably already exposes __getitem__ etc. to Python anyway.

Progress in wrangling the Python C API

Posted Nov 11, 2023 9:38 UTC (Sat) by DemiMarie (subscriber, #164188) [Link]

The reason I mentioned ctypes is to show that one can already corrupt memory from within Python. “Don’t do that” = “if you change memory that you should not change and something breaks, you get to keep both pieces.”

Progress in wrangling the Python C API

Posted Nov 10, 2023 7:17 UTC (Fri) by himi (subscriber, #340) [Link]

It seems to me that the Python C API would probably benefit from formalising the ownership and mutability rules to match Rust's - not just because it makes interfacing Python with Rust easier, but because it would improve the lot of /everything/ interfacing with Python. Rust's rules are simple, fairly easy to reason about, and powerful, even if you don't have the whole language enforcing them - it seems like it'd be a good model for /anything/ that wants to manage access to shared memory, particularly across a natural boundary like an extension API.

If I've understood things from previous discussions about this properly (and option 3 in your comment supports this) the API already kind of supports some of this, though not really very well - from memory it might be doable with some new flags that plugins supporting the new model would set and check, while old plugins would ignore/not set and get the same behaviour they do now. Then it's just a matter of making sure the Python side maintained its end of the new model, and off you go. Of course, mixing new and old would probably Not Work(tm) in interesting ways, but mixing and matching ABIs is rarely a good idea . . .

You probably wouldn't want to try and enforce the rules with locking and so forth (though in the no-gil world maybe it'd be necessary?) - trusting that the plugin didn't ignore the flags that were set, or lie about the flags /it/ set, would seem like the best approach. After all, it's at least making explicit promises, as it stands there are no promises at all, let alone actual guard rails anywhere.

It may also be the case that changes on the Python side to support the new model would break some users - but isn't that kind of the situation anyone using the current C API has to deal with across major version changes? And if the whole C API is being reworked (hopefully also properly formalised, stabilised and documented) then isn't that a great time to implement this kind of breaking change?