Sounds awfully similar to what Xpra does. (http://xpra.org/) It makes use of X-server running dummy frame buffer and connects into it as compositor.
I don't see why the client must speak the same language as the remote location's native windowing system does. Poeple don't complain the Unix commandline doesn't do session management on its own ether, but instead use dutifully screen. They still have to have putty installed in Windows machines which ususally isn't, how does an xpra client (or a Wayland equivalent) make any big difference? Ether don't need admistrator privileges to run.
By the way -xpra approach makes rootlessness the default behavior too.