Nope -- every other GPU implementation I've seen thus far requires the shader compiler and even the general translation from OpenGLES API to native formats and command queues to happen on the host side. Including a full general purpose processor in the GPU block is unique to Broadcom to the best of my knowledge (though it is similar to how a lot of codec blocks work -- a pretty powerful, relatively general purpose cpu or dsp combined with dedicated hw accel units for heavy lifting).