Naive question: how much did userspace malloc's learn from the various kernel allocators?
I remember reading on LWN that part of Firefox 3's performance improvement came from adopting jemalloc and avoiding memory fragmentation on Windows: http://lwn.net/Articles/273001/