Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Posted Sep 20, 2025 0:45 UTC (Sat) by stephen.pollei (subscriber, #125364)In reply to: Neat: but isn't this a type-1 hypervisor? by quotemstr
Parent article: Multiple kernels on a single system
Posted Sep 20, 2025 15:29 UTC (Sat)
by ballombe (subscriber, #9523)
[Link] (9 responses)
Posted Sep 20, 2025 17:10 UTC (Sat)
by quotemstr (subscriber, #45331)
[Link] (8 responses)
Posted Sep 20, 2025 20:22 UTC (Sat)
by roc (subscriber, #30627)
[Link] (7 responses)
If the Linux kernel had been unable to scale well beyond 16 cores then this cluster idea might have been a viable path forward. But Linux did and any potential competitor that doesn't is simply not viable for these workloads.
Posted Sep 21, 2025 8:07 UTC (Sun)
by quotemstr (subscriber, #45331)
[Link] (6 responses)
Yes, and those programs can keep running. Suppose I'm developing a brand-new system and a cluster on which to run it. My workload is bigger than any single machine no matter how beefy, so I'm going to have to distribute it *anyway*, with all the concomitant complexity. If I can carve up my cluster such that each NUMA domain is a "machine", I can reuse my inter-box work distribution stuff for intra-box distribution too.
Not every workload is like this, but some are, and life can be simpler this way.
Posted Sep 21, 2025 9:17 UTC (Sun)
by ballombe (subscriber, #9523)
[Link] (5 responses)
But more seriously, when using message passing, you still want to be share your working set across all cores in the same node to preserve memory.
Posted Sep 21, 2025 10:15 UTC (Sun)
by willy (subscriber, #9762)
[Link] (4 responses)
Posted Sep 21, 2025 12:42 UTC (Sun)
by ballombe (subscriber, #9523)
[Link] (3 responses)
Posted Sep 21, 2025 20:19 UTC (Sun)
by willy (subscriber, #9762)
[Link] (2 responses)
Posted Sep 21, 2025 20:35 UTC (Sun)
by quotemstr (subscriber, #45331)
[Link] (1 responses)
Posted Sep 21, 2025 20:39 UTC (Sun)
by willy (subscriber, #9762)
[Link]
The patches are to do this automatically without library involvement. I think the latest round were called something awful like "Copy On NUMA".
Posted Sep 20, 2025 18:59 UTC (Sat)
by willy (subscriber, #9762)
[Link] (2 responses)
But Solaris didn't have RCU. I would argue that RCU has enabled Linux to scale further than Solaris without falling off "the locking cliff". We also have lockdep to prevent us from creating deadlocks (I believe Solaris eventually had an equivalent, but that was after Larry left Sun). Linux also distinguishes between spinlocks and mutexes, while I believe Solaris only has spinaphores. Whether that's terribly helpful or not for scaling, I'm not sure.
Posted Sep 20, 2025 21:31 UTC (Sat)
by stephen.pollei (subscriber, #125364)
[Link] (1 responses)
I don't know enough to have an opinion on how Linux kernel was able to scale as successfully as it has. There were certainly doubts in the past. If I recall correctly, RCU was being used in other kernels before it was introduced in Linux, but I don't recall which ones.
Posted Sep 21, 2025 6:04 UTC (Sun)
by willy (subscriber, #9762)
[Link]
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
<https://en.wikipedia.org/wiki/Single_system_image>
... or HPE will sell you NUMAlink systems with coherent memory across 32 sockets.
Replacing a 128 cores system by 8 16-cores system will require 8 copies of the working set.
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?
Neat: but isn't this a type-1 hypervisor?