>>105572471 (OP)Because they're peripherals.
It's in the name: Peripheral Component Interconnect. There is a pseudo-direct connection-method - GPU-to-GPU - achievable through FPGA (Field-Programmable Gate Array) boards which OpenAI and Microshit use in their Mariner and Gridcoin servers (part of Azure and OpenAI Compute). They GOUs are connected to each other through the FPGAs, with MMF, using either LC or MOO-DAC connectors.
Bus management and responsibility includes frequency matching, impedance matching, access and backoff algorithms, and IO sheduling, among other activities. Traditionally, the GP CPU ingests these requirements as simple opcodes and interrupts along a memory-management route with attendant syscalls in kernel libraries or stacks, but GPUs COULD do all of that themselves.
Things is, at that point, you'd be reinventing the motherboard, simply with the GPU as the underlying ULSI foundation instead of the CPU.
Believe it or not, some AI organizations are already doing that, like the above-mentioned EvilCorps.