>>103668705
Bandwidth is much different and it makes a lot of difference when LLM models shuffle around doing operations in memory, we're talking about a 10x difference which is why GPUs do LLM stuff a lot faster. This is why the really high end GPU in datacenters use HBM memory, and why 3090s are still prized.