>>106395902
>>106395938
Nvidia didn't miss performance improvements for AI, even if they did for gaming. You can see it most clearly with the Qwen benchmarks where it/s is basically 2x. I feel like FP4 models are going to start coming out especially with MXFP4 on GPT-OSS being super fast and everyone likely to copy OpenAI.