>>106329655
>>106329758
Wait, didn't read. The dense layers of full deepseek R1 should fit into 24gb VRAM though. All the layers that go on GPU in pic rel is kept at Q8 (unquanted) and it still fits 32k context in 24gb VRAM.