>>106149915
It does happen for deepseek but deepseek at Q1 is still the best model you can run in that amount of memory. A model with fewer parameters at a larger quant will be worse.