>try ik_llamacpp for glm4.5 with the ubergarm quants
>it's slower than llama.cpp with the unsloth ones even if you take into account that the unsloth q4 is bigger
I had hopes for it because they were better for Deepseek but I guess I'm back on main