GLM 4.5 air iq4_xs on a 3090... Should the prompt processing be this slow?