>>106208554
Well fuck me sideways that made an enormous difference, 22 to 145 t/s on glm 358b

I swear I upped my batch sizes on qwen 235b and it made a negligable difference so I hadn't even bothered on glm, now this cunt PP's faster than it does.

Time to rejig my -ts and squeeze the last of my memory, since I had to change my -ot to fit the higher batch size.