Search Results

Found 1 results for "40a5e09fc023ac3088bd25bb0b2683a2" across all boards searching md5.

8/6/2025, 5:22:01 AM

>Dialing in my performance/args for the big GLM4.5
>6.11 t/s token gen
Huh, I can live with that, just barely
>22.16 t/s prompt processing
KILL ME.

Also after some dicking around, the -ncmoe arg is less efficient than just doing a manual -ot with *exps.=CPU, but not by a whole lot.

Go to Thread

Page 1