Search Results

Found 1 results for "f34f5f1fa9ec069487d841cfa83775cf" across all boards searching md5.

Anonymous /g/106195686#106196670

8/9/2025, 2:57:13 AM

>>106196144
tried with that, text gen is now as fast as llama.cpp, but prompt processing is 5x slower
./llama-server --model ~/TND/AI/glmq3kxl/GLM-4.5-Air-UD-Q3_K_XL-00001-of-00002.gguf -ot ffn_up_shexp=CUDA0 -ot exps=CPU -ngl 100 -t 6 -c 16384 --no-mmap -fa -ub 2048 -b 2048 -fmoe -amb 512 -rtr

Go to Thread

Page 1