>>107133416
bro?
its using 10gb vram, 4gig model and rest is ctx prob
250t/s prompt processing, not tg
tg is more like 7-9t/s
i think i have benchmarks saved somewhere, gimme a minute