← Home ← Back to /g/

Thread 106187009

13 posts 4 images /g/
Anonymous No.106187009 >>106187242 >>106188127 >>106189055 >>106190320 >>106192534
i like gpt-oss:20b, pretty decent for it's parameters
Anonymous No.106187242
>>106187009 (OP)
>135tok/s
Yeah, it's pretty nice.
Anonymous No.106187502
hart
Anonymous No.106187533 >>106187552
none of this shit from hugging face is up to scratch. it's a total waste of a time compared to the premium stuff.
Anonymous No.106187552 >>106188175
>>106187533
The VRAM moat is real.
Anonymous No.106188122
>mfw the 2020 dodge charger is a four door
Anonymous No.106188127
>>106187009 (OP)
>>>/vt/ go back to your containment board
Anonymous No.106188175
>>106187552
part of it is VRAM, but a large part of it is the extra, non-model things they don't talk about that actually QA the output before sending it to you.
Anonymous No.106188302
whats that child doing under my desk?
Anonymous No.106189055
>>106187009 (OP)
It's literally just a shitty reasoning finetune on top of 1.5 generation old weights. Llama-2-13B-Chat mogs on it.
Anonymous No.106190320 >>106190724
>>106187009 (OP)
>i like the thing
enough to make an entire thread? wow
Anonymous No.106190724
>>106190320
It's a slide thread
>Lust inducing picture
>time wasting question
Anonymous No.106192534
>>106187009 (OP)
The reasoning is much efficient than Qwen3. They take for-fucking-ever.