Search Results

Found 5 results for "57202dda8454fcf7998cf7f9334f153f" across all boards searching md5.

Anonymous /g/105745833#105746677
6/29/2025, 9:29:18 PM
>>105746664
>There shouldn't be any difference because Q8 GGUF is 8 bit floating point, just different format.
another retarded take from debo
Anonymous /g/105741878#105744111
6/29/2025, 4:56:19 PM
>>105744098
>switching between gguf and fp8
what gguf though? only Q8 is better than fp8
Anonymous /g/105730102#105731065
6/28/2025, 8:24:55 AM
>>105731041
>what would be the best way to run it with minimal (preferably no) quality loss?
Q8 is really close to bf16 and it's only eating ~15gb of memory during inference
Anonymous /g/105634008#105635953
6/19/2025, 1:42:05 AM
>>105635932
>Because Q8 is negligible quality difference to the full weights while allowing me to generate 8 images at a time easily, speeding up the generation by granting a free 1 image for each 8 time wise
what? Q8 and bf16 have the same speed though?
Anonymous /g/105603632#105605505
6/16/2025, 1:09:26 AM
>>105605495
>>105605498
why are retards always so confident of themselves?