Anonymous
6/22/2025, 10:16:48 PM
No.105674325
>>105674298
I think ktransformers let's you speed things up by having a copy of the model in each NUMA node.
So effectively you use half your total ram but double the theoughtput?
Something like that.
I think ktransformers let's you speed things up by having a copy of the model in each NUMA node.
So effectively you use half your total ram but double the theoughtput?
Something like that.