>>105883646
training llama4-behemoth 2t/288b which they used to distill all the data they trained maverick/scout on