>>105783390
the latent is larger -> diffusion model is working with a larger input -> slower generation
sdxl: 3x1024x1024 image -> 4x128x128 latent = 65,536 values
flux: 3x1024x1024 image -> 16x128x128 latent = 262,144 values