The netalumina models are pretty nice. Fast enough that they're actually usable as well.
I'll have to test out the Qwen models as well.