>>535045548
MS-Schisandra-22B-v0.3, on the other hand, maintains the style subtly, but well enough, is smart and cohesive, and can write in languages other than English surprisingly well. A 5 or 4 bit quant will fit on 24 GB of VRAM with room to spare for 8K or 16K of context.
If anyone out there uses anything local better that is not a Mistral Small or Mistral Nemo fine-tune, I'd be happy to know.