New gemma3 LLM model is only 300MB.
I wonder if it's good enough to put into your game for NPCs now. Maybe with some finetuning every NPC or archetype can have their own finetuned dialogue.