so let's say i have rtx 6000 pro that fell on my lap
what's the best set up to do the following:
use silly tavern with llm model X (via llama.cpp or whatever) to do lewd RP with, with a TTS model Y to read said lewd RP in my oneitis voice, and gen image/video with model Z of said oneitis?