Sorry if this is the wrong thread, but does anyone use TTS models?
I've been fucking around with LLMs, image gen and video gen for awhile and there's plenty of ressources and discussion on these, but nobody seems to give a shit about text to speech
I can't even find a good library of voice samples.