>>106022276 (OP)I use OpenAudio S1 Mini for local text to speech and Seed-VC for voice conversion. OpenAudio S1 Mini eats up 5GB and a fine tuned Seed-VC model eats up 2GB of VRAM.
Input audio of Sports Commentator sample file from the DMOSpeech 2 demo github page
https://vocaroo.com/1kKJm37RDYFI
Openaudio S1 Mini Output file
https://vocaroo.com/1bQiLbbhKfZO
Input audio of Doc Hammer (Carl's Jr Ad announcer)
https://vocaroo.com/19XnClS0JbMi
Openaudio S1 Mini Output file
https://vocaroo.com/13JjY0eyTLon
Input audio of unknown Carl's Jr Ad announcer
https://vocaroo.com/1ffVrk1VNmz4
Openaudio S1 Mini Output file
https://vocaroo.com/185jN6BLxiuo
Input audio of Grace Randolph
https://vocaroo.com/1isQM48G8nA9
Openaudio S1 Mini Output file
https://vocaroo.com/14fY7AYlTNvH
Openaudio S1 Mini Output file fed to a fine tuned Seed-VC Model. Sample rate is 22050 Hz
https://vocaroo.com/1ayPfkoXu95U