Anonymous
7/30/2025, 2:22:28 PM No.5936456
Previous thread >>5909267
Dedicated Suno/Udio/Sonauto thread >>5915471
Post AI generated stuff. Song covers, animations, whatever.
Bonus for OC. Please keep short AI video clips without sound to a minimum.
Open source resources are linked below, but feel free to post anything, open or proprietary.
> Speech-to-Speech and Singing-Voice-Conversion
https://github.com/Mangio621/Mangio-RVC-Fork
https://github.com/Vali-98/XTTS-RVC-UI
https://github.com/voicepaw/so-vits-svc-fork
https://github.com/open-mmlab/Amphion/tree/main/models/svc/vevosing
https://github.com/Plachtaa/seed-vc
> Text-to-Speech
https://github.com/FunAudioLLM/CosyVoice
https://github.com/resemble-ai/chatterbox
https://github.com/index-tts/index-tts
https://github.com/Zyphra/Zonos/
https://github.com/open-mmlab/Amphion/tree/main/models/tts/maskgct
https://github.com/SparkAudio/Spark-TTS
https://huggingface.co/spaces/mrfakename/E2-F5-TTS/tree/main
https://github.com/BoltzmannEntropy/xtts2-ui
> Text to Music
https://github.com/joeljuvel/YuE-UI/
> Text-to-Video, Image-to-Video
https://github.com/kijai/ComfyUI-WanVideoWrapper
https://github.com/Wan-Video/Wan2.1
> Lipsync, Deepfake, etc.
https://openart.ai/workflows/datou/multitalk-wanvideo-21/lFeTPTwebmGMloHM3QGO + https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensors
https://github.com/Hillobar/Rope
https://github.com/Mozer/wav2lip
> Audio Cleanup
UVR Walkthrough: https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit#heading=h.n8ac32fhltgg
https://github.com/Anjok07/ultimatevocalremovergui
https://github.com/Rikorose/DeepFilterNet
https://github.com/nomadkaraoke/python-audio-separator
https://github.com/resemble-ai/resemble-enhance
> Transcription
https://github.com/linto-ai/whisper-timestamped
https://github.com/openai/whisper
Dedicated Suno/Udio/Sonauto thread >>5915471
Post AI generated stuff. Song covers, animations, whatever.
Bonus for OC. Please keep short AI video clips without sound to a minimum.
Open source resources are linked below, but feel free to post anything, open or proprietary.
> Speech-to-Speech and Singing-Voice-Conversion
https://github.com/Mangio621/Mangio-RVC-Fork
https://github.com/Vali-98/XTTS-RVC-UI
https://github.com/voicepaw/so-vits-svc-fork
https://github.com/open-mmlab/Amphion/tree/main/models/svc/vevosing
https://github.com/Plachtaa/seed-vc
> Text-to-Speech
https://github.com/FunAudioLLM/CosyVoice
https://github.com/resemble-ai/chatterbox
https://github.com/index-tts/index-tts
https://github.com/Zyphra/Zonos/
https://github.com/open-mmlab/Amphion/tree/main/models/tts/maskgct
https://github.com/SparkAudio/Spark-TTS
https://huggingface.co/spaces/mrfakename/E2-F5-TTS/tree/main
https://github.com/BoltzmannEntropy/xtts2-ui
> Text to Music
https://github.com/joeljuvel/YuE-UI/
> Text-to-Video, Image-to-Video
https://github.com/kijai/ComfyUI-WanVideoWrapper
https://github.com/Wan-Video/Wan2.1
> Lipsync, Deepfake, etc.
https://openart.ai/workflows/datou/multitalk-wanvideo-21/lFeTPTwebmGMloHM3QGO + https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensors
https://github.com/Hillobar/Rope
https://github.com/Mozer/wav2lip
> Audio Cleanup
UVR Walkthrough: https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit#heading=h.n8ac32fhltgg
https://github.com/Anjok07/ultimatevocalremovergui
https://github.com/Rikorose/DeepFilterNet
https://github.com/nomadkaraoke/python-audio-separator
https://github.com/resemble-ai/resemble-enhance
> Transcription
https://github.com/linto-ai/whisper-timestamped
https://github.com/openai/whisper
Replies: