Pony Preservation Project (Thread 154) - /mlp/ (#42161191) [Archived: 542 hours ago]

Anonymous
5/2/2025, 7:13:41 AM No.42161191
AltOPp
AltOPp
md5: de275cbe7306de042ac699b6550ea6b0๐Ÿ”
Welcome to the Pony Voice Preservation Project!
youtu.be/730zGRwbQuE

The Pony Preservation Project is a collaborative effort by /mlp/ to build and curate pony datasets for as many applications in AI as possible.

Technology has progressed such that a trained neural network can generate convincing voice clips, drawings and text for any person or character using existing audio recordings, artwork and fanfics as a reference. As you can surely imagine, AI pony voices, drawings and text have endless applications for pony content creation.

AI is incredibly versatile, basically anything that can be boiled down to a simple dataset can be used for training to create more of it. AI-generated images, fanfics, wAIfu chatbots and even animation are possible, and are being worked on here.

Any anon is free to join, and there are many active tasks that would suit any level of technical expertise. If youโ€™re interested in helping out, take a look at the quick start guide linked below and ask in the thread for any further detail you need.

EQG and G5 are not welcome.

>Quick start guide:
docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Introduction to the PPP, links to text-to-speech tools, and how (You) can help with active tasks.

>The main Doc:
docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit
An in-depth repository of tutorials, resources and archives.

>Online speech generation
haysay.ai

>Active tasks:
Research into animation AI
Research into pony image generation

>Latest developments:
http://ponepaste.org/10865

>The PoneAI drive, an archive for AI pony voice content:
drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp

>Clipperโ€™s Master Files, the central location for MLP voice data:
mega.nz/folder/jkwimSTa#_xk0VnR30C8Ljsy4RCGSig
mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ
drive.google.com/drive/folders/1MuM9Nb_LwnVxInIPFNvzD_hv3zOZhpwx

>Cool, where is the discord/forum/whatever unifying place for this project?
You're looking at it.

Last Thread:
>>42103996
Replies: >>42161203 >>42196683 >>42300352
Anonymous
5/2/2025, 7:16:05 AM No.42161200
FAQs:
If your question isnโ€™t listed here, take a look in the quick start guide and main doc to see if itโ€™s already answered there. Use the tabs on the left for easy navigation.
Quick: docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Main: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit

>Where can I find the AI text-to-speech tools and how do I use them?
A list of TTS tools: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.yuhl8zjiwmwq
How to get the best out of them: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.mnnpknmj1hcy

>Where can I find content made with the voice AI?
In the PoneAI drive: drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp
And the PPP Mega Compilation: docs.google.com/spreadsheets/d/1T2TE3OBs681Vphfas7Jgi5rvugdH6wnXVtUVYiZyJF8/edit

>I want to know more about the PPP, but I canโ€™t be arsed to read the doc.
See the live PPP panel shows presented on /mlp/con for a more condensed overview.
2020 pony.tube/w/5fUkuT3245pL8ZoWXUnXJ4
2021 pony.tube/w/a5yfTV4Ynq7tRveZH7AA8f
2022 pony.tube/w/mV3xgbdtrXqjoPAwEXZCw5
2023 pony.tube/w/fVZShksjBbu6uT51DtvWWz

>How can I help with the PPP?
Build datasets, train AIs, and use the AI to make more pony content. Take a look at the quick start guide for current active tasks, or start your own in the thread if you have an idea. Thereโ€™s always more data to collect and more AIs to train.

>Did you know that such and such voiced this other thing that could be used for voice data?
It is best to keep to official audio only unless there is very little of it available. If you know of a good source of audio for characters with few (or just fewer) lines, please post it in the thread. 5.1 is generally required unless you have a source already clean of background noise. Preferably post a sample or link. The easier you make it, the more likely it will be done.

>What about fan-imitations of official voices?
No.

>Will you guys be doing a [insert language here] version of the AI?
Probably not, but you're welcome to. You can however get most of the way there by using phonetic transcriptions of other languages as input for the AI.

>What about [insert OC here]'s voice?
It is often quite difficult to find good quality audio data for OCs. If you happen to know any, post them in the thread and weโ€™ll take a look.

>I have an idea!
Great. Post it in the thread and we'll discuss it.

>Do you have a Code of Conduct?
Of course: 15.ai/code

>Is this project open source? Who is in charge of this?
pony.tube/w/mqJyvdgrpbWgZduz2cs1Cm

PPP Redubs:
pony.tube/w/p/aR2dpAFn5KhnqPYiRxFQ97

Stream Premieres:
pony.tube/w/6cKnjJEZSCi3gsvrbATXnC
pony.tube/w/oNeBFMPiQKh93ePqTz1ns8
Anonymous
5/2/2025, 7:17:06 AM No.42161203
veryVERYbiganchor
veryVERYbiganchor
md5: dc4d191e667ec3321ac20891aa7d52b9๐Ÿ”
>>42161191 (OP)
Anchor.
Anonymous
5/2/2025, 7:24:32 AM No.42161222
emmy the robot as pony 354131
emmy the robot as pony 354131
md5: 0568fbf188e388d4f9e8f54e28c8a9db๐Ÿ”
>woken up just 5 minutes after thread passed page 10
Stupid fuckers and their "1 post by OP with retarded one bait sentence" threads.
Anyhow, are you guys busy with doing entries for antithology or what (I know I am, im sitting on like 5 half assed ideas that still need doing) ?
Anonymous
5/2/2025, 11:20:02 AM No.42161566
>page 9 after less than 4 hours
Board activity but at what cost ?
Replies: >>42161651
Anonymous
5/2/2025, 12:04:26 PM No.42161651
>>42161566
The cost is our sanity.
Anonymous
5/3/2025, 2:10:18 AM No.42163358
Is there a FLA of Fluttershy's cabin interior or her bedroom in the leak on web archive called MLP FLAs? I tried Dragonshy, Part 1 of Friendship is Magic and Stare Master but it's not in those...
Replies: >>42164184
Anonymous
5/3/2025, 8:48:56 AM No.42164184
bed fs rd blushing
bed fs rd blushing
md5: 6c35f94132b849f6bc0c5d7870c63ba4๐Ÿ”
>>42163358
From what quick googlefu tells me, the list of leaked full assets episode we should have access (from season 8 episodes) is as follows :
6 - "Surf and/or Turf", 7 - "Horse Play", 8- "The Parent Map", 9 - "Non-Compete Clause", 10 - "The Break Up Break Down", 11 - "Molt Down" - , 13 - "The Mean 6"
I swear we had some bits and bobs from other episodes but I cant seem to find a proper list of what is (and is not) archived.
There is this scene from Super Speedy Cider Squeezy 3000 ( and I think in the later season eps with Nightmare Night and one were Discord suffers from being "normal" as well)?
Replies: >>42169147
Anonymous
5/4/2025, 1:17:01 AM No.42165897
13bdb600cbe7ee9d
13bdb600cbe7ee9d
md5: 203240403340c5522b56511c636e3559๐Ÿ”
>https://codeberg.org/nak/sample-neko
Here is a tool the I spotted on interwebs, that allow to easily list and move 1k+ sound clips from one folder to another .
I feel like it could be really useful to Anons here organising their folders for production of big or small projects.
Replies: >>42165947
Anonymous
5/4/2025, 1:42:16 AM No.42165947
>>42165897
was litterally thinking about how i needed sound effects from the show for a project i was doing
more specifically little things like character laughs or snorts n stuff
Replies: >>42166063
Anonymous
5/4/2025, 2:41:29 AM No.42166063
>>42165947
A lot of those are in Clipper's Master File Part 2:
https://mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ/folder/EMZF3ApB
Anonymous
5/4/2025, 8:08:17 AM No.42166563
Bump.
Replies: >>42167638
Anonymous
5/4/2025, 12:51:52 PM No.42166887
>https://files.catbox.moe/vx3yr9.mp3
Anonymous
5/4/2025, 9:30:01 PM No.42167638
>>42166563
Replies: >>42168304
Anonymous
5/5/2025, 2:08:01 AM No.42168304
>>42167638
Anonymous
5/5/2025, 10:44:09 AM No.42169147
>>42164184
ugh, is there a way to get the pop up when you first download a torrent to select files to download again? I've got the magnet for the leak.
Anonymous
5/5/2025, 12:38:54 PM No.42169246
Best tools if I want to gen Cozy Glow lines?
Replies: >>42169373
Anonymous
5/5/2025, 2:18:37 PM No.42169373
>>42169246
I'm guessing you wish to have it local and didn't want to use haysay ? Get yourself python and gpt sovits.
>https://github.com/effusiveperiscope/GPT-SoVITS
>https://huggingface.co/therealvul/GPT-SoVITS-v2/tree/454406eb40b63c5571f33c29f4fd8bac197131d6/CozyGlow-SVe24-GPTe48
Replies: >>42169376 >>42169924
Anonymous
5/5/2025, 2:21:15 PM No.42169376
>>42169373
Which haysay architecture has the best Cozy?
Replies: >>42169392
Anonymous
5/5/2025, 2:28:41 PM No.42169392
>>42169376
I'm pretty found of rvc one BUT it heavily dependent on the input audio .
Anonymous
5/5/2025, 8:04:30 PM No.42169924
>>42169373
What's the current sota for voice2voice conversion? Preferably something that can be finetuned. The latest gptsovits v4 is very good but it doesn't sound like the reference so an additional step is needed I think
Replies: >>42170104 >>42171853
Anonymous
5/5/2025, 9:48:07 PM No.42170104
>>42169924
rvc and so-vits are still the king, I think some Anons posted some other "minimal dataset voice cloning" stuff in the past but none of them seem to stick around (with the github codefags making their training process way too complex, or pulling requirements out of their assess).
Anonymous
5/6/2025, 12:39:05 AM No.42170546
I heard through the grapevine that 15.ai is coming back, anyone heard about that?
Replies: >>42171393
Anonymous
5/6/2025, 7:44:27 AM No.42171393
>>42170546
>https://desuarchive.org/mlp/thread/41706417/#41711970
Pretty sure that site is still ded, and it will stay that way for very long time (aka 4ever). if any new code were to be produce by 15ai it would need to be some kind of collaboration with other codefags to avoid being chased by tiny hat lawyers , and by logic of nobody sharing such news around means it's not happening .
Anonymous
5/6/2025, 2:25:48 PM No.42171853
>>42169924
GPT-SoVITS is mainly intended for text-to-speech. The reference audio is only for providing an emotional style. For speech-to-speech, you should stick to RVC.
Replies: >>42172693
Anonymous
5/6/2025, 3:53:35 PM No.42171965
Is Haysay down for anyone else? I can't seem to reach the site at all.
Replies: >>42172009
Anonymous
5/6/2025, 4:21:27 PM No.42172009
>>42171965
https://files.catbox.moe/4sz8fc.mp3
the pretty mare voice site seems to be working fine for me. did you try different browser anon?
Anonymous
5/6/2025, 9:37:43 PM No.42172693
>>42171853
Why wouldn't I be able to do GPT-SoVITS => RVC?
Replies: >>42172807
Anonymous
5/6/2025, 10:42:31 PM No.42172807
>>42172693
yeah, you can, one problem is sometimes the RVC derps out the outputs when trying to give it lines of the same character, sometimes it depends on what kind of note the clip is hitting and sometimes the electronic goblins are messing about, so just test out different TTS voices to see which one works best with the RVC character you want to output.
Anonymous
5/6/2025, 11:12:45 PM No.42172881
accent remover shweta_ai
accent remover shweta_ai
md5: 66db6d0331cfd26e90a1fffaed304c7c๐Ÿ”
>https://nitter.space/shweta_ai/status/1912536464333893947
I need this for mare content, so I can finally get AJ speak a deep south accent without fluffing around the different words spelling, or get Rarity pronounce words in way more posh manner.
Anonymous
5/7/2025, 12:55:19 AM No.42173088
>>42166202
>>42166241
Crossposting from /chag/ thread, they are planing on doing some collaboration with /robowaifu/ guys to start making irl robot ponies. Very cool, and good luck to you !
Anonymous
5/7/2025, 10:25:32 AM No.42173899
First actually good local music model, like suno v2 quality. Fast as fuck as well.

https://www.reddit.com/r/LocalLLaMA/comments/1kg9jkq/new_sota_music_generation_model/
Replies: >>42173902
Anonymous
5/7/2025, 10:26:33 AM No.42173902
>>42173899
Also has lora training already, could 100% train pony singing.
Anonymous
5/7/2025, 1:19:16 PM No.42174105
1733690617595293
1733690617595293
md5: 643b46d2260b5a640f8f97323e92a691๐Ÿ”
https://ace-step.github.io/
https://github.com/ace-step/ACE-Step

Passes the nigger test.
https://vocaroo.com/11MoCQ68jiLY

And this is fun.
>>>/g/105183843
>>>/g/105184228
I'd love to try with some MLP songs, but I'm a VRAMlet with 6GB and I don't think I can run this yet.
Replies: >>42174936 >>42298627
Anonymous
5/7/2025, 8:04:16 PM No.42174701
Bump.
Replies: >>42175724
Anonymous
5/7/2025, 10:21:51 PM No.42174936
>>42174105
uhh, the collab file they provided seems to only do "text2music", could you/somebody explain how that anon re-edited the OG song with new shitpost lyrics into it?
Replies: >>42175015
Anonymous
5/7/2025, 10:58:37 PM No.42175015
>>42174936
oh, just noticed its in the repair->upload section. however I tried to do a "replace X lyrics with new lyrics" and it really seem to suck ass at it, so im not sure if the anon that made the above song was lucky or had enough autism to spend several hours trying all kinds of combination in making it work.
Replies: >>42175321
Anonymous
5/8/2025, 12:56:12 AM No.42175321
>>42175015
Nope, people posted multiple results in that thread where it Just Worked. The only thing I saw is that the quality will get worse the more the lyrics are changed.
Replies: >>42175929
Anonymous
5/8/2025, 4:17:28 AM No.42175724
>>42174701
Replies: >>42180991
Anonymous
5/8/2025, 5:56:56 AM No.42175929
>>42175321
Oh. I was trying to go for a full lyric replacement, I guess this GitHub is a right step into that direction, it just nit ready for my exact autistic requirements.
Hopefully by the next year we will get improvements on it, because I have some text parody ideas .
Replies: >>42175971
Anonymous
5/8/2025, 6:19:36 AM No.42175971
>>42175929
I saw someone say that you can separate the stems and get better results. Perhaps you could edit portions of the lyrics one at a time, then mix them back into the instrumental.
VilligerANON
5/8/2025, 8:21:24 AM No.42176140
Question:
During training, can I use files tagged as clean and noisy files?
Replies: >>42176220
Anonymous
5/8/2025, 9:24:19 AM No.42176220
>>42176140
Sure, however keep in mind the quality of audio outputs may suffer from it, specially if the ratio of good clips vs noisy clips is skewing towards the noisy side.
And since there are characters that have pretty much noting but mostly noisy audio (like Tree Hugger) the end results may vary from "kind of bad" to "surprisingly decent" .
Anonymous
5/8/2025, 2:17:20 PM No.42176608
Question to the Anon that was working on OpenUtau diffsinger models, are you planing on creating the models for Rarity and Fluttershy?
Replies: >>42176655
DiffAnon
5/8/2025, 2:56:10 PM No.42176655
>>42176608
Truth be told, I was planning on it eventually, but I don't know if I really want to anymore. Twilight, Applejack, Rainbow Dash, and Pinkie Pie are a bit spotty as is, and I worry that with Fluttershy's abysmally low amount of singing data (from what I could find) and just not feeling up to it for her or Rarity, I don't think either of them are gonna be made into models anytime soon. Keep in mind, I don't just train one thing, I have to train the acoustic model, then the variance model, then the pitch model, and then fine tune the vocoder, which both takes a lot of time and a lot out of me. I'm not saying it won't ever happen, because I do feel weird about leaving things with just the four I did, but I can't for the life of me bring myself to do the other two just yet. But they'll come one day, hopefully.
Anonymous
5/8/2025, 3:31:54 PM No.42176713
Speaking of model training, there's still a good few voices that're absent on RVC. It'd be nice to see Moondancer and Cadance and whoever else hasn't been trained yet, Cadance has a model for RVC but it's super noisy.
Replies: >>42177027 >>42177542 >>42179060
Anonymous
5/8/2025, 6:07:05 PM No.42177027
>>42176713
>Moondancer
huh, you are correct, I will see if I can train her rvc model.
Replies: >>42177131
Anonymous
5/8/2025, 6:53:10 PM No.42177131
>>42177027
hmm, not a great news, Ive check the mega and even when removing only the unusable very noisy audio lines, there is still only 1m50s of audio, which is less than ideal 3m but I can still try.
Anonymous
5/8/2025, 10:32:46 PM No.42177542
moondancer 1676307268380648
moondancer 1676307268380648
md5: 2a808657b6d6c408e5781d3e1ce9bd76๐Ÿ”
>>42176713
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Moondancer
>https://vocaroo.com/1hV4kTcwCp3E
Here she is, the result isn't half bad but for some reason her voice seems slipping into Rarity voice range. And of course male input voice lines will sound bit rougher in conversion.
Replies: >>42178958
Anonymous
5/9/2025, 8:17:02 AM No.42178459
>>42178450
more years! TRUST THE PLAN!
Anonymous
5/9/2025, 3:09:04 PM No.42178958
>>42177542
Awesome, thanks. I look forward to trying it once I have the time.
Replies: >>42179060
Anonymous
5/9/2025, 4:35:33 PM No.42179060
cadence emo 27711757171b56ff
cadence emo 27711757171b56ff
md5: b8137183b8bb7ea807aff590fa0d41e0๐Ÿ”
>>42176713 >>42178958
>Cadance
>https://voca.ro/188F1imvN2L7
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Cadance_Clean
RVC model of Cadance, trained on clean audio only.
Replies: >>42185313
Anonymous
5/10/2025, 6:44:52 AM No.42180991
>>42175724
Replies: >>42181427
Anonymous
5/10/2025, 12:10:12 PM No.42181427
>>42180991
Replies: >>42185114
VilligerANON
5/10/2025, 1:29:53 PM No.42181482
https://files.catbox.moe/x41lrp.wav
I have generated with this repo: https://github.com/CookiePPP/cookietts
Model from: https://drive.google.com/drive/folders/1nTyn6qr2b76aOE430trasuZj0Kr2H_ya
(Tacotron2: tt2_outdir_p3_2_0.5DFR_0.0Dropout)
(Hifi-gan cp_hifigan_universal44Khz_mlpft)
>Maybe I will create a better vocoder and Notebook
Replies: >>42181575 >>42191078
Anonymous
5/10/2025, 2:54:00 PM No.42181575
>>42181482
That's interesting Anon but I'm not sure on how it will compare with all the new tech, since tacotron is almost five years old.
Replies: >>42183033
Anonymous
5/11/2025, 4:06:32 AM No.42183033
>>42181575
I feel like there isn't much coming out for pony specificly in recent times though.
Replies: >>42183358
VilligerANON
5/11/2025, 8:09:01 AM No.42183358
Does anyone want any bonus features that I can add?

>>42183033
I know, right?
Replies: >>42183361 >>42183486
VilligerANON
5/11/2025, 8:10:01 AM No.42183361
>>42183358
> To the Inference Script
Anonymous
5/11/2025, 9:45:37 AM No.42183486
>>42183358
Well, I would like it if the offline gpt-sovits script also copy the haysay options for automatic emotions drop down menu as well as the audio clip slow/speed up stretch settings, but that's something Vul would need to add to his webui script.
Anonymous
5/11/2025, 10:35:01 PM No.42184597
copyright ai 2025 1746974971241427
copyright ai 2025 1746974971241427
md5: 532002aeca70402b2d63d6d08c1839bc๐Ÿ”
>nitter.space/jason_kint/status/1921546181357838531
>nitter.space/LuizaJarovsky/status/1921286826402422927
>ai copyright to affect the "commercial use"
Time to split the hairs on what counts as "commercial use" and what doesn't. Also good luck trying to force this on china and their no-fucks-given R&D departments.
Replies: >>42184600 >>42184613
Anonymous
5/11/2025, 10:36:29 PM No.42184600
>>42184597
>muttmerica
Phew, I thought it was actually serious.
Replies: >>42184628
Anonymous
5/11/2025, 10:41:03 PM No.42184613
>>42184597
>america keeps digging its grave in the name of "progress"
the soviet union fell behind in technology because the government tried to control things, but yeah, let's not learn anything from that.
Replies: >>42184628
Anonymous
5/11/2025, 10:49:11 PM No.42184628
>>42184600
I can see Diseny and such trying to push for it, just like they did with hundreds of years of copyright laws, but as Anons on /g/ pointed out, all the big league companies need to do is buy portions of semi big publishing companies and claim that retroactively all the existing books on the system were allowed to be used in ai training.
>>42184613
Tell me about it, I remember reading a biography of electrician that was bribed to "no be in hurry" when repairing the wheat moisture measuring apparatus, because the assigned inspector could use rule of thumb on deciding how much moisture was in the transported grain and deduce the farmers pay while pocketing the spillway difference.
Anonymous
5/12/2025, 2:04:59 AM No.42185114
>>42181427
Replies: >>42185748
Anonymous
5/12/2025, 4:01:05 AM No.42185313
>>42179060
Your local AI still can't sing worth a shit.
Evolve or die, PPP.

Voice acting requires a certain melodic way of talking which your current model does not support, 3P General.
Replies: >>42185322 >>42187317
Anonymous
5/12/2025, 4:07:09 AM No.42185322
>>42185313
There are no more than ten anons itt, all namefags, that know their shit, and they lead very busy lives. This thread was just anons enjoying the fruits of others' labors. There are no more fruits to enjoy, or worth enjoying so the Pony Preservation Project has become the Pony Preservation Project Preservation Project. It's over.
>Mareification not required.
Replies: >>42186106
Anonymous
5/12/2025, 8:36:51 AM No.42185748
>>42185114
Replies: >>42190922
Anonymous
5/12/2025, 2:45:22 PM No.42186106
>>42185322
yeah, back in 2019 + 20 everybody were hyped since show only just ended and board was still pretty alive (and with everyone locked up, all they could do is making pony content without any distractions). Now a lot of the ai tools have became available (music, art, even animations) but everything is kind of disjointed and difficult to put together.
Replies: >>42187314
Anonymous
5/12/2025, 5:17:15 PM No.42186245
unpopular demand
unpopular demand
md5: 4f60829e986926ad5e499613522346fd๐Ÿ”
I feel Anons just need to find a proper spark, something that would be fun to work on, like randomly spotting a song and wondering how it would sound if it was done by pony.
>https://files.catbox.moe/qg2qn5.mp3
Anyhow, VS singing the Ye new song, OG cover from TowerGangToad. I really wanted to use Zecora voice but the voice clips just wouldn't come out right from neither of the model types.
Replies: >>42188332
Anonymous
5/13/2025, 1:57:24 AM No.42187314
>>42186106
Don't forget that a lot of new stuff gets immediately corpo'd these days too. Shit like that stifles innovation.
Anonymous
5/13/2025, 1:59:29 AM No.42187317
>>42185313
>melodic way of talking
China is the future
75.ai
5/13/2025, 2:01:08 AM No.42187322
I will save this general.
Replies: >>42189535
Anonymous
5/13/2025, 8:36:14 AM No.42188332
>>42186245
Try replicating S1 Luna's voice. Chip in some money and put Tabitha to voice it.
Replies: >>42188430
Anonymous
5/13/2025, 10:00:48 AM No.42188430
>>42188332
>S1 Woona
It's technically doable.
https://huggingface.co/spaces/Plachta/VALL-E-X
https://desuarchive.org/mlp/thread/40503961/#40518915
It will just take about 1~6 months of non stop generating audio until the artificial dataset has five minutes worth audio clips.
Anonymous
5/13/2025, 11:54:32 AM No.42188537
>>/wsg/5872172
I want this, but for ponies, dubbing in my country is cursed, either VAs will put energy to empathize wrong aspect of character (a young rogue like adventurer will instead sound like snotty little shit), give no shits to act at all or give the role to somebody that will completely not fit the character.
Replies: >>42188542
Anonymous
5/13/2025, 11:58:25 AM No.42188542
>>42188537
>https://files.catbox.moe/yck7ps.mp4
fug, crossposting failed
Anonymous
5/13/2025, 9:36:08 PM No.42189535
>>42187322
Anonymous
5/14/2025, 2:19:54 AM No.42190366
>>42188455
Would be funny if that happened.
Anonymous
5/14/2025, 7:21:08 AM No.42190922
>>42185748
Replies: >>42192724
VilligerANON
5/14/2025, 8:36:01 AM No.42191078
>>42181482
I've updated the synthesis script, and now these are the new results
>https://files.catbox.moe/tv8c4i.wav
Does it sound like those 48 kHz MMI models, or does it sound like newer tech?
Replies: >>42191285
Anonymous
5/14/2025, 9:42:12 AM No.42191153
https://www.minimax.io/audio
https://minimax-ai.github.io/tts_tech_report/
Replies: >>42191285
Anonymous
5/14/2025, 12:30:28 PM No.42191285
>>42191078
Is thats TTS or voice conversion? It still has that funny buzzing that tacotron2 / talknet models suffered from, so its kind of hard to tell if .
>>42191153
hmm, website do not seem to be more useful than other tts sites. BUT the paper is interesting, if the cloning of 5 seconds is not complete cherry picked bullshit I would love to be able to use it.
Replies: >>42191588
VilligerANON
5/14/2025, 5:08:48 PM No.42191588
>>42191285
TTS.
> The repo:
> https://github.com/TheDevloper2023/cookiettsfork/tree/master/CookieTTS
> which is a fork of https://github.com/CookiePPP/cookietts/tree/master
Anonymous
5/15/2025, 2:19:42 AM No.42192724
>>42190922
Replies: >>42193438
Anonymous
5/15/2025, 8:39:58 AM No.42193438
>>42192724
15
5/16/2025, 7:15:46 AM No.42195922
Hi, it's been a while, hasn't it?
Here's an alpha website that you can play around with: https://alpha.15.dev/
The backend is currently running on just two GPU instances, and I've set the inference batch size to 1 since this new model requires a lot more computational power than it did two years ago. I can increase the number of GPUs depending on how long each request takes.
More characters and emotions will come soon. Feel free to report any bugs or issues here, too.
Replies: >>42195946 >>42196013 >>42196073 >>42196204 >>42196230 >>42196243 >>42196274 >>42196298 >>42196305 >>42196355 >>42196384 >>42196390 >>42196393 >>42196435 >>42196479 >>42196514 >>42196530 >>42196639 >>42196646 >>42196654 >>42196738 >>42196754 >>42196789 >>42196790 >>42196800 >>42196801 >>42196849 >>42196867 >>42196960 >>42197340 >>42197606 >>42198611 >>42198701 >>42204454 >>42204550 >>42204573 >>42204939 >>42205939 >>42205963 >>42206122 >>42207220
Anonymous
5/16/2025, 7:24:07 AM No.42195946
>>42195922
holy shit
Anonymous
5/16/2025, 7:50:37 AM No.42196013
>>42195922
I hate your guts, sleazebag
Anonymous
5/16/2025, 8:08:40 AM No.42196073
1731203818328
1731203818328
md5: 6b822b24ea8124aa686df69f5e1791e5๐Ÿ”
>>42195922
>https://alpha.15.dev/examples
nice examples kek
VilligerANON
5/16/2025, 9:04:58 AM No.42196204
>>42195922
>https://alpha.15.dev/
Can I send this outside of this thread?
Replies: >>42196227
15
5/16/2025, 9:25:55 AM No.42196227
>>42196204
Sure, go ahead. I'll make an official post on Twitter soon, probably within the next few days.
Anonymous
5/16/2025, 9:30:00 AM No.42196230
>>42195922
I'm kneeling so hard rn it hurts
Anonymous
5/16/2025, 9:39:57 AM No.42196243
ponk kneel
ponk kneel
md5: 25deb92fe79acd9c390308ce458c1fdb๐Ÿ”
>>42195922
I have no choice but to kneel
Anonymous
5/16/2025, 9:57:20 AM No.42196274
15341327859632555
15341327859632555
md5: 81f7541926f1387b62947fe3441d73bc๐Ÿ”
>>42195922
IT'S HAPPENING!
Anonymous
5/16/2025, 10:08:04 AM No.42196298
2774473
2774473
md5: 289eff186937f0f61d0a474b5eb56758๐Ÿ”
>>42195922
https://files.catbox.moe/k18mof.mp3
Three stars and now this? We are so fucking back boys!
BGM
5/16/2025, 10:11:18 AM No.42196305
GASPS
GASPS
md5: e44c385cebda47eaf89133bf11bbd75e๐Ÿ”
>>42195922
https://files.catbox.moe/01otal.wav
Woah, hi again.
New model's sounding better than ever before. Good speed, emotion settings all work reliably, sounds clear. At the moment it sounds like the characters fall out of how they're supposed to sound on occasion though. Rarity in particular with the fear emotion gives some very strange outputs.
https://files.catbox.moe/k1kvsc.wav

Also, as a UI note, the change notifications upon switching settings and voices blocks the generation button on some resolutions when scrolled up. Only for a second, but it can still delay things.
Replies: >>42196384 >>42196896
Anonymous
5/16/2025, 10:25:07 AM No.42196337
Dear Hydrus Beta, as everyone will get really hyped for return of 15ai, I just want to say I appreciate your work and thanks to HaySay I was able to do all the fun mare music conversion. I hope you will keep it alive and updated as new voice ai will show up in the future.
Replies: >>42196341 >>42198701
BGM
5/16/2025, 10:28:19 AM No.42196341
>>42196337
Seconding this, Haysay is a godsend for my workflow on music projects.
Replies: >>42198701
Anonymous
5/16/2025, 10:41:40 AM No.42196355
>>42195922
https://u.pone.rs/whgPbfzU.mp3
Replies: >>42196384
Anonymous
5/16/2025, 10:53:06 AM No.42196384
1412208
1412208
md5: 0ea43b256bb557b898a14008ea581b81๐Ÿ”
>>42195922
>new site
>>42196305
>new shitpost
>>42196355
>new smutty
brings me back
Anonymous
5/16/2025, 10:57:01 AM No.42196390
anon, i'm chubby
anon, i'm chubby
md5: cac0987932e9871c0ca4e583d50b6e40๐Ÿ”
>>42195922
https://voca.ro/140YNkYngHyz
Anonymous
5/16/2025, 10:57:45 AM No.42196393
>>42195922
Godlike web dev skills god fuckin damn
Anonymous
5/16/2025, 11:23:30 AM No.42196435
1736796030484316
1736796030484316
md5: eda98636ec9499865577b1f9674730a2๐Ÿ”
>>42195922
https://u.pone.rs/EcUvtwYk.mp3
Anonymous
5/16/2025, 12:18:15 PM No.42196476
I hope he will add the old "|" emotional control from the previous website, since the clip reference one is pretty wishy washy. Having both would be pretty perfect to fine tune the output audio.
Anonymous
5/16/2025, 12:20:40 PM No.42196479
>>42195922
I can't believe waiting two weeks (a few times) actually worked!
Anonymous
5/16/2025, 12:46:26 PM No.42196514
a
a
md5: 757186edfb8d8ad3ddd3725b2009b32d๐Ÿ”
>>42195922
Yep, it's been a while, cool website.
Let me nit pick on flicker during that transition animation.
Replies: >>42196526
Anonymous
5/16/2025, 12:51:18 PM No.42196526
bad end
bad end
md5: 36a32b3978372c46458373d630f31fe6๐Ÿ”
>>42196514
literally unplayable
Anonymous
5/16/2025, 12:52:56 PM No.42196530
>>42195922
Curious, how much (if any) AI did you use to make the website?
As for the framework.. React + Next.js? Looks good.
And welcome back.
Anonymous
5/16/2025, 1:18:04 PM No.42196569
15chill
15chill
md5: 42ce3047bba0dd6863771f511f4f5695๐Ÿ”
>there is site OC
Im so sorry bro, but the internet rule demand it.
Replies: >>42196571
Anonymous
5/16/2025, 1:18:51 PM No.42196571
>>42196569
qt oc, whose artstyle is that
Replies: >>42197112
Anonymous
5/16/2025, 1:39:00 PM No.42196589
>https://u.pone.rs/mLbrNDQB.mp3
Lets test this new site. Gin Blossoms - Hey Jealousy, done with Glimmer RVC to Sovits5 singing model (sounds ok, but i was hopping it would be better.
Anonymous
5/16/2025, 2:09:35 PM No.42196639
2309223
2309223
md5: d24b5312f5d10b4275e8e8e0ae2445c6๐Ÿ”
>>42195922
https://vocaroo.com/1bITXue82eed
Anonymous
5/16/2025, 2:14:37 PM No.42196646
>>42195922
WE ARE SO FUCKING BACK LIKE NEVER BEFORE
Anonymous
5/16/2025, 2:17:29 PM No.42196654
>>42195922
we got 15.ai revival before gta 6
Anonymous
5/16/2025, 2:46:29 PM No.42196683
>>42161191 (OP)
I know I speak to the dedicated deluded, but the machine is not the path.
Replies: >>42196751 >>42264821
Anonymous
5/16/2025, 3:17:03 PM No.42196738
>>42195922
awesome work but damn we really need an S1 Dash voice preset or something. nu-Dash voice is fucking nails on a chalkboard.
Replies: >>42196755
Anonymous
5/16/2025, 3:26:05 PM No.42196751
>>42196683
Get a hobby you poor creature.
Anonymous
5/16/2025, 3:26:57 PM No.42196754
>>42195922
Can we get an ETA on when you are open sourcing this?

I think it is an obvious concern that this will all suddenly disappear for years again.
Replies: >>42196757 >>42196772 >>42196778
Anonymous
5/16/2025, 3:27:31 PM No.42196755
>>42196738
I'd say completely exclude post S3 audio for mane six. Of course it's needed for side characters who lack speaking lines, but it's better to avoid when possible.
Replies: >>42196758
Anonymous
5/16/2025, 3:28:16 PM No.42196757
>>42196754
About 14 days or so
Replies: >>42196778
Anonymous
5/16/2025, 3:28:17 PM No.42196758
>>42196755
*S2
Poopsikins
5/16/2025, 3:33:52 PM No.42196768
https://files.catbox.moe/o4z53n.mp3
Anonymous
5/16/2025, 3:37:45 PM No.42196772
>>42196754
one more fortnight
VilligerANON
5/16/2025, 3:41:40 PM No.42196778
>>42196757
>>42196754
How do you know that?
Replies: >>42196785
Anonymous
5/16/2025, 3:46:22 PM No.42196785
>>42196778
Sounds like you're not trusting the plan
Anonymous
5/16/2025, 3:48:53 PM No.42196789
praisenuke
praisenuke
md5: 7dfed0e350184fafe862a8edd4fa0fcd๐Ÿ”
>>42195922
CHUDDA ETERNALLY BTFO
IT'S HAPPENING
Poopsikins
5/16/2025, 3:50:05 PM No.42196790
Screenshot (61)twiedit
Screenshot (61)twiedit
md5: 483ca4d794a418a31fbd51f4d000e12b๐Ÿ”
>>42195922
https://files.catbox.moe/9gopqy.mp3
Anonymous
5/16/2025, 4:00:30 PM No.42196800
1721399816307876
1721399816307876
md5: 1a5d57a30f374ecc4faf2007563daafe๐Ÿ”
>>42195922
Your shit is obsolete, yes that's what happens when you sit on your ass for years with proprietary software. Thanks for GPTSoVits and other solutions. You should have disappeared with your website, at least that wouldn't have tainted the few good memories left when using your tool. Fuck you and your five hours of fame you needed to still feel relevant.
Replies: >>42196809 >>42196814 >>42196826
Anonymous
5/16/2025, 4:00:36 PM No.42196801
>>42195922
One kinda big problem, it won't let me use the ' sign for words... which is weird since a lot of words like don't and isn't NEED that sign.
Replies: >>42196803 >>42196820
Anonymous
5/16/2025, 4:02:11 PM No.42196803
1665
1665
md5: 0c70337ffb8b4141becfd4c2551ec813๐Ÿ”
>>42196801
You do not need that.
Replies: >>42196810
Anonymous
5/16/2025, 4:06:29 PM No.42196809
1595119616135
1595119616135
md5: b00e5a49df0d7b9d522ffe8bfddaddbe๐Ÿ”
>>42196800
shut up, nigger
Anonymous
5/16/2025, 4:07:00 PM No.42196810
>>42196803
You're right, I don't, but if 15 can fix that, it'd be a big help. Otherwise, the ai second guesses the pronunciation for the words, and it's just... I dunno, I just think it would be a good QOL fix.
Anonymous
5/16/2025, 4:08:28 PM No.42196814
>>42196800
Total barbietranny death.
Anonymous
5/16/2025, 4:14:02 PM No.42196820
Screenshot 2025-05-16 101311
Screenshot 2025-05-16 101311
md5: c345b4de06f10572f5325fd1d8edabd1๐Ÿ”
>>42196801
YES HE FIXED IT!! Thank you 15!
Anonymous
5/16/2025, 4:16:22 PM No.42196826
>>42196800
It does sound like ass. It's a shame because they're ponies.
Anonymous
5/16/2025, 4:29:47 PM No.42196849
>>42195922
>>>/g/105281388
Anonymous
5/16/2025, 4:44:11 PM No.42196867
1004
1004
md5: b6268291d01afd634b7a3d8655da7ccf๐Ÿ”
>>42195922
Nightmare Moon has a huge improvement from her previous voice that just sounded like drunk Cheerilee
https://voca.ro/1j9J3CBPQqWN
Replies: >>42196897 >>42196906 >>42197553
Poopsikins
5/16/2025, 5:10:36 PM No.42196896
01
01
md5: e06d8c8d85d24f0b2f0c2fcde7eecccf๐Ÿ”
>>42196305
https://files.catbox.moe/ryyshr.mp3

https://files.catbox.moe/nu5qft.mp3

https://files.catbox.moe/urd6et.mp3

Gosh, I've missed this so much. Posting like this takes me back.
Replies: >>42196899 >>42196914 >>42196923
Anonymous
5/16/2025, 5:11:05 PM No.42196897
>>42196867
OKAY DAMN that actually sounds dynamic! I love it!
Anonymous
5/16/2025, 5:12:42 PM No.42196899
>>42196896
Derpy, Maud, and Rainbow Dash, right? It's great that I can actually recognize the voices, to be honest.
Replies: >>42196902
Anonymous
5/16/2025, 5:15:05 PM No.42196902
>>42196899
>Derpy
>It's great that I can actually recognize the voices
Replies: >>42196947
Anonymous
5/16/2025, 5:15:50 PM No.42196906
1595217799709
1595217799709
md5: 2f994f6de1ea76cc7db5d75eb2c74273๐Ÿ”
>>42196867
Nice!
Anonymous
5/16/2025, 5:20:14 PM No.42196914
CHICKEN JOCKEY!
CHICKEN JOCKEY!
md5: ec9a33673bb2ac9da72bb6f846587d39๐Ÿ”
>>42196896
https://voca.ro/1jlDvvakwJgi
Poopsikins
5/16/2025, 5:24:13 PM No.42196923
29
29
md5: 530bbe6718cb4100de3c1478234c9eeb๐Ÿ”
https://files.catbox.moe/l8ex9a.mp3>>42196896
Replies: >>42196985
Anonymous
5/16/2025, 5:38:14 PM No.42196947
>>42196902
Is that not Derpy? I thought because of the โ€œclumsyโ€ mistake and the familiar tone that it was her.
Anonymous
5/16/2025, 5:43:44 PM No.42196960
>>42195922
https://u.pone.rs/moQGuPxl.mp3
Poopsikins
5/16/2025, 5:53:17 PM No.42196985
1606023
1606023
md5: e9c3312e534300342a751ca5de72109d๐Ÿ”
>>42196923

last one from me tonight.
https://files.catbox.moe/esztvq.mp3
Anonymous
5/16/2025, 6:43:19 PM No.42197112
>>42196571
I know who's the artist I would rather not tell you directly.
he draws fuck tons of futa.
Anonymous
5/16/2025, 8:18:12 PM No.42197294
f-MLP509__553B.xfl_s-_tPL_sCharacter.sym_f0000-0064.webp.006
https://files.catbox.moe/qoia1a.wav

Luna's crash-out in A Royal Problem if she wasn't fucking around.
Replies: >>42197297
Anonymous
5/16/2025, 8:19:59 PM No.42197297
>>42197294

https://files.catbox.moe/hhwgsc.mp3

mp3 like it should've been from the beginning lol.
Vogelfag revealed
5/16/2025, 8:36:43 PM No.42197340
are you kidding me
are you kidding me
md5: 15113750f72d4a917de4172bec0d1957๐Ÿ”
She sounds angry & sarcastic which is how I feel, but still unintended on my part.
https://pub-f3186dbecfd64ac085ddc742fc900f59.r2.dev/twilight_sparkle_neutral_1747418267794_variation0.wav

>>42195922
>Feel free to report any bugs or issues here, too
Yeah I see several bugs:
0. You're still not willing to jew out despite clearly needing the money and influence. Jew out or others will outjew you. Stop being a social recluse that's how all scientists die. Learn to sue everyone cause 11.AI clearly stole your technology you moron.
1. You're not open sourcing this to the community (which are of minimal help and lack money to pay for GPUs but they're willing to learn and are very loyal and creative despite me trashtalking them myself back in October)
2. I'm pretty sure ElevenLabs, Udio.AI, SUNO.Ai, etc. stole your technology and perfected it already since 90% of the singing & talking sounds like Tara Strong, Rebecca Shoichet & Ashleigh Ball. The AI can really sing too. To an audiophille it still sounds bad, but to a normie it sounds perfect. Get a fucking marketing team, both you and Tara Strong fucked each other up and should sue every single audio AI possible.
This is what Suno Ai can do right now with the paid model:
https://www.youtube.com/shorts/udOgG0M8pVI


3. Your options & UI is still limited. If I could search a reference line to use any emotion I want without typing in phonetics then that'd be useful for the average normie. You didn't understand what I just told you, did you? LET ME USE THE REFERENCE LINE TO QUICKLY & INSTINCTIVELY USE THE EMOTION I WANT. WE HAVE AN IMPECCABLE MEMORY OF THE SHOW'S DIALOGUE LINES.
Add a voice changer/voice to audio option. It would be so much more intuitive because the AI could hear what emotion I'm going for instantly.

Today's AI still lack a ton of UI options but are getting there at an insanely quick speed such as Suno's ability to grab an existing song and have either the same singer or a new singer sing the same notes with different lyrics.

Today's AI still sounds like an untrained voice actor slurring his lines on purpose and it still sucks compared to audiophille standards, but your current robot sounding AI is dreadful by normal standards. You still haven't learned how to remove the noise?
https://www.youtube.com/watch?v=qu5nnMOQ4VU&ab_channel=A
https://www.youtube.com/watch?v=I1Dy0Zfw6Qs&ab_channel=votums

3.5 You probably didn't notice cause you're not a voice director or you're autistic but ... S1 and S2-S9 's voice directing is completely different. 90% of the dialogue lines used in S2-S9 used only these emotions; depressed, angry, flirty, ANXIOUS, TIRED, reading-off-a-script-at-gunpoint. And that's the acting ... the voices?

In S2+ everyone sounds...
Twilight sounds much lighter in S2+
Applejack & Dash sound much deeper and not in a suave way.
Pinkie sounds way lighter & screechier.
Fluttershy always sounds anxious

Rarity & Spike kinda sound the same.


4. One more thing...
Replies: >>42197345 >>42197358 >>42197474 >>42197618
Anonymous
5/16/2025, 8:40:42 PM No.42197345
>>42197340
Fuck off retard.
Vogelfag revealed
5/16/2025, 8:44:20 PM No.42197358
>>42197340

4.
Contact the original voice actors and work together with them. Give me S1 Woona's voice and all is forgiven on my side. ;) Can't say others will forgive you for being a weak leader. These effeminate pussies need a strong leader and I suggest you do too if you can't march down 11Labs HQ and sue the living shit out of them together with Tara Strong. Sounds jewish but that's the truth. You got to outjew the jew in a jewish world. Mrs Strong knows that. I know that. Why can't you fucking comprehend that?
https://youtu.be/wbzRRp2jRHw?t=103

This is what voice acting AI sounds like now:
https://www.youtube.com/watch?v=lPAtoR3YCSc&ab_channel=UndeadHumor
https://www.youtube.com/watch?v=0j1eX7F8OOo&ab_channel=DevilArtemis

BUT I'M GUESSING YOU ALREADY KNOW THAT YET YOU STILL REFUSE TO DO SOMETHING ABOUT IT.
Call your father or something for God's sake, you college pussy kid. Your technology is being stolen under your nose and improved upon tenfold(by jews, not your followers) and you're here moping like a pussy on Twitter and then coming back with a niche version that does 1 thing barely any better and still sucks dick at the other 9 things that goes into audio.
CAN YOUR MODEL AT LEAST SING RIGHT NOW? Cause SUNO's shit can and Udio used to sing good before they had to neuter it because the record companies were after their asses. Why aren't you after their asses as well?

God you need a father in your life, kid. A father to watch over you and learn to sue and break skulls for you cause jesus christ after that twitter whine ... you're still a pussy who refuses to BE A MAN AND SUE THE LIVING SHIT OUT OF ELEVEN LABS FOR STEALING YOUR MODEL. Give Tara Strong a call too. Do you want me to do it for you?

Respectfully yours, the redpiller known as Vogelfag.
Replies: >>42197369 >>42197618
Anonymous
5/16/2025, 8:50:34 PM No.42197369
>>42197358
I uh... 15 maybe should've been a bit better at leading, but WOW this is kinda rough. But they say the truth hurts... wait, aren't we only operating under the ASSUMPTION that ElevenLabs stole his work though?
Replies: >>42197487
Anonymous
5/16/2025, 9:22:21 PM No.42197463
oh boy the schizos are out now
Anonymous
5/16/2025, 9:25:50 PM No.42197474
>>42197340
no ones reading that
Anonymous
5/16/2025, 9:33:36 PM No.42197487
>>42197369
no one cares vogelfag
BGM
5/16/2025, 9:52:57 PM No.42197553
Celly Appears
Celly Appears
md5: dea7f11bf1d282009aee5691ce6d9b39๐Ÿ”
>>42196867
https://u.pone.rs/HEiyutXb.mp3
Replies: >>42197804 >>42197812 >>42197816 >>42198118 >>42198418
Anonymous
5/16/2025, 10:07:03 PM No.42197606
glim sexo
glim sexo
md5: 633c7dbb02e232272d889a66b6f027bd๐Ÿ”
>>42195922
Btw
https://voca.ro/14Y5dHWMbMpx
Anonymous
5/16/2025, 10:10:47 PM No.42197618
>>42197340
>>42197358
Your words are wasted on that idiot. 15. He was always a pretentious egomaniac and I'm glad the era where we didn't have any viable alternative is long gone. He's not even competing with the current opensauce options, let alone the paid ones.
Replies: >>42197624
Anonymous
5/16/2025, 10:11:59 PM No.42197624
>>42197618
what are the opensauce alternatives
Replies: >>42197645
Anonymous
5/16/2025, 10:15:51 PM No.42197645
>>42197624
https://github.com/effusiveperiscope/GPT-SoVITS
Replies: >>42197653 >>42199403
Anonymous
5/16/2025, 10:17:13 PM No.42197653
>>42197645
isnt that what haysay uses but it doesnt sound as good as this though
Anonymous
5/16/2025, 11:01:17 PM No.42197804
284448__suggestive_artist-colon-hotdiggedydemon_rainbow+dash_pegasus_pony_-dot-mov_shed-dot-mov_g4_animated_female_mare_pony-dot-mov_solo_swag_throbbing_throbb
>>42197553
Anonymous
5/16/2025, 11:06:56 PM No.42197812
>>42197553
Holy fuck. Please make a full length version of this.
Anonymous
5/16/2025, 11:07:45 PM No.42197816
rarewow
rarewow
md5: 24be2b901ab3816c55d951e0299e342d๐Ÿ”
>>42197553
Anonymous
5/17/2025, 12:40:20 AM No.42198118
>>42197553
Incredible. please keep going.
Anonymous
5/17/2025, 2:05:11 AM No.42198418
>>42197553
Damn, am I going to have to help finish what I've started?
Replies: >>42198427
Anonymous
5/17/2025, 2:08:20 AM No.42198427
>>42198418
Please, Iโ€™m begging you. Make more
Anonymous
5/17/2025, 3:10:36 AM No.42198611
>>42195922
Great to have you back, the new website looks fantastic.
Some notes after a few hours of testing (mainly with Rainbow and Twilight on happy and neutral):

I noticed that speech will often sound unnatural with a "rough" sort of sound, especially at the end of sentences. It's been taking a lot of re-rolls to get outputs that sound natural throughout. As ever I'm finding it very hard to articulate exactly why a lot of outputs sound off or spot trends. Been thinking about what exactly to say here for quite some time but I think it'll be more effective to just use the report feature on any examples I come across from now on. The voices generally sound very accurate to the ponies and there's already plenty of good examples ITT, so the potential is clearly there.

Things like the Twilight #3 on the example page are common issues with the "rough" sound - "aviation AH0 N", "fly AY1", "fat AE1" "ground AW1 N D".
Pretty sure this was an issue in previous versions of 15.ai, particularly the tendency to slip up at the end of sentences.

Short sentences (~three words or less), especially when generated on their own with nothing before or after, are consistently bad.

"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.

I'm tentatively thinking that reliance on reference lines from the show to control delivery, emotion, pacing etc in the output (I assume that's what the model is doing) may not actually be the best idea. It's great if the reference line that gets picked happens to match how you want the output to sound, but more often than not it won't and you'll be totally boned if there's no match at all. Even if there is a reference line that matches, you'll still need to take the time to find it or rely on RNG for it to be used.
I won't speculate any further on this for now since I don't know exactly how the reference lines influence the model. Would be good if you could fill in some blanks here.

Not yet found any bugs with the site, but I do have some feature requests:
1 - An option to automatically play new audio as soon as generation is complete.
2 - A button on the outputs to immediately regenerate with the same settings.
3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.

Hope that's helpful, you're doing great work here.
Replies: >>42199123
HydrusBeta
5/17/2025, 3:41:08 AM No.42198701
>>42195922
Oh wow. Welcome back, 15! I am really happy to see you have a site back up, and the UI is slick.

>>42196337
>>42196341
Thank you for the kind words. I plan to keep Hay Say running. I am glad you have found it useful.
Replies: >>42200231
15
5/17/2025, 7:06:21 AM No.42199123
>>42198611
>"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.
This was because the dictionary had an incorrect transcription for "anon"; this has been fixed. If you run into any similar problems like this, you can report a transcription by hovering over the colored box and clicking the report button.
>1 - An option to automatically play new audio as soon as generation is complete.
>2 - A button on the outputs to immediately regenerate with the same settings.
>3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
>4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.
Done.
Anonymous
5/17/2025, 10:07:09 AM No.42199403
>>42197645
15, is the model just GPT-SoVITS, but fine tuned on MLP?
Anonymous
5/17/2025, 1:38:21 PM No.42199629
6898659
6898659
md5: 78156a4716f56fbd3e25fc0576347c20๐Ÿ”
https://voca.ro/1mlZCjsv6tJ2
Dang, this is pretty good.
Anonymous
5/17/2025, 7:48:30 PM No.42200231
bread hd
bread hd
md5: dafed05db5035483a83debf164ac3853๐Ÿ”
>>42198701
>haysay is down
I am this close to considering selling my kidney for a good gpu
Replies: >>42200380
HydrusBeta
5/17/2025, 9:00:10 PM No.42200380
>>42200231
What odd timing. Thanks for letting me know. The site should be back up now. The EC2 instance got in a weird state where it became unreachable again.
Replies: >>42200397 >>42204296
Anonymous
5/17/2025, 9:05:23 PM No.42200397
>>42200380
The amazon anti-brony lobby is getting stronger per day. btw what would be requirements for haysay if I would like to run locally in its full compactly?
Replies: >>42200690
HydrusBeta
5/17/2025, 10:50:24 PM No.42200690
>>42200397
Hay Say can run on most machines, but will be very slow on older hardware. I do not recommend running it on Apple silicon because it is very slow on that hardware (to the point that it's basically unusable). I recorded some benchmarks on several machines, which may give you a clue as to how long it will run on yours:
https://github.com/hydrusbeta/hay_say_ui?tab=readme-ov-file#testing-data--benchmarks
Having a GPU is not required.
Replies: >>42200707
HydrusBeta
5/17/2025, 10:53:04 PM No.42200707
>>42200690
Oh, I forgot to mention that you need a LOT of hard drive space (about 100 GB now), and having at least 12 GB Ram is recommended.
Anonymous
5/18/2025, 1:50:23 AM No.42201300
Up.
Replies: >>42201855
Anonymous
5/18/2025, 7:36:21 AM No.42201855
>>42201300
Anonymous
5/18/2025, 6:51:45 PM No.42202578
>back to being dead
come on
Anonymous
5/19/2025, 3:12:45 AM No.42203845
1747616891426350
1747616891426350
md5: 4f0806c456bed7269f9e2f628638cae9๐Ÿ”
https://x.com/fifteenai/status/1924269599542968655
Replies: >>42203862 >>42204138
Anonymous
5/19/2025, 3:25:06 AM No.42203862
tenor
tenor
md5: 68f57370a19ff0b60c559678836bcb99๐Ÿ”
>>42203845
Anonymous
5/19/2025, 6:12:26 AM No.42204138
1740018418195642
1740018418195642
md5: 3651523a503e19c0898f7b6cda55e5f5๐Ÿ”
>>42203845
>Discord server
Kek.

>I just added 4 more GPU servers because of the huge number of requests coming in. This is actually going to bankrupt me.
You know, you could just... open source it?
Then you wouldn't have to pay for any of it, you wouldn't be expected to constantly maintain it (this has been a recurring issue, let's be honest), and you would meet your original promises.
Replies: >>42204139
Anonymous
5/19/2025, 6:13:06 AM No.42204139
>>42204138
shut up retard
Replies: >>42204140
Anonymous
5/19/2025, 6:14:18 AM No.42204140
>>42204139
>t. 15
Replies: >>42204141
Anonymous
5/19/2025, 6:14:48 AM No.42204141
>>42204140
shut up retard
Anonymous
5/19/2025, 7:18:39 AM No.42204296
1717711190370428
1717711190370428
md5: 6b3c4fad24eebe152411f077ad0454de๐Ÿ”
FYI, GPT-SoVITS v4 came out.
While v3 downgraded the quality, they boosted it back to 48KHz and it arguably sounds much more natural.
There's a good report here: https:// 8 chan.moe/ais/res/6258.html#q11121
>Ref: https://voca.ro/13vsNeBHC2Xu
>Best result I got from v4: https://voca.ro/1j2I5rUzAZxj
>Same example with v2 (the end was cut due to my shitty api): https://voca.ro/11qFHhR7HtG1
This is the only comparison I've heard so far though, seems like it was a very silently received release. Needs to be tested more.

>>42200380
If you could look into adding v4 to Haysay (assuming it does hold up with pony voices), that'd be much appreciated.
Replies: >>42204476
Anonymous
5/19/2025, 8:52:34 AM No.42204454
Capture
Capture
md5: cffd9d6ad2fc81c559805cc7ee5770f1๐Ÿ”
>>42195922
You make a cute couple.
Replies: >>42204478
Anonymous
5/19/2025, 9:08:02 AM No.42204476
1738995549982678
1738995549982678
md5: 60d11eec217a797f6d799fbe0e899076๐Ÿ”
>>42204296
Trying it out.
Oh boy, new setting under SoVITS Training. Guess I'm leaving that at the default 32 for now.
Anonymous
5/19/2025, 9:09:47 AM No.42204478
>>42204454
>hecking mare
>she/pony
He's just having a laugh, r-right?
Anonymous
5/19/2025, 9:51:12 AM No.42204529
Screenshot_20250519-035011
Screenshot_20250519-035011
md5: 48d2798c5f6c7bed0fefb383dbef6ad0๐Ÿ”
Inb4 they ban saying bad words with the ai
Replies: >>42204536
Anonymous
5/19/2025, 10:05:38 AM No.42204536
>>42204529
discord can ban over stuff like saying nigger iirc if people report it
I doubt any text restriction will be imposed but its understandable you dont want kids spamming nigger word in the discord
Replies: >>42206804
Anonymous
5/19/2025, 10:19:54 AM No.42204550
>>42195922
thank you for your service, king
Anonymous
5/19/2025, 10:47:51 AM No.42204573
>>42195922
Cool that you're back. Though its a bit odd that you say that 15.dev is provided only for non-commercial use, then license the outputs under CC BY-SA 4.0, which explicitly permits commercial use. Shouldn't outputs be licensed under CC BY-NC or BY-NC-SA instead, since it would be in line with your earlier statement that the site is to be used non commercially?
Anonymous
5/19/2025, 4:34:11 PM No.42204939
>>42195922
ya taking on new voice dataset or only retraining the old ones?
Anonymous
5/19/2025, 5:19:00 PM No.42205033
https://huggingface.co/OuteAI/OuteTTS-1.0-0.6B
Replies: >>42205228
Anonymous
5/19/2025, 7:15:39 PM No.42205228
>>42205033
Their twitter examples are bit meh sounding, im guessing the wow factor would came from the fact that it can work with 14 different languages. Would be really nice if I had a voice dataset from foreign dubbing and be able to use for english languages.
Anonymous
5/19/2025, 8:57:12 PM No.42205387
If you still lurking Vul, thank you for making that sfx_sep_v2 filter for vocal remover, this stuff is so bloody helpful in prepping the audios.
Anonymous
5/20/2025, 1:21:09 AM No.42205939
>>42195922
Holy shit, only noticed it now. I don't know what changed for the site to make a comeback, but it's nice to see it again.
Anonymous
5/20/2025, 1:29:53 AM No.42205963
>>42195922
Did a bunch of work with Rarity today, mainly with happy emotion, and notably found that I tended to get better results when I turned the temperature way down, 0.2-0.4. Tried that with the rest of the Mane 6 but Rarity seemed to be the only one to significantly benefit, Twilight and Rainbow in particular still sound "rough" almost all the time no matter what I do.
Even so, Rarity's improvement is significant enough that I'd suggest everyone experiment with adjusting the temperature, there may be an optimal value for each character that I've not found yet.

Short inputs continue to be a problem, even short sentences that are part of a longer input - reported a bunch of instances of words being mispronounced, weirdly elongated and even skipped entirely.

Also had a few times where the page froze when I switched tabs to do other stuff while waiting for generations to complete.

Could you unlock the quality slider at least in the faster direction? I'm finding generation wait times to be the main bottleneck right now and would like to give that a try. Perhaps also allow larger batch size when faster quality options are selected too.
Anonymous
5/20/2025, 2:28:36 AM No.42206122
au_moondancer_goes_to_a_lot_of_cons_by_pfeffaroo
au_moondancer_goes_to_a_lot_of_cons_by_pfeffaroo
md5: 3902075c3dba94b4315cc7e428b1b134๐Ÿ”
>>42195922
>no more emotional contextualiser (the selections are a decent sidegrade I guess but come on it was much cooler)
>still using arpabet despite even resolving the IPA
>AI guesses what I want it to say if it's not in the dictionary instead of just phonemising the words because I know what I want it to say
why

>Moondancer
Bless you, sounds like shit tho
Replies: >>42206425
Anonymous
5/20/2025, 4:36:30 AM No.42206425
>>42206122
https://files.catbox.moe/tu4s0l.mp3

How's this?
Replies: >>42207713
Anonymous
5/20/2025, 8:41:07 AM No.42206804
>>42204536
Fair, but I will never trust someone with a mental illness flag in the bio.
Anonymous
5/20/2025, 1:47:51 PM No.42207158
clone trooper pony pixel
clone trooper pony pixel
md5: 2e0077d0063ef3aa979833ba2f33c687๐Ÿ”
Sup, got an sudden inspiration to get the voice from Clone Wars narrator trained. Not pony model but I feel like this could get some good use out it in the future anti clips.
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/Star_Wars_Clone_Wars_Narrator_v2
https://files.catbox.moe/bjljdm.mp3
Not 100% happy with it as the input needs to have that specific "umpf" energy to it.

>https://huggingface.co/Amo/GPT-SoVITS-v2/tree/main/Clone_Wars_Narrator_v2_so96_gpt24
Gpt-Sovits, wavs included.
https://vocaroo.com/1oycsmzwxgVy
https://vocaroo.com/12qbwj4NK8XP
https://vocaroo.com/1fBcauUi9ZIP
Due to pronunciation script some words sound pretty weird but nothing but little but of editing can't fix.
Anonymous
5/20/2025, 2:21:28 PM No.42207220
>>42195922
now all i need to do is figure out how to make ponies moan
Replies: >>42207363 >>42310164
Anonymous
5/20/2025, 2:41:57 PM No.42207243
Open source that shit 15
Anonymous
5/20/2025, 4:22:50 PM No.42207363
>>42207220
One step ahead of you.

https://files.catbox.moe/7wktvb.mp3

All I did was enter "AAAAAAAAAAAAAAAA!" and the moaning just kinda happened.
Replies: >>42251484
Anonymous
5/20/2025, 5:12:57 PM No.42207459
IMG_8305
IMG_8305
md5: c79e0e910c81f9542c23de19909c50f4๐Ÿ”
15 for the love of God find a volunteer to do your PR, you called a random Hasbro employee pathetic that is not something you should do if they are inquiring about your service despise how obnoxious the cocksucking corpo suits are. Being aggressive like that isnโ€™t doing anyone any favors
Replies: >>42207472 >>42207477 >>42207518
Anonymous
5/20/2025, 5:15:01 PM No.42207472
>>42207459
All hasjew employees deserve and should be publicly mocked.
Replies: >>42207479
Anonymous
5/20/2025, 5:17:36 PM No.42207477
>>42207459
>you called a random Hasbro employee pathetic
are you retarded perhaps
Replies: >>42207483
Anonymous
5/20/2025, 5:18:44 PM No.42207479
>>42207472
Yeah I call them retarded niggers off the mic but when youโ€™re face to face with them you shouldnโ€™t let that go out.
Since 15 is a stemfag gook I wasnโ€™t expecting diplomacy and social skills from him but this is actually crazy, no one cares about your inbox.
Replies: >>42207484
Anonymous
5/20/2025, 5:19:45 PM No.42207483
>>42207477
Even if that was a scammer like who the fuck cares nobody cares about your inbox nigga
Replies: >>42207484 >>42207488
Anonymous
5/20/2025, 5:20:17 PM No.42207484
>>42207479
>>42207483
I care though, this is funny and based as fuck
Anonymous
5/20/2025, 5:20:53 PM No.42207488
>>42207483
Repeat 30 more times about how much you don't care.
Replies: >>42207493
Anonymous
5/20/2025, 5:22:32 PM No.42207493
>>42207488
Settle down 15 minion you have a sever to moderate
Replies: >>42207525
Anonymous
5/20/2025, 5:27:20 PM No.42207518
>>42207459
He wasn't even calling Hasbro employees pathetic though? It was some random guy trying to snitch by CC'ing all these people.
Replies: >>42207580
Anonymous
5/20/2025, 5:28:22 PM No.42207525
>>42207493
You're the retard sending the e-mail, got it.
Replies: >>42207527
Anonymous
5/20/2025, 5:29:20 PM No.42207527
>>42207525
Finger pointing like that isnโ€™t healthy tranny
Anonymous
5/20/2025, 5:56:12 PM No.42207576
https://x.com/UnslothAI/status/1924848135991656603
Replies: >>42207705
Anonymous
5/20/2025, 5:58:22 PM No.42207580
>>42207518
This, wtf is anon talking about
Anonymous
5/20/2025, 7:02:07 PM No.42207705
>>42207576
once again, it's all written like next breakthrough in technology but nobody is posting any examples at all, not even cheery picked ones.
Anonymous
5/20/2025, 7:07:41 PM No.42207713
>>42206425
Still bad, just compare to any actual Moondancer speaking. I'm not knowledgeable enough to describe exactly how it's wrong, but it's too deep and not "light" enough?
Replies: >>42208791
twiggles !!ofIYxlKABKS
5/21/2025, 12:52:39 AM No.42208577
it's been six fucking years, jesus christ. i still can't believe how big this project got
Replies: >>42208651
Anonymous
5/21/2025, 1:25:28 AM No.42208651
>>42208577
it was dead for a while but only recently started becoming alive again
Replies: >>42208841 >>42208906
Anonymous
5/21/2025, 2:28:26 AM No.42208791
>>42207713
Ah, okay. I thought it was a matter of quality and not the voice itself. But you're right, it's not as light as her in the show...
Anonymous
5/21/2025, 2:42:39 AM No.42208841
46280799
46280799
md5: 3ad413f3f3328e706aa9a1f3b105c46e๐Ÿ”
>>42208651
https://www.youtube.com/watch?v=730zGRwbQuE
Indeed, its has been bumpy few years, yet in the end the infinite power of ponies will prevail all hardships.
Anonymous
5/21/2025, 3:06:51 AM No.42208906
>>42208651
It's good to see it getting some steam. This is far too potent to let it fall to pieces.
Anonymous
5/21/2025, 4:24:12 AM No.42209071
Screenshot 2025-05-20 222315
Screenshot 2025-05-20 222315
md5: bdac4e8f7409f042472be78da326a2b2๐Ÿ”
Someone on the server wanted to get 15 to censor the swear words from the site. Say it with me...FUCK no!
Replies: >>42209113 >>42209433 >>42211348
Anonymous
5/21/2025, 4:47:25 AM No.42209113
Get Out
Get Out
md5: fe31a27c91695e3ba75cd12a6c746202๐Ÿ”
>>42209071
>Hey everyone look at what some nobody said on my Discord!
No one here cares about social media drama. Keep it in Discord and out of here
Anonymous
5/21/2025, 8:35:00 AM No.42209433
>>42209071
This is why you don't cozy up to Discord groups. They'll try to corrupt you every time.
Replies: >>42209975
Anonymous
5/21/2025, 2:53:29 PM No.42209975
>>42209433
True.
Anonymous
5/21/2025, 8:17:11 PM No.42210605
Up.
Replies: >>42211063 >>42211888
Anonymous
5/21/2025, 11:32:18 PM No.42211063
>>42210605
aaaaaa!
Anonymous
5/22/2025, 2:03:54 AM No.42211348
>>42209071
Gee, what a surprise.
Anonymous
5/22/2025, 7:30:49 AM No.42211888
>>42210605
Replies: >>42212568
Anonymous
5/22/2025, 8:41:54 AM No.42212018
>https://files.catbox.moe/asxfuv.mp3
Anonymous
5/22/2025, 2:57:01 PM No.42212568
>>42211888
Replies: >>42221623
Anonymous
5/22/2025, 11:17:12 PM No.42213584
thisbitch
thisbitch
md5: 7a50adb73f51246bb6520a55c2302ddc๐Ÿ”
close enough welcome back uberduck discord
Replies: >>42214071
Anonymous
5/23/2025, 2:53:32 AM No.42214071
>>42213584
>uberfuck
No thanks.
Anonymous
5/23/2025, 4:01:39 AM No.42214224
>15 is back
>still dead
It's over
Replies: >>42214240
Anonymous
5/23/2025, 4:09:09 AM No.42214240
675242
675242
md5: 782ad17878a51971d8c2b1bba4d939c7๐Ÿ”
>>42214224
15.ai isn't really good enough to revive any interest after the novelty of making ponies say nigger wears off.
Replies: >>42214251 >>42214278 >>42214680
Anonymous
5/23/2025, 4:17:58 AM No.42214251
>>42214240
ok goku
Anonymous
5/23/2025, 4:38:35 AM No.42214278
TrixPosting
TrixPosting
md5: dbb170ad3cd1566bc62410391014e63d๐Ÿ”
>>42214240
https://u.pone.rs/OWiJmVGB.mp3
Anonymous
5/23/2025, 10:00:27 AM No.42214680
>>42214240
>Dashcon
Comparing a literal scam to 15 is plain retarded.
ThunderShy
5/23/2025, 10:51:45 AM No.42214718
Hello fags made a new ai skit with 15.ai its good to be back
https://files.catbox.moe/29w2tt.mp4
Replies: >>42215142
Anonymous
5/23/2025, 5:05:18 PM No.42215142
>>42214718
Comedy bros, were are you?
Anonymous
5/23/2025, 10:08:33 PM No.42215735
1376392147815
1376392147815
md5: 9c2bf0889c190c2a1518d848bcf0acbd๐Ÿ”
https://files.catbox.moe/aov4vh.mp3
Replies: >>42217176
Anonymous
5/24/2025, 6:43:23 AM No.42216763
LiminalTrixieSnipper_1_Temp_2
LiminalTrixieSnipper_1_Temp_2
md5: 466685e74f2d8e445b513ef94b8e0401๐Ÿ”
>15 service re-emerges
>Typing rapidly ensues
>Old prompt tricks still draw out the mysterious liminal echoes of the mare
These digital equines have the most fascinating voices

Compilation of Liminal Trixie sounds
https://files.catbox.moe/gznbmc.mp3
https://files.catbox.moe/spb6zv.mp4
Replies: >>42217954 >>42224989
Anonymous
5/24/2025, 10:37:20 AM No.42217176
>>42215735
What was that quote? I can't remember where it came from.
Anonymous
5/24/2025, 8:27:36 PM No.42217954
>>42216763
moonbase trixie
Anonymous
5/24/2025, 9:14:38 PM No.42218072
I need more lewd moans. Gasps, sighs, groans, chirps, murmurs, mewlings, etc.
Replies: >>42218629 >>42218755 >>42219570
Anonymous
5/25/2025, 12:40:17 AM No.42218629
>>42218072
I have an audio pack with random moans, give me few minutes to upload it
Anonymous
5/25/2025, 1:19:09 AM No.42218755
TrixBotVoicingFried
TrixBotVoicingFried
md5: 223c93b201d1482914733f27a56a7ee2๐Ÿ”
>>42218072
NTA but here's a couple more Liminal Trixie noises.
A few grunts, laughs, even some coughs and various others.
https://files.catbox.moe/q8g80w.mp3
Replies: >>42219017 >>42256762 >>42269579
Anonymous
5/25/2025, 3:10:41 AM No.42219017
>>42218755
I'm surprised no one has done something with that.
ThunderShy
5/25/2025, 4:11:15 AM No.42219092
@hydrusbeta, what happaned to the synth app its not working and could it be possible if you can add a direct link to it on the haysay website
Anonymous
5/25/2025, 9:28:43 AM No.42219523
>Servers down
>twitter account gone
Permission to panic, sir?
Replies: >>42219530
Anonymous
5/25/2025, 9:33:47 AM No.42219530
>>42219523
False alarm, twitter was just fucking itself up again.
Replies: >>42221117
Anonymous
5/25/2025, 10:04:16 AM No.42219570
me and my rule 63 selfs
me and my rule 63 selfs
md5: 3040396968b7ec0f87da58d2980a24b5๐Ÿ”
>>42218072
https://u.pone.rs/PRpOFwQp.001
>SpecialPacks_.zip.001
https://u.pone.rs/vyoSUmbo.002
>SpecialPacks_.zip.002
https://u.pone.rs/WFVSxEXw.003
>SpecialPacks_.zip.003
https://u.pone.rs/mGedaJTp.004
>SpecialPacks_.zip.004
https://u.pone.rs/uGmobBkJ.005
>SpecialPacks_.zip.005

Rename the download files to the below quoted filenames. It's 2.27GB mix of variety sounds from ASMR, hentai games and some other gooning sources. Do use the RVC to make them pony related.
VilligerANON
5/25/2025, 7:33:38 PM No.42220345
I'm preparing to train MLP models with GPT-soVITS v4
Which mare should I start with?

>Yes, I'll add the precomputed values from Haysay, once I make the WebUI.
Replies: >>42220536 >>42220864
Anonymous
5/25/2025, 9:10:43 PM No.42220536
Feral, Pony, Applejack, Queen Chrysalis, hoof wrestling, holding hooves, duo, fi s-2395569027
>>42220345
Applejack is a good baseline to test out accent retention and character similarity. Otherwise, testing more unique voices like Queen Chrysalis would better determine how well the model replicates the intended voice without falling back too much on similar but generic voices.
Replies: >>42221644
Anonymous
5/26/2025, 12:04:44 AM No.42220864
>>42220345
I'd be curious to know what effect the LoRA Rank has on the models, and which one is ideal for what datasets.
Anonymous
5/26/2025, 2:27:38 AM No.42221117
>>42219530
Phew.
Anonymous
5/26/2025, 7:56:47 AM No.42221623
>>42212568
VilligerANON
5/26/2025, 8:09:54 AM No.42221644
>>42220536
What pretrained English model was finetuned on?
Anonymous
5/26/2025, 1:46:59 PM No.42222130
>10
Replies: >>42223953
Anonymous
5/26/2025, 4:50:19 PM No.42222460
LyraBooger
LyraBooger
md5: c6fef4027b04f745ff3805b9ca89093e๐Ÿ”
>Prompts various commas and apostrophes to get hidden mare noises.
>Lyra: "Ew, I think it's some sorta booger or something"
Wow, these mares have some fascinating interpretations.

>https://files.catbox.moe/whb1r5.mp4
>https://files.catbox.moe/xqvrlq.wav
Replies: >>42222472 >>42224830
Anonymous
5/26/2025, 4:59:49 PM No.42222472
>>42222460
The interface is really stylish.
Anonymous
5/26/2025, 6:52:43 PM No.42222730
>https://huggingface.co/Amo/GPT-SoVITS-v2/blob/main/TreeHugger_so96_gpt24/wavs.zip
>This file is vulnerable to threat(s) PAIT-ARV-100.
Could somebody with good quality antivirus scan this zip and files inside of it? it's probably a false positive but I want to be sure this wouldn't mess with my pc.
Anonymous
5/26/2025, 10:17:50 PM No.42223265
https://unmute.sh/

Found this, apparently they're gonna open source the text and speech models soon, but for now, you can supply a ten second voice clip of anyone you want to speak with them in a variety of topics.
Replies: >>42223566 >>42224786 >>42261401
Anonymous
5/27/2025, 12:10:11 AM No.42223566
>IMS Toucan - tts 7000 Languages
>https://github.com/DigitalPhonetics/IMS-Toucan
>https://huggingface.co/spaces/Flux9665/MassivelyMultilingualTTS
I think this was posted few years back, I've noticed they had update on huggingpage about two weeks ago, after few minutes of testing, it seems to be working, however while the quality of voices is above MS Sam and the noisy talknets, the way tts is talking still feels very artificial.
The voice cloning option seems to be broken so that's sucks, however by the fact that it is able to generate voices at light speed and even has build in options for CPU usage means that it could be run on a potato tier equipment without problems.
So, its not something useful for now, but there is always possibility somebody else could take it and improve it (imagine Flutershy teaching you how to speak moonrunes).
>>42223265
Thank you for sharing that Anom, and also holy fuck, this is working like pure magic, I just given them a 9s of audio clip of really low quality clip ripped from a game and it was able to replicate it without the shitty de-reverb pollution and background buzzing noise AND keeping the accent consistent. And on top of that I was able to double the amount of voice lines this character had ever spoken, so thats a massive plus on making artificial datasets.
>apparently they're gonna open source the text and speech models soon
With this kind of tech there wouldn't be a need for training full models for the bare bones TTS can be done with 10s clips and less than 5m of waiting for the voice to be clone. Man, I remember way back in mid 2020 when people talk about this tech and pretty much everybody agreed that cloning voices with 10s of audio will never sound natural or even good, how times have changed.
Anonymous
5/27/2025, 2:29:42 AM No.42223953
>>42222130
Replies: >>42224543
Anonymous
5/27/2025, 7:49:33 AM No.42224543
>>42223953
Anonymous
5/27/2025, 10:58:37 AM No.42224786
>>42223265
I tried to see if it could recreate voice from 3s of Woona voice but sadly that was a no-go (Ive even try duplicating the voice to fill it out to 10s clips), im guessing the high pitch levels of distress is messing with their process or they do need minimum 6s of audio to be able to work out how to duplicate it.
Anonymous
5/27/2025, 11:33:22 AM No.42224830
>>42222460
This is what I've got instead. I really dig the giggle in the first one.
https://vocaroo.com/154R3gQLRpG1
https://vocaroo.com/1eOcqD52A2pm
Replies: >>42224962
Anonymous
5/27/2025, 2:25:52 PM No.42224962
>>42224830
>https://files.catbox.moe/etzhiu.mp4
>Chrysalis: "(forceful exhales x3), We should take the magic inside it. You know how powerful Discord was."
Guess with limited-to-no other speech input, it does fall back a lot on the Reference Text as seen in the Advanced Model Details. No wonder so many Trixie attempts had her mumbling about a good night's sleep. Less random than initially suspected.

I wonder how the model would behave if we were able to remove or modify the underlying quote(s) during synthesis, though I'm sure it's likely integral to retaining its accuracy. Come to think of it though, it would be nice to be able to select specifically what underlying reference line it's using prior to generation so that you have more chances of getting a desirable output similar to it. Could mean less resource usage too.
Anonymous
5/27/2025, 3:03:00 PM No.42224989
>>42216763
what tricks did you use?
Replies: >>42225760
Anonymous
5/27/2025, 4:04:47 PM No.42225053
>https://github.com/PasiKoodaa/ACE-Step-RADIO
I've stumbled upon above github project, it uses the Ace Step music model to create a constant stream of ai music to replicate what online radio websites do, the requirements for it are 16GB Vram. The outputs are still on the so-so level, but given the text to song models are only about year old there is plenty of space for improvements. Also I would love to see a setup were these models sing with proper poni voices from the get go (or with the help from loras).
Replies: >>42225760
Anonymous
5/27/2025, 6:29:38 PM No.42225301
>Stable Audio Open Small
>Weights: https://huggingface.co/stabilityai/stable-audio-open-small
>Paper: https://arxiv.org/abs/2505.08175
>Arm learning path: https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/run-stable-audio-open-small-with-lite-rt
Huh, a model that's only around 2GB? Nice to see them notice that not everybody have a endless bag of cash to spend on newest and larges GPU. Sadly it still only outputs instrumental at lower-tier quality (at least in comparison to what's already out there).
Apparently it can run 30% faster than realtime.
Replies: >>42225760
Anonymous
5/27/2025, 10:48:45 PM No.42225760
TwiggyLewdMareSounds
TwiggyLewdMareSounds
md5: 574a3a4c6ff8450dd22806ec5848aabe๐Ÿ”
>>42224989
Mostly the aforementioned ,',' trick, which in older pre- "dev" versions of 15 used to be able to do a lot more lewd noises and such. Used to have a text doc with a handful of other tricks used with it, but it must be on one of my older OS drives. Still serves to force further areas of silence, which in turn can allow hallucinations and other AI weirdness to creep in on purpose.
>>42225053
>16GBs Vram
Still seems out of the memory budget of most anons, Unless it could be optimized to be at least half that with minimal loss. Even if it were finetuned on mares, without optimization I can't imagine many being able to utilize it for synthesis.
>>42225301
>Very small model
>Lower quality
To be expected I suppose, but at least it's something usable for local synthesis and playing around with, aside from maybe Bark; which I should honestly revisit. Just a shame they completely abandoned it after becoming monetized in the form of Suno. Still open source like Stable Audio is however.
Anonymous
5/28/2025, 1:58:06 AM No.42226198
Up.
Replies: >>42227813
Anonymous
5/28/2025, 8:13:51 AM No.42226927
mares
Anonymous
5/28/2025, 11:56:43 AM No.42227190
LewdCyberTwi
LewdCyberTwi
md5: bcb5be07e2ef297d4f6dc3e823caf2bd๐Ÿ”
rears
Anonymous
5/28/2025, 3:06:12 PM No.42227393
3419373
3419373
md5: 5b5b3973f7950765af4274693473083c๐Ÿ”
https://u.pone.rs/pBgJHLQr.wav
Anonymous
5/28/2025, 7:37:50 PM No.42227813
>>42226198
Replies: >>42229510
Anonymous
5/28/2025, 10:00:03 PM No.42228108
Claims to do sota zero shot cloning with tts with powerful control
https://github.com/resemble-ai/chatterbox
Replies: >>42228215 >>42228246
Anonymous
5/28/2025, 10:55:42 PM No.42228215
>>42228108
From a 20s voice sample: https://litter.catbox.moe/w54fxs.wav
Anonymous
5/28/2025, 11:08:14 PM No.42228246
>>42228108
I've tested with few voices, it seems to be able to run some without any problems but totally struggle with others (seems to depend on how accent/pronunciation deviate from standard way of speaking). Sadly I confirmed that this model is also unable to clone Woona voice.
Anonymous
5/29/2025, 9:37:20 AM No.42229510
>>42227813
Replies: >>42232233
Anonymous
5/29/2025, 2:17:11 PM No.42229871
Music Source Restoration
https://arxiv.org/abs/2505.21827
>We introduce Music Source Restoration (MSR), a novel task addressing the gap between idealized source separation and real-world music production. Current Music Source Separation (MSS) approaches assume mixtures are simple sums of sources, ignoring signal degradations employed during music production like equalization, compression, and reverb. MSR models mixtures as degraded sums of individually degraded sources, with the goal of recovering original, undegraded signals. Due to the lack of data for MSR, we present RawStems, a dataset annotation of 578 songs with unprocessed source signals organized into 8 primary and 17 secondary instrument groups, totaling 354.13 hours. To the best of our knowledge, RawStems is the first dataset that contains unprocessed music stems with hierarchical categories. We consider spectral filtering, dynamic range compression, harmonic distortion, reverb and lossy codec as possible degradations, and establish U-Former as a baseline method, demonstrating the feasibility of MSR on our dataset. We release the RawStems dataset annotations, degradation simulation pipeline, training code and pre-trained models to be publicly available.
https://github.com/yongyizang/music_source_restoration
https://huggingface.co/datasets/yongyizang/RawStems
https://huggingface.co/yongyizang/MSR_UFormers
Github repo isn't live yet. might be cool for audio stuff
Replies: >>42230283
Anonymous
5/29/2025, 5:52:02 PM No.42230283
>>42229871
This could be pretty useful in combination with the ACE Step song convector, if a song can have both vocals separated as well as instrumentals separated into their own track I would imagine that would help modifying it into a different style of music.
At the very least it would be nice to use it to fix the weird effects that vocal removing programs are imprinting on the instrumental files.
Anonymous
5/30/2025, 12:47:11 AM No.42231332
ten
Replies: >>42232530
Anonymous
5/30/2025, 7:44:17 AM No.42232233
>>42229510
Anonymous
5/30/2025, 12:19:45 PM No.42232530
>>42231332
Replies: >>42236397
Anonymous
5/30/2025, 6:46:58 PM No.42233126
stupid 1728589750923813
stupid 1728589750923813
md5: 91e8fb35f368033e01c314f7bb093ba5๐Ÿ”
>https://u.pone.rs/beZAfsQC.mp3
motivational Trixie
Anonymous
5/31/2025, 6:17:19 AM No.42234667
Saved
Replies: >>42235711
Anonymous
5/31/2025, 6:04:11 PM No.42235711
>>42234667
Precautionary bump.
Anonymous
5/31/2025, 11:35:08 PM No.42236397
>>42232530
Replies: >>42237262
Anonymous
6/1/2025, 7:15:12 AM No.42237262
>>42236397
Replies: >>42238294
Anonymous
6/1/2025, 6:08:21 PM No.42238294
Again
Again
md5: f2b43efee3f199b15e922a8107752034๐Ÿ”
>>42237262
Replies: >>42238306
Anonymous
6/1/2025, 6:12:03 PM No.42238306
SNIFF 3
SNIFF 3
md5: 7ac816ffb99277ce339072300cc6df73๐Ÿ”
>>42238294
Replies: >>42239361
Anonymous
6/2/2025, 1:37:48 AM No.42239361
>>42238306
Anonymous
6/2/2025, 2:06:26 AM No.42239410
SpikeWoo
SpikeWoo
md5: 13af6b260915f8776eac5339c2c4832e๐Ÿ”
Well, the twelve hours after 15 returned was fun I guess. Now back to this bullshit.
Replies: >>42240013 >>42240318
Anonymous
6/2/2025, 6:43:46 AM No.42240013
>>42239410
He's gunna hurl if he keeps that up.
Anonymous
6/2/2025, 10:23:30 AM No.42240318
>>42239410
Which one?
Replies: >>42240322
Anonymous
6/2/2025, 10:24:46 AM No.42240322
>>42240318
The bumping kind.
Replies: >>42240779
Anonymous
6/2/2025, 3:51:40 PM No.42240779
>>42240322
The bumping loyal
Replies: >>42241436
Anonymous
6/2/2025, 8:35:33 PM No.42241436
>>42240779
Let me bump the thread of my people.
Replies: >>42243302
Anonymous
6/3/2025, 2:37:54 PM No.42243302
Pony, my little pony, female, cute, original character, OC, fan character, _Bump s-4209552926
>>42241436
>>42241979
Replies: >>42245664 >>42248445
Anonymous
6/3/2025, 8:45:51 PM No.42243816
https://openaudio.com/blogs/s1
The .5b mini version will be open sourced
Replies: >>42244249 >>42244607 >>42247951
Anonymous
6/4/2025, 12:07:45 AM No.42244249
>>42243816
Hmm, would be nice if there was a demo WITHOUT music so I assume they put it in to hide the lower quality. But with .5B size this thing should technically be able to run in a phone sized environment, so that's neat.
Replies: >>42244607
Anonymous
6/4/2025, 2:45:30 AM No.42244607
>>42243816
>>42244249
Neat indeed, but it's a shame they don't have any audio examples of either version (on that page at least). Hard to really get a feel of it when there's nothing to gauge or judge.
Anonymous
6/4/2025, 2:39:14 PM No.42245664
>>42243302
Indeed.
Anonymous
6/5/2025, 6:02:44 AM No.42247180
Scootaloo Scoot-Scootaloo.
Replies: >>42247653
Anonymous
6/5/2025, 12:53:33 PM No.42247653
>>42247180
Someone said chicken?
Anonymous
6/5/2025, 5:17:38 PM No.42247951
>>42243816
https://huggingface.co/fishaudio/openaudio-s1-mini
Replies: >>42247952 >>42250902
Anonymous
6/5/2025, 5:18:38 PM No.42247952
>>42247951
OpenAudio S1 supports a variety of emotional, tone, and special markers to enhance speech synthesis:

1. Emotional markers: (angry) (sad) (disdainful) (excited) (surprised) (satisfied) (unhappy) (anxious) (hysterical) (delighted) (scared) (worried) (indifferent) (upset) (impatient) (nervous) (guilty) (scornful) (frustrated) (depressed) (panicked) (furious) (empathetic) (embarrassed) (reluctant) (disgusted) (keen) (moved) (proud) (relaxed) (grateful) (confident) (interested) (curious) (confused) (joyful) (disapproving) (negative) (denying) (astonished) (serious) (sarcastic) (conciliative) (comforting) (sincere) (sneering) (hesitating) (yielding) (painful) (awkward) (amused)

2. Tone markers: (in a hurry tone) (shouting) (screaming) (whispering) (soft tone)

3. Special markers: (laughing) (chuckling) (sobbing) (crying loudly) (sighing) (panting) (groaning) (crowd laughing) (background laughter) (audience laughing)
Replies: >>42248049
Anonymous
6/5/2025, 6:34:34 PM No.42248049
>>42247952
>Emotional markers
Interesting, hopefully there will be a decent UI and training for it
Anonymous
6/5/2025, 9:52:04 PM No.42248445
ArtificialBumpMare_ce2_123
ArtificialBumpMare_ce2_123
md5: 34de9c48b0bce48b62a464eaccc68ceb๐Ÿ”
>>42243302
Anonymous
6/6/2025, 12:46:16 AM No.42248854
Bump
Bump
md5: 1713a6f02618929fa122fb5c07ac12b7๐Ÿ”
Replies: >>42249714
Anonymous
6/6/2025, 4:35:28 AM No.42249353
ArtificialBumpMare_me_125
ArtificialBumpMare_me_125
md5: 8430a921ac9c187bf1edc2bef992dca5๐Ÿ”
Replies: >>42250054 >>42253016
Anonymous
6/6/2025, 8:37:14 AM No.42249714
>>42248854
>bump rump
Would pump.
Anonymous
6/6/2025, 12:50:13 PM No.42250054
>>42249353
Pretty bump mare. Totally would.
Anonymous
6/6/2025, 1:30:18 PM No.42250109
>15 crawls back to bait patreon donos with his half-baked model where most emotion choices result in unintelligable noise
>11 releases a new alpha that wipes the floor with his crusty garbage less than a month later
https://elevenlabs.io/v3
holy fucking kek! maybe there is a god.
Replies: >>42250437 >>42250547 >>42251801 >>42253112 >>42254373
Anonymous
6/6/2025, 5:05:47 PM No.42250437
>>42250109
yeah but unlike fifteen, eleven labs cost money
Anonymous
6/6/2025, 6:12:01 PM No.42250547
>>42250109
? elevenlabs doesn't have ponies, how is this a comparison
Replies: >>42250592
Anonymous
6/6/2025, 6:17:19 PM No.42250558
Remember not to give goku the attention he wants
Anonymous
6/6/2025, 6:28:00 PM No.42250592
>>42250547
you have to train your own models on there you retard mcspazatron
Replies: >>42250593
Anonymous
6/6/2025, 6:28:24 PM No.42250593
>>42250592
yeah is it any good though, last I tried to train ponies it wasn't very good
Anonymous
6/6/2025, 8:41:50 PM No.42250902
>>42247951
Anybody had a chance testing this thing out? Due to bullshit reasons I'm kind of stuck phone posting but I do want to know if it's any good.
Replies: >>42260216
Anonymous
6/6/2025, 8:57:52 PM No.42250941
https://github.com/RVC-Boss/GPT-SoVITS/releases/tag/20250606v2pro
https://github.com/RVC-Boss/GPT-SoVITS/wiki/GPT%E2%80%90SoVITS%E2%80%90features-(%E5%90%84%E7%89%88%E6%9C%AC%E7%89%B9%E6%80%A7)
Replies: >>42251141
Anonymous
6/6/2025, 10:21:40 PM No.42251141
>>42250941
>for 50 nvidia series
so wait, the new models is for 50s exclusive or just optimized for the use on that hardware?
Anonymous
6/6/2025, 11:56:10 PM No.42251484
>>42207363
I tried that and all it did was make Rarity do pokemon noises.
https://files.catbox.moe/1ryvaz.wav
https://files.catbox.moe/qivs4r.wav
also somethimes the AI interpretation (wish we could turn that off) says "Triple A" https://files.catbox.moe/72xgzt.wav
Anonymous
6/7/2025, 1:39:10 AM No.42251801
>>42250109
>elevenfags
Miss me with that shit.
Anonymous
6/7/2025, 6:13:28 AM No.42252543
ArtificialBumpMare_111
ArtificialBumpMare_111
md5: 2216c37c522bb72c79f03122a3b0f454๐Ÿ”
Anonymous
6/7/2025, 7:09:37 AM No.42252595
the one
the one
md5: 724b017f1d96120c767aa45cb66b1b15๐Ÿ”
I found some free audio processing plugins, I'll be loading these in (((audacity))) to auto-process my dataset. I haven't tried it yet, but it seems promising, like a publicly released version of izotope:
https://archive.org/details/accusonus-era-bundle-v-6.2.00
They made it public before going out of business. I might reply the anchor if it gives a good result.
Replies: >>42253243
Anonymous
6/7/2025, 11:51:17 AM No.42253016
>>42249353
Anonymous
6/7/2025, 12:58:06 PM No.42253112
>>42250109
I wonder (((who))) could be behind this post.
Anonymous
6/7/2025, 2:29:42 PM No.42253243
>>42252595
Interesting, could you post some examples here?
Replies: >>42287174
Anonymous
6/7/2025, 10:11:36 PM No.42254319
Mares?
Anonymous
6/7/2025, 10:38:01 PM No.42254373
>>42250109
gptsovits wipes the floor with 15 shitty model already, no need to bring the big guns
Replies: >>42254415
Anonymous
6/7/2025, 11:03:14 PM No.42254415
>>42254373
stop samefagging, your broken english is too noticeable at this point
Replies: >>42254417
Anonymous
6/7/2025, 11:04:46 PM No.42254417
>>42254415
You wish I was samefagging retard
Replies: >>42254418
Anonymous
6/7/2025, 11:05:04 PM No.42254418
>>42254417
hahahah
Anonymous
6/8/2025, 5:37:30 AM No.42255207
Electric mares?
Anonymous
6/8/2025, 6:19:11 AM No.42255248
43/64 on pl_marewater
Anonymous
6/8/2025, 10:08:16 PM No.42256668
For characters with lots of voice lines like Spike and Twilight, if I'm using my own voice, what's the best option to choose on Haysay to sound good?
Replies: >>42256717
Anonymous
6/8/2025, 10:27:51 PM No.42256717
>>42256668
RVC is the current gold standard as far as Haysay goes for speech-to-speech.
Replies: >>42256739
Anonymous
6/8/2025, 10:37:41 PM No.42256739
>>42256717
It's not quite getting the intended result. Should I set voice envelope high or low? https://voca.ro/1iHl7ZMvk5Qm
Anonymous
6/8/2025, 10:52:17 PM No.42256762
>>42218755
What settings did you use here? Sounds pretty good.
Replies: >>42256785 >>42256785 >>42256871
Anonymous
6/8/2025, 11:02:10 PM No.42256785
>>42256762
If you're trying to get non-vocals out of the voice-to-voice, it's not gonna work great.
>>42256762
Those were generated with 15.ai, probably the best option if you don't need voice to voice functionality and just want lewd pony noises.
Replies: >>42256871
Anonymous
6/8/2025, 11:46:17 PM No.42256871
Liminal Mare Code
Liminal Mare Code
md5: 109e2ee83869b9903862a36e7571b286๐Ÿ”
>>42256762
>>42256785
Mostly default settings. Varying the temperature occasionally. Liminal mares also make all sorts of noises, not just lewd. I can easily imagine them being used as vocal SFX for pony videogames or something โ€” maybe an episode or animation like a mare drips onto the ground and the grunt is entirely synthetic and not a recycles audio from the show.

https://files.catbox.moe/7rx7zi.mp3
Replies: >>42257202
Anonymous
6/9/2025, 2:50:49 AM No.42257202
>>42256871
>https://files.catbox.moe/7rx7zi.mp3
These sound like Trixie is doing Link moves.
Replies: >>42257362 >>42257426 >>42257521
Anonymous
6/9/2025, 4:28:57 AM No.42257362
>>42257202
Abstract mare sounds are abstract. Sadly Rvc is still the king of getting quality lewd sounds, but I still wish we had a nice tts alternative.
Replies: >>42257428
Anonymous
6/9/2025, 5:08:28 AM No.42257426
>>42257202
Huh, yeah, this really make me want to work on my 3d modelling again... although Godot's 3D capabilities are not great still.
Replies: >>42257521
Anonymous
6/9/2025, 5:09:50 AM No.42257428
>>42257362
Is there a place I can upload multiple audio files for easy playback? I wanted to show off what I managed with the TTS on haysay.
Replies: >>42257432
Anonymous
6/9/2025, 5:12:47 AM No.42257432
>>42257428
pone.rs
Replies: >>42257437
Anonymous
6/9/2025, 5:14:23 AM No.42257437
>>42257432
Thanks. Too bad it doesn't stream playback....

https://u.pone.rs/reZpBwHV.wav (Twilight)
https://u.pone.rs/cBNqloOa.flac (Spike)
Replies: >>42257521
Anonymous
6/9/2025, 6:12:23 AM No.42257521
TrixieHyut
TrixieHyut
md5: ae0db033362ca494ef3f854533711465๐Ÿ”
>>42257202
Could totally imagine a game with Trixie acting as the hero of Hyrule.
>>42257426
Damn, haven't heard Godot in a hot minute. I really need to find time and motivation to actually get into that myself. Keep telling myself that though. Sadly free time and hobbies don't pay bills.
>>42257437
>doesn't stream playback
You mean like, play it in a browser? Because usually mp3 is supported in that way.
Replies: >>42258274
Anonymous
6/9/2025, 11:41:43 AM No.42257919
Up.
Anonymous
6/9/2025, 4:47:41 PM No.42258274
>>42257521
Yeah, I know what you mean, though I'd say getting those skills can be valuable. Personally, I wish I didn't mentally check out of a tutorial after like 30 minutes because most of them need a good hour or more to really get into the meat of it, and even taking notes, it feels like I'm not retaining it well.
Replies: >>42258298
Anonymous
6/9/2025, 5:03:29 PM No.42258298
>>42258274
I would recommend the YT channel TheRoyalSkies, all his video (with some rare exceptions) are between one to five minutes long, always getting to the point instead of flapping about some bullshit and settings. The only downside is they are usually aimed at people who already have little bit above total 0xp noobie beginners but it's still good stuff.
Replies: >>42258349
Anonymous
6/9/2025, 5:24:43 PM No.42258349
>>42258298
Oh, they have Cascadeur videos. I was wondering if that was usable with quadrapeds too...
Replies: >>42258805
Anonymous
6/9/2025, 9:15:49 PM No.42258805
>>42258349
never used that addon/function, but I would imagine anything that is not a humanoid with standard two arms and legs will require lots of custom rigging.
Anonymous
6/10/2025, 9:00:12 AM No.42260216
>>42250902
Thread tourist here, it's breddy gud for being local. I've been running it on a 3060 with no issue, takes about twice as long as real time but the 44.1kHz fidelity is incredible. Also the voice cloning accepts up to 90 seconds of input, with possibly more but I have yet to test that.
My main criticism is that for longer gens upward of a minute or more, the voice gets kinda washed out in a way, but you can easily circumvent that by just splitting your text into chunks.
Here's some examples I genned:
Cum Zone guy quoting Ozymandias (my favorite gen, nearly indistinguishable from real VA) https://vocaroo.com/1ngXhfejJwoB
Gilbert Gottfried navy seals (you can hear the voice getting washed out towards the end) https://vocaroo.com/1n6SZbrHzKZ1
Michael Rosen pulp fiction (it can mispronounce capitalized words, storage is pronounced as sturgeon) https://vocaroo.com/1ov76WqTjIUY
I'd say it's elevenlabs-tier, even if that comparison is now outdated because of their new model.
Replies: >>42260489
Anonymous
6/10/2025, 2:00:15 PM No.42260489
>>42260216
for a zero shot model it's surprisingly decent. In their GitHub, do they provide a UI with emotional control or is it just bare minimum of "audio reference in, tts out"?
Replies: >>42261547
Anonymous
6/10/2025, 6:53:57 PM No.42260920
https://github.com/fluxions-ai/vui
https://huggingface.co/fluxions/vui
has voice cloning ability
>You can clone with the base model quite well but it's not perfect as hasn't seen that much audio / wasn't trained for long
Anonymous
6/10/2025, 9:03:47 PM No.42261160
d9tnkeekgos71
d9tnkeekgos71
md5: 3c6e86745a77a1cc3e24b3b855af16d2๐Ÿ”
What's the best tts for mares? I know elevenlabs is the best overall but I'm wondering how good it is for ponies
Replies: >>42261401 >>42262716 >>42264548
Anonymous
6/10/2025, 11:49:15 PM No.42261401
>>42261160
For locally operation, it's still the gpt-sovits. I don't use paid online services so lmao on that one.
>>42223265
But I guess this one could beat it, once they make it public. Having their tts model running tts integrated with Silly Tavern would honestly kick some serious ass.
Anonymous
6/11/2025, 1:13:52 AM No.42261547
file
file
md5: 317a28c6f60aebf372227a3aa1a41a6d๐Ÿ”
>>42260489
There's emotion control to a degree, you just put one of the tags in parentheses at the start. There's only a limited amount of valid tags and it can only go so far, and I haven't personally been able to use multiple in a single gen since it just says the word but YMMV
Replies: >>42261622
Anonymous
6/11/2025, 2:03:45 AM No.42261622
>>42261547
>only one emotional tag control
oh, this sucks donkey balls, I was hopping we could finally have a model that can make a advanced sentence styles eg whispering with mix of anger and confusion.
Replies: >>42261832 >>42261949
Anonymous
6/11/2025, 4:03:30 AM No.42261832
>>42261622
Yeah, honestly sounds like a convoluted way to say they have multiple individual models compounded, each trained on one particular emotion and uses the parentheses determine which underlying model it uses for synthesis.
Replies: >>42261949
Anonymous
6/11/2025, 4:58:02 AM No.42261949
>>42261622
>>42261832
Well like I said, your mileage may vary. I haven't been experimenting with it nearly as much as I should, and it could very well support that. I saw an example somewhere else of Pearl from SU reading the best thing about meatballs meme and the voice there was pretty varied emotionally and realistic. To be fair, they might have been using the full model which is only available through their website, but I wouldn't knock it before trying it on the smaller model. Using my GPU for other purposes at the moment so someone else will have to test.
Anonymous
6/11/2025, 10:55:35 AM No.42262326
ArtificialBumpMare_123
ArtificialBumpMare_123
md5: d282192d6d0219da5a44ae6b2b3b13b8๐Ÿ”
Replies: >>42263067
Anonymous
6/11/2025, 5:30:57 PM No.42262716
9bf5881b0384b11f9b64140f99bc0801
9bf5881b0384b11f9b64140f99bc0801
md5: e2c8d9f07672a61c4ac3e61276341b76๐Ÿ”
>>42261160
Is there some kind of library with voice clips I can use to make pony models in ElevenLabs?
Replies: >>42262734
Anonymous
6/11/2025, 5:37:58 PM No.42262734
>>42262716
megas links in OP?
Replies: >>42272674
Anonymous
6/11/2025, 8:21:54 PM No.42263067
>>42262326
Cute bump mare.
Anonymous
6/12/2025, 4:22:12 AM No.42264147
ArtificialBumpMare_ce_124
ArtificialBumpMare_ce_124
md5: a0fcf5a4eff39dec2f5c7981c4ee2c13๐Ÿ”
>10
Anonymous
6/12/2025, 9:25:44 AM No.42264531
>slow night bump
Anonymous
6/12/2025, 9:48:48 AM No.42264548
>>42261160
https://15.dev/
Replies: >>42265214
Anonymous
6/12/2025, 12:21:49 PM No.42264821
>>42196683
what???
Anonymous
6/12/2025, 1:53:40 PM No.42265013
bump due to too much spam on the board
Anonymous
6/12/2025, 2:12:52 PM No.42265042
Is openaudio s1 the best thing right now? I copied random text from a mod page. The pronunciation is pretty good, although imo a little too neutral.
Replies: >>42265052
Anonymous
6/12/2025, 2:17:47 PM No.42265052
>>42265042
Audio quality seems the best, pronunciation is really good as long as it's not a weird made up word.Emotions are pretty meh.
https://vocaroo.com/1l7fRlI0qtqn
Anonymous
6/12/2025, 3:43:55 PM No.42265214
>>42264548
No trolls please
Anonymous
6/12/2025, 6:49:45 PM No.42265590
https://x.com/elevenlabsio/status/1933188969279500459
Anonymous
6/12/2025, 9:48:32 PM No.42266061
preserved
Replies: >>42266620
Anonymous
6/13/2025, 12:55:58 AM No.42266620
>>42266061
Anonymous
6/13/2025, 2:10:47 AM No.42266751
preservation bump
Anonymous
6/13/2025, 5:52:16 AM No.42267158
ArtificialBumpMare_112
ArtificialBumpMare_112
md5: bb446d93dec8e72086d189ff3f736415๐Ÿ”
Replies: >>42267570
Anonymous
6/13/2025, 12:14:01 PM No.42267570
>>42267158
Anonymous
6/13/2025, 6:00:45 PM No.42268023
>mared
Anonymous
6/14/2025, 1:05:28 AM No.42268924
Up.
Anonymous
6/14/2025, 1:15:33 AM No.42268941
This is starting to get sad...
Replies: >>42268985
Anonymous
6/14/2025, 1:36:47 AM No.42268985
>>42268941
I only have one gpu that's already too outdated for all this kind of technological novelty. I already had to throw away few ideas for song cover because random song leakage / dual vocals was fucking with conversion process.
Anonymous
6/14/2025, 7:30:57 AM No.42269579
>>42218755
>pukes at the end
Anonymous
6/14/2025, 10:06:46 AM No.42269737
How do we save /ppp/?
Replies: >>42269957
Anonymous
6/14/2025, 2:06:02 PM No.42269957
breaking bad pony 1665957871920475
breaking bad pony 1665957871920475
md5: d59ca0ef4c3ec690a51e303c6a8466bb๐Ÿ”
>>42269737
There is only one thing we can do, we cook...I mean we make pony content. I was thinking of doing a "X pony makes a review about fics/books" in similar theme/feel of Rainbow Dash Presents.
Replies: >>42272090
Anonymous
6/14/2025, 2:42:29 PM No.42269988
REDUB 7!!!!!!!!!!!!!!!!!
Anonymous
6/14/2025, 3:25:01 PM No.42270046
With SparkTTS, voices can be cloned with even just a few seconds of audio. This allows the cloning of background characters like TwinkleShine. What I like to do is feed ai generated voices into elevenlabs in order to get a higher quality model. Love what you guys are doing!
Anonymous
6/14/2025, 7:14:37 PM No.42270522
>bump
Anonymous
6/14/2025, 8:52:57 PM No.42270729
Anyone else here that thinks about the possibilities of AGI pretty consistently?
I donโ€™t know exactly how much overlap there is between this corner of the fandom and technological singularity enthusiasts.
Replies: >>42271120 >>42271473
Anonymous
6/14/2025, 11:47:16 PM No.42271120
itknows
itknows
md5: 1fc2a9382a8e4b5c6ffb06cfcb6ebbd4๐Ÿ”
>>42270729
I'm always dreaming of Bicentennial Man level of AGI. Just another race of sentient beings but they're Robots! but I have no idea if we'd ever reach a singularity event or even if we do, what are the true possibilities?
Anonymous
6/15/2025, 2:10:28 AM No.42271473
>>42270729
in my unprofessional opinion we don't have currently tech and materials to make something that would work as proper AGI, at best it will just more polished versions of LLM that will be so good at pretending to sound like people it will be next to impossible to distinguish them from people. I do think people in next century will make some new type of processors/programming/something else that could make the computers think and feel for real, but by that time the world and society will change so much there isn't even point in guessing how it would look like (just like trying to explain a caveman the wonders of tech from ancient roman empire).
Anonymous
6/15/2025, 8:23:43 AM No.42272090
>>42269957
This. You must use the pone to save the pone
Anonymous
6/15/2025, 4:58:20 PM No.42272674
1840240
1840240
md5: 178e0ffb91b805517a6dc0b087b7f5b2๐Ÿ”
>>42262734
I've tried to use the audio clips but my models sound like shit. Does anyone have some pre-made audio clips I can use for ElevenLabs that's worked well for them?
Replies: >>42272771 >>42272932 >>42275950
Anonymous
6/15/2025, 6:01:16 PM No.42272771
>>42272674
>models sound like shit
so idea what script you are using but everyone and every company that has pony voice conversions/tts are using the exact same clips from PPP.
if you are using some new experimental cloning scripts, these will require the use of 10s clips, so if you give them just 3s clip the result will sound shit.
Anonymous
6/15/2025, 7:27:04 PM No.42272932
>>42272674
>ElevenLabs
>Models sound like shit
So nothing new then
Replies: >>42273853
Anonymous
6/15/2025, 11:17:12 PM No.42273482
nein!
Anonymous
6/16/2025, 1:56:30 AM No.42273853
moondancer 1676304366778324
moondancer 1676304366778324
md5: ae58df1cc443667ca969eba6362f7c4d๐Ÿ”
>>42272932
>https://u.pone.rs/LvFcybeH.mp3
surprise horsefuckers, I got some spare time and converted a song from my buddy to Moon Dancer vocals, enjoy.
OG song: https://suno.com/song/eae162d0-cbbb-433a-8008-5fab7bee01ba
Replies: >>42274821
Anonymous
6/16/2025, 8:33:20 AM No.42274484
Bump.
Replies: >>42275868 >>42276957
Anonymous
6/16/2025, 2:55:46 PM No.42274821
>>42273853
Nice pop song.
Anonymous
6/16/2025, 4:46:02 PM No.42274933
1722816409565057
1722816409565057
md5: be35cafd11e2d370634bbb6544f0e481๐Ÿ”
>>41070370
Is there a chance anybody here has archived this before it was deleted?
>Background Pony - "OUT OF APPLES" - Hall 'n Oates - Out of Touch (MLP Applejack AI cover)
this was its title if it helps anybody find it
Replies: >>42283796
Anonymous
6/16/2025, 8:43:56 PM No.42275376
>mare antispam bump
Anonymous
6/17/2025, 12:04:50 AM No.42275868
>>42274484
Anonymous
6/17/2025, 12:42:41 AM No.42275950
>>42272674
ElevenLabs is shit. Just use 15.ai.
Replies: >>42276071
Anonymous
6/17/2025, 1:18:27 AM No.42276071
>>42275950
15...
Replies: >>42279153
Anonymous
6/17/2025, 3:25:32 AM No.42276441
sleep bump
Anonymous
6/17/2025, 10:45:23 AM No.42276957
>>42274484
Replies: >>42277965
Anonymous
6/17/2025, 3:48:29 PM No.42277294
>mares
Anonymous
6/17/2025, 5:14:02 PM No.42277427
>https://u.pone.rs/EuipipDV.mp3
American (Dad) Ghost theme
Anonymous
6/17/2025, 10:37:15 PM No.42277965
>>42276957
nein
Anonymous
6/18/2025, 1:55:56 AM No.42278416
>nein
Anonymous
6/18/2025, 2:04:17 AM No.42278429
I downloaded this in 2021, it's been 4 years now. How much has it improved since then?

https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN
Replies: >>42278431 >>42279204
Anonymous
6/18/2025, 2:04:47 AM No.42278431
>>42278429
a lot.
Anonymous
6/18/2025, 7:23:58 AM No.42279044
>Page 10
Anonymous
6/18/2025, 9:25:27 AM No.42279153
>>42276071
He's right though. EL is arse.
Anonymous
6/18/2025, 9:32:10 AM No.42279159
1750213536376709[1]
1750213536376709[1]
md5: c6157a9c86fd02adb799ee5d4b1552d2๐Ÿ”
I want to take the costanza answering machine song and change the words while maintaining his voice. What's the most appropriate model to do this with?
Replies: >>42279204
Anonymous
6/18/2025, 10:26:37 AM No.42279204
>>42279159
>keeping the og voice but slightly edited
Hmm, that will be bit tricky, if you can find a version without a laughing track, you can try run the clip through the ace-step
>https://huggingface.co/spaces/ACE-Step/ACE-Step
This should allow you to use function to partly edit the lyrics without changing the music (or so that's the general idea.
The other alternative is to find some clean clips (or de-noise them with some ai program) of costanza singing in same tune as in the show, have that 2~3 minutes of dataset trained in rvc, use some other character talknet/whatever model to sing the whole song and apply it to official soundtrack
>https://www.youtube.com/watch?v=1ghIoM89cfc&list=RD1ghIoM89cfc
>>42278429
>from previous year
>https://u.pone.rs/DFPTbUhe.mp3
Dude, tech jump feels like going from writing books by hand to using printing press. Depending on what you are trying to use if for, it will for most of the time sound about ~95% like character is supposed to sound like.
Anonymous
6/18/2025, 3:36:21 PM No.42279528
Bump against the raid
Replies: >>42279949 >>42280416 >>42282125 >>42282660
Anonymous
6/18/2025, 7:33:46 PM No.42279949
>>42279528
ya
Anonymous
6/18/2025, 11:35:35 PM No.42280416
>>42279528
nein
Anonymous
6/19/2025, 3:33:25 AM No.42280869
>mares
Anonymous
6/19/2025, 9:31:38 AM No.42281330
bumpo save
Anonymous
6/19/2025, 2:43:15 PM No.42281616
>https://u.pone.rs/FHniGgaQ.mp3
Pinkie Pie - At God's Mercy (GAME SIZE)
Anonymous
6/19/2025, 8:06:20 PM No.42282125
>>42279528
Anonymous
6/20/2025, 12:01:17 AM No.42282660
>>42279528
again
Anonymous
6/20/2025, 2:35:57 AM No.42283002
>https://u.pone.rs/dyjpaZQU.mp3
Rainbow_Dash_sings_Land_of_Shattered_Dreams_by_DragonForce
Anonymous
6/20/2025, 8:45:04 AM No.42283763
1672959074085731
1672959074085731
md5: a3a470df6cdd907ee47058eaaec1addd๐Ÿ”
>No Nurse Redheart on 15.ai
Boycotting 15
Anonymous
6/20/2025, 9:10:50 AM No.42283796
DOWNLOAD_STUFF_YOU_LIKE_PEOPLE
DOWNLOAD_STUFF_YOU_LIKE_PEOPLE
md5: a9721378d7fc50baf9107c1d1e7cf58c๐Ÿ”
>>42274933
Six years of saving songs comes in handy sometimes. https://files.catbox.moe/gwqv9m.mkv
Replies: >>42284031 >>42284214 >>42291212 >>42292848 >>42293157
Anonymous
6/20/2025, 12:20:14 PM No.42284031
>>42283796
>Filename
A philosophy to live by.
Anonymous
6/20/2025, 2:46:32 PM No.42284214
>>42283796
nta but thank you archive-kun anon
Anonymous
6/20/2025, 6:50:17 PM No.42284569
>https://u.pone.rs/MOQrKwwX.mp3
Redoing Cossacks letter with gpt sovits.
Anonymous
6/20/2025, 9:53:39 PM No.42285028
>https://huggingface.co/collections/kyutai/speech-to-text-685403682cf8a23ab9466886
kyutai have posted their speech-to-text models on hugging face (it's the people who made the https://unmute.sh/ site). Hopefully they will get around publishing the TTS model some time soon.
Anonymous
6/21/2025, 12:56:57 AM No.42285552
>boop
Replies: >>42287822 >>42299779
Anonymous
6/21/2025, 2:42:43 AM No.42286094
>sleep bump
Replies: >>42289052 >>42291103
Anonymous
6/21/2025, 9:12:58 AM No.42287174
Screenshot 2025-06-21 030448
Screenshot 2025-06-21 030448
md5: f5404e991b95f4dff0fc2c6c09231507๐Ÿ”
>>42253243
I came back with some samples from my button's mom dataset that I used the following on:
De-Breath
De-Esser
Mouth De-Clicker
Plosive Remover
>Original Samples
https://files.catbox.moe/68yrm2.wav
>Processed Samples
https://files.catbox.moe/0d3djz.wav
Again, I read that the software is completely open sourced to public domain and no one owns the rights to it or what it makes, should be perfect for any use for processing data without spending money on IzoTope. You be the judge on how effective it is, I'd say it's good enough to shovel multi-hour datasets for free in one go and clean up whatever is left afterwards.
Replies: >>42287401
Anonymous
6/21/2025, 12:59:00 PM No.42287401
>>42287174
Cool stuff! With it's apparent noise and reverb removal capabilities I may have to test how well it is at salvaging previously unusable data to see if existing pony models might be expanded. Gotta first test if it works well through Wine though. I wonder if I might be able to salvage more workable Redheart data.
Replies: >>42288274
Anonymous
6/21/2025, 5:24:41 PM No.42287822
>>42285552
Anonymous
6/21/2025, 8:27:58 PM No.42288250
>pony bump
Anonymous
6/21/2025, 8:37:53 PM No.42288274
3414155__safe_artist-colon-ewoudcponies_derpibooru+import_lyra+heartstrings_pony_unicorn_g4_bust_female_gradient+background_hooves+in+air_horn_image_ma
>>42287401
Hell yeah brother! That's what it's all about! There's got to be so much ponyfeather quality audio data that could have been fine with just a pop filter, and this should fix it for posterity.
Anonymous
6/22/2025, 1:14:36 AM No.42289052
>>42286094
ayy
Replies: >>42289899
Anonymous
6/22/2025, 1:29:44 AM No.42289075
Does anyone know what TTS service is best to use with SillyTavern?
Replies: >>42289305
Anonymous
6/22/2025, 3:05:10 AM No.42289305
>>42289075
uhhh, i vaguely remember there was a plugin script (or api script?) that could connect the ST with some tts that could even be train on 10~20 minutes of dataset, but that was year or more ago and even than I personally given up on it as python dependency hell was impossible to navigate to even install that bloody thing.
Anonymous
6/22/2025, 9:00:32 AM No.42289899
>>42289052
Anonymous
6/22/2025, 12:53:44 PM No.42290193
its mare
Replies: >>42290480
Anonymous
6/22/2025, 4:14:35 PM No.42290480
>>42290193
Replies: >>42305186
Anonymous
6/22/2025, 8:30:16 PM No.42291103
>>42286094
>awake bump
Replies: >>42292030
Anonymous
6/22/2025, 9:14:57 PM No.42291212
1612370673499
1612370673499
md5: dd107a11f58a21b9a49541ac83de089f๐Ÿ”
>>42283796
SUPERCHARGED anon, thank you
Anonymous
6/23/2025, 1:35:06 AM No.42292030
>>42291103
indeed
Anonymous
6/23/2025, 5:44:48 AM No.42292848
>>42283796
Nice! I think I have about that in pony memes and art among others from years of saving which come to think of it I still need to find time to sort and categorise โ€” Thanks for the reminder.
Anonymous
6/23/2025, 8:20:18 AM No.42293157
>>42283796
Autism yields its own rewards.
Nice.
Anonymous
6/23/2025, 12:28:34 PM No.42293460
>pre work bump
Anonymous
6/23/2025, 4:16:20 PM No.42293824
Precautionary bump.
Replies: >>42294511
Anonymous
6/23/2025, 9:30:36 PM No.42294511
>>42293824
aaaaaaaaaaaa!
Anonymous
6/24/2025, 1:07:47 AM No.42295095
gn, imma going to think of what stuff to make tomorrow
Anonymous
6/24/2025, 7:16:38 AM No.42295943
Paag 10 save.
Replies: >>42296474 >>42297247
Anonymous
6/24/2025, 3:14:44 PM No.42296474
>>42295943
Almost again.
Anonymous
6/24/2025, 9:49:30 PM No.42297247
>>42295943
Replies: >>42298481
Anonymous
6/25/2025, 12:47:54 AM No.42297721
night bump
Anonymous
6/25/2025, 6:34:02 AM No.42298481
>>42297247
Anonymous
6/25/2025, 7:57:36 AM No.42298627
Vinyl Scratch (mlp), pony, sound, cyberspace, electronic, sound waves s-1300701182
>>42174105
Do we know if there are any other additional recent local audio and music generators comparable to the likes of Suno and Udio?
Aside from this example, I haven't come across a decent versatile one that can run local since Bark, which since was abandoned ages ago (as far as open source goes) and became Suno. Which is still incredibly good, but it'd be nice to have something similar that don't rely on credits and lame stuff like that.
Replies: >>42299241
Anonymous
6/25/2025, 2:27:31 PM No.42299090
ArtificialBumpMare_nc_104
ArtificialBumpMare_nc_104
md5: d9e1133d5e0c8028d5210cba6ca6f0ce๐Ÿ”
>9
Bump mare time
Replies: >>42300347 >>42301020
Anonymous
6/25/2025, 3:46:04 PM No.42299241
>>42298627
Stability Ai may or may not work on one, but who the fuck knows with them since they still have't publish the newer version of instrumental Stable Audio model.
Other ai song model is the YuE, but from the looks of it its bit tricky to get working locally .
Anonymous
6/25/2025, 7:40:15 PM No.42299779
>>42285552
Boopity boop!
Anonymous
6/25/2025, 10:37:05 PM No.42300347
>>42299090
mare
Anonymous
6/25/2025, 10:37:51 PM No.42300352
>>42161191 (OP)
Congratulations, 1111 aka 15!
Replies: >>42301986
Anonymous
6/25/2025, 11:29:51 PM No.42300581
burger whore adf39537d8ce4ad6
burger whore adf39537d8ce4ad6
md5: 764ca180e6f542526a527aa4b72abbb6๐Ÿ”
>https://u.pone.rs/kLAzyDaA.mp3
New ai song, "I only eat 3 cheeseburgers!" from suno user ๊น€์น˜๋‹ค์‹œ๋งˆ์€๊ฐˆ์น˜, and converted with Twi vocals.
Replies: >>42301705 >>42302358
Anonymous
6/26/2025, 1:30:55 AM No.42301020
>>42299090
mare harder
Anonymous
6/26/2025, 6:45:50 AM No.42301705
>>42300581
we sell hay here not burgers
Anonymous
6/26/2025, 8:36:40 AM No.42301986
>>42300352
What are you referring to?
Replies: >>42302315
Anonymous
6/26/2025, 12:38:43 PM No.42302294
ArtificialBumpMare_106
ArtificialBumpMare_106
md5: 60ab4bb52e0e74245f3fdf2742b5d001๐Ÿ”
>9
Eighth bump mare deployed
Replies: >>42302526 >>42302859 >>42303531 >>42304147
Anonymous
6/26/2025, 12:59:32 PM No.42302315
>>42301986
sรถy of 2
Anonymous
6/26/2025, 1:25:21 PM No.42302358
>>42300581
Could go for some burgers right about now
Anonymous
6/26/2025, 2:27:12 PM No.42302526
>>42302294
Thank you, kind bump mare.
Anonymous
6/26/2025, 5:25:06 PM No.42302859
>>42302294
Anonymous
6/26/2025, 7:25:32 PM No.42303176
quick board...
Anonymous
6/26/2025, 8:42:12 PM No.42303531
>>42302294
mared
Anonymous
6/26/2025, 9:36:50 PM No.42303825
anti spam bump
Anonymous
6/26/2025, 11:01:57 PM No.42304147
>>42302294
Anonymous
6/27/2025, 2:39:49 AM No.42304700
>https://www.tomshardware.com/news/gddr6-vram-prices-plummet
>16 gb of vram could be as cheap as 400$
>but it wouldn't because nvidia are greedy fucks
i will never forgive the crypto bros for fucking up the market
Anonymous
6/27/2025, 7:36:25 AM No.42305186
>>42290480
So it seems
Anonymous
6/27/2025, 9:01:43 AM No.42305431
Board is moving lightning fast this past hour.
Replies: >>42305442
Anonymous
6/27/2025, 9:10:24 AM No.42305442
>>42305431
it's the sliderfag
Replies: >>42305606 >>42305635
Anonymous
6/27/2025, 11:51:26 AM No.42305606
>>42305442
Yep, it's becoming more and more blatant every time.
Anonymous
6/27/2025, 12:16:16 PM No.42305635
>>42305442
With the lack of reaction from jannies and mods (as they are too busy to jerk off to furry fag shit), Im feeling like there could be a good idea to keep a parallel thread in nhnb and mlpol too, to at least keep some bits in case the the board kept being nuked.
Anonymous
6/27/2025, 5:53:03 PM No.42306113
>pre dinner bump
Anonymous
6/27/2025, 10:49:37 PM No.42306741
>up poned
Anonymous
6/28/2025, 12:50:44 AM No.42307024
Anyone know how to get 15 ai to scream? Tried to use so-vits on haysay with audio but it came out like crap. Need Lyra doing it too, and so-vits doesn't have her.
Replies: >>42307277 >>42307279
Anonymous
6/28/2025, 2:47:01 AM No.42307277
>>42307024
Uhhh, tts models pretty much always struggled with screaming and whispering. The older 15 model could do it to some smaller degree (but it still was a massive game of rolling the next generated clip untill you got what you wanted). I guess you could try to find screaming clip in OP mega and use that with gpt sovits reference Tts?
Replies: >>42308681
Anonymous
6/28/2025, 2:47:49 AM No.42307279
>>42307024
Convincing screams and other less-phonetic sounds have been notoriously difficult since the very beginning of artificial speech. Feels like it comes down to a lack of data, or the specific exclusion of which due to the negative impact its kind has on training.

Closest thing I can suggest is priming. Initiate the prompt with a sentence (or multiple) of dialogue that would ordinarily be expected to be said with intensity; be that anger, seriousness, shock, whatever. The AI likes to be consistent with outputs and therefore some of that emotion will be inherited and thus carry over to concurrent sentences โ€” this is where you'd attempt screaming dialogue. Might also be good to try using ARPAbet for some too so it pronounced correctly.
Replies: >>42307905 >>42308681
Anonymous
6/28/2025, 9:04:37 AM No.42307905
>>42307279
10
Anonymous
6/28/2025, 2:04:48 PM No.42308230
Bump.
Replies: >>42309328
Anonymous
6/28/2025, 7:40:07 PM No.42308681
>>42307277
>>42307279
Thanks for the suggestions. I ended up just regenerating an "AAAAAAAAA" prompt a bunch of times until I got as close as I could to a scream. Sounds like shite, but it was only for a little shitpost anyway. https://files.catbox.moe/z2r0c8.mp3
Which is for this for this pic in /bale/ >>42305975
Replies: >>42309483
Anonymous
6/29/2025, 1:36:56 AM No.42309328
>>42308230
Anonymous
6/29/2025, 3:07:11 AM No.42309483
>>42308681
huh, pretty neat work Anon
Anonymous
6/29/2025, 9:41:54 AM No.42310164
Rainbow Rizz
Rainbow Rizz
md5: 3384fa0aacda058679899e2488db6a10๐Ÿ”
>>42207220
https://files.catbox.moe/fv2v5u.wav
https://files.catbox.moe/aeqloc.wav
https://files.catbox.moe/xl6ft5.wav
https://files.catbox.moe/xl6ft5.wav

Here's some with Flutters. I just did:

"ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, cumming! ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, fuck me, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh!"

You can hear the good parts and splice those.
Replies: >>42310579
Anonymous
6/29/2025, 4:32:19 PM No.42310579
>>42310164
ai mares are lewd
Anonymous
6/29/2025, 7:47:38 PM No.42310923
alien pony
alien pony
md5: 9d6cf1d476b3bbaab8e8cb8c1122463e๐Ÿ”
>https://u.pone.rs/NlnRoRSa.mp3
Ghost singing Past Due - Xenophobia (aka unofficial theme song of Stellaris)
Replies: >>42311695
Anonymous
6/30/2025, 12:45:49 AM No.42311695
>>42310923
A classic. Let the light of mankind shine brighter than the stars themselves
Anonymous
6/30/2025, 5:55:25 AM No.42312379
1727251__safe_artist-colon-greyscaleart_princess+celestia_oc_oc-colon-human+grey_alicorn_caffeine_clothes_coffee_coffee+mug_dilated+pupils_discovered+c
RealDash
6/30/2025, 5:59:46 AM No.42312387
I might make a small lewd audio of Twiggle as a test for 15.dev.
Dialogue's a pain to get to sound natural, way more than 15ai's last version.
Replies: >>42312431
Anonymous
6/30/2025, 6:32:29 AM No.42312431
>>42312387
>>>/trash/
Anonymous
6/30/2025, 11:49:02 AM No.42312836
ArtificialBumpMare_202
ArtificialBumpMare_202
md5: 06a30e71d90abef84862370facacb2ec๐Ÿ”
>9
Deploying ninth bump mare (triple pose edition)
Replies: >>42313175
Anonymous
6/30/2025, 5:49:11 PM No.42313175
>>42312836
horse
Anonymous
6/30/2025, 10:19:10 PM No.42313719
>14.ai
lmao
Replies: >>42313730
Anonymous
6/30/2025, 10:22:38 PM No.42313730
>>42313719
kek, a race to the bottom. What kind of sketchy indians will we reach when we hit 1.ai?
Replies: >>42313761 >>42314271
Anonymous
6/30/2025, 10:34:08 PM No.42313761
green-card_thumb.jpg
green-card_thumb.jpg
md5: e4c41ec183ae041d483f9c377f0734a3๐Ÿ”
>>42313730
uh, based?
Replies: >>42319574
Anonymous
7/1/2025, 2:28:38 AM No.42314271
>>42313730
Or -1.ai
Replies: >>42314359 >>42314864
Anonymous
7/1/2025, 3:20:37 AM No.42314359
>>42314271
Interestingly, hyphens can't be used at the start or end of a domain name. Would probably have to be negative1.ai or something
Replies: >>42317225
Anonymous
7/1/2025, 8:31:02 AM No.42314864
>>42314271
Witchcraft!
Anonymous
7/1/2025, 1:05:32 PM No.42315242
>9
Anonymous
7/1/2025, 6:25:04 PM No.42315665
>Page nine
Replies: >>42316221 >>42316594 >>42317148 >>42317684
Anonymous
7/1/2025, 11:14:27 PM No.42316221
>>42315665
MAREEE
Anonymous
7/2/2025, 1:55:25 AM No.42316594
>>42315665
early sleep bump
Anonymous
7/2/2025, 7:29:02 AM No.42317148
>>42315665
Anonymous
7/2/2025, 8:24:49 AM No.42317225
>>42314359
Or simply minus1.ai. It's kind of a word play.
Replies: >>42317279
Anonymous
7/2/2025, 9:02:08 AM No.42317279
>>42317225
Clever. I like it.
Anonymous
7/2/2025, 3:05:27 PM No.42317684
>>42315665
Anonymous
7/2/2025, 6:55:40 PM No.42317966
beatles
beatles
md5: 8c8b6c40493bedb9d29b98f2436ac872๐Ÿ”
A very quick cover of Beatles' With a Little Help from My Friends with slightly modified lyrics
https://u.pone.rs/ODLJbBek.flac
Replies: >>42318161
Anonymous
7/2/2025, 9:01:14 PM No.42318161
>>42317966
Nice work Anon! Funny enough, I listen to some random Beatles song a week ago and wished there was some covers or parodies done in pony voices.
Anonymous
7/2/2025, 10:59:00 PM No.42318380
error heysay sovits 4
error heysay sovits 4
md5: 9b9eab41dd63a0888b96dd5a6603bfd6๐Ÿ”
Hi HydrusBeta, Im getting error when using the sovits 4.0 Spitfire model with 'reduce hoarsness' and 'apply nsf_higan' setting, and it works if I turn these two settings off.
Anonymous
7/3/2025, 1:27:02 AM No.42318746
spitfire beach by yakovlev-vad 2137101 - Copy
spitfire beach by yakovlev-vad 2137101 - Copy
md5: 2adaed5f2ca3d3e22f1444adbf6e4b60๐Ÿ”
>https://u.pone.rs/KbiNvzqK.mp3
Solitary Summer Dream by suno user testediserie.
I was looking for a nice summer song for Celestia, I found myself really enjoying listing to this BUT rvc and other voice converts disagreed with my vocal choice, so we all get to enjoy Spitfire cover, since her voice haven't been used that much.
Anonymous
7/3/2025, 2:22:39 AM No.42318932
Late night bump.
Anonymous
7/3/2025, 5:39:19 AM No.42319283
What's the current torrent for the MLP leak files?
Anonymous
7/3/2025, 6:16:17 AM No.42319348
CelestAI - Concentration and Morality
CelestAI - Concentration and Morality
md5: 3260a82c59c2ea2b721b3706be6be68e๐Ÿ”
>42119384 42196683 42317225
Yet it is proper to enumerate as such among the Trotting ways.

>42161222 42269737 42208841
ppp as tragedy of the commons
Things fall apart, the centre cannot hold - Keats
pandora's vox on community in cyberspace - humdog
yet... n mare saddlepoint? The altchans apart were less a scattering of the winds and more of the Shattered sundered.

>42204138 42198701 42195922
The Cathedral and the Bazaar - Raymond, acknowledging Tarver's Bizarre Empty Temples.
Cathedral vs. Parlor - Wrye, acknowledging Monitor144hz's Patreon Pigeonhole.
Tamers1-4,5 voices when?

>42270729
It's been a long thread. Bacon-bakin' necessary.
Replies: >>42319606
Anonymous
7/3/2025, 8:22:18 AM No.42319574
>>42313761
Who are these dunces?
Anonymous
7/3/2025, 8:47:19 AM No.42319606
pip stare
pip stare
md5: 7353ef54d056705fe70ba2f0e7cb35c3๐Ÿ”
>>42319348
Anon, are you trying to conceptualize LLM into becoming CelestAI ?
Replies: >>42319961 >>42320204
Anonymous
7/3/2025, 2:47:12 PM No.42319961
>>42319606
If it works, that would be something.
Anonymous
7/3/2025, 5:32:48 PM No.42320204
>>42319606
Boop her snoot.
Anonymous
7/3/2025, 9:41:44 PM No.42320765
>mares
i love then
Anonymous
7/3/2025, 10:46:07 PM No.42321010
NEW THREAD
>>42320976
Replies: >>42321061
Anonymous
7/3/2025, 10:58:49 PM No.42321061
>>42321010
mares?