Thread 42161191

545 posts 222 images /mlp/

Anonymous 5/2/2025, 7:13:41 AM No.42161191 [Report] >>42161203 >>42196683 >>42300352

Pony Preservation Project (Thread 154)

Welcome to the Pony Voice Preservation Project!
youtu.be/730zGRwbQuE

The Pony Preservation Project is a collaborative effort by /mlp/ to build and curate pony datasets for as many applications in AI as possible.

Technology has progressed such that a trained neural network can generate convincing voice clips, drawings and text for any person or character using existing audio recordings, artwork and fanfics as a reference. As you can surely imagine, AI pony voices, drawings and text have endless applications for pony content creation.

AI is incredibly versatile, basically anything that can be boiled down to a simple dataset can be used for training to create more of it. AI-generated images, fanfics, wAIfu chatbots and even animation are possible, and are being worked on here.

Any anon is free to join, and there are many active tasks that would suit any level of technical expertise. If you’re interested in helping out, take a look at the quick start guide linked below and ask in the thread for any further detail you need.

EQG and G5 are not welcome.

>Quick start guide:
docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Introduction to the PPP, links to text-to-speech tools, and how (You) can help with active tasks.

>The main Doc:
docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit
An in-depth repository of tutorials, resources and archives.

>Online speech generation
haysay.ai

>Active tasks:
Research into animation AI
Research into pony image generation

>Latest developments:
http://ponepaste.org/10865

>The PoneAI drive, an archive for AI pony voice content:
drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp

>Clipper’s Master Files, the central location for MLP voice data:
mega.nz/folder/jkwimSTa#_xk0VnR30C8Ljsy4RCGSig
mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ
drive.google.com/drive/folders/1MuM9Nb_LwnVxInIPFNvzD_hv3zOZhpwx

>Cool, where is the discord/forum/whatever unifying place for this project?
You're looking at it.

Last Thread:
>>42103996

Anonymous 5/2/2025, 7:16:05 AM No.42161200 [Report]

FAQs:
If your question isn’t listed here, take a look in the quick start guide and main doc to see if it’s already answered there. Use the tabs on the left for easy navigation.
Quick: docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Main: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit

>Where can I find the AI text-to-speech tools and how do I use them?
A list of TTS tools: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.yuhl8zjiwmwq
How to get the best out of them: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.mnnpknmj1hcy

>Where can I find content made with the voice AI?
In the PoneAI drive: drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp
And the PPP Mega Compilation: docs.google.com/spreadsheets/d/1T2TE3OBs681Vphfas7Jgi5rvugdH6wnXVtUVYiZyJF8/edit

>I want to know more about the PPP, but I can’t be arsed to read the doc.
See the live PPP panel shows presented on /mlp/con for a more condensed overview.
2020 pony.tube/w/5fUkuT3245pL8ZoWXUnXJ4
2021 pony.tube/w/a5yfTV4Ynq7tRveZH7AA8f
2022 pony.tube/w/mV3xgbdtrXqjoPAwEXZCw5
2023 pony.tube/w/fVZShksjBbu6uT51DtvWWz

>How can I help with the PPP?
Build datasets, train AIs, and use the AI to make more pony content. Take a look at the quick start guide for current active tasks, or start your own in the thread if you have an idea. There’s always more data to collect and more AIs to train.

>Did you know that such and such voiced this other thing that could be used for voice data?
It is best to keep to official audio only unless there is very little of it available. If you know of a good source of audio for characters with few (or just fewer) lines, please post it in the thread. 5.1 is generally required unless you have a source already clean of background noise. Preferably post a sample or link. The easier you make it, the more likely it will be done.

>What about fan-imitations of official voices?
No.

>Will you guys be doing a [insert language here] version of the AI?
Probably not, but you're welcome to. You can however get most of the way there by using phonetic transcriptions of other languages as input for the AI.

>What about [insert OC here]'s voice?
It is often quite difficult to find good quality audio data for OCs. If you happen to know any, post them in the thread and we’ll take a look.

>I have an idea!
Great. Post it in the thread and we'll discuss it.

>Do you have a Code of Conduct?
Of course: 15.ai/code

>Is this project open source? Who is in charge of this?
pony.tube/w/mqJyvdgrpbWgZduz2cs1Cm

PPP Redubs:
pony.tube/w/p/aR2dpAFn5KhnqPYiRxFQ97

Stream Premieres:
pony.tube/w/6cKnjJEZSCi3gsvrbATXnC
pony.tube/w/oNeBFMPiQKh93ePqTz1ns8

Anonymous 5/2/2025, 7:17:06 AM No.42161203 [Report]

veryVERYbiganchor.jpg md5: dc4d191e...

>>42161191 (OP)
Anchor.

Anonymous 5/2/2025, 7:24:32 AM No.42161222 [Report]

emmy the robot as pony 354131.png md5: 0568fbf1...

>woken up just 5 minutes after thread passed page 10
Stupid fuckers and their "1 post by OP with retarded one bait sentence" threads.
Anyhow, are you guys busy with doing entries for antithology or what (I know I am, im sitting on like 5 half assed ideas that still need doing) ?

Anonymous 5/2/2025, 11:20:02 AM No.42161566 [Report] >>42161651

>page 9 after less than 4 hours
Board activity but at what cost ?

Anonymous 5/2/2025, 12:04:26 PM No.42161651 [Report]

>>42161566
The cost is our sanity.

Anonymous 5/3/2025, 2:10:18 AM No.42163358 [Report] >>42164184

Is there a FLA of Fluttershy's cabin interior or her bedroom in the leak on web archive called MLP FLAs? I tried Dragonshy, Part 1 of Friendship is Magic and Stare Master but it's not in those...

Anonymous 5/3/2025, 8:48:56 AM No.42164184 [Report] >>42169147

bed fs rd blushing.gif md5: 6c35f941...

>>42163358
From what quick googlefu tells me, the list of leaked full assets episode we should have access (from season 8 episodes) is as follows :
6 - "Surf and/or Turf", 7 - "Horse Play", 8- "The Parent Map", 9 - "Non-Compete Clause", 10 - "The Break Up Break Down", 11 - "Molt Down" - , 13 - "The Mean 6"
I swear we had some bits and bobs from other episodes but I cant seem to find a proper list of what is (and is not) archived.
There is this scene from Super Speedy Cider Squeezy 3000 ( and I think in the later season eps with Nightmare Night and one were Discord suffers from being "normal" as well)?

Anonymous 5/4/2025, 1:17:01 AM No.42165897 [Report] >>42165947

13bdb600cbe7ee9d.png md5: 20324040...

>https://codeberg.org/nak/sample-neko
Here is a tool the I spotted on interwebs, that allow to easily list and move 1k+ sound clips from one folder to another .
I feel like it could be really useful to Anons here organising their folders for production of big or small projects.

Anonymous 5/4/2025, 1:42:16 AM No.42165947 [Report] >>42166063

>>42165897
was litterally thinking about how i needed sound effects from the show for a project i was doing
more specifically little things like character laughs or snorts n stuff

Anonymous 5/4/2025, 2:41:29 AM No.42166063 [Report]

>>42165947
A lot of those are in Clipper's Master File Part 2:
https://mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ/folder/EMZF3ApB

Anonymous 5/4/2025, 8:08:17 AM No.42166563 [Report] >>42167638

Bump.

Anonymous 5/4/2025, 12:51:52 PM No.42166887 [Report]

>https://files.catbox.moe/vx3yr9.mp3

Anonymous 5/4/2025, 9:30:01 PM No.42167638 [Report] >>42168304

>>42166563

Anonymous 5/5/2025, 2:08:01 AM No.42168304 [Report]

>>42167638

Anonymous 5/5/2025, 10:44:09 AM No.42169147 [Report]

>>42164184
ugh, is there a way to get the pop up when you first download a torrent to select files to download again? I've got the magnet for the leak.

Anonymous 5/5/2025, 12:38:54 PM No.42169246 [Report] >>42169373

Best tools if I want to gen Cozy Glow lines?

Anonymous 5/5/2025, 2:18:37 PM No.42169373 [Report] >>42169376 >>42169924

>>42169246
I'm guessing you wish to have it local and didn't want to use haysay ? Get yourself python and gpt sovits.
>https://github.com/effusiveperiscope/GPT-SoVITS
>https://huggingface.co/therealvul/GPT-SoVITS-v2/tree/454406eb40b63c5571f33c29f4fd8bac197131d6/CozyGlow-SVe24-GPTe48

Anonymous 5/5/2025, 2:21:15 PM No.42169376 [Report] >>42169392

>>42169373
Which haysay architecture has the best Cozy?

Anonymous 5/5/2025, 2:28:41 PM No.42169392 [Report]

>>42169376
I'm pretty found of rvc one BUT it heavily dependent on the input audio .

Anonymous 5/5/2025, 8:04:30 PM No.42169924 [Report] >>42170104 >>42171853

>>42169373
What's the current sota for voice2voice conversion? Preferably something that can be finetuned. The latest gptsovits v4 is very good but it doesn't sound like the reference so an additional step is needed I think

Anonymous 5/5/2025, 9:48:07 PM No.42170104 [Report]

>>42169924
rvc and so-vits are still the king, I think some Anons posted some other "minimal dataset voice cloning" stuff in the past but none of them seem to stick around (with the github codefags making their training process way too complex, or pulling requirements out of their assess).

Anonymous 5/6/2025, 12:39:05 AM No.42170546 [Report] >>42171393

I heard through the grapevine that 15.ai is coming back, anyone heard about that?

Anonymous 5/6/2025, 7:44:27 AM No.42171393 [Report]

>>42170546
>https://desuarchive.org/mlp/thread/41706417/#41711970
Pretty sure that site is still ded, and it will stay that way for very long time (aka 4ever). if any new code were to be produce by 15ai it would need to be some kind of collaboration with other codefags to avoid being chased by tiny hat lawyers , and by logic of nobody sharing such news around means it's not happening .

Anonymous 5/6/2025, 2:25:48 PM No.42171853 [Report] >>42172693

>>42169924
GPT-SoVITS is mainly intended for text-to-speech. The reference audio is only for providing an emotional style. For speech-to-speech, you should stick to RVC.

Anonymous 5/6/2025, 3:53:35 PM No.42171965 [Report] >>42172009

Is Haysay down for anyone else? I can't seem to reach the site at all.

Anonymous 5/6/2025, 4:21:27 PM No.42172009 [Report]

>>42171965
https://files.catbox.moe/4sz8fc.mp3
the pretty mare voice site seems to be working fine for me. did you try different browser anon?

Anonymous 5/6/2025, 9:37:43 PM No.42172693 [Report] >>42172807

>>42171853
Why wouldn't I be able to do GPT-SoVITS => RVC?

Anonymous 5/6/2025, 10:42:31 PM No.42172807 [Report]

>>42172693
yeah, you can, one problem is sometimes the RVC derps out the outputs when trying to give it lines of the same character, sometimes it depends on what kind of note the clip is hitting and sometimes the electronic goblins are messing about, so just test out different TTS voices to see which one works best with the RVC character you want to output.

Anonymous 5/6/2025, 11:12:45 PM No.42172881 [Report]

accent remover shweta_ai.jpg md5: 66db6d03...

>https://nitter.space/shweta_ai/status/1912536464333893947
I need this for mare content, so I can finally get AJ speak a deep south accent without fluffing around the different words spelling, or get Rarity pronounce words in way more posh manner.

Anonymous 5/7/2025, 12:55:19 AM No.42173088 [Report]

>>42166202
>>42166241
Crossposting from /chag/ thread, they are planing on doing some collaboration with /robowaifu/ guys to start making irl robot ponies. Very cool, and good luck to you !

Anonymous 5/7/2025, 10:25:32 AM No.42173899 [Report] >>42173902

First actually good local music model, like suno v2 quality. Fast as fuck as well.

https://www.reddit.com/r/LocalLLaMA/comments/1kg9jkq/new_sota_music_generation_model/

Anonymous 5/7/2025, 10:26:33 AM No.42173902 [Report]

>>42173899
Also has lora training already, could 100% train pony singing.

Anonymous 5/7/2025, 1:19:16 PM No.42174105 [Report] >>42174936 >>42298627

1733690617595293.gif md5: 643b46d2...

https://ace-step.github.io/
https://github.com/ace-step/ACE-Step

Passes the nigger test.
https://vocaroo.com/11MoCQ68jiLY

And this is fun.
>>>/g/105183843
>>>/g/105184228
I'd love to try with some MLP songs, but I'm a VRAMlet with 6GB and I don't think I can run this yet.

Anonymous 5/7/2025, 8:04:16 PM No.42174701 [Report] >>42175724

Bump.

Anonymous 5/7/2025, 10:21:51 PM No.42174936 [Report] >>42175015

>>42174105
uhh, the collab file they provided seems to only do "text2music", could you/somebody explain how that anon re-edited the OG song with new shitpost lyrics into it?

Anonymous 5/7/2025, 10:58:37 PM No.42175015 [Report] >>42175321

>>42174936
oh, just noticed its in the repair->upload section. however I tried to do a "replace X lyrics with new lyrics" and it really seem to suck ass at it, so im not sure if the anon that made the above song was lucky or had enough autism to spend several hours trying all kinds of combination in making it work.

Anonymous 5/8/2025, 12:56:12 AM No.42175321 [Report] >>42175929

>>42175015
Nope, people posted multiple results in that thread where it Just Worked. The only thing I saw is that the quality will get worse the more the lyrics are changed.

Anonymous 5/8/2025, 4:17:28 AM No.42175724 [Report] >>42180991

>>42174701

Anonymous 5/8/2025, 5:56:56 AM No.42175929 [Report] >>42175971

>>42175321
Oh. I was trying to go for a full lyric replacement, I guess this GitHub is a right step into that direction, it just nit ready for my exact autistic requirements.
Hopefully by the next year we will get improvements on it, because I have some text parody ideas .

Anonymous 5/8/2025, 6:19:36 AM No.42175971 [Report]

>>42175929
I saw someone say that you can separate the stems and get better results. Perhaps you could edit portions of the lyrics one at a time, then mix them back into the instrumental.

VilligerANON 5/8/2025, 8:21:24 AM No.42176140 [Report] >>42176220

Question:
During training, can I use files tagged as clean and noisy files?

Anonymous 5/8/2025, 9:24:19 AM No.42176220 [Report]

>>42176140
Sure, however keep in mind the quality of audio outputs may suffer from it, specially if the ratio of good clips vs noisy clips is skewing towards the noisy side.
And since there are characters that have pretty much noting but mostly noisy audio (like Tree Hugger) the end results may vary from "kind of bad" to "surprisingly decent" .

Anonymous 5/8/2025, 2:17:20 PM No.42176608 [Report] >>42176655

Question to the Anon that was working on OpenUtau diffsinger models, are you planing on creating the models for Rarity and Fluttershy?

DiffAnon 5/8/2025, 2:56:10 PM No.42176655 [Report]

>>42176608
Truth be told, I was planning on it eventually, but I don't know if I really want to anymore. Twilight, Applejack, Rainbow Dash, and Pinkie Pie are a bit spotty as is, and I worry that with Fluttershy's abysmally low amount of singing data (from what I could find) and just not feeling up to it for her or Rarity, I don't think either of them are gonna be made into models anytime soon. Keep in mind, I don't just train one thing, I have to train the acoustic model, then the variance model, then the pitch model, and then fine tune the vocoder, which both takes a lot of time and a lot out of me. I'm not saying it won't ever happen, because I do feel weird about leaving things with just the four I did, but I can't for the life of me bring myself to do the other two just yet. But they'll come one day, hopefully.

Anonymous 5/8/2025, 3:31:54 PM No.42176713 [Report] >>42177027 >>42177542 >>42179060

Speaking of model training, there's still a good few voices that're absent on RVC. It'd be nice to see Moondancer and Cadance and whoever else hasn't been trained yet, Cadance has a model for RVC but it's super noisy.

Anonymous 5/8/2025, 6:07:05 PM No.42177027 [Report] >>42177131

>>42176713
>Moondancer
huh, you are correct, I will see if I can train her rvc model.

Anonymous 5/8/2025, 6:53:10 PM No.42177131 [Report]

>>42177027
hmm, not a great news, Ive check the mega and even when removing only the unusable very noisy audio lines, there is still only 1m50s of audio, which is less than ideal 3m but I can still try.

Anonymous 5/8/2025, 10:32:46 PM No.42177542 [Report] >>42178958

moondancer 1676307268380648.jpg md5: 2a808657...

>>42176713
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Moondancer
>https://vocaroo.com/1hV4kTcwCp3E
Here she is, the result isn't half bad but for some reason her voice seems slipping into Rarity voice range. And of course male input voice lines will sound bit rougher in conversion.

Anonymous 5/9/2025, 8:17:02 AM No.42178459 [Report]

>>42178450
more years! TRUST THE PLAN!

Anonymous 5/9/2025, 3:09:04 PM No.42178958 [Report] >>42179060

>>42177542
Awesome, thanks. I look forward to trying it once I have the time.

Anonymous 5/9/2025, 4:35:33 PM No.42179060 [Report] >>42185313

cadence emo 27711757171b56ff.jpg md5: b8137183...

>>42176713 >>42178958
>Cadance
>https://voca.ro/188F1imvN2L7
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Cadance_Clean
RVC model of Cadance, trained on clean audio only.

Anonymous 5/10/2025, 6:44:52 AM No.42180991 [Report] >>42181427

>>42175724

Anonymous 5/10/2025, 12:10:12 PM No.42181427 [Report] >>42185114

>>42180991

VilligerANON 5/10/2025, 1:29:53 PM No.42181482 [Report] >>42181575 >>42191078

https://files.catbox.moe/x41lrp.wav
I have generated with this repo: https://github.com/CookiePPP/cookietts
Model from: https://drive.google.com/drive/folders/1nTyn6qr2b76aOE430trasuZj0Kr2H_ya
(Tacotron2: tt2_outdir_p3_2_0.5DFR_0.0Dropout)
(Hifi-gan cp_hifigan_universal44Khz_mlpft)
>Maybe I will create a better vocoder and Notebook

Anonymous 5/10/2025, 2:54:00 PM No.42181575 [Report] >>42183033

>>42181482
That's interesting Anon but I'm not sure on how it will compare with all the new tech, since tacotron is almost five years old.

Anonymous 5/11/2025, 4:06:32 AM No.42183033 [Report] >>42183358

>>42181575
I feel like there isn't much coming out for pony specificly in recent times though.

VilligerANON 5/11/2025, 8:09:01 AM No.42183358 [Report] >>42183361 >>42183486

Does anyone want any bonus features that I can add?

>>42183033
I know, right?

VilligerANON 5/11/2025, 8:10:01 AM No.42183361 [Report]

>>42183358
> To the Inference Script

Anonymous 5/11/2025, 9:45:37 AM No.42183486 [Report]

>>42183358
Well, I would like it if the offline gpt-sovits script also copy the haysay options for automatic emotions drop down menu as well as the audio clip slow/speed up stretch settings, but that's something Vul would need to add to his webui script.

Anonymous 5/11/2025, 10:35:01 PM No.42184597 [Report] >>42184600 >>42184613

>nitter.space/jason_kint/status/1921546181357838531
>nitter.space/LuizaJarovsky/status/1921286826402422927
>ai copyright to affect the "commercial use"
Time to split the hairs on what counts as "commercial use" and what doesn't. Also good luck trying to force this on china and their no-fucks-given R&D departments.

Anonymous 5/11/2025, 10:36:29 PM No.42184600 [Report] >>42184628

>>42184597
>muttmerica
Phew, I thought it was actually serious.

Anonymous 5/11/2025, 10:41:03 PM No.42184613 [Report] >>42184628

>>42184597
>america keeps digging its grave in the name of "progress"
the soviet union fell behind in technology because the government tried to control things, but yeah, let's not learn anything from that.

Anonymous 5/11/2025, 10:49:11 PM No.42184628 [Report]

>>42184600
I can see Diseny and such trying to push for it, just like they did with hundreds of years of copyright laws, but as Anons on /g/ pointed out, all the big league companies need to do is buy portions of semi big publishing companies and claim that retroactively all the existing books on the system were allowed to be used in ai training.
>>42184613
Tell me about it, I remember reading a biography of electrician that was bribed to "no be in hurry" when repairing the wheat moisture measuring apparatus, because the assigned inspector could use rule of thumb on deciding how much moisture was in the transported grain and deduce the farmers pay while pocketing the spillway difference.

Anonymous 5/12/2025, 2:04:59 AM No.42185114 [Report] >>42185748

>>42181427

Anonymous 5/12/2025, 4:01:05 AM No.42185313 [Report] >>42185322 >>42187317

>>42179060
Your local AI still can't sing worth a shit.
Evolve or die, PPP.

Voice acting requires a certain melodic way of talking which your current model does not support, 3P General.

Anonymous 5/12/2025, 4:07:09 AM No.42185322 [Report] >>42186106

>>42185313
There are no more than ten anons itt, all namefags, that know their shit, and they lead very busy lives. This thread was just anons enjoying the fruits of others' labors. There are no more fruits to enjoy, or worth enjoying so the Pony Preservation Project has become the Pony Preservation Project Preservation Project. It's over.
>Mareification not required.

Anonymous 5/12/2025, 8:36:51 AM No.42185748 [Report] >>42190922

>>42185114

Anonymous 5/12/2025, 2:45:22 PM No.42186106 [Report] >>42187314

>>42185322
yeah, back in 2019 + 20 everybody were hyped since show only just ended and board was still pretty alive (and with everyone locked up, all they could do is making pony content without any distractions). Now a lot of the ai tools have became available (music, art, even animations) but everything is kind of disjointed and difficult to put together.

Anonymous 5/12/2025, 5:17:15 PM No.42186245 [Report] >>42188332

unpopular demand.png md5: 4f60829e...

I feel Anons just need to find a proper spark, something that would be fun to work on, like randomly spotting a song and wondering how it would sound if it was done by pony.
>https://files.catbox.moe/qg2qn5.mp3
Anyhow, VS singing the Ye new song, OG cover from TowerGangToad. I really wanted to use Zecora voice but the voice clips just wouldn't come out right from neither of the model types.

Anonymous 5/13/2025, 1:57:24 AM No.42187314 [Report]

>>42186106
Don't forget that a lot of new stuff gets immediately corpo'd these days too. Shit like that stifles innovation.

Anonymous 5/13/2025, 1:59:29 AM No.42187317 [Report]

>>42185313
>melodic way of talking
China is the future

75.ai 5/13/2025, 2:01:08 AM No.42187322 [Report] >>42189535

I will save this general.

Anonymous 5/13/2025, 8:36:14 AM No.42188332 [Report] >>42188430

>>42186245
Try replicating S1 Luna's voice. Chip in some money and put Tabitha to voice it.

Anonymous 5/13/2025, 10:00:48 AM No.42188430 [Report]

>>42188332
>S1 Woona
It's technically doable.
https://huggingface.co/spaces/Plachta/VALL-E-X
https://desuarchive.org/mlp/thread/40503961/#40518915
It will just take about 1~6 months of non stop generating audio until the artificial dataset has five minutes worth audio clips.

Anonymous 5/13/2025, 11:54:32 AM No.42188537 [Report] >>42188542

>>/wsg/5872172
I want this, but for ponies, dubbing in my country is cursed, either VAs will put energy to empathize wrong aspect of character (a young rogue like adventurer will instead sound like snotty little shit), give no shits to act at all or give the role to somebody that will completely not fit the character.

Anonymous 5/13/2025, 11:58:25 AM No.42188542 [Report]

>>42188537
>https://files.catbox.moe/yck7ps.mp4
fug, crossposting failed

Anonymous 5/13/2025, 9:36:08 PM No.42189535 [Report]

>>42187322

Anonymous 5/14/2025, 2:19:54 AM No.42190366 [Report]

>>42188455
Would be funny if that happened.

Anonymous 5/14/2025, 7:21:08 AM No.42190922 [Report] >>42192724

>>42185748

VilligerANON 5/14/2025, 8:36:01 AM No.42191078 [Report] >>42191285

>>42181482
I've updated the synthesis script, and now these are the new results
>https://files.catbox.moe/tv8c4i.wav
Does it sound like those 48 kHz MMI models, or does it sound like newer tech?

Anonymous 5/14/2025, 9:42:12 AM No.42191153 [Report] >>42191285

https://www.minimax.io/audio
https://minimax-ai.github.io/tts_tech_report/

Anonymous 5/14/2025, 12:30:28 PM No.42191285 [Report] >>42191588

>>42191078
Is thats TTS or voice conversion? It still has that funny buzzing that tacotron2 / talknet models suffered from, so its kind of hard to tell if .
>>42191153
hmm, website do not seem to be more useful than other tts sites. BUT the paper is interesting, if the cloning of 5 seconds is not complete cherry picked bullshit I would love to be able to use it.

VilligerANON 5/14/2025, 5:08:48 PM No.42191588 [Report]

>>42191285
TTS.
> The repo:
> https://github.com/TheDevloper2023/cookiettsfork/tree/master/CookieTTS
> which is a fork of https://github.com/CookiePPP/cookietts/tree/master

Anonymous 5/15/2025, 2:19:42 AM No.42192724 [Report] >>42193438

>>42190922

Anonymous 5/15/2025, 8:39:58 AM No.42193438 [Report]

>>42192724

15 5/16/2025, 7:15:46 AM No.42195922 [Report] >>42195946 >>42196013 >>42196073 >>42196204 >>42196230 >>42196243 >>42196274 >>42196298 >>42196305 >>42196355 >>42196384 >>42196390 >>42196393 >>42196435 >>42196479 >>42196514 >>42196530 >>42196639 >>42196646 >>42196654 >>42196738 >>42196754 >>42196789 >>42196790 >>42196800 >>42196801 >>42196849 >>42196867 >>42196960 >>42197340 >>42197606 >>42198611 >>42198701 >>42204454 >>42204550 >>42204573 >>42204939 >>42205939 >>42205963 >>42206122 >>42207220

Hi, it's been a while, hasn't it?
Here's an alpha website that you can play around with: https://alpha.15.dev/
The backend is currently running on just two GPU instances, and I've set the inference batch size to 1 since this new model requires a lot more computational power than it did two years ago. I can increase the number of GPUs depending on how long each request takes.
More characters and emotions will come soon. Feel free to report any bugs or issues here, too.

Anonymous 5/16/2025, 7:24:07 AM No.42195946 [Report]

>>42195922
holy shit

Anonymous 5/16/2025, 7:50:37 AM No.42196013 [Report]

>>42195922
I hate your guts, sleazebag

Anonymous 5/16/2025, 8:08:40 AM No.42196073 [Report]

1731203818328.png md5: 6b822b24...

>>42195922
>https://alpha.15.dev/examples
nice examples kek

VilligerANON 5/16/2025, 9:04:58 AM No.42196204 [Report] >>42196227

>>42195922
>https://alpha.15.dev/
Can I send this outside of this thread?

15 5/16/2025, 9:25:55 AM No.42196227 [Report]

>>42196204
Sure, go ahead. I'll make an official post on Twitter soon, probably within the next few days.

Anonymous 5/16/2025, 9:30:00 AM No.42196230 [Report]

>>42195922
I'm kneeling so hard rn it hurts

Anonymous 5/16/2025, 9:39:57 AM No.42196243 [Report]

ponk kneel.png md5: 25deb92f...

>>42195922
I have no choice but to kneel

Anonymous 5/16/2025, 9:57:20 AM No.42196274 [Report]

15341327859632555.jpg md5: 81f75419...

>>42195922
IT'S HAPPENING!

Anonymous 5/16/2025, 10:08:04 AM No.42196298 [Report]

2774473.jpg md5: 289eff18...

>>42195922
https://files.catbox.moe/k18mof.mp3
Three stars and now this? We are so fucking back boys!

BGM 5/16/2025, 10:11:18 AM No.42196305 [Report] >>42196384 >>42196896

GASPS.gif md5: e44c385c...

>>42195922
https://files.catbox.moe/01otal.wav
Woah, hi again.
New model's sounding better than ever before. Good speed, emotion settings all work reliably, sounds clear. At the moment it sounds like the characters fall out of how they're supposed to sound on occasion though. Rarity in particular with the fear emotion gives some very strange outputs.
https://files.catbox.moe/k1kvsc.wav

Also, as a UI note, the change notifications upon switching settings and voices blocks the generation button on some resolutions when scrolled up. Only for a second, but it can still delay things.

Anonymous 5/16/2025, 10:25:07 AM No.42196337 [Report] >>42196341 >>42198701

Dear Hydrus Beta, as everyone will get really hyped for return of 15ai, I just want to say I appreciate your work and thanks to HaySay I was able to do all the fun mare music conversion. I hope you will keep it alive and updated as new voice ai will show up in the future.

BGM 5/16/2025, 10:28:19 AM No.42196341 [Report] >>42198701

>>42196337
Seconding this, Haysay is a godsend for my workflow on music projects.

Anonymous 5/16/2025, 10:41:40 AM No.42196355 [Report] >>42196384

>>42195922
https://u.pone.rs/whgPbfzU.mp3

Anonymous 5/16/2025, 10:53:06 AM No.42196384 [Report]

1412208.gif md5: 0ea43b25...

>>42195922
>new site
>>42196305
>new shitpost
>>42196355
>new smutty
brings me back

Anonymous 5/16/2025, 10:57:01 AM No.42196390 [Report]

anon, i'm chubby.gif md5: cac09879...

>>42195922
https://voca.ro/140YNkYngHyz

Anonymous 5/16/2025, 10:57:45 AM No.42196393 [Report]

>>42195922
Godlike web dev skills god fuckin damn

Anonymous 5/16/2025, 11:23:30 AM No.42196435 [Report]

1736796030484316.png md5: eda98636...

>>42195922
https://u.pone.rs/EcUvtwYk.mp3

Anonymous 5/16/2025, 12:18:15 PM No.42196476 [Report]

I hope he will add the old "|" emotional control from the previous website, since the clip reference one is pretty wishy washy. Having both would be pretty perfect to fine tune the output audio.

Anonymous 5/16/2025, 12:20:40 PM No.42196479 [Report]

>>42195922
I can't believe waiting two weeks (a few times) actually worked!

Anonymous 5/16/2025, 12:46:26 PM No.42196514 [Report] >>42196526

a.gif md5: 757186ed...

>>42195922
Yep, it's been a while, cool website.
Let me nit pick on flicker during that transition animation.

Anonymous 5/16/2025, 12:51:18 PM No.42196526 [Report]

bad end.png md5: 36a32b39...

>>42196514
literally unplayable

Anonymous 5/16/2025, 12:52:56 PM No.42196530 [Report]

>>42195922
Curious, how much (if any) AI did you use to make the website?
As for the framework.. React + Next.js? Looks good.
And welcome back.

Anonymous 5/16/2025, 1:18:04 PM No.42196569 [Report] >>42196571

15chill.png md5: 42ce3047...

>there is site OC
Im so sorry bro, but the internet rule demand it.

Anonymous 5/16/2025, 1:18:51 PM No.42196571 [Report] >>42197112

>>42196569
qt oc, whose artstyle is that

Anonymous 5/16/2025, 1:39:00 PM No.42196589 [Report]

>https://u.pone.rs/mLbrNDQB.mp3
Lets test this new site. Gin Blossoms - Hey Jealousy, done with Glimmer RVC to Sovits5 singing model (sounds ok, but i was hopping it would be better.

Anonymous 5/16/2025, 2:09:35 PM No.42196639 [Report]

2309223.jpg md5: d24b5312...

>>42195922
https://vocaroo.com/1bITXue82eed

Anonymous 5/16/2025, 2:14:37 PM No.42196646 [Report]

>>42195922
WE ARE SO FUCKING BACK LIKE NEVER BEFORE

Anonymous 5/16/2025, 2:17:29 PM No.42196654 [Report]

>>42195922
we got 15.ai revival before gta 6

Anonymous 5/16/2025, 2:46:29 PM No.42196683 [Report] >>42196751 >>42264821

>>42161191 (OP)
I know I speak to the dedicated deluded, but the machine is not the path.

Anonymous 5/16/2025, 3:17:03 PM No.42196738 [Report] >>42196755

>>42195922
awesome work but damn we really need an S1 Dash voice preset or something. nu-Dash voice is fucking nails on a chalkboard.

Anonymous 5/16/2025, 3:26:05 PM No.42196751 [Report]

>>42196683
Get a hobby you poor creature.

Anonymous 5/16/2025, 3:26:57 PM No.42196754 [Report] >>42196757 >>42196772 >>42196778

>>42195922
Can we get an ETA on when you are open sourcing this?

I think it is an obvious concern that this will all suddenly disappear for years again.

Anonymous 5/16/2025, 3:27:31 PM No.42196755 [Report] >>42196758

>>42196738
I'd say completely exclude post S3 audio for mane six. Of course it's needed for side characters who lack speaking lines, but it's better to avoid when possible.

Anonymous 5/16/2025, 3:28:16 PM No.42196757 [Report] >>42196778

>>42196754
About 14 days or so

Anonymous 5/16/2025, 3:28:17 PM No.42196758 [Report]

>>42196755
*S2

Poopsikins 5/16/2025, 3:33:52 PM No.42196768 [Report]

https://files.catbox.moe/o4z53n.mp3

Anonymous 5/16/2025, 3:37:45 PM No.42196772 [Report]

>>42196754
one more fortnight

VilligerANON 5/16/2025, 3:41:40 PM No.42196778 [Report] >>42196785

>>42196757
>>42196754
How do you know that?

Anonymous 5/16/2025, 3:46:22 PM No.42196785 [Report]

>>42196778
Sounds like you're not trusting the plan

Anonymous 5/16/2025, 3:48:53 PM No.42196789 [Report]

praisenuke.gif md5: 7dfed0e3...

>>42195922
CHUDDA ETERNALLY BTFO
IT'S HAPPENING

Poopsikins 5/16/2025, 3:50:05 PM No.42196790 [Report]

Screenshot (61)twiedit.png md5: 483ca4d7...

>>42195922
https://files.catbox.moe/9gopqy.mp3

Anonymous 5/16/2025, 4:00:30 PM No.42196800 [Report] >>42196809 >>42196814 >>42196826

1721399816307876.png md5: 1a5d57a3...

>>42195922
Your shit is obsolete, yes that's what happens when you sit on your ass for years with proprietary software. Thanks for GPTSoVits and other solutions. You should have disappeared with your website, at least that wouldn't have tainted the few good memories left when using your tool. Fuck you and your five hours of fame you needed to still feel relevant.

Anonymous 5/16/2025, 4:00:36 PM No.42196801 [Report] >>42196803 >>42196820

>>42195922
One kinda big problem, it won't let me use the ' sign for words... which is weird since a lot of words like don't and isn't NEED that sign.

Anonymous 5/16/2025, 4:02:11 PM No.42196803 [Report] >>42196810

1665.jpg md5: 0c70337f...

>>42196801
You do not need that.

Anonymous 5/16/2025, 4:06:29 PM No.42196809 [Report]

1595119616135.gif md5: b00e5a49...

>>42196800
shut up, nigger

Anonymous 5/16/2025, 4:07:00 PM No.42196810 [Report]

>>42196803
You're right, I don't, but if 15 can fix that, it'd be a big help. Otherwise, the ai second guesses the pronunciation for the words, and it's just... I dunno, I just think it would be a good QOL fix.

Anonymous 5/16/2025, 4:08:28 PM No.42196814 [Report]

>>42196800
Total barbietranny death.

Anonymous 5/16/2025, 4:14:02 PM No.42196820 [Report]

Screenshot 2025-05-16 101311.png md5: c345b4de...

>>42196801
YES HE FIXED IT!! Thank you 15!

Anonymous 5/16/2025, 4:16:22 PM No.42196826 [Report]

>>42196800
It does sound like ass. It's a shame because they're ponies.

Anonymous 5/16/2025, 4:29:47 PM No.42196849 [Report]

>>42195922
>>>/g/105281388

Anonymous 5/16/2025, 4:44:11 PM No.42196867 [Report] >>42196897 >>42196906 >>42197553

1004.png md5: b6268291...

>>42195922
Nightmare Moon has a huge improvement from her previous voice that just sounded like drunk Cheerilee
https://voca.ro/1j9J3CBPQqWN

Poopsikins 5/16/2025, 5:10:36 PM No.42196896 [Report] >>42196899 >>42196914 >>42196923

01.png md5: e06d8c8d...

>>42196305
https://files.catbox.moe/ryyshr.mp3

https://files.catbox.moe/nu5qft.mp3

https://files.catbox.moe/urd6et.mp3

Gosh, I've missed this so much. Posting like this takes me back.

Anonymous 5/16/2025, 5:11:05 PM No.42196897 [Report]

>>42196867
OKAY DAMN that actually sounds dynamic! I love it!

Anonymous 5/16/2025, 5:12:42 PM No.42196899 [Report] >>42196902

>>42196896
Derpy, Maud, and Rainbow Dash, right? It's great that I can actually recognize the voices, to be honest.

Anonymous 5/16/2025, 5:15:05 PM No.42196902 [Report] >>42196947

>>42196899
>Derpy
>It's great that I can actually recognize the voices

Anonymous 5/16/2025, 5:15:50 PM No.42196906 [Report]

1595217799709.gif md5: 2f994f6d...

>>42196867
Nice!

Anonymous 5/16/2025, 5:20:14 PM No.42196914 [Report]

CHICKEN JOCKEY!.jpg md5: ec9a3367...

>>42196896
https://voca.ro/1jlDvvakwJgi

Poopsikins 5/16/2025, 5:24:13 PM No.42196923 [Report] >>42196985

29.jpg md5: 530bbe67...

https://files.catbox.moe/l8ex9a.mp3 >>42196896

Anonymous 5/16/2025, 5:38:14 PM No.42196947 [Report]

>>42196902
Is that not Derpy? I thought because of the “clumsy” mistake and the familiar tone that it was her.

Anonymous 5/16/2025, 5:43:44 PM No.42196960 [Report]

>>42195922
https://u.pone.rs/moQGuPxl.mp3

Poopsikins 5/16/2025, 5:53:17 PM No.42196985 [Report]

1606023.png md5: e9c3312e...

>>42196923

last one from me tonight.
https://files.catbox.moe/esztvq.mp3

Anonymous 5/16/2025, 6:43:19 PM No.42197112 [Report]

>>42196571
I know who's the artist I would rather not tell you directly.
he draws fuck tons of futa.

Anonymous 5/16/2025, 8:18:12 PM No.42197294 [Report] >>42197297

f-MLP509__553B.xfl_s-_tPL_sCharacter.sym_f0000-0064.webp.006.png md5: ec5c6f1a...

https://files.catbox.moe/qoia1a.wav

Luna's crash-out in A Royal Problem if she wasn't fucking around.

Anonymous 5/16/2025, 8:19:59 PM No.42197297 [Report]

>>42197294

https://files.catbox.moe/hhwgsc.mp3

mp3 like it should've been from the beginning lol.

Vogelfag revealed 5/16/2025, 8:36:43 PM No.42197340 [Report] >>42197345 >>42197358 >>42197474 >>42197618

are you kidding me.png md5: 15113750...

She sounds angry & sarcastic which is how I feel, but still unintended on my part.
https://pub-f3186dbecfd64ac085ddc742fc900f59.r2.dev/twilight_sparkle_neutral_1747418267794_variation0.wav

>>42195922
>Feel free to report any bugs or issues here, too
Yeah I see several bugs:
0. You're still not willing to jew out despite clearly needing the money and influence. Jew out or others will outjew you. Stop being a social recluse that's how all scientists die. Learn to sue everyone cause 11.AI clearly stole your technology you moron.
1. You're not open sourcing this to the community (which are of minimal help and lack money to pay for GPUs but they're willing to learn and are very loyal and creative despite me trashtalking them myself back in October)
2. I'm pretty sure ElevenLabs, Udio.AI, SUNO.Ai, etc. stole your technology and perfected it already since 90% of the singing & talking sounds like Tara Strong, Rebecca Shoichet & Ashleigh Ball. The AI can really sing too. To an audiophille it still sounds bad, but to a normie it sounds perfect. Get a fucking marketing team, both you and Tara Strong fucked each other up and should sue every single audio AI possible.
This is what Suno Ai can do right now with the paid model:
https://www.youtube.com/shorts/udOgG0M8pVI

3. Your options & UI is still limited. If I could search a reference line to use any emotion I want without typing in phonetics then that'd be useful for the average normie. You didn't understand what I just told you, did you? LET ME USE THE REFERENCE LINE TO QUICKLY & INSTINCTIVELY USE THE EMOTION I WANT. WE HAVE AN IMPECCABLE MEMORY OF THE SHOW'S DIALOGUE LINES.
Add a voice changer/voice to audio option. It would be so much more intuitive because the AI could hear what emotion I'm going for instantly.

Today's AI still lack a ton of UI options but are getting there at an insanely quick speed such as Suno's ability to grab an existing song and have either the same singer or a new singer sing the same notes with different lyrics.

Today's AI still sounds like an untrained voice actor slurring his lines on purpose and it still sucks compared to audiophille standards, but your current robot sounding AI is dreadful by normal standards. You still haven't learned how to remove the noise?
https://www.youtube.com/watch?v=qu5nnMOQ4VU&ab_channel=A
https://www.youtube.com/watch?v=I1Dy0Zfw6Qs&ab_channel=votums

3.5 You probably didn't notice cause you're not a voice director or you're autistic but ... S1 and S2-S9 's voice directing is completely different. 90% of the dialogue lines used in S2-S9 used only these emotions; depressed, angry, flirty, ANXIOUS, TIRED, reading-off-a-script-at-gunpoint. And that's the acting ... the voices?

In S2+ everyone sounds...
Twilight sounds much lighter in S2+
Applejack & Dash sound much deeper and not in a suave way.
Pinkie sounds way lighter & screechier.
Fluttershy always sounds anxious

Rarity & Spike kinda sound the same.

4. One more thing...

Anonymous 5/16/2025, 8:40:42 PM No.42197345 [Report]

>>42197340
Fuck off retard.

Vogelfag revealed 5/16/2025, 8:44:20 PM No.42197358 [Report] >>42197369 >>42197618

>>42197340

4.
Contact the original voice actors and work together with them. Give me S1 Woona's voice and all is forgiven on my side. ;) Can't say others will forgive you for being a weak leader. These effeminate pussies need a strong leader and I suggest you do too if you can't march down 11Labs HQ and sue the living shit out of them together with Tara Strong. Sounds jewish but that's the truth. You got to outjew the jew in a jewish world. Mrs Strong knows that. I know that. Why can't you fucking comprehend that?
https://youtu.be/wbzRRp2jRHw?t=103

This is what voice acting AI sounds like now:
https://www.youtube.com/watch?v=lPAtoR3YCSc&ab_channel=UndeadHumor
https://www.youtube.com/watch?v=0j1eX7F8OOo&ab_channel=DevilArtemis

BUT I'M GUESSING YOU ALREADY KNOW THAT YET YOU STILL REFUSE TO DO SOMETHING ABOUT IT.
Call your father or something for God's sake, you college pussy kid. Your technology is being stolen under your nose and improved upon tenfold(by jews, not your followers) and you're here moping like a pussy on Twitter and then coming back with a niche version that does 1 thing barely any better and still sucks dick at the other 9 things that goes into audio.
CAN YOUR MODEL AT LEAST SING RIGHT NOW? Cause SUNO's shit can and Udio used to sing good before they had to neuter it because the record companies were after their asses. Why aren't you after their asses as well?

God you need a father in your life, kid. A father to watch over you and learn to sue and break skulls for you cause jesus christ after that twitter whine ... you're still a pussy who refuses to BE A MAN AND SUE THE LIVING SHIT OUT OF ELEVEN LABS FOR STEALING YOUR MODEL. Give Tara Strong a call too. Do you want me to do it for you?

Respectfully yours, the redpiller known as Vogelfag.

Anonymous 5/16/2025, 8:50:34 PM No.42197369 [Report] >>42197487

>>42197358
I uh... 15 maybe should've been a bit better at leading, but WOW this is kinda rough. But they say the truth hurts... wait, aren't we only operating under the ASSUMPTION that ElevenLabs stole his work though?

Anonymous 5/16/2025, 9:22:21 PM No.42197463 [Report]

oh boy the schizos are out now

Anonymous 5/16/2025, 9:25:50 PM No.42197474 [Report]

>>42197340
no ones reading that

Anonymous 5/16/2025, 9:33:36 PM No.42197487 [Report]

>>42197369
no one cares vogelfag

BGM 5/16/2025, 9:52:57 PM No.42197553 [Report] >>42197804 >>42197812 >>42197816 >>42198118 >>42198418

Celly Appears.jpg md5: dea7f11b...

>>42196867
https://u.pone.rs/HEiyutXb.mp3

Anonymous 5/16/2025, 10:07:03 PM No.42197606 [Report]

glim sexo.png md5: 633c7dbb...

>>42195922
Btw
https://voca.ro/14Y5dHWMbMpx

Anonymous 5/16/2025, 10:10:47 PM No.42197618 [Report] >>42197624

>>42197340
>>42197358
Your words are wasted on that idiot. 15. He was always a pretentious egomaniac and I'm glad the era where we didn't have any viable alternative is long gone. He's not even competing with the current opensauce options, let alone the paid ones.

Anonymous 5/16/2025, 10:11:59 PM No.42197624 [Report] >>42197645

>>42197618
what are the opensauce alternatives

Anonymous 5/16/2025, 10:15:51 PM No.42197645 [Report] >>42197653 >>42199403

>>42197624
https://github.com/effusiveperiscope/GPT-SoVITS

Anonymous 5/16/2025, 10:17:13 PM No.42197653 [Report]

>>42197645
isnt that what haysay uses but it doesnt sound as good as this though

Anonymous 5/16/2025, 11:01:17 PM No.42197804 [Report]

284448__suggestive_artist-colon-hotdiggedydemon_rainbow+dash_pegasus_pony_-dot-mov_shed-dot-mov_g4_animated_female_mare_pony-dot-mov_solo_swag_throbbing_throbb.gif md5: f5e13bf5...

>>42197553

Anonymous 5/16/2025, 11:06:56 PM No.42197812 [Report]

>>42197553
Holy fuck. Please make a full length version of this.

Anonymous 5/16/2025, 11:07:45 PM No.42197816 [Report]

rarewow.gif md5: 24be2b90...

>>42197553

Anonymous 5/17/2025, 12:40:20 AM No.42198118 [Report]

>>42197553
Incredible. please keep going.

Anonymous 5/17/2025, 2:05:11 AM No.42198418 [Report] >>42198427

>>42197553
Damn, am I going to have to help finish what I've started?

Anonymous 5/17/2025, 2:08:20 AM No.42198427 [Report]

>>42198418
Please, I’m begging you. Make more

Anonymous 5/17/2025, 3:10:36 AM No.42198611 [Report] >>42199123

>>42195922
Great to have you back, the new website looks fantastic.
Some notes after a few hours of testing (mainly with Rainbow and Twilight on happy and neutral):

I noticed that speech will often sound unnatural with a "rough" sort of sound, especially at the end of sentences. It's been taking a lot of re-rolls to get outputs that sound natural throughout. As ever I'm finding it very hard to articulate exactly why a lot of outputs sound off or spot trends. Been thinking about what exactly to say here for quite some time but I think it'll be more effective to just use the report feature on any examples I come across from now on. The voices generally sound very accurate to the ponies and there's already plenty of good examples ITT, so the potential is clearly there.

Things like the Twilight #3 on the example page are common issues with the "rough" sound - "aviation AH0 N", "fly AY1", "fat AE1" "ground AW1 N D".
Pretty sure this was an issue in previous versions of 15.ai, particularly the tendency to slip up at the end of sentences.

Short sentences (~three words or less), especially when generated on their own with nothing before or after, are consistently bad.

"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.

I'm tentatively thinking that reliance on reference lines from the show to control delivery, emotion, pacing etc in the output (I assume that's what the model is doing) may not actually be the best idea. It's great if the reference line that gets picked happens to match how you want the output to sound, but more often than not it won't and you'll be totally boned if there's no match at all. Even if there is a reference line that matches, you'll still need to take the time to find it or rely on RNG for it to be used.
I won't speculate any further on this for now since I don't know exactly how the reference lines influence the model. Would be good if you could fill in some blanks here.

Not yet found any bugs with the site, but I do have some feature requests:
1 - An option to automatically play new audio as soon as generation is complete.
2 - A button on the outputs to immediately regenerate with the same settings.
3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.

Hope that's helpful, you're doing great work here.

HydrusBeta 5/17/2025, 3:41:08 AM No.42198701 [Report] >>42200231

>>42195922
Oh wow. Welcome back, 15! I am really happy to see you have a site back up, and the UI is slick.

>>42196337
>>42196341
Thank you for the kind words. I plan to keep Hay Say running. I am glad you have found it useful.

15 5/17/2025, 7:06:21 AM No.42199123 [Report]

>>42198611
>"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.
This was because the dictionary had an incorrect transcription for "anon"; this has been fixed. If you run into any similar problems like this, you can report a transcription by hovering over the colored box and clicking the report button.
>1 - An option to automatically play new audio as soon as generation is complete.
>2 - A button on the outputs to immediately regenerate with the same settings.
>3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
>4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.
Done.

Anonymous 5/17/2025, 10:07:09 AM No.42199403 [Report]

>>42197645
15, is the model just GPT-SoVITS, but fine tuned on MLP?

Anonymous 5/17/2025, 1:38:21 PM No.42199629 [Report]

6898659.jpg md5: 78156a47...

https://voca.ro/1mlZCjsv6tJ2
Dang, this is pretty good.

Anonymous 5/17/2025, 7:48:30 PM No.42200231 [Report] >>42200380

bread hd.jpg md5: dafed05d...

>>42198701
>haysay is down
I am this close to considering selling my kidney for a good gpu

HydrusBeta 5/17/2025, 9:00:10 PM No.42200380 [Report] >>42200397 >>42204296

>>42200231
What odd timing. Thanks for letting me know. The site should be back up now. The EC2 instance got in a weird state where it became unreachable again.

Anonymous 5/17/2025, 9:05:23 PM No.42200397 [Report] >>42200690

>>42200380
The amazon anti-brony lobby is getting stronger per day. btw what would be requirements for haysay if I would like to run locally in its full compactly?

HydrusBeta 5/17/2025, 10:50:24 PM No.42200690 [Report] >>42200707

>>42200397
Hay Say can run on most machines, but will be very slow on older hardware. I do not recommend running it on Apple silicon because it is very slow on that hardware (to the point that it's basically unusable). I recorded some benchmarks on several machines, which may give you a clue as to how long it will run on yours:
https://github.com/hydrusbeta/hay_say_ui?tab=readme-ov-file#testing-data--benchmarks
Having a GPU is not required.

HydrusBeta 5/17/2025, 10:53:04 PM No.42200707 [Report]

>>42200690
Oh, I forgot to mention that you need a LOT of hard drive space (about 100 GB now), and having at least 12 GB Ram is recommended.

Anonymous 5/18/2025, 1:50:23 AM No.42201300 [Report] >>42201855

Up.

Anonymous 5/18/2025, 7:36:21 AM No.42201855 [Report]

>>42201300

Anonymous 5/18/2025, 6:51:45 PM No.42202578 [Report]

>back to being dead
come on

Anonymous 5/19/2025, 3:12:45 AM No.42203845 [Report] >>42203862 >>42204138

1747616891426350.gif md5: 4f0806c4...

https://x.com/fifteenai/status/1924269599542968655

Anonymous 5/19/2025, 3:25:06 AM No.42203862 [Report]

tenor.gif md5: 68f57370...

>>42203845

Anonymous 5/19/2025, 6:12:26 AM No.42204138 [Report] >>42204139

1740018418195642.png md5: 3651523a...

>>42203845
>Discord server
Kek.

>I just added 4 more GPU servers because of the huge number of requests coming in. This is actually going to bankrupt me.
You know, you could just... open source it?
Then you wouldn't have to pay for any of it, you wouldn't be expected to constantly maintain it (this has been a recurring issue, let's be honest), and you would meet your original promises.

Anonymous 5/19/2025, 6:13:06 AM No.42204139 [Report] >>42204140

>>42204138
shut up retard

Anonymous 5/19/2025, 6:14:18 AM No.42204140 [Report] >>42204141

>>42204139
>t. 15

Anonymous 5/19/2025, 6:14:48 AM No.42204141 [Report]

>>42204140
shut up retard

Anonymous 5/19/2025, 7:18:39 AM No.42204296 [Report] >>42204476

1717711190370428.png md5: 6b3c4fad...

FYI, GPT-SoVITS v4 came out.
While v3 downgraded the quality, they boosted it back to 48KHz and it arguably sounds much more natural.
There's a good report here: https:// 8 chan.moe/ais/res/6258.html#q11121
>Ref: https://voca.ro/13vsNeBHC2Xu
>Best result I got from v4: https://voca.ro/1j2I5rUzAZxj
>Same example with v2 (the end was cut due to my shitty api): https://voca.ro/11qFHhR7HtG1
This is the only comparison I've heard so far though, seems like it was a very silently received release. Needs to be tested more.

>>42200380
If you could look into adding v4 to Haysay (assuming it does hold up with pony voices), that'd be much appreciated.

Anonymous 5/19/2025, 8:52:34 AM No.42204454 [Report] >>42204478

Capture.png md5: cffd9d6a...

>>42195922
You make a cute couple.

Anonymous 5/19/2025, 9:08:02 AM No.42204476 [Report]

1738995549982678.png md5: 60d11eec...

>>42204296
Trying it out.
Oh boy, new setting under SoVITS Training. Guess I'm leaving that at the default 32 for now.

Anonymous 5/19/2025, 9:09:47 AM No.42204478 [Report]

>>42204454
>hecking mare
>she/pony
He's just having a laugh, r-right?

Anonymous 5/19/2025, 9:51:12 AM No.42204529 [Report] >>42204536

Screenshot_20250519-035011.png md5: 48d2798c...

Inb4 they ban saying bad words with the ai

Anonymous 5/19/2025, 10:05:38 AM No.42204536 [Report] >>42206804

>>42204529
discord can ban over stuff like saying nigger iirc if people report it
I doubt any text restriction will be imposed but its understandable you dont want kids spamming nigger word in the discord

Anonymous 5/19/2025, 10:19:54 AM No.42204550 [Report]

>>42195922
thank you for your service, king

Anonymous 5/19/2025, 10:47:51 AM No.42204573 [Report]

>>42195922
Cool that you're back. Though its a bit odd that you say that 15.dev is provided only for non-commercial use, then license the outputs under CC BY-SA 4.0, which explicitly permits commercial use. Shouldn't outputs be licensed under CC BY-NC or BY-NC-SA instead, since it would be in line with your earlier statement that the site is to be used non commercially?

Anonymous 5/19/2025, 4:34:11 PM No.42204939 [Report]

>>42195922
ya taking on new voice dataset or only retraining the old ones?

Anonymous 5/19/2025, 5:19:00 PM No.42205033 [Report] >>42205228

https://huggingface.co/OuteAI/OuteTTS-1.0-0.6B

Anonymous 5/19/2025, 7:15:39 PM No.42205228 [Report]

>>42205033
Their twitter examples are bit meh sounding, im guessing the wow factor would came from the fact that it can work with 14 different languages. Would be really nice if I had a voice dataset from foreign dubbing and be able to use for english languages.

Anonymous 5/19/2025, 8:57:12 PM No.42205387 [Report]

If you still lurking Vul, thank you for making that sfx_sep_v2 filter for vocal remover, this stuff is so bloody helpful in prepping the audios.

Anonymous 5/20/2025, 1:21:09 AM No.42205939 [Report]

>>42195922
Holy shit, only noticed it now. I don't know what changed for the site to make a comeback, but it's nice to see it again.

Anonymous 5/20/2025, 1:29:53 AM No.42205963 [Report]

>>42195922
Did a bunch of work with Rarity today, mainly with happy emotion, and notably found that I tended to get better results when I turned the temperature way down, 0.2-0.4. Tried that with the rest of the Mane 6 but Rarity seemed to be the only one to significantly benefit, Twilight and Rainbow in particular still sound "rough" almost all the time no matter what I do.
Even so, Rarity's improvement is significant enough that I'd suggest everyone experiment with adjusting the temperature, there may be an optimal value for each character that I've not found yet.

Short inputs continue to be a problem, even short sentences that are part of a longer input - reported a bunch of instances of words being mispronounced, weirdly elongated and even skipped entirely.

Also had a few times where the page froze when I switched tabs to do other stuff while waiting for generations to complete.

Could you unlock the quality slider at least in the faster direction? I'm finding generation wait times to be the main bottleneck right now and would like to give that a try. Perhaps also allow larger batch size when faster quality options are selected too.

Anonymous 5/20/2025, 2:28:36 AM No.42206122 [Report] >>42206425

au_moondancer_goes_to_a_lot_of_cons_by_pfeffaroo.jpg md5: 3902075c...

>>42195922
>no more emotional contextualiser (the selections are a decent sidegrade I guess but come on it was much cooler)
>still using arpabet despite even resolving the IPA
>AI guesses what I want it to say if it's not in the dictionary instead of just phonemising the words because I know what I want it to say
why

>Moondancer
Bless you, sounds like shit tho

Anonymous 5/20/2025, 4:36:30 AM No.42206425 [Report] >>42207713

>>42206122
https://files.catbox.moe/tu4s0l.mp3

How's this?

Anonymous 5/20/2025, 8:41:07 AM No.42206804 [Report]

>>42204536
Fair, but I will never trust someone with a mental illness flag in the bio.

Anonymous 5/20/2025, 1:47:51 PM No.42207158 [Report]

clone trooper pony pixel.png md5: 2e0077d0...

Sup, got an sudden inspiration to get the voice from Clone Wars narrator trained. Not pony model but I feel like this could get some good use out it in the future anti clips.
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/Star_Wars_Clone_Wars_Narrator_v2
https://files.catbox.moe/bjljdm.mp3
Not 100% happy with it as the input needs to have that specific "umpf" energy to it.

>https://huggingface.co/Amo/GPT-SoVITS-v2/tree/main/Clone_Wars_Narrator_v2_so96_gpt24
Gpt-Sovits, wavs included.
https://vocaroo.com/1oycsmzwxgVy
https://vocaroo.com/12qbwj4NK8XP
https://vocaroo.com/1fBcauUi9ZIP
Due to pronunciation script some words sound pretty weird but nothing but little but of editing can't fix.

Anonymous 5/20/2025, 2:21:28 PM No.42207220 [Report] >>42207363 >>42310164

>>42195922
now all i need to do is figure out how to make ponies moan

Anonymous 5/20/2025, 2:41:57 PM No.42207243 [Report]

Open source that shit 15

Anonymous 5/20/2025, 4:22:50 PM No.42207363 [Report] >>42251484

>>42207220
One step ahead of you.

https://files.catbox.moe/7wktvb.mp3

All I did was enter "AAAAAAAAAAAAAAAA!" and the moaning just kinda happened.

Anonymous 5/20/2025, 5:12:57 PM No.42207459 [Report] >>42207472 >>42207477 >>42207518

IMG_8305.jpg md5: c79e0e91...

15 for the love of God find a volunteer to do your PR, you called a random Hasbro employee pathetic that is not something you should do if they are inquiring about your service despise how obnoxious the cocksucking corpo suits are. Being aggressive like that isn’t doing anyone any favors

Anonymous 5/20/2025, 5:15:01 PM No.42207472 [Report] >>42207479

>>42207459
All hasjew employees deserve and should be publicly mocked.

Anonymous 5/20/2025, 5:17:36 PM No.42207477 [Report] >>42207483

>>42207459
>you called a random Hasbro employee pathetic
are you retarded perhaps

Anonymous 5/20/2025, 5:18:44 PM No.42207479 [Report] >>42207484

>>42207472
Yeah I call them retarded niggers off the mic but when you’re face to face with them you shouldn’t let that go out.
Since 15 is a stemfag gook I wasn’t expecting diplomacy and social skills from him but this is actually crazy, no one cares about your inbox.

Anonymous 5/20/2025, 5:19:45 PM No.42207483 [Report] >>42207484 >>42207488

>>42207477
Even if that was a scammer like who the fuck cares nobody cares about your inbox nigga

Anonymous 5/20/2025, 5:20:17 PM No.42207484 [Report]

>>42207479
>>42207483
I care though, this is funny and based as fuck

Anonymous 5/20/2025, 5:20:53 PM No.42207488 [Report] >>42207493

>>42207483
Repeat 30 more times about how much you don't care.

Anonymous 5/20/2025, 5:22:32 PM No.42207493 [Report] >>42207525

>>42207488
Settle down 15 minion you have a sever to moderate

Anonymous 5/20/2025, 5:27:20 PM No.42207518 [Report] >>42207580

>>42207459
He wasn't even calling Hasbro employees pathetic though? It was some random guy trying to snitch by CC'ing all these people.

Anonymous 5/20/2025, 5:28:22 PM No.42207525 [Report] >>42207527

>>42207493
You're the retard sending the e-mail, got it.

Anonymous 5/20/2025, 5:29:20 PM No.42207527 [Report]

>>42207525
Finger pointing like that isn’t healthy tranny

Anonymous 5/20/2025, 5:56:12 PM No.42207576 [Report] >>42207705

https://x.com/UnslothAI/status/1924848135991656603

Anonymous 5/20/2025, 5:58:22 PM No.42207580 [Report]

>>42207518
This, wtf is anon talking about

Anonymous 5/20/2025, 7:02:07 PM No.42207705 [Report]

>>42207576
once again, it's all written like next breakthrough in technology but nobody is posting any examples at all, not even cheery picked ones.

Anonymous 5/20/2025, 7:07:41 PM No.42207713 [Report] >>42208791

>>42206425
Still bad, just compare to any actual Moondancer speaking. I'm not knowledgeable enough to describe exactly how it's wrong, but it's too deep and not "light" enough?

twiggles !!ofIYxlKABKS 5/21/2025, 12:52:39 AM No.42208577 [Report] >>42208651

it's been six fucking years, jesus christ. i still can't believe how big this project got

Anonymous 5/21/2025, 1:25:28 AM No.42208651 [Report] >>42208841 >>42208906

>>42208577
it was dead for a while but only recently started becoming alive again

Anonymous 5/21/2025, 2:28:26 AM No.42208791 [Report]

>>42207713
Ah, okay. I thought it was a matter of quality and not the voice itself. But you're right, it's not as light as her in the show...

Anonymous 5/21/2025, 2:42:39 AM No.42208841 [Report]

46280799.jpg md5: 3ad413f3...

>>42208651
https://www.youtube.com/watch?v=730zGRwbQuE
Indeed, its has been bumpy few years, yet in the end the infinite power of ponies will prevail all hardships.

Anonymous 5/21/2025, 3:06:51 AM No.42208906 [Report]

>>42208651
It's good to see it getting some steam. This is far too potent to let it fall to pieces.

Anonymous 5/21/2025, 4:24:12 AM No.42209071 [Report] >>42209113 >>42209433 >>42211348

Screenshot 2025-05-20 222315.png md5: bdac4e8f...

Someone on the server wanted to get 15 to censor the swear words from the site. Say it with me...FUCK no!

Anonymous 5/21/2025, 4:47:25 AM No.42209113 [Report]

Get Out.gif md5: fe31a27c...

>>42209071
>Hey everyone look at what some nobody said on my Discord!
No one here cares about social media drama. Keep it in Discord and out of here

Anonymous 5/21/2025, 8:35:00 AM No.42209433 [Report] >>42209975

>>42209071
This is why you don't cozy up to Discord groups. They'll try to corrupt you every time.

Anonymous 5/21/2025, 2:53:29 PM No.42209975 [Report]

>>42209433
True.

Anonymous 5/21/2025, 8:17:11 PM No.42210605 [Report] >>42211063 >>42211888

Up.

Anonymous 5/21/2025, 11:32:18 PM No.42211063 [Report]

>>42210605
aaaaaa!

Anonymous 5/22/2025, 2:03:54 AM No.42211348 [Report]

>>42209071
Gee, what a surprise.

Anonymous 5/22/2025, 7:30:49 AM No.42211888 [Report] >>42212568

>>42210605

Anonymous 5/22/2025, 8:41:54 AM No.42212018 [Report]

>https://files.catbox.moe/asxfuv.mp3

Anonymous 5/22/2025, 2:57:01 PM No.42212568 [Report] >>42221623

>>42211888

Anonymous 5/22/2025, 11:17:12 PM No.42213584 [Report] >>42214071

thisbitch.png md5: 7a50adb7...

close enough welcome back uberduck discord

Anonymous 5/23/2025, 2:53:32 AM No.42214071 [Report]

>>42213584
>uberfuck
No thanks.

Anonymous 5/23/2025, 4:01:39 AM No.42214224 [Report] >>42214240

>15 is back
>still dead
It's over

Anonymous 5/23/2025, 4:09:09 AM No.42214240 [Report] >>42214251 >>42214278 >>42214680

675242.png md5: 782ad178...

>>42214224
15.ai isn't really good enough to revive any interest after the novelty of making ponies say nigger wears off.

Anonymous 5/23/2025, 4:17:58 AM No.42214251 [Report]

>>42214240
ok goku

Anonymous 5/23/2025, 4:38:35 AM No.42214278 [Report]

TrixPosting.gif md5: dbb170ad...

>>42214240
https://u.pone.rs/OWiJmVGB.mp3

Anonymous 5/23/2025, 10:00:27 AM No.42214680 [Report]

>>42214240
>Dashcon
Comparing a literal scam to 15 is plain retarded.

ThunderShy 5/23/2025, 10:51:45 AM No.42214718 [Report] >>42215142

Hello fags made a new ai skit with 15.ai its good to be back
https://files.catbox.moe/29w2tt.mp4

Anonymous 5/23/2025, 5:05:18 PM No.42215142 [Report]

>>42214718
Comedy bros, were are you?

Anonymous 5/23/2025, 10:08:33 PM No.42215735 [Report] >>42217176

1376392147815.gif md5: 9c2bf088...

https://files.catbox.moe/aov4vh.mp3

Anonymous 5/24/2025, 6:43:23 AM No.42216763 [Report] >>42217954 >>42224989

LiminalTrixieSnipper_1_Temp_2.gif md5: 466685e7...

>15 service re-emerges
>Typing rapidly ensues
>Old prompt tricks still draw out the mysterious liminal echoes of the mare
These digital equines have the most fascinating voices

Compilation of Liminal Trixie sounds
https://files.catbox.moe/gznbmc.mp3
https://files.catbox.moe/spb6zv.mp4

Anonymous 5/24/2025, 10:37:20 AM No.42217176 [Report]

>>42215735
What was that quote? I can't remember where it came from.

Anonymous 5/24/2025, 8:27:36 PM No.42217954 [Report]

>>42216763
moonbase trixie

Anonymous 5/24/2025, 9:14:38 PM No.42218072 [Report] >>42218629 >>42218755 >>42219570

I need more lewd moans. Gasps, sighs, groans, chirps, murmurs, mewlings, etc.

Anonymous 5/25/2025, 12:40:17 AM No.42218629 [Report]

>>42218072
I have an audio pack with random moans, give me few minutes to upload it

Anonymous 5/25/2025, 1:19:09 AM No.42218755 [Report] >>42219017 >>42256762 >>42269579

TrixBotVoicingFried.png md5: 223c93b2...

>>42218072
NTA but here's a couple more Liminal Trixie noises.
A few grunts, laughs, even some coughs and various others.
https://files.catbox.moe/q8g80w.mp3

Anonymous 5/25/2025, 3:10:41 AM No.42219017 [Report]

>>42218755
I'm surprised no one has done something with that.

ThunderShy 5/25/2025, 4:11:15 AM No.42219092 [Report]

@hydrusbeta, what happaned to the synth app its not working and could it be possible if you can add a direct link to it on the haysay website

Anonymous 5/25/2025, 9:28:43 AM No.42219523 [Report] >>42219530

>Servers down
>twitter account gone
Permission to panic, sir?

Anonymous 5/25/2025, 9:33:47 AM No.42219530 [Report] >>42221117

>>42219523
False alarm, twitter was just fucking itself up again.

Anonymous 5/25/2025, 10:04:16 AM No.42219570 [Report]

me and my rule 63 selfs.png md5: 30403969...

>>42218072
https://u.pone.rs/PRpOFwQp.001
>SpecialPacks_.zip.001
https://u.pone.rs/vyoSUmbo.002
>SpecialPacks_.zip.002
https://u.pone.rs/WFVSxEXw.003
>SpecialPacks_.zip.003
https://u.pone.rs/mGedaJTp.004
>SpecialPacks_.zip.004
https://u.pone.rs/uGmobBkJ.005
>SpecialPacks_.zip.005

Rename the download files to the below quoted filenames. It's 2.27GB mix of variety sounds from ASMR, hentai games and some other gooning sources. Do use the RVC to make them pony related.

VilligerANON 5/25/2025, 7:33:38 PM No.42220345 [Report] >>42220536 >>42220864

I'm preparing to train MLP models with GPT-soVITS v4
Which mare should I start with?

>Yes, I'll add the precomputed values from Haysay, once I make the WebUI.

Anonymous 5/25/2025, 9:10:43 PM No.42220536 [Report] >>42221644

Feral, Pony, Applejack, Queen Chrysalis, hoof wrestling, holding hooves, duo, fi s-2395569027.png md5: c6d7bc3f...

>>42220345
Applejack is a good baseline to test out accent retention and character similarity. Otherwise, testing more unique voices like Queen Chrysalis would better determine how well the model replicates the intended voice without falling back too much on similar but generic voices.

Anonymous 5/26/2025, 12:04:44 AM No.42220864 [Report]

>>42220345
I'd be curious to know what effect the LoRA Rank has on the models, and which one is ideal for what datasets.

Anonymous 5/26/2025, 2:27:38 AM No.42221117 [Report]

>>42219530
Phew.

Anonymous 5/26/2025, 7:56:47 AM No.42221623 [Report]

>>42212568

VilligerANON 5/26/2025, 8:09:54 AM No.42221644 [Report]

>>42220536
What pretrained English model was finetuned on?

Anonymous 5/26/2025, 1:46:59 PM No.42222130 [Report] >>42223953

>10

Anonymous 5/26/2025, 4:50:19 PM No.42222460 [Report] >>42222472 >>42224830

LyraBooger.png md5: c6fef402...

>Prompts various commas and apostrophes to get hidden mare noises.
>Lyra: "Ew, I think it's some sorta booger or something"
Wow, these mares have some fascinating interpretations.

>https://files.catbox.moe/whb1r5.mp4
>https://files.catbox.moe/xqvrlq.wav

Anonymous 5/26/2025, 4:59:49 PM No.42222472 [Report]

>>42222460
The interface is really stylish.

Anonymous 5/26/2025, 6:52:43 PM No.42222730 [Report]

>https://huggingface.co/Amo/GPT-SoVITS-v2/blob/main/TreeHugger_so96_gpt24/wavs.zip
>This file is vulnerable to threat(s) PAIT-ARV-100.
Could somebody with good quality antivirus scan this zip and files inside of it? it's probably a false positive but I want to be sure this wouldn't mess with my pc.

Anonymous 5/26/2025, 10:17:50 PM No.42223265 [Report] >>42223566 >>42224786 >>42261401

https://unmute.sh/

Found this, apparently they're gonna open source the text and speech models soon, but for now, you can supply a ten second voice clip of anyone you want to speak with them in a variety of topics.

Anonymous 5/27/2025, 12:10:11 AM No.42223566 [Report]

>IMS Toucan - tts 7000 Languages
>https://github.com/DigitalPhonetics/IMS-Toucan
>https://huggingface.co/spaces/Flux9665/MassivelyMultilingualTTS
I think this was posted few years back, I've noticed they had update on huggingpage about two weeks ago, after few minutes of testing, it seems to be working, however while the quality of voices is above MS Sam and the noisy talknets, the way tts is talking still feels very artificial.
The voice cloning option seems to be broken so that's sucks, however by the fact that it is able to generate voices at light speed and even has build in options for CPU usage means that it could be run on a potato tier equipment without problems.
So, its not something useful for now, but there is always possibility somebody else could take it and improve it (imagine Flutershy teaching you how to speak moonrunes).
>>42223265
Thank you for sharing that Anom, and also holy fuck, this is working like pure magic, I just given them a 9s of audio clip of really low quality clip ripped from a game and it was able to replicate it without the shitty de-reverb pollution and background buzzing noise AND keeping the accent consistent. And on top of that I was able to double the amount of voice lines this character had ever spoken, so thats a massive plus on making artificial datasets.
>apparently they're gonna open source the text and speech models soon
With this kind of tech there wouldn't be a need for training full models for the bare bones TTS can be done with 10s clips and less than 5m of waiting for the voice to be clone. Man, I remember way back in mid 2020 when people talk about this tech and pretty much everybody agreed that cloning voices with 10s of audio will never sound natural or even good, how times have changed.

Anonymous 5/27/2025, 2:29:42 AM No.42223953 [Report] >>42224543

>>42222130

Anonymous 5/27/2025, 7:49:33 AM No.42224543 [Report]

>>42223953

Anonymous 5/27/2025, 10:58:37 AM No.42224786 [Report]

>>42223265
I tried to see if it could recreate voice from 3s of Woona voice but sadly that was a no-go (Ive even try duplicating the voice to fill it out to 10s clips), im guessing the high pitch levels of distress is messing with their process or they do need minimum 6s of audio to be able to work out how to duplicate it.

Anonymous 5/27/2025, 11:33:22 AM No.42224830 [Report] >>42224962

>>42222460
This is what I've got instead. I really dig the giggle in the first one.
https://vocaroo.com/154R3gQLRpG1
https://vocaroo.com/1eOcqD52A2pm

Anonymous 5/27/2025, 2:25:52 PM No.42224962 [Report]

>>42224830
>https://files.catbox.moe/etzhiu.mp4
>Chrysalis: "(forceful exhales x3), We should take the magic inside it. You know how powerful Discord was."
Guess with limited-to-no other speech input, it does fall back a lot on the Reference Text as seen in the Advanced Model Details. No wonder so many Trixie attempts had her mumbling about a good night's sleep. Less random than initially suspected.

I wonder how the model would behave if we were able to remove or modify the underlying quote(s) during synthesis, though I'm sure it's likely integral to retaining its accuracy. Come to think of it though, it would be nice to be able to select specifically what underlying reference line it's using prior to generation so that you have more chances of getting a desirable output similar to it. Could mean less resource usage too.

Anonymous 5/27/2025, 3:03:00 PM No.42224989 [Report] >>42225760

>>42216763
what tricks did you use?

Anonymous 5/27/2025, 4:04:47 PM No.42225053 [Report] >>42225760

>https://github.com/PasiKoodaa/ACE-Step-RADIO
I've stumbled upon above github project, it uses the Ace Step music model to create a constant stream of ai music to replicate what online radio websites do, the requirements for it are 16GB Vram. The outputs are still on the so-so level, but given the text to song models are only about year old there is plenty of space for improvements. Also I would love to see a setup were these models sing with proper poni voices from the get go (or with the help from loras).

Anonymous 5/27/2025, 6:29:38 PM No.42225301 [Report] >>42225760

>Stable Audio Open Small
>Weights: https://huggingface.co/stabilityai/stable-audio-open-small
>Paper: https://arxiv.org/abs/2505.08175
>Arm learning path: https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/run-stable-audio-open-small-with-lite-rt
Huh, a model that's only around 2GB? Nice to see them notice that not everybody have a endless bag of cash to spend on newest and larges GPU. Sadly it still only outputs instrumental at lower-tier quality (at least in comparison to what's already out there).
Apparently it can run 30% faster than realtime.

Anonymous 5/27/2025, 10:48:45 PM No.42225760 [Report]

TwiggyLewdMareSounds.png md5: 574a3a4c...

>>42224989
Mostly the aforementioned ,',' trick, which in older pre- "dev" versions of 15 used to be able to do a lot more lewd noises and such. Used to have a text doc with a handful of other tricks used with it, but it must be on one of my older OS drives. Still serves to force further areas of silence, which in turn can allow hallucinations and other AI weirdness to creep in on purpose.
>>42225053
>16GBs Vram
Still seems out of the memory budget of most anons, Unless it could be optimized to be at least half that with minimal loss. Even if it were finetuned on mares, without optimization I can't imagine many being able to utilize it for synthesis.
>>42225301
>Very small model
>Lower quality
To be expected I suppose, but at least it's something usable for local synthesis and playing around with, aside from maybe Bark; which I should honestly revisit. Just a shame they completely abandoned it after becoming monetized in the form of Suno. Still open source like Stable Audio is however.

Anonymous 5/28/2025, 1:58:06 AM No.42226198 [Report] >>42227813

Up.

Anonymous 5/28/2025, 8:13:51 AM No.42226927 [Report]

mares

Anonymous 5/28/2025, 11:56:43 AM No.42227190 [Report]

LewdCyberTwi.png md5: bcb5be07...

rears

Anonymous 5/28/2025, 3:06:12 PM No.42227393 [Report]

3419373.png md5: 5b5b3973...

https://u.pone.rs/pBgJHLQr.wav

Anonymous 5/28/2025, 7:37:50 PM No.42227813 [Report] >>42229510

>>42226198

Anonymous 5/28/2025, 10:00:03 PM No.42228108 [Report] >>42228215 >>42228246

Claims to do sota zero shot cloning with tts with powerful control
https://github.com/resemble-ai/chatterbox

Anonymous 5/28/2025, 10:55:42 PM No.42228215 [Report]

>>42228108
From a 20s voice sample: https://litter.catbox.moe/w54fxs.wav

Anonymous 5/28/2025, 11:08:14 PM No.42228246 [Report]

>>42228108
I've tested with few voices, it seems to be able to run some without any problems but totally struggle with others (seems to depend on how accent/pronunciation deviate from standard way of speaking). Sadly I confirmed that this model is also unable to clone Woona voice.

Anonymous 5/29/2025, 9:37:20 AM No.42229510 [Report] >>42232233

>>42227813

Anonymous 5/29/2025, 2:17:11 PM No.42229871 [Report] >>42230283

Music Source Restoration
https://arxiv.org/abs/2505.21827
>We introduce Music Source Restoration (MSR), a novel task addressing the gap between idealized source separation and real-world music production. Current Music Source Separation (MSS) approaches assume mixtures are simple sums of sources, ignoring signal degradations employed during music production like equalization, compression, and reverb. MSR models mixtures as degraded sums of individually degraded sources, with the goal of recovering original, undegraded signals. Due to the lack of data for MSR, we present RawStems, a dataset annotation of 578 songs with unprocessed source signals organized into 8 primary and 17 secondary instrument groups, totaling 354.13 hours. To the best of our knowledge, RawStems is the first dataset that contains unprocessed music stems with hierarchical categories. We consider spectral filtering, dynamic range compression, harmonic distortion, reverb and lossy codec as possible degradations, and establish U-Former as a baseline method, demonstrating the feasibility of MSR on our dataset. We release the RawStems dataset annotations, degradation simulation pipeline, training code and pre-trained models to be publicly available.
https://github.com/yongyizang/music_source_restoration
https://huggingface.co/datasets/yongyizang/RawStems
https://huggingface.co/yongyizang/MSR_UFormers
Github repo isn't live yet. might be cool for audio stuff

Anonymous 5/29/2025, 5:52:02 PM No.42230283 [Report]

>>42229871
This could be pretty useful in combination with the ACE Step song convector, if a song can have both vocals separated as well as instrumentals separated into their own track I would imagine that would help modifying it into a different style of music.
At the very least it would be nice to use it to fix the weird effects that vocal removing programs are imprinting on the instrumental files.

Anonymous 5/30/2025, 12:47:11 AM No.42231332 [Report] >>42232530

ten

Anonymous 5/30/2025, 7:44:17 AM No.42232233 [Report]

>>42229510

Anonymous 5/30/2025, 12:19:45 PM No.42232530 [Report] >>42236397

>>42231332

Anonymous 5/30/2025, 6:46:58 PM No.42233126 [Report]

stupid 1728589750923813.png md5: 91e8fb35...

>https://u.pone.rs/beZAfsQC.mp3
motivational Trixie

Anonymous 5/31/2025, 6:17:19 AM No.42234667 [Report] >>42235711

Saved

Anonymous 5/31/2025, 6:04:11 PM No.42235711 [Report]

>>42234667
Precautionary bump.

Anonymous 5/31/2025, 11:35:08 PM No.42236397 [Report] >>42237262

>>42232530

Anonymous 6/1/2025, 7:15:12 AM No.42237262 [Report] >>42238294

>>42236397

Anonymous 6/1/2025, 6:08:21 PM No.42238294 [Report] >>42238306

Again.png md5: f2b43efe...

>>42237262

Anonymous 6/1/2025, 6:12:03 PM No.42238306 [Report] >>42239361

SNIFF 3.gif md5: 7ac816ff...

>>42238294

Anonymous 6/2/2025, 1:37:48 AM No.42239361 [Report]

>>42238306

Anonymous 6/2/2025, 2:06:26 AM No.42239410 [Report] >>42240013 >>42240318

SpikeWoo.gif md5: 13af6b26...

Well, the twelve hours after 15 returned was fun I guess. Now back to this bullshit.

Anonymous 6/2/2025, 6:43:46 AM No.42240013 [Report]

>>42239410
He's gunna hurl if he keeps that up.

Anonymous 6/2/2025, 10:23:30 AM No.42240318 [Report] >>42240322

>>42239410
Which one?

Anonymous 6/2/2025, 10:24:46 AM No.42240322 [Report] >>42240779

>>42240318
The bumping kind.

Anonymous 6/2/2025, 3:51:40 PM No.42240779 [Report] >>42241436

>>42240322
The bumping loyal

Anonymous 6/2/2025, 8:35:33 PM No.42241436 [Report] >>42243302

>>42240779
Let me bump the thread of my people.

Anonymous 6/3/2025, 2:37:54 PM No.42243302 [Report] >>42245664 >>42248445

Pony, my little pony, female, cute, original character, OC, fan character, _Bump s-4209552926.png md5: 113d0de8...

>>42241436
>>42241979

Anonymous 6/3/2025, 8:45:51 PM No.42243816 [Report] >>42244249 >>42244607 >>42247951

https://openaudio.com/blogs/s1
The .5b mini version will be open sourced

Anonymous 6/4/2025, 12:07:45 AM No.42244249 [Report] >>42244607

>>42243816
Hmm, would be nice if there was a demo WITHOUT music so I assume they put it in to hide the lower quality. But with .5B size this thing should technically be able to run in a phone sized environment, so that's neat.

Anonymous 6/4/2025, 2:45:30 AM No.42244607 [Report]

>>42243816
>>42244249
Neat indeed, but it's a shame they don't have any audio examples of either version (on that page at least). Hard to really get a feel of it when there's nothing to gauge or judge.

Anonymous 6/4/2025, 2:39:14 PM No.42245664 [Report]

>>42243302
Indeed.

Anonymous 6/5/2025, 6:02:44 AM No.42247180 [Report] >>42247653

Scootaloo Scoot-Scootaloo.

Anonymous 6/5/2025, 12:53:33 PM No.42247653 [Report]

>>42247180
Someone said chicken?

Anonymous 6/5/2025, 5:17:38 PM No.42247951 [Report] >>42247952 >>42250902

>>42243816
https://huggingface.co/fishaudio/openaudio-s1-mini

Anonymous 6/5/2025, 5:18:38 PM No.42247952 [Report] >>42248049

>>42247951
OpenAudio S1 supports a variety of emotional, tone, and special markers to enhance speech synthesis:

1. Emotional markers: (angry) (sad) (disdainful) (excited) (surprised) (satisfied) (unhappy) (anxious) (hysterical) (delighted) (scared) (worried) (indifferent) (upset) (impatient) (nervous) (guilty) (scornful) (frustrated) (depressed) (panicked) (furious) (empathetic) (embarrassed) (reluctant) (disgusted) (keen) (moved) (proud) (relaxed) (grateful) (confident) (interested) (curious) (confused) (joyful) (disapproving) (negative) (denying) (astonished) (serious) (sarcastic) (conciliative) (comforting) (sincere) (sneering) (hesitating) (yielding) (painful) (awkward) (amused)

2. Tone markers: (in a hurry tone) (shouting) (screaming) (whispering) (soft tone)

3. Special markers: (laughing) (chuckling) (sobbing) (crying loudly) (sighing) (panting) (groaning) (crowd laughing) (background laughter) (audience laughing)

Anonymous 6/5/2025, 6:34:34 PM No.42248049 [Report]

>>42247952
>Emotional markers
Interesting, hopefully there will be a decent UI and training for it

Anonymous 6/5/2025, 9:52:04 PM No.42248445 [Report]

ArtificialBumpMare_ce2_123.png md5: 34de9c48...

>>42243302

Anonymous 6/6/2025, 12:46:16 AM No.42248854 [Report] >>42249714

Bump.png md5: 1713a6f0...

Anonymous 6/6/2025, 4:35:28 AM No.42249353 [Report] >>42250054 >>42253016

ArtificialBumpMare_me_125.png md5: 8430a921...

Anonymous 6/6/2025, 8:37:14 AM No.42249714 [Report]

>>42248854
>bump rump
Would pump.

Anonymous 6/6/2025, 12:50:13 PM No.42250054 [Report]

>>42249353
Pretty bump mare. Totally would.

Anonymous 6/6/2025, 1:30:18 PM No.42250109 [Report] >>42250437 >>42250547 >>42251801 >>42253112 >>42254373

>15 crawls back to bait patreon donos with his half-baked model where most emotion choices result in unintelligable noise
>11 releases a new alpha that wipes the floor with his crusty garbage less than a month later
https://elevenlabs.io/v3
holy fucking kek! maybe there is a god.

Anonymous 6/6/2025, 5:05:47 PM No.42250437 [Report]

>>42250109
yeah but unlike fifteen, eleven labs cost money

Anonymous 6/6/2025, 6:12:01 PM No.42250547 [Report] >>42250592

>>42250109
? elevenlabs doesn't have ponies, how is this a comparison

Anonymous 6/6/2025, 6:17:19 PM No.42250558 [Report]

Remember not to give goku the attention he wants

Anonymous 6/6/2025, 6:28:00 PM No.42250592 [Report] >>42250593

>>42250547
you have to train your own models on there you retard mcspazatron

Anonymous 6/6/2025, 6:28:24 PM No.42250593 [Report]

>>42250592
yeah is it any good though, last I tried to train ponies it wasn't very good

Anonymous 6/6/2025, 8:41:50 PM No.42250902 [Report] >>42260216

>>42247951
Anybody had a chance testing this thing out? Due to bullshit reasons I'm kind of stuck phone posting but I do want to know if it's any good.

Anonymous 6/6/2025, 8:57:52 PM No.42250941 [Report] >>42251141

https://github.com/RVC-Boss/GPT-SoVITS/releases/tag/20250606v2pro
https://github.com/RVC-Boss/GPT-SoVITS/wiki/GPT%E2%80%90SoVITS%E2%80%90features-(%E5%90%84%E7%89%88%E6%9C%AC%E7%89%B9%E6%80%A7)

Anonymous 6/6/2025, 10:21:40 PM No.42251141 [Report]

>>42250941
>for 50 nvidia series
so wait, the new models is for 50s exclusive or just optimized for the use on that hardware?

Anonymous 6/6/2025, 11:56:10 PM No.42251484 [Report]

>>42207363
I tried that and all it did was make Rarity do pokemon noises.
https://files.catbox.moe/1ryvaz.wav
https://files.catbox.moe/qivs4r.wav
also somethimes the AI interpretation (wish we could turn that off) says "Triple A" https://files.catbox.moe/72xgzt.wav

Anonymous 6/7/2025, 1:39:10 AM No.42251801 [Report]

>>42250109
>elevenfags
Miss me with that shit.

Anonymous 6/7/2025, 6:13:28 AM No.42252543 [Report]

ArtificialBumpMare_111.png md5: 2216c37c...

Anonymous 6/7/2025, 7:09:37 AM No.42252595 [Report] >>42253243

the one.jpg md5: 724b017f...

I found some free audio processing plugins, I'll be loading these in (((audacity))) to auto-process my dataset. I haven't tried it yet, but it seems promising, like a publicly released version of izotope:
https://archive.org/details/accusonus-era-bundle-v-6.2.00
They made it public before going out of business. I might reply the anchor if it gives a good result.

Anonymous 6/7/2025, 11:51:17 AM No.42253016 [Report]

>>42249353

Anonymous 6/7/2025, 12:58:06 PM No.42253112 [Report]

>>42250109
I wonder (((who))) could be behind this post.

Anonymous 6/7/2025, 2:29:42 PM No.42253243 [Report] >>42287174

>>42252595
Interesting, could you post some examples here?

Anonymous 6/7/2025, 10:11:36 PM No.42254319 [Report]

Mares?

Anonymous 6/7/2025, 10:38:01 PM No.42254373 [Report] >>42254415

>>42250109
gptsovits wipes the floor with 15 shitty model already, no need to bring the big guns

Anonymous 6/7/2025, 11:03:14 PM No.42254415 [Report] >>42254417

>>42254373
stop samefagging, your broken english is too noticeable at this point

Anonymous 6/7/2025, 11:04:46 PM No.42254417 [Report] >>42254418

>>42254415
You wish I was samefagging retard

Anonymous 6/7/2025, 11:05:04 PM No.42254418 [Report]

>>42254417
hahahah

Anonymous 6/8/2025, 5:37:30 AM No.42255207 [Report]

Electric mares?

Anonymous 6/8/2025, 6:19:11 AM No.42255248 [Report]

43/64 on pl_marewater

Anonymous 6/8/2025, 10:08:16 PM No.42256668 [Report] >>42256717

For characters with lots of voice lines like Spike and Twilight, if I'm using my own voice, what's the best option to choose on Haysay to sound good?

Anonymous 6/8/2025, 10:27:51 PM No.42256717 [Report] >>42256739

>>42256668
RVC is the current gold standard as far as Haysay goes for speech-to-speech.

Anonymous 6/8/2025, 10:37:41 PM No.42256739 [Report]

>>42256717
It's not quite getting the intended result. Should I set voice envelope high or low? https://voca.ro/1iHl7ZMvk5Qm

Anonymous 6/8/2025, 10:52:17 PM No.42256762 [Report] >>42256785 >>42256785 >>42256871

>>42218755
What settings did you use here? Sounds pretty good.

Anonymous 6/8/2025, 11:02:10 PM No.42256785 [Report] >>42256871

>>42256762
If you're trying to get non-vocals out of the voice-to-voice, it's not gonna work great.
>>42256762
Those were generated with 15.ai, probably the best option if you don't need voice to voice functionality and just want lewd pony noises.

Anonymous 6/8/2025, 11:46:17 PM No.42256871 [Report] >>42257202

Liminal Mare Code.png md5: 109e2ee8...

>>42256762
>>42256785
Mostly default settings. Varying the temperature occasionally. Liminal mares also make all sorts of noises, not just lewd. I can easily imagine them being used as vocal SFX for pony videogames or something — maybe an episode or animation like a mare drips onto the ground and the grunt is entirely synthetic and not a recycles audio from the show.

https://files.catbox.moe/7rx7zi.mp3

Anonymous 6/9/2025, 2:50:49 AM No.42257202 [Report] >>42257362 >>42257426 >>42257521

>>42256871
>https://files.catbox.moe/7rx7zi.mp3
These sound like Trixie is doing Link moves.

Anonymous 6/9/2025, 4:28:57 AM No.42257362 [Report] >>42257428

>>42257202
Abstract mare sounds are abstract. Sadly Rvc is still the king of getting quality lewd sounds, but I still wish we had a nice tts alternative.

Anonymous 6/9/2025, 5:08:28 AM No.42257426 [Report] >>42257521

>>42257202
Huh, yeah, this really make me want to work on my 3d modelling again... although Godot's 3D capabilities are not great still.

Anonymous 6/9/2025, 5:09:50 AM No.42257428 [Report] >>42257432

>>42257362
Is there a place I can upload multiple audio files for easy playback? I wanted to show off what I managed with the TTS on haysay.

Anonymous 6/9/2025, 5:12:47 AM No.42257432 [Report] >>42257437

>>42257428
pone.rs

Anonymous 6/9/2025, 5:14:23 AM No.42257437 [Report] >>42257521

>>42257432
Thanks. Too bad it doesn't stream playback....

https://u.pone.rs/reZpBwHV.wav (Twilight)
https://u.pone.rs/cBNqloOa.flac (Spike)

Anonymous 6/9/2025, 6:12:23 AM No.42257521 [Report] >>42258274

TrixieHyut.png md5: ae0db033...

>>42257202
Could totally imagine a game with Trixie acting as the hero of Hyrule.
>>42257426
Damn, haven't heard Godot in a hot minute. I really need to find time and motivation to actually get into that myself. Keep telling myself that though. Sadly free time and hobbies don't pay bills.
>>42257437
>doesn't stream playback
You mean like, play it in a browser? Because usually mp3 is supported in that way.

Anonymous 6/9/2025, 11:41:43 AM No.42257919 [Report]

Up.

Anonymous 6/9/2025, 4:47:41 PM No.42258274 [Report] >>42258298

>>42257521
Yeah, I know what you mean, though I'd say getting those skills can be valuable. Personally, I wish I didn't mentally check out of a tutorial after like 30 minutes because most of them need a good hour or more to really get into the meat of it, and even taking notes, it feels like I'm not retaining it well.

Anonymous 6/9/2025, 5:03:29 PM No.42258298 [Report] >>42258349

>>42258274
I would recommend the YT channel TheRoyalSkies, all his video (with some rare exceptions) are between one to five minutes long, always getting to the point instead of flapping about some bullshit and settings. The only downside is they are usually aimed at people who already have little bit above total 0xp noobie beginners but it's still good stuff.

Anonymous 6/9/2025, 5:24:43 PM No.42258349 [Report] >>42258805

>>42258298
Oh, they have Cascadeur videos. I was wondering if that was usable with quadrapeds too...

Anonymous 6/9/2025, 9:15:49 PM No.42258805 [Report]

>>42258349
never used that addon/function, but I would imagine anything that is not a humanoid with standard two arms and legs will require lots of custom rigging.

Anonymous 6/10/2025, 9:00:12 AM No.42260216 [Report] >>42260489

>>42250902
Thread tourist here, it's breddy gud for being local. I've been running it on a 3060 with no issue, takes about twice as long as real time but the 44.1kHz fidelity is incredible. Also the voice cloning accepts up to 90 seconds of input, with possibly more but I have yet to test that.
My main criticism is that for longer gens upward of a minute or more, the voice gets kinda washed out in a way, but you can easily circumvent that by just splitting your text into chunks.
Here's some examples I genned:
Cum Zone guy quoting Ozymandias (my favorite gen, nearly indistinguishable from real VA) https://vocaroo.com/1ngXhfejJwoB
Gilbert Gottfried navy seals (you can hear the voice getting washed out towards the end) https://vocaroo.com/1n6SZbrHzKZ1
Michael Rosen pulp fiction (it can mispronounce capitalized words, storage is pronounced as sturgeon) https://vocaroo.com/1ov76WqTjIUY
I'd say it's elevenlabs-tier, even if that comparison is now outdated because of their new model.

Anonymous 6/10/2025, 2:00:15 PM No.42260489 [Report] >>42261547

>>42260216
for a zero shot model it's surprisingly decent. In their GitHub, do they provide a UI with emotional control or is it just bare minimum of "audio reference in, tts out"?

Anonymous 6/10/2025, 6:53:57 PM No.42260920 [Report]

https://github.com/fluxions-ai/vui
https://huggingface.co/fluxions/vui
has voice cloning ability
>You can clone with the base model quite well but it's not perfect as hasn't seen that much audio / wasn't trained for long

Anonymous 6/10/2025, 9:03:47 PM No.42261160 [Report] >>42261401 >>42262716 >>42264548

d9tnkeekgos71.jpg md5: 3c6e8674...

What's the best tts for mares? I know elevenlabs is the best overall but I'm wondering how good it is for ponies

Anonymous 6/10/2025, 11:49:15 PM No.42261401 [Report]

>>42261160
For locally operation, it's still the gpt-sovits. I don't use paid online services so lmao on that one.
>>42223265
But I guess this one could beat it, once they make it public. Having their tts model running tts integrated with Silly Tavern would honestly kick some serious ass.

Anonymous 6/11/2025, 1:13:52 AM No.42261547 [Report] >>42261622

file.png md5: 317a28c6...

>>42260489
There's emotion control to a degree, you just put one of the tags in parentheses at the start. There's only a limited amount of valid tags and it can only go so far, and I haven't personally been able to use multiple in a single gen since it just says the word but YMMV

Anonymous 6/11/2025, 2:03:45 AM No.42261622 [Report] >>42261832 >>42261949

>>42261547
>only one emotional tag control
oh, this sucks donkey balls, I was hopping we could finally have a model that can make a advanced sentence styles eg whispering with mix of anger and confusion.

Anonymous 6/11/2025, 4:03:30 AM No.42261832 [Report] >>42261949

>>42261622
Yeah, honestly sounds like a convoluted way to say they have multiple individual models compounded, each trained on one particular emotion and uses the parentheses determine which underlying model it uses for synthesis.

Anonymous 6/11/2025, 4:58:02 AM No.42261949 [Report]

>>42261622
>>42261832
Well like I said, your mileage may vary. I haven't been experimenting with it nearly as much as I should, and it could very well support that. I saw an example somewhere else of Pearl from SU reading the best thing about meatballs meme and the voice there was pretty varied emotionally and realistic. To be fair, they might have been using the full model which is only available through their website, but I wouldn't knock it before trying it on the smaller model. Using my GPU for other purposes at the moment so someone else will have to test.

Anonymous 6/11/2025, 10:55:35 AM No.42262326 [Report] >>42263067

ArtificialBumpMare_123.png md5: d282192d...

Anonymous 6/11/2025, 5:30:57 PM No.42262716 [Report] >>42262734

9bf5881b0384b11f9b64140f99bc0801.jpg md5: e2c8d9f0...

>>42261160
Is there some kind of library with voice clips I can use to make pony models in ElevenLabs?

Anonymous 6/11/2025, 5:37:58 PM No.42262734 [Report] >>42272674

>>42262716
megas links in OP?

Anonymous 6/11/2025, 8:21:54 PM No.42263067 [Report]

>>42262326
Cute bump mare.

Anonymous 6/12/2025, 4:22:12 AM No.42264147 [Report]

ArtificialBumpMare_ce_124.png md5: a0fcf5a4...

>10

Anonymous 6/12/2025, 9:25:44 AM No.42264531 [Report]

>slow night bump

Anonymous 6/12/2025, 9:48:48 AM No.42264548 [Report] >>42265214

>>42261160
https://15.dev/

Anonymous 6/12/2025, 12:21:49 PM No.42264821 [Report]

>>42196683
what???

Anonymous 6/12/2025, 1:53:40 PM No.42265013 [Report]

bump due to too much spam on the board

Anonymous 6/12/2025, 2:12:52 PM No.42265042 [Report] >>42265052

Is openaudio s1 the best thing right now? I copied random text from a mod page. The pronunciation is pretty good, although imo a little too neutral.

Anonymous 6/12/2025, 2:17:47 PM No.42265052 [Report]

>>42265042
Audio quality seems the best, pronunciation is really good as long as it's not a weird made up word.Emotions are pretty meh.
https://vocaroo.com/1l7fRlI0qtqn

Anonymous 6/12/2025, 3:43:55 PM No.42265214 [Report]

>>42264548
No trolls please

Anonymous 6/12/2025, 6:49:45 PM No.42265590 [Report]

https://x.com/elevenlabsio/status/1933188969279500459

Anonymous 6/12/2025, 9:48:32 PM No.42266061 [Report] >>42266620

preserved

Anonymous 6/13/2025, 12:55:58 AM No.42266620 [Report]

>>42266061

Anonymous 6/13/2025, 2:10:47 AM No.42266751 [Report]

preservation bump

Anonymous 6/13/2025, 5:52:16 AM No.42267158 [Report] >>42267570

ArtificialBumpMare_112.png md5: bb446d93...

Anonymous 6/13/2025, 12:14:01 PM No.42267570 [Report]

>>42267158

Anonymous 6/13/2025, 6:00:45 PM No.42268023 [Report]

>mared

Anonymous 6/14/2025, 1:05:28 AM No.42268924 [Report]

Up.

Anonymous 6/14/2025, 1:15:33 AM No.42268941 [Report] >>42268985

This is starting to get sad...

Anonymous 6/14/2025, 1:36:47 AM No.42268985 [Report]

>>42268941
I only have one gpu that's already too outdated for all this kind of technological novelty. I already had to throw away few ideas for song cover because random song leakage / dual vocals was fucking with conversion process.

Anonymous 6/14/2025, 7:30:57 AM No.42269579 [Report]

>>42218755
>pukes at the end

Anonymous 6/14/2025, 10:06:46 AM No.42269737 [Report] >>42269957

How do we save /ppp/?

Anonymous 6/14/2025, 2:06:02 PM No.42269957 [Report] >>42272090

breaking bad pony 1665957871920475.png md5: d59ca0ef...

>>42269737
There is only one thing we can do, we cook...I mean we make pony content. I was thinking of doing a "X pony makes a review about fics/books" in similar theme/feel of Rainbow Dash Presents.

Anonymous 6/14/2025, 2:42:29 PM No.42269988 [Report]

REDUB 7!!!!!!!!!!!!!!!!!

Anonymous 6/14/2025, 3:25:01 PM No.42270046 [Report]

With SparkTTS, voices can be cloned with even just a few seconds of audio. This allows the cloning of background characters like TwinkleShine. What I like to do is feed ai generated voices into elevenlabs in order to get a higher quality model. Love what you guys are doing!

Anonymous 6/14/2025, 7:14:37 PM No.42270522 [Report]

>bump

Anonymous 6/14/2025, 8:52:57 PM No.42270729 [Report] >>42271120 >>42271473

Anyone else here that thinks about the possibilities of AGI pretty consistently?
I don’t know exactly how much overlap there is between this corner of the fandom and technological singularity enthusiasts.

Anonymous 6/14/2025, 11:47:16 PM No.42271120 [Report]

itknows.jpg md5: 1fc2a938...

>>42270729
I'm always dreaming of Bicentennial Man level of AGI. Just another race of sentient beings but they're Robots! but I have no idea if we'd ever reach a singularity event or even if we do, what are the true possibilities?

Anonymous 6/15/2025, 2:10:28 AM No.42271473 [Report]

>>42270729
in my unprofessional opinion we don't have currently tech and materials to make something that would work as proper AGI, at best it will just more polished versions of LLM that will be so good at pretending to sound like people it will be next to impossible to distinguish them from people. I do think people in next century will make some new type of processors/programming/something else that could make the computers think and feel for real, but by that time the world and society will change so much there isn't even point in guessing how it would look like (just like trying to explain a caveman the wonders of tech from ancient roman empire).

Anonymous 6/15/2025, 8:23:43 AM No.42272090 [Report]

>>42269957
This. You must use the pone to save the pone

Anonymous 6/15/2025, 4:58:20 PM No.42272674 [Report] >>42272771 >>42272932 >>42275950

1840240.jpg md5: 178e0ffb...

>>42262734
I've tried to use the audio clips but my models sound like shit. Does anyone have some pre-made audio clips I can use for ElevenLabs that's worked well for them?

Anonymous 6/15/2025, 6:01:16 PM No.42272771 [Report]

>>42272674
>models sound like shit
so idea what script you are using but everyone and every company that has pony voice conversions/tts are using the exact same clips from PPP.
if you are using some new experimental cloning scripts, these will require the use of 10s clips, so if you give them just 3s clip the result will sound shit.

Anonymous 6/15/2025, 7:27:04 PM No.42272932 [Report] >>42273853

>>42272674
>ElevenLabs
>Models sound like shit
So nothing new then

Anonymous 6/15/2025, 11:17:12 PM No.42273482 [Report]

nein!

Anonymous 6/16/2025, 1:56:30 AM No.42273853 [Report] >>42274821

moondancer 1676304366778324.png md5: ae58df1c...

>>42272932
>https://u.pone.rs/LvFcybeH.mp3
surprise horsefuckers, I got some spare time and converted a song from my buddy to Moon Dancer vocals, enjoy.
OG song: https://suno.com/song/eae162d0-cbbb-433a-8008-5fab7bee01ba

Anonymous 6/16/2025, 8:33:20 AM No.42274484 [Report] >>42275868 >>42276957

Bump.

Anonymous 6/16/2025, 2:55:46 PM No.42274821 [Report]

>>42273853
Nice pop song.

Anonymous 6/16/2025, 4:46:02 PM No.42274933 [Report] >>42283796

1722816409565057.jpg md5: be35cafd...

>>41070370
Is there a chance anybody here has archived this before it was deleted?
>Background Pony - "OUT OF APPLES" - Hall 'n Oates - Out of Touch (MLP Applejack AI cover)
this was its title if it helps anybody find it

Anonymous 6/16/2025, 8:43:56 PM No.42275376 [Report]

>mare antispam bump

Anonymous 6/17/2025, 12:04:50 AM No.42275868 [Report]

>>42274484

Anonymous 6/17/2025, 12:42:41 AM No.42275950 [Report] >>42276071

>>42272674
ElevenLabs is shit. Just use 15.ai.

Anonymous 6/17/2025, 1:18:27 AM No.42276071 [Report] >>42279153

>>42275950
15...

Anonymous 6/17/2025, 3:25:32 AM No.42276441 [Report]

sleep bump

Anonymous 6/17/2025, 10:45:23 AM No.42276957 [Report] >>42277965

>>42274484

Anonymous 6/17/2025, 3:48:29 PM No.42277294 [Report]

>mares

Anonymous 6/17/2025, 5:14:02 PM No.42277427 [Report]

>https://u.pone.rs/EuipipDV.mp3
American (Dad) Ghost theme

Anonymous 6/17/2025, 10:37:15 PM No.42277965 [Report]

>>42276957
nein

Anonymous 6/18/2025, 1:55:56 AM No.42278416 [Report]

>nein

Anonymous 6/18/2025, 2:04:17 AM No.42278429 [Report] >>42278431 >>42279204

I downloaded this in 2021, it's been 4 years now. How much has it improved since then?

https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN

Anonymous 6/18/2025, 2:04:47 AM No.42278431 [Report]

>>42278429
a lot.

Anonymous 6/18/2025, 7:23:58 AM No.42279044 [Report]

>Page 10

Anonymous 6/18/2025, 9:25:27 AM No.42279153 [Report]

>>42276071
He's right though. EL is arse.

Anonymous 6/18/2025, 9:32:10 AM No.42279159 [Report] >>42279204

1750213536376709[1].png md5: c6157a9c...

I want to take the costanza answering machine song and change the words while maintaining his voice. What's the most appropriate model to do this with?

Anonymous 6/18/2025, 10:26:37 AM No.42279204 [Report]

>>42279159
>keeping the og voice but slightly edited
Hmm, that will be bit tricky, if you can find a version without a laughing track, you can try run the clip through the ace-step
>https://huggingface.co/spaces/ACE-Step/ACE-Step
This should allow you to use function to partly edit the lyrics without changing the music (or so that's the general idea.
The other alternative is to find some clean clips (or de-noise them with some ai program) of costanza singing in same tune as in the show, have that 2~3 minutes of dataset trained in rvc, use some other character talknet/whatever model to sing the whole song and apply it to official soundtrack
>https://www.youtube.com/watch?v=1ghIoM89cfc&list=RD1ghIoM89cfc
>>42278429
>from previous year
>https://u.pone.rs/DFPTbUhe.mp3
Dude, tech jump feels like going from writing books by hand to using printing press. Depending on what you are trying to use if for, it will for most of the time sound about ~95% like character is supposed to sound like.

Anonymous 6/18/2025, 3:36:21 PM No.42279528 [Report] >>42279949 >>42280416 >>42282125 >>42282660

Bump against the raid

Anonymous 6/18/2025, 7:33:46 PM No.42279949 [Report]

>>42279528
ya

Anonymous 6/18/2025, 11:35:35 PM No.42280416 [Report]

>>42279528
nein

Anonymous 6/19/2025, 3:33:25 AM No.42280869 [Report]

>mares

Anonymous 6/19/2025, 9:31:38 AM No.42281330 [Report]

bumpo save

Anonymous 6/19/2025, 2:43:15 PM No.42281616 [Report]

>https://u.pone.rs/FHniGgaQ.mp3
Pinkie Pie - At God's Mercy (GAME SIZE)

Anonymous 6/19/2025, 8:06:20 PM No.42282125 [Report]

>>42279528

Anonymous 6/20/2025, 12:01:17 AM No.42282660 [Report]

>>42279528
again

Anonymous 6/20/2025, 2:35:57 AM No.42283002 [Report]

>https://u.pone.rs/dyjpaZQU.mp3
Rainbow_Dash_sings_Land_of_Shattered_Dreams_by_DragonForce

Anonymous 6/20/2025, 8:45:04 AM No.42283763 [Report]

1672959074085731.png md5: a3a470df...

>No Nurse Redheart on 15.ai
Boycotting 15

Anonymous 6/20/2025, 9:10:50 AM No.42283796 [Report] >>42284031 >>42284214 >>42291212 >>42292848 >>42293157

DOWNLOAD_STUFF_YOU_LIKE_PEOPLE.jpg md5: a9721378...

>>42274933
Six years of saving songs comes in handy sometimes. https://files.catbox.moe/gwqv9m.mkv

Anonymous 6/20/2025, 12:20:14 PM No.42284031 [Report]

>>42283796
>Filename
A philosophy to live by.

Anonymous 6/20/2025, 2:46:32 PM No.42284214 [Report]

>>42283796
nta but thank you archive-kun anon

Anonymous 6/20/2025, 6:50:17 PM No.42284569 [Report]

>https://u.pone.rs/MOQrKwwX.mp3
Redoing Cossacks letter with gpt sovits.

Anonymous 6/20/2025, 9:53:39 PM No.42285028 [Report]

>https://huggingface.co/collections/kyutai/speech-to-text-685403682cf8a23ab9466886
kyutai have posted their speech-to-text models on hugging face (it's the people who made the https://unmute.sh/ site). Hopefully they will get around publishing the TTS model some time soon.

Anonymous 6/21/2025, 12:56:57 AM No.42285552 [Report] >>42287822 >>42299779

>boop

Anonymous 6/21/2025, 2:42:43 AM No.42286094 [Report] >>42289052 >>42291103

>sleep bump

Anonymous 6/21/2025, 9:12:58 AM No.42287174 [Report] >>42287401

Screenshot 2025-06-21 030448.png md5: f5404e99...

>>42253243
I came back with some samples from my button's mom dataset that I used the following on:
De-Breath
De-Esser
Mouth De-Clicker
Plosive Remover
>Original Samples
https://files.catbox.moe/68yrm2.wav
>Processed Samples
https://files.catbox.moe/0d3djz.wav
Again, I read that the software is completely open sourced to public domain and no one owns the rights to it or what it makes, should be perfect for any use for processing data without spending money on IzoTope. You be the judge on how effective it is, I'd say it's good enough to shovel multi-hour datasets for free in one go and clean up whatever is left afterwards.

Anonymous 6/21/2025, 12:59:00 PM No.42287401 [Report] >>42288274

>>42287174
Cool stuff! With it's apparent noise and reverb removal capabilities I may have to test how well it is at salvaging previously unusable data to see if existing pony models might be expanded. Gotta first test if it works well through Wine though. I wonder if I might be able to salvage more workable Redheart data.

Anonymous 6/21/2025, 5:24:41 PM No.42287822 [Report]

>>42285552

Anonymous 6/21/2025, 8:27:58 PM No.42288250 [Report]

>pony bump

Anonymous 6/21/2025, 8:37:53 PM No.42288274 [Report]

3414155__safe_artist-colon-ewoudcponies_derpibooru+import_lyra+heartstrings_pony_unicorn_g4_bust_female_gradient+background_hooves+in+air_horn_image_ma.png md5: d655351f...

>>42287401
Hell yeah brother! That's what it's all about! There's got to be so much ponyfeather quality audio data that could have been fine with just a pop filter, and this should fix it for posterity.

Anonymous 6/22/2025, 1:14:36 AM No.42289052 [Report] >>42289899

>>42286094
ayy

Anonymous 6/22/2025, 1:29:44 AM No.42289075 [Report] >>42289305

Does anyone know what TTS service is best to use with SillyTavern?

Anonymous 6/22/2025, 3:05:10 AM No.42289305 [Report]

>>42289075
uhhh, i vaguely remember there was a plugin script (or api script?) that could connect the ST with some tts that could even be train on 10~20 minutes of dataset, but that was year or more ago and even than I personally given up on it as python dependency hell was impossible to navigate to even install that bloody thing.

Anonymous 6/22/2025, 9:00:32 AM No.42289899 [Report]

>>42289052

Anonymous 6/22/2025, 12:53:44 PM No.42290193 [Report] >>42290480

its mare

Anonymous 6/22/2025, 4:14:35 PM No.42290480 [Report] >>42305186

>>42290193

Anonymous 6/22/2025, 8:30:16 PM No.42291103 [Report] >>42292030

>>42286094
>awake bump

Anonymous 6/22/2025, 9:14:57 PM No.42291212 [Report]

1612370673499.jpg md5: dd107a11...

>>42283796
SUPERCHARGED anon, thank you

Anonymous 6/23/2025, 1:35:06 AM No.42292030 [Report]

>>42291103
indeed

Anonymous 6/23/2025, 5:44:48 AM No.42292848 [Report]

>>42283796
Nice! I think I have about that in pony memes and art among others from years of saving which come to think of it I still need to find time to sort and categorise — Thanks for the reminder.

Anonymous 6/23/2025, 8:20:18 AM No.42293157 [Report]

>>42283796
Autism yields its own rewards.
Nice.

Anonymous 6/23/2025, 12:28:34 PM No.42293460 [Report]

>pre work bump

Anonymous 6/23/2025, 4:16:20 PM No.42293824 [Report] >>42294511

Precautionary bump.

Anonymous 6/23/2025, 9:30:36 PM No.42294511 [Report]

>>42293824
aaaaaaaaaaaa!

Anonymous 6/24/2025, 1:07:47 AM No.42295095 [Report]

gn, imma going to think of what stuff to make tomorrow

Anonymous 6/24/2025, 7:16:38 AM No.42295943 [Report] >>42296474 >>42297247

Paag 10 save.

Anonymous 6/24/2025, 3:14:44 PM No.42296474 [Report]

>>42295943
Almost again.

Anonymous 6/24/2025, 9:49:30 PM No.42297247 [Report] >>42298481

>>42295943

Anonymous 6/25/2025, 12:47:54 AM No.42297721 [Report]

night bump

Anonymous 6/25/2025, 6:34:02 AM No.42298481 [Report]

>>42297247

Anonymous 6/25/2025, 7:57:36 AM No.42298627 [Report] >>42299241

Vinyl Scratch (mlp), pony, sound, cyberspace, electronic, sound waves s-1300701182.png md5: b6380d65...

>>42174105
Do we know if there are any other additional recent local audio and music generators comparable to the likes of Suno and Udio?
Aside from this example, I haven't come across a decent versatile one that can run local since Bark, which since was abandoned ages ago (as far as open source goes) and became Suno. Which is still incredibly good, but it'd be nice to have something similar that don't rely on credits and lame stuff like that.

Anonymous 6/25/2025, 2:27:31 PM No.42299090 [Report] >>42300347 >>42301020

ArtificialBumpMare_nc_104.png md5: d9e1133d...

>9
Bump mare time

Anonymous 6/25/2025, 3:46:04 PM No.42299241 [Report]

>>42298627
Stability Ai may or may not work on one, but who the fuck knows with them since they still have't publish the newer version of instrumental Stable Audio model.
Other ai song model is the YuE, but from the looks of it its bit tricky to get working locally .

Anonymous 6/25/2025, 7:40:15 PM No.42299779 [Report]

>>42285552
Boopity boop!

Anonymous 6/25/2025, 10:37:05 PM No.42300347 [Report]

>>42299090
mare

Anonymous 6/25/2025, 10:37:51 PM No.42300352 [Report] >>42301986

>>42161191 (OP)
Congratulations, 1111 aka 15!

Anonymous 6/25/2025, 11:29:51 PM No.42300581 [Report] >>42301705 >>42302358

burger whore adf39537d8ce4ad6.png md5: 764ca180...

>https://u.pone.rs/kLAzyDaA.mp3
New ai song, "I only eat 3 cheeseburgers!" from suno user 김치다시마은갈치, and converted with Twi vocals.

Anonymous 6/26/2025, 1:30:55 AM No.42301020 [Report]

>>42299090
mare harder

Anonymous 6/26/2025, 6:45:50 AM No.42301705 [Report]

>>42300581
we sell hay here not burgers

Anonymous 6/26/2025, 8:36:40 AM No.42301986 [Report] >>42302315

>>42300352
What are you referring to?

Anonymous 6/26/2025, 12:38:43 PM No.42302294 [Report] >>42302526 >>42302859 >>42303531 >>42304147

ArtificialBumpMare_106.png md5: 60ab4bb5...

>9
Eighth bump mare deployed

Anonymous 6/26/2025, 12:59:32 PM No.42302315 [Report]

>>42301986
söy of 2

Anonymous 6/26/2025, 1:25:21 PM No.42302358 [Report]

>>42300581
Could go for some burgers right about now

Anonymous 6/26/2025, 2:27:12 PM No.42302526 [Report]

>>42302294
Thank you, kind bump mare.

Anonymous 6/26/2025, 5:25:06 PM No.42302859 [Report]

>>42302294

Anonymous 6/26/2025, 7:25:32 PM No.42303176 [Report]

quick board...

Anonymous 6/26/2025, 8:42:12 PM No.42303531 [Report]

>>42302294
mared

Anonymous 6/26/2025, 9:36:50 PM No.42303825 [Report]

anti spam bump

Anonymous 6/26/2025, 11:01:57 PM No.42304147 [Report]

>>42302294

Anonymous 6/27/2025, 2:39:49 AM No.42304700 [Report]

>https://www.tomshardware.com/news/gddr6-vram-prices-plummet
>16 gb of vram could be as cheap as 400$
>but it wouldn't because nvidia are greedy fucks
i will never forgive the crypto bros for fucking up the market

Anonymous 6/27/2025, 7:36:25 AM No.42305186 [Report]

>>42290480
So it seems

Anonymous 6/27/2025, 9:01:43 AM No.42305431 [Report] >>42305442

Board is moving lightning fast this past hour.

Anonymous 6/27/2025, 9:10:24 AM No.42305442 [Report] >>42305606 >>42305635

>>42305431
it's the sliderfag

Anonymous 6/27/2025, 11:51:26 AM No.42305606 [Report]

>>42305442
Yep, it's becoming more and more blatant every time.

Anonymous 6/27/2025, 12:16:16 PM No.42305635 [Report]

>>42305442
With the lack of reaction from jannies and mods (as they are too busy to jerk off to furry fag shit), Im feeling like there could be a good idea to keep a parallel thread in nhnb and mlpol too, to at least keep some bits in case the the board kept being nuked.

Anonymous 6/27/2025, 5:53:03 PM No.42306113 [Report]

>pre dinner bump

Anonymous 6/27/2025, 10:49:37 PM No.42306741 [Report]

>up poned

Anonymous 6/28/2025, 12:50:44 AM No.42307024 [Report] >>42307277 >>42307279

Anyone know how to get 15 ai to scream? Tried to use so-vits on haysay with audio but it came out like crap. Need Lyra doing it too, and so-vits doesn't have her.

Anonymous 6/28/2025, 2:47:01 AM No.42307277 [Report] >>42308681

>>42307024
Uhhh, tts models pretty much always struggled with screaming and whispering. The older 15 model could do it to some smaller degree (but it still was a massive game of rolling the next generated clip untill you got what you wanted). I guess you could try to find screaming clip in OP mega and use that with gpt sovits reference Tts?

Anonymous 6/28/2025, 2:47:49 AM No.42307279 [Report] >>42307905 >>42308681

>>42307024
Convincing screams and other less-phonetic sounds have been notoriously difficult since the very beginning of artificial speech. Feels like it comes down to a lack of data, or the specific exclusion of which due to the negative impact its kind has on training.

Closest thing I can suggest is priming. Initiate the prompt with a sentence (or multiple) of dialogue that would ordinarily be expected to be said with intensity; be that anger, seriousness, shock, whatever. The AI likes to be consistent with outputs and therefore some of that emotion will be inherited and thus carry over to concurrent sentences — this is where you'd attempt screaming dialogue. Might also be good to try using ARPAbet for some too so it pronounced correctly.

Anonymous 6/28/2025, 9:04:37 AM No.42307905 [Report]

>>42307279
10

Anonymous 6/28/2025, 2:04:48 PM No.42308230 [Report] >>42309328

Bump.

Anonymous 6/28/2025, 7:40:07 PM No.42308681 [Report] >>42309483

>>42307277
>>42307279
Thanks for the suggestions. I ended up just regenerating an "AAAAAAAAA" prompt a bunch of times until I got as close as I could to a scream. Sounds like shite, but it was only for a little shitpost anyway. https://files.catbox.moe/z2r0c8.mp3
Which is for this for this pic in /bale/ >>42305975

Anonymous 6/29/2025, 1:36:56 AM No.42309328 [Report]

>>42308230

Anonymous 6/29/2025, 3:07:11 AM No.42309483 [Report]

>>42308681
huh, pretty neat work Anon

Anonymous 6/29/2025, 9:41:54 AM No.42310164 [Report] >>42310579

Rainbow Rizz.png md5: 3384fa0a...

>>42207220
https://files.catbox.moe/fv2v5u.wav
https://files.catbox.moe/aeqloc.wav
https://files.catbox.moe/xl6ft5.wav
https://files.catbox.moe/xl6ft5.wav

Here's some with Flutters. I just did:

"ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, cumming! ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, fuck me, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh!"

You can hear the good parts and splice those.

Anonymous 6/29/2025, 4:32:19 PM No.42310579 [Report]

>>42310164
ai mares are lewd

Anonymous 6/29/2025, 7:47:38 PM No.42310923 [Report] >>42311695

alien pony.png md5: 9d6cf1d4...

>https://u.pone.rs/NlnRoRSa.mp3
Ghost singing Past Due - Xenophobia (aka unofficial theme song of Stellaris)

Anonymous 6/30/2025, 12:45:49 AM No.42311695 [Report]

>>42310923
A classic. Let the light of mankind shine brighter than the stars themselves

Anonymous 6/30/2025, 5:55:25 AM No.42312379 [Report]

1727251__safe_artist-colon-greyscaleart_princess+celestia_oc_oc-colon-human+grey_alicorn_caffeine_clothes_coffee_coffee+mug_dilated+pupils_discovered+c.jpg md5: 8708cbcf...

RealDash 6/30/2025, 5:59:46 AM No.42312387 [Report] >>42312431

I might make a small lewd audio of Twiggle as a test for 15.dev.
Dialogue's a pain to get to sound natural, way more than 15ai's last version.

Anonymous 6/30/2025, 6:32:29 AM No.42312431 [Report]

>>42312387
>>>/trash/

Anonymous 6/30/2025, 11:49:02 AM No.42312836 [Report] >>42313175

ArtificialBumpMare_202.png md5: 06a30e71...

>9
Deploying ninth bump mare (triple pose edition)

Anonymous 6/30/2025, 5:49:11 PM No.42313175 [Report]

>>42312836
horse

Anonymous 6/30/2025, 10:19:10 PM No.42313719 [Report] >>42313730

>14.ai
lmao

Anonymous 6/30/2025, 10:22:38 PM No.42313730 [Report] >>42313761 >>42314271

>>42313719
kek, a race to the bottom. What kind of sketchy indians will we reach when we hit 1.ai?

Anonymous 6/30/2025, 10:34:08 PM No.42313761 [Report] >>42319574

green-card_thumb.jpg.webm md5: e4c41ec1...

WebM not supported

>>42313730
uh, based?

Anonymous 7/1/2025, 2:28:38 AM No.42314271 [Report] >>42314359 >>42314864

>>42313730
Or -1.ai

Anonymous 7/1/2025, 3:20:37 AM No.42314359 [Report] >>42317225

>>42314271
Interestingly, hyphens can't be used at the start or end of a domain name. Would probably have to be negative1.ai or something

Anonymous 7/1/2025, 8:31:02 AM No.42314864 [Report]

>>42314271
Witchcraft!

Anonymous 7/1/2025, 1:05:32 PM No.42315242 [Report]

Anonymous 7/1/2025, 6:25:04 PM No.42315665 [Report] >>42316221 >>42316594 >>42317148 >>42317684

>Page nine

Anonymous 7/1/2025, 11:14:27 PM No.42316221 [Report]

>>42315665
MAREEE

Anonymous 7/2/2025, 1:55:25 AM No.42316594 [Report]

>>42315665
early sleep bump

Anonymous 7/2/2025, 7:29:02 AM No.42317148 [Report]

>>42315665

Anonymous 7/2/2025, 8:24:49 AM No.42317225 [Report] >>42317279

>>42314359
Or simply minus1.ai. It's kind of a word play.

Anonymous 7/2/2025, 9:02:08 AM No.42317279 [Report]

>>42317225
Clever. I like it.

Anonymous 7/2/2025, 3:05:27 PM No.42317684 [Report]

>>42315665

Anonymous 7/2/2025, 6:55:40 PM No.42317966 [Report] >>42318161

beatles.jpg md5: 8c8b6c40...

A very quick cover of Beatles' With a Little Help from My Friends with slightly modified lyrics
https://u.pone.rs/ODLJbBek.flac

Anonymous 7/2/2025, 9:01:14 PM No.42318161 [Report]

>>42317966
Nice work Anon! Funny enough, I listen to some random Beatles song a week ago and wished there was some covers or parodies done in pony voices.

Anonymous 7/2/2025, 10:59:00 PM No.42318380 [Report]

error heysay sovits 4.png md5: 9b9eab41...

Hi HydrusBeta, Im getting error when using the sovits 4.0 Spitfire model with 'reduce hoarsness' and 'apply nsf_higan' setting, and it works if I turn these two settings off.

Anonymous 7/3/2025, 1:27:02 AM No.42318746 [Report]

spitfire beach by yakovlev-vad 2137101 - Copy.png md5: 2adaed5f...

>https://u.pone.rs/KbiNvzqK.mp3
Solitary Summer Dream by suno user testediserie.
I was looking for a nice summer song for Celestia, I found myself really enjoying listing to this BUT rvc and other voice converts disagreed with my vocal choice, so we all get to enjoy Spitfire cover, since her voice haven't been used that much.

Anonymous 7/3/2025, 2:22:39 AM No.42318932 [Report]

Late night bump.

Anonymous 7/3/2025, 5:39:19 AM No.42319283 [Report]

What's the current torrent for the MLP leak files?

Anonymous 7/3/2025, 6:16:17 AM No.42319348 [Report] >>42319606

CelestAI - Concentration and Morality.png md5: 3260a82c...

>42119384 42196683 42317225
Yet it is proper to enumerate as such among the Trotting ways.

>42161222 42269737 42208841
ppp as tragedy of the commons
Things fall apart, the centre cannot hold - Keats
pandora's vox on community in cyberspace - humdog
yet... n mare saddlepoint? The altchans apart were less a scattering of the winds and more of the Shattered sundered.

>42204138 42198701 42195922
The Cathedral and the Bazaar - Raymond, acknowledging Tarver's Bizarre Empty Temples.
Cathedral vs. Parlor - Wrye, acknowledging Monitor144hz's Patreon Pigeonhole.
Tamers1-4,5 voices when?

>42270729
It's been a long thread. Bacon-bakin' necessary.

Anonymous 7/3/2025, 8:22:18 AM No.42319574 [Report]

>>42313761
Who are these dunces?

Anonymous 7/3/2025, 8:47:19 AM No.42319606 [Report] >>42319961 >>42320204

pip stare.png md5: 7353ef54...