← Home ← Back to /mlp/

Thread 42161191

545 posts 222 images /mlp/
Anonymous No.42161191 >>42161203 >>42196683 >>42300352
Pony Preservation Project (Thread 154)
Welcome to the Pony Voice Preservation Project!
youtu.be/730zGRwbQuE

The Pony Preservation Project is a collaborative effort by /mlp/ to build and curate pony datasets for as many applications in AI as possible.

Technology has progressed such that a trained neural network can generate convincing voice clips, drawings and text for any person or character using existing audio recordings, artwork and fanfics as a reference. As you can surely imagine, AI pony voices, drawings and text have endless applications for pony content creation.

AI is incredibly versatile, basically anything that can be boiled down to a simple dataset can be used for training to create more of it. AI-generated images, fanfics, wAIfu chatbots and even animation are possible, and are being worked on here.

Any anon is free to join, and there are many active tasks that would suit any level of technical expertise. If you’re interested in helping out, take a look at the quick start guide linked below and ask in the thread for any further detail you need.

EQG and G5 are not welcome.

>Quick start guide:
docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Introduction to the PPP, links to text-to-speech tools, and how (You) can help with active tasks.

>The main Doc:
docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit
An in-depth repository of tutorials, resources and archives.

>Online speech generation
haysay.ai

>Active tasks:
Research into animation AI
Research into pony image generation

>Latest developments:
http://ponepaste.org/10865

>The PoneAI drive, an archive for AI pony voice content:
drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp

>Clipper’s Master Files, the central location for MLP voice data:
mega.nz/folder/jkwimSTa#_xk0VnR30C8Ljsy4RCGSig
mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ
drive.google.com/drive/folders/1MuM9Nb_LwnVxInIPFNvzD_hv3zOZhpwx

>Cool, where is the discord/forum/whatever unifying place for this project?
You're looking at it.

Last Thread:
>>42103996
Anonymous No.42161200
FAQs:
If your question isn’t listed here, take a look in the quick start guide and main doc to see if it’s already answered there. Use the tabs on the left for easy navigation.
Quick: docs.google.com/document/d/1PDkSrKKiHzzpUTKzBldZeKngvjeBUjyTtGCOv2GWwa0/edit
Main: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit

>Where can I find the AI text-to-speech tools and how do I use them?
A list of TTS tools: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.yuhl8zjiwmwq
How to get the best out of them: docs.google.com/document/d/1y1pfS0LCrwbbvxdn3ZksH25BKaf0LaO13uYppxIQnac/edit#heading=h.mnnpknmj1hcy

>Where can I find content made with the voice AI?
In the PoneAI drive: drive.google.com/drive/folders/1E21zJQWC5XVQWy2mt42bUiJ_XbqTJXCp
And the PPP Mega Compilation: docs.google.com/spreadsheets/d/1T2TE3OBs681Vphfas7Jgi5rvugdH6wnXVtUVYiZyJF8/edit

>I want to know more about the PPP, but I can’t be arsed to read the doc.
See the live PPP panel shows presented on /mlp/con for a more condensed overview.
2020 pony.tube/w/5fUkuT3245pL8ZoWXUnXJ4
2021 pony.tube/w/a5yfTV4Ynq7tRveZH7AA8f
2022 pony.tube/w/mV3xgbdtrXqjoPAwEXZCw5
2023 pony.tube/w/fVZShksjBbu6uT51DtvWWz

>How can I help with the PPP?
Build datasets, train AIs, and use the AI to make more pony content. Take a look at the quick start guide for current active tasks, or start your own in the thread if you have an idea. There’s always more data to collect and more AIs to train.

>Did you know that such and such voiced this other thing that could be used for voice data?
It is best to keep to official audio only unless there is very little of it available. If you know of a good source of audio for characters with few (or just fewer) lines, please post it in the thread. 5.1 is generally required unless you have a source already clean of background noise. Preferably post a sample or link. The easier you make it, the more likely it will be done.

>What about fan-imitations of official voices?
No.

>Will you guys be doing a [insert language here] version of the AI?
Probably not, but you're welcome to. You can however get most of the way there by using phonetic transcriptions of other languages as input for the AI.

>What about [insert OC here]'s voice?
It is often quite difficult to find good quality audio data for OCs. If you happen to know any, post them in the thread and we’ll take a look.

>I have an idea!
Great. Post it in the thread and we'll discuss it.

>Do you have a Code of Conduct?
Of course: 15.ai/code

>Is this project open source? Who is in charge of this?
pony.tube/w/mqJyvdgrpbWgZduz2cs1Cm

PPP Redubs:
pony.tube/w/p/aR2dpAFn5KhnqPYiRxFQ97

Stream Premieres:
pony.tube/w/6cKnjJEZSCi3gsvrbATXnC
pony.tube/w/oNeBFMPiQKh93ePqTz1ns8
Anonymous No.42161203
>>42161191 (OP)
Anchor.
Anonymous No.42161222
>woken up just 5 minutes after thread passed page 10
Stupid fuckers and their "1 post by OP with retarded one bait sentence" threads.
Anyhow, are you guys busy with doing entries for antithology or what (I know I am, im sitting on like 5 half assed ideas that still need doing) ?
Anonymous No.42161566 >>42161651
>page 9 after less than 4 hours
Board activity but at what cost ?
Anonymous No.42161651
>>42161566
The cost is our sanity.
Anonymous No.42163358 >>42164184
Is there a FLA of Fluttershy's cabin interior or her bedroom in the leak on web archive called MLP FLAs? I tried Dragonshy, Part 1 of Friendship is Magic and Stare Master but it's not in those...
Anonymous No.42164184 >>42169147
>>42163358
From what quick googlefu tells me, the list of leaked full assets episode we should have access (from season 8 episodes) is as follows :
6 - "Surf and/or Turf", 7 - "Horse Play", 8- "The Parent Map", 9 - "Non-Compete Clause", 10 - "The Break Up Break Down", 11 - "Molt Down" - , 13 - "The Mean 6"
I swear we had some bits and bobs from other episodes but I cant seem to find a proper list of what is (and is not) archived.
There is this scene from Super Speedy Cider Squeezy 3000 ( and I think in the later season eps with Nightmare Night and one were Discord suffers from being "normal" as well)?
Anonymous No.42165897 >>42165947
>https://codeberg.org/nak/sample-neko
Here is a tool the I spotted on interwebs, that allow to easily list and move 1k+ sound clips from one folder to another .
I feel like it could be really useful to Anons here organising their folders for production of big or small projects.
Anonymous No.42165947 >>42166063
>>42165897
was litterally thinking about how i needed sound effects from the show for a project i was doing
more specifically little things like character laughs or snorts n stuff
Anonymous No.42166063
>>42165947
A lot of those are in Clipper's Master File Part 2:
https://mega.nz/folder/gVYUEZrI#6dQHH3P2cFYWm3UkQveHxQ/folder/EMZF3ApB
Anonymous No.42166563 >>42167638
Bump.
Anonymous No.42166887
>https://files.catbox.moe/vx3yr9.mp3
Anonymous No.42167638 >>42168304
>>42166563
Anonymous No.42168304
>>42167638
Anonymous No.42169147
>>42164184
ugh, is there a way to get the pop up when you first download a torrent to select files to download again? I've got the magnet for the leak.
Anonymous No.42169246 >>42169373
Best tools if I want to gen Cozy Glow lines?
Anonymous No.42169373 >>42169376 >>42169924
>>42169246
I'm guessing you wish to have it local and didn't want to use haysay ? Get yourself python and gpt sovits.
>https://github.com/effusiveperiscope/GPT-SoVITS
>https://huggingface.co/therealvul/GPT-SoVITS-v2/tree/454406eb40b63c5571f33c29f4fd8bac197131d6/CozyGlow-SVe24-GPTe48
Anonymous No.42169376 >>42169392
>>42169373
Which haysay architecture has the best Cozy?
Anonymous No.42169392
>>42169376
I'm pretty found of rvc one BUT it heavily dependent on the input audio .
Anonymous No.42169924 >>42170104 >>42171853
>>42169373
What's the current sota for voice2voice conversion? Preferably something that can be finetuned. The latest gptsovits v4 is very good but it doesn't sound like the reference so an additional step is needed I think
Anonymous No.42170104
>>42169924
rvc and so-vits are still the king, I think some Anons posted some other "minimal dataset voice cloning" stuff in the past but none of them seem to stick around (with the github codefags making their training process way too complex, or pulling requirements out of their assess).
Anonymous No.42170546 >>42171393
I heard through the grapevine that 15.ai is coming back, anyone heard about that?
Anonymous No.42171393
>>42170546
>https://desuarchive.org/mlp/thread/41706417/#41711970
Pretty sure that site is still ded, and it will stay that way for very long time (aka 4ever). if any new code were to be produce by 15ai it would need to be some kind of collaboration with other codefags to avoid being chased by tiny hat lawyers , and by logic of nobody sharing such news around means it's not happening .
Anonymous No.42171853 >>42172693
>>42169924
GPT-SoVITS is mainly intended for text-to-speech. The reference audio is only for providing an emotional style. For speech-to-speech, you should stick to RVC.
Anonymous No.42171965 >>42172009
Is Haysay down for anyone else? I can't seem to reach the site at all.
Anonymous No.42172009
>>42171965
https://files.catbox.moe/4sz8fc.mp3
the pretty mare voice site seems to be working fine for me. did you try different browser anon?
Anonymous No.42172693 >>42172807
>>42171853
Why wouldn't I be able to do GPT-SoVITS => RVC?
Anonymous No.42172807
>>42172693
yeah, you can, one problem is sometimes the RVC derps out the outputs when trying to give it lines of the same character, sometimes it depends on what kind of note the clip is hitting and sometimes the electronic goblins are messing about, so just test out different TTS voices to see which one works best with the RVC character you want to output.
Anonymous No.42172881
>https://nitter.space/shweta_ai/status/1912536464333893947
I need this for mare content, so I can finally get AJ speak a deep south accent without fluffing around the different words spelling, or get Rarity pronounce words in way more posh manner.
Anonymous No.42173088
>>42166202
>>42166241
Crossposting from /chag/ thread, they are planing on doing some collaboration with /robowaifu/ guys to start making irl robot ponies. Very cool, and good luck to you !
Anonymous No.42173899 >>42173902
First actually good local music model, like suno v2 quality. Fast as fuck as well.

https://www.reddit.com/r/LocalLLaMA/comments/1kg9jkq/new_sota_music_generation_model/
Anonymous No.42173902
>>42173899
Also has lora training already, could 100% train pony singing.
Anonymous No.42174105 >>42174936 >>42298627
https://ace-step.github.io/
https://github.com/ace-step/ACE-Step

Passes the nigger test.
https://vocaroo.com/11MoCQ68jiLY

And this is fun.
>>>/g/105183843
>>>/g/105184228
I'd love to try with some MLP songs, but I'm a VRAMlet with 6GB and I don't think I can run this yet.
Anonymous No.42174701 >>42175724
Bump.
Anonymous No.42174936 >>42175015
>>42174105
uhh, the collab file they provided seems to only do "text2music", could you/somebody explain how that anon re-edited the OG song with new shitpost lyrics into it?
Anonymous No.42175015 >>42175321
>>42174936
oh, just noticed its in the repair->upload section. however I tried to do a "replace X lyrics with new lyrics" and it really seem to suck ass at it, so im not sure if the anon that made the above song was lucky or had enough autism to spend several hours trying all kinds of combination in making it work.
Anonymous No.42175321 >>42175929
>>42175015
Nope, people posted multiple results in that thread where it Just Worked. The only thing I saw is that the quality will get worse the more the lyrics are changed.
Anonymous No.42175724 >>42180991
>>42174701
Anonymous No.42175929 >>42175971
>>42175321
Oh. I was trying to go for a full lyric replacement, I guess this GitHub is a right step into that direction, it just nit ready for my exact autistic requirements.
Hopefully by the next year we will get improvements on it, because I have some text parody ideas .
Anonymous No.42175971
>>42175929
I saw someone say that you can separate the stems and get better results. Perhaps you could edit portions of the lyrics one at a time, then mix them back into the instrumental.
VilligerANON No.42176140 >>42176220
Question:
During training, can I use files tagged as clean and noisy files?
Anonymous No.42176220
>>42176140
Sure, however keep in mind the quality of audio outputs may suffer from it, specially if the ratio of good clips vs noisy clips is skewing towards the noisy side.
And since there are characters that have pretty much noting but mostly noisy audio (like Tree Hugger) the end results may vary from "kind of bad" to "surprisingly decent" .
Anonymous No.42176608 >>42176655
Question to the Anon that was working on OpenUtau diffsinger models, are you planing on creating the models for Rarity and Fluttershy?
DiffAnon No.42176655
>>42176608
Truth be told, I was planning on it eventually, but I don't know if I really want to anymore. Twilight, Applejack, Rainbow Dash, and Pinkie Pie are a bit spotty as is, and I worry that with Fluttershy's abysmally low amount of singing data (from what I could find) and just not feeling up to it for her or Rarity, I don't think either of them are gonna be made into models anytime soon. Keep in mind, I don't just train one thing, I have to train the acoustic model, then the variance model, then the pitch model, and then fine tune the vocoder, which both takes a lot of time and a lot out of me. I'm not saying it won't ever happen, because I do feel weird about leaving things with just the four I did, but I can't for the life of me bring myself to do the other two just yet. But they'll come one day, hopefully.
Anonymous No.42176713 >>42177027 >>42177542 >>42179060
Speaking of model training, there's still a good few voices that're absent on RVC. It'd be nice to see Moondancer and Cadance and whoever else hasn't been trained yet, Cadance has a model for RVC but it's super noisy.
Anonymous No.42177027 >>42177131
>>42176713
>Moondancer
huh, you are correct, I will see if I can train her rvc model.
Anonymous No.42177131
>>42177027
hmm, not a great news, Ive check the mega and even when removing only the unusable very noisy audio lines, there is still only 1m50s of audio, which is less than ideal 3m but I can still try.
Anonymous No.42177542 >>42178958
>>42176713
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Moondancer
>https://vocaroo.com/1hV4kTcwCp3E
Here she is, the result isn't half bad but for some reason her voice seems slipping into Rarity voice range. And of course male input voice lines will sound bit rougher in conversion.
Anonymous No.42178459
>>42178450
more years! TRUST THE PLAN!
Anonymous No.42178958 >>42179060
>>42177542
Awesome, thanks. I look forward to trying it once I have the time.
Anonymous No.42179060 >>42185313
>>42176713 >>42178958
>Cadance
>https://voca.ro/188F1imvN2L7
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/MLP_Cadance_Clean
RVC model of Cadance, trained on clean audio only.
Anonymous No.42180991 >>42181427
>>42175724
Anonymous No.42181427 >>42185114
>>42180991
VilligerANON No.42181482 >>42181575 >>42191078
https://files.catbox.moe/x41lrp.wav
I have generated with this repo: https://github.com/CookiePPP/cookietts
Model from: https://drive.google.com/drive/folders/1nTyn6qr2b76aOE430trasuZj0Kr2H_ya
(Tacotron2: tt2_outdir_p3_2_0.5DFR_0.0Dropout)
(Hifi-gan cp_hifigan_universal44Khz_mlpft)
>Maybe I will create a better vocoder and Notebook
Anonymous No.42181575 >>42183033
>>42181482
That's interesting Anon but I'm not sure on how it will compare with all the new tech, since tacotron is almost five years old.
Anonymous No.42183033 >>42183358
>>42181575
I feel like there isn't much coming out for pony specificly in recent times though.
VilligerANON No.42183358 >>42183361 >>42183486
Does anyone want any bonus features that I can add?

>>42183033
I know, right?
VilligerANON No.42183361
>>42183358
> To the Inference Script
Anonymous No.42183486
>>42183358
Well, I would like it if the offline gpt-sovits script also copy the haysay options for automatic emotions drop down menu as well as the audio clip slow/speed up stretch settings, but that's something Vul would need to add to his webui script.
Anonymous No.42184597 >>42184600 >>42184613
>nitter.space/jason_kint/status/1921546181357838531
>nitter.space/LuizaJarovsky/status/1921286826402422927
>ai copyright to affect the "commercial use"
Time to split the hairs on what counts as "commercial use" and what doesn't. Also good luck trying to force this on china and their no-fucks-given R&D departments.
Anonymous No.42184600 >>42184628
>>42184597
>muttmerica
Phew, I thought it was actually serious.
Anonymous No.42184613 >>42184628
>>42184597
>america keeps digging its grave in the name of "progress"
the soviet union fell behind in technology because the government tried to control things, but yeah, let's not learn anything from that.
Anonymous No.42184628
>>42184600
I can see Diseny and such trying to push for it, just like they did with hundreds of years of copyright laws, but as Anons on /g/ pointed out, all the big league companies need to do is buy portions of semi big publishing companies and claim that retroactively all the existing books on the system were allowed to be used in ai training.
>>42184613
Tell me about it, I remember reading a biography of electrician that was bribed to "no be in hurry" when repairing the wheat moisture measuring apparatus, because the assigned inspector could use rule of thumb on deciding how much moisture was in the transported grain and deduce the farmers pay while pocketing the spillway difference.
Anonymous No.42185114 >>42185748
>>42181427
Anonymous No.42185313 >>42185322 >>42187317
>>42179060
Your local AI still can't sing worth a shit.
Evolve or die, PPP.

Voice acting requires a certain melodic way of talking which your current model does not support, 3P General.
Anonymous No.42185322 >>42186106
>>42185313
There are no more than ten anons itt, all namefags, that know their shit, and they lead very busy lives. This thread was just anons enjoying the fruits of others' labors. There are no more fruits to enjoy, or worth enjoying so the Pony Preservation Project has become the Pony Preservation Project Preservation Project. It's over.
>Mareification not required.
Anonymous No.42185748 >>42190922
>>42185114
Anonymous No.42186106 >>42187314
>>42185322
yeah, back in 2019 + 20 everybody were hyped since show only just ended and board was still pretty alive (and with everyone locked up, all they could do is making pony content without any distractions). Now a lot of the ai tools have became available (music, art, even animations) but everything is kind of disjointed and difficult to put together.
Anonymous No.42186245 >>42188332
I feel Anons just need to find a proper spark, something that would be fun to work on, like randomly spotting a song and wondering how it would sound if it was done by pony.
>https://files.catbox.moe/qg2qn5.mp3
Anyhow, VS singing the Ye new song, OG cover from TowerGangToad. I really wanted to use Zecora voice but the voice clips just wouldn't come out right from neither of the model types.
Anonymous No.42187314
>>42186106
Don't forget that a lot of new stuff gets immediately corpo'd these days too. Shit like that stifles innovation.
Anonymous No.42187317
>>42185313
>melodic way of talking
China is the future
75.ai No.42187322 >>42189535
I will save this general.
Anonymous No.42188332 >>42188430
>>42186245
Try replicating S1 Luna's voice. Chip in some money and put Tabitha to voice it.
Anonymous No.42188430
>>42188332
>S1 Woona
It's technically doable.
https://huggingface.co/spaces/Plachta/VALL-E-X
https://desuarchive.org/mlp/thread/40503961/#40518915
It will just take about 1~6 months of non stop generating audio until the artificial dataset has five minutes worth audio clips.
Anonymous No.42188537 >>42188542
>>/wsg/5872172
I want this, but for ponies, dubbing in my country is cursed, either VAs will put energy to empathize wrong aspect of character (a young rogue like adventurer will instead sound like snotty little shit), give no shits to act at all or give the role to somebody that will completely not fit the character.
Anonymous No.42188542
>>42188537
>https://files.catbox.moe/yck7ps.mp4
fug, crossposting failed
Anonymous No.42189535
>>42187322
Anonymous No.42190366
>>42188455
Would be funny if that happened.
Anonymous No.42190922 >>42192724
>>42185748
VilligerANON No.42191078 >>42191285
>>42181482
I've updated the synthesis script, and now these are the new results
>https://files.catbox.moe/tv8c4i.wav
Does it sound like those 48 kHz MMI models, or does it sound like newer tech?
Anonymous No.42191153 >>42191285
https://www.minimax.io/audio
https://minimax-ai.github.io/tts_tech_report/
Anonymous No.42191285 >>42191588
>>42191078
Is thats TTS or voice conversion? It still has that funny buzzing that tacotron2 / talknet models suffered from, so its kind of hard to tell if .
>>42191153
hmm, website do not seem to be more useful than other tts sites. BUT the paper is interesting, if the cloning of 5 seconds is not complete cherry picked bullshit I would love to be able to use it.
VilligerANON No.42191588
>>42191285
TTS.
> The repo:
> https://github.com/TheDevloper2023/cookiettsfork/tree/master/CookieTTS
> which is a fork of https://github.com/CookiePPP/cookietts/tree/master
Anonymous No.42192724 >>42193438
>>42190922
Anonymous No.42193438
>>42192724
15 No.42195922 >>42195946 >>42196013 >>42196073 >>42196204 >>42196230 >>42196243 >>42196274 >>42196298 >>42196305 >>42196355 >>42196384 >>42196390 >>42196393 >>42196435 >>42196479 >>42196514 >>42196530 >>42196639 >>42196646 >>42196654 >>42196738 >>42196754 >>42196789 >>42196790 >>42196800 >>42196801 >>42196849 >>42196867 >>42196960 >>42197340 >>42197606 >>42198611 >>42198701 >>42204454 >>42204550 >>42204573 >>42204939 >>42205939 >>42205963 >>42206122 >>42207220
Hi, it's been a while, hasn't it?
Here's an alpha website that you can play around with: https://alpha.15.dev/
The backend is currently running on just two GPU instances, and I've set the inference batch size to 1 since this new model requires a lot more computational power than it did two years ago. I can increase the number of GPUs depending on how long each request takes.
More characters and emotions will come soon. Feel free to report any bugs or issues here, too.
Anonymous No.42195946
>>42195922
holy shit
Anonymous No.42196013
>>42195922
I hate your guts, sleazebag
Anonymous No.42196073
>>42195922
>https://alpha.15.dev/examples
nice examples kek
VilligerANON No.42196204 >>42196227
>>42195922
>https://alpha.15.dev/
Can I send this outside of this thread?
15 No.42196227
>>42196204
Sure, go ahead. I'll make an official post on Twitter soon, probably within the next few days.
Anonymous No.42196230
>>42195922
I'm kneeling so hard rn it hurts
Anonymous No.42196243
>>42195922
I have no choice but to kneel
Anonymous No.42196274
>>42195922
IT'S HAPPENING!
Anonymous No.42196298
>>42195922
https://files.catbox.moe/k18mof.mp3
Three stars and now this? We are so fucking back boys!
BGM No.42196305 >>42196384 >>42196896
>>42195922
https://files.catbox.moe/01otal.wav
Woah, hi again.
New model's sounding better than ever before. Good speed, emotion settings all work reliably, sounds clear. At the moment it sounds like the characters fall out of how they're supposed to sound on occasion though. Rarity in particular with the fear emotion gives some very strange outputs.
https://files.catbox.moe/k1kvsc.wav

Also, as a UI note, the change notifications upon switching settings and voices blocks the generation button on some resolutions when scrolled up. Only for a second, but it can still delay things.
Anonymous No.42196337 >>42196341 >>42198701
Dear Hydrus Beta, as everyone will get really hyped for return of 15ai, I just want to say I appreciate your work and thanks to HaySay I was able to do all the fun mare music conversion. I hope you will keep it alive and updated as new voice ai will show up in the future.
BGM No.42196341 >>42198701
>>42196337
Seconding this, Haysay is a godsend for my workflow on music projects.
Anonymous No.42196355 >>42196384
>>42195922
https://u.pone.rs/whgPbfzU.mp3
Anonymous No.42196384
>>42195922
>new site
>>42196305
>new shitpost
>>42196355
>new smutty
brings me back
Anonymous No.42196390
>>42195922
https://voca.ro/140YNkYngHyz
Anonymous No.42196393
>>42195922
Godlike web dev skills god fuckin damn
Anonymous No.42196435
>>42195922
https://u.pone.rs/EcUvtwYk.mp3
Anonymous No.42196476
I hope he will add the old "|" emotional control from the previous website, since the clip reference one is pretty wishy washy. Having both would be pretty perfect to fine tune the output audio.
Anonymous No.42196479
>>42195922
I can't believe waiting two weeks (a few times) actually worked!
Anonymous No.42196514 >>42196526
>>42195922
Yep, it's been a while, cool website.
Let me nit pick on flicker during that transition animation.
Anonymous No.42196526
>>42196514
literally unplayable
Anonymous No.42196530
>>42195922
Curious, how much (if any) AI did you use to make the website?
As for the framework.. React + Next.js? Looks good.
And welcome back.
Anonymous No.42196569 >>42196571
>there is site OC
Im so sorry bro, but the internet rule demand it.
Anonymous No.42196571 >>42197112
>>42196569
qt oc, whose artstyle is that
Anonymous No.42196589
>https://u.pone.rs/mLbrNDQB.mp3
Lets test this new site. Gin Blossoms - Hey Jealousy, done with Glimmer RVC to Sovits5 singing model (sounds ok, but i was hopping it would be better.
Anonymous No.42196639
>>42195922
https://vocaroo.com/1bITXue82eed
Anonymous No.42196646
>>42195922
WE ARE SO FUCKING BACK LIKE NEVER BEFORE
Anonymous No.42196654
>>42195922
we got 15.ai revival before gta 6
Anonymous No.42196683 >>42196751 >>42264821
>>42161191 (OP)
I know I speak to the dedicated deluded, but the machine is not the path.
Anonymous No.42196738 >>42196755
>>42195922
awesome work but damn we really need an S1 Dash voice preset or something. nu-Dash voice is fucking nails on a chalkboard.
Anonymous No.42196751
>>42196683
Get a hobby you poor creature.
Anonymous No.42196754 >>42196757 >>42196772 >>42196778
>>42195922
Can we get an ETA on when you are open sourcing this?

I think it is an obvious concern that this will all suddenly disappear for years again.
Anonymous No.42196755 >>42196758
>>42196738
I'd say completely exclude post S3 audio for mane six. Of course it's needed for side characters who lack speaking lines, but it's better to avoid when possible.
Anonymous No.42196757 >>42196778
>>42196754
About 14 days or so
Anonymous No.42196758
>>42196755
*S2
Poopsikins No.42196768
https://files.catbox.moe/o4z53n.mp3
Anonymous No.42196772
>>42196754
one more fortnight
VilligerANON No.42196778 >>42196785
>>42196757
>>42196754
How do you know that?
Anonymous No.42196785
>>42196778
Sounds like you're not trusting the plan
Anonymous No.42196789
>>42195922
CHUDDA ETERNALLY BTFO
IT'S HAPPENING
Poopsikins No.42196790
>>42195922
https://files.catbox.moe/9gopqy.mp3
Anonymous No.42196800 >>42196809 >>42196814 >>42196826
>>42195922
Your shit is obsolete, yes that's what happens when you sit on your ass for years with proprietary software. Thanks for GPTSoVits and other solutions. You should have disappeared with your website, at least that wouldn't have tainted the few good memories left when using your tool. Fuck you and your five hours of fame you needed to still feel relevant.
Anonymous No.42196801 >>42196803 >>42196820
>>42195922
One kinda big problem, it won't let me use the ' sign for words... which is weird since a lot of words like don't and isn't NEED that sign.
Anonymous No.42196803 >>42196810
>>42196801
You do not need that.
Anonymous No.42196809
>>42196800
shut up, nigger
Anonymous No.42196810
>>42196803
You're right, I don't, but if 15 can fix that, it'd be a big help. Otherwise, the ai second guesses the pronunciation for the words, and it's just... I dunno, I just think it would be a good QOL fix.
Anonymous No.42196814
>>42196800
Total barbietranny death.
Anonymous No.42196820
>>42196801
YES HE FIXED IT!! Thank you 15!
Anonymous No.42196826
>>42196800
It does sound like ass. It's a shame because they're ponies.
Anonymous No.42196849
>>42195922
>>>/g/105281388
Anonymous No.42196867 >>42196897 >>42196906 >>42197553
>>42195922
Nightmare Moon has a huge improvement from her previous voice that just sounded like drunk Cheerilee
https://voca.ro/1j9J3CBPQqWN
Poopsikins No.42196896 >>42196899 >>42196914 >>42196923
>>42196305
https://files.catbox.moe/ryyshr.mp3

https://files.catbox.moe/nu5qft.mp3

https://files.catbox.moe/urd6et.mp3

Gosh, I've missed this so much. Posting like this takes me back.
Anonymous No.42196897
>>42196867
OKAY DAMN that actually sounds dynamic! I love it!
Anonymous No.42196899 >>42196902
>>42196896
Derpy, Maud, and Rainbow Dash, right? It's great that I can actually recognize the voices, to be honest.
Anonymous No.42196902 >>42196947
>>42196899
>Derpy
>It's great that I can actually recognize the voices
Anonymous No.42196906
>>42196867
Nice!
Anonymous No.42196914
>>42196896
https://voca.ro/1jlDvvakwJgi
Poopsikins No.42196923 >>42196985
https://files.catbox.moe/l8ex9a.mp3>>42196896
Anonymous No.42196947
>>42196902
Is that not Derpy? I thought because of the β€œclumsy” mistake and the familiar tone that it was her.
Anonymous No.42196960
>>42195922
https://u.pone.rs/moQGuPxl.mp3
Poopsikins No.42196985
>>42196923

last one from me tonight.
https://files.catbox.moe/esztvq.mp3
Anonymous No.42197112
>>42196571
I know who's the artist I would rather not tell you directly.
he draws fuck tons of futa.
Anonymous No.42197294 >>42197297
https://files.catbox.moe/qoia1a.wav

Luna's crash-out in A Royal Problem if she wasn't fucking around.
Anonymous No.42197297
>>42197294

https://files.catbox.moe/hhwgsc.mp3

mp3 like it should've been from the beginning lol.
Vogelfag revealed No.42197340 >>42197345 >>42197358 >>42197474 >>42197618
She sounds angry & sarcastic which is how I feel, but still unintended on my part.
https://pub-f3186dbecfd64ac085ddc742fc900f59.r2.dev/twilight_sparkle_neutral_1747418267794_variation0.wav

>>42195922
>Feel free to report any bugs or issues here, too
Yeah I see several bugs:
0. You're still not willing to jew out despite clearly needing the money and influence. Jew out or others will outjew you. Stop being a social recluse that's how all scientists die. Learn to sue everyone cause 11.AI clearly stole your technology you moron.
1. You're not open sourcing this to the community (which are of minimal help and lack money to pay for GPUs but they're willing to learn and are very loyal and creative despite me trashtalking them myself back in October)
2. I'm pretty sure ElevenLabs, Udio.AI, SUNO.Ai, etc. stole your technology and perfected it already since 90% of the singing & talking sounds like Tara Strong, Rebecca Shoichet & Ashleigh Ball. The AI can really sing too. To an audiophille it still sounds bad, but to a normie it sounds perfect. Get a fucking marketing team, both you and Tara Strong fucked each other up and should sue every single audio AI possible.
This is what Suno Ai can do right now with the paid model:
https://www.youtube.com/shorts/udOgG0M8pVI


3. Your options & UI is still limited. If I could search a reference line to use any emotion I want without typing in phonetics then that'd be useful for the average normie. You didn't understand what I just told you, did you? LET ME USE THE REFERENCE LINE TO QUICKLY & INSTINCTIVELY USE THE EMOTION I WANT. WE HAVE AN IMPECCABLE MEMORY OF THE SHOW'S DIALOGUE LINES.
Add a voice changer/voice to audio option. It would be so much more intuitive because the AI could hear what emotion I'm going for instantly.

Today's AI still lack a ton of UI options but are getting there at an insanely quick speed such as Suno's ability to grab an existing song and have either the same singer or a new singer sing the same notes with different lyrics.

Today's AI still sounds like an untrained voice actor slurring his lines on purpose and it still sucks compared to audiophille standards, but your current robot sounding AI is dreadful by normal standards. You still haven't learned how to remove the noise?
https://www.youtube.com/watch?v=qu5nnMOQ4VU&ab_channel=A
https://www.youtube.com/watch?v=I1Dy0Zfw6Qs&ab_channel=votums

3.5 You probably didn't notice cause you're not a voice director or you're autistic but ... S1 and S2-S9 's voice directing is completely different. 90% of the dialogue lines used in S2-S9 used only these emotions; depressed, angry, flirty, ANXIOUS, TIRED, reading-off-a-script-at-gunpoint. And that's the acting ... the voices?

In S2+ everyone sounds...
Twilight sounds much lighter in S2+
Applejack & Dash sound much deeper and not in a suave way.
Pinkie sounds way lighter & screechier.
Fluttershy always sounds anxious

Rarity & Spike kinda sound the same.


4. One more thing...
Anonymous No.42197345
>>42197340
Fuck off retard.
Vogelfag revealed No.42197358 >>42197369 >>42197618
>>42197340

4.
Contact the original voice actors and work together with them. Give me S1 Woona's voice and all is forgiven on my side. ;) Can't say others will forgive you for being a weak leader. These effeminate pussies need a strong leader and I suggest you do too if you can't march down 11Labs HQ and sue the living shit out of them together with Tara Strong. Sounds jewish but that's the truth. You got to outjew the jew in a jewish world. Mrs Strong knows that. I know that. Why can't you fucking comprehend that?
https://youtu.be/wbzRRp2jRHw?t=103

This is what voice acting AI sounds like now:
https://www.youtube.com/watch?v=lPAtoR3YCSc&ab_channel=UndeadHumor
https://www.youtube.com/watch?v=0j1eX7F8OOo&ab_channel=DevilArtemis

BUT I'M GUESSING YOU ALREADY KNOW THAT YET YOU STILL REFUSE TO DO SOMETHING ABOUT IT.
Call your father or something for God's sake, you college pussy kid. Your technology is being stolen under your nose and improved upon tenfold(by jews, not your followers) and you're here moping like a pussy on Twitter and then coming back with a niche version that does 1 thing barely any better and still sucks dick at the other 9 things that goes into audio.
CAN YOUR MODEL AT LEAST SING RIGHT NOW? Cause SUNO's shit can and Udio used to sing good before they had to neuter it because the record companies were after their asses. Why aren't you after their asses as well?

God you need a father in your life, kid. A father to watch over you and learn to sue and break skulls for you cause jesus christ after that twitter whine ... you're still a pussy who refuses to BE A MAN AND SUE THE LIVING SHIT OUT OF ELEVEN LABS FOR STEALING YOUR MODEL. Give Tara Strong a call too. Do you want me to do it for you?

Respectfully yours, the redpiller known as Vogelfag.
Anonymous No.42197369 >>42197487
>>42197358
I uh... 15 maybe should've been a bit better at leading, but WOW this is kinda rough. But they say the truth hurts... wait, aren't we only operating under the ASSUMPTION that ElevenLabs stole his work though?
Anonymous No.42197463
oh boy the schizos are out now
Anonymous No.42197474
>>42197340
no ones reading that
Anonymous No.42197487
>>42197369
no one cares vogelfag
BGM No.42197553 >>42197804 >>42197812 >>42197816 >>42198118 >>42198418
>>42196867
https://u.pone.rs/HEiyutXb.mp3
Anonymous No.42197606
>>42195922
Btw
https://voca.ro/14Y5dHWMbMpx
Anonymous No.42197618 >>42197624
>>42197340
>>42197358
Your words are wasted on that idiot. 15. He was always a pretentious egomaniac and I'm glad the era where we didn't have any viable alternative is long gone. He's not even competing with the current opensauce options, let alone the paid ones.
Anonymous No.42197624 >>42197645
>>42197618
what are the opensauce alternatives
Anonymous No.42197645 >>42197653 >>42199403
>>42197624
https://github.com/effusiveperiscope/GPT-SoVITS
Anonymous No.42197653
>>42197645
isnt that what haysay uses but it doesnt sound as good as this though
Anonymous No.42197804
>>42197553
Anonymous No.42197812
>>42197553
Holy fuck. Please make a full length version of this.
Anonymous No.42197816
>>42197553
Anonymous No.42198118
>>42197553
Incredible. please keep going.
Anonymous No.42198418 >>42198427
>>42197553
Damn, am I going to have to help finish what I've started?
Anonymous No.42198427
>>42198418
Please, I’m begging you. Make more
Anonymous No.42198611 >>42199123
>>42195922
Great to have you back, the new website looks fantastic.
Some notes after a few hours of testing (mainly with Rainbow and Twilight on happy and neutral):

I noticed that speech will often sound unnatural with a "rough" sort of sound, especially at the end of sentences. It's been taking a lot of re-rolls to get outputs that sound natural throughout. As ever I'm finding it very hard to articulate exactly why a lot of outputs sound off or spot trends. Been thinking about what exactly to say here for quite some time but I think it'll be more effective to just use the report feature on any examples I come across from now on. The voices generally sound very accurate to the ponies and there's already plenty of good examples ITT, so the potential is clearly there.

Things like the Twilight #3 on the example page are common issues with the "rough" sound - "aviation AH0 N", "fly AY1", "fat AE1" "ground AW1 N D".
Pretty sure this was an issue in previous versions of 15.ai, particularly the tendency to slip up at the end of sentences.

Short sentences (~three words or less), especially when generated on their own with nothing before or after, are consistently bad.

"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.

I'm tentatively thinking that reliance on reference lines from the show to control delivery, emotion, pacing etc in the output (I assume that's what the model is doing) may not actually be the best idea. It's great if the reference line that gets picked happens to match how you want the output to sound, but more often than not it won't and you'll be totally boned if there's no match at all. Even if there is a reference line that matches, you'll still need to take the time to find it or rely on RNG for it to be used.
I won't speculate any further on this for now since I don't know exactly how the reference lines influence the model. Would be good if you could fill in some blanks here.

Not yet found any bugs with the site, but I do have some feature requests:
1 - An option to automatically play new audio as soon as generation is complete.
2 - A button on the outputs to immediately regenerate with the same settings.
3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.

Hope that's helpful, you're doing great work here.
HydrusBeta No.42198701 >>42200231
>>42195922
Oh wow. Welcome back, 15! I am really happy to see you have a site back up, and the UI is slick.

>>42196337
>>42196341
Thank you for the kind words. I plan to keep Hay Say running. I am glad you have found it useful.
15 No.42199123
>>42198611
>"Anon" is often pronounced wrong, tends to get split into either "A Non" or "An On" and is spoken with a little break between them like they're two separate words.
This was because the dictionary had an incorrect transcription for "anon"; this has been fixed. If you run into any similar problems like this, you can report a transcription by hovering over the colored box and clicking the report button.
>1 - An option to automatically play new audio as soon as generation is complete.
>2 - A button on the outputs to immediately regenerate with the same settings.
>3 - Report function is useful, suggest also adding a thumbs up icon or similar to highlight when the model does well.
>4 - Not sure if it's my browser, but the download button always opens the audio in a new tab where I then have to click the three dots icon to download. All those extra mouse clicks quickly add up.
Done.
Anonymous No.42199403
>>42197645
15, is the model just GPT-SoVITS, but fine tuned on MLP?
Anonymous No.42199629
https://voca.ro/1mlZCjsv6tJ2
Dang, this is pretty good.
Anonymous No.42200231 >>42200380
>>42198701
>haysay is down
I am this close to considering selling my kidney for a good gpu
HydrusBeta No.42200380 >>42200397 >>42204296
>>42200231
What odd timing. Thanks for letting me know. The site should be back up now. The EC2 instance got in a weird state where it became unreachable again.
Anonymous No.42200397 >>42200690
>>42200380
The amazon anti-brony lobby is getting stronger per day. btw what would be requirements for haysay if I would like to run locally in its full compactly?
HydrusBeta No.42200690 >>42200707
>>42200397
Hay Say can run on most machines, but will be very slow on older hardware. I do not recommend running it on Apple silicon because it is very slow on that hardware (to the point that it's basically unusable). I recorded some benchmarks on several machines, which may give you a clue as to how long it will run on yours:
https://github.com/hydrusbeta/hay_say_ui?tab=readme-ov-file#testing-data--benchmarks
Having a GPU is not required.
HydrusBeta No.42200707
>>42200690
Oh, I forgot to mention that you need a LOT of hard drive space (about 100 GB now), and having at least 12 GB Ram is recommended.
Anonymous No.42201300 >>42201855
Up.
Anonymous No.42201855
>>42201300
Anonymous No.42202578
>back to being dead
come on
Anonymous No.42203845 >>42203862 >>42204138
https://x.com/fifteenai/status/1924269599542968655
Anonymous No.42203862
>>42203845
Anonymous No.42204138 >>42204139
>>42203845
>Discord server
Kek.

>I just added 4 more GPU servers because of the huge number of requests coming in. This is actually going to bankrupt me.
You know, you could just... open source it?
Then you wouldn't have to pay for any of it, you wouldn't be expected to constantly maintain it (this has been a recurring issue, let's be honest), and you would meet your original promises.
Anonymous No.42204139 >>42204140
>>42204138
shut up retard
Anonymous No.42204140 >>42204141
>>42204139
>t. 15
Anonymous No.42204141
>>42204140
shut up retard
Anonymous No.42204296 >>42204476
FYI, GPT-SoVITS v4 came out.
While v3 downgraded the quality, they boosted it back to 48KHz and it arguably sounds much more natural.
There's a good report here: https:// 8 chan.moe/ais/res/6258.html#q11121
>Ref: https://voca.ro/13vsNeBHC2Xu
>Best result I got from v4: https://voca.ro/1j2I5rUzAZxj
>Same example with v2 (the end was cut due to my shitty api): https://voca.ro/11qFHhR7HtG1
This is the only comparison I've heard so far though, seems like it was a very silently received release. Needs to be tested more.

>>42200380
If you could look into adding v4 to Haysay (assuming it does hold up with pony voices), that'd be much appreciated.
Anonymous No.42204454 >>42204478
>>42195922
You make a cute couple.
Anonymous No.42204476
>>42204296
Trying it out.
Oh boy, new setting under SoVITS Training. Guess I'm leaving that at the default 32 for now.
Anonymous No.42204478
>>42204454
>hecking mare
>she/pony
He's just having a laugh, r-right?
Anonymous No.42204529 >>42204536
Inb4 they ban saying bad words with the ai
Anonymous No.42204536 >>42206804
>>42204529
discord can ban over stuff like saying nigger iirc if people report it
I doubt any text restriction will be imposed but its understandable you dont want kids spamming nigger word in the discord
Anonymous No.42204550
>>42195922
thank you for your service, king
Anonymous No.42204573
>>42195922
Cool that you're back. Though its a bit odd that you say that 15.dev is provided only for non-commercial use, then license the outputs under CC BY-SA 4.0, which explicitly permits commercial use. Shouldn't outputs be licensed under CC BY-NC or BY-NC-SA instead, since it would be in line with your earlier statement that the site is to be used non commercially?
Anonymous No.42204939
>>42195922
ya taking on new voice dataset or only retraining the old ones?
Anonymous No.42205033 >>42205228
https://huggingface.co/OuteAI/OuteTTS-1.0-0.6B
Anonymous No.42205228
>>42205033
Their twitter examples are bit meh sounding, im guessing the wow factor would came from the fact that it can work with 14 different languages. Would be really nice if I had a voice dataset from foreign dubbing and be able to use for english languages.
Anonymous No.42205387
If you still lurking Vul, thank you for making that sfx_sep_v2 filter for vocal remover, this stuff is so bloody helpful in prepping the audios.
Anonymous No.42205939
>>42195922
Holy shit, only noticed it now. I don't know what changed for the site to make a comeback, but it's nice to see it again.
Anonymous No.42205963
>>42195922
Did a bunch of work with Rarity today, mainly with happy emotion, and notably found that I tended to get better results when I turned the temperature way down, 0.2-0.4. Tried that with the rest of the Mane 6 but Rarity seemed to be the only one to significantly benefit, Twilight and Rainbow in particular still sound "rough" almost all the time no matter what I do.
Even so, Rarity's improvement is significant enough that I'd suggest everyone experiment with adjusting the temperature, there may be an optimal value for each character that I've not found yet.

Short inputs continue to be a problem, even short sentences that are part of a longer input - reported a bunch of instances of words being mispronounced, weirdly elongated and even skipped entirely.

Also had a few times where the page froze when I switched tabs to do other stuff while waiting for generations to complete.

Could you unlock the quality slider at least in the faster direction? I'm finding generation wait times to be the main bottleneck right now and would like to give that a try. Perhaps also allow larger batch size when faster quality options are selected too.
Anonymous No.42206122 >>42206425
>>42195922
>no more emotional contextualiser (the selections are a decent sidegrade I guess but come on it was much cooler)
>still using arpabet despite even resolving the IPA
>AI guesses what I want it to say if it's not in the dictionary instead of just phonemising the words because I know what I want it to say
why

>Moondancer
Bless you, sounds like shit tho
Anonymous No.42206425 >>42207713
>>42206122
https://files.catbox.moe/tu4s0l.mp3

How's this?
Anonymous No.42206804
>>42204536
Fair, but I will never trust someone with a mental illness flag in the bio.
Anonymous No.42207158
Sup, got an sudden inspiration to get the voice from Clone Wars narrator trained. Not pony model but I feel like this could get some good use out it in the future anti clips.
>https://huggingface.co/Amo/RVC_v2_GA/tree/main/models/Star_Wars_Clone_Wars_Narrator_v2
https://files.catbox.moe/bjljdm.mp3
Not 100% happy with it as the input needs to have that specific "umpf" energy to it.

>https://huggingface.co/Amo/GPT-SoVITS-v2/tree/main/Clone_Wars_Narrator_v2_so96_gpt24
Gpt-Sovits, wavs included.
https://vocaroo.com/1oycsmzwxgVy
https://vocaroo.com/12qbwj4NK8XP
https://vocaroo.com/1fBcauUi9ZIP
Due to pronunciation script some words sound pretty weird but nothing but little but of editing can't fix.
Anonymous No.42207220 >>42207363 >>42310164
>>42195922
now all i need to do is figure out how to make ponies moan
Anonymous No.42207243
Open source that shit 15
Anonymous No.42207363 >>42251484
>>42207220
One step ahead of you.

https://files.catbox.moe/7wktvb.mp3

All I did was enter "AAAAAAAAAAAAAAAA!" and the moaning just kinda happened.
Anonymous No.42207459 >>42207472 >>42207477 >>42207518
15 for the love of God find a volunteer to do your PR, you called a random Hasbro employee pathetic that is not something you should do if they are inquiring about your service despise how obnoxious the cocksucking corpo suits are. Being aggressive like that isn’t doing anyone any favors
Anonymous No.42207472 >>42207479
>>42207459
All hasjew employees deserve and should be publicly mocked.
Anonymous No.42207477 >>42207483
>>42207459
>you called a random Hasbro employee pathetic
are you retarded perhaps
Anonymous No.42207479 >>42207484
>>42207472
Yeah I call them retarded niggers off the mic but when you’re face to face with them you shouldn’t let that go out.
Since 15 is a stemfag gook I wasn’t expecting diplomacy and social skills from him but this is actually crazy, no one cares about your inbox.
Anonymous No.42207483 >>42207484 >>42207488
>>42207477
Even if that was a scammer like who the fuck cares nobody cares about your inbox nigga
Anonymous No.42207484
>>42207479
>>42207483
I care though, this is funny and based as fuck
Anonymous No.42207488 >>42207493
>>42207483
Repeat 30 more times about how much you don't care.
Anonymous No.42207493 >>42207525
>>42207488
Settle down 15 minion you have a sever to moderate
Anonymous No.42207518 >>42207580
>>42207459
He wasn't even calling Hasbro employees pathetic though? It was some random guy trying to snitch by CC'ing all these people.
Anonymous No.42207525 >>42207527
>>42207493
You're the retard sending the e-mail, got it.
Anonymous No.42207527
>>42207525
Finger pointing like that isn’t healthy tranny
Anonymous No.42207576 >>42207705
https://x.com/UnslothAI/status/1924848135991656603
Anonymous No.42207580
>>42207518
This, wtf is anon talking about
Anonymous No.42207705
>>42207576
once again, it's all written like next breakthrough in technology but nobody is posting any examples at all, not even cheery picked ones.
Anonymous No.42207713 >>42208791
>>42206425
Still bad, just compare to any actual Moondancer speaking. I'm not knowledgeable enough to describe exactly how it's wrong, but it's too deep and not "light" enough?
twiggles !!ofIYxlKABKS No.42208577 >>42208651
it's been six fucking years, jesus christ. i still can't believe how big this project got
Anonymous No.42208651 >>42208841 >>42208906
>>42208577
it was dead for a while but only recently started becoming alive again
Anonymous No.42208791
>>42207713
Ah, okay. I thought it was a matter of quality and not the voice itself. But you're right, it's not as light as her in the show...
Anonymous No.42208841
>>42208651
https://www.youtube.com/watch?v=730zGRwbQuE
Indeed, its has been bumpy few years, yet in the end the infinite power of ponies will prevail all hardships.
Anonymous No.42208906
>>42208651
It's good to see it getting some steam. This is far too potent to let it fall to pieces.
Anonymous No.42209071 >>42209113 >>42209433 >>42211348
Someone on the server wanted to get 15 to censor the swear words from the site. Say it with me...FUCK no!
Anonymous No.42209113
>>42209071
>Hey everyone look at what some nobody said on my Discord!
No one here cares about social media drama. Keep it in Discord and out of here
Anonymous No.42209433 >>42209975
>>42209071
This is why you don't cozy up to Discord groups. They'll try to corrupt you every time.
Anonymous No.42209975
>>42209433
True.
Anonymous No.42210605 >>42211063 >>42211888
Up.
Anonymous No.42211063
>>42210605
aaaaaa!
Anonymous No.42211348
>>42209071
Gee, what a surprise.
Anonymous No.42211888 >>42212568
>>42210605
Anonymous No.42212018
>https://files.catbox.moe/asxfuv.mp3
Anonymous No.42212568 >>42221623
>>42211888
Anonymous No.42213584 >>42214071
close enough welcome back uberduck discord
Anonymous No.42214071
>>42213584
>uberfuck
No thanks.
Anonymous No.42214224 >>42214240
>15 is back
>still dead
It's over
Anonymous No.42214240 >>42214251 >>42214278 >>42214680
>>42214224
15.ai isn't really good enough to revive any interest after the novelty of making ponies say nigger wears off.
Anonymous No.42214251
>>42214240
ok goku
Anonymous No.42214278
>>42214240
https://u.pone.rs/OWiJmVGB.mp3
Anonymous No.42214680
>>42214240
>Dashcon
Comparing a literal scam to 15 is plain retarded.
ThunderShy No.42214718 >>42215142
Hello fags made a new ai skit with 15.ai its good to be back
https://files.catbox.moe/29w2tt.mp4
Anonymous No.42215142
>>42214718
Comedy bros, were are you?
Anonymous No.42215735 >>42217176
https://files.catbox.moe/aov4vh.mp3
Anonymous No.42216763 >>42217954 >>42224989
>15 service re-emerges
>Typing rapidly ensues
>Old prompt tricks still draw out the mysterious liminal echoes of the mare
These digital equines have the most fascinating voices

Compilation of Liminal Trixie sounds
https://files.catbox.moe/gznbmc.mp3
https://files.catbox.moe/spb6zv.mp4
Anonymous No.42217176
>>42215735
What was that quote? I can't remember where it came from.
Anonymous No.42217954
>>42216763
moonbase trixie
Anonymous No.42218072 >>42218629 >>42218755 >>42219570
I need more lewd moans. Gasps, sighs, groans, chirps, murmurs, mewlings, etc.
Anonymous No.42218629
>>42218072
I have an audio pack with random moans, give me few minutes to upload it
Anonymous No.42218755 >>42219017 >>42256762 >>42269579
>>42218072
NTA but here's a couple more Liminal Trixie noises.
A few grunts, laughs, even some coughs and various others.
https://files.catbox.moe/q8g80w.mp3
Anonymous No.42219017
>>42218755
I'm surprised no one has done something with that.
ThunderShy No.42219092
@hydrusbeta, what happaned to the synth app its not working and could it be possible if you can add a direct link to it on the haysay website
Anonymous No.42219523 >>42219530
>Servers down
>twitter account gone
Permission to panic, sir?
Anonymous No.42219530 >>42221117
>>42219523
False alarm, twitter was just fucking itself up again.
Anonymous No.42219570
>>42218072
https://u.pone.rs/PRpOFwQp.001
>SpecialPacks_.zip.001
https://u.pone.rs/vyoSUmbo.002
>SpecialPacks_.zip.002
https://u.pone.rs/WFVSxEXw.003
>SpecialPacks_.zip.003
https://u.pone.rs/mGedaJTp.004
>SpecialPacks_.zip.004
https://u.pone.rs/uGmobBkJ.005
>SpecialPacks_.zip.005

Rename the download files to the below quoted filenames. It's 2.27GB mix of variety sounds from ASMR, hentai games and some other gooning sources. Do use the RVC to make them pony related.
VilligerANON No.42220345 >>42220536 >>42220864
I'm preparing to train MLP models with GPT-soVITS v4
Which mare should I start with?

>Yes, I'll add the precomputed values from Haysay, once I make the WebUI.
Anonymous No.42220536 >>42221644
>>42220345
Applejack is a good baseline to test out accent retention and character similarity. Otherwise, testing more unique voices like Queen Chrysalis would better determine how well the model replicates the intended voice without falling back too much on similar but generic voices.
Anonymous No.42220864
>>42220345
I'd be curious to know what effect the LoRA Rank has on the models, and which one is ideal for what datasets.
Anonymous No.42221117
>>42219530
Phew.
Anonymous No.42221623
>>42212568
VilligerANON No.42221644
>>42220536
What pretrained English model was finetuned on?
Anonymous No.42222130 >>42223953
>10
Anonymous No.42222460 >>42222472 >>42224830
>Prompts various commas and apostrophes to get hidden mare noises.
>Lyra: "Ew, I think it's some sorta booger or something"
Wow, these mares have some fascinating interpretations.

>https://files.catbox.moe/whb1r5.mp4
>https://files.catbox.moe/xqvrlq.wav
Anonymous No.42222472
>>42222460
The interface is really stylish.
Anonymous No.42222730
>https://huggingface.co/Amo/GPT-SoVITS-v2/blob/main/TreeHugger_so96_gpt24/wavs.zip
>This file is vulnerable to threat(s) PAIT-ARV-100.
Could somebody with good quality antivirus scan this zip and files inside of it? it's probably a false positive but I want to be sure this wouldn't mess with my pc.
Anonymous No.42223265 >>42223566 >>42224786 >>42261401
https://unmute.sh/

Found this, apparently they're gonna open source the text and speech models soon, but for now, you can supply a ten second voice clip of anyone you want to speak with them in a variety of topics.
Anonymous No.42223566
>IMS Toucan - tts 7000 Languages
>https://github.com/DigitalPhonetics/IMS-Toucan
>https://huggingface.co/spaces/Flux9665/MassivelyMultilingualTTS
I think this was posted few years back, I've noticed they had update on huggingpage about two weeks ago, after few minutes of testing, it seems to be working, however while the quality of voices is above MS Sam and the noisy talknets, the way tts is talking still feels very artificial.
The voice cloning option seems to be broken so that's sucks, however by the fact that it is able to generate voices at light speed and even has build in options for CPU usage means that it could be run on a potato tier equipment without problems.
So, its not something useful for now, but there is always possibility somebody else could take it and improve it (imagine Flutershy teaching you how to speak moonrunes).
>>42223265
Thank you for sharing that Anom, and also holy fuck, this is working like pure magic, I just given them a 9s of audio clip of really low quality clip ripped from a game and it was able to replicate it without the shitty de-reverb pollution and background buzzing noise AND keeping the accent consistent. And on top of that I was able to double the amount of voice lines this character had ever spoken, so thats a massive plus on making artificial datasets.
>apparently they're gonna open source the text and speech models soon
With this kind of tech there wouldn't be a need for training full models for the bare bones TTS can be done with 10s clips and less than 5m of waiting for the voice to be clone. Man, I remember way back in mid 2020 when people talk about this tech and pretty much everybody agreed that cloning voices with 10s of audio will never sound natural or even good, how times have changed.
Anonymous No.42223953 >>42224543
>>42222130
Anonymous No.42224543
>>42223953
Anonymous No.42224786
>>42223265
I tried to see if it could recreate voice from 3s of Woona voice but sadly that was a no-go (Ive even try duplicating the voice to fill it out to 10s clips), im guessing the high pitch levels of distress is messing with their process or they do need minimum 6s of audio to be able to work out how to duplicate it.
Anonymous No.42224830 >>42224962
>>42222460
This is what I've got instead. I really dig the giggle in the first one.
https://vocaroo.com/154R3gQLRpG1
https://vocaroo.com/1eOcqD52A2pm
Anonymous No.42224962
>>42224830
>https://files.catbox.moe/etzhiu.mp4
>Chrysalis: "(forceful exhales x3), We should take the magic inside it. You know how powerful Discord was."
Guess with limited-to-no other speech input, it does fall back a lot on the Reference Text as seen in the Advanced Model Details. No wonder so many Trixie attempts had her mumbling about a good night's sleep. Less random than initially suspected.

I wonder how the model would behave if we were able to remove or modify the underlying quote(s) during synthesis, though I'm sure it's likely integral to retaining its accuracy. Come to think of it though, it would be nice to be able to select specifically what underlying reference line it's using prior to generation so that you have more chances of getting a desirable output similar to it. Could mean less resource usage too.
Anonymous No.42224989 >>42225760
>>42216763
what tricks did you use?
Anonymous No.42225053 >>42225760
>https://github.com/PasiKoodaa/ACE-Step-RADIO
I've stumbled upon above github project, it uses the Ace Step music model to create a constant stream of ai music to replicate what online radio websites do, the requirements for it are 16GB Vram. The outputs are still on the so-so level, but given the text to song models are only about year old there is plenty of space for improvements. Also I would love to see a setup were these models sing with proper poni voices from the get go (or with the help from loras).
Anonymous No.42225301 >>42225760
>Stable Audio Open Small
>Weights: https://huggingface.co/stabilityai/stable-audio-open-small
>Paper: https://arxiv.org/abs/2505.08175
>Arm learning path: https://learn.arm.com/learning-paths/mobile-graphics-and-gaming/run-stable-audio-open-small-with-lite-rt
Huh, a model that's only around 2GB? Nice to see them notice that not everybody have a endless bag of cash to spend on newest and larges GPU. Sadly it still only outputs instrumental at lower-tier quality (at least in comparison to what's already out there).
Apparently it can run 30% faster than realtime.
Anonymous No.42225760
>>42224989
Mostly the aforementioned ,',' trick, which in older pre- "dev" versions of 15 used to be able to do a lot more lewd noises and such. Used to have a text doc with a handful of other tricks used with it, but it must be on one of my older OS drives. Still serves to force further areas of silence, which in turn can allow hallucinations and other AI weirdness to creep in on purpose.
>>42225053
>16GBs Vram
Still seems out of the memory budget of most anons, Unless it could be optimized to be at least half that with minimal loss. Even if it were finetuned on mares, without optimization I can't imagine many being able to utilize it for synthesis.
>>42225301
>Very small model
>Lower quality
To be expected I suppose, but at least it's something usable for local synthesis and playing around with, aside from maybe Bark; which I should honestly revisit. Just a shame they completely abandoned it after becoming monetized in the form of Suno. Still open source like Stable Audio is however.
Anonymous No.42226198 >>42227813
Up.
Anonymous No.42226927
mares
Anonymous No.42227190
rears
Anonymous No.42227393
https://u.pone.rs/pBgJHLQr.wav
Anonymous No.42227813 >>42229510
>>42226198
Anonymous No.42228108 >>42228215 >>42228246
Claims to do sota zero shot cloning with tts with powerful control
https://github.com/resemble-ai/chatterbox
Anonymous No.42228215
>>42228108
From a 20s voice sample: https://litter.catbox.moe/w54fxs.wav
Anonymous No.42228246
>>42228108
I've tested with few voices, it seems to be able to run some without any problems but totally struggle with others (seems to depend on how accent/pronunciation deviate from standard way of speaking). Sadly I confirmed that this model is also unable to clone Woona voice.
Anonymous No.42229510 >>42232233
>>42227813
Anonymous No.42229871 >>42230283
Music Source Restoration
https://arxiv.org/abs/2505.21827
>We introduce Music Source Restoration (MSR), a novel task addressing the gap between idealized source separation and real-world music production. Current Music Source Separation (MSS) approaches assume mixtures are simple sums of sources, ignoring signal degradations employed during music production like equalization, compression, and reverb. MSR models mixtures as degraded sums of individually degraded sources, with the goal of recovering original, undegraded signals. Due to the lack of data for MSR, we present RawStems, a dataset annotation of 578 songs with unprocessed source signals organized into 8 primary and 17 secondary instrument groups, totaling 354.13 hours. To the best of our knowledge, RawStems is the first dataset that contains unprocessed music stems with hierarchical categories. We consider spectral filtering, dynamic range compression, harmonic distortion, reverb and lossy codec as possible degradations, and establish U-Former as a baseline method, demonstrating the feasibility of MSR on our dataset. We release the RawStems dataset annotations, degradation simulation pipeline, training code and pre-trained models to be publicly available.
https://github.com/yongyizang/music_source_restoration
https://huggingface.co/datasets/yongyizang/RawStems
https://huggingface.co/yongyizang/MSR_UFormers
Github repo isn't live yet. might be cool for audio stuff
Anonymous No.42230283
>>42229871
This could be pretty useful in combination with the ACE Step song convector, if a song can have both vocals separated as well as instrumentals separated into their own track I would imagine that would help modifying it into a different style of music.
At the very least it would be nice to use it to fix the weird effects that vocal removing programs are imprinting on the instrumental files.
Anonymous No.42231332 >>42232530
ten
Anonymous No.42232233
>>42229510
Anonymous No.42232530 >>42236397
>>42231332
Anonymous No.42233126
>https://u.pone.rs/beZAfsQC.mp3
motivational Trixie
Anonymous No.42234667 >>42235711
Saved
Anonymous No.42235711
>>42234667
Precautionary bump.
Anonymous No.42236397 >>42237262
>>42232530
Anonymous No.42237262 >>42238294
>>42236397
Anonymous No.42238294 >>42238306
>>42237262
Anonymous No.42238306 >>42239361
>>42238294
Anonymous No.42239361
>>42238306
Anonymous No.42239410 >>42240013 >>42240318
Well, the twelve hours after 15 returned was fun I guess. Now back to this bullshit.
Anonymous No.42240013
>>42239410
He's gunna hurl if he keeps that up.
Anonymous No.42240318 >>42240322
>>42239410
Which one?
Anonymous No.42240322 >>42240779
>>42240318
The bumping kind.
Anonymous No.42240779 >>42241436
>>42240322
The bumping loyal
Anonymous No.42241436 >>42243302
>>42240779
Let me bump the thread of my people.
Anonymous No.42243302 >>42245664 >>42248445
>>42241436
>>42241979
Anonymous No.42243816 >>42244249 >>42244607 >>42247951
https://openaudio.com/blogs/s1
The .5b mini version will be open sourced
Anonymous No.42244249 >>42244607
>>42243816
Hmm, would be nice if there was a demo WITHOUT music so I assume they put it in to hide the lower quality. But with .5B size this thing should technically be able to run in a phone sized environment, so that's neat.
Anonymous No.42244607
>>42243816
>>42244249
Neat indeed, but it's a shame they don't have any audio examples of either version (on that page at least). Hard to really get a feel of it when there's nothing to gauge or judge.
Anonymous No.42245664
>>42243302
Indeed.
Anonymous No.42247180 >>42247653
Scootaloo Scoot-Scootaloo.
Anonymous No.42247653
>>42247180
Someone said chicken?
Anonymous No.42247951 >>42247952 >>42250902
>>42243816
https://huggingface.co/fishaudio/openaudio-s1-mini
Anonymous No.42247952 >>42248049
>>42247951
OpenAudio S1 supports a variety of emotional, tone, and special markers to enhance speech synthesis:

1. Emotional markers: (angry) (sad) (disdainful) (excited) (surprised) (satisfied) (unhappy) (anxious) (hysterical) (delighted) (scared) (worried) (indifferent) (upset) (impatient) (nervous) (guilty) (scornful) (frustrated) (depressed) (panicked) (furious) (empathetic) (embarrassed) (reluctant) (disgusted) (keen) (moved) (proud) (relaxed) (grateful) (confident) (interested) (curious) (confused) (joyful) (disapproving) (negative) (denying) (astonished) (serious) (sarcastic) (conciliative) (comforting) (sincere) (sneering) (hesitating) (yielding) (painful) (awkward) (amused)

2. Tone markers: (in a hurry tone) (shouting) (screaming) (whispering) (soft tone)

3. Special markers: (laughing) (chuckling) (sobbing) (crying loudly) (sighing) (panting) (groaning) (crowd laughing) (background laughter) (audience laughing)
Anonymous No.42248049
>>42247952
>Emotional markers
Interesting, hopefully there will be a decent UI and training for it
Anonymous No.42248445
>>42243302
Anonymous No.42248854 >>42249714
Anonymous No.42249353 >>42250054 >>42253016
Anonymous No.42249714
>>42248854
>bump rump
Would pump.
Anonymous No.42250054
>>42249353
Pretty bump mare. Totally would.
Anonymous No.42250109 >>42250437 >>42250547 >>42251801 >>42253112 >>42254373
>15 crawls back to bait patreon donos with his half-baked model where most emotion choices result in unintelligable noise
>11 releases a new alpha that wipes the floor with his crusty garbage less than a month later
https://elevenlabs.io/v3
holy fucking kek! maybe there is a god.
Anonymous No.42250437
>>42250109
yeah but unlike fifteen, eleven labs cost money
Anonymous No.42250547 >>42250592
>>42250109
? elevenlabs doesn't have ponies, how is this a comparison
Anonymous No.42250558
Remember not to give goku the attention he wants
Anonymous No.42250592 >>42250593
>>42250547
you have to train your own models on there you retard mcspazatron
Anonymous No.42250593
>>42250592
yeah is it any good though, last I tried to train ponies it wasn't very good
Anonymous No.42250902 >>42260216
>>42247951
Anybody had a chance testing this thing out? Due to bullshit reasons I'm kind of stuck phone posting but I do want to know if it's any good.
Anonymous No.42250941 >>42251141
https://github.com/RVC-Boss/GPT-SoVITS/releases/tag/20250606v2pro
https://github.com/RVC-Boss/GPT-SoVITS/wiki/GPT%E2%80%90SoVITS%E2%80%90features-(%E5%90%84%E7%89%88%E6%9C%AC%E7%89%B9%E6%80%A7)
Anonymous No.42251141
>>42250941
>for 50 nvidia series
so wait, the new models is for 50s exclusive or just optimized for the use on that hardware?
Anonymous No.42251484
>>42207363
I tried that and all it did was make Rarity do pokemon noises.
https://files.catbox.moe/1ryvaz.wav
https://files.catbox.moe/qivs4r.wav
also somethimes the AI interpretation (wish we could turn that off) says "Triple A" https://files.catbox.moe/72xgzt.wav
Anonymous No.42251801
>>42250109
>elevenfags
Miss me with that shit.
Anonymous No.42252543
Anonymous No.42252595 >>42253243
I found some free audio processing plugins, I'll be loading these in (((audacity))) to auto-process my dataset. I haven't tried it yet, but it seems promising, like a publicly released version of izotope:
https://archive.org/details/accusonus-era-bundle-v-6.2.00
They made it public before going out of business. I might reply the anchor if it gives a good result.
Anonymous No.42253016
>>42249353
Anonymous No.42253112
>>42250109
I wonder (((who))) could be behind this post.
Anonymous No.42253243 >>42287174
>>42252595
Interesting, could you post some examples here?
Anonymous No.42254319
Mares?
Anonymous No.42254373 >>42254415
>>42250109
gptsovits wipes the floor with 15 shitty model already, no need to bring the big guns
Anonymous No.42254415 >>42254417
>>42254373
stop samefagging, your broken english is too noticeable at this point
Anonymous No.42254417 >>42254418
>>42254415
You wish I was samefagging retard
Anonymous No.42254418
>>42254417
hahahah
Anonymous No.42255207
Electric mares?
Anonymous No.42255248
43/64 on pl_marewater
Anonymous No.42256668 >>42256717
For characters with lots of voice lines like Spike and Twilight, if I'm using my own voice, what's the best option to choose on Haysay to sound good?
Anonymous No.42256717 >>42256739
>>42256668
RVC is the current gold standard as far as Haysay goes for speech-to-speech.
Anonymous No.42256739
>>42256717
It's not quite getting the intended result. Should I set voice envelope high or low? https://voca.ro/1iHl7ZMvk5Qm
Anonymous No.42256762 >>42256785 >>42256785 >>42256871
>>42218755
What settings did you use here? Sounds pretty good.
Anonymous No.42256785 >>42256871
>>42256762
If you're trying to get non-vocals out of the voice-to-voice, it's not gonna work great.
>>42256762
Those were generated with 15.ai, probably the best option if you don't need voice to voice functionality and just want lewd pony noises.
Anonymous No.42256871 >>42257202
>>42256762
>>42256785
Mostly default settings. Varying the temperature occasionally. Liminal mares also make all sorts of noises, not just lewd. I can easily imagine them being used as vocal SFX for pony videogames or something β€” maybe an episode or animation like a mare drips onto the ground and the grunt is entirely synthetic and not a recycles audio from the show.

https://files.catbox.moe/7rx7zi.mp3
Anonymous No.42257202 >>42257362 >>42257426 >>42257521
>>42256871
>https://files.catbox.moe/7rx7zi.mp3
These sound like Trixie is doing Link moves.
Anonymous No.42257362 >>42257428
>>42257202
Abstract mare sounds are abstract. Sadly Rvc is still the king of getting quality lewd sounds, but I still wish we had a nice tts alternative.
Anonymous No.42257426 >>42257521
>>42257202
Huh, yeah, this really make me want to work on my 3d modelling again... although Godot's 3D capabilities are not great still.
Anonymous No.42257428 >>42257432
>>42257362
Is there a place I can upload multiple audio files for easy playback? I wanted to show off what I managed with the TTS on haysay.
Anonymous No.42257432 >>42257437
>>42257428
pone.rs
Anonymous No.42257437 >>42257521
>>42257432
Thanks. Too bad it doesn't stream playback....

https://u.pone.rs/reZpBwHV.wav (Twilight)
https://u.pone.rs/cBNqloOa.flac (Spike)
Anonymous No.42257521 >>42258274
>>42257202
Could totally imagine a game with Trixie acting as the hero of Hyrule.
>>42257426
Damn, haven't heard Godot in a hot minute. I really need to find time and motivation to actually get into that myself. Keep telling myself that though. Sadly free time and hobbies don't pay bills.
>>42257437
>doesn't stream playback
You mean like, play it in a browser? Because usually mp3 is supported in that way.
Anonymous No.42257919
Up.
Anonymous No.42258274 >>42258298
>>42257521
Yeah, I know what you mean, though I'd say getting those skills can be valuable. Personally, I wish I didn't mentally check out of a tutorial after like 30 minutes because most of them need a good hour or more to really get into the meat of it, and even taking notes, it feels like I'm not retaining it well.
Anonymous No.42258298 >>42258349
>>42258274
I would recommend the YT channel TheRoyalSkies, all his video (with some rare exceptions) are between one to five minutes long, always getting to the point instead of flapping about some bullshit and settings. The only downside is they are usually aimed at people who already have little bit above total 0xp noobie beginners but it's still good stuff.
Anonymous No.42258349 >>42258805
>>42258298
Oh, they have Cascadeur videos. I was wondering if that was usable with quadrapeds too...
Anonymous No.42258805
>>42258349
never used that addon/function, but I would imagine anything that is not a humanoid with standard two arms and legs will require lots of custom rigging.
Anonymous No.42260216 >>42260489
>>42250902
Thread tourist here, it's breddy gud for being local. I've been running it on a 3060 with no issue, takes about twice as long as real time but the 44.1kHz fidelity is incredible. Also the voice cloning accepts up to 90 seconds of input, with possibly more but I have yet to test that.
My main criticism is that for longer gens upward of a minute or more, the voice gets kinda washed out in a way, but you can easily circumvent that by just splitting your text into chunks.
Here's some examples I genned:
Cum Zone guy quoting Ozymandias (my favorite gen, nearly indistinguishable from real VA) https://vocaroo.com/1ngXhfejJwoB
Gilbert Gottfried navy seals (you can hear the voice getting washed out towards the end) https://vocaroo.com/1n6SZbrHzKZ1
Michael Rosen pulp fiction (it can mispronounce capitalized words, storage is pronounced as sturgeon) https://vocaroo.com/1ov76WqTjIUY
I'd say it's elevenlabs-tier, even if that comparison is now outdated because of their new model.
Anonymous No.42260489 >>42261547
>>42260216
for a zero shot model it's surprisingly decent. In their GitHub, do they provide a UI with emotional control or is it just bare minimum of "audio reference in, tts out"?
Anonymous No.42260920
https://github.com/fluxions-ai/vui
https://huggingface.co/fluxions/vui
has voice cloning ability
>You can clone with the base model quite well but it's not perfect as hasn't seen that much audio / wasn't trained for long
Anonymous No.42261160 >>42261401 >>42262716 >>42264548
What's the best tts for mares? I know elevenlabs is the best overall but I'm wondering how good it is for ponies
Anonymous No.42261401
>>42261160
For locally operation, it's still the gpt-sovits. I don't use paid online services so lmao on that one.
>>42223265
But I guess this one could beat it, once they make it public. Having their tts model running tts integrated with Silly Tavern would honestly kick some serious ass.
Anonymous No.42261547 >>42261622
>>42260489
There's emotion control to a degree, you just put one of the tags in parentheses at the start. There's only a limited amount of valid tags and it can only go so far, and I haven't personally been able to use multiple in a single gen since it just says the word but YMMV
Anonymous No.42261622 >>42261832 >>42261949
>>42261547
>only one emotional tag control
oh, this sucks donkey balls, I was hopping we could finally have a model that can make a advanced sentence styles eg whispering with mix of anger and confusion.
Anonymous No.42261832 >>42261949
>>42261622
Yeah, honestly sounds like a convoluted way to say they have multiple individual models compounded, each trained on one particular emotion and uses the parentheses determine which underlying model it uses for synthesis.
Anonymous No.42261949
>>42261622
>>42261832
Well like I said, your mileage may vary. I haven't been experimenting with it nearly as much as I should, and it could very well support that. I saw an example somewhere else of Pearl from SU reading the best thing about meatballs meme and the voice there was pretty varied emotionally and realistic. To be fair, they might have been using the full model which is only available through their website, but I wouldn't knock it before trying it on the smaller model. Using my GPU for other purposes at the moment so someone else will have to test.
Anonymous No.42262326 >>42263067
Anonymous No.42262716 >>42262734
>>42261160
Is there some kind of library with voice clips I can use to make pony models in ElevenLabs?
Anonymous No.42262734 >>42272674
>>42262716
megas links in OP?
Anonymous No.42263067
>>42262326
Cute bump mare.
Anonymous No.42264147
>10
Anonymous No.42264531
>slow night bump
Anonymous No.42264548 >>42265214
>>42261160
https://15.dev/
Anonymous No.42264821
>>42196683
what???
Anonymous No.42265013
bump due to too much spam on the board
Anonymous No.42265042 >>42265052
Is openaudio s1 the best thing right now? I copied random text from a mod page. The pronunciation is pretty good, although imo a little too neutral.
Anonymous No.42265052
>>42265042
Audio quality seems the best, pronunciation is really good as long as it's not a weird made up word.Emotions are pretty meh.
https://vocaroo.com/1l7fRlI0qtqn
Anonymous No.42265214
>>42264548
No trolls please
Anonymous No.42265590
https://x.com/elevenlabsio/status/1933188969279500459
Anonymous No.42266061 >>42266620
preserved
Anonymous No.42266620
>>42266061
Anonymous No.42266751
preservation bump
Anonymous No.42267158 >>42267570
Anonymous No.42267570
>>42267158
Anonymous No.42268023
>mared
Anonymous No.42268924
Up.
Anonymous No.42268941 >>42268985
This is starting to get sad...
Anonymous No.42268985
>>42268941
I only have one gpu that's already too outdated for all this kind of technological novelty. I already had to throw away few ideas for song cover because random song leakage / dual vocals was fucking with conversion process.
Anonymous No.42269579
>>42218755
>pukes at the end
Anonymous No.42269737 >>42269957
How do we save /ppp/?
Anonymous No.42269957 >>42272090
>>42269737
There is only one thing we can do, we cook...I mean we make pony content. I was thinking of doing a "X pony makes a review about fics/books" in similar theme/feel of Rainbow Dash Presents.
Anonymous No.42269988
REDUB 7!!!!!!!!!!!!!!!!!
Anonymous No.42270046
With SparkTTS, voices can be cloned with even just a few seconds of audio. This allows the cloning of background characters like TwinkleShine. What I like to do is feed ai generated voices into elevenlabs in order to get a higher quality model. Love what you guys are doing!
Anonymous No.42270522
>bump
Anonymous No.42270729 >>42271120 >>42271473
Anyone else here that thinks about the possibilities of AGI pretty consistently?
I don’t know exactly how much overlap there is between this corner of the fandom and technological singularity enthusiasts.
Anonymous No.42271120
>>42270729
I'm always dreaming of Bicentennial Man level of AGI. Just another race of sentient beings but they're Robots! but I have no idea if we'd ever reach a singularity event or even if we do, what are the true possibilities?
Anonymous No.42271473
>>42270729
in my unprofessional opinion we don't have currently tech and materials to make something that would work as proper AGI, at best it will just more polished versions of LLM that will be so good at pretending to sound like people it will be next to impossible to distinguish them from people. I do think people in next century will make some new type of processors/programming/something else that could make the computers think and feel for real, but by that time the world and society will change so much there isn't even point in guessing how it would look like (just like trying to explain a caveman the wonders of tech from ancient roman empire).
Anonymous No.42272090
>>42269957
This. You must use the pone to save the pone
Anonymous No.42272674 >>42272771 >>42272932 >>42275950
>>42262734
I've tried to use the audio clips but my models sound like shit. Does anyone have some pre-made audio clips I can use for ElevenLabs that's worked well for them?
Anonymous No.42272771
>>42272674
>models sound like shit
so idea what script you are using but everyone and every company that has pony voice conversions/tts are using the exact same clips from PPP.
if you are using some new experimental cloning scripts, these will require the use of 10s clips, so if you give them just 3s clip the result will sound shit.
Anonymous No.42272932 >>42273853
>>42272674
>ElevenLabs
>Models sound like shit
So nothing new then
Anonymous No.42273482
nein!
Anonymous No.42273853 >>42274821
>>42272932
>https://u.pone.rs/LvFcybeH.mp3
surprise horsefuckers, I got some spare time and converted a song from my buddy to Moon Dancer vocals, enjoy.
OG song: https://suno.com/song/eae162d0-cbbb-433a-8008-5fab7bee01ba
Anonymous No.42274484 >>42275868 >>42276957
Bump.
Anonymous No.42274821
>>42273853
Nice pop song.
Anonymous No.42274933 >>42283796
>>41070370
Is there a chance anybody here has archived this before it was deleted?
>Background Pony - "OUT OF APPLES" - Hall 'n Oates - Out of Touch (MLP Applejack AI cover)
this was its title if it helps anybody find it
Anonymous No.42275376
>mare antispam bump
Anonymous No.42275868
>>42274484
Anonymous No.42275950 >>42276071
>>42272674
ElevenLabs is shit. Just use 15.ai.
Anonymous No.42276071 >>42279153
>>42275950
15...
Anonymous No.42276441
sleep bump
Anonymous No.42276957 >>42277965
>>42274484
Anonymous No.42277294
>mares
Anonymous No.42277427
>https://u.pone.rs/EuipipDV.mp3
American (Dad) Ghost theme
Anonymous No.42277965
>>42276957
nein
Anonymous No.42278416
>nein
Anonymous No.42278429 >>42278431 >>42279204
I downloaded this in 2021, it's been 4 years now. How much has it improved since then?

https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN
https://vocaroo.com/11NtyOrTttKN
Anonymous No.42278431
>>42278429
a lot.
Anonymous No.42279044
>Page 10
Anonymous No.42279153
>>42276071
He's right though. EL is arse.
Anonymous No.42279159 >>42279204
I want to take the costanza answering machine song and change the words while maintaining his voice. What's the most appropriate model to do this with?
Anonymous No.42279204
>>42279159
>keeping the og voice but slightly edited
Hmm, that will be bit tricky, if you can find a version without a laughing track, you can try run the clip through the ace-step
>https://huggingface.co/spaces/ACE-Step/ACE-Step
This should allow you to use function to partly edit the lyrics without changing the music (or so that's the general idea.
The other alternative is to find some clean clips (or de-noise them with some ai program) of costanza singing in same tune as in the show, have that 2~3 minutes of dataset trained in rvc, use some other character talknet/whatever model to sing the whole song and apply it to official soundtrack
>https://www.youtube.com/watch?v=1ghIoM89cfc&list=RD1ghIoM89cfc
>>42278429
>from previous year
>https://u.pone.rs/DFPTbUhe.mp3
Dude, tech jump feels like going from writing books by hand to using printing press. Depending on what you are trying to use if for, it will for most of the time sound about ~95% like character is supposed to sound like.
Anonymous No.42279528 >>42279949 >>42280416 >>42282125 >>42282660
Bump against the raid
Anonymous No.42279949
>>42279528
ya
Anonymous No.42280416
>>42279528
nein
Anonymous No.42280869
>mares
Anonymous No.42281330
bumpo save
Anonymous No.42281616
>https://u.pone.rs/FHniGgaQ.mp3
Pinkie Pie - At God's Mercy (GAME SIZE)
Anonymous No.42282125
>>42279528
Anonymous No.42282660
>>42279528
again
Anonymous No.42283002
>https://u.pone.rs/dyjpaZQU.mp3
Rainbow_Dash_sings_Land_of_Shattered_Dreams_by_DragonForce
Anonymous No.42283763
>No Nurse Redheart on 15.ai
Boycotting 15
Anonymous No.42283796 >>42284031 >>42284214 >>42291212 >>42292848 >>42293157
>>42274933
Six years of saving songs comes in handy sometimes. https://files.catbox.moe/gwqv9m.mkv
Anonymous No.42284031
>>42283796
>Filename
A philosophy to live by.
Anonymous No.42284214
>>42283796
nta but thank you archive-kun anon
Anonymous No.42284569
>https://u.pone.rs/MOQrKwwX.mp3
Redoing Cossacks letter with gpt sovits.
Anonymous No.42285028
>https://huggingface.co/collections/kyutai/speech-to-text-685403682cf8a23ab9466886
kyutai have posted their speech-to-text models on hugging face (it's the people who made the https://unmute.sh/ site). Hopefully they will get around publishing the TTS model some time soon.
Anonymous No.42285552 >>42287822 >>42299779
>boop
Anonymous No.42286094 >>42289052 >>42291103
>sleep bump
Anonymous No.42287174 >>42287401
>>42253243
I came back with some samples from my button's mom dataset that I used the following on:
De-Breath
De-Esser
Mouth De-Clicker
Plosive Remover
>Original Samples
https://files.catbox.moe/68yrm2.wav
>Processed Samples
https://files.catbox.moe/0d3djz.wav
Again, I read that the software is completely open sourced to public domain and no one owns the rights to it or what it makes, should be perfect for any use for processing data without spending money on IzoTope. You be the judge on how effective it is, I'd say it's good enough to shovel multi-hour datasets for free in one go and clean up whatever is left afterwards.
Anonymous No.42287401 >>42288274
>>42287174
Cool stuff! With it's apparent noise and reverb removal capabilities I may have to test how well it is at salvaging previously unusable data to see if existing pony models might be expanded. Gotta first test if it works well through Wine though. I wonder if I might be able to salvage more workable Redheart data.
Anonymous No.42287822
>>42285552
Anonymous No.42288250
>pony bump
Anonymous No.42288274
>>42287401
Hell yeah brother! That's what it's all about! There's got to be so much ponyfeather quality audio data that could have been fine with just a pop filter, and this should fix it for posterity.
Anonymous No.42289052 >>42289899
>>42286094
ayy
Anonymous No.42289075 >>42289305
Does anyone know what TTS service is best to use with SillyTavern?
Anonymous No.42289305
>>42289075
uhhh, i vaguely remember there was a plugin script (or api script?) that could connect the ST with some tts that could even be train on 10~20 minutes of dataset, but that was year or more ago and even than I personally given up on it as python dependency hell was impossible to navigate to even install that bloody thing.
Anonymous No.42289899
>>42289052
Anonymous No.42290193 >>42290480
its mare
Anonymous No.42290480 >>42305186
>>42290193
Anonymous No.42291103 >>42292030
>>42286094
>awake bump
Anonymous No.42291212
>>42283796
SUPERCHARGED anon, thank you
Anonymous No.42292030
>>42291103
indeed
Anonymous No.42292848
>>42283796
Nice! I think I have about that in pony memes and art among others from years of saving which come to think of it I still need to find time to sort and categorise β€” Thanks for the reminder.
Anonymous No.42293157
>>42283796
Autism yields its own rewards.
Nice.
Anonymous No.42293460
>pre work bump
Anonymous No.42293824 >>42294511
Precautionary bump.
Anonymous No.42294511
>>42293824
aaaaaaaaaaaa!
Anonymous No.42295095
gn, imma going to think of what stuff to make tomorrow
Anonymous No.42295943 >>42296474 >>42297247
Paag 10 save.
Anonymous No.42296474
>>42295943
Almost again.
Anonymous No.42297247 >>42298481
>>42295943
Anonymous No.42297721
night bump
Anonymous No.42298481
>>42297247
Anonymous No.42298627 >>42299241
>>42174105
Do we know if there are any other additional recent local audio and music generators comparable to the likes of Suno and Udio?
Aside from this example, I haven't come across a decent versatile one that can run local since Bark, which since was abandoned ages ago (as far as open source goes) and became Suno. Which is still incredibly good, but it'd be nice to have something similar that don't rely on credits and lame stuff like that.
Anonymous No.42299090 >>42300347 >>42301020
>9
Bump mare time
Anonymous No.42299241
>>42298627
Stability Ai may or may not work on one, but who the fuck knows with them since they still have't publish the newer version of instrumental Stable Audio model.
Other ai song model is the YuE, but from the looks of it its bit tricky to get working locally .
Anonymous No.42299779
>>42285552
Boopity boop!
Anonymous No.42300347
>>42299090
mare
Anonymous No.42300352 >>42301986
>>42161191 (OP)
Congratulations, 1111 aka 15!
Anonymous No.42300581 >>42301705 >>42302358
>https://u.pone.rs/kLAzyDaA.mp3
New ai song, "I only eat 3 cheeseburgers!" from suno user κΉ€μΉ˜λ‹€μ‹œλ§ˆμ€κ°ˆμΉ˜, and converted with Twi vocals.
Anonymous No.42301020
>>42299090
mare harder
Anonymous No.42301705
>>42300581
we sell hay here not burgers
Anonymous No.42301986 >>42302315
>>42300352
What are you referring to?
Anonymous No.42302294 >>42302526 >>42302859 >>42303531 >>42304147
>9
Eighth bump mare deployed
Anonymous No.42302315
>>42301986
sΓΆy of 2
Anonymous No.42302358
>>42300581
Could go for some burgers right about now
Anonymous No.42302526
>>42302294
Thank you, kind bump mare.
Anonymous No.42302859
>>42302294
Anonymous No.42303176
quick board...
Anonymous No.42303531
>>42302294
mared
Anonymous No.42303825
anti spam bump
Anonymous No.42304147
>>42302294
Anonymous No.42304700
>https://www.tomshardware.com/news/gddr6-vram-prices-plummet
>16 gb of vram could be as cheap as 400$
>but it wouldn't because nvidia are greedy fucks
i will never forgive the crypto bros for fucking up the market
Anonymous No.42305186
>>42290480
So it seems
Anonymous No.42305431 >>42305442
Board is moving lightning fast this past hour.
Anonymous No.42305442 >>42305606 >>42305635
>>42305431
it's the sliderfag
Anonymous No.42305606
>>42305442
Yep, it's becoming more and more blatant every time.
Anonymous No.42305635
>>42305442
With the lack of reaction from jannies and mods (as they are too busy to jerk off to furry fag shit), Im feeling like there could be a good idea to keep a parallel thread in nhnb and mlpol too, to at least keep some bits in case the the board kept being nuked.
Anonymous No.42306113
>pre dinner bump
Anonymous No.42306741
>up poned
Anonymous No.42307024 >>42307277 >>42307279
Anyone know how to get 15 ai to scream? Tried to use so-vits on haysay with audio but it came out like crap. Need Lyra doing it too, and so-vits doesn't have her.
Anonymous No.42307277 >>42308681
>>42307024
Uhhh, tts models pretty much always struggled with screaming and whispering. The older 15 model could do it to some smaller degree (but it still was a massive game of rolling the next generated clip untill you got what you wanted). I guess you could try to find screaming clip in OP mega and use that with gpt sovits reference Tts?
Anonymous No.42307279 >>42307905 >>42308681
>>42307024
Convincing screams and other less-phonetic sounds have been notoriously difficult since the very beginning of artificial speech. Feels like it comes down to a lack of data, or the specific exclusion of which due to the negative impact its kind has on training.

Closest thing I can suggest is priming. Initiate the prompt with a sentence (or multiple) of dialogue that would ordinarily be expected to be said with intensity; be that anger, seriousness, shock, whatever. The AI likes to be consistent with outputs and therefore some of that emotion will be inherited and thus carry over to concurrent sentences β€” this is where you'd attempt screaming dialogue. Might also be good to try using ARPAbet for some too so it pronounced correctly.
Anonymous No.42307905
>>42307279
10
Anonymous No.42308230 >>42309328
Bump.
Anonymous No.42308681 >>42309483
>>42307277
>>42307279
Thanks for the suggestions. I ended up just regenerating an "AAAAAAAAA" prompt a bunch of times until I got as close as I could to a scream. Sounds like shite, but it was only for a little shitpost anyway. https://files.catbox.moe/z2r0c8.mp3
Which is for this for this pic in /bale/ >>42305975
Anonymous No.42309328
>>42308230
Anonymous No.42309483
>>42308681
huh, pretty neat work Anon
Anonymous No.42310164 >>42310579
>>42207220
https://files.catbox.moe/fv2v5u.wav
https://files.catbox.moe/aeqloc.wav
https://files.catbox.moe/xl6ft5.wav
https://files.catbox.moe/xl6ft5.wav

Here's some with Flutters. I just did:

"ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, cumming! ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, fuck me, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh, ahh!"

You can hear the good parts and splice those.
Anonymous No.42310579
>>42310164
ai mares are lewd
Anonymous No.42310923 >>42311695
>https://u.pone.rs/NlnRoRSa.mp3
Ghost singing Past Due - Xenophobia (aka unofficial theme song of Stellaris)
Anonymous No.42311695
>>42310923
A classic. Let the light of mankind shine brighter than the stars themselves
Anonymous No.42312379
RealDash No.42312387 >>42312431
I might make a small lewd audio of Twiggle as a test for 15.dev.
Dialogue's a pain to get to sound natural, way more than 15ai's last version.
Anonymous No.42312431
>>42312387
>>>/trash/
Anonymous No.42312836 >>42313175
>9
Deploying ninth bump mare (triple pose edition)
Anonymous No.42313175
>>42312836
horse
Anonymous No.42313719 >>42313730
>14.ai
lmao
Anonymous No.42313730 >>42313761 >>42314271
>>42313719
kek, a race to the bottom. What kind of sketchy indians will we reach when we hit 1.ai?
Anonymous No.42313761 >>42319574
>>42313730
uh, based?
Anonymous No.42314271 >>42314359 >>42314864
>>42313730
Or -1.ai
Anonymous No.42314359 >>42317225
>>42314271
Interestingly, hyphens can't be used at the start or end of a domain name. Would probably have to be negative1.ai or something
Anonymous No.42314864
>>42314271
Witchcraft!
Anonymous No.42315242
>9
Anonymous No.42315665 >>42316221 >>42316594 >>42317148 >>42317684
>Page nine
Anonymous No.42316221
>>42315665
MAREEE
Anonymous No.42316594
>>42315665
early sleep bump
Anonymous No.42317148
>>42315665
Anonymous No.42317225 >>42317279
>>42314359
Or simply minus1.ai. It's kind of a word play.
Anonymous No.42317279
>>42317225
Clever. I like it.
Anonymous No.42317684
>>42315665
Anonymous No.42317966 >>42318161
A very quick cover of Beatles' With a Little Help from My Friends with slightly modified lyrics
https://u.pone.rs/ODLJbBek.flac
Anonymous No.42318161
>>42317966
Nice work Anon! Funny enough, I listen to some random Beatles song a week ago and wished there was some covers or parodies done in pony voices.
Anonymous No.42318380
Hi HydrusBeta, Im getting error when using the sovits 4.0 Spitfire model with 'reduce hoarsness' and 'apply nsf_higan' setting, and it works if I turn these two settings off.
Anonymous No.42318746
>https://u.pone.rs/KbiNvzqK.mp3
Solitary Summer Dream by suno user testediserie.
I was looking for a nice summer song for Celestia, I found myself really enjoying listing to this BUT rvc and other voice converts disagreed with my vocal choice, so we all get to enjoy Spitfire cover, since her voice haven't been used that much.
Anonymous No.42318932
Late night bump.
Anonymous No.42319283
What's the current torrent for the MLP leak files?
Anonymous No.42319348 >>42319606
>42119384 42196683 42317225
Yet it is proper to enumerate as such among the Trotting ways.

>42161222 42269737 42208841
ppp as tragedy of the commons
Things fall apart, the centre cannot hold - Keats
pandora's vox on community in cyberspace - humdog
yet... n mare saddlepoint? The altchans apart were less a scattering of the winds and more of the Shattered sundered.

>42204138 42198701 42195922
The Cathedral and the Bazaar - Raymond, acknowledging Tarver's Bizarre Empty Temples.
Cathedral vs. Parlor - Wrye, acknowledging Monitor144hz's Patreon Pigeonhole.
Tamers1-4,5 voices when?

>42270729
It's been a long thread. Bacon-bakin' necessary.
Anonymous No.42319574
>>42313761
Who are these dunces?
Anonymous No.42319606 >>42319961 >>42320204
>>42319348
Anon, are you trying to conceptualize LLM into becoming CelestAI ?
Anonymous No.42319961
>>42319606
If it works, that would be something.
Anonymous No.42320204
>>42319606
Boop her snoot.
Anonymous No.42320765
>mares
i love then
Anonymous No.42321010 >>42321061
NEW THREAD
>>42320976
Anonymous No.42321061
>>42321010
mares?