/ldg/ - Local Diffusion General
Anonymous
9/6/2025, 2:44:57 AM
No.106497283
>>106497298
All round town anons ask me if I know of the "blessed thread" and do you know what I tell them
Comfy needs WebUI style workflow or node improvements for users. Swarm and SDNext aren't solutions. People want Gradio.
Anonymous
9/6/2025, 2:46:54 AM
No.106497298
>>106497375
>>106497283
Please tell me
Anonymous
9/6/2025, 2:48:03 AM
No.106497311
>>106497293
people dont know what they want until you give it to them desu
>>106497293
ComfyTip:
Group reusable nodes using Nest Node Builder.
Load/ungroup them when needed instead of adding individual nodes.
Save time with saved nested nodes (ControlNet, Prompt, Upscale, etc.).
Export as JSON to transfer/share.
ComfyUI is faster and more convenient than Forge once a workflow is finalized, trust me.
Anonymous
9/6/2025, 2:50:53 AM
No.106497324
Blessed thread of frenship
>>106497316
Comfy It's trash, a big pile of ugly frustrating trash, and why the hell is it called comfy ui? it anything but comfy its awful. I'm will never using it again, fuck !
Graphical UIs suck. Why can't we write code directly?
A basic workflow is less than 10 lines of code:
text_encoder = load_text_encoder("te.safetensor")
model = load_model("model.safetensor")
vae = load_vae("vae.safetensor")
model.load_lora("lora.safetensor")
latent = new_latent(width=1024, height=1024)
latent = latent.ksampler(steps=20, cfg=4, seed=12345)
image = vae.decode(latent)
image.save("out.png")
Ability to write functions make abstraction possible:
latent = new_latent(width=1024, height=1024)
latent = my_custom_first_pass_sample(latent, steps=20, cfg=4)
latent = my_custom_hires_fix(latent, steps=30, cfg=4, denoise=0.7)
latent = my_custom_adetailer(latent, ...)
No more spaghetti. No more janky node to workaround node UI deficiencies like reroute, switch, number to string to number, numeric operations on numbers, etc.
Not to mention with loop statements it's easy to make custom XYZ plots, which node UI just can't do.
With a real programming language, it has the power of both forge-style UI and node-based UI do while also better than both in flexibility and cleanness.
Pair it with a Jupyter notebook style UI, you can nicely iterate and inspect results anywhere in the middle of generation and nicely iterate until you get a good result.
Anonymous
9/6/2025, 2:52:29 AM
No.106497335
>>106497325
>Comfy It's trash
Can I push a big orange button to convert a "comfy" workflow to python or some real scripting language and hack that?
Anonymous
9/6/2025, 2:54:52 AM
No.106497345
>>106497355
>>106497316
What is killing me on my Comfy are the updates that and other shenanigans that keeps breaking constantly , like, the project is great but it's so fragile and takes a misplaced period to almost make you pc explode.
Anonymous
9/6/2025, 2:54:52 AM
No.106497346
>>106497334
Easy peasy, lemon squeezey ! What imports do I need?
Anonymous
9/6/2025, 2:55:53 AM
No.106497351
>>106497334
this is the best
Anonymous
9/6/2025, 2:56:54 AM
No.106497355
>>106497345
Seriously, they should have guideline like Forge and enforce them before allowing them to be integrated into WebUI. Seriously, that's what is keeping me from using it as my main stable diffusion ui.
Anonymous
9/6/2025, 2:57:32 AM
No.106497358
>>106497334
the only thing i want to write is a prompt
Anonymous
9/6/2025, 2:58:42 AM
No.106497369
Anonymous
9/6/2025, 3:00:12 AM
No.106497375
>>106497451
>>106497298
that gap man
THAT FUCKING GAP
it does things to me
Anonymous
9/6/2025, 3:02:13 AM
No.106497384
After all this time trying comfy, I still absolutley hate it's fking guts. I tried, I learned, I made mistakes, I studied, I failed, I learned again. Debugging and debugging and debugging... I'm so sick of it. I hated it from my first git clone up until now, with my last right click delete of the repository. I have been using A1111, reForge, and Forge as my daily before Comfy. I tried Invoke, foocus, and SwarmUI. Comfy is at the bottom. I don't just not enjoy it, it is a huge nightmare everytime I start it. I wanted something simple, plug n play, push power button and grab a controller, type of ui. Comfy is not only 'not it' for me, it is the epitome of what I hate in life.
Why do I hate it so much? Here's some back ground if you care. When I studied to do IT 14 years ago I had a choice to choose my specialty. I had to learn everything from networking, desktop, database, server, etc... Guess which specialties I ACTIVELY avoided? Database and coding/dev. The professors would suggest once every month to do it. I refused with deep annoyance at them. I dropped out of Visual Basic class because I couldn't stand it. I purposely cut my Linux courses because I hated command line, I still do. I want things in life to be as easy and simple as possible.
Anonymous
9/6/2025, 3:03:23 AM
No.106497391
>>106497385
okay .. well you're kinda retarded so you're not helping the rest of us with our case
>>106497385
Comfy is like browsing the internet in a browser with html format only. Imagine a wall of code, a functional wall of code. It's not really the spaghetti that bothers me, it's the jumbled bunch of blocks I am supposed to make work. The constant scrolling in and out is annoying but the breaking of comfy from all the nodes (missing nodes) was what killed it for me. Everyone has a custom workflow. I'm tired of reading dependencies over and over and over again.
I swear to Odin I tried my best. I couldn't do it. I just want to point and click and boom image. I don't care for hanyoon, huwanwei, whatever it's called. I don't care for video and all these other tools, I really don't. I just want an outstanding checkpoint and an amazing inpainter.
Am I stupid? yeah sure call me that if you want. I don't care. I open forge. I make image. I improve image. I leave. That's how involved I am in the AI space. DESU, 90% of the new things, cool things, new posts in this sub is irrelevant to me.
You can't pay me enough to use comfy. If it works for you great, more power to you and I'm glad it's working out for you. Comfy was made for people like you. GUI was made for people who couldn't be bothered with microscoptic details. I applaud you for using Comfy. It's not a bad tool, just absolutely not for people like me. It's the only and the most power ui out there. It's a shame that I couldn't vibe with it.
Anonymous
9/6/2025, 3:04:37 AM
No.106497403
>>106497392
I use comfy for video, forge for the images. I do a fair amount of nodes in unreal and blender at work, so it's not too scary, but I won't add another node system I need to master unless I have to!
Anonymous
9/6/2025, 3:04:45 AM
No.106497405
>chatgpt write schizobabble from the perspective of someone who hates comfyui
Anonymous
9/6/2025, 3:05:16 AM
No.106497411
>>106497385
>. I tried, I learned, I made mistakes, I studied, I failed, I learned again.
Engaging but not empowering. sigh
Anonymous
9/6/2025, 3:06:20 AM
No.106497420
>>106497392
I completely get where you are coming from. I have tried ComfyUI too, and while it is insanely powerful, it always feels like I am stuck fixing broken LEGO instructions made by 10 different people. Every time I load someoneβs workflow, half the nodes are either deprecated, custom, or renamed, and I end up spending more time debugging or hunting missing certain type of files than generating.
Anonymous
9/6/2025, 3:08:04 AM
No.106497436
>>106497477
>>106497428
You ruined that skindentation with that ugly face.
Anonymous
9/6/2025, 3:08:20 AM
No.106497438
>>106497447
>>106497428
dat phase shift
Anonymous
9/6/2025, 3:08:33 AM
No.106497440
>>106497385
That is why I stick with Stable Diffusion Forge for images. It just works. PNG Info gives me everything I need at a glance, even if the image was not made in Forge. Prompt, model, LoRA,
Anonymous
9/6/2025, 3:09:31 AM
No.106497447
>>106497438
that was on purpose.. it took so goddamn many tries to get it to do it too
Anonymous
9/6/2025, 3:09:51 AM
No.106497451
>>106497504
>>106497334
Can't you just use the tensor library in python and kinda just do that?
Implementing/porting all functionality of custom nodes might be a pain, though.
That being said, I don't mind ComfyUI. I kinda like the previews/compare nodes. If you'd want that in pure code it'll require a UI again, anyway.
>>106497375
This is a blue board, this is all friendly fun.
Anonymous
9/6/2025, 3:10:19 AM
No.106497454
>>106497392
ComfyUI has its fans and massive support and I respect that, but it is clearly not built for everyone. If you like clean workflows and hate excessive tinkering, Forge hits the sweet spot. So you are not alone!
Anonymous
9/6/2025, 3:12:27 AM
No.106497467
>>106498333
blue board, blue balls
Anonymous
9/6/2025, 3:13:06 AM
No.106497473
did I tell you before? I don't like comfyui
I know there are other webuis, but I don't care about them, I don't even actually gen, I just want to say I don't like comfyui
Anonymous
9/6/2025, 3:13:27 AM
No.106497477
>>106497436
that was actually on purpose and it took many attempts to pull it off
Anonymous
9/6/2025, 3:14:28 AM
No.106497484
>>106502045
Anonymous
9/6/2025, 3:14:37 AM
No.106497486
>>106497501
>>106497316
You'd be surprised at the number of nodes people include in their workflows that are completely unnecessary. I've even seen requests to install a custom node package to set an integer or just to hide 2 spaghettis.
Anonymous
9/6/2025, 3:15:15 AM
No.106497494
It's all well and good to want to write high-level code and click run but it does have some problems. Main one being that importing PyTorch and then loading your models takes forfuckingever so you're going to need to have some sort of server that holds onto that stuff for you and somehow knows what models to cache when, otherwise every time you fiddle with your gen you have to wait the best part of a minute just for it to get going. Maybe you could do it with some combination of Jupyter notebooks idk, but I've never used Jupyter and you'd also need to implement some way to easily embed those notebooks into your outputs and then convince everybody to adopt that standard.
Anonymous
9/6/2025, 3:15:51 AM
No.106497498
with all this fresh pasta i dont need the noodles XD
Anonymous
9/6/2025, 3:15:53 AM
No.106497499
To AntiComfySchizo:
You don't hate comfy, you hate python. And rightly so, it's a fully trash peer dependency environment.
Anonymous
9/6/2025, 3:16:23 AM
No.106497501
>>106497486
It's very annoying when workflow use random exotic nodes while perfectly fine versions exist in core or even well known packs.
I just convert them usually.
Anonymous
9/6/2025, 3:17:31 AM
No.106497503
>>106497535
I HATE PYTHON
Anonymous
9/6/2025, 3:17:52 AM
No.106497504
>>106497672
>>106497451
>This is a blue board, this is all friendly fun.
it makes me horny in a friendly way
Anonymous
9/6/2025, 3:18:28 AM
No.106497515
I LOVE SNAKES
Anonymous
9/6/2025, 3:18:41 AM
No.106497518
>>106497528
>>106497502
FUCK OF NETA LUMINA SCAMMER
SHARE WORKFLOW OR GTFO
Anonymous
9/6/2025, 3:19:05 AM
No.106497521
Anonymous
9/6/2025, 3:19:49 AM
No.106497528
give me prompt suggestions pls
Anonymous
9/6/2025, 3:20:36 AM
No.106497535
>>106497503
>I HATE PYTHON
I like python. I hate python package management and dependency hell.
Anonymous
9/6/2025, 3:21:36 AM
No.106497545
>>106497581
>>106497534
A cute 1girl sleeping under the sun.
Anonymous
9/6/2025, 3:22:06 AM
No.106497551
>>106497581
>>106497534
"An old man punching a horse"
Anonymous
9/6/2025, 3:22:38 AM
No.106497556
>>106497582
>>106495675
anyone? though reading past the last thread Iβm gonna assume that even if it is possible, itβs not as easy as I think it is.
iβm basically modifying an image of a hand pulling something out of a box, and I want to use the reference image as the item being pulled out.
Still using an unmodified workflow from the guide here.
Anonymous
9/6/2025, 3:22:39 AM
No.106497557
>>106497581
>>106497534
1gil, looking at viewer, waving
Anonymous
9/6/2025, 3:23:36 AM
No.106497566
>>106497581
>>106497534
an indian uncle harassing a hot girl for sexi sex
Anonymous
9/6/2025, 3:25:59 AM
No.106497581
>>106497545
>>106497551
>>106497557
>>106497566
i could inpaint all of these into a single awesome gen but i use comfy so you know im not fucking around with that shit kek
Anonymous
9/6/2025, 3:26:01 AM
No.106497582
Anonymous
9/6/2025, 3:28:43 AM
No.106497600
>>106497612
I've started to notice some similarity on the wan 2.2 faces, is there anything I can do about it without prompting face traits or ethnicity?
Anonymous
9/6/2025, 3:30:45 AM
No.106497612
>>106497600
use loras, or use i2v instead
>>106497504
I'm not going to give you a friendly handshake, that's for sure.
Anonymous
9/6/2025, 3:40:46 AM
No.106497697
>>106497672
that's ok anon
Anonymous
9/6/2025, 3:44:24 AM
No.106497725
>>106497525
You asuka sucks you know?
Anonymous
9/6/2025, 3:47:01 AM
No.106497747
tfw ani was right
Anonymous
9/6/2025, 3:47:21 AM
No.106497751
>>106497525
pretty good Asuka
Anonymous
9/6/2025, 3:48:22 AM
No.106497762
k enough of the discord invasion
Anonymous
9/6/2025, 3:57:17 AM
No.106497819
Am i really the only cumfartnigger who sees this in vram usage during every vram allocation until it stabilizes and works normally? Started happening in the last couple of days
Anonymous
9/6/2025, 4:08:07 AM
No.106497895
>>106497882
Maybe comfyui should be dragged on the streets and ACKed
Anonymous
9/6/2025, 4:09:02 AM
No.106497901
>>106497908
man we really need better interpolation software
Anonymous
9/6/2025, 4:10:10 AM
No.106497907
>>106497892
pretty damn good
Anonymous
9/6/2025, 4:10:31 AM
No.106497908
>>106497901
film vfi is good enough for 16 to 32, and we just need some cracked topaz node connection in comfyui for slightly higher quality to 32 and for anything more
Anonymous
9/6/2025, 4:16:56 AM
No.106497955
Anonymous
9/6/2025, 4:17:46 AM
No.106497959
>>106497882
>mfw ram usage expands
Anonymous
9/6/2025, 4:21:56 AM
No.106497991
i want these goddamn snakes out of my goddamn computer
Anonymous
9/6/2025, 4:27:43 AM
No.106498024
Anonymous
9/6/2025, 4:38:35 AM
No.106498098
>>106502095
Anonymous
9/6/2025, 4:44:28 AM
No.106498141
>>106498172
Anonymous
9/6/2025, 4:47:33 AM
No.106498151
>>106497325
>I'm will
you will be sorely missed, frenchy.
Anonymous
9/6/2025, 4:50:24 AM
No.106498169
Anonymous
9/6/2025, 4:51:21 AM
No.106498172
>>106498378
>>106498141
why is that man drooling like a retard
Anonymous
9/6/2025, 4:52:51 AM
No.106498188
Anonymous
9/6/2025, 4:54:11 AM
No.106498196
Anonymous
9/6/2025, 4:58:27 AM
No.106498231
>>106498237
Anonymous
9/6/2025, 4:59:33 AM
No.106498237
Anonymous
9/6/2025, 5:03:23 AM
No.106498256
Anonymous
9/6/2025, 5:11:53 AM
No.106498295
>>106498310
Don't get all the shit Comfy gets, when the entire community has had to rely on Civit for far too long. We're talking actual tangible, long term damage from such a garbage platform thriving.
Anonymous
9/6/2025, 5:13:52 AM
No.106498309
>>106499172
Anonymous
9/6/2025, 5:14:05 AM
No.106498310
>>106498295
Yet people here rarely talk about real problems in need of an urgent solution.
Anonymous
9/6/2025, 5:17:59 AM
No.106498325
>>106498344
>>106497275
whoever genned this, please PLEASE catbox
Anonymous
9/6/2025, 5:20:33 AM
No.106498333
>>106497467
Not if you visit /adt/
I mean, it's crazy what they get away with posting there.
Anonymous
9/6/2025, 5:23:36 AM
No.106498344
>>106498325
PLEASE get better taste
Anonymous
9/6/2025, 5:28:59 AM
No.106498370
Anonymous
9/6/2025, 5:30:38 AM
No.106498378
>>106498437
Anonymous
9/6/2025, 5:43:13 AM
No.106498437
Anonymous
9/6/2025, 5:43:24 AM
No.106498439
>>106498530
Anonymous
9/6/2025, 5:58:01 AM
No.106498530
>>106498439
Typical DiT manlet (because manlet syndrome isn't unique to kontext or QIE, it is some kind of transformer-wide intrinsic property, noticable in Dalles, too. Young meatbag artists also share it with transformers when they run out of bottom margins on their real world paper but still need to draw feet.)
Anonymous
9/6/2025, 6:00:53 AM
No.106498539
Anonymous
9/6/2025, 6:02:09 AM
No.106498545
>>106497428
2000 years post wall
Anonymous
9/6/2025, 6:06:53 AM
No.106498571
Anonymous
9/6/2025, 6:12:13 AM
No.106498601
Anonymous
9/6/2025, 6:38:32 AM
No.106498708
>>106498718
Can any of you anons make me a believable black-and-white photo of Sigmund Freud mixed with George Floyd? I want to print it and
Anonymous
9/6/2025, 6:39:36 AM
No.106498718
>>106498708
...and put it on my best friend's wall.
Is GenJam never coming back?
Anonymous
9/6/2025, 6:52:24 AM
No.106498773
>>106498746
it will return
Anonymous
9/6/2025, 7:06:44 AM
No.106498840
>>106498746
Just say it with, move your lips when reading it so you can feel how good it feels saying.
"GOON JAM"
Anonymous
9/6/2025, 7:14:48 AM
No.106498878
>>106498923
Anonymous
9/6/2025, 7:18:16 AM
No.106498895
wan 2.2 vace WHEN
Anonymous
9/6/2025, 7:25:58 AM
No.106498922
>>106502499
>>106498746
>didn't participate
>somehow miss it
feels weird
Anonymous
9/6/2025, 7:26:07 AM
No.106498923
>>106498929
>>106498878
I'm curious of the prompt
Anonymous
9/6/2025, 7:26:31 AM
No.106498929
>>106498923
A scenic view of an old bicycle in a field. The camera pushes out to reveal scraps of torn and ripped clothing strewn about and broken beer bottles on the ground. There are smears from bloody handprints on the tree. There is a puddle of of blood in the grass and on the ground near the torn clothes. It looks like a murder scene.
Anonymous
9/6/2025, 7:27:34 AM
No.106498936
>>106498996
How much RAM you guys have? I plan to upgrade from 64 to 128GB, but not sure if it's worth the upgrade
>>106497872
kek, fucked up the heels
Anonymous
9/6/2025, 7:44:09 AM
No.106498996
>>106499232
>>106498936
noob and a lora trained on 80 imgs curated from
https://x.com/schauermannx2
i think it can be much better tho
>>106498961
perspective wise i think the video makes more sense heh
Anonymous
9/6/2025, 7:51:32 AM
No.106499026
>>106499215
>>106498958
Downloaded RAM from 64 to 128 and it reduced my Wan2.2 gens by 80 seconds. Worth it.
Anonymous
9/6/2025, 7:58:46 AM
No.106499047
>>106501005
>>106499032
where can u download ram from???
Anonymous
9/6/2025, 7:59:47 AM
No.106499051
Anonymous
9/6/2025, 8:02:05 AM
No.106499064
Are there wan2.2 vace or alternatives yet?
Anonymous
9/6/2025, 8:12:13 AM
No.106499100
Anonymous
9/6/2025, 8:23:14 AM
No.106499147
Anonymous
9/6/2025, 8:27:43 AM
No.106499172
Anonymous
9/6/2025, 8:30:49 AM
No.106499186
Anonymous
9/6/2025, 8:36:00 AM
No.106499215
>>106499237
>>106499026
no screenshot? comeon hommie, flex that dick
Anonymous
9/6/2025, 8:37:04 AM
No.106499218
>>106499244
>>106496741
>Now I kind of want to make a Suno.ai song with this phrase as the chorus:
>>he made his own bed
>>by forcing the model to run faster starting at v30, >it all went downhill from there
got ya senpai
https://vocaroo.com/1loPzeLJD8qK
Anonymous
9/6/2025, 8:40:11 AM
No.106499232
>>106498961
>>106498996
true, i think i got it right this time though
Anonymous
9/6/2025, 8:40:44 AM
No.106499237
>>106499247
Anonymous
9/6/2025, 8:41:46 AM
No.106499244
Anonymous
9/6/2025, 8:42:42 AM
No.106499247
>>106499250
>>106499237
dawg I got a 5090, thought you had that blackwell pro, tryin to jerk off over here
Anonymous
9/6/2025, 8:43:13 AM
No.106499250
>>106499247
no 96 gig ram not vram
Anonymous
9/6/2025, 8:44:14 AM
No.106499255
Anonymous
9/6/2025, 8:46:12 AM
No.106499264
odd how onetrainer wont let you define sampler/scheduler/etc for training samples. i wonder how training with comfy compares in general
Anonymous
9/6/2025, 8:53:05 AM
No.106499300
>>106499316
>>106497264 (OP)
Middle right is amazing
Anonymous
9/6/2025, 8:56:55 AM
No.106499316
>>106499300
Ikr, I didn't find any flaw on that one, I kinda expect Wan 3.0 to have this kind of quality consistently
Anonymous
9/6/2025, 8:57:56 AM
No.106499318
>>106498206
LMAOOO, this is a gem
Anonymous
9/6/2025, 8:59:43 AM
No.106499331
>>106499113
now I get why Microsoft wants to shut this model down kek
Anonymous
9/6/2025, 9:12:21 AM
No.106499420
>>106500838
>>106499405
the corrected 7b model is "on the way", it'll be a lobotomized version of the one we already have lol
I just realized WAN's generation are tuned for 16 fps. Increasing fps makes the motion fast most of the time.
Anonymous
9/6/2025, 9:19:05 AM
No.106499463
>>106499405
because that obviously is not the original microsoft repo
Anonymous
9/6/2025, 9:19:46 AM
No.106499468
>>106499455
yep, I hope their next version will be at 24 fps, that's the threshold where it doesn't look chopped as fuck
https://www.theverge.com/anthropic/773087/anthropic-to-pay-1-5-billion-to-authors-in-landmark-ai-settlement
Anthropic to pay $1.5 billion to authors in landmark AI settlement
holy fuck dude, this is bad, like really really bad
Anonymous
9/6/2025, 9:25:44 AM
No.106499505
>>106499509
>>106499455
>I just realized WAN's generation are tuned for 16 fps.
I have no idea where people got the idea that it isn't. It's been a really hard myth to dispell.
Anonymous
9/6/2025, 9:26:24 AM
No.106499509
>>106499505
>I have no idea where people got the idea that it isn't.
that's because the 5b version is actually working at 24fps, so I also thought the 14b version would be too
Anonymous
9/6/2025, 9:26:45 AM
No.106499513
>>106499502
That's like 80% of what I've spent gooning to opus 4.1
Anonymous
9/6/2025, 9:30:52 AM
No.106499544
>>106499502
>Let's fuck with the development of this groundbreaking technology because some fat bitch wants money for her chad thundercock schlock novel
Grim. I hate the antichrist.
>>106499502
It's actually good in the sense that they're only paying for illegally downloading the books, not for using them in training. 1.5b is nothing to them but (like everyone's been saying) this case further widens the gap between the big guys and the little guys (which is what the authors proclaim they are against kek).
Anonymous
9/6/2025, 9:34:49 AM
No.106499578
>>106499455
The frame interpolators are decent for solving this issue. I use GIMM-VFI, which you can search for with comfyui custom nodes manager and install it.
I believe these are quite a bit better than the old interpolated frames you'd get with TVs. Though they're obviously not perfect, as they're only spending like 60 seconds to generate the interpolated frames for your entire video.
The way it works is your frame count gets increased (like if every 2nd frame is interpolated then it goes from 81 --> 160 or whatever), so now your video is in slow motion, but then you fix this by increasing your FPS to make it faster. And then it'll look right.
Anonymous
9/6/2025, 9:34:59 AM
No.106499580
>>106499604
Testing Chroma Radiance
I think the output is pretty interesting. It has sort of a weird mottling pattern that seems unique. It's probably something that should go away with further training progress, but I actually like it.
>>106499502
>>106499556
The judge explicitly ruled that training on protected works is fair use which is the biggest win for AI bros
Anonymous
9/6/2025, 9:36:42 AM
No.106499592
>>106499638
https://vocaroo.com/1eCSSHLSRPJ0
You know, for the first time using vibe voice it's actually pretty good. I was expecting another so-so model but it's actually pretty good.
Anonymous
9/6/2025, 9:37:45 AM
No.106499596
>>106499665
>>106499585
yeah we won but it'll be reported as a loss.
as usual, we pretty much have to just wait about 12 fucking years until the whiners die out and the AI zoomers take over, then they'll start making a bunch of youtube videos finally correcting the record for all the misinformation being spread.
>>106499580
Poor anatomy and other Chroma problems seem about the same as the regular models
>>106499604
Problem is radiance is slow. And raising the resolution increases memory requirements like crazy.
Anonymous
9/6/2025, 9:41:10 AM
No.106499624
>>106499655
>>106499613
Whoa. It's almost like the Vae exists for a reason.
Anonymous
9/6/2025, 9:42:34 AM
No.106499634
>havent updated in months because comfy runs fine
>start it up, suddenly it crashes when loading a specific controlnet
>switch to different controlnet and it works fine
Wha
Anonymous
9/6/2025, 9:43:15 AM
No.106499638
>>106499592
Yeah the 7B model is crazy good.
Anonymous
9/6/2025, 9:43:30 AM
No.106499641
>>106499665
>>106499585
>The judge explicitly ruled that training on protected works is fair use which is the biggest win for AI bros
then why do they have to pay to use it? that's the fucking problem
Anonymous
9/6/2025, 9:43:59 AM
No.106499645
>>106499604
Unfortunately it looks like the biggest thing hoped to improve with Radiance isn't any better. Small, high frequency details are still get melted and deformed. Not using VAE isn't helping. I think it was already doomed when they decided to train with 512x.
>>106499613
I am overly GPU-rich so I didn't really notice, but I thought skipping the VAE could have lead to speed improvements eventually?
Anonymous
9/6/2025, 9:45:43 AM
No.106499655
>>106499762
>>106499624
>Whoa. It's almost like the Vae exists for a reason.
for edit models, vae is a disaster, you want the model to only modify certain parts of the image but with a vae you have a compression loss on all the pixels, vae-less is the way to go for edit models
Anonymous
9/6/2025, 9:48:08 AM
No.106499665
>>106499678
>>106499596
I cannot think of a bigger win for training other than forcing those included in datasets to pay the trainers. It's that big.
>yeah we won but it'll be reported as a loss.
True but in time, it won't.
>>106499641
>then why do they have to pay to use it?
Not to use it, anon. To obtain it. I agree it's still gay and retarded but until the entire copyright apparatus is taken down that's the way it'll be.
Anonymous
9/6/2025, 9:50:27 AM
No.106499678
>>106499694
>>106499665
>Not to use it, anon. To obtain it. I agree it's still gay and retarded but
I don't think you realize how fucked up this is, every uncoming company will need billions of dollars to get the data needed to train their models, it'll kill everything, only giant companies will afford to do that, the US eldorado is over
Anonymous
9/6/2025, 9:50:36 AM
No.106499681
>>106497892
same seed and prompt, different epochs of a new version
need to figure out why noob lineart cnet crashes comfy tho
Anonymous
9/6/2025, 9:53:36 AM
No.106499694
>>106499678
>I don't think you realize how fucked up this is,
See
>>106499556
>this case further widens the gap between the big guys and the little guys (which is what the authors proclaim they are against kek).
It sucks but I'm a half glass full kind of person. It does prove that the artists and authors suing are either 1. hypocrites or 2. being fooled by large copyright holders but we've all suspected as such already.
Anonymous
9/6/2025, 9:58:35 AM
No.106499722
>>106499740
>>106499556
>It's actually good in the sense that they're only paying for illegally downloading the books, not for using them in training.
3000 dollars for a single book? really? they're not paying to buy a book, they're paying the extra to use it for training, how is that "fair use"? the judge is fucking RETARDED
Anonymous
9/6/2025, 9:58:35 AM
No.106499723
>>106499734
Anonymous
9/6/2025, 9:59:56 AM
No.106499734
>>106500904
>>106499723
What I've been seeing going around is the first frame being the image and the second frame being the box display, so like the character walks up on to the desk.
Anonymous
9/6/2025, 10:01:43 AM
No.106499740
>>106499748
>>106499755
>>106499722
Put yourself in Anthropic's shoes. Spending 1.5b to save 183b is a steal.
>the judge is fucking RETARDED
Pretty sure that number was reached between the two parties. Actually I think it was Anthropic that came out and said "that's fine we'll pay it.
Anonymous
9/6/2025, 10:03:35 AM
No.106499748
>>106499740
for anthropic it's fine, but this sets a precent, now every company that wants to replicate their success know they will have to first need billions of dollars to make their first model, this will be impossible for almost everyone, the US is dead, China has won
Anonymous
9/6/2025, 10:03:48 AM
No.106499751
>>106499648
>VAE could have lead to speed improvements eventually
Theory is supposed to learn better so technically but in practice I doubt it. We only have one pixel model in the wild and it's really meh cause it was undercooked, which I am assuming is from the high training resource this technique requires.
Anonymous
9/6/2025, 10:04:39 AM
No.106499755
>>106499740
$183 billion, but they are still operating at a loss, they are still not profitable.
Anonymous
9/6/2025, 10:04:49 AM
No.106499756
>>106499794
>>106499714
It kinda starting speeding up like crazy near the end but speech was pretty natural until then. Don't know who it is though.
Anonymous
9/6/2025, 10:06:05 AM
No.106499762
>>106499960
>>106499648
>Not using VAE isn't helping.
it is
>>106499655
Anonymous
9/6/2025, 10:11:14 AM
No.106499790
I bet if you're able to obtain your dataset via the clear web and not torrents or other illegal means it'd be fine.
Anonymous
9/6/2025, 10:11:41 AM
No.106499794
>>106500848
>>106499756
It's Dagoth Ur from morrowind. It sounds pretty much exactly like him but then again I think he's one of the easiest voices to replicate.
Anonymous
9/6/2025, 10:20:15 AM
No.106499836
>>106499856
Anonymous
9/6/2025, 10:23:11 AM
No.106499856
>>106499879
>>106499836
kek, who's voice is it?
Anonymous
9/6/2025, 10:29:13 AM
No.106499879
>>106499897
Anonymous
9/6/2025, 10:31:48 AM
No.106499897
>>106499941
>>106499879
so it managed to replicate the voice with only 7 seconds of examples? that's insane
Anonymous
9/6/2025, 10:39:29 AM
No.106499941
>>106499951
>>106499897
anudda victory for the OGs
Anonymous
9/6/2025, 10:40:37 AM
No.106499951
>>106499941
https://files.catbox.moe/81zqj5.flac
A short clip of megumin in Japanese from youtube.
Anonymous
9/6/2025, 10:42:43 AM
No.106499960
>>106499962
>>106499762
Good thing Chroma is an edit model oh wait
>>106499960
>Good thing Chroma is an edit model oh wait
that's funny because he wants chroma to be an edit model
https://xcancel.com/LodestoneE621/status/1963467050501992811#m
Anonymous
9/6/2025, 10:45:51 AM
No.106499981
>>106499962
>Base model still mangles anything and everything a lot of the time
>let's waste some more compute/money to become a bad editing model
Can he see one thing trough for once and make it good before his ADHD kicks in?
Anonymous
9/6/2025, 10:45:55 AM
No.106499982
>>106500029
>>106500466
>>106499962
This feels so ill conceived at this point. Chroma should be marked as done and he should use those resources on more promising models.
Anonymous
9/6/2025, 10:46:38 AM
No.106499987
>>106499993
remember when he said he was going to do a wan tune :(
Anonymous
9/6/2025, 10:47:47 AM
No.106499993
>>106499987
No, because I don't hang on to his every word like some people here seem to.
Anonymous
9/6/2025, 10:50:47 AM
No.106500006
Anonymous
9/6/2025, 10:52:16 AM
No.106500014
Anonymous
9/6/2025, 10:55:26 AM
No.106500029
>>106502328
>>106499982
this, at this point he should just finetune Qwen Image Edit so that it doesn't zooms in randomly, can do porn and has nice skin texture, it won't be too expensive and he will really be considered a legend, trying to undistill schnell was a giant mistake, you can't save distilled models, now it's obvious
Anonymous
9/6/2025, 10:56:49 AM
No.106500032
>>106500054
Chroma Radiance creates this mosaic artifact if you try to generate at very high resolution. Obviously it's not expected to actually produce good output in this case, but the artifact is interesting. It's almost like it's trying to extrapolate the scale of the pixels themselves instead of the number of pixels.
Anonymous
9/6/2025, 10:57:58 AM
No.106500050
What's the best way to make pixelated low resolution gens? Like anime style pixel art with flat colors and simple shadows.
Anonymous
9/6/2025, 10:58:32 AM
No.106500054
>>106500032
that's because this pixel method uses square patches, so it's literally some 16x16 mosaics, I wonder how they can fix that
Anonymous
9/6/2025, 10:59:40 AM
No.106500060
>>106500071
Anonymous
9/6/2025, 11:01:58 AM
No.106500071
>>106500076
>>106500060
kek, but for real though, indians will use this techology to reproduce a real american voice and their scam will be harder to notice
Anonymous
9/6/2025, 11:02:54 AM
No.106500076
>>106500083
>>106500071
I mean... have you seen the leadership at microsoft lately?
Anonymous
9/6/2025, 11:04:09 AM
No.106500083
>>106500093
>>106500076
I do, and I'm glad it's a jeet at the top, look at their fuckup, now we have a good model on the wild because if their incompetence (I still believe they did this shit in purpose to help the local ecosystem forward, like the """leak""" of llama1)
Anonymous
9/6/2025, 11:05:11 AM
No.106500088
Can you convert flux loras to Qwen?
Anonymous
9/6/2025, 11:05:34 AM
No.106500093
>>106500083
It's honestly really good. I can't believe I almost ignored this model.
Anonymous
9/6/2025, 11:06:01 AM
No.106500096
>>106500141
>>106500165
Since VibeVoice large is like 17gigs I'm currently trying to implement block swapping for the custom nodes I found so you can offload parts of the model to DRAM and run it on lower VRAM systems.
The normal nodes didn't offload anything for me and OOM during initialization of the model.
Can some Anon with 16gb VRAM tell me what kinda nodes they're using for the large model before I waste any more time?
Anonymous
9/6/2025, 11:07:52 AM
No.106500105
Anonymous
9/6/2025, 11:13:15 AM
No.106500137
Anonymous
9/6/2025, 11:13:58 AM
No.106500141
>>106500146
>>106500096
What is that node anyway? The node I'm using doesn't look like that
Anonymous
9/6/2025, 11:14:56 AM
No.106500146
>>106500159
>>106500141
That's from
https://github.com/Enemyx-net/VibeVoice-ComfyUI
That's why I'm asking, what nodes are you using?
Anonymous
9/6/2025, 11:15:10 AM
No.106500149
Anonymous
9/6/2025, 11:16:07 AM
No.106500153
>>106500188
>>106500423
Anonymous
9/6/2025, 11:17:18 AM
No.106500159
>>106500176
>>106500781
>>106500146
https://github.com/wildminder/ComfyUI-VibeVoice/commits/main/
I can't help you though, I'm on 24gb and it just works. The node has a quantization option for lower vram users though. Good luck with your block swapping.
Anonymous
9/6/2025, 11:19:03 AM
No.106500165
>>106500176
>>106500096
I'm on 12gb and I use quants.
Anonymous
9/6/2025, 11:20:47 AM
No.106500176
>>106500159
Yeah, block swapping works already with the code I wrote, so it loads on 16GB VRAM, but I'm still doing the compute on DRAM right now because I think I'm using wrong logic to get the source of the tensor.
But your node seems to have offloading already... so I just did that for nothing.
Joke's on me for not searching for another node and relying on fucking plebbit of all places.
Thanks man.
>>106500165
I'd rather run full models and offload most of the time.
Anonymous
9/6/2025, 11:21:11 AM
No.106500177
Anonymous
9/6/2025, 11:22:20 AM
No.106500183
>>106499962
>10 more epochs
he has zero patience and will introduce random experiments and distillations that will fuck it up halfway
Anonymous
9/6/2025, 11:23:30 AM
No.106500188
>>106500195
>>106500153
that doesn't sound like him at all
>t. looks at his videos everyday to get news of the new woke slop drama
Anonymous
9/6/2025, 11:26:02 AM
No.106500195
>>106500188
I think it's the cadence that ruins it.
Anonymous
9/6/2025, 11:30:52 AM
No.106500215
Some super lewd WAN gens
files.catbox.moe/ buvot5.mp4
files.catbox.moe/ opnfsd.mp4
files.catbox.moe/ ulztfl.mp4
came out pretty good I'd say
Anonymous
9/6/2025, 11:44:23 AM
No.106500279
>>106500283
>>106500261
>Put a space in the url
>Censored the images on catbox anway
You are a real piece of shit, you know that?
Anonymous
9/6/2025, 11:44:34 AM
No.106500281
>>106500289
>>106499714
vibevoice or chatterbox?
Anonymous
9/6/2025, 11:44:55 AM
No.106500283
>>106500485
>>106500279
I don't want a forced vacation ok
tryin to play it safe
Anonymous
9/6/2025, 11:46:18 AM
No.106500289
>>106500281
Everything in this thread is vibe voice.
Anonymous
9/6/2025, 11:56:42 AM
No.106500344
>>106500380
Anonymous
9/6/2025, 12:02:58 PM
No.106500380
>>106500344
ogopogo! ogopogo!
Anonymous
9/6/2025, 12:07:48 PM
No.106500413
Damn. VibeVoice works kinda alright for non-English languages.
Neat model.
Anonymous
9/6/2025, 12:08:34 PM
No.106500423
>>106500153
>https://files.catbox.moe/hdi8ls.flac
>Asmon comes out as trans.
Do a Total Biscuit WTF podcast, returning from the grave to review trash games.
Anonymous
9/6/2025, 12:13:11 PM
No.106500458
>taggui can't do batches
Literally what is the point then
Anonymous
9/6/2025, 12:13:35 PM
No.106500466
>>106499982
>>106499962
One more training bro
Anonymous
9/6/2025, 12:15:46 PM
No.106500485
>>106500283
anon, catbox is fine
Anonymous
9/6/2025, 12:16:34 PM
No.106500492
>having fun generating generic npcs
>the final boss appears
pretty rad
Anonymous
9/6/2025, 12:26:51 PM
No.106500561
>>106500261
why would you censor them
Anonymous
9/6/2025, 1:01:04 PM
No.106500745
>>106500780
>>106499032
Is this sarcasm or real? im thinking of increasing ram too
Anonymous
9/6/2025, 1:04:17 PM
No.106500757
>>106500781
the large vibevoice doesn't run with 16gb even with quants, meh
Anonymous
9/6/2025, 1:09:36 PM
No.106500780
>>106500745
I also increased ram from 64 to 128 but I have two gpu to feed, I don't think you need more than 64 for 1 gpu.
Anonymous
9/6/2025, 1:09:39 PM
No.106500781
>>106500789
>>106500757
I'm running it on 12 with this node
>>106500159
Anonymous
9/6/2025, 1:10:08 PM
No.106500784
>>106500796
What's so good about the voice model leaking? Can it do nsfw?
Anonymous
9/6/2025, 1:11:02 PM
No.106500789
>>106500781
I just get OOM. Oh my fucking god it's Comfy again isnt it
>>106500784
>leaking?
It didn't leak. It got the wizard LM 2 treatment. Nobody bothered to look at it until Microsoft pulled it for being too good. And yeah, it does NSFW.
It's just all around solid.
Anonymous
9/6/2025, 1:13:07 PM
No.106500807
>>106500816
>prompt WAN i2v
>action looked good
>increase length by 12 frames
>action changes completely
we will never get exactly what we want
Anonymous
9/6/2025, 1:13:19 PM
No.106500808
>>106500818
>>106500796
>And yeah, it does NSFW
can someone show a catbox of that
unless nsfw is just "I can make someone say tits ass"
Anonymous
9/6/2025, 1:14:09 PM
No.106500816
>>106500807
you mean you stitched two videos or you actually added 12 frames to the 81?
Anonymous
9/6/2025, 1:14:23 PM
No.106500818
>>106500838
Anonymous
9/6/2025, 1:14:44 PM
No.106500821
>>106500796
>Microsoft pulled it for being too good
Oh I see, so basically a happy accident.
Anonymous
9/6/2025, 1:14:51 PM
No.106500823
>>106501056
>>106499415
I can easily fap to this
>>106500818
That's actually better than anything else I've heard locally. Can you control emotions? Or just write text and it figures it out?
>>106499420
>the corrected 7b model is "on the way", it'll be a lobotomized version of the one we already have lol
This will be funny to watch, and see the differences, so we'll know if it's the nsfw abilities or the "too good for local" that triggered its deletion.
Anonymous
9/6/2025, 1:18:53 PM
No.106500848
>>106499794
the old 11labs ones are better imo
Anonymous
9/6/2025, 1:19:36 PM
No.106500854
>>106500838
Basically you have to have those kinda sound in the reference and it kinda figures how to implement them through context.
Anonymous
9/6/2025, 1:19:54 PM
No.106500856
>>106500838
>Can you control emotions?
no but if the base model is good people will figure out ways to create ""loras"" out of them
would be happy to hear a jav one for sure
Anonymous
9/6/2025, 1:26:28 PM
No.106500885
comfy fork when
Anonymous
9/6/2025, 1:27:10 PM
No.106500889
>>106500898
>>106500796
>Nobody bothered to look at it until Microsoft pulled it for being too good. And yeah, it does NSFW.
congrats Microsoftrannies, the streisland is at full effect kek
Anonymous
9/6/2025, 1:29:02 PM
No.106500898
>>106500889
I downloaded it just because someone said they deleted it. I'm not even interested in voice but it had to be done.
Anonymous
9/6/2025, 1:30:06 PM
No.106500904
>>106500796
> it does NSFW.
what kind of nsfw? just moaning
>>106500941
>what kind of nsfw? just moaning
I feel stupid for needing to ask this, but what kind of NSFW stuff do you expect beyond moaning for a voice model?
Anonymous
9/6/2025, 1:46:21 PM
No.106500981
>>106501017
>>106500961
nta but shlops and plaps
Anonymous
9/6/2025, 1:49:59 PM
No.106501005
Anonymous
9/6/2025, 1:50:55 PM
No.106501017
Anonymous
9/6/2025, 1:59:10 PM
No.106501056
>>106501099
>>106500823
very nice style mix
Anonymous
9/6/2025, 2:00:58 PM
No.106501068
>>106501091
>>106499455
>I just realized WAN's generation are tuned for 16 fps
nigger this was explicitly known since wan 2.1
Anonymous
9/6/2025, 2:04:44 PM
No.106501091
>>106501068
the wan 2.2 5b model is 24fps though
Anonymous
9/6/2025, 2:08:15 PM
No.106501106
>>106501028
Wow, this model is nuts
Anonymous
9/6/2025, 2:08:48 PM
No.106501109
>>106501124
https://xcancel.com/bdsqlsz/status/1964279441305030725#m
seems like one of the 2 edit chink models that will be released (one of them will be local) is Seedream 4.0
Anonymous
9/6/2025, 2:10:09 PM
No.106501124
>>106501129
>>106501028
>>106501109
do you think the normies will accept the fact the Simpsons will use the Homer/Marge original AI voices once the actors will die of old age?
Anonymous
9/6/2025, 2:10:20 PM
No.106501126
>>106501319
>>106497264 (OP)
Why haven't you trained that LoRA yet?
>>106501028
Decent Homer
Anonymous
9/6/2025, 2:10:47 PM
No.106501129
>>106501124
I give marge weeks before she is dead. Have you heard her lately?
Anonymous
9/6/2025, 2:11:14 PM
No.106501134
>>106501116
good on them for showing failure cases, not many model makers do that
Anonymous
9/6/2025, 2:12:10 PM
No.106501137
>>106501155
Anonymous
9/6/2025, 2:14:55 PM
No.106501155
>>106501137
>Gollum
desu this should be the benchmark for audio models, he has the perfect voice to test out a model's limit
Anonymous
9/6/2025, 2:20:15 PM
No.106501194
>>106501116
the left one is impressive
Anonymous
9/6/2025, 2:22:00 PM
No.106501203
>>106501197
Oops, I changed "dream" to "gen" and it hit the wrong way. This is what I meant to post
>>106499648
>Not using VAE isn't helping
the entire chroma finetuning project amounts to lodestone going "guys, I may have conceived an idea most ingenious!" only to find out there's a reason no other model does it.
Anonymous
9/6/2025, 2:27:23 PM
No.106501240
>>106501099
This belongs to Anime Diffusion Thread.
Anonymous
9/6/2025, 2:27:43 PM
No.106501241
>>106501252
>>106501231
Donating to lodestones right now is basically throwing your money in a shredder.
Satisfying his curiosity on other's dime.
>>106501231
>>106501241
to be fair, I want a future without VAEs, so lodestrone trying that PixNerd paper and see if it actually works seems like a good experiment
Anonymous
9/6/2025, 2:31:43 PM
No.106501260
I am going to fucking kill myself
Anonymous
9/6/2025, 2:31:47 PM
No.106501261
>>106501272
Anonymous
9/6/2025, 2:32:52 PM
No.106501269
>>106501116
LOL! seedream 3 (mogao) is one of the top models right now. there is no way they release v4 openly. and even that still looks behind nano banana.
Anonymous
9/6/2025, 2:33:05 PM
No.106501272
>>106501231
idk it's cool that he explores in the open. It's not like lode's a know-it-all- he's just curious.
>>106501261
ty much better
Anonymous
9/6/2025, 2:36:10 PM
No.106501291
>>106501312
>>106501252
I haven't used a VAE in like a year
Anonymous
9/6/2025, 2:40:49 PM
No.106501312
>>106501336
Anonymous
9/6/2025, 2:41:46 PM
No.106501319
>>106501411
>>106501126
>Why haven't you trained that LoRA yet?
Been testing different settings. Adamw8bit + cosine seems to be the way to go. I don't know if OneTrainer knows Huber loss, would like to try with that. Chroma noise offset settings are also one mystery.
Anonymous
9/6/2025, 2:44:02 PM
No.106501335
>>106501231
I'm ashamed to say that but if I was on his shoes I would do the exact same shit, with a shit ton of money all I would do would be trying all the obscure papers and see if something sticks, there's probably gold in there
Anonymous
9/6/2025, 2:44:08 PM
No.106501336
>>106501345
>>106501312
idk just bein myself I guess
Anonymous
9/6/2025, 2:44:36 PM
No.106501338
Anonymous
9/6/2025, 2:46:40 PM
No.106501354
Anonymous
9/6/2025, 2:46:47 PM
No.106501355
>>106501345
I just tried it and it didn't do anything. I stopped when models started saying that they had the vae baked in.
Anonymous
9/6/2025, 2:50:12 PM
No.106501376
>>106501562
>>106501361
kek I was going to post that on twitter in a bit but this one is better, thanks
Anonymous
9/6/2025, 2:55:11 PM
No.106501411
>>106501462
>>106501319
>huber loss
I tried huber loss on diffusion pipe (quick code edit) a while back and it was worse. I only tested a few delta values, so ymmv
>>106501411
Have you tried multiple concepts per lora? I get pretty terrible character bleed
Anonymous
9/6/2025, 3:09:39 PM
No.106501508
>>106502741
>>106501462
>multiple concepts per lora
No, but it sounds fun! I'll try that today. Maybe the bleed can make new people with desired attributes
Anonymous
9/6/2025, 3:21:25 PM
No.106501574
>>106501603
>>106501562
can you get a jab from the left so we can see more of her face in pain?
Anonymous
9/6/2025, 3:24:04 PM
No.106501593
>>106501562
>>106501361
motion really highlights the downs syndrome features
Anonymous
9/6/2025, 3:25:52 PM
No.106501603
>>106501574
gotta go wagie
Anonymous
9/6/2025, 3:49:53 PM
No.106501788
>>106501834
>>106501462
Have you tried training separate loras and then merging them using some fancy concept-retaining (allegedly) merger like k-lora? Or even simple svd?
>>106501252
>a future without VAEs
What did the vaes ever did to you?
Anonymous
9/6/2025, 3:56:47 PM
No.106501834
>>106501788
>Have you tried training separate loras and then merging them using some fancy concept-retaining (allegedly) merger like k-lora? Or even simple svd?
first time hearing about such thing
Anonymous
9/6/2025, 4:20:37 PM
No.106502045
>>106502821
>>106497484
Very nice, abtract geometric art is underrated
Anonymous
9/6/2025, 4:25:53 PM
No.106502095
>>106497872
>>106497892
>>106498098
These are really good, very cool style as well
Friendship ended with chatterbox.
VIibeVoice is my new best friend.
https://files.catbox.moe/rhxshe.wav
Anonymous
9/6/2025, 4:35:12 PM
No.106502168
Anonymous
9/6/2025, 4:45:04 PM
No.106502264
>>106502143
Can it do Microsoft Ashley voice?
Anonymous
9/6/2025, 4:49:05 PM
No.106502295
>>106502326
>>106502143
Can the moans work on any voice or does it require moans in the sample vocie?
Anonymous
9/6/2025, 4:53:14 PM
No.106502326
>>106502379
>>106502295
The sample voice I used didn't have any moans. Took a few gens though and tweaking the script.
Anonymous
9/6/2025, 4:53:49 PM
No.106502328
>>106502339
>>106500029
>it won't be too expensive
are you stupid?
chroma is a 8.9b model and cost near 150k+
qwen is 20b, the training would then also take a magnitude of time to train
Anonymous
9/6/2025, 4:54:54 PM
No.106502339
>>106502429
>>106502328
he doesn't need millions of images to save qwen image edit though, the model is solid, it just need a few examples to learn porn and how to make normal skin
>>106500261
> space in url
> censored
never ever post here again
Anonymous
9/6/2025, 4:58:29 PM
No.106502379
>>106502326
Sick, time to gen some degen.
Anonymous
9/6/2025, 5:00:17 PM
No.106502397
i adore the fact that they tried to remove the model because it can degen so easily. if they hadn't, i would have never learnt about it.
thank you streisand effect
Anonymous
9/6/2025, 5:02:18 PM
No.106502416
>>106502357
>>106500261
>never ever post here again
this
Anonymous
9/6/2025, 5:03:54 PM
No.106502429
>>106502491
>>106502339
Yes, you need millions of real world images to remove the synthetic slop look of Qwen, it's arguably even more overtrained that Flux, with the exception of the 'flux chin'
Anonymous
9/6/2025, 5:11:49 PM
No.106502489
>>106502771
>>106502143
sick, how did you prompt the moans?
Anonymous
9/6/2025, 5:11:55 PM
No.106502491
>>106502548
>>106502429
>you need millions of real world images to remove the synthetic slop look of Qwen
I don't think you need that many images though
https://civitai.com/models/1927710?modelVersionId=2181911
Anonymous
9/6/2025, 5:12:28 PM
No.106502499
Anonymous
9/6/2025, 5:19:25 PM
No.106502548
>>106502491
But this is just a small scope lora for a specific look, with very limited variation and applied on a concept Qwen already knows well (non-nude human beings), it nowhere near a full finetune.
If you are happy with this then why are you clamoring for said full finetune of qwen to begin with ? Just download / train loras.
Anonymous
9/6/2025, 5:20:02 PM
No.106502552
>>106501795
eh compresses my image and doesn't afraid of anything
Anonymous
9/6/2025, 5:25:18 PM
No.106502588
>>106503383
uninstalled
Anonymous
9/6/2025, 5:27:15 PM
No.106502604
>>106502646
>>106502357
its takes 2 seconds to fix the url
Anonymous
9/6/2025, 5:27:53 PM
No.106502611
>>106501795
Lose detail, particularly in complex anatomy like hands
Anonymous
9/6/2025, 5:31:01 PM
No.106502646
>>106502656
>>106502604
and yet the videos are still censored
Anonymous
9/6/2025, 5:31:52 PM
No.106502656
>>106502725
>>106502646
Well I thought I would get offd for 3 days if they werent!
Anonymous
9/6/2025, 5:32:44 PM
No.106502667
>>106502683
>>106502430
VibeVoice-ComfyUI seems to be more updated but I have no idea if it's better.
I need to try it, the samples posted here are pretty titillating.
>>106502667
the only thing i find interesting is the moans and being able to direct it more through prompts. quality wise it's pretty bland compared to rvc+xtts using alltalk
Anonymous
9/6/2025, 5:40:01 PM
No.106502725
>>106502656
fair. the jannies here are unhinged and ban based on mood.
anyone have a prompt guide for vibevoice? i can't find shit.
Anonymous
9/6/2025, 5:40:19 PM
No.106502728
>>106502744
>>106502683
so alltalk is better in everything except moans and lewd voices?
Anonymous
9/6/2025, 5:41:39 PM
No.106502741
>>106503275
>>106501462
>character bleed
Confirmed bleeding all over the place
>>106501508
>new people with desired attributes
It kinda happened. Excuse the cataracts from 512px training. It was a quick 1 1/2hr run just to see what happens
Anonymous
9/6/2025, 5:42:43 PM
No.106502744
>>106502792
>>106502728
imo yes, from the samples posted here.
actually no, rvc and xtts can't generate as long but eh.
Anonymous
9/6/2025, 5:45:46 PM
No.106502771
>>106502489
Just ohhh, mmm, hmm that kind of stuff. I think contextually the words in the script helped it figure out what I wanted.
Anonymous
9/6/2025, 5:47:44 PM
No.106502785
Baker?
Anonymous
9/6/2025, 5:48:30 PM
No.106502792
>>106502824
>>106502744
RVC requires a sample to convert and xtts tends to be extremely monotone. I find the emotive aspects of VibeVoice way better than xtts, not just the moaning and stuff, breathing, inflection of voice, that kind of stuff is better.
Anonymous
9/6/2025, 5:49:41 PM
No.106502799
>>106502855
Hmm...
Anonymous
9/6/2025, 5:52:05 PM
No.106502821
>>106502045
>Very nice, abtract geometric
thanks
Anonymous
9/6/2025, 5:52:29 PM
No.106502824
>>106502792
just finished downloading vibevoice.
i take everything i said back. vibevoice is gonna make me lose my balls.
it can flawlessly copy asmr voices without breaking.
it's so joever
>>106502799
>black Miku with jewish nose
...
Anonymous
9/6/2025, 5:57:12 PM
No.106502880
>>106502855
she's mixed race kek
Anonymous
9/6/2025, 5:58:25 PM
No.106502898
>>106502921
>>106502855
It's not a Jewish nose. The prompt was:
>an illustration of a dark-skinned Hatsune Miku sitting at a table and reading a book. She has black African features, including a big flat nose. The book's title is just "DAS RITE KAPITAL". Behind her on the wall is the logo of the Black Panther Party.
And I meant a very different nose. Chroma just decided that that means the nose of a proboscis monkey.
Anonymous
9/6/2025, 6:00:56 PM
No.106502921
>>106502974
>>106502898
Anyway, I shouldn't have blown my VERY FUNNY "Das rite Kapital" joke. which apparently no one else has ever made, on this gen.
Anonymous
9/6/2025, 6:05:57 PM
No.106502974
>>106502921
Surely one to go down in the anals of history
Anonymous
9/6/2025, 6:13:36 PM
No.106503063
I miss the times when the collage had no videos. Thinking of becoming a new reviled personality on /ldg/.
Anonymous
9/6/2025, 6:15:16 PM
No.106503077
>>106503249
no bake?
Anonymous
9/6/2025, 6:15:54 PM
No.106503085
>>106498746
as one of the two participants of genjam 3, i would like to say that yes, i am down for another round
Anonymous
9/6/2025, 6:31:39 PM
No.106503249
>>106503077
be the change you want to see, faggot.
Anonymous
9/6/2025, 6:33:21 PM
No.106503275
>>106503425
>>106502741
>Excuse the cataracts from 512px training
Have you tried 768? 1024 kills my machine
Anonymous
9/6/2025, 6:39:36 PM
No.106503353
>>106503381
I thought there were schizo shills patrolling here 24/7?? Make a new thread, I have stuff to say.
Anonymous
9/6/2025, 6:42:11 PM
No.106503381
Anonymous
9/6/2025, 6:42:27 PM
No.106503383
Anonymous
9/6/2025, 6:43:25 PM
No.106503397
>>106503466
>>106502683
I don't know what you are doing, but your posted example is much worse than what vibevoice 7b gives me
Anonymous
9/6/2025, 6:44:14 PM
No.106503408
Anonymous
9/6/2025, 6:45:38 PM
No.106503425
>>106503275
>Have you tried 768? 1024 kills my machine
Yes. 640 is decent too. I can go up to 1280 without block swapping on batch size 1 with float8 (24GB VRAM). Tested float8 with validation and it's within a 0.0001-0.0005 difference. The loss is so little that I just leave it on for flexibility
Anonymous
9/6/2025, 6:49:14 PM
No.106503466
>>106503397
wrong anon. i didn't post shit.
Anonymous
9/6/2025, 8:40:07 PM
No.106504711
>>106502430
where do you get the models from? looks like they're taken down from huggingface