/ldg/ - Local Diffusion General - /g/ (#106133377)

Anonymous
8/4/2025, 4:47:58 AM No.106133377
highlights_g_106130699_1754273667_thumb.jpg
highlights_g_106130699_1754273667_thumb.jpg
md5: 1bb00584519a747b6796df4ae649d845๐Ÿ”
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106130699

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Replies: >>106133533 >>106133661 >>106134205
Anonymous
8/4/2025, 4:50:15 AM No.106133401
>no genjam
it's over
Replies: >>106133777
Anonymous
8/4/2025, 4:51:23 AM No.106133410
why wanvideo video have no preview?
Replies: >>106133415
Anonymous
8/4/2025, 4:52:06 AM No.106133415
>>106133410
the org simply wanted us not to have it
Anonymous
8/4/2025, 4:55:16 AM No.106133428
1693062795574302
1693062795574302
md5: 17bab44dbbe3c1d71fcbcf6cb9b313fb๐Ÿ”
I'm completely lost for what's the best and fastest workflows and tricks to use wan 2.2.
Replies: >>106133438 >>106133468 >>106133487
Anonymous
8/4/2025, 4:56:50 AM No.106133438
>>106133428
just use the all in one and use it like 2.1
Replies: >>106133505
Anonymous
8/4/2025, 5:03:13 AM No.106133468
>>106133428
for what it's worth i was I2Ving a picture of a woman sitting with her legs spread trying to get her to turn around, and with the rapid AIO workflow I got a a dozen gens of her reverting to the starting position or weird body morphing. with kijai's it got it on the first try but it's kinda bright and washed out
Replies: >>106133505
Anonymous
8/4/2025, 5:05:57 AM No.106133487
>>106133428
just wait new light lora for 2.2. for now, t2v gens are meh. only i2v worth it
Replies: >>106133505
Anonymous
8/4/2025, 5:06:32 AM No.106133492
WVI2V_CC_INT_03-08-25-22-53_00001_thumb.jpg
WVI2V_CC_INT_03-08-25-22-53_00001_thumb.jpg
md5: 8e593a1aff9f648dd299ae9099f07933๐Ÿ”
Replies: >>106133500
Anonymous
8/4/2025, 5:07:23 AM No.106133500
>>106133492
why post a fuck-up? do you need help or something?
Anonymous
8/4/2025, 5:07:54 AM No.106133503
ComfyUI_00831_
ComfyUI_00831_
md5: aee49649165d1982e00c2b9b5f588b2d๐Ÿ”
>>106133101
>>106133308
Well I got deformed output when I enabled v-param as well (same seed, nothing else changed).
So does this mean scaled v pred loss is mandatory? Or is it conflicting with stuff like Min SNR gamma, pyramid noise, etc?
I am about the test the first hypothesis now but wanted to ask in case it doesn't result in any fix.
Replies: >>106133704 >>106134424
Anonymous
8/4/2025, 5:08:18 AM No.106133505
>>106133468
>>106133438
>>106133487
Guess I'll try kijai, hopefully it's not completely obtuse with weird nodes.
I'm mostly i2v anyway.
Replies: >>106133557
Anonymous
8/4/2025, 5:09:50 AM No.106133514
1753826115530323
1753826115530323
md5: 561a123be8cc7f6590e93520243157fd๐Ÿ”
EVERY DAY UNTIL I DIE
Replies: >>106133528 >>106134197
Anonymous
8/4/2025, 5:10:33 AM No.106133517
WVI2V_CC_INT_03-08-25-23-01_00001_thumb.jpg
WVI2V_CC_INT_03-08-25-23-01_00001_thumb.jpg
md5: d6e9c70a3a9805ecc7cff3123916e95e๐Ÿ”
Replies: >>106133531
Anonymous
8/4/2025, 5:12:02 AM No.106133528
>>106133514
You know vegans are a minority when you see them constantly telling everyone else how veganism is amazing.
Anonymous
8/4/2025, 5:12:33 AM No.106133531
>>106133517
we can't help you if you don't type the issues you're having anon. I get you are struggling to get a good output but you need to show us the nodes so we can fix it
Replies: >>106133538
Anonymous
8/4/2025, 5:12:39 AM No.106133532
ComfyUI_08223_
ComfyUI_08223_
md5: 12eb84221b85a49dd8b0a03645da26f6๐Ÿ”
Replies: >>106134193
Anonymous
8/4/2025, 5:12:56 AM No.106133533
>>106133377 (OP)
checkd
Anonymous
8/4/2025, 5:13:53 AM No.106133538
>>106133531
that's wanschizo. he actually thinks that's good
Anonymous
8/4/2025, 5:15:37 AM No.106133550
WVI2V_CC_INT_03-08-25-23-04_00001_thumb.jpg
WVI2V_CC_INT_03-08-25-23-04_00001_thumb.jpg
md5: 4218fed31e365124aac12a766b4f478f๐Ÿ”
Anonymous
8/4/2025, 5:16:59 AM No.106133557
>>106133505
if you like i2v, comfyui wan.2.2 workflow is enough for fun
Replies: >>106133566
Anonymous
8/4/2025, 5:17:17 AM No.106133558
why does this hobby attract so many mentally ill schizos?
Anonymous
8/4/2025, 5:18:16 AM No.106133566
>>106133557
Yeah, I'm mostly thrown off by latest speed enhancing lora thing, and the fact wan2.2 is actually 2 models.
Anonymous
8/4/2025, 5:21:20 AM No.106133578
>torch.OutOfMemoryError: CUDA out of memory.
I'm getting this message on both Easy Scripts and Kohya_ss while trying to train an Illustrious Lora. Any ideas as to what might be causing it? I have 12GB of VRAM and I feel like this error shouldn't be happening.
Replies: >>106133631 >>106133704
Anonymous
8/4/2025, 5:27:41 AM No.106133609
WVI2V_CC_INT_03-08-25-23-17_00001_thumb.jpg
WVI2V_CC_INT_03-08-25-23-17_00001_thumb.jpg
md5: b8e87b4b8755beb7673f30a4756dbeb1๐Ÿ”
Replies: >>106133761
Anonymous
8/4/2025, 5:32:01 AM No.106133631
>>106133578
It could be so many things it makes my head spin just thinking about it.

The first and most likely issue is that you're training at too high a resolution, rank and or batch size for your card.
After that it could be anything. Could be your torch version, could be your driver, could be anything.
Replies: >>106133686
Anonymous
8/4/2025, 5:36:42 AM No.106133658
Man the WanVideoWrapper really doesn't like my 12GB VRAMlet ass. I always get an OOM even with GGUFs and block swap set high.
Replies: >>106133664
Anonymous
8/4/2025, 5:37:10 AM No.106133661
>>106133377 (OP)
anything that can run on AMD hardware?
Replies: >>106133672 >>106133912
Anonymous
8/4/2025, 5:37:19 AM No.106133664
>>106133658
>really doesn't like my 12GB VRAMlet ass
I don't like your vramlet ass either.
Replies: >>106133867
Anonymous
8/4/2025, 5:38:20 AM No.106133672
>>106133661
I'll get you a box of crayons and you can pretend to gen with us.
Anonymous
8/4/2025, 5:40:20 AM No.106133686
>>106133631
For once I am thankful for redditors because I just stumbled on a thread that solved the problem. I had to turn on Gradient Checkpointing and Cache text encoder outputs in Kohya and it started training. I spent almost all day on this trying to learn how to train a lora as well as troubleshooting that fucking OOM error message.
Replies: >>106133700 >>106133704
Anonymous
8/4/2025, 5:41:43 AM No.106133700
>>106133686
Yeah that would almost certainly be an issue if those weren't turned on.
Anonymous
8/4/2025, 5:42:17 AM No.106133704
ComfyUI_00832_
ComfyUI_00832_
md5: 210d163344057a56442585620a2f71ae๐Ÿ”
>>106133503
Well it is still bad.
I hope it doesn't take too long to trial and error what is causing this...
>>106133578
Batch size? Shouldn't be higher than 2 I think.
LR predicting Prodigy uses extra VRAM.
Should enable Xformers and gradient checkpointing probably.
Also 12gb vramlet. I am also a noob trying to figure this out
>>106133686
Oh well glad you figured it out.
Replies: >>106134424
Anonymous
8/4/2025, 5:42:53 AM No.106133710
1726741065835846_thumb.jpg
1726741065835846_thumb.jpg
md5: c19b38dcba334439a7c04531859fc2ed๐Ÿ”
I have put every word ever that describes lip movement and talking into negative prompt but she still moves her mouth. I got the boobs smaller but it seems talking is not possible to fix with just prompts.
Replies: >>106133733 >>106133734 >>106133784
Anonymous
8/4/2025, 5:43:15 AM No.106133714
i will suck off a chinaman if they release a model that can extend videos with no degradation or color shift
Anonymous
8/4/2025, 5:46:24 AM No.106133733
>>106133710
but did you do it in chinese?
Replies: >>106133766
Anonymous
8/4/2025, 5:46:25 AM No.106133734
>>106133710
If you put closed mouth, muted or any synonym of it? Dunno never used wan but have you tried?
Replies: >>106133766
Anonymous
8/4/2025, 5:47:50 AM No.106133743
3090 bros i didn't get any boost from moving cuda 12.8 to cuda 12.6...
Replies: >>106133917 >>106134395
Anonymous
8/4/2025, 5:50:40 AM No.106133761
>>106133609
Why do you like Asuka? I like Rei because stoic, unemotional women melt my heart. But Asuka's fans make me curious.
Anonymous
8/4/2025, 5:52:07 AM No.106133766
>>106133733
yes I tried chinese translation too
>>106133734
pos: closed mouth, nonverbal, silent, holds her breath
various things like that
neg: talking, speaking, mouth, mouth movement, lips moving, inside mouth, throat, lips, teeth, tongue, open mouth, open smile, mouth animation, moving mouth, gums, screaming, shouting, etc.
The neg does seem to get rid of things like teeth and make the mouth a bit smaller but it never stops a seed that has mouth animation.
Anonymous
8/4/2025, 5:53:23 AM No.106133777
Colla2_thumb.jpg
Colla2_thumb.jpg
md5: 38577073f1b36749eecdd9e7a85f4041๐Ÿ”
GenJam2 is GO.

Album: https://e.pcloud.link/publink/show?code=kZox1EZMxwWS1tRTwhF3jQElno88yLqtv97

>>106133401
Slept in.
Replies: >>106133787 >>106133797 >>106133814 >>106134069
Anonymous
8/4/2025, 5:53:46 AM No.106133780
202501030047_thumb.jpg
202501030047_thumb.jpg
md5: f6f6aa7328ba52993af46618a15f8580๐Ÿ”
>>106133088
made a comparison with new WF, non light is 30 steps total

https://files.catbox.moe/3rxj1k.json
Replies: >>106133807 >>106134401
Anonymous
8/4/2025, 5:54:07 AM No.106133784
>>106133710
Did you try shift the focus of atention to sonething else to the AI?
Anonymous
8/4/2025, 5:54:38 AM No.106133787
>>106133777
nice
Anonymous
8/4/2025, 5:56:43 AM No.106133797
>>106133777
Also welcoming any volunteer collage makers.
Anonymous
8/4/2025, 5:57:19 AM No.106133799
Misato best girl
Anonymous
8/4/2025, 5:58:46 AM No.106133807
>>106133780
Am I retard if I say I like the light one better?
Replies: >>106133834 >>106133848
Anonymous
8/4/2025, 5:58:59 AM No.106133809
Should I spin for GenJam 3 now or wait for euros to wake up? I'm thinking the latter.
Anonymous
8/4/2025, 5:59:31 AM No.106133813
file
file
md5: d2019fb5b6390adc712e68858f9b8ad2๐Ÿ”
where does Wan2.2 save files?
also this line should be changed in the wan_autoinstall.bat file, this is an old version of triton that didn't work with my PyTorch
Replies: >>106133817 >>106133850
Anonymous
8/4/2025, 5:59:40 AM No.106133814
>>106133777
Also by the way you can still submit post-deadline. No hard deadlines on any of these; I'll just add it to the album.
Anonymous
8/4/2025, 6:00:53 AM No.106133817
>>106133813
nigga that depends on what ui you're using
Replies: >>106133919
Anonymous
8/4/2025, 6:01:50 AM No.106133821
oo oo there should be prizes for winning gen jam like amazon gift cards and GPUs
Anonymous
8/4/2025, 6:02:58 AM No.106133834
>>106133807
no?
Anonymous
8/4/2025, 6:05:32 AM No.106133848
>>106133807
On average quick gen distills will always be worse by their very nature but they can occasionally gen better than normal steps by sheer chance.
So no, you are not retarded.
Anonymous
8/4/2025, 6:06:03 AM No.106133850
>>106133813
output folder, or "temp" folder if you're using kijai's workflow (save_output should be switched on or you'll lose your gens)
Replies: >>106133863 >>106133919
Anonymous
8/4/2025, 6:07:59 AM No.106133860
1749952523221934
1749952523221934
md5: e08eaf37d50d6c213f9f0a16dbd2d33c๐Ÿ”
yt women b like
Replies: >>106133864 >>106135272
Anonymous
8/4/2025, 6:08:08 AM No.106133863
>>106133850
The gens in my heart are never lost.
Anonymous
8/4/2025, 6:08:18 AM No.106133864
>>106133860
delete this
Anonymous
8/4/2025, 6:08:30 AM No.106133867
p01192_pic2_737648ac4f26cf34
p01192_pic2_737648ac4f26cf34
md5: 44aa0bb689a6b62c78ff7348c235b41a๐Ÿ”
>>106133664
Wait until you get a hold of me.
Anonymous
8/4/2025, 6:08:53 AM No.106133869
>most character loras on civitai are 200mbs+
>mine is 40mb
Can someone explain? Did they do like 500 images with like 20 repeats and 5 batches or something?
Replies: >>106133877 >>106133890
Anonymous
8/4/2025, 6:10:35 AM No.106133877
>>106133869
what rank did you train it at? that is all that matters for size, for smaller / not complicated loras a lower rank is ok
Replies: >>106133887
Anonymous
8/4/2025, 6:12:44 AM No.106133887
>>106133877
rank?

Sorry I'm new to this.
Replies: >>106133895
Anonymous
8/4/2025, 6:13:05 AM No.106133890
>>106133869
Rank dictates the size of the LoRA.
Anonymous
8/4/2025, 6:14:20 AM No.106133895
>>106133887
Basically how much of the model's layers you actually trained. For most simple stuff rank 32 / 64 is plenty, some people do 128 or even 256 for small gains in quality
Replies: >>106133902
Anonymous
8/4/2025, 6:16:23 AM No.106133902
file
file
md5: 46918780434d4f9a264043ef2159e1ad๐Ÿ”
>>106133895
This setting?
Replies: >>106133910 >>106133922
Anonymous
8/4/2025, 6:17:34 AM No.106133910
>>106133902
yes, 8 should be ok if its just something like a single subject, for styles / concepts / multi concepts you would want more
Replies: >>106133935
Anonymous
8/4/2025, 6:17:37 AM No.106133912
>>>>106133661
Wanvideo on ComfyUI works on a 7900 XTX based on my testing. You need to install ROCm Pytorch on a Linux distro however, and VAE Decode is super slow unless you run tiled decode on the minimum possible tile size.
Anonymous
8/4/2025, 6:18:26 AM No.106133917
>>106133743
3090 on 12.9 here & saw that about 12.6, whats a good baseline test people are using?

14b fp8 848x480 81 frame 8 step vids here are 150 seconds (though it depends on the sampler/scheduler)
Replies: >>106134395
Anonymous
8/4/2025, 6:18:45 AM No.106133919
WanVideo2_2_I2V_00002_thumb.jpg
WanVideo2_2_I2V_00002_thumb.jpg
md5: fd04f6a4cc806089fdc0d4b9776e3342๐Ÿ”
>>106133850
>>106133817
OK I found it, for some reason it made a ComfyUI/ComfyUI/output folder
Replies: >>106133932
Anonymous
8/4/2025, 6:19:04 AM No.106133922
>>106133902
Just be aware, bigger =\= better. For most purposes 32 - 64 is more than enough, 128 if you're sure about it. The higher you go, the more likely you are to just deep fry (furk) your model.
Replies: >>106133933
Anonymous
8/4/2025, 6:19:17 AM No.106133923
i'm trying kijai's 2.2 workflow with both e4m3fn and e5m2 models on the same seed with my prompt of "woman turns around". the outputs are similar but sometimes significantly different. in one case she turns clockwise on one model and counter-clockwise with the other model. in another case the outputs are very similar but with e4m3fn her butt jiggled more. i don't know if one is necessarily better than the other but i'm leaning towards e4m3fn
Replies: >>106133957
Anonymous
8/4/2025, 6:19:34 AM No.106133927
still working on the WF, might be better to trade some high noise steps for some low noise ones
Anonymous
8/4/2025, 6:20:34 AM No.106133932
>>106133919
what does your video combine/save video node look like?
Anonymous
8/4/2025, 6:20:35 AM No.106133933
>>106133922
not if you adjust alpha accordingly, use about the same to double the rank
Replies: >>106133944
Anonymous
8/4/2025, 6:20:46 AM No.106133935
>>106133910
hmm kk thanks. Ill have to experiment further tommorow. Trying to make a lora for a character that looked fine in Pony models but looks like an ultra generic chinese doll in Illustrious models. The first lora I made is making a small difference but I need it to be a stronger change. I may also just need to create a better dataset.
Anonymous
8/4/2025, 6:21:28 AM No.106133938
Any other 3090 users here worried about the trend of higher cuda versions just being a straight downgrade?
I had to downgrade my cuda for diffusion pipe yesterday because it wouldn't let me train on multiple GPUs on 12.8
Replies: >>106133952
Anonymous
8/4/2025, 6:22:28 AM No.106133944
>>106133933
Using the alpha to account for the rank means you probably should have just used half the rank in the first place.
Replies: >>106133972
Anonymous
8/4/2025, 6:23:51 AM No.106133952
>>106133938
I doubt there's much point in upgrading CUDA versions anymore for a 3090 anyway. The most you could optimize is using SageAttention instead of default attention for inferencing (video gen).
Anonymous
8/4/2025, 6:25:13 AM No.106133957
>>106133923
They are different trade offs of representing 8-bit float precision. In general neither should be strictly "better" than the other.
>i don't know if one is necessarily better than the other but i'm leaning towards e4m3fn
This will typically require a lot of testing to say confidently, but roll with it if you like it.
I don't see why it will matter across many gens, just roll with one.
Replies: >>106133979
Anonymous
8/4/2025, 6:28:36 AM No.106133972
>>106133944
higher rank really is higher quality though if you dont burn it, also faster training
Replies: >>106134001
Anonymous
8/4/2025, 6:30:18 AM No.106133979
>>106133957
From what I remember, there's a "e4m3fn fast" option available too for quantization. There's a negligible difference in performance, so I just go with whatever generates the fastest.
Anonymous
8/4/2025, 6:33:25 AM No.106133994
Can we get a T2IV 5B workflow and guide in the new wan rentry OP?
Anonymous
8/4/2025, 6:35:01 AM No.106134001
>>106133972
That is a big if.
Anonymous
8/4/2025, 6:35:03 AM No.106134002
use case for krea?
Replies: >>106134008 >>106134009 >>106134016
Anonymous
8/4/2025, 6:36:02 AM No.106134008
>>106134002
Schizo collages.
Anonymous
8/4/2025, 6:36:25 AM No.106134009
>>106134002
shilling for bfl but that's about it
Anonymous
8/4/2025, 6:37:19 AM No.106134016
>>106134002
diy stock images
Anonymous
8/4/2025, 6:40:18 AM No.106134027
WanVideoWrapper_T2v_00016_thumb.jpg
WanVideoWrapper_T2v_00016_thumb.jpg
md5: fb2678b1a8b56f3cebc650fbf064da16๐Ÿ”
ok, giving the low noise model 1 starting step without light / with cfg was also helpful, still 12 total steps

https://files.catbox.moe/fmlcrd.json
Anonymous
8/4/2025, 6:47:35 AM No.106134049
202501030047_thumb.jpg
202501030047_thumb.jpg
md5: 669c2f2b7c85342e2ee8c882872b030a๐Ÿ”
new comparison, the gen time is like 10 secs off now but whatever
Anonymous
8/4/2025, 6:49:48 AM No.106134063
any tips for making your wan 2.2 gens not so frantic? im trying to make a girl shake her ass but she turns into like a bunny demon that shakes at like 500x the normal speed, basically all the motion is way too frantic and fast ??
Replies: >>106134081 >>106134166
Anonymous
8/4/2025, 6:50:36 AM No.106134069
>>106133777
fucking waldorf
Anonymous
8/4/2025, 6:52:20 AM No.106134081
>>106134063
just set the fps lower
Replies: >>106134210
Anonymous
8/4/2025, 6:54:36 AM No.106134094
>chroma v48
when will it finish training?
Replies: >>106134103 >>106134125
Anonymous
8/4/2025, 6:55:34 AM No.106134103
>>106134094
Chrome is done. This is it.
Anonymous
8/4/2025, 6:59:55 AM No.106134125
>>106134094
v49 & v50 will take months due to 1024x1024 training. it most likely won't be finished until october.
Replies: >>106134130 >>106134139 >>106134163
Anonymous
8/4/2025, 7:01:13 AM No.106134130
>>106134125
training is quadratic, so if its 2x as big it will take 4x as long, so if it was about every 4 days it will be about every 16 days per epoch. But that is if he uses the same amount of compute
Anonymous
8/4/2025, 7:02:09 AM No.106134139
>>106134125
why not 2048x2048
Replies: >>106134143
Anonymous
8/4/2025, 7:03:42 AM No.106134143
>>106134139
that would take 16x as long / 16x as much compute
Anonymous
8/4/2025, 7:05:28 AM No.106134149
This epoch for sure guys!
Replies: >>106134164 >>106134170 >>106134183
Anonymous
8/4/2025, 7:08:26 AM No.106134163
>>106134125
Very much doubt it, they will likely just throw more compute at the last two epochs, I would expect Chroma v49 to drop within days, and the final release to be this month
Anonymous
8/4/2025, 7:08:47 AM No.106134164
>>106134149
i mean its good at its res as is, just not as good as wan
Anonymous
8/4/2025, 7:09:23 AM No.106134166
>>106134063
Lower lora strength on the high noise model.
Replies: >>106134210
Anonymous
8/4/2025, 7:10:28 AM No.106134170
>>106134149
Yeah I am not hopeful at all for chroma lol.
Anonymous
8/4/2025, 7:12:22 AM No.106134182
I don't see the hate there, its amazing for artsy stuff and unlike midjourney its not censored, it being able to do 1024 x 1024 will make it the best local model in that regard
Replies: >>106135287
Anonymous
8/4/2025, 7:12:32 AM No.106134183
>>106134149
With two 1024 resolution epochs left to go, Chroma is already easily the best model behind Wan, and it's faster both to use and to train.

Loras give great results both in realism and artstyles, it's also uncensored meaning you don't have to endlessly fight the model when you want to do NSFW.

In short, it will be THE new community model for general purpose.
Anonymous
8/4/2025, 7:13:15 AM No.106134188
they always say it's great but they never post a gen
Replies: >>106134190
Anonymous
8/4/2025, 7:13:56 AM No.106134190
>>106134188
here is a few thousand, some of them are mine
https://civitai.com/models/1330309/chroma
Anonymous
8/4/2025, 7:14:58 AM No.106134193
>>106133532
yjk
Anonymous
8/4/2025, 7:16:09 AM No.106134197
>>106133514
Based kek
Anonymous
8/4/2025, 7:17:17 AM No.106134200
91302711
91302711
md5: dd896e088ebc5a8a7ec2d7756f634cf6๐Ÿ”
its better than midjourney already imo and if you want hardcore smut / copyrighted stuff unlike midjourney it wont stop you
Anonymous
8/4/2025, 7:18:00 AM No.106134203
light2xv for wan2.2 when??????
Replies: >>106134271
Anonymous
8/4/2025, 7:18:21 AM No.106134205
>>106133377 (OP)
>2.2 Guide: https://rentry.org/wan22ldgguide
is it me or is the links to the T2V workflows broken
Replies: >>106134251
Anonymous
8/4/2025, 7:18:22 AM No.106134206
five seconds is not enough reeeeeeeeee. chinks get on it
Anonymous
8/4/2025, 7:18:24 AM No.106134207
chroma_91684261
chroma_91684261
md5: de161e3be1940dfca0fc99e7de4e788b๐Ÿ”
Anonymous
8/4/2025, 7:19:02 AM No.106134210
>>106134081
i cant seem to find how to do that? do i lower the length on the gen? brand new to this and im using a prebuilt runpod that says it comes set to 60fps
>>106134166
ty ill try that
Replies: >>106134220
Anonymous
8/4/2025, 7:19:44 AM No.106134215
chroma_91143238
chroma_91143238
md5: 74134537d7f9c56ed6858e6e14c0e955๐Ÿ”
Anonymous
8/4/2025, 7:20:26 AM No.106134220
>>106134210
>do i lower the length on the gen?
no
>im using a prebuilt runpod
wut
Replies: >>106134228
Anonymous
8/4/2025, 7:21:14 AM No.106134221
chroma_00229_
chroma_00229_
md5: 81f9e69aaf3b2335218fd3401d815e91๐Ÿ”
Replies: >>106135047
Anonymous
8/4/2025, 7:23:00 AM No.106134228
>>106134220
i have a shit gpu so im using a cloud gpu service, people can prebuild this gpus with settings so they are easy to use, the one im using is called Wan_i2V_60fps
Replies: >>106134238
Anonymous
8/4/2025, 7:23:04 AM No.106134229
chroma_3854399381
chroma_3854399381
md5: b4846e3e446718665e84cae0f5b903c2๐Ÿ”
Anonymous
8/4/2025, 7:24:24 AM No.106134237
chroma_00015_
chroma_00015_
md5: df82e720c85a241f8be43539aa90285c๐Ÿ”
Anonymous
8/4/2025, 7:24:27 AM No.106134238
>>106134228
that doesn't answer anything
Replies: >>106134249
Anonymous
8/4/2025, 7:25:33 AM No.106134241
chroma_00023_
chroma_00023_
md5: 895dec8fa0b4a89c52c06b38f9f93395๐Ÿ”
Anonymous
8/4/2025, 7:27:12 AM No.106134248
1731695807063451
1731695807063451
md5: 761df789e5a7ce87e0427a5998ab164c๐Ÿ”
Anonymous
8/4/2025, 7:27:18 AM No.106134249
>>106134238
i am saying within my comfy workflow i do not see the option to modify my fps
Replies: >>106134263 >>106134271
Anonymous
8/4/2025, 7:27:19 AM No.106134251
>>106134205
yeah theyre broken, im mostly doing T2I with the T2V models tho, pretty decent
Anonymous
8/4/2025, 7:30:43 AM No.106134263
>>106134249
fps is usually set in the final node which is typically video combine or save video, but how should anyone know what you're talking about if you give no information?
Replies: >>106134289 >>106134308
Anonymous
8/4/2025, 7:32:18 AM No.106134271
>>106134203
The only thing we know is that the lightx2v team is working on it.

>>106134249
What is your VAE Decode (or WanVideo Decode) node connected to? Post a screenshot.
Replies: >>106134289
Anonymous
8/4/2025, 7:36:47 AM No.106134289
decode
decode
md5: 5b8ad21e681c016b5201b2bc2d65bf1e๐Ÿ”
>>106134263
im sorry ive never used comfy so i am lost, i am trying to edit the fps in the final node now, i didnt think that would help as the preview is still really frantic, but hoping it works!
>>106134271
is this the one?
Replies: >>106134323 >>106134338
Anonymous
8/4/2025, 7:38:39 AM No.106134299
screenshot.1754285874
screenshot.1754285874
md5: f5c6f15e5a09f16a118ba9d1b4234e1e๐Ÿ”
reminder there are people on youtube getting half a million views just by making basic shit.
Replies: >>106134309 >>106134311 >>106134425
Anonymous
8/4/2025, 7:39:21 AM No.106134302
1748866125252738
1748866125252738
md5: b8b3b398867015a50693e105074a00d4๐Ÿ”
Replies: >>106134330
Anonymous
8/4/2025, 7:41:39 AM No.106134308
>>106134263
changing the final output fps seemed to help a lot, ty
Anonymous
8/4/2025, 7:41:48 AM No.106134309
>>106134299
You can tell they used midjourney because the results actually look good, unlike the shit in this thread.
Replies: >>106134548
Anonymous
8/4/2025, 7:42:10 AM No.106134311
Anime Biker_thumb.jpg
Anime Biker_thumb.jpg
md5: 60411a8607842d6b656531a5bff3ef31๐Ÿ”
>>106134299
Late 80's, early 90's anime artstyle is still the best artstyle to date. I can't wait for AI to be good enough that we can go back to it.
Replies: >>106135788 >>106135799
Anonymous
8/4/2025, 7:44:13 AM No.106134323
>>106134289
I think the biggest problem people have with learning Comfy is that the first workflow(s) they are exposed to is some insane spaghetti with tons of third-party nodes, 90% of which are superflous since the functionality already exist in base
Replies: >>106134331 >>106134337
Anonymous
8/4/2025, 7:46:31 AM No.106134330
>>106134302
Nice, reminds me of those old Japanese scroll illustrations
Replies: >>106134412
Anonymous
8/4/2025, 7:46:32 AM No.106134331
>>106134323
This is a big factor. I don't know why they do it, but many people who share their workflows have a fetish for making them overly complex and full of obscure nodes that offer zero functionality to the final output. It's like adding lucky charms to their outfit to look more impressive or something.
Anonymous
8/4/2025, 7:47:25 AM No.106134337
00
00
md5: 16454f1bb6502ef0f397dbdb5fc2874c๐Ÿ”
>>106134323
this is a real workflow that someone was proud of and posted for people to use
Replies: >>106134367 >>106134375 >>106134376 >>106134384 >>106134424
Anonymous
8/4/2025, 7:47:27 AM No.106134338
>>106134289
The framerate is set to 24, you want to change that to 16, without seeing the rest of nodes that handle the interpolation and their values I can't say for certain what the end result will be like.
Anonymous
8/4/2025, 7:51:51 AM No.106134367
>>106134337
I can smell the mental illness
Anonymous
8/4/2025, 7:52:47 AM No.106134375
>>106134337
well what does it do tho
Anonymous
8/4/2025, 7:53:07 AM No.106134376
>>106134337
>pic
That HAS to be a fucking joke.
Replies: >>106134378
Anonymous
8/4/2025, 7:53:38 AM No.106134378
>>106134376
https://www.reddit.com/r/comfyui/comments/1mg46fi/spaghettification/
Replies: >>106134405 >>106134427
Anonymous
8/4/2025, 7:54:48 AM No.106134384
>>106134337
holy nodes
Anonymous
8/4/2025, 7:58:04 AM No.106134395
>>106133743
>>106133917
Maybe it has been corrected since then, or maybe it's just impacting slower cards like 3060s.
To test it's easy, literally try any wan, fix the seed, and run it on 12.6,12.8,12.9 (is it out?).
You can easily do that by creating new venv and downloading pytorch for the version.
Replies: >>106135509
Anonymous
8/4/2025, 7:59:09 AM No.106134401
>>106133780
lightx2v fucks up the rain
Anonymous
8/4/2025, 7:59:51 AM No.106134402
so who won genjam 2?
Replies: >>106134408 >>106134411 >>106134419 >>106135036
Anonymous
8/4/2025, 8:00:12 AM No.106134405
>>106134378
I wonder how long it took to zoom out/in
Anonymous
8/4/2025, 8:01:00 AM No.106134408
>>106134402
feds won
Anonymous
8/4/2025, 8:01:12 AM No.106134411
>>106134402
me, I won
Anonymous
8/4/2025, 8:01:17 AM No.106134412
1752569146877762
1752569146877762
md5: 7684208688a40f3a7bad44c82e189295๐Ÿ”
>>106134330
Ty
Anonymous
8/4/2025, 8:02:21 AM No.106134419
>>106134402
i have seen no indication thus far that it was a contest
Anonymous
8/4/2025, 8:02:42 AM No.106134424
ComfyUI_00835_
ComfyUI_00835_
md5: cad1a54424687db8f47ee8b4902459a6๐Ÿ”
>>106133503
>>106133704
Well, I just can't get v param working. I am out of trivial ideas.
I guess it is possible that this shit in the sticky https://github.com/derrian-distro/LoRA_Easy_Training_Scripts is bugged (it doesn't seem to be too actively maintained anymore) or the other anon might have mislead me but whatever I am going to bed now.
I will figure this out another day...
>>106134337
I regularly waste hours making spergy experiments about shit no one cares about but I will never reach THIS level of autism.
Anonymous
8/4/2025, 8:02:47 AM No.106134425
>>106134299
that sakura looks really good
Anonymous
8/4/2025, 8:03:13 AM No.106134427
ComfyUI_temp_pvuqs_00004_
ComfyUI_temp_pvuqs_00004_
md5: 1d278a490344ac92ff31b6ceae2a5a64๐Ÿ”
>>106134378
>2700+ nodes
Replies: >>106134458
Anonymous
8/4/2025, 8:03:14 AM No.106134428
Trying to get lightx2v to work is pointless. It fucks up too much. Just wait till they update it.
Anonymous
8/4/2025, 8:03:16 AM No.106134429
jam'd and gen'd
Anonymous
8/4/2025, 8:07:35 AM No.106134458
>>106134427
this guy is probably employed to do this
Replies: >>106134481
Anonymous
8/4/2025, 8:11:01 AM No.106134481
>>106134458
That image has the same legal authority as an unemployment certificate from a previous employer.
Replies: >>106134500
Anonymous
8/4/2025, 8:13:18 AM No.106134500
>>106134481
it should be a case for why comfy shouldn't be employed
Replies: >>106134510
Anonymous
8/4/2025, 8:14:33 AM No.106134510
>>106134500
is comfy technically employed?
Replies: >>106134552
Anonymous
8/4/2025, 8:21:28 AM No.106134548
>>106134309
It doesn't look that good, the mouth animations are pathetic, the characters are otherwise mostly static, and there's a lot of asymmetry and blobbiness in the fine details
Replies: >>106134684
Anonymous
8/4/2025, 8:21:49 AM No.106134552
>>106134510
Depends on how he set up the company structure, but he owns the company, so it's 'employee' on paper only.
Replies: >>106134561
Anonymous
8/4/2025, 8:22:52 AM No.106134561
>>106134552
>he owns the company
technically, his ceo owns the company. he just has more equity than the drooling retards working under him
Anonymous
8/4/2025, 8:45:45 AM No.106134684
>>106134548
doesn't matter. that guy got 500k views. if he can maybe videos like that every 2-4 weeks, he'd be making $100k+ a year. ridiculous
Replies: >>106134703 >>106134705
Anonymous
8/4/2025, 8:49:19 AM No.106134703
>>106134684
I made one of those harry potter Balenciaga videos. It got like a million views, I got monetized and then like three days later You tube demonetized me permanently because my content was unoriginal and low effort. Which is true.
Replies: >>106134713 >>106134752
Anonymous
8/4/2025, 8:49:47 AM No.106134705
>>106134684
>10k subs
>videos are all less than 2 minutes
They aren't making anything because the videos aren't monetized.
Replies: >>106136146
Anonymous
8/4/2025, 8:51:18 AM No.106134713
>>106134703
Did you atleast manage to get any money within those 3 days?
Replies: >>106134741
Anonymous
8/4/2025, 8:56:34 AM No.106134741
>>106134713
Yeah like 200 bucks.
Anonymous
8/4/2025, 8:59:27 AM No.106134752
>>106134703
>unoriginal and low effort.
it's higher effort than reaction videos but whatever.
Replies: >>106134758
Anonymous
8/4/2025, 9:00:53 AM No.106134758
>>106134752
desu, I spent like all day making them because I was using Stable diffusion 2 at the time. It was by no means an easy process. It took like two days per video of non stop genning.
Anonymous
8/4/2025, 9:04:55 AM No.106134775
Hear me out. Batch size of 1 when training details.
Thoughts?
Replies: >>106134813 >>106134852 >>106134878 >>106134885
Anonymous
8/4/2025, 9:05:42 AM No.106134780
Wan22WVI2V_KJ_RAW__00148_thumb.jpg
Wan22WVI2V_KJ_RAW__00148_thumb.jpg
md5: 0115422adc1d9bb16f69ae403a7fc1c6๐Ÿ”
>There are people on /sdg/ and /ldg/ that didn't go all in on nvidia stocks when SD1.5 kick off.

ngmi. Whoring out your digital waifu for coomer bucks is pathetic. Learn to make money with money. If you're spineless and has weak hands, the SP500 should protect against inflation at the minimum.
Replies: >>106134804 >>106134844 >>106135716 >>106135740 >>106135757
Anonymous
8/4/2025, 9:06:49 AM No.106134789
insufferable prick
Anonymous
8/4/2025, 9:07:57 AM No.106134804
AniStudio_InterOpTest-00736_thumb.jpg
AniStudio_InterOpTest-00736_thumb.jpg
md5: 6c6b9794e4eac10fa1c566e22d90aff5๐Ÿ”
>>106134780
rude
Anonymous
8/4/2025, 9:10:31 AM No.106134813
>>106134775
Does it matter?
Replies: >>106134820
Anonymous
8/4/2025, 9:11:23 AM No.106134820
>>106134813
I genuinely do not know. What does a large batch size look like to a model compared to a batch size of 1?
Replies: >>106134828 >>106134860 >>106134931
Anonymous
8/4/2025, 9:12:11 AM No.106134828
>>106134820
the same
Anonymous
8/4/2025, 9:15:17 AM No.106134844
>>106134780
I like how the AI can't decide whether to make an axe or sword and constantly switches back and forth between them.
Replies: >>106134848
Anonymous
8/4/2025, 9:16:08 AM No.106134848
>>106134844
I don't like that it's a repost and it's probably not the author
Anonymous
8/4/2025, 9:16:49 AM No.106134852
axe with sling
axe with sling
md5: 268aed4da7c2384059fcb379e224f31a๐Ÿ”
>>106134775

I do details training by separating items from the character and include it in the dataset. My logic is that if I can generate that particular trinket by itself in high res, it should improve details. And that does indeed work upon generation. Dataset is king after all. Optimal and efficient? Who knows. Prove me wrong.
Anonymous
8/4/2025, 9:18:34 AM No.106134860
>>106134820
I imagine it'd highly depend on the thing you're training. You won't ever get consistent results. I just stick to batch size 1.
Anonymous
8/4/2025, 9:21:09 AM No.106134878
>>106134775
I did a bunch of tests back on Flux and SDXL, and yes, batch 1 was overall best quality, both in details and overall capture of concept.

However, it's also much slower, just going from batch 1 to batch 2 is ~25-30% faster depending on resolution / hardware / model, I typically land at batch 4 which is ~40-45% faster for me.

Also you need to increase the LR from what is good on batch 1 when you go to higher batches, else the results will suffer.
Anonymous
8/4/2025, 9:21:45 AM No.106134880
What are your best videos you have seen so far /ldg/?
Replies: >>106134893
Anonymous
8/4/2025, 9:21:57 AM No.106134885
>>106134775
it won't really matter on most settings I've tried
Anonymous
8/4/2025, 9:22:52 AM No.106134893
>>106134880
a meme or porn. pretty much all this tech is good for. throwaway content
Replies: >>106134915 >>106136301
Anonymous
8/4/2025, 9:26:44 AM No.106134910
Fuck I hate comfy ui

What nodes do i use if I DON'T want to preprocess my depth/pose/canny control net? I already have my maps generated. in voldie's i could just choose preprocessor type none but that's not a thing
Replies: >>106134920 >>106134921
Anonymous
8/4/2025, 9:26:56 AM No.106134915
>>106134893
not much you can do with 5 seconds aside from memes/porn
Anonymous
8/4/2025, 9:27:44 AM No.106134920
>>106134910
bypass the node
noob
Replies: >>106135004
Anonymous
8/4/2025, 9:27:51 AM No.106134921
>>106134910
input image and load the preprocessed image retard
Replies: >>106135004 >>106135043
Anonymous
8/4/2025, 9:29:11 AM No.106134931
>>106134820
The difference is in how it updates its learning, with batch 1, the model is at its 'optimal state' in terms of the images it has learned, since it learns all images sequentually and thus the model learning can 'grok' the next image based upon all the other images it has learned.

When you go above batch 1, you are learning several images at the same time, independantly, so they get nothing from eachother in terms of learning, and the more images you train simultaneously (higher batch) the more this hampers learning quality.

The reason you want higher batches is for speed, not quality, I've seen some people argue that the gradients are more normalised when using higher batches which should help in learning, but every single test I've done and have seen others do, show that batch 1 gives the best quality. But again, unless you are traininig small amounts of images, it's too much performance to throw away by not using higher batches.
Replies: >>106134954
Anonymous
8/4/2025, 9:33:47 AM No.106134954
>>106134931
unless you're getting paid to make loras, there is no point in trying to aim for speed while sacrificing quality. personally if I got paid to make loras, I'd use runpod or something and just run dozens of training in parallel at batch 1.

SDXL in particular only takes 1-2 hours on a 3090 anyway. I doubt you need to make 20+ loras a day.
Replies: >>106135231
Anonymous
8/4/2025, 9:41:02 AM No.106134983
2025-08-04T04.26.34_1
2025-08-04T04.26.34_1
md5: fc81c06dd37baf5d955119d687fd41bb๐Ÿ”
Anonymous
8/4/2025, 9:43:40 AM No.106135001
231415214312
231415214312
md5: 688895f7f652a6a3cb675357e0062289๐Ÿ”
Replies: >>106135081
Anonymous
8/4/2025, 9:43:43 AM No.106135003
For some reason saving the output of the high noise model always produces garbage. Why? It looks fine in the sample preview. Has to be a bug or something
Anonymous
8/4/2025, 9:43:54 AM No.106135004
apply
apply
md5: 089df8fd13b67b5a665809ef045a1331๐Ÿ”
>>106134921
>>106134920
I mean what node do I replace the comfyui control net node with if I want to run it without a preprocessor?
Replies: >>106135019 >>106135043 >>106135046 >>106135114
Anonymous
8/4/2025, 9:47:26 AM No.106135019
>>106135004
i dont know what you're doing so I could be off but there's a custom node called ComfyUI-Advanced-ControlNet you can install which has more options
Anonymous
8/4/2025, 9:49:38 AM No.106135036
>>106134402
you did! congrats
Anonymous
8/4/2025, 9:50:47 AM No.106135043
>>106135004
>>106134921
Replies: >>106135122
Anonymous
8/4/2025, 9:51:15 AM No.106135046
>>106135004
The image input should be your preprocessed image bypass any processing on image going into it.
Replies: >>106135122
Anonymous
8/4/2025, 9:51:26 AM No.106135047
>>106134221
nice
Anonymous
8/4/2025, 10:00:53 AM No.106135081
>>106135001
She's crying because of her malformed twin behind her.
Replies: >>106135142
Anonymous
8/4/2025, 10:06:19 AM No.106135114
00
00
md5: d9e0367c07ac4b20671b2658f15164bb๐Ÿ”
>>106135004
again, bypass the node. I setup a bool so all i have to do is click a button to disable it once my image has been processed.
Anonymous
8/4/2025, 10:08:03 AM No.106135122
>>106135046
>>106135043
I figured it out. The node is not preprocessing the images and it's fine to put depth maps in directly.

1. I copied a flux workflow that was putting normal images directly into that node
2. I assumed the node was doing preprocessing somewhere because why would you do that if it wasn't
3. I tested anyway putting in a depth map that usually gets good results in SDXL
4. The good depth map got shit results and assumed it was preprocessing the depth badly

turns out I had a bad controlnet model and the workflow I followed was also using it wrong. I replaced it with the flux union one and now i'm getting good results. Sorry for the trouble.
Replies: >>106135389
Anonymous
8/4/2025, 10:10:00 AM No.106135142
>>106135081
heh. you know a character lora is bad when clones start popping up. OVERFITTED
Replies: >>106135197
Anonymous
8/4/2025, 10:21:02 AM No.106135197
>>106135142
I wish I still had the video of the highly overfitted bog LoRA where it was deepfried and everyone looked like a bog.
Anonymous
8/4/2025, 10:27:02 AM No.106135231
>>106134954
Sure, if the extra time doesn't bother you, why not go for the best quality
Anonymous
8/4/2025, 10:36:16 AM No.106135272
1753231752766398
1753231752766398
md5: e41546fd7dbe17aaba99f22064c2a2c2๐Ÿ”
>>106133860
Replies: >>106135435
Anonymous
8/4/2025, 10:39:04 AM No.106135287
>>106134182
the hate is from 3060 vramlets who cant run it, as usual
Replies: >>106135291
Anonymous
8/4/2025, 10:40:05 AM No.106135291
>>106135287
(fast)
Anonymous
8/4/2025, 10:54:26 AM No.106135389
>>106135122
> falsely blaming ComfyUI again, ep.1333
Anonymous
8/4/2025, 11:02:54 AM No.106135435
83142225007-samantha-white-mug
83142225007-samantha-white-mug
md5: e8790a6856fac5ed4eab3dfdb47d6c37๐Ÿ”
>>106135272
https://eu.news-press.com/story/news/crime/2025/04/17/lee-county-woman-gets-prison-for-having-sex-with-household-pets/83137545007/
Replies: >>106136912
Anonymous
8/4/2025, 11:19:23 AM No.106135509
>>106134395
Same for my 3060, uninstalled pytorch+cuda128 and installed pytorch+cuda126,I tested multiple gens and there's no noticeable improvement. Might be already fixed? Or anon was just trolling?
Replies: >>106135648 >>106136671
Anonymous
8/4/2025, 11:24:57 AM No.106135534
sometimes im wondering if my prompt isn't good enough or if I'm getting trolled by bad seed rolls
Replies: >>106135610
Anonymous
8/4/2025, 11:26:42 AM No.106135547
comfy should be dragged out on the street and shot
Anonymous
8/4/2025, 11:32:56 AM No.106135591
>>106134119
Turns out if I use the low noise model solo in kijai workflow it does this shit. I prompted for character knocking on viewer's screen with "changing scene" in NAG and it still did it.
Anonymous
8/4/2025, 11:36:21 AM No.106135610
>>106135534
Cursed GPU I'm afraid.
Anonymous
8/4/2025, 11:44:20 AM No.106135648
>>106135509
The reddit thread about it dates from months ago, so my guess is whatever the issue, it's not there anymore.
Or maybe it's linked to driver version too?
https://www.reddit.com/r/LocalLLaMA/comments/1jlofc7/performance_regression_in_cuda_workloads_with/

You can try:
Driver Version: 560.35.05
CUDA Version: 12.6
Replies: >>106136642
Anonymous
8/4/2025, 11:45:42 AM No.106135657
file
file
md5: b892c4becb5f7f1429fa3dcf4e889332๐Ÿ”
>106135547
Very organic.
Anonymous
8/4/2025, 11:56:16 AM No.106135716
>>106134780
i was 15 when sd1.5 kicked off
im only about to get a bank account
Replies: >>106135750
Anonymous
8/4/2025, 12:01:11 PM No.106135740
>>106134780
hard to invest when you have no money
Replies: >>106135750
Anonymous
8/4/2025, 12:03:43 PM No.106135750
>>106135716
Fuck off zoomer

>>106135740
Fuck off poor fag
Replies: >>106135759 >>106135879
Anonymous
8/4/2025, 12:04:23 PM No.106135757
>>106134780
>didn't go all in on nvidia stocks when SD1.5 kick off
I COULD BE A MILLIONNAIRE
well, it is what it is
Anonymous
8/4/2025, 12:04:59 PM No.106135759
file
file
md5: f27eb239211f15b6847b65516c5bf323๐Ÿ”
kek
>>106135750
fuck off r*dditor
Replies: >>106135939 >>106135947
Anonymous
8/4/2025, 12:09:29 PM No.106135775
So are nvidia chips like salvaged from a giant alien wreck in in Taiwan or something? Why can't any other company even come close to their product? And don't bullshit me with lies like AMD being held back by software alone. I know it's shit hardware too.
Replies: >>106135801 >>106135857 >>106135878 >>106135879 >>106135886
Anonymous
8/4/2025, 12:11:20 PM No.106135788
>>106134311
It's really doesn't look any worse than animation out of that era either. The originals could be pretty janky
Anonymous
8/4/2025, 12:13:17 PM No.106135799
>>106134311
Kinda agree
Anonymous
8/4/2025, 12:13:23 PM No.106135801
>>106135775
Because it will take years and a lot of money to catch them. Even mainland chinese tried and failed.
Anonymous
8/4/2025, 12:21:19 PM No.106135857
>>106135775
Mainly because of CUDA, and they're also in the bleeding edge performance wise.
Anonymous
8/4/2025, 12:25:04 PM No.106135878
>>106135775
Nvidia's hardware is better optimized for AI since they've been aboard on it for ~a decade, but the amount AMD lags behind is also a large part not having the same software optimizations.
Anonymous
8/4/2025, 12:25:27 PM No.106135879
file
file
md5: fdfa8d645e7317c0b06474d762cb90f9๐Ÿ”
>>106135775
1. CUDA
2. While AMD focused on gaming GPUs and CPUs, Nvidia went full all in on AI infrastructure.
3. First-Mover Advantage in AIโ€™s gold rush.
4. Nvidia bet on AI acceleration before it was cool (see: 2012 AlexNet on GPUs).
5. TSMCโ€™s 4nm/5nm nodes are bottlenecked, and Nvidia booked capacity years ahead. AMD has to fight for scraps (MI300X is TSMC 5nm/6nm), while Nvidiaโ€™s H100s are printing money.
6. Smaller players (Cerebras, Graphcore) canโ€™t scale due to costs.
7. AMD is juggling CPUs (Ryzen/EPYC), GPUs (Radeon), and now FPGAs (Xilinx). Nvidiaโ€™s entire existence is โ€œaccelerated computing.โ€ Focus matters.
8. That said, AMDโ€™s MI300X is competitive in raw specsโ€”but without CUDA, itโ€™s stuck selling to hyperscalers (Microsoft, Meta) who can afford to port code. For everyone else? CUDA or die.
lets say you're a company trying to catch up on ai, you want the best of the best and dont want to be taking risks, especially when it comes to software. writing software by yourself also costs money, remember wages are like 100K$/year/person in land of the free
>>106135750
nice bait but ill take it, are you jealous that i got into the ai field so early and all im doing in my free time is gooning? cope more wagie, while you're grinding your ai skills to catch up i've been casually consooming all ai models since 2022 and the only thing i've been using them for is gooning
you will never be celibate again, while i'm keeping my virginity for a custom made open source robot, you're simping for coworkers or getting divorce raped
Replies: >>106136504
Anonymous
8/4/2025, 12:26:14 PM No.106135886
>>106135775
Unsurprisingly shit software that issues twice the number of instructions is slower
Anonymous
8/4/2025, 12:35:08 PM No.106135939
>>106135759
DUN DUUUN
Anonymous
8/4/2025, 12:37:05 PM No.106135947
>>106135759
cabbage ass
Anonymous
8/4/2025, 12:53:16 PM No.106136030
can I add my dataset to chroma training?
Replies: >>106136036 >>106136055
Anonymous
8/4/2025, 12:53:53 PM No.106136036
>>106136030
???
Replies: >>106136203
Anonymous
8/4/2025, 12:56:33 PM No.106136053
1567195715489
1567195715489
md5: 3bfa741332031a2fa84a9cd25a41171c๐Ÿ”
Can I train a wan lora that is just T2I or does it need to be video dataset?
Replies: >>106136066 >>106136068
Anonymous
8/4/2025, 12:56:47 PM No.106136055
>>106136030
Yes, if got at least 7 votes of the Chroma Council.
Anonymous
8/4/2025, 12:57:52 PM No.106136066
>>106136053
You can train wan t2v and i2v with pictures too.
Anonymous
8/4/2025, 12:57:58 PM No.106136068
>>106136053
Nope. You don't even strictly need video to train T2V. It will hurt the motion though.
Anonymous
8/4/2025, 1:07:13 PM No.106136122
ComfyUI_temp_eodyh_00013_
ComfyUI_temp_eodyh_00013_
md5: c227e203e349136c64b27c80ae62336e๐Ÿ”
What VLMs can take fetish content? Alternatively, how do I automate batch tagging of images for wan? I have SDXL datasets that are tagged with just booru tags and I need to convert it for WAN use.
Anonymous
8/4/2025, 1:07:26 PM No.106136123
file
file
md5: 705ab8cc6020cc92d7b2869b150c666c๐Ÿ”
Anonymous
8/4/2025, 1:08:42 PM No.106136129
flux-t5-adapter-2
flux-t5-adapter-2
md5: 4fcdc98d05be2caf001ff5800f0bf181๐Ÿ”
>T5-XXL vs T5-small with adapter
Better, the frog has the sign now
This run uses 11x the prompts and more layers, still saves ~8GB compared to T5-XXL
Replies: >>106136142
Anonymous
8/4/2025, 1:10:41 PM No.106136142
>>106136129
what do you mean better?
what are you even talking about?
Replies: >>106136160
Anonymous
8/4/2025, 1:11:06 PM No.106136146
>>106134705
>10k subs
Irrelevant I have 2k subs and I am monetized.
>2 minutes
Shorter the content the less you make yes.
Anonymous
8/4/2025, 1:12:15 PM No.106136160
flux-t5-adapter
flux-t5-adapter
md5: f9ec2f9d67a4d7e7e431d8dbe94fb187๐Ÿ”
>>106136142
Compared to the last run
I'm training an adapter that turns T5-small embeds into T5-XXL embeds
Replies: >>106136206
Anonymous
8/4/2025, 1:19:16 PM No.106136203
>>106136036
!!!
Anonymous
8/4/2025, 1:20:26 PM No.106136206
file
file
md5: 61b28179f5f8a17d4d61dbf82ac728ca๐Ÿ”
>>106136160
thats interesting, so are you distilling T5-XXL into T5-Small? what rig are you doing it on? are you planning to open source it?
you should think of a license before you open source it (im not talking about cuck license im talking about a license that will prevent big tech from using your trained model)
https://opensource.google/documentation/reference/using/agpl-policy
>WARNING: Code licensed under the GNU Affero General Public License (AGPL) MUST NOT be used at Google.
Replies: >>106136275
Anonymous
8/4/2025, 1:32:13 PM No.106136273
ComfyUI_00018_
ComfyUI_00018_
md5: 15fd687a93d1e33d3bb7b7d57b3dfd74๐Ÿ”
Replies: >>106136281 >>106136486
Anonymous
8/4/2025, 1:32:48 PM No.106136275
flux-t5-adapter-3
flux-t5-adapter-3
md5: 6c1b3ba74b543d1c69ec0ed6c20f914c๐Ÿ”
>>106136206
I guess it could count as distilling.
The trained model is just a bunch of linear layers and activations, T5-small is 512 dim, T5-XXL is 4096, the first layer is 512->4096 and the rest are 4096->4096.
The dataset is embeds from T5-small and T5-XXL, training is T5-small embed -> adapter -> target is T5-XXL embed
Dataset is precomputed, saved to webdataset, with some custom tensor serialization with compression because T5-XXL embeds are huge
I was using A40 on runpod but it's too slow, now I'm using gpu_1x_gh200 on Lambda, that's ARM64 + H100, 64 vCPUs, 432 GiB RAM, 4 TiB SSD for only $1.49 / hr
If it ends up working good enough then yeah I'll release it
Replies: >>106136289
Anonymous
8/4/2025, 1:33:09 PM No.106136281
>>106136273
Nice cock
Anonymous
8/4/2025, 1:34:16 PM No.106136289
>>106136275
that's very kewl
Anonymous
8/4/2025, 1:36:21 PM No.106136301
>>106134893
That's 90% of what people go to the internet for, this technology is winning
Anonymous
8/4/2025, 1:37:06 PM No.106136307
Respect copyright. Remember to gen with models that are properly licensed.
Replies: >>106136338
Anonymous
8/4/2025, 1:41:58 PM No.106136338
>>106136307
In other words no model, not a single model has licensed the images or videos they train on
Anonymous
8/4/2025, 1:55:38 PM No.106136427
00090-3564197447
00090-3564197447
md5: f44c9873f6428ab8967efaa9de99f084๐Ÿ”
Anonymous
8/4/2025, 2:00:39 PM No.106136464
flux-t5-adapter-4
flux-t5-adapter-4
md5: b60f478eb3fd42ff8f296f8487501c8d๐Ÿ”
Replies: >>106136471
Anonymous
8/4/2025, 2:02:07 PM No.106136471
file
file
md5: 18ead6964b807cc794cff47674f9a36d๐Ÿ”
>>106136464
very nice, it got close with "word"
Replies: >>106136486 >>106136648
Anonymous
8/4/2025, 2:04:26 PM No.106136486
>>106136273
>>106136471
You two, get a hotel room.
Replies: >>106136555
Anonymous
8/4/2025, 2:07:26 PM No.106136504
>>106135879
incelibate*
Anonymous
8/4/2025, 2:12:10 PM No.106136545
big qwen t2i eventually https://github.com/huggingface/diffusers/pull/12055
this better not suck
Replies: >>106136555 >>106136601
Anonymous
8/4/2025, 2:13:27 PM No.106136555
file
file
md5: 9a75ac0d92fff5951f8ab845036ed04c๐Ÿ”
>>106136486
a diamond statue of a teen girl with small perky breasts and a tight pussy is bent over and getting fucked by a copper statue of a big veiny cock
>>106136545
it will suck because qwen LLMs are very cucked
Anonymous
8/4/2025, 2:19:07 PM No.106136601
>>106136545
What could it realistically deliver of wan?
Replies: >>106136705
Anonymous
8/4/2025, 2:21:36 PM No.106136621
wan 2.2 vace when?????
Anonymous
8/4/2025, 2:25:26 PM No.106136642
N-vidya
N-vidya
md5: feec0e5483606801f6c6357bd6d2a852๐Ÿ”
>>106135648
Can't seem to find that driver version equivalent for windows, maybe I can try with a windows driver near the same release date but I can't seem to track down when that specific linux driver released. I'm using drivers released back in April right now.
Replies: >>106136671
Anonymous
8/4/2025, 2:25:55 PM No.106136646
1533423826134
1533423826134
md5: 336aada94f2bc288c209b1a7bd6e0250๐Ÿ”
I'm trying to install diffusion pipe, but when I try to requirements.txt I'm missing some modules that those requirements need. Is there a way to chain it so it automatically pulls everything it needs? Using miniconda since the diff pipe git says to use that
Replies: >>106136689 >>106136718
Anonymous
8/4/2025, 2:26:02 PM No.106136648
>>106136471
Yeah it's getting close. Interesting that the frog seems less plastic too.
Anonymous
8/4/2025, 2:30:49 PM No.106136671
>>106135509
you need to install the old cuda 12.6 not just the pytorch packages...
check what version of cuda you have active by doing nvcc --version
>>106136642
driver isnt that big of a deal, cuda version is
Replies: >>106136708
Anonymous
8/4/2025, 2:33:23 PM No.106136689
1740726323967180
1740726323967180
md5: 6ebf0969af7796d709f0e2c4c0adf1a8๐Ÿ”
>>106136646
nvm, I didn't notice that deepspeed is fucked
Replies: >>106136731 >>106136857
Anonymous
8/4/2025, 2:34:38 PM No.106136704
>there are still people ITT that dont know about venvs
Replies: >>106136712 >>106136927
Anonymous
8/4/2025, 2:34:40 PM No.106136705
>>106136601
Hopefully not a complete disgrace to the qwen team. Otherwise, why bother releasing it.
Anonymous
8/4/2025, 2:35:00 PM No.106136708
N-vidya
N-vidya
md5: 40c854d2787ba40091191138c0ee35ce๐Ÿ”
>>106136671
Yeah I did both, I installed 12.6 and installed 126 pytorch packages.
Replies: >>106136719
Anonymous
8/4/2025, 2:35:47 PM No.106136712
>>106136704
What's venv?
Replies: >>106136872
Anonymous
8/4/2025, 2:36:56 PM No.106136718
>>106136646
List the missing modules and maybe we can help, like holy shit do you think people are psychic ?
Replies: >>106136731
Anonymous
8/4/2025, 2:37:01 PM No.106136719
>>106136708
interesting, what are you testing it with? i had a speedup on wan 2.2/2.1 kijai sageattention workflow
ill grab newest drivers and cuda 12.8 to test
Replies: >>106136824
Anonymous
8/4/2025, 2:38:24 PM No.106136731
>>106136718
>>106136689
It's the deepspeed, but it turns out it just refuses to work at all on windows. Gonna need a WSL
Replies: >>106136944
Anonymous
8/4/2025, 2:49:56 PM No.106136824
>>106136719
My WF is just lightx2v and sageattention, I went full retard and forgot to screenshot my 12.8 results but on 2nd sampler pass they were around 80-90 seconds. On 12.6 they were still in that range, same WF, same seed, same initial image.
Anonymous
8/4/2025, 2:52:16 PM No.106136840
flux-t5-adapter-5
flux-t5-adapter-5
md5: 20c093c3ea45f65a2e6740e3ec6cbb43๐Ÿ”
Will do more runs later
Anonymous
8/4/2025, 2:54:04 PM No.106136857
>>106136689
If you read the repo, you'd know it only works on linux of wsl2
Anonymous
8/4/2025, 2:55:35 PM No.106136872
>>106136712
Something you don't need to worry about. Only retards without jobs care about them. Just pip install everything to your home environment.
Replies: >>106136956
Anonymous
8/4/2025, 3:00:25 PM No.106136912
1730894775469365
1730894775469365
md5: 10d77ad279562a7dce898591f1769794๐Ÿ”
>>106135435
Anonymous
8/4/2025, 3:03:08 PM No.106136927
>>106136704
you mean docker?
Anonymous
8/4/2025, 3:05:37 PM No.106136944
>>106136731
WSL2 to be exact
Anonymous
8/4/2025, 3:06:53 PM No.106136956
>>106136872
What's pip?
Replies: >>106136959 >>106137239
Anonymous
8/4/2025, 3:07:35 PM No.106136959
>>106136956
peepee in poopoo
Replies: >>106136990
Anonymous
8/4/2025, 3:10:21 PM No.106136974
any nsfw loras for krea? or what nsfw flux loras work with krea?
Anonymous
8/4/2025, 3:11:05 PM No.106136982
wan22_00001_thumb.jpg
wan22_00001_thumb.jpg
md5: e5dbd27bb9e0b63361f07d8e057dd055๐Ÿ”
Anonymous
8/4/2025, 3:12:57 PM No.106136990
>>106136959
I'm twitching rn...
Anonymous
8/4/2025, 3:19:51 PM No.106137041
wan22_00012_thumb.jpg
wan22_00012_thumb.jpg
md5: 1162350008050890c8f0f9d1401aaace๐Ÿ”
Anonymous
8/4/2025, 3:21:26 PM No.106137058
Does anyone have any tips for generating NSFW audio with mmaudio? Trying to get that glrrrrrrrk glrrrrrrk sound
Replies: >>106137080 >>106137240
Anonymous
8/4/2025, 3:23:44 PM No.106137080
>>106137058
you mean thinksound?
Replies: >>106137087 >>106137240
Anonymous
8/4/2025, 3:24:42 PM No.106137087
>>106137080

Is that what everyone is using now for audio?
Anonymous
8/4/2025, 3:27:37 PM No.106137111
BDSM Emma Watson
BDSM Emma Watson
md5: 1a631112499878cda5aa19bfb8458d7c๐Ÿ”
Can I make a request for one of you to animate this image?
Replies: >>106137118 >>106137131 >>106137160
Anonymous
8/4/2025, 3:28:29 PM No.106137118
>>106137111
no, demand it instead
Anonymous
8/4/2025, 3:30:03 PM No.106137131
>>106137111
No, beg for it instead
Anonymous
8/4/2025, 3:33:40 PM No.106137160
1484309379
1484309379
md5: 17483471d0e90f3f06e656d467736343๐Ÿ”
>>106137111
that's pretty ancient, surely someone can shop something better by now
Replies: >>106137210
Anonymous
8/4/2025, 3:37:48 PM No.106137191
VTFG9U46m5k
VTFG9U46m5k
md5: 9b66887bbbf7c154618b06a5d0c295bc๐Ÿ”
Wait, do I have to reinstall the diffusion pipe entirely within the WSL linux environment?
Replies: >>106137217 >>106137232
Anonymous
8/4/2025, 3:40:06 PM No.106137210
WAN22_00007_thumb.jpg
WAN22_00007_thumb.jpg
md5: 52d38d2308a2b972f825075733fca970๐Ÿ”
>>106137160
pretty sure new Emma manipulation tech is superior
Anonymous
8/4/2025, 3:40:48 PM No.106137217
>>106137191
you mean install it in WSL period? It does not work outside of linux or wsl
Anonymous
8/4/2025, 3:41:49 PM No.106137232
>>106137191
https://civitai.com/articles/12837/full-setup-guide-wan21-lora-training-on-wsl-with-diffusion-pipe
Anonymous
8/4/2025, 3:41:52 PM No.106137234
cuda comparison 1
cuda comparison 1
md5: f38671cc8628b69d63eb8c33d08d50a1๐Ÿ”
===UPLIFTING NEWS===
570.133.07 with cuda 12.8 is magically no longer fucked up, maybe old pytorch cu128 had fucked up kernels (source: cudadev)
in fact it's faster now!
Anonymous
8/4/2025, 3:42:26 PM No.106137239
>>106136956
Python environment package installer
Replies: >>106137265
Anonymous
8/4/2025, 3:42:33 PM No.106137240
>>106137058
mmaudio is kinda ok at it, hit and miss really

>>106137080
is there any gguf of it out there? i cant find any. the model is like 21gb kek, apparently here's the comfyui version https://github.com/Yuan-ManX/ComfyUI-ThinkSound
Anonymous
8/4/2025, 3:44:56 PM No.106137264
1645708170631
1645708170631
md5: 083a87d879320c5d46d6971e138564b6๐Ÿ”
>ctrl c and v doesn't work in linux
I already want to kill myself
Anonymous
8/4/2025, 3:44:58 PM No.106137265
>>106137239
What's Python?
Anonymous
8/4/2025, 3:46:34 PM No.106137285
soyblonde
soyblonde
md5: cbc8f7e588ccbba3b7a66539dfdcdb2f๐Ÿ”
ITS FUCKING HAPPENING