← Home ← Back to /g/

Thread 105857606

320 posts 166 images /g/
Anonymous No.105857606 [Report] >>105857618 >>105857622
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105852577

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows/home

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105857618 [Report]
>>105857606 (OP)
good mornin :3
Anonymous No.105857622 [Report] >>105857642 >>105857684 >>105858149
>>105857606 (OP)
Anonymous No.105857624 [Report]
ポストカード No.105857642 [Report] >>105857684
>>105857622
Anonymous No.105857684 [Report]
>>105857622
>>105857642
shoo shoo , tranny
go back to your containment board
Anonymous No.105857688 [Report]
Blessed thread of frenship
Anonymous No.105857707 [Report] >>105857724 >>105857727 >>105857740 >>105863838 >>105864006
What's the deal with a1111 ui? Why is it not recommended anymore?
Anonymous No.105857724 [Report]
>>105857707
don't think it's been updated in a long while me thinks
Anonymous No.105857727 [Report]
>>105857707
No longer updated. Check the dates here : https://github.com/AUTOMATIC1111/stable-diffusion-webui
Anonymous No.105857740 [Report]
>>105857707
This is a look-alike program : https://github.com/lllyasviel/stable-diffusion-webui-forge
Anonymous No.105857755 [Report] >>105857764 >>105857887
Any lightweight or new models on the horizon?
Anonymous No.105857764 [Report] >>105857782
>>105857755
no, but wanx 2.2 soon™®
Anonymous No.105857782 [Report] >>105857794
>>105857764
I weep
Anonymous No.105857794 [Report] >>105857827
>>105857782
it might get hunyuan'd by a better model out of nowhere, wouldn't that be something
Anonymous No.105857804 [Report]
WHy don't deepseek faggots get in SD?
Anonymous No.105857822 [Report] >>105857869 >>105857919 >>105860805
Anonymous No.105857827 [Report] >>105857898
>>105857794
Not impossible, but unlikely.

Wan was such a big improvement, BFL even cancelled their upcoming video model the moment Wan dropped.

I'm still kinda in shock that we have this quality on consumer hardware from an open video model.
Anonymous No.105857841 [Report]
Anonymous No.105857861 [Report]
>>105852349
> I have an A770 16GB
Do you have black image output for fp8?
Anonymous No.105857869 [Report] >>105857898 >>105857908
>>105857822
>>105853928
Animatediff can do this now?
Anonymous No.105857887 [Report] >>105858187
>>105857755
Cosmos predict2 2b maybe. Relatively new and relatively lightweight.
Anonymous No.105857898 [Report] >>105858027 >>105858058
>>105857869
no, thats just the default file name.

>>105857827
>BFL even cancelled their upcoming video model the moment Wan dropped
[citation needed]
don't get me wrong, it is 200% cancelled but i doubt wan was the leading cause.
honestly might have been a godsend, think of how restrictive it would have been.
Anonymous No.105857908 [Report] >>105858058
>>105857869
that's just the default filename prefix for the video combine node. it's wan interpolated with gimm-vfi.
Anonymous No.105857919 [Report] >>105857942
>>105857822
never kissed anyone or what?
what the fuck am i even watching here
Anonymous No.105857937 [Report] >>105857959 >>105857968
>I'm thinking of trying my hand at becoming a new schizo with a pregnancy fetish angle. Thoughts?
Anonymous No.105857942 [Report] >>105857958
>>105857919
acktsually I have never kissed anyone, if you want to know the truth about it bitch
Anonymous No.105857945 [Report] >>105857959 >>105857968
I'm thinking of trying my hand at becoming a new schizo.
with a pregnancy fetish angle. Thoughts?
Anonymous No.105857958 [Report]
>>105857942
well I hate to be the bearer of bad news but that is not what it looks like at all
you think they would train on movie scenes
Anonymous No.105857959 [Report] >>105859482
>>105857945
>>105857937
do it
Anonymous No.105857968 [Report] >>105859482
>>105857937
>>105857945
sample? this isn't something you want to half-ass
Anonymous No.105858019 [Report] >>105858083 >>105858084 >>105858345 >>105858790 >>105859956
You're not a schizo unless you larp as thread moderator and post 16h every single day
Anonymous No.105858027 [Report]
>>105857898
>but i doubt wan was the leading cause.
We'll have to disagree then, the timing is spot on, and obviously there would be almost zero interest in their video model which would be worse than Wan and also undoubtable insanely censored.
Anonymous No.105858058 [Report]
>>105857908 >>105857898
Oh ok. Wanx is less surprising.
Anonymous No.105858083 [Report]
>>105858019
basedo
Anonymous No.105858084 [Report]
>>105858019
based
Anonymous No.105858149 [Report] >>105858256 >>105858278
>>105857622
Why is so slowmo? You can do better.
Anonymous No.105858187 [Report]
>>105857887
didn't expect there to be such a great gap between the 6000 workstation cards.
seems interesting, thank you for the suggestion.
ポストカード No.105858256 [Report] >>105858278 >>105858293 >>105858439
>>105858149
you're right sir,
I'll strive to do far better from now on;
i am sorry to disappoint you
standby for a funny from
a few threads ago
(its worse now)
ポストカード No.105858278 [Report] >>105858312
>>105858256
>>105858149
>Boolean logic
>wan will: werk
>wan will: fuck up
>wan will: fuck up because of what you typed
>wan will: fuck up ignoring some of what you typed
>wan will: fuck up ignoring ALL of what you typed
Anonymous No.105858293 [Report] >>105858301 >>105858439
>>105858256
Rocket spammer... you are a disgrace.
Anonymous No.105858301 [Report] >>105858321 >>105858330
>>105858293
Are you some kind of stalker ?
Anonymous No.105858305 [Report]
Anonymous No.105858312 [Report] >>105858321
>>105858278
You know Salem is gonna tap that catunny
ポストカード No.105858321 [Report]
>>105858312
i just want to make cute things ;c
>>105858301
i forgot to turn my trip back on so i slipped through his filters on accident..,.
Anonymous No.105858330 [Report] >>105859004
>>105858301
I'm a victim of a spammer.
Anonymous No.105858342 [Report]
What node do i use if i want to use Kontext Image Edit with a gguf model?
ポストカード No.105858345 [Report]
>>105858019
kek
Anonymous No.105858346 [Report] >>105858422
>cries about other peoples posts so much the general gets report bombed and deleted
Anonymous No.105858422 [Report] >>105858428
>>105858346
Nothing has been deleted, schizo. Here's an example of a thread moderator larper.
Anonymous No.105858428 [Report] >>105858790
>>105858422
>larper
"good morning"
Anonymous No.105858429 [Report]
GOOD MORNIN!
Anonymous No.105858439 [Report]
>>105858293
>>105858256
i dont get it? its pretty
Anonymous No.105858470 [Report]
What's the pirate spammer's favorite programming language?
Anonymous No.105858530 [Report] >>105858561 >>105858646 >>105858653 >>105859495 >>105862181 >>105866090 >>105866108
I'm trying to train a Chroma LORA based on body types that i like. My problem is that when i use it, i always seem to get the same exact body, which will be a combination of all the different body types i trained on. What i would like is that i randomly picks one of them, or a random combination of body types, to get more varied results.

Is this a limitation of LORA training, or can this be fixed by proper prompting?
Anonymous No.105858561 [Report] >>105858578
>>105858530
How did you tag them?
Anonymous No.105858578 [Report] >>105858598
>>105858561
I only described the setting and that there was a woman in the image.
Anonymous No.105858598 [Report] >>105858698
>>105858578
I think that's your problem right there.
Anonymous No.105858646 [Report]
>>105858530
chroma is a failbake, seek sdxl
Anonymous No.105858653 [Report] >>105858698
>>105858530
The way ai training works is that it will look for patterns, with images it's the images coupled with the caption, once it finds a pattern it will start generalising that pattern, that is what you are observing here. It's not a lora training limitation, is true for all ai training.

To mitigitate this you need to caption the bodytypes differently, and prompt for them accordingly. Like athletic, broad shouldered, hourglass figure etc.
Anonymous No.105858698 [Report] >>105858803
>>105858653
>>105858598
>To mitigitate this you need to caption the bodytypes differently, and prompt for them accordingly. Like athletic, broad shouldered, hourglass figure etc.
I was trying to avoid having to prompt for them individually, and instead wanted the lora to learn what type of bodies i wanted, and create similar looking ones whenever it's loaded. Tagging their features, as you say, i imagined that i would have to prompt for them. And not tagging them, seems to have molded them all into the same body. But i'll try to retag them tonight to see the difference and maybe come up with an idea from there.
Anonymous No.105858790 [Report]
>>105858428
>>105858019
Anonymous No.105858796 [Report]
Neat.
Anonymous No.105858803 [Report]
>>105858698
Again, that's how ai training works. You train a bunch of images tagged as 'woman', the ai will try to generalize it into what it considers the 'essense' of all the things you call 'woman', the ai has no idea what 'woman' is, it just recognizes patterns.

The base model has been fed thousands upon thousands of images containing the caption 'tree', so it understands how to represent 'tree', some of the images captioned with 'tree' also have 'pine', so it understands how to represent pine tree, etc. It's all pattern recognition from top to bottom.
Anonymous No.105858822 [Report]
>mfw
Anonymous No.105858847 [Report]
How is comfyUI/SD with inference? How well does it compete with Sora?
Anonymous No.105858955 [Report]
>>105857217
nice
Anonymous No.105858978 [Report] >>105859090
>>105852973
>>105858771
Noice
Anonymous No.105859004 [Report] >>105859239
>>105858330
In all fairness these threads get remade every 4 hours anyway & I don’t see that many rockets these days? Maybe the crazy pills are working for me
Anonymous No.105859090 [Report] >>105859289
>>105858978
https://www.youtube.com/watch?v=Hc31HotThA0
totally missed how our sperg makes a snide comment about cascade getting stalled with "red-teaming" at around 12:30ish
Anonymous No.105859188 [Report]
BAKUSHIIIN!!!
Post moar cute equines
Anonymous No.105859239 [Report]
>>105859004
they where crying about being rangebanned a few days ago, so it makes sense they're trying to not completely shit the bed and burn through proxies.
Anonymous No.105859289 [Report] >>105859301
>>105859090
He's cute but obviously autistic as hell.
Anonymous No.105859301 [Report]
>>105859289
you have to be to be willing to work on something like comfy lmao
Anonymous No.105859465 [Report] >>105859474 >>105859510
I've messed around with hidream a bit today and man, does it suck ass. Poses and hands are alright but Chroma is so much better in every other regard, it's kinda silly.
Anonymous No.105859474 [Report] >>105859493 >>105863454
>>105859465
WAN T2I is better than chroma imo
for realism at least
Anonymous No.105859482 [Report]
>>105857968
>>105857959
not him, but now I want to gen some pregnancy. pic unrelated
Anonymous No.105859486 [Report] >>105859499
So I just installed comfy
Where do I find all the controlnets from?
I thought all will be part of it already
Anonymous No.105859491 [Report] >>105862747
Comfy needs to go to the gym, but for real.
3 times a week would form him up nicely and he has the dosh and job to go do it without it interfering with his coding
Anonymous No.105859493 [Report] >>105859497 >>105863556
>>105859474
wasn't there supposed to be a dedicated text to image model by wan?
Anonymous No.105859495 [Report]
>>105858530
>The woman has {large breasts and narrow hips|large breasts and wide hips|small breasts and wide hips|small breasts and narrow hips}
This will randomize the tokens
Anonymous No.105859497 [Report] >>105859550
>>105859493
dont believe so
their t2i model does it perfectly though. just gotta set output to 1 frame
Anonymous No.105859499 [Report] >>105859517
>>105859486
read nigga, READ!
Anonymous No.105859510 [Report]
>>105859465
Yeah, underwhelmed is how I would describe my experience with Hidream, which is extra bad considering how large and slow it is.
Anonymous No.105859517 [Report] >>105859741
>>105859499
yeah sorry
should have read the OP
Anonymous No.105859550 [Report] >>105859622 >>105863531
>>105859497
I think it forces too much cinematic style, could be user error tho
Anonymous No.105859615 [Report] >>105859639
Anonymous No.105859622 [Report] >>105863531
>>105859550
If I recall correctly, one of the core researchers of Wan made the helloworldXL finetune which is a more aesthetic/cinematic focused model.
Anonymous No.105859639 [Report]
>>105859615
Absolutely based.
Anonymous No.105859741 [Report] >>105859860
>>105859517
now feel good about yourself for solving your own problem
Anonymous No.105859860 [Report] >>105860012
>>105859741
>Winning gold made it worth losing the index finger
You must be willing to sacrifice something if you want to be the best
Anonymous No.105859861 [Report]
Anonymous No.105859956 [Report]
>>105858019
mikubros... not like this...
Anonymous No.105860012 [Report]
>>105859860
Isn't that a truth
Anonymous No.105860053 [Report]
Anonymous No.105860126 [Report]
cozy bread
Anonymous No.105860319 [Report] >>105860343 >>105865301
Anonymous No.105860343 [Report]
>>105860319
Cool
Anonymous No.105860353 [Report]
>tfw spent 4k on a rtx 5090prebuilt for i2v and txt2 goon genning
>(16:9, 540p) in 63.9secs
Anonymous No.105860383 [Report] >>105860407 >>105860454
Anonymous No.105860407 [Report]
>>105860383
that's wonderful
Anonymous No.105860409 [Report] >>105860465 >>105860539
I hadn't tested i2v
might actually be better than t2v
Anonymous No.105860454 [Report]
>>105860383
is this the new local diffusion thread
Anonymous No.105860459 [Report]
Anonymous No.105860465 [Report] >>105860472
>>105860409
what did you prompt to get this apparently I suck at prompting... and did you use any LORAs?
Anonymous No.105860472 [Report]
>>105860465
The womans blue dress and sandals morph into a white top and tight black leather pants and black heels.
and yeah
Anonymous No.105860539 [Report]
>>105860409
is it possible to generate shape shifting transformation like human to wolf like true blood series.
Anonymous No.105860711 [Report] >>105860805
Anonymous No.105860725 [Report] >>105860747
Anonymous No.105860747 [Report] >>105860781
>>105860725
Anonymous No.105860781 [Report]
>>105860747
>Release of the Epstein client list, 2126, colorized
Anonymous No.105860784 [Report] >>105860805
Anonymous No.105860805 [Report] >>105860930 >>105861965
>>105860711
>>105860784
>>105857822
love this
Anonymous No.105860930 [Report] >>105861192
>>105860805
i don't
Anonymous No.105861076 [Report]
Messing around with some "realistic" models and now I can't stop laughing
Anonymous No.105861192 [Report]
>>105860930
Skill issue
Anonymous No.105861225 [Report]
https://www.linkedin.com/pulse/recap-ai-driven-content-creation-from-assistance-rvsmf?utm_source=share&utm_medium=member_android&utm_campaign=share_via
hmmmm
Anonymous No.105861289 [Report] >>105861326 >>105861613
ComfiSlop status?
Anonymous No.105861326 [Report] >>105861471
>>105861289
he was pretty butthurt yesterday because he knows cumfartui is stagnant and people are getting bored and pissed at his behavior
Anonymous No.105861336 [Report] >>105864888
Anonymous No.105861471 [Report] >>105861514 >>105861945 >>105862845
>>105861326
Any unslopped UI alternatives?
I'm almost ready to download Comfy, but those nodes feel more like an obstacle than a solution. The dev needs to map the 3D nodes to make it visually simpler without all the wire clutter.
Anonymous No.105861514 [Report] >>105861535
>>105861471
Make your own inference script in Python. It is quite simple because you can use ChatGPT to pajeet-code it.
Anonymous No.105861535 [Report] >>105861855
>>105861514
I'm a coomer, not a programmer. What I have to tell to GPT?
Anonymous No.105861613 [Report] >>105861841 >>105861936
>>105861289
Stalled to death. Yesterday ComfiAnon had to flex his namefagging power in 3 threads straight after he announced he’s gonna make a special Lora format just for ComfyUI because “now he has the power to do it”.
Anonymous No.105861692 [Report]
dead general
Anonymous No.105861784 [Report]
Anonymous No.105861841 [Report] >>105863541 >>105863556
>>105861613
https://vocaroo.com/16paEU4IoKb8
wayne june test
Anonymous No.105861852 [Report] >>105862241
Anonymous No.105861855 [Report]
>>105861535
You are too stupid to even use AI then. Sad, many such cases.
Anonymous No.105861936 [Report] >>105861958
>>105861613
What a faggot. That's why I only use AniStudio.
Anonymous No.105861945 [Report] >>105861958
>>105861471
Use AniStudio, it's much better, and the creator is actually a good person who cares about the space.
Anonymous No.105861957 [Report]
>special Lora format just for ComfyUI
People keep saying this and it is inaccurate. It would be better to call it "original model checkpoint lora format". When a new model drops, comfy copy pastes the original code into ComfyUI, with minimal changes, to add the implementation. If I want to train the model, I copy paste the original code into diffusion-pipe, then when saving the lora I literally just add a "diffusion_model." prefix to the keys in the state_dict, and it just werks in ComfyUI.

It is Diffusers that changes everything when they integrate a model. They have their own naming conventions, so the names of the submodules in the model change, so the state_dict key names change completely. That's why models like Wan have their own Diffusers version. Usually they add conversion code in the direction of "original format" -> "Diffusers format", but not the other way around. So if you use Diffusers and its native saving utilities, the checkpoints and loras you produce are incompatible with everything else. It is a massive pain in the ass. They could have a policy where they would always adhere to the original model key naming conventions, but they don't. In fact it seems like they are actively going out of their way to change the names of everything. I understand why comfy wants to stick closely to the original model checkpoint naming conventions.
Anonymous No.105861958 [Report]
>>105861936
>>105861945
the day this will be unironically true would be fucking hilarious
Anonymous No.105861965 [Report] >>105862001 >>105862241
>>105860805
Thanks
Anonymous No.105861976 [Report] >>105861989 >>105862027
if comfy ui is any good why are his fennac girl images so shit
Anonymous No.105861989 [Report]
>>105861976
he doesn't have a creative bone in his body which is why the front end design is so shit
Anonymous No.105862001 [Report]
>>105861965
slop
Anonymous No.105862027 [Report] >>105862032
>>105861976
His fennec girl is a image generation from the SOTA model he says is the future! (behind a paid API node :3)
Anonymous No.105862032 [Report]
>>105862027
no wonder why the real owners want the fennec girl dead
Anonymous No.105862047 [Report] >>105862100 >>105862341
How much time until ComfiUI gets a Tensor Art node?
Anonymous No.105862100 [Report]
>>105862047
Tthe same amount that if you want to add more nodes, you have to pay a monthly fee.
Anonymous No.105862181 [Report]
>>105858530
Anonymous No.105862241 [Report]
>>105861852
>>105861965
Quality
Anonymous No.105862306 [Report] >>105862375 >>105862880 >>105862979
Anyone know how I can replicate these torpedo boobs?
The model randomly spit these out and never again.
Anonymous No.105862341 [Report]
>>105862047
The moment someone makes one, the real question is why would anyone want one ?
Anonymous No.105862346 [Report]
Anonymous No.105862375 [Report]
>>105862306
Unless the model has trained on 'torpedo tits' in the captions, you're out of luck. Perhaps the image captions has described them in some other way though.
Anonymous No.105862424 [Report] >>105862526 >>105862623 >>105862934 >>105863985
Trying neta lumina,one of those latest pth files they have on their huggingface, it seems they are training it , and i'm all for it
Anonymous No.105862526 [Report] >>105862713
>>105862424
Isn't their beta model more 'complete' than the alpha ones?
What's the treshold so far for styles and characters with neta? Like, how much pic on danbooru for it to get the style/character right, and what's the most recent big anime/gacha character it can output?
They should be done with the training by the end of this month or the next, but I don't know what to expect from it
Anonymous No.105862623 [Report] >>105864487
>>105862424
looks nice... I also want to see neta succeed, lumina's prompt understanding + noob level tag and style understanding would be amazing. hope they can get it to that level. and lumina is not bloated, unlike chroma.
Anonymous No.105862713 [Report] >>105862831 >>105862926 >>105862934
>>105862526
i do quite agree on that but i also feel that this latest trained alpha is quite usable,i'm trying their 'aes'thetic version
Anonymous No.105862715 [Report] >>105862779 >>105862797
any video tutorial for a brainlet like me on how to generate images locally?
no login, everything local and open source
Anonymous No.105862747 [Report] >>105862818
>>105859491
He didn't say he's gay.
Anonymous No.105862779 [Report] >>105862798
>>105862715
>video tutorial
Just read the rentrys my dude. Video tutorials are made by jeets and for jeets.
Anonymous No.105862797 [Report]
>>105862715
you don't need a video tutorial just follow the "WanX (video)" rentry Guide in OP. It does images and video. I joined the thread 2 days ago and can do everything I want using Wan now.
Anonymous No.105862798 [Report] >>105862807
>>105862779
>rentrys
what's that?
Anonymous No.105862807 [Report]
>>105862798
Text guides. Read OP.
Anonymous No.105862818 [Report]
>>105862747
But he would benefit from some weight lifting and cycling
Anonymous No.105862831 [Report]
>>105862713
The regular latest has more knowledge than the latest aesthetic version but it's still severely undertrained both on the artist knowledge and the anatomy. All good looking examples I saw or attempted to gen are like yours, fuzzy. Styles with precise sharp lines - can't do them yet. So we wait.
Anonymous No.105862845 [Report]
>>105861471
Nodes are good, much better than hundreds of tabs, menus, checkboxes, fields and bars in your face at least, don't be afraid.
Anonymous No.105862880 [Report]
>>105862306
Collect the random spits and train a lora.
Anonymous No.105862926 [Report] >>105863697
>>105862713
that's very aesthetically pleasing, catbox?
ポストカード No.105862934 [Report]
>>105862713
>>105862424
one of my favorite franchise ;c
that & last blade
Anonymous No.105862979 [Report]
>>105862306
Floating or protruding breasts
Sag/saggy/sagging in the negatives
Anonymous No.105863201 [Report]
Anonymous No.105863245 [Report] >>105863267 >>105863319 >>105863390
bros i have a 4080 super and wanted to gen videos. followed the rentry and am using wan q6

my genning gets stuck at this step with gpu/vram at 100%

what do?????????
Anonymous No.105863267 [Report] >>105863319 >>105863390
>>105863245
forgot to mention, i've followed this:
https://rentry.org/wan21kjguide#vram-requirements-and-model-size-aka-can-my-4gb-gpu-run-this
ポストカード No.105863319 [Report] >>105863349
>>105863267
>>105863245
stuck for how long?
my 30 series takes about 40 minutes to cook each video anon
if you have already rebooted a few times\rerolled
& feel like giving up
>https://github.com/deepbeepmeep
is what i ended up going with
Anonymous No.105863349 [Report] >>105863377
>>105863319
I've re-run it a couple times so it's been at 0% for 10-15 mins?
I mean it shows no progress at all, so it's genning then? I'm doing a test run anyways (I actually need this for work, need to animate the company's stupid avatar with idle anims and I thought to give AI a go).
I didn't reboot, so that could be it I guess? I'm also not on the latest nvidia drivers since they like to crash(thanks jensen)
Anonymous No.105863357 [Report] >>105863375 >>105864534
What are you guys using for lewd anime image to video? Wan?
ポストカード No.105863375 [Report] >>105863389 >>105863441
>>105863357
a specific wan nsfw finetune +loras
Anonymous No.105863377 [Report]
>>105863349
https://github.com/deepbeepmeep/Wan2GP
just use this. comfy's memory management is all fucked so save yourself hours of troubleshooting
Anonymous No.105863389 [Report] >>105863430 >>105863441
>>105863375
>wan nsfw finetune +loras
Any recommendations? Going to mess around with base Wan first but would love to try out some lewd ones.
Anonymous No.105863390 [Report] >>105865209
>>105863245
>>105863267
fuck it just MOVED
is this for fucking real? lol
Anonymous No.105863400 [Report]
noob is such kino
ポストカード No.105863430 [Report] >>105863441 >>105863473 >>105863474
>>105863389
for the lora just grab what you want them to do
its not "needed" but can help facilitate specific
actions within the videos themselves..

chest bouce (kek):
https://tensor.art/models/852362314505033772

wan 2.1 finetune:
https://tensor.art/models/864231482397327022
ポストカード No.105863441 [Report] >>105863474
>>105863389
>>105863430
this one here >>105863375
used : https://tensor.art/models/839853388687731926
Anonymous No.105863451 [Report] >>105863463 >>105863467
how long has civitai been down?
Anonymous No.105863454 [Report]
>>105859474
>WAN T2I is better than chroma imo
>for realism at least

I tried Wan t2i early days. It's not feasible as t2i model, images tend to come out too blurry since it was trained on moving images and wait time is too long. Also try to do anything dynamic (that's not 1girl standing there) you get body horror. Also lacks NSFW knowledge level of Chroma.
ポストカード No.105863463 [Report]
>>105863451
who cares lol
honeypot shithole goes down every few days
you should have been BACKING UP the BACKUPS OF THE BACKUPS
i say this every thread
DOWNLOAD ALL
NOW
Anonymous No.105863467 [Report] >>105863491
>>105863451
It works just fine
Anonymous No.105863473 [Report]
>>105863430
Rocket spammer has traumatized me. Is this even fair? No it isn't.
Anonymous No.105863474 [Report] >>105863505
>>105863430
>>105863441
Thanks friend!
Anonymous No.105863491 [Report] >>105863526
>>105863467
weird, it works fine in all my other browsers except chrome
Anonymous No.105863505 [Report]
>>105863474
>>105863497
another example
dont sweat it <3
Anonymous No.105863526 [Report] >>105863535
>>105863491
Huh, that is weird.
Anonymous No.105863531 [Report] >>105863624
>>105859550
>>105859622
You can prompt Chroma directly for this aesthetic btw, just prompt for "DSLR" and add "bokeh" to your prompt. It's not special.
Anonymous No.105863535 [Report]
>>105863526
Working now.
Anonymous No.105863541 [Report]
>>105861841
Darkest dungeon kino
P0STCARD No.105863556 [Report]
>>105861841
>>105859493
niice doggy, niiiice doggy ;c
Anonymous No.105863579 [Report] >>105864325
Anonymous No.105863624 [Report] >>105863797 >>105865269
>>105863531
Is 'aesthetic 11' just a meme or is it a caption Chroma actually recognizes ?
Anonymous No.105863697 [Report] >>105863773
>>105862926
I'm using a workflow found on a civitai image, with sageattention and torch that help improve speed,and an upscaler

https://files.catbox.moe/z7qb0k.png
Anonymous No.105863765 [Report]
steady as she desus
Anonymous No.105863773 [Report] >>105863815
>>105863697
thanks, but that is a very schizo workflow lmao
Anonymous No.105863797 [Report] >>105865269
>>105863624
I think it's a meme. Just adding the word "aesthetic" probably makes a stylistic difference in the gen, but I don't think it necessarily improves them as some believe.
Anonymous No.105863812 [Report] >>105863935 >>105866976
dear baker,
please come to the castle
i have baked a moot for u
yours truly,
princess
poastbard
Anonymous No.105863815 [Report]
>>105863773
well yes. But aren't we all in a good or bad way
Anonymous No.105863838 [Report] >>105864072
>>105857707
A1111 and forge etc.. have been dead for a while. Unfortunately instead of just writing a more efficient UI auto has quit and disappeared. The forks are also no longer maintained.
Anonymous No.105863935 [Report]
>>105863812
kekked
Anonymous No.105863985 [Report] >>105863990
>>105862424
Lumina is too big a model for my shitbox :d
ポストカード No.105863990 [Report]
>>105863985
>out of storage
same sis, 3TB & climbing
Anonymous No.105864006 [Report]
>>105857707
Not only are they not updated and can't do video, but most developers also no longer develop extensions for it nor maintain existing ones. The only reason to use it is if you're allergic to Comfy at this point.
Anonymous No.105864010 [Report] >>105864097
What's the point of checkpoints built on noob (beyond super specific stuff like figure style etc)? Are they actually any better? Like I saw people saying 291h was good or something but what do those even do better?

Also what's the deal with stabilizer and detail loras and similar? Do they actually help?
Anonymous No.105864051 [Report] >>105864078
Does anyone here use subgraphs for Comfy? How is it
Anonymous No.105864072 [Report]
>>105863838
Forge still recieves updates (recently got Chroma support), but from other maintainers since Illya seems totally occupied with Framepack.

A1111 is dead though.
Anonymous No.105864078 [Report] >>105864104
>>105864051
bugged so I'm waiting for full release. not worth it beta testing if I'm not getting paid
Anonymous No.105864097 [Report]
>>105864010
They're easier to use but lose the ability to not look ai-generated
>stabilizer and detail loras
I've never needed them
Anonymous No.105864104 [Report] >>105864115 >>105864213
>>105864078
I see. What about this new Tangential Damping CFG node?
Anonymous No.105864115 [Report] >>105864137
>>105864104
why not check or ask the GitHub directly instead of using the jeet coded website?
Anonymous No.105864137 [Report] >>105864159 >>105864213
>>105864115
because there's 0 discussion about it on git + discord. a guy made a commit vaguely showing what it does and the request was approved. that's it.
Anonymous No.105864159 [Report] >>105864236
>>105864137
I normally don't trust cfg snake oils
Anonymous No.105864213 [Report]
>>105864104
>>105864137
by the looks of the descriptions they don't even know if it makes a meaningful difference.
Anonymous No.105864236 [Report] >>105864302 >>105864309
>>105864159
Me either, but if Comfy approved it, surely it must be legit? Feels like it came out of nowhere.

From what my retarded ass can discern, it seems to retain the prompt guidance of higher CFG values while keeping image quality. wish a proper example was provided to show exactly how it's supposed to be used
Anonymous No.105864302 [Report]
>>105864236
I'm sure sd3.0 is legit like he said as well
Anonymous No.105864309 [Report] >>105864404
>>105864236
>if Comfy approved it, surely it must be legit?
BWAHAHAHA
Anonymous No.105864325 [Report]
>>105863579
I love the texture you can get out of noob.
Anonymous No.105864404 [Report]
>>105864309
HOW DARE U
Anonymous No.105864487 [Report] >>105864527 >>105864719 >>105864746
>>105862623
Anonymous No.105864527 [Report]
>>105864487
>braps in the thread
why does this happen every time?
Anonymous No.105864534 [Report] >>105864553
>>105863357
read op
Anonymous No.105864553 [Report]
>>105864534
wrong
Anonymous No.105864586 [Report] >>105864613 >>105864791 >>105864838
Anonymous No.105864608 [Report] >>105864642 >>105864701
Tell me where I am fucking up, why doesn't this work? I am trying to load CLIP from outside the model.
Getting pitch black or black with random red shapes.
Anonymous No.105864613 [Report] >>105864749
>>105864586
quite stunning
catbox if you dont mind sharing please
Anonymous No.105864642 [Report] >>105864658
>>105864608
>I am trying to load CLIP from outside the model
you cant do that with sdxl?
Anonymous No.105864658 [Report] >>105864678
>>105864642
Is that so? Why? I swear I've read some stuff about it before. Hell, why is there sdxl option if that's the case?
Anonymous No.105864678 [Report]
>>105864658
I should rephrase, i dont think it works for most sdxl models. i tried in the past and never got it to work
Anonymous No.105864701 [Report] >>105864779
>>105864608
Pony trained clip or base clip won't work on an Illustrious model because Illustrious clip is trained.
Anonymous No.105864719 [Report] >>105864746
>>105864487
holy based. videochads I kneel
Anonymous No.105864746 [Report] >>105865041
>>105864719
>>105864487
guess you could say
she got the party sharted
Anonymous No.105864749 [Report] >>105864791 >>105864813 >>105864838 >>105865064
>>105864613
Thanks
https://files.catbox.moe/63hb1i.png
Anonymous No.105864779 [Report] >>105864901 >>105864933
>>105864701
Thanks, I see.
I checked text encoders here https://huggingface.co/OnomaAIResearch/Illustrious-xl-early-release-v0/tree/main and they indeed have different hashes compared to sdxl one. Does this mean I am stuck with them?
I was ultimately intending to do something like clip_l + t5xxl or clip_g + t5xxl and see if that improves output compared to clip_l + clip_g.
Anonymous No.105864784 [Report] >>105865684
ポストカード No.105864791 [Report] >>105865577
>>105864586
>>105864749
thnx<33
Anonymous No.105864798 [Report] >>105865072
Anonymous No.105864813 [Report] >>105865072 >>105865577
>>105864749
very based style
POSTCARD No.105864838 [Report] >>105865577
>>105864749
>>105864586
was >>105863583 you also?
all quite nice<3
Anonymous No.105864888 [Report]
>>105861336
naisu
Anonymous No.105864901 [Report] >>105864933
>>105864779
Can use CLIPSave node to export another Illustrious model's trained clip and use that but yes you are stuck using clips trained for illustrious. You cannot use t5xxl with a clip based model without an adaption layer or retraining the model. IDR if ostris got anywhere with this for SDXL after ELLA wasn't released for SDXL and only released for SD1.5.
Anonymous No.105864933 [Report]
>>105864779
So I downloaded the CLIPs from there. I managed to load them and get them running externally. The image they generated was coherent but not exactly the same as the current model's. I guess they got trained some more in the later versions.
Anyway I tried to combine both with t5xxl and got pitch black so I assume it just can't be run with sdxl?
>>105864901
Figured as much by now. Thanks anon.
Anonymous No.105865041 [Report]
>>105864746
Lmao
Anonymous No.105865064 [Report] >>105865577
>>105864749
this is beautiful
Anonymous No.105865067 [Report] >>105865100
Is there a wan i2v nsfw finetune or just the t2i one?
Anonymous No.105865072 [Report]
>>105864813
>>105864798
Lmao hahaha
Anonymous No.105865100 [Report] >>105865108 >>105865110
>>105865067
Its literally in this v thread cntrl+f
Anonymous No.105865108 [Report] >>105865636
>>105865100
i2v one is just wan i2v with a shitty lora merged in
Anonymous No.105865110 [Report]
>>105865100
Whoops I'm a retard thought that was t2v.
Anonymous No.105865121 [Report]
How is FramePack different from Wan?
Anonymous No.105865130 [Report]
Anonymous No.105865144 [Report] >>105865168 >>105865183
Wan doesn't know what a chupacabra is lmao
Anonymous No.105865168 [Report] >>105865219
>>105865144
How many videos do you think have a chupacabra?
Anonymous No.105865181 [Report] >>105865196
Anonymous No.105865183 [Report] >>105865207 >>105865208 >>105865219 >>105865263
>>105865144
fantasy creature with no footage wasn't trained into a video model? not that surprising.
Anonymous No.105865196 [Report]
>>105865181
Prompt?
Anonymous No.105865207 [Report]
>>105865183
>prompt for “goat sucker”
Hoo boy
Anonymous No.105865208 [Report] >>105865256
>>105865183
Worse, it's a fantasy creature with wildly different depictions
Anonymous No.105865209 [Report]
>>105863390
somewhat slow for a 480x832, but maybe you're still using your gpu too much and/or the nvidia driver's own very slow offloading to system ram has hit (probably turn it off in nvidia's control panel)
Anonymous No.105865219 [Report] >>105865256 >>105865265
>>105865168
there are plenty on youtube
>>105865183
>fantasy creature
ok retard
Anonymous No.105865239 [Report]
Anonymous No.105865256 [Report]
>>105865208
ouch. good luck with that.

>>105865219
>there are plenty on youtube
an educated guess how many have much/most or even all of youtube in their video model's training? no one.

if youtube is making it into the training data, it'll probably be a very filtered 0.0001% or so from the most popular footage if it's not too long.
Anonymous No.105865263 [Report] >>105865296 >>105865316
why in the living fuck do my gen times double if I don't clear the model and cache manually each time?

Why the hell does comfy go "prompt executed in 0.01 seconds" if I try to queue the same thing more than once without changing anything in the workflow, but if I change anything at all it will work fine?

why don't we have radial attention?

>>105865183
wow watch out here comes a fantasy creature
Anonymous No.105865265 [Report]
>>105865219
>there are plenty on youtube
Relative to what? You realize that in terms of ratios it's still lost in the noise right? You people are genuinely retarded.
Anonymous No.105865269 [Report] >>105865308
>>105863624
>>105863797
It IS something Chroma recognizes. aesthetic 0-10 are based on score by month of e621. aesthetic 11 is the trainer's own curated quality tag.
Anonymous No.105865296 [Report] >>105865332 >>105865332
>>105865263
the model stays cached but another copy of the model gets cached on top of that so you run out of VRAM and use RAM instead
Anonymous No.105865301 [Report]
>>105860319
catbox?
Anonymous No.105865308 [Report]
>>105865269
Thanks for the info anon!
Anonymous No.105865316 [Report] >>105865332
>>105865263
>why in the living fuck do my gen times double if I don't clear the model and cache manually each time?
maybe the nvidia driver or your OS swapping some stuff
Anonymous No.105865332 [Report] >>105865344 >>105865357
>>105865296
>>105865296
If that were the case I would be offloading to CPU after 1 gen. It also says it's clearing the cache literally 5 times before finishing the gen.

>>105865316
I don't see why, it's untouched for other uses at all times when it's busy
Anonymous No.105865344 [Report]
>>105865332
85% of my 32gb RAM is being used for some reason though
Anonymous No.105865357 [Report] >>105865383
>>105865332
The clearing cache is FILM VFI interpolation if you are using the wan rentry workflow and unrelated to holding models.
Anonymous No.105865383 [Report] >>105865420
>>105865357
There's an unload all models node I think I've seen before. Should I just add it to the end?

Also does anyone know why it won't queue the same thing over and over without skipping everything after the first one? I want to just press a button and come back in an hour. This happens regardless of workflow
Anonymous No.105865420 [Report] >>105865480
>>105865383
Couldn't hurt to try the LayerStyle node Purge VRAM V2. Also make sure you have CUDA - Sysmem Fallback Policy in Nvidia Control Panel set to Prefer No Sysmem Fallback so that if it does not offload to RAM properly through ComfyUI, it will just crash instead of doubling your generation time. Make sure your seed is changing between generations.
Anonymous No.105865480 [Report] >>105866110
>>105865420
It's set to "randomize" in the control after generate box.

I also just restarted my PC and still it's hitting 130s/it instead of 65 again without even getting a clean one first. I wonder why it's using my RAM. All I've done is change CFG lately
Anonymous No.105865567 [Report]
Anonymous No.105865577 [Report]
>>105864791
>>105864813
>>105865064
Thanks
>>105864838
Yup
Anonymous No.105865636 [Report] >>105865645
>>105865108
Lame. Can someone link the model that was being discussed a few threads back that was a proper finetune? I think it was on huggingface.
Anonymous No.105865645 [Report] >>105865685
>>105865636
that one is T2V only - it's by a group named NSFW-API
Anonymous No.105865684 [Report]
>>105864784
I don't trust that computer...
Anonymous No.105865685 [Report] >>105865700
>>105865645
Ok, hear me out, what if we extract an i2v lora from the difference of base wan i2v and t2v and apply it as a lora on top of it the nsfw t2v?
Anonymous No.105865694 [Report] >>105865974
Anonymous No.105865700 [Report]
>>105865685
already made and tested. anons reported that the final nsfw t2v model was too different from wan base, the lora extract produced extremely blurry results on base i2v
Anonymous No.105865881 [Report] >>105865895
Is there any lore in differences, strong and wakness between anime gen models? For example Noob vs Illustrious and pony? Slops in common and slops in difference etc?
For example my favourite or all round model for me its Nova Anime 8.0 and now 9.0
Anonymous No.105865895 [Report] >>105866160
>>105865881
3d general, go to pokemon
Anonymous No.105865944 [Report]
>thread up for 15 hours
yeah it's over
Anonymous No.105865969 [Report] >>105865975
No new model ever never
Anonymous No.105865974 [Report] >>105866093
>>105865694
cool styles anon. thanks for sharing.

I should just throw spandrel upscaling in this already since I can't find a decent c++ upscaling lib that just does it all.
Anonymous No.105865975 [Report] >>105866006 >>105866172
>>105865969
What exactly are you looking for in a new model?
Anonymous No.105866006 [Report] >>105866033
>>105865975
The hit of dopamine
Anonymous No.105866014 [Report]
Dead hobby
Anonymous No.105866033 [Report] >>105866046
>>105866006
Good news training Loras do that too, set up a dataset for a Lora concept, let it train for a while, dopamine when it works every time
Anonymous No.105866040 [Report]
https://github.com/bghira/SimpleTuner/pull/1548
>SimpleTuner Wan training was completely fucking broken this entire time because the latents weren't computed correctly
lol, lmao even
Anonymous No.105866046 [Report] >>105866057
>>105866033
Ia there any up to date guide on how to make Loras?
Anonymous No.105866057 [Report] >>105866061 >>105866072
>>105866046
Get images or video clips (img1.jpg, img2.jpg)
Write captions (img1.txt, img2.txt)
Use diffusion-pipe or ai-toolkit
They both have idiot proof examples and configs (if this filters you then you're a retard and really shouldn't post here any more, go to /b/ degen)
Anonymous No.105866058 [Report]
The Seeds of Forge to Comfi are still the same? Or i'm doing something wrong?
Anonymous No.105866061 [Report] >>105866073
>>105866057
Why do you asume that I'm a coomer?
Anonymous No.105866072 [Report] >>105866081
>>105866057
Can I caption using prose instead of tags slop for SDXL?
Anonymous No.105866073 [Report] >>105866106
>>105866061
Because that's the only thing retards use AI for because everything else is too abstract.
Anonymous No.105866081 [Report]
>>105866072
If you're training SDXL you need to match the caption style your model of choice uses.

Otherwise for Flux, Kontext, Wan, etc, it's prose or really anything text.
Anonymous No.105866090 [Report] >>105866180
>>105858530
The advice about not tagging the features you want the lora to learn is 100% bullshit. If you don't use tags, the lora can learn faster but will also overcook faster and won't generalize well. You just need to train longer if you use more tags.
Anonymous No.105866093 [Report] >>105866137
>>105865974
spandrel is sorta one of a kind in that regard tbqh
shame development slowed to a crawl though
Anonymous No.105866106 [Report] >>105866164
>>105866073
But i'm a programmer. And want to generate a C++ code Lora
Anonymous No.105866108 [Report]
>>105858530
Your caption should be something like:
[body type keyword] [Joycaption sentence description]
The model appreciates having a good caption and then it will spend the rest of the attention learning they keyword association.
Anonymous No.105866110 [Report] >>105866907
>>105865480
Did you change cfg from 1 to a higher value? Doing that doubles your gen time because it's generating two videos at the same time using negative and positive prompts
Anonymous No.105866111 [Report] >>105866118
Ok and minimum reuqeriments for Lora training? Can I for example go to work and let my pc turned on trainging the Lora? Or i need to skip a captcha or sonething else
Anonymous No.105866118 [Report] >>105866131
>>105866111
Don't even bother if you don't have at least 16 GB of VRAM or you'll just cope.
Anonymous No.105866131 [Report] >>105866149
>>105866118
AIIIEEE i have 12Vram, I don't mind waiting!
Anonymous No.105866137 [Report]
>>105866093
the only thing that sucks is two versions of opencv if I use pip. I'd have to manually bind the opencv I'm using c++ side. that's about it.
Anonymous No.105866149 [Report] >>105866228
>>105866131
You'll have to follow all the low VRAM methods available, see examples. You'll likely have to compromise training resolution and other factors. It's otherwise set it and forget, once it's running the training loop you can go to sleep or work or whatever.
Anonymous No.105866160 [Report]
>>105865895
wrong
Anonymous No.105866164 [Report]
>>105866106
You should head over to the lmg thread.
Anonymous No.105866172 [Report]
>>105865975
Scaling laws to be broken mainly
Anonymous No.105866180 [Report] >>105866279
>>105866090
I think you are misunderstanding what people mean with this, take a gun for example, the model knows what a gun is, but it typically has a bad understanding of what makes guns different from eachother.

So unstead of using 'gun' in your caption and tie it to the generalized idea the model has of 'gun' (which is a combination of all the images with the caption 'gun' it has trained on), you instead only caption the details of the gun, together with its 'trigger' identifier, like 'glock', thereby getting better results.
Anonymous No.105866228 [Report]
>>105866149
Ouch, i like the styles of Cloverworks, Ufotable and other sloppa anime studios. I wanted to make a lora based in their art styles. Its over?
Anonymous No.105866233 [Report] >>105866408 >>105867932
move
>>105866229
>>105866229
>>105866229
Anonymous No.105866279 [Report]
>>105866180
That makes sense when you put it that way. Using generic words like "gun" might be the reason why loras can get overcooked quickly.
Anonymous No.105866408 [Report]
>>105866233
Death thread
Stagnate hobby
Why did you make another geberak if the other its in page 4?
Anonymous No.105866907 [Report]
>>105866110
Thank you for saving my life
Anonymous No.105866976 [Report]
>>105863812
Newfags don't know this giga fag.
Anonymous No.105867932 [Report]
>>105866233
>it's still not archived