← Home ← Back to /g/

Thread 106217746

314 posts 210 images /g/
Anonymous No.106217746 [Report] >>106217761 >>106218167
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106213865

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Statler/Waldorf No.106217761 [Report] >>106217785
>>106217746 (OP)
b-b-blessed thread of.... BEAHAGAHAHAH
>try to post more than 40 images this time you fucking SHITTERS
Statler/Waldorf No.106217785 [Report]
>>106217761
>>106217102
>>106217237
>>106217259
>>106217322
resharing these since i know NO ONE would ever look at the previous thread, because its EVEN WORSE... BWAHAHAHA!!
Anonymous No.106217789 [Report] >>106217800
Statler/Waldorf No.106217800 [Report] >>106217900
>>106217789
>>106217514
>>106217474
>we have sloppa at HOME!
why even bother with local dalle?? fun?
not even trying to poke fun just curious
Anonymous No.106217805 [Report] >>106217829 >>106217900
these car interior gens are great

>>106217474
>>106217514
is this the dalle lora you just posted for chroma?
Anonymous No.106217815 [Report]
https://files.catbox.moe/54q7eg.png
Anonymous No.106217829 [Report] >>106217860 >>106217865
>>106217805
Anonymous No.106217842 [Report] >>106217854
it is so fucking obvious the schizo puts his own shitty gens in the collage everytime.
Anonymous No.106217843 [Report]
Anonymous No.106217849 [Report]
Qwen only kinda understands catgirls, but the refiner helps.
Anonymous No.106217854 [Report] >>106217881 >>106217893 >>106217908
>>106217842
waldorf cannot be in the collage it will encourage more of the behaviours
Anonymous No.106217857 [Report]
https://files.catbox.moe/i6eil4.png
Anonymous No.106217860 [Report] >>106217881
>>106217829
nice
Anonymous No.106217865 [Report]
>>106217829
Anonymous No.106217877 [Report]
https://files.catbox.moe/76strg.png
Anonymous No.106217881 [Report] >>106217893
>>106217854
>micromanaging other peoples "behavior" on 4chan
trying WAY too hard kid
>>106217860
shes got the goods but she sadly is sloppafied
Statler/Waldorf No.106217893 [Report]
>>106217854
>>106217881
i will return but sadly my GPU is done for the day its 100+ degrees here
Anonymous No.106217897 [Report] >>106218451
Why is /ldg/ so schizo at these hours? American hours?
Anonymous No.106217898 [Report]
https://files.catbox.moe/r7p9oz.png
Anonymous No.106217900 [Report]
>>106217800
It's capable of some truly great image compositions, even thought the image quality is not the best. I would love to retrain a lora using high quality dalle-like compositions but without what people regard as "slop". Maybe doing img2img on qwen-img with unslopped loras for it?

>>106217805
Yes
Anonymous No.106217903 [Report] >>106218015
do you need Patch Model Patcher Order for loras if you dont use torch compile/etc in wan2.2? my loras work fine without it
Statler/Waldorf No.106217908 [Report]
>>106217854
>british spelling
beagahaha
Anonymous No.106217916 [Report] >>106218036 >>106218059 >>106218580
Anyone remember Hunyuan video?
Anonymous No.106218002 [Report] >>106218044
Anonymous No.106218015 [Report]
>>106217903
No you don't need it
Anonymous No.106218036 [Report]
>>106217916
unironically, no. i got into gens right when wan released so i never used it.
Anonymous No.106218044 [Report]
>>106218002
not bad, now do waldorf\statler
Anonymous No.106218059 [Report] >>106218545 >>106218580
>>106217916
I remember their sota t2v model blew everyone away, then they announced their upcoming i2v model which caused much antipation, scheduled "soon".
December passed it never arrived but teased at january. January passed it never arrived, but teased at february. Wan dropped, the rest is history
Anonymous No.106218143 [Report] >>106218182
Anonymous No.106218161 [Report] >>106218176
Anonymous No.106218167 [Report]
>>106217746 (OP)

(?i)(chroma|\bv([1-9]|[1-4][0-9]|50)\b)

Remeber to filter Chromatards
Anonymous No.106218176 [Report]
>>106218161
>honey, it's time to stardust! :3
Anonymous No.106218177 [Report] >>106218181 >>106218195 >>106218222 >>106218275 >>106218630 >>106219824
qwen is good at photorealism... limited in style, but not slopped.
Anonymous No.106218181 [Report] >>106218287
>>106218177
Real life truly isn't fair..
Anonymous No.106218182 [Report] >>106218362
>>106218143
Miss landscape general...
Anonymous No.106218195 [Report]
>>106218177
Qwen realistic gens all look kind of airbrushed. Like they were filmed with a soap opera camera or something. A little fuzzy.
Anonymous No.106218198 [Report]
Which is more heavy-dense to txt2img?
WAN 2.2 or Qwen?
Anonymous No.106218213 [Report]
Anonymous No.106218217 [Report] >>106218226
Anonymous No.106218222 [Report] >>106218287
>>106218177
nice, crossed legs do a lot of heavy lifting in my appreciation
Anonymous No.106218226 [Report]
>>106218217
How does it just KNOW Asuka?
Anonymous No.106218235 [Report] >>106220219 >>106220410
>>106216911
>It seems like if you're using it just like Flux (i.e, not porn) then it has similar quality to Flux. But if you try to use it for something Chroma-specific (i.e, porn) then the quality sharply drops.

For what kinds of images? Compared to Flux, Chroma is better at everything in regards to realism, even with SFW prompts. You really notice this if you stress test a bunch of prompts, and compare output quality.

Take pic rel for instance. Not even me trying to gen porn. The prompt is

>Amateur photograph, a stunning Japanese female cosplaying as sailor, sitting in front of a restaurant at night, she is holding a drink with left hand, and food with right hand, she is candidly laughing

Flux, and for that matter, even Krea, can not do this, even if you ignore that the shot has to be candid. Because the models do have flexibility in what you can prompt for; they are opinionated.
Anonymous No.106218275 [Report] >>106218287
>>106218177
hooot. any stable qwen for vamlet? i have freezing slop
Anonymous No.106218287 [Report]
>>106218275
>>106218222
>>106218181

The fuck? Why are you all so intrigued by this basic bitch gen?
Anonymous No.106218299 [Report] >>106218489
idk why all these vid2vid experiments turn into 2.5d.
Anonymous No.106218340 [Report] >>106218360 >>106218568 >>106218683 >>106219696
I am training a Chroma lora on 2000s digital camera high resolution images. I will post the results when it's done. Fingers crossed it will:

1 - Improve fine details
2 - Make v50 unslopped again by default at 1024
Anonymous No.106218360 [Report] >>106218407
>>106218340
Oh you are doing that, that's great. How did you tag them?
Anonymous No.106218362 [Report]
>>106218182
remake it
Anonymous No.106218374 [Report] >>106218537
umt5_xxl_fp16 is the same quality as umt5_xxl_encoder_q8_0.gguf right?
fp16 is 10gb & q8 is 5gb.
Anonymous No.106218402 [Report]
It's funny to see the same posters seemingly every or every other thread bitch and moan. You'd think they'd get bored but... nope. Ldg must interest them greatly (despite what they portray).
Anonymous No.106218407 [Report]
>>106218360
I already had the dataset from long ago, I used gpt4o at the time, it's good enough.
Anonymous No.106218408 [Report]
I'm using the base template in the rentry, is there anyway I can load another reference image to use as a face?
Anonymous No.106218417 [Report]
What native resolutions does Chroma support? I want to train a lora, but I don't know if I should manually crop everything to 1024x1024 or just let bucketing handle the variable sizes.

for example, let's say I have an image that is 700x1600. Will bucketing resize/crop it to the nearest size chroma supports? For SDXL, that would be 640x1536

this is important
Anonymous No.106218423 [Report]
i miss R anon
Anonymous No.106218424 [Report]
Anonymous No.106218432 [Report]
>>106217257
This v46 result is one of the best result I got in Chroma for that prompt
https://desu-usergeneratedcontent.xyz/g/image/1753/67/1753676465813.png

Changing res changes output drastically so I can't really show you v49/v50 equivalent.
Though the Wan result looks a bit plastic and also the leg is duplicated.

>>106216877
Here's same seed, prompt was
>Amateur photograph of two beautiful Japanese idol girls, on the sidewalk at night in SHibuya. They are holding food and doing peace sign

neg
>3D, render, drawing
Anonymous No.106218442 [Report] >>106218481
>Dr. Doom laughs while shitposting on the internet
Anonymous No.106218451 [Report]
>>106217897
the baker is also a schizo, we're all schizo here
Anonymous No.106218481 [Report]
>>106218442
https://www.youtube.com/watch?v=syhOt6KS5X0
Anonymous No.106218489 [Report]
>>106218299
Nvm. It's the light LoRAs. They default to non-2d styles.
Anonymous No.106218505 [Report] >>106218752
Anonymous No.106218522 [Report]
Anonymous No.106218529 [Report] >>106220598
Anonymous No.106218537 [Report] >>106218547
>>106218374
Anon... no.
Anonymous No.106218545 [Report]
>>106218059
its coming out with pony 7

This hobby forces you to learn quickly to stop anticipating the numbered release of something, it'll likely be obsoleted by something you haven't heard of yet. We probably wont be using wan 3
Anonymous No.106218547 [Report] >>106218576 >>106218588
>>106218537
Anon..your comment..tells me..nothing..
Anonymous No.106218568 [Report]
>>106218340
I think it will be fine. Your VHS lora already has nice deslop effect.
Anonymous No.106218572 [Report] >>106218581 >>106218880
>randomly thought of using Q8 wan2.2 quants instead of Q5 on my 3060 shitbox
>it just works for some reason
>same speed as Q5
>better quality
>no OOM
why didn't i try this sooner
Anonymous No.106218576 [Report]
>>106218547
Sorry, I ment to say you can drop down to Q4 without almost zero quality loss. There's no reason to use anything larger.
Anonymous No.106218580 [Report]
>>106217916
>>106218059
I would love to see those mfs trying to outdo Wan, but if they do it will probably not be open as they (Tencent Hunyuan team) haven't been super keen on open-sourcing the good stuff lately.
Anonymous No.106218581 [Report]
>>106218572
Q8s are top dog mhm
Anonymous No.106218588 [Report]
>>106218547
It's not the same quality, fp16 is better, it's just that q8 is closer to fp16 than fp8.

>106218576
Not me.
Anonymous No.106218592 [Report] >>106220202
Anonymous No.106218596 [Report] >>106218603 >>106220447
Can I use the WAN guide on AMD?
Anonymous No.106218603 [Report]
>>106218596
yeah if you like bdsm
Anonymous No.106218630 [Report]
>>106218177
>not slopped
What? That looks like CGI
Anonymous No.106218632 [Report] >>106218654
I wonder if some modern artists purposefully draw "fucked up" anatomy (and then make it a part of their style) because they know it's a good way to disrupt training
Anonymous No.106218641 [Report]
Anonymous No.106218647 [Report]
Anonymous No.106218654 [Report]
>>106218632
no a lot of people are just shit at that, no need for intentional sabotage of their own art
Anonymous No.106218683 [Report] >>106218830
>>106218340
How do I into chroma lora training. I have datasets.
Anonymous No.106218686 [Report] >>106218719 >>106218891
>3090
>import example workflow from comfy for chroma
>launch
>OOM
I forgot the requirements for flux and chroma, do they really need more than 24GB at fp16 for a 1024x1024 image?
Anonymous No.106218695 [Report]
chroma flash gguf will save nintendo
Anonymous No.106218701 [Report] >>106218882
Anonymous No.106218719 [Report] >>106218987
>>106218686
try workflow from chroma repo
Anonymous No.106218727 [Report]
is it just my luck or is wan 2.2 good at animating 1024x1024?
Anonymous No.106218752 [Report]
>>106218505
Anonymous No.106218830 [Report] >>106218876 >>106219552
>>106218683
1 - read the rentry: https://rentry.org/mvu52t46
2 - install diffusion-pipe
3 - clone this repo: https://huggingface.co/lodestones/Chroma1-HD/tree/main , set it in the diffusers_path field
4 - set the path for Chroma-HD.safetensors path in transformer_path's field
5 - try training and meet a bunch of errors you will have to spend hours fixing with chatgpt or claude
Anonymous No.106218876 [Report]
>>106218830
It will probably complain about a bunch of missing folders, just download the missing folders and files from the flux schnell repo
Anonymous No.106218880 [Report]
>>106218572
Oh please share your workflow brother in goons&cooms
Anonymous No.106218882 [Report] >>106218907
>>106218701
how did you prompt for low poly graphics like that? or is that a LORA? this is the furthest I can get it to go in low poly
Anonymous No.106218891 [Report] >>106218987
>>106218686
https://files.catbox.moe/bigh6m.json
Anonymous No.106218907 [Report] >>106219232
>>106218882
>or is that a LORA?
I used video game screenshot lora 2 made by anon
Anonymous No.106218962 [Report] >>106218971
What's currently the best method for adding an object/person to an existing image, without using controlnets.(None exist for Chroma yet)
Anonymous No.106218971 [Report]
>>106218962
Flux Kontext
Anonymous No.106218987 [Report]
>>106218719
>>106218891
Thanks, I'll probably go with the gguf or maybe the fp8 e5m2 scaled if I can convert to it since only fp8 e4m3fn are available.
Anonymous No.106219017 [Report]
Anonymous No.106219149 [Report] >>106219159 >>106219185
Aside from retard girl poster. What regulars do you recognise at this point?

I know Japanese woman foot fetish chroma poster and /pol/ image poster.
Anonymous No.106219159 [Report]
>>106219149
Anonymous No.106219180 [Report] >>106219219 >>106219420
Jesus christ
Wan 2.2 is my dream
I can make whatever porn I want pretty much. I can't wait for a few months when this improved even further.
Probably gonna buy a second 5060Ti 16GB and SLI them.
Anonymous No.106219185 [Report] >>106219273 >>106219301
>>106219149
95% of the thread are regulars. I haven't seen a new anon in a long time
Anonymous No.106219219 [Report] >>106219318 >>106219335
>>106219180
stop being a short sighted gooner and just save up for a 5090 or wait for 6000 series in '26 or 27.
Anonymous No.106219232 [Report]
>>106218907
found it in the archive, awesome
Anonymous No.106219273 [Report] >>106219285 >>106219402 >>106220464
>>106219185
I agree, I havnt seen new gens in a while, I think its because newer image/video models have higher hardware requirements than before, that leaves out all the poorfags out, SDXL seems dated now :(
Anonymous No.106219285 [Report] >>106219378
>>106219273
i'm a long time lurker who had to sell the 4090 and become a nogen ;_;
Anonymous No.106219286 [Report] >>106219499
Anonymous No.106219301 [Report] >>106219408
>>106219185
I arrived here two months ago. It was a quiet place, but it quickly became schizo with multiple generals, shillers and UI wars,
Anonymous No.106219318 [Report]
>>106219219
But with that logic i will never goon
What if a car hits me?
Anonymous No.106219329 [Report]
There are new gens here everyday for those with eyes who can see
Anonymous No.106219335 [Report]
>>106219219
What's the point? By that time I may as well "Also save up" for the next coolest one coming out
Anonymous No.106219340 [Report]
Anonymous No.106219369 [Report] >>106219499
Anonymous No.106219378 [Report]
>>106219285
rip anon
Anonymous No.106219379 [Report] >>106219415 >>106219422 >>106219430 >>106219442 >>106219444 >>106219476 >>106219509 >>106220210
So this is the power of prompt adherence of qwen
Anonymous No.106219402 [Report] >>106219442
>>106219273
Is that still qwen?
Anonymous No.106219408 [Report]
>>106219301
It's been schizo a lot longer than that.
Anonymous No.106219411 [Report]
There is no spoon.
Anonymous No.106219415 [Report]
>>106219379
?
she's wearing a red sweater
Anonymous No.106219419 [Report]
Consumerist hobby, reminds me of gaming. Every year, new, heavier games dropped. Had to upgrade GPU, RAM, then my motherboard. Eventually, had to replace the whole computer...Depressing...History repeats.
Anonymous No.106219420 [Report] >>106219467
>>106219180
>5060Ti 16GB
You will be better off selling it and getting one 5090.

>SLI them
Not only this isn't possible because Nvidia got rid of that for non professional cards, but but you can't pool memory for image or video gen.
At best you can generate something low in vram with the 5060ti and something else in another session at the same time, but that needs double the memory.
Anonymous No.106219421 [Report] >>106219443 >>106219771 >>106219795 >>106219797
I made 271 dollars last month generating images on patreon. 166 dollars more than the month prior.
I'm pretty sure its going to keep going up and now I'm kinda scared. Why are people paying for this?
no I won't link it.
Anonymous No.106219422 [Report]
>>106219379
damn look at that goon meat in the top row
Anonymous No.106219430 [Report]
>>106219379
>She is wearing a buttchin.
Anonymous No.106219442 [Report]
>>106219402
chroma

>>106219379
the redditor who created that is obviously is a promplet, I would say that 80% of r/stablediffusion users are pajeets and vramlets, so don't take their posts too seriously
Anonymous No.106219443 [Report]
>>106219421
What kind of stuff do you make? At least post an example you can just generate now.
Most people gold rushing to that make literally pennies out of it.
Anonymous No.106219444 [Report]
>>106219379
Misleading image. Ignore it. You're mistaken. Show prompts first. If it states she is Japanese or Asian, Qwen is correct.
Anonymous No.106219467 [Report] >>106219536
>>106219420
Really? That sucks. I guess I will sell it then. Thanks.
Anonymous No.106219472 [Report]
so we're complaining about consistant characters now?
Anonymous No.106219476 [Report]
>>106219379
I get that 'experiment style'. Now you'll compare this model with yours that you've been pushing since Friday, right? KYS Iodestones.
Anonymous No.106219499 [Report] >>106219555 >>106219836
>>106219369
>>106219286
SDXL suffers from chronic same face. It's one thing I noticed when I tested bigASP
Anonymous No.106219506 [Report] >>106219680 >>106220211
Anonymous No.106219509 [Report]
>>106219379
should compare to chroma instead
Anonymous No.106219536 [Report] >>106219632
>>106219467
5090 is weird. 32GB is more than necessary to run high or best quality models but not quite enough to do anything else. the performance is great, though.
i'd either wait until the next 24GB refresh to decide or buy a 4090 24/48GB.
Anonymous No.106219552 [Report]
>>106218830
>3 - clone this repo: https://huggingface.co/lodestones/Chroma1-HD/tree/main , set it in the diffusers_path field
Can I like not use the models and encoders that I already have?
Anonymous No.106219555 [Report] >>106219674 >>106220210
>>106219499
Yeah I see the same face everywhere depending of the model, it actually drives me crazy but most people dont seem to notice at all, when a model gets popular and widespread and everyone starts using it I see it everywhere, In instagram there are 1000s of "AI influencers" that use flux and all have the same blonde generic 1girl buttchin face, In DeviantArt, I would say that 90% of realistic users are using that BigLust/Lusitfy model with that DMD lora that gives them the same face, now im seeing WAN videos with the lightx2v wan lora that also gives the same face, its annoying as fuck, distilled loras/models give you the same face
Anonymous No.106219565 [Report] >>106219576 >>106219579 >>106219668 >>106219692 >>106219848 >>106220016 >>106220049
>decide to try lightx2 again for wan2.2
>quality is absolute fucking shit
>motion is nuked to being almost non-existent
I just don't get the hype. The quality is so fucking bad compared to not using it. Am I doing something wrong? I'm using this with their native workflow.
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-I2V-A14B-4steps-lora-rank64-Seko-V1
Anonymous No.106219567 [Report] >>106219842
so.. qwen + wan refiner is the meta for photoreal now?
Anonymous No.106219576 [Report] >>106219592
>>106219565
where can i find their recommended workflow? i want to try it
Anonymous No.106219579 [Report] >>106219592 >>106219692
>>106219565
This version of the lora is overbaked to super strength. You need to use the kijai lora
Anonymous No.106219582 [Report]
Anonymous No.106219592 [Report] >>106219658 >>106219692
>>106219576
https://huggingface.co/lightx2v/Wan2.2-Lightning/blob/main/Wan2.2-I2V-A14B-4steps-lora-rank64-Seko-V1/Wan2.2-I2V-A14B-4steps-lora-rank64-Seko-V1-NativeComfy.json

>>106219579
>You need to use the kijai lora
seriously? god damn i swear if this is bad im never touching lightx2 again
Anonymous No.106219614 [Report]
Delete it from OP
Anonymous No.106219632 [Report] >>106219920
>>106219536
>5090 is weird. 32GB is more than necessary to run high or best quality models but not quite enough to do anything else.
Doesn't really matter for inference as the system ram can compensate, it has enough compute to be fast.
I know, I have one, and did test exactly that.
For training though, sure, but most people play around generation, not training, even here.
Anonymous No.106219656 [Report] >>106219712
how do I do wan 2.2 t2i? isn't this a model for video gen? and what is low noise and high noise? it's all very confusing..
Anonymous No.106219658 [Report]
>>106219592
Also NAG with "slowmo, slowmotion" also helps a bit with WAN
Anonymous No.106219666 [Report]
Anonymous No.106219668 [Report]
>>106219565
it sucks anon, the only way to get a proper speed is to generate 121 frames and use 24 fps but the i2v model is fucked and cannot generate 121 frames with proper motion, I don't know if if its a comfy thing or whatever, you can generate 81 frames and it looks fine, you generate 121 and it looks jittery.
only t2v is decent for now :(
Anonymous No.106219674 [Report] >>106219789
>>106219555
tbdesu even if you had variations, most people just copy the "success recipe" of other and dream of getting it big that way
just look at anime stuff, there is almost every style and artist under the sun already baked in models or as lora, but what most people try to monetize are like 3-4 different styles, copied from each others
Anonymous No.106219680 [Report]
>>106219506
nice feet
Anonymous No.106219692 [Report] >>106219705 >>106220016
>>106219565
>>106219579
>>106219592
Even kijai's one isn't that good, I think it's just that it's worse than the 2.1 version.
I tried every variation, in the end, I just gen 15+15 normally. It takes me 20 minutes, but at least it looks fine.
Anonymous No.106219696 [Report]
>>106218340
I will restart the training with 1024x1024 cropped images, and I also recaptioned the data with shorter texts with no trigger words. I want it to absolutely fix the fine details and be unslopped out of the box regardless of "photo" prompt (make the model completely biased towards candid photos)
Anonymous No.106219699 [Report]
Anonymous No.106219705 [Report] >>106219719
>>106219692
to turn off the high speed lora do you just need to bypass it and increase the steps?
Anonymous No.106219712 [Report]
>>106219656
>how do I do wan 2.2 t2i
Like you would t2v, except you generate 1 frame only.

>isn't this a model for video gen
Videos are a succession of images.

>what is low noise and high noise
High Noise : first model, responsible for the overall motion and look.
Low Noise : second model, adding details and refining the scene.
Anonymous No.106219718 [Report] >>106219736
i know the supported resolutions are at a 16:9 ratio but what if I just wanna gen at a 1:1 aspect ratio? like, would there just be too much noise or would the quality be fucked up? like, 720x720 or something?
Anonymous No.106219719 [Report]
>>106219705
>bypass it and increase the steps?
Correct.
Anonymous No.106219733 [Report] >>106219758 >>106219775 >>106219873
can someone share a workflow to upscale with sdxl controlnet tile?
Anonymous No.106219736 [Report] >>106219879
>>106219718
it's usually fine
Anonymous No.106219758 [Report] >>106219873
>>106219733
nobody uses sdxl anymore
Anonymous No.106219771 [Report]
>>106219421
Damn, how niche is that niche?
Anonymous No.106219775 [Report]
>>106219733
Reforge UI or Forge. Don't waste time with Comfy if you are RAMlet
Anonymous No.106219780 [Report]
Anonymous No.106219789 [Report] >>106220694 >>106220712
>>106219674
yeah it all ends up in cannibalization, desu its really hard to make it too, IG is starting to ask for ID/face verification when you create a new account, soon all those ai influencers will dry out. Photorealistic ai content is really hard to monetize, now they are going for cartoon/2d stuff too, I have read tons and tons of artists getting their accounts banned by patreon for the most silly shit. In the west, you have to be really careful of what you post, if you offend someone and that person reports you, say goodbye your account, and if you're making it using AI you have to deal with AI haters that will mass report your account just out of spite, internet has truly become a toxic place
Anonymous No.106219795 [Report]
>>106219421
>Why are people paying for this?
>Patreon
I've no idea why anyone would pay for any kind of normie shit that would be allowed on Patreon either. There's alot of boomers/brain dead people to take advantage of.
Anonymous No.106219797 [Report]
>>106219421
based gl anon
Anonymous No.106219814 [Report] >>106219873
ddim + ddim_uniform seems to reduce noise somewhat for realistic v50 gens.
Anonymous No.106219824 [Report]
>>106218177
it's a fuckton worse than WAN T2I and Flux Krea in that regard by every conceivable metric lol
Anonymous No.106219836 [Report] >>106219901 >>106220613
>>106219499
BigASP is the least samefacey SDXL checkpoint that ever existed lol, even V2 had a dataset larger than that of Chroma (and V2.5 has one that's like, over twice the size of Chroma's)
Anonymous No.106219842 [Report]
>>106219567
maybe if you're retarded? Qwen has strengths but realism ain't one of them
Anonymous No.106219848 [Report]
>>106219565
i have been using heavy prompt weight for the high noise and then normal weight for low noise and its been turning out fine. the colours and details do get blown out a bit, but its not as bad as 2.1. I actually have not tried without high speed, will try it now
Anonymous No.106219873 [Report] >>106220009
>>106219733
this is what I've been using, probably could be improved upon
https://files.catbox.moe/i2btzt.png

>>106219758
it's still pretty good for fast upscales and some other niche uses.

>>106219814
sick gen, how?
Anonymous No.106219879 [Report] >>106219902
>>106219736
i forgot to mention for wan, specifically
Anonymous No.106219901 [Report]
>>106219836
i have not tried v2 but 2.5 is very clearly contaminated with a lot of ai in its training data
Anonymous No.106219902 [Report]
>>106219879
i figured. i usually just gen at whatever arbitrary resolution and aspect ratio, with i2v anyway, but anons will die on the hill of only using 832x480 or 1280x720
Anonymous No.106219920 [Report] >>106220709
>>106219632
You never know when you might need to train a niche lora, it's better to have more vram than not when you can.
Anonymous No.106219944 [Report] >>106219948 >>106219977 >>106219989 >>106220030
Where do I download celebrity loras? I remember 1 year ago that they were in civitai but now I can't find it anywhere on the internet
Anonymous No.106219948 [Report] >>106219989 >>106220035
>>106219944
they are illegal now anon, give it up
Anonymous No.106219977 [Report] >>106219989 >>106220035
>>106219944
let me help you out there-
>
shid :DDD
Anonymous No.106219989 [Report] >>106220035 >>106220045
>>106219944
>>106219948
>>106219977
guess it's about time we create our little secret lora club
Anonymous No.106219996 [Report]
>design workflow so that low noise uses the same loras as high noise but with option to adjust weight
>people are releasing loras for both high and low noise, breaking my design
ughhh
Anonymous No.106220009 [Report] >>106220063
>>106219873
>sick gen, how?
I just added anime + substituted her name in the prompt and inpainted over it twice.
Anonymous No.106220016 [Report] >>106220163 >>106220717
>>106219565
>>106219692
15+4 seems to work fine. The high noise step benefits from motion/negative prompt/teacache/slg and the low noise step benefits from increased speed.
Anonymous No.106220030 [Report]
>>106219944
don't tell him, hes bhira and will report any site that is hosting those loras
Anonymous No.106220035 [Report] >>106220061
>>106219989
>>106219977
>>106219948
so i cant make ai fakes of millie bobby brown anymore?
Anonymous No.106220045 [Report] >>106220070
>>106219989
loras are dumb. We should just make a model that can be instantly trained on a few images when needed. simple as
Anonymous No.106220049 [Report]
>>106219565

The civit cumskulls can't tell the difference but all of the loras kneecap 2.2. You might as well just use 1.1 at that point.
Anonymous No.106220061 [Report]
>>106220035
You have to train it yourself, or suck someone's cock to do it for you
Anonymous No.106220063 [Report]
>>106220009
oh inpainting, fair enough
Anonymous No.106220070 [Report] >>106220101
>>106220045

Isn't that pretty much Chroma? I can tell it to generate Minecraft Steve.
Anonymous No.106220077 [Report]
I hate chroma shills
Anonymous No.106220097 [Report] >>106220137
Enjoying using the scan tag in illustrious. Makes it looks like an uploaded magazine scan. You can even see the edge of the page
Anonymous No.106220101 [Report]
>>106220070
A jack of all trades, master of none
Also kontext and qwen image would fit that category too, yet they all still need loras in the end.
If anything, I am happy how fast we can train them nowadays.
Anonymous No.106220137 [Report]
>>106220097
scan, magazine scan, scan artifacts is kino
Anonymous No.106220140 [Report] >>106220177
Is it possible to inpaint video gens yet?
Anonymous No.106220163 [Report]
>>106220016
its still on slowmotion anon
Anonymous No.106220170 [Report] >>106220185
kek
Anonymous No.106220173 [Report]
Anonymous No.106220177 [Report]
>>106220140
https://huggingface.co/CCP6/FakeVace2.2/tree/main

YMMV, but it does "work."
Anonymous No.106220181 [Report]
Anonymous No.106220185 [Report] >>106220192 >>106220196
>>106220170
I bet that fucking retard browses this thread on a daily basis
Anonymous No.106220192 [Report] >>106220220
>>106220185
his first grift was pilfered from this thread fun fact
Anonymous No.106220196 [Report] >>106220220
>>106220185
I doubt it. He usually goes between posting on his personal subreddit, seething about Israel on twitter, stealth advertising and requesting features on github, posting about thing completely unrelated to the former on random subreddits and timing posts on r/stablediffusion to not break self promotion rules.

I'd wager he's largely ignorant to this place.
Anonymous No.106220202 [Report] >>106220613
>>106218592
catbox needed for my research :v
Anonymous No.106220204 [Report]
Anonymous No.106220207 [Report]
How many steps for the low noise model without any LoRAs do you think is really necessary? If you look at the high noise outputs by themselves, it's like 80% of the way there. I wonder how few steps you could get away with for a passable output.
Anonymous No.106220210 [Report] >>106220499
>>106219379
The meta for these new models has and always will be creative bankruptcy. From SDXL to Flux: Removed artist styles and names. Then from Flux to Qwen: As planned, remove all variety. And they have one thing in common, they are getting more and more slopped. Sure, the model may be getting stronger, but at what cost? The more slopped the dataset, the worse the results. But Plebbitors don't care

>>106219555
Yeah, it's part of the reason why Chroma is such a breath of fresh air. At least that has been the case in my experience. Plus with the prompting freedom it gives you, you can describe just about any type of unique woman and unique attributes.
Anonymous No.106220211 [Report]
>>106219506
workflow or it didn't happen D:
Anonymous No.106220219 [Report] >>106220224 >>106220519
>>106218235
wan2.2 version
Anonymous No.106220220 [Report] >>106220224
>>106220192
not surprised
>>106220196
>posting on his personal subreddit
>seething about Israel on twitter
>stealth advertising and requesting features on github
>posting about thing completely unrelated to the former on random subreddits
>timing posts on r/stablediffusion to not break self promotion rules.
I don't know, he sound like a total /g/ tranny, everything checks out.
Anonymous No.106220224 [Report] >>106220233 >>106220275 >>106220542
>>106220219
Wan 2.2 really does give the most plausibly realistic outputs. Especially a randomly selected frame in a video that has been upscaled and then reprocessed again. They're just so perfectly candid and different to the other outputs you'd see from an AI model.

>>106220220
He has distinctively bad English. I think he would have been seen by new. That and he has an aversion to NSFW images he often complains about.
Anonymous No.106220232 [Report]
Anonymous No.106220233 [Report]
>>106220224
>He has distinctively bad English.
>he has an aversion to NSFW images he often complains about.
Again, total /g/ tranny
Anonymous No.106220239 [Report] >>106220338 >>106220434
can anon compare chroma, qwen, and wan on photoreal gens?
I would like to see their limitations.
Anonymous No.106220275 [Report] >>106220300
>>106220224
>Wan 2.2 really does give the most plausibly realistic outputs. Especially a randomly selected frame in a video that has been upscaled and then reprocessed again.

you got an example?
Anonymous No.106220300 [Report] >>106220318 >>106220542
>>106220275
Not right now. Just my observations. A random frame from Wan usually has more realism and plausibility than any other model I've seen.

That doesn't make it the best at instruction following etc, but it simply makes the most sensical outputs.
Anonymous No.106220318 [Report] >>106220347
>>106220300
ahh I thought you were saying it using wan as a img2img model, I tried to do it, its good for inpainting and fixing hands, feet, whatever, but if you try to do img2img with the lightx2v lora all you get is a low quality image
Anonymous No.106220338 [Report]
>>106220239
yeah anon do this plz
Anonymous No.106220347 [Report] >>106220365
>>106220318
I wouldn't use the light LoRA with img2img. It's probably fast enough without it.
Anonymous No.106220365 [Report]
>>106220347
>It's probably fast enough without it.
I wouldn't call a 5x time reduction that way.
Anonymous No.106220410 [Report]
>>106218235
Can you post workflow/image on catbox? I couldn't get chroma to work or output anything remotely close to the quality of flux outputs
Anonymous No.106220434 [Report]
>>106220239
not doing the homework for you, retard.
qwen is the best. simple as
Anonymous No.106220441 [Report]
Anonymous No.106220447 [Report]
>>106218596
In OP? No.
Assuming Windows and depending on which card you have you could follow these instructions "New Install Method"
https://github.com/patientx/ComfyUI-Zluda/issues/188
From what I was reading (and managed to get working) you want a 7900XT or XTX as they're basically the only cards properly supported by HIP 6.2 AND have enough VRAM to handle WAN. Not to say you couldn't do it on their other cards, just more faffing around.
Anonymous No.106220464 [Report]
>>106219273
I don't post my gens because I'm embarrassed by them
Anonymous No.106220469 [Report] >>106220545
Anonymous No.106220475 [Report] >>106220496 >>106220500
>change workflow tab
>preview disappears

Why won't they fix this?
Anonymous No.106220496 [Report] >>106220506
>>106220475
Wait, that's a common issue? I am always messing around with the code and files so I thought I messed up something and never bother checking for fixes.
Anonymous No.106220498 [Report]
Anonymous No.106220499 [Report]
>>106220210
>Removed artist styles and names
They don't actually remove the artist styles, they train on a lot of them which you notice when you train a lora on an artist.

If they already trained on it, the learning is QUICK as opposed to an artist they never had in their dataset.

The problem is indeed that they don't use the artist name in the training captions, meaning there's no way of specifying an artist style in the base models.

Same goes for celebrities, they're all there, just seldom not by name. So there's no way to prompt for their specific characteristics on a name basis.
Anonymous No.106220500 [Report] >>106220527
>>106220475
>not using fixed seed and gen again
Anonymous No.106220503 [Report]
Anonymous No.106220506 [Report]
>>106220496
Very common issue. One that could probably be fixed by checking if the preview is showing after each step updated.
Anonymous No.106220519 [Report]
>>106220219
i cant decide if the film grain is really nice or really shit
Anonymous No.106220527 [Report] >>106220586 >>106220611 >>106221131
>>106220500
That's because the seed is changed after the gen is done, so the fixed one will be the new one after it was done genning, which I think is retarded and should be changed
Anonymous No.106220542 [Report] >>106220572
>>106220300
>>106220224

Well, Wan is a video model. It has a better understanding of the world than txt2img. But it's not necessarily knowledgeable with styles, hence it always gives that film grain look.
Anonymous No.106220545 [Report]
>>106220469
gib catbox ffs D":
Anonymous No.106220572 [Report] >>106220648
>>106220542
don't have any grain here if fastfilmgrain node is disabled ??
Anonymous No.106220574 [Report]
Anonymous No.106220585 [Report]
Anonymous No.106220586 [Report]
>>106220527
>not using fixed seed from the very start
Statler/waldorf No.106220598 [Report]
>>106218529
beahfahagahah
Anonymous No.106220611 [Report]
>>106220527
I've coped with the seed node from rgthree for at least three years
Anonymous No.106220613 [Report] >>106220732 >>106220820
>>106219836
Dunno, I think it was a bigASP based mix instead of just by itself
>>106220202
https://files.catbox.moe/ffq8z4.png
Anonymous No.106220620 [Report] >>106220631
statler/waldorf No.106220631 [Report]
>>106220620
$10 says this makes the collage
Anonymous No.106220637 [Report] >>106220654 >>106220667
>comfyanonymous.github.io
>docs.comfy.org
>www.runcomfy.com
difference?
Anonymous No.106220648 [Report]
>>106220572
No, but I mean it's naturally biased towards that look. This is the guy who trained it
https://civitai.com/user/LEOSAM
Anonymous No.106220654 [Report]
>>106220637
all the same
Anonymous No.106220662 [Report] >>106220678
>t2v-a14b
>ti2v-5b
which one should I use to gen video?
Anonymous No.106220667 [Report]
>>106220637
Except for
>www.runcomfy.com
you probably meant
>https://www.comfy.org/
Anonymous No.106220670 [Report] >>106220691 >>106220811 >>106221277
I am going to bed now, but I think I am happy with the 2000s camera lora results.

https://files.catbox.moe/hn8034.safetensors

Just prompt for photography and it should work.
As a flex, all of the images in picrel collage were made in Chroma Flash, not even the vanilla Chroma, yet it showcases the strong effect the lora has.

It's probably the best showcase of realism Chroma can offer so far, the downside is that it tends to make ugly bitches very often.
Anonymous No.106220678 [Report]
>>106220662
5b one is long forgotten.
Anonymous No.106220691 [Report]
>>106220670
this is cute, goodnight anon
Anonymous No.106220694 [Report]
>>106219789
this means only the most vanilla non confrontational ai art will remain monetized, and that's hard to monetize
kind of sad
Anonymous No.106220704 [Report]
why does wan 2.2 14b pair with wan 2.1 vae?
shouldn't it be wan 2.2 vae?
Anonymous No.106220709 [Report]
>>106219920
Sadly the only worth it for me is the 6000, which is like 9k€ where I like.
Anonymous No.106220712 [Report] >>106220738
>>106219789
Do you think the staff at IG will notice a pattern of all the "AI influencers" coming from one country when they see ID?
Anonymous No.106220717 [Report]
>>106220016
It fucks up the contrast for me.
Anonymous No.106220732 [Report] >>106220749 >>106221026
>>106220613
is it able to make manucured perfect looking feet?
Anonymous No.106220738 [Report]
>>106220712
what country? depending on what mental illness you have, it can probably be only 3
Anonymous No.106220741 [Report]
Anonymous No.106220749 [Report] >>106220763 >>106221026
>>106220732
It's made by chinks so obviously yes, they have a thing for feet somehow and I doubt they would miss the chance to include that.
Anonymous No.106220754 [Report]
Anonymous No.106220763 [Report]
>>106220749
good, our interests align in that fetish
I like that they obsess over cute and feminine feet, while western fetish is more about being dirty or smell stuff, not my thing
Anonymous No.106220803 [Report]
Anonymous No.106220811 [Report]
>>106220670
Nice, seems like a nice effect to test the OG myspace feel
Anonymous No.106220820 [Report]
>>106220613
you're da best :3
Anonymous No.106220836 [Report]
Anonymous No.106220841 [Report] >>106220860
Anonymous No.106220851 [Report]
Chroma is nice for realistic, HOWEVER, changing the prompt a bit to add nudity it looks like a whole other model... anyone else see this ?
Anonymous No.106220860 [Report] >>106220892
>>106220841
it's wan2.2 ? mind sharing a catbox
Anonymous No.106220892 [Report] >>106220909
>>106220860
https://files.catbox.moe/9j8glt.png

Image is a still from a generated video then latent upsaled/img2img. Same prompt at the video. Not quite what I wanted but loose shirts that show underboob is not Wans specialty. Don't mind the custom LoRAs, it's just a collection of random people
Anonymous No.106220896 [Report] >>106220902 >>106220905 >>106221039
>spent 5 hours hitting the generate button in comfyui before finally releasing the nut
i think i saw some article a while back that claimed staring at the latent noise while it generates over a long period of time has mental ramifications; nevermind edging, should I be worried?
Anonymous No.106220902 [Report]
>>106220896
>staring at the latent noise while it generates over a long period of time has mental ramifications

Sounds cursed. Let me know if you find it. I wanna read it.
Anonymous No.106220905 [Report]
>>106220896
no, nothing to ruin in our heads to begin with
Anonymous No.106220909 [Report]
The video in question
>>106220892
Anonymous No.106220914 [Report]
Anonymous No.106220917 [Report] >>106220934
>camera dolly in, gusty wind moves the grass and nature around, the car turn lights are blinking, the girl is standing enjoying the wind as it moves and rattles her clothes and hair
Anonymous No.106220934 [Report] >>106220971
>>106220917
Ghost car or driverless car?
Anonymous No.106220939 [Report] >>106220951
What're we trainin', fellas
Anonymous No.106220951 [Report]
>>106220939
Nothing right now. Did a qwen experiment and did a Wan experiment. Might look at doing another wan one later down the right but right now I don't have need of it.
Anonymous No.106220970 [Report] >>106220978
Once again asking where this is no send to vae button that cancels the generation at its current step and just sends it to the vae for decoding.
Anonymous No.106220971 [Report]
>>106220934
Probably a driverless car, it got scarred of the unnatural movements of the girl and nopped out of the place.
Anonymous No.106220978 [Report] >>106220987
>>106220970
I swear it did that at one point. I remember some half baked outputs.
Anonymous No.106220987 [Report]
>>106220978
Oh shit, now you're implanting those memories into my head. It did, didn't it or was that A111?
Anonymous No.106220993 [Report]
I've seen the lora manager thing. It looks nice, but it phones home and I don't like that.
Do we have anything else?
Anonymous No.106221020 [Report]
what malware have you tards installed?
Anonymous No.106221026 [Report] >>106221126
>>106220732
Chroma? But of course anon!
>>106220749
Not Chroma
Anonymous No.106221031 [Report]
Anonymous No.106221039 [Report] >>106221079 >>106221251
>>106220896
yeah you start doing inference in your head directly
Anonymous No.106221079 [Report]
>>106221039
>tfw you goon to too many denoising previews and accidentally biologically denoise the latent space demons into reality
Anonymous No.106221083 [Report] >>106221089 >>106221092 >>106221141
reminder qwen + wan is the photoreal meta
Anonymous No.106221089 [Report] >>106221141
>>106221083
I wonder if Wan could save the nonsense in Chroma outputs.
Anonymous No.106221092 [Report] >>106221141
>>106221083
if only it was the nsfw meta, it would be the sota local model
Anonymous No.106221099 [Report]
Anonymous No.106221126 [Report] >>106221172
>>106221026
Anonymous No.106221131 [Report]
>>106220527
Yeah, it's retarded and it being the default is only due to comfyanon ego at this point

You can thankfully change it though, in settings, scroll down to Node Widget and change Widget Control Mode from 'after' to 'before'
Anonymous No.106221141 [Report] >>106221263
>>106221083
>>106221089
>>106221092
i will never figure out how to wrangle t2i wan into giving serviceable details
Anonymous No.106221172 [Report] >>106221183
>>106221126
Seems to have forgotten a ton of concepts/fine details it nailed before with that lora unfortunately
Anonymous No.106221183 [Report] >>106221204
>>106221172
It probably hasn't, this is all Chroma Flash
Anonymous No.106221204 [Report] >>106221214
>>106221183
How fast is Chroma Flash vs normal ?
Anonymous No.106221214 [Report] >>106221239
>>106221204
22 seconds on a 3090, full precision
Anonymous No.106221239 [Report] >>106221269
>>106221214
What resolution and steps ?
Anonymous No.106221251 [Report]
>>106221039
blah blah all i see are blonde brunette blah blah blah
Anonymous No.106221263 [Report]
>>106221141
why would you use t2i for a refiner?
statler/waldorf No.106221265 [Report] >>106221272 >>106221278
>the absolute state
i'm sorry I can't bring myself to continue coming to this thread to do these forced memes
It's just so Abhorrent
you guys aren't even trying
I might make fun of the people in the anime general for a bit longer but don't expect much more for me you guys are fucking pathetic
Anonymous No.106221269 [Report] >>106221291
>>106221239
1024x1024
10 steps
heun beta
cfg 1
statler/waldorf No.106221272 [Report] >>106221278
>>106221265
blessed thread of discord fags talking to eachother on the slowest shill board
fags
Anonymous No.106221277 [Report]
>>106220670
TY bro
statler/waldorf No.106221278 [Report]
>>106221265
>>106221272
I mean I might still be around but im returning to no gen posting for sure, Literally not worth the effort
Anonymous No.106221283 [Report]
>>106221281
>>106221281
>>106221281
Anonymous No.106221291 [Report]
>>106221269
Thanks! That's a nice speedup

Also saw Nunchaku is adding support for Chroma, so you will likely get nice speedups without a significant quality drop shortly
Anonymous No.106221338 [Report]