← Home ← Back to /g/

Thread 106308594

380 posts 220 images /g/
Anonymous No.106308594 >>106309500 >>106310202 >>106311536
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106305640

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106308599
first for lolis
Anonymous No.106308622 >>106308665 >>106308681
>>106308571
>>106308559
bruh qwen is nuts
Anonymous No.106308625 >>106308649
β€œPlump” has got to be one of my favorite tags to throw onto any prompt I’m running lately, hands down.
Anonymous No.106308629 >>106308640 >>106308839
Anonymous No.106308640 >>106308905
>>106308629
neat
Anonymous No.106308649
>>106308625
Plump is my favorite body type.
Anonymous No.106308653 >>106308714 >>106308737 >>106309733
After months of being a nogen, I've finally somehow been bestowed with an RTX 6000 BBW today. Perfect timing for Qwen-Image-Edit.

Here's the very first test gen. Qwen-Image, straight from huggingface, diffusers, no uncomfyUI.

test prompt was only
>1girl, rtx 6000 blackwell

nothing impressive but it's working. 60GB of VRAM used.

Qwen-Image-Edit testing next as soon as I can get it to run.
Anonymous No.106308665 >>106308673 >>106308681 >>106308708
>>106308622
chroma... lol...
Anonymous No.106308667
>Still no Qwen-Image edit

I take it is a slow day for the Comfy team?
Anonymous No.106308673
>>106308665
>some keys outright broken
Pottery.
Anonymous No.106308681 >>106308689 >>106308705 >>106308737 >>106308743
>>106308622
>>106308665
you're seriously comparing a 20B model to chroma, in which that model's entire strength is their text?
Anonymous No.106308689
>>106308681
kys
Anonymous No.106308705
>>106308681
Just shut the fuck up. I'm sick of hearing about chroma every time a new model comes out.
Anonymous No.106308708 >>106308747 >>106308758
>>106308665
Well, yeah, I'd expect that out of a 20B parameter model.
Anonymous No.106308714
>>106308653
based and welcome into the fold, fren
Anonymous No.106308737
>>106308681
yes, why not? and please show me any model that can render a keyboard this well, local or SAAS.

we should hold smaller models to a higher standard if we want to escape from parameter bloat. I think a more advanced 9b param model will one day easily be able to render a keyboard like that.

>>106308653
welcome, you joined during a great time for local diffusion.
Anonymous No.106308738 >>106309524
Anyone want to have a go animating this?
Anonymous No.106308739 >>106308756
>more console wars
why can't you just use qwen for sfw and chroma for nsfw and leave it at that?
Anonymous No.106308743
>>106308681
Bro really thinks a frankenstein Flux Schnell is with T5 is going to save local, lul.
Anonymous No.106308747 >>106308758 >>106308770
>>106308708
Another Flash limitation, bizarrely enough doesn't always recognize she's supposed to be in Japan. Though maybe lack of negs is playing a role I don't realize here on my simple prompt (and obviously some engineering will lead to better results)

>Amateur photograph, view from behind as cute Japanese woman rides a bike, bunny cosplay, thick thighs and fishnet leggings, heels, hair tied to a bun
Anonymous No.106308756
>>106308739
Current forefront human society is void of any meaningful confrontation or strife that we are compelled to create meaningless competition.
That, and bad actors.
Anonymous No.106308758
>>106308708
>>106308747
Hahaha imagine if she decided to like as a joke of course just sit on your face haha wouldn’t that be hilarious
Anonymous No.106308769 >>106308820
chroma flash works fine for single-subject prompts but if i do anything multi-subject it immediately slops the outputs. hyper + rescale loras with 8 steps tends to work better.
Anonymous No.106308770 >>106308802 >>106309024
>>106308747
>down Japanese street

Much better result
elf-hugger No.106308790
>get up
>turn on peecee
>gen up some sounds for listening while making tea
>gen cuties
>check to see if robots can do my housework yet
>gen podcast contractions of books
>poast
>sleep
Anonymous No.106308802
>>106308770
nobody parks on the side of the road like that in Japan.
Anonymous No.106308820
>>106308769
Some multiple subject results at a higher res are very, very good. But the issue is prompt adherence compared to full HD model unfortunately. But this Flash model is based on V48 apparently. So perhaps a proper merge with V50/V49 or an experimental HD model (none of which I've tested) could help.
Anonymous No.106308831 >>106308856 >>106309007
holy qwen
Anonymous No.106308837
amateur and professional style photography can live together in harmony
Anonymous No.106308839 >>106308905
>>106308629
I gotta say, this is the first time I see consistency in the sunlight on the surface and on the planets/moons in the sky. That always bugs me too much and this one is correct. Thank you very much.
Anonymous No.106308856 >>106309007
>>106308831
Looks like it could be a still from a 90s HK movie, neato
Anonymous No.106308875 >>106308973
THE SECOND QWEN EDIT PR HAS HIT THE GIT HISTORY

https://github.com/comfyanonymous/ComfyUI/pull/9412
https://github.com/comfyanonymous/ComfyUI/pull/9412
https://github.com/comfyanonymous/ComfyUI/pull/9412
Anonymous No.106308905
>>106308640
>>106308839
thx
Anonymous No.106308945
can one of you 24GB chads to a video of Art the Clown walking in on gen z boss and a mini?
Anonymous No.106308947
baby shark do do
Anonymous No.106308951 >>106308972 >>106308985 >>106309007
babby's second Qwen-Image test on RTX 6000, with an actual prompt this time

>prompt = '''high-contrast manga-style illustration of a female space marine wearing a form-fitting robotic exoskeleton. she has a black bob haircut and narrow blue glasses studded with blinking LEDs. her armored suit is gray and blue. she is leaning back against a railing, in front of a large window looking out into space from low Earth orbit. Half of the Earth's face occupies the view out the window. Mechanical contrivances are visible dangling around the periphery of the frame. The illustration is highly detailed, and the character design is in the style of Ilya Kuvshinov and Tsutomu Nihei.'''
Anonymous No.106308959 >>106309007
why does this infinite canvas not satiate my feelings of loneliness and isolation
Anonymous No.106308966 >>106308972 >>106310901
Anonymous No.106308972
>>106308951
>>106308966
good stuff
Anonymous No.106308973
>>106308875
im getting annoyed at comfy shilling this shit so hard.
Anonymous No.106308985 >>106309007
>>106308951
God I love natural language prompting so much bros, wish my potato could run a decent model for it. Unless there’s a good sdxl offshoot that can do it and do anime well
Anonymous No.106308990 >>106309042 >>106309096 >>106309137 >>106309985
Anonymous No.106308995
qwen loras are all garbage so far, they ruin stuff more than help
Anonymous No.106309007 >>106309030
>>106308831
>>106308856
this is with the emotional photography lora on civit.

>>106308951
not a bad result for a beginner with qwen.

>>106308959
sounds like you need to start using local chatbots.

>>106308985
your best bet is sd35m or lumina2.
Anonymous No.106309024
>>106308770
>jap street on a residential area
>more than one lane
Anonymous No.106309030
>>106309007
I like it, very wong kar wai
Anonymous No.106309042 >>106309055 >>106309063 >>106309080 >>106309090
>>106308990
if the roles were reversed you'd get banned for this shit
Anonymous No.106309055 >>106309066
>>106309042
very reddit post
Anonymous No.106309063
>>106309042
Why would a man be in a miniskirt and have a bodacious ass though?
Anonymous No.106309066
>>106309055
where do you think we are
Anonymous No.106309080
>>106309042
i dont want to see some juicy man ass bending over tho
Anonymous No.106309088 >>106309096 >>106309103 >>106309110 >>106309152 >>106309985
>if the roles were reversed you'd get banned for this shit
well yeah no one wants to see a man in a thong you faggot
Anonymous No.106309090
>>106309042
he's already gotten banned pretty sure he's evading
Anonymous No.106309096
>>106308990
>>106309088
MOOOOOOOOOOOOOOOOOOOOOOOOOOOOODS
Anonymous No.106309103
>>106309088
>plump
Hnnnnnngh
Anonymous No.106309108 >>106309964
Anonymous No.106309110 >>106309121
>>106309088
The role equivalent would be a man in tight clothes with a huge penis bulge walking in a sexy fashion towards a loli.
Anonymous No.106309121 >>106309132
>>106309110
there is no "equivalent" you faggot child molester. men and women are fundamentally different.
Anonymous No.106309132 >>106309148
>>106309121
you aren't fooling anyone with your mommy x shota fetish.
Anonymous No.106309137 >>106309148
>>106308990
I wonder whats the stuff you don't upload
Anonymous No.106309139
>>106307460
The 600 watt, this one. https://www.pny.com/nvidia-rtx-pro-6000-blackwell-ws
Anonymous No.106309144 >>106309153
Anonymous No.106309148
>>106309132
i have many fetishes, none of them change the validity of what i said

>>106309137
i upload lots of stuff on /b/
it mostly stays non-nude except for topless though just because of wan's limitation
but lots of boys feeding their latina maids drippy popsicles and stuff
Anonymous No.106309152 >>106309183
>>106309088
your vids are so realistic now. mind to share the workflow?
Anonymous No.106309153 >>106309314
>>106309144
this would've been so much better if the guy was in a furry costume
Anonymous No.106309158
Anonymous No.106309183 >>106309208
>>106309152
i shared a bunch of upskirts actually if you search "catbox upskirt" or something on desuarchive you should find them and they have workflows attached

i have been happy with using 2.2 lighting v1.1 at 0.6 and 2.1 lightx2v at 0.4. feels smart, pretty and better movement. using Q8_0 and 6 steps euler
Anonymous No.106309197 >>106309312
Anonymous No.106309208
>>106309183
thanks my bro. keep the diamond talent
Anonymous No.106309210 >>106309312
20 secs per gen is insane, like total footfag victory
Anonymous No.106309246
Anonymous No.106309261 >>106309292
for chroma: phrase your negatives like flux sentences: "This is a low resolution digital painting. The aesthetic appeal of this painting is lacking

literally night and day difference in gen quality
Anonymous No.106309262 >>106309273 >>106309310 >>106309661
Actual glowie here. Be warned that I have been closely monitoring all the nonces and antisemitic posts on these threads.
Anonymous No.106309273
>>106309262
nonce = pro-semitic. so you're just monitoring everyone.
Anonymous No.106309292
>>106309261
aw fuck reddit spacing, I have to write MD for work
Anonymous No.106309310
>>106309262
Why is this video super slow motion
Anonymous No.106309312
>>106309197
>>106309210
more like body horror victory, stop posting your crap please, or at least try to fix it before you start spamming your chroma shit
Anonymous No.106309314 >>106309322
>>106309153
Anonymous No.106309322 >>106309403
>>106309314
Why is this video super choppy
Anonymous No.106309359
getting black screen again with qwen. seems to happen with a longer prompt. i dont have sage attention active or anything like that. do i have to limit my qwen prompts now
Anonymous No.106309389 >>106309430
https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main
specifically which one of these files should I download? I heard that the 4step lora might give better quality than the 8step lora.
im assuming bf16 is worse quality because it's smaller
Anonymous No.106309399
>janny deletes some of the only good videos posted in a while
Typical trannyjanny
Anonymous No.106309403
>>106309322
idk. I don't think it's the upscale method.
Anonymous No.106309410 >>106309453
For Comfy, where do I get the bong_tangent scheduler?
Anonymous No.106309415 >>106309475
why are they shilling chroma so much? tired of this shit
Anonymous No.106309430
>>106309389
the 4steps is cool
Anonymous No.106309453 >>106309513
>>106309410
res4lyf
Anonymous No.106309458 >>106309483
First testing of Qwen-Image-Edit. I think I picked something too difficult so I'll try an easier one next.

Original image on the left is an astronaut looking out through a window into space.
Goal is to change the camera location so that we're looking at him through the window from outside.

try 1:
>prompt = "Look at the man through the window from outside the space capsule"
try 2:
>prompt = "Change the viewpoint so that the man's face is seen through the window from the outside of the space capsule. The outside of the space capsule is visible surrounding the window."
try 3:
>prompt = "Move the camera so that it aims directly at the front of the man's face."
try 4:
>prompt = "Move the camera so that it aims directly at the front of the man's face, looking at him through the window from outside the spacecraft."

try 3a - 2nd generation edit of 3:
>prompt = "Move the camera away, so that it is looking at the man through the window from outside the spacecraft."
Anonymous No.106309475 >>106309502 >>106309671
>>106309415
there is literally a faggot in this thread shilling qwen dude.
Anonymous No.106309483
>>106309458
mislabled it, the 2nd generation edit is of 2 rather than 3, but you get the idea
Anonymous No.106309500
>>106308594 (OP)
how do I find something like infinite worlds without it being cucked?

better yet how do I have my own?
Anonymous No.106309502 >>106309541 >>106309552
>>106309475
>The chroma fag can't allow the existence of other models than chroma.
Anonymous No.106309513
>>106309453
Thanks.
Anonymous No.106309524
>>106308738
Anonymous No.106309538
https://huggingface.co/Comfy-Org/Qwen-Image-Edit_ComfyUI/tree/main/split_files/diffusion_models

yay
Anonymous No.106309541 >>106309552
>>106309502
cope, I only use wan2.2 to make fetish videos you vramlet. I don't care about shit you do with your 3090.
Anonymous No.106309552 >>106309751
>>106309541
>>106309502
Anonymous No.106309576 >>106309619 >>106309653 >>106309825
Where did the not using chroma = vramlet thing even come from? It arguably the least demanding of the models we discuss here.
Anonymous No.106309581
Not much luck with Qwen-Image-Edit so far

>prompt = "change to isometric perspective"
Anonymous No.106309619
>>106309576
huh? it's pretty demanding
Anonymous No.106309632 >>106309655 >>106309696
This worked a bit better, and I suppose it's technically correct

>prompt = "make the line weight heavy on the girl"
Anonymous No.106309643
anyone got tips for ClownsharKSampling with Wan? I feel like it's a resample of some kind, but I think that the way that res2_s schedules doesn't really work with that
Anonymous No.106309653 >>106309699 >>106309712
>>106309576
poorfag projection really. chroma is a shitty model but 3080jeets (caste mentality) think it's a step up from SDXL (it's not) so they clap their feet over generating blurry artifacted garbage. the same people who would use plastic SDXL shitmixes and claim they're superior to the base finetunes.
Anonymous No.106309655
>>106309632
uhh that planet is about to crash into the other planet
Anonymous No.106309661
>>106309262
>I have been closely monitoring all the nonces and antisemitic posts on these threads
>nonces
4chan is banned in your shithole country paki go rape a teenager the police don't care

Also don't worry I pro-semitic nonce post often e.g a beautiful little Jewish princess getting her feet massaged by two Americans wearing maga hats so I play both sides and come out ahead. I love Jewish girls so much mr glowie they're the only girls I can tolerate acting bratty and entitled and superior in real life because they actually are and nothing is hotter than a princess who knows she's a princess who is actually a princess
Anonymous No.106309671
>>106309475
but qwen is a new, no wonder that people try and share results
Anonymous No.106309678 >>106309685 >>106309764
Please post better gens. Your life depends on it.
Anonymous No.106309685
>>106309678
>life
my bad ment *dick
Anonymous No.106309696
>>106309632
Trying to give it more specific instructions doesn't seem to have helped much.

>prompt = "make the line weight heavy on the girl. do not alter the shading. do not modify the image except the girl."

Trying negative prompt next. I don't even know if Qwen-Image-Edit is even supposed to support negatives, but we'll see
Anonymous No.106309699 >>106309725
>>106309653
This feels most likely. I'm just sitting here pumping away on two GPUs running wan and some guy calling me a vramlet has me puzzled.
Anonymous No.106309707
Anonymous No.106309712
>>106309653
care to post your sdxl realism? oh wait, that will literally never happen, lmao
Anonymous No.106309725
>>106309699
describe your setup. how do you run 2 gpus and what are they
Anonymous No.106309733
>>106308653
based
looks cool
Anonymous No.106309741
Anonymous No.106309745 >>106309754 >>106309756
I don't normally say this, but I think we need to split the general to include /cdd/ I can't put up with their delusions and narcissism anymore.
Anonymous No.106309751
>>106309552
now is the time to start flipping burgers
Anonymous No.106309754 >>106309766 >>106309767
>>106309745
cdd??
Anonymous No.106309756 >>106309766 >>106309767
>>106309745
And CDD stands for?
Anonymous No.106309760 >>106309794 >>106309892
Qwen-Image-Edit reaction to negative prompt

>prompt = "make the line weight heavy on the girl."
>negative_prompt = "use flat shading. alter the style of the background."

Interesting at least
Anonymous No.106309764
>>106309678
My GPU has started making a clicking sound when fans are spinning because Nvidia quality control is dogshit so I need to make sure my wife is asleep before I can start generating i hope you understand

If you give me a suggestion for something to gen in the meantime I could take it into consideration
Anonymous No.106309766
>>106309754
>>106309756
cock defiling degens
Anonymous No.106309767
>>106309756
>>106309754
Chroma Diffusion Deneral.
Anonymous No.106309778
antichroma trolls out in full force huh
Anonymous No.106309784 >>106309808
Anonymous No.106309794 >>106309880
>>106309760
Ask it to make a depth map of the image.
Anonymous No.106309808
>>106309784
deep
Anonymous No.106309814
Anonymous No.106309825
>>106309576
This place has a lot of weird memes that are contradictory to real life, like Apple users are accused of being poor poos, who in real life can’t even afford Apple devices and use like shitty TCL phones and such. Don’t try to make sense of it.
Anonymous No.106309826 >>106309831 >>106309841
qwen edit is not good, it changes the pose of the subject, doesnt follow prompts, what a letdown
Anonymous No.106309831 >>106309869
>>106309826
No example no believe.
Anonymous No.106309841 >>106309845 >>106309849 >>106309869
>>106309826
Needs proper workflow, needs prompt tests
kontext needed a lot of fiddling to change the image minimally too
Anonymous No.106309845
>>106309841
She looks better with small breasts.
Anonymous No.106309849
>>106309841 (me)
That image is old tip for kontext btw, not qwen image edit
Anonymous No.106309868 >>106309985
Anonymous No.106309869 >>106309878 >>106309889 >>106309891
>>106309831
I'm using the same prompt examples that are on their hf page, if you look at them, most of the images are different from the original input, another thing it could be is just a bad comfy implementation (not rare tbqh)

pic rel. "obtain back view"

>>106309841
stfu, you're always making up some schizo workflows thats just snake oil
Anonymous No.106309878 >>106309936
>>106309869
try
Change the point of view to be from the woman's back without changing anything about the woman or the rest of the image.
Anonymous No.106309880 >>106309892 >>106309895
>>106309794
>prompt = "make a depth map of the image"
Anonymous No.106309889 >>106309896 >>106309896
>>106309869
another example, by default it changes the hands position to a neutral pose, it probably because of the training
Anonymous No.106309891 >>106309905 >>106309914
>>106309869
>pic rel. "obtain back view"
This is the prompt IQ of your average non-vramlet Chroma hater btw.
Anonymous No.106309892
>>106309760
>>106309880
cnet preprocessors on suicide watch
Anonymous No.106309895
>>106309880
Hmm intradesting.
Anonymous No.106309896 >>106309901
>>106309889
obtain the back-side

>>106309889
I'm using the examples qwen posted on their hf retard
https://huggingface.co/Qwen/Qwen-Image-Edit
Anonymous No.106309901 >>106309936
>>106309896
Try with your own prompts, like rotate subject 180 degrees.
Anonymous No.106309904 >>106309917 >>106310448 >>106310652
Anonymous No.106309905 >>106309962
>>106309891
I'm using the examples qwen posted on their hf, you retard
https://huggingface.co/Qwen/Qwen-Image-Edit

Even their examples are fucked
Anonymous No.106309912 >>106310108
Anonymous No.106309914 >>106309962 >>106309965
>>106309891
Anonymous No.106309917
>>106309904
so this is why its so style locked and it apparently would be really hard to get out of that without over cooking the model further
Anonymous No.106309919 >>106309931 >>106309937
why should i, a regular anon, trust the words of a degenerate furfag
Anonymous No.106309931 >>106309953
>>106309919
his track record speaks for itself, doesn't it?
Anonymous No.106309936 >>106309941 >>106309942
>>106309878
>>106309901
here comes the qwen schizos cope:
>t-the model is good i swear, i-i-ts the prompt that isnt right

pic related:
"Change the point of view to be from the woman's back without changing anything about the woman or the rest of the image."
Anonymous No.106309937
>>106309919
by having used flux and then qwen
Anonymous No.106309941
>>106309936
coped so hard he imagined a gen
Anonymous No.106309942
>>106309936
Anonymous No.106309947 >>106309970 >>106309985
Anonymous No.106309950
we'll have to wait for the miku anon to thoroughly test it as he did kontext
Anonymous No.106309953
>>106309931
Yeah he really deep fried and fucked that model.
Anonymous No.106309962 >>106309973
>>106309905
>>106309914
Kontext devs also didn't give a good workflow that won't change the image, the community needed to find the proper prompt and workflow to make the model mask or not mask what you want while having no destructive changes beyond the VAE loss, the image edit developers all just want their model to be usable by social media normies that are not gonna care about this when they say "make my pfp into gibli", if anything, they want the model to "fix it up" more.
Anonymous No.106309963
the only thing it can do right is change the background and lighting
Qwen cannot handle big resolutions either, so whats then point? downscale a big image to change its lighting?
I better use kontext for that, what a let down indeed
Anonymous No.106309964
>>106309108

>Trying a chroma prompt in qwen
Anonymous No.106309965
>>106309914
notice the two examples of the right, it changed the pose of the subject
Anonymous No.106309970
>>106309947
>tfw its not a blue prius
Anonymous No.106309973
>>106309962
at least kontext doesnt change the subject pose
Anonymous No.106309978
>Boss returns from lunch.
Anonymous No.106309980 >>106309995 >>106310122
Imagine if LDG pooled all its compute and knowledge and trained its own model. How would it fare?
Anonymous No.106309981 >>106309993
Anonymous No.106309985 >>106309994 >>106310044
>>106308990
>>106309088
>>106309947
>>106309868
sorry to be that guy. I have never made anything AI related. what would I download to make similar videos? there are so many different options in the OP
Anonymous No.106309986 >>106310010 >>106310138
Can one of your chromafags post a realistic workflow and prompt example? The basic one I set up (v50 annealed, fp16 t5) is alright, but despite verbose prompting I can't get anywhere near realistic looking gens.

Follow up - are any of the following needed?

* Hyper-FLUX.1-dev-16steps lora?
* Any sort of upscaler? (I've seen basic integrated hires fix and then completely dedicated upscaling workflows)
* Anything else of note?
Anonymous No.106309991 >>106310191
Anonymous No.106309992 >>106310000 >>106310020 >>106310029 >>106310119 >>106310209 >>106310702
guys don't underestimate the text encoder, I've noticed you always get way better quality if you use the largest one you can.
so for example don't use qwen_2.5_vl_7b_fp8_scaled, use qwen_2.5_vl_7b.
this applies to Flux as well.
Anonymous No.106309993 >>106309999
>>106309981
blurred but kino and perhaps showing some inclings of soul
Anonymous No.106309994
>>106309985
https://rentry.org/ldg-lazy-getting-started-guide
>ctrl f video
Anonymous No.106309995
>>106309980
There's probably a MAX of 30 users in a thread at its busiest. It would be abysmal.
Anonymous No.106309999
>>106309993
here's your soul https://files.catbox.moe/6z2tng.png
Anonymous No.106310000 >>106310013
>>106309992
I cringe whenever I see a workflow with a quanted text encoder. It's basically guaranteeing you will lose quality.
Anonymous No.106310007
Anonymous No.106310010
>>106309986
Use regular v50 and +128px to your height and width
https://files.catbox.moe/ssp56q.png

While we wait for a new version to be (re)trained
Anonymous No.106310013
>>106310000
I bet a lot of people probably think that the text encoder and the model need to fit into the VRAM together at the same time. That's why people keep making this mistake.
Anonymous No.106310020
>>106309992
>i-ts not the model, I swear, it has to be something else...i-i-ts the text encoder
Anonymous No.106310026
Anonymous No.106310029 >>106310033
>>106309992
got a comparison?
Anonymous No.106310031
Anonymous No.106310033 >>106310080 >>106310119
>>106310029
nta, but I thought this was common knowledge. Idk when it stopped being common practice to use the fp16 encoder.
Anonymous No.106310039 >>106310050
Should I tick these Never OOM boxes in Forge?
Anonymous No.106310044
>>106309985
Be ashamed of needing this spoon-feed you lazy fuck just click on things
Ok to be fair linking to the Wan-Video GitHub account isn't actually that useful and that's the only one that says video in the OP so now I don't blame you
Anonymous No.106310045 >>106310153
chroma is extremely diverse art style wise, its the first model to actually beat midjourney for non slop images, here's hoping he can fix it for higher res
Anonymous No.106310049
What is the best Cuda version right now for comfyui? 12.8? I am still on 12.6
Anonymous No.106310050
>>106310039
not unless you want your latents to NaN
Anonymous No.106310067 >>106310144
Trying to make a Wan cum tribute LoRA
What should I caption?
>a man jerks his penis and cums on the screen
??
Anonymous No.106310080 >>106310119
>>106310033
same for the model. 8bit wan is trash compared to fp16
Anonymous No.106310102 >>106310186
Anonymous No.106310108 >>106310112 >>106310117
>>106309912
the angry tree near hogwarts grabs flying blue toyota prius 2015
Anonymous No.106310112
>>106310108
>angry tree near hogwarts
Whomping Willow
Anonymous No.106310117
>>106310108
It's called the Whomping Willow.
Anonymous No.106310119 >>106310140
>>106310080
>>106310033
>>106309992
>use uncompressed qwen TE
>Given normalized_shape=[3584], expected input with shape [*, 3584], but got input of size[1, 176, 2048]
what the fuck?
Anonymous No.106310122
>>106309980
imagine retards stopped buying 4090s and 5090s+ and paid chinese guys for optimisations instead
Anonymous No.106310138 >>106310184 >>106310208
>>106309986

Annealed is a bad epoch that lode really should not have publicly released. It's not your fault (there's zero documentation anywhere), but don't use that one. V48 is generally seen as the current best all-rounder epoch. You need to describe the style of image you want, so realism requires a description like "a 35mm film photograph" or "a candid, casually taken photograph" depending on the look you want. It pulls a lot of information based on time of day, lighting, and setting descriptions as well.
Anonymous No.106310140 >>106310159 >>106310160
>>106310119
Likely using the wrong text encoder. There are like 50 T-5s floating around at this point.
Anonymous No.106310144
>>106310067
well
well
well?
Anonymous No.106310153
>>106310045

It's what I love about it. All the other models since 2024 still generally have that slop look to them. Krea just buries everything in yellow-tinted noise to hide it.
Anonymous No.106310158
kek
west: kontext dead, long live qwen edit
chinese: qwen edit shit, long live kontext
chinese fanboy mindset is just different
Anonymous No.106310159
>>106310140
you're right, nm
Anonymous No.106310160
>>106310140
you're right, nm
Anonymous No.106310167 >>106310239
Is flan_xxl better than the old t5_xxl for Chroma?
Anonymous No.106310169
Anonymous No.106310172 >>106310178
The OP is confusing and unintuitive should be structured more like

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106305640 (Cross-thread)

READ THIS FIRST
https://rentry.org/ldg-lazy-getting-started-guide

## Images
### Image Checkpoints
- Qwen-Image:
- Chroma: https://huggingface.co/lodestones/Chroma1-HD/tree/main
- Training: https://rentry.org/mvu52t46
- Illustrious
- 1girl and Beyond: https://rentry.org/comfyui_guide_1girl
- Tag Explorer: https://tagexplorer.github.io/

### Image UIs
- ComfyUI: https://github.com/comfyanonymous/ComfyUI
- SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
- re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
- SD.Next: https://github.com/vladmandic/sdnext
- Wan2GP: https://github.com/deepbeepmeep/Wan2GP

## Videos
### Video Checkpoints
- Wan Video
- GitHub: https://github.com/Wan-Video
- 2.1 Guide: https://rentry.org/wan21kjguide
- 2.2 Guide: https://rentry.org/wan22ldgguide
- Additional Info: https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

## Resources
### Checkpoints & Models
- Civitai: https://civitai.com
- CivitAI Archive: https://civitaiarchive.com/
- Tensor Art: https://tensor.art
- OpenModelDB: https://openmodeldb.info
- OpenArt Workflows: https://openart.ai/workflows

### Tuning
- Demystifying SD Fine-tuning: https://github.com/spacepxl/demystifying-sd-finetuning
- OneTrainer: https://github.com/Nerogar/OneTrainer
- SD Scripts: https://github.com/kohya-ss/sd-scripts/tree/sd3
- LoRA Easy Training Scripts: https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
- Diffusion Pipe: https://github.com/tdrussell/diffusion-pipe

## Misc

## Neighbors
Anonymous No.106310178
>>106310172
Oh and have a video UIs section for wan2gp
Anonymous No.106310182 >>106310222 >>106310232
erm, i think my android boy is malfunctioning
Anonymous No.106310184 >>106310208 >>106310230
>>106310138
Thanks anon, I'll give V48 a go. Is ChatGPT or any LLM I can run on a 24gb card good at giving the T5 style prompts?

>A professionally lit studio fashion photograph taken on a canon r5 with an 85mm f1.4 lens of a young pale white woman in her early-twenties with fair skin, d-cup breasts, a slender hourglass build with a tiny waist and wide hips, and jet black shoulder-length hair in two neat pigtails.

That's how I start my prompt, then proceed to describe her stance, outfit and location in different blocks. Not sure if sperging about the camera specs is the way to unlock the goodness, but that's what I've got so far.
Anonymous No.106310186
>>106310102
police should be gunning down the furfags
Anonymous No.106310191
>>106309991

I like it.
Anonymous No.106310202 >>106310995
>>106308594 (OP)
Anyone that pushes a repo with a dependency version constraint on pytorch should be shot and their body dragged through the streets like an assassinated dictator
FUCK OFF I am adult I will figure it out, you guessed wrong bitch
Anonymous No.106310208 >>106310241
>>106310184
>>106310138

Oh yeah, also - is the general rule of thumb to get just the numbered version, or the version that "sounds better"

There's straight up v48 and chroma-unlocked-v48-detail-calibrated. I wish I didn't have to ask all this crap, it's very ironic how documentation is now nonexistent despite having robots that can write for us now...
Anonymous No.106310209 >>106310221
>>106309992
not seeing a huge difference between the two. and fp8 T5 is faster.
Anonymous No.106310221
>>106310209
its another anon bullshitting again episode, tonight episode: "t-the text encoder is wrong I swear"
Anonymous No.106310222
>>106310182
Just catbox next time retard
Anonymous No.106310230 >>106310359
>>106310184

You can use gemini or gemma just fine if you need help with prompting. Just be aware that they tend to love inserting purple prose that the encoder has no use for. The camera stuff is totally on point, so don't worry. Line breaks also work fine with t5 for splitting up the prompt into paragraphs, don't use BREAK or any old sdxl stuff. Chroma understands booru tags but they skew the image toward an anime/furry style.
Anonymous No.106310232 >>106310245 >>106310350
>>106310182
since when is kissing nsfw?
white people kiss their kids on the lips all the time
Anonymous No.106310238
Anonymous No.106310239 >>106310252
>>106310167
>Is flan_xxl better than the old t5_xxl for Chroma?
Some said it is (wrong opinion). I tested on 2 images and it was worse (brute fact).
Anonymous No.106310241
>>106310208

To my understanding, detail calibrated is the same but with some of the training datasets being in a higher resolution. Functionally, I'm not sure how much difference there is because Lode refuses to elaborate on this stuff lmao.
Anonymous No.106310245 >>106310344
>>106310232
Does anyone do that but some retard parents and maybe americans?
Anonymous No.106310252 >>106310264
>>106310239

Have you tried GNER? These all seem like placebos.
Anonymous No.106310258
>pull
>everything's changed
>again
Since when does comfyui has a native node manager?
Anonymous No.106310264 >>106310330
>>106310252
No, the only relatively obscure thing that can be an improvement to chroma is res_2s bong, and only on some gens while making others unstable
Anonymous No.106310282 >>106310294 >>106310308
sooo is gguf Q8 better than fp8? even speed wise?
Anonymous No.106310294
>>106310282
Hard to say without knowing the model.
Anonymous No.106310308
>>106310282
I had better gens on FP8 Kijai nodes wan2.2
Anonymous No.106310328
>He doesn't use fp8 high and Q8 low.
Anonymous No.106310330 >>106310355
>>106310264

Gotcha. Would you say fp16 text encoder is any visibly better than a q8 or fp8?
Anonymous No.106310339 >>106310347 >>106310353
Anonymous No.106310344 >>106310375
>>106310245
>Does anyone do that but some retard parents and maybe americans?
kissing on the lips between parents and children happens in 90% of cultures you gremlin that was raised without love

Yes, it is true that kisses on the lips between parents and children are more common than romantic kissing across cultures worldwide.

According to the sources, while romantic kissing is not universal and varies significantly by region, with prevalence rates ranging from 46% to 73% in different areas, parental kissing is observed in approximately 90% of cultures. This indicates a higher prevalence of parental kissing globally compared to romantic kissing.

Answer: Yes, kisses on the lips between parents and children are more common than romantic kissing across cultures worldwide.
Anonymous No.106310347 >>106310353
>>106310339
Anonymous No.106310350
>>106310232
implying i didn't delete my own post to dodge the jannies
Anonymous No.106310353 >>106310371
>>106310339
>>106310347
model?
Anonymous No.106310355 >>106310371
>>106310330
No reason not to have fp16 encoder since its not held in vram, there is a difference.

Picrel are model quants not text encoder quants but they show you the difference, minimal between q8 and fp16, and then everything else is noticably worse.
Anonymous No.106310359 >>106310393
>>106310230
Sweet. I don't know what booru tags are and I'm attracted to human females, so my retardation is helpful on that front.

Is Qwen V3 alright for prompt help, or is Gemma/ini the one to go for?
Anonymous No.106310371 >>106310376 >>106310379
>>106310353
qwen

>>106310355
>No reason not to have fp16 encoder
it can be slower though
Anonymous No.106310375 >>106310381
>>106310344
>npc quoted chatgpt and also equivocated something that exists in some form in most cultures as being popular in all of them
Maybe for you family members kissing on the lips was normal because your parents are siblings? Lmao retard.
Anonymous No.106310376
>>106310371
I knew it. The consistent netting, the plausible background that look like what they're supposed to.
Anonymous No.106310379 >>106310413
>>106310371
>it can be slower though
Compared to Q8 not really and there's no reason to not gain a little more quality very cheaply, especially in the area of prompt adherence and quality, something that's the biggest problem for models.
Anonymous No.106310381
>>106310375
good bait made me smile have a (You)
Anonymous No.106310393 >>106310421
>>106310359
Just use Gemini, it's free.
Anonymous No.106310397
pic of u
Anonymous No.106310413
>>106310379
fp8 scaled TE shaves off ~15 seconds for me with qwen, and I haven't found a comparison where it made a big difference. I think this might matter more with T5 models
Anonymous No.106310415 >>106310431
>just realized you can use inpainting to mask AND adetailer at once
I feel so dumb
it works great for fixing the eyes without fucking up the face when you draw tiny little masks on the eyes
Anonymous No.106310421
>>106310393
I want to self host! The L in /ldg/ stands for local, silly (fag).

Is picrel helping or hindering (the LoRA)?
Anonymous No.106310431 >>106310447
>>106310415
>adetailer

This thread is for people 24gb of vram and over. Scram
Anonymous No.106310447
>>106310431
rude I just recently got into local gen
Anonymous No.106310448
>>106309904
I sure hope nobody puts any weight on what this retard says.
He is constantly schizo rambling about his own personal theories of how diffusion models work.
He is a crack-pot.
In the training of chroma, he has done at least 5 or 6 things blatantly wrong. You can literally just read his training code and see it for yourself.
He burned 150k USD on chroma and is now desperate to put down every other model and claim it's shit.
Anonymous No.106310450 >>106310460 >>106310464
Anonymous No.106310460
>>106310450
>wan 2.1
Anonymous No.106310464 >>106310480 >>106310590
>>106310450
>weird ass panties
>sword phases through the guy

Someone needs to train some LoRAs on Realise/Pharfaite style swimsuits/lingerie - picrel.
Anonymous No.106310480 >>106310509
>>106310464
>phases through the guy
You mean cuts so cleanly that not even he knows he has been sliced?
Anonymous No.106310493 >>106310520 >>106310530
Anonymous No.106310494
Anonymous No.106310496
>Buys 5090
>Fails to motherboard post due to outdated driver
>Comfyui Cuda error, Triton error, sageattention error, troubleshooting galore.
>200s per gen reduced to 120s gen with VRAM to spare for higher res.

Worth it. Now.. time to fuck around with reforge dependencies
Anonymous No.106310509 >>106310514
>>106310480
yes, my bad sen pai, i didnt realise it was some

>omayy wah moo shin deroo

typa shit
Anonymous No.106310514 >>106310529
>>106310509
It wasn't. I was just justifying it after the fact.
Anonymous No.106310520 >>106310530
>>106310493
Anonymous No.106310529
>>106310514
yeah i know its called a joke you asperger having doublenigger
Anonymous No.106310530 >>106310549
>>106310520
>>106310493
Kontextsister? Our response?
Anonymous No.106310549 >>106310553
>>106310530
not kontext, qwen edit Q5
https://huggingface.co/QuantStack/Qwen-Image-Edit-GGUF/tree/main
Anonymous No.106310553
>>106310549
yeah i know its called a joke you asperger having doublenigger
Anonymous No.106310559 >>106310571 >>106310572 >>106310591
Anonymous No.106310565
Anonymous No.106310571
>>106310559
Her bag and sunglasses are gone.
Anonymous No.106310572
>>106310559
Interested in your general workflow and model if you don't mind - I love realistic gens with that PVC/Latex look.
Anonymous No.106310590 >>106310613
>>106310464
>Someone needs to train some LoRAs on Realise/Pharfaite style swimsuits/lingerie - picrel.
Add Leohex and Bitysie to that list even if they're mostly chink bootleg Realise. Both brands have some promotional videos which could be used to train for wan even.
Anonymous No.106310591 >>106310628
>>106310559
double steps (40)
Anonymous No.106310613 >>106310625 >>106310636 >>106310658
>>106310590
I have thousands of studio quality pictures of my gf in realise/leohex/bitysie/pharfaite. Can these be used to train a lora? Sorry if this is a dumb question im brand fuckin new.
Anonymous No.106310625 >>106310709
>>106310613
share them
I can train a LoRA for you
Anonymous No.106310628
>>106310591
>Aug 2025
>We still haven't solved the same face problem
Imgen absolute state of
Anonymous No.106310636 >>106310709
>>106310613
>I have thousands of studio quality pictures of my gf in realise/leohex/bitysie/pharfaite.
I'm so fucking jealous.
>Can these be used to train a lora?
Short answer is yes. You'll have to look up some guides on how to do it.
Anonymous No.106310649 >>106310668 >>106310866
Wan really doesn't do well when the movement gets too fast.
Anonymous No.106310652 >>106310763
>>106309904
So how did he fuck the v49-v50 so hard if he has so much knowledge on training? I mean he did do good for the first v48 but even those versions werent learning artists, anatomy and things like that, and then v49 v50 v50-annealed get created and he makes the decision to name v50 as v1 before people test it more and realize how cooked the 1024x1024 generation is on v50, and how slopped it is in general now. It mostly fixed the anatomy but these were all major fuckups.

At least he bit the bullet and is retraining on v48, which most of the other retards in this world never want to do, say a lot of their work and money was wasted and try again.
Like pony v7, although maybe he had some kind of a contract early on too and couldn't switch to train on flux.
Anonymous No.106310654
>qwen image edit 50 steps cfg 4
bros I wanna go FAST WHAT THE FUCK
Anonymous No.106310658 >>106310709
>>106310613
Load them in your trainer, describe as much as possible of the girl (hair color, breast size, where she's looking at, full body, portrait etc) except the swimsuit in the tagging. Have instead the same trigger word like "pharfaite" in every description.
Lora will learn that the swimsuit is essential for the generated image, but everything else (what you described individually) can vary.
Anonymous No.106310664 >>106310681
well thats not good
Anonymous No.106310668
>>106310649
Will we ever reach a point where AI stops being an idiot?
Anonymous No.106310681 >>106310754
>>106310664
Are you running the workflow without changing anything? It stores the previous result.
Anonymous No.106310702
>>106309992
This has always been my approach. Full or nearly full text encoder, smaller gguf for the model.
Anonymous No.106310706 >>106310713 >>106310721
Create a view from outside the car looking in.
Anonymous No.106310709 >>106310729 >>106310800
>>106310625
funny guy
>>106310636
it's fun. she'll also wear them under a shirt+skirt or dress and we'll go walk in the forest/park. 50% nature/landscape shots, 50% really naughty stuff when nobody's around.
>>106310658
Which trainer should I use? Please dump me some resources for this, I think it would be a great learning exercise + immortalizing my fetish in ai image gen.
Anonymous No.106310713 >>106310718 >>106310721
>>106310706
>Now create a view from the front of the car looking in.
Anonymous No.106310718 >>106310721
>>106310713
>Sitting next to him in the passenger seat is Sonic The Hedgehog.
Anonymous No.106310721 >>106310741
>>106310706
>>106310713
>>106310718
Qwen Edit?
Anonymous No.106310729
>>106310709
>Which trainer should I use?
Look into OP.
>Tuning
Anonymous No.106310731 >>106310737
ChromaChads - T5 min_padding: 0 or 1?

Also, ChromaChads - are we negative prompting or leaving it blank?
Anonymous No.106310737 >>106310747
>>106310731
I've had mine on 1 and no complaints
Anonymous No.106310738 >>106310750
bros.... qwen edit bros.... I lost!!!!!!!
Anonymous No.106310741
>>106310721
Yeah the online version. idk. seems kinda shit to me.
Anonymous No.106310747 >>106310782
>>106310737
Nice. I saw some chinks saying 0 was better, but 1 has been fine for me as well. What about your negative prompt?
Anonymous No.106310750 >>106310767
>>106310738
Based 1050chad
Anonymous No.106310754
>>106310681
Trying the infinite gen wan 2.2 workflow
https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
Anonymous No.106310760
Anonymous No.106310763 >>106310771 >>106310785
>>106310652
It's nothing personal, but I don't understand the principle of getting so emotionally involved with someone else's failure.
if it does work out: nice
if it continues to fail: who cares?
anything apart from that I would call emotionally unstable
Anonymous No.106310767
>>106310750
I'm actually on 4080S 16gb... im a vramlet brosssssssss
Anonymous No.106310771 >>106310786 >>106310811
>>106310763
Who's getting emotionally involved? I responded to his own discord comments by mentioned basic facts on what happened
Anonymous No.106310773
Anonymous No.106310781 >>106310799
I've been trying to run wan, but it crashes after a few generations
4090
64gb of ram
Anonymous No.106310782
>>106310747
I don't negative prompt in Chroma. It never turns out well for me.
Anonymous No.106310785
>>106310763
Think of it from the perspective of the coombrain who is using the model - it's his defacto sexual partner, so he'll obviously have some sort of attachment to it . It's that unhealthy blinding lusty "love" that dudes get dicknapped by. Now instead of that energy being poured into a woman, it's directed to a furrynigger trained AI model that he's decided to feed his biological imperative (the need to reproduce for all of the illiterate jeets here) with.

Sad.
Anonymous No.106310786 >>106310815
>>106310771
>I'm not emotionally involved
>I actively comment on his discord

which is it?
Anonymous No.106310799 >>106311186
>>106310781

Post error log dumb ass. Then someone "may" offer advice.
Anonymous No.106310800 >>106310813
>>106310709
>it's fun. she'll also wear them under a shirt+skirt or dress and we'll go walk in the forest/park. 50% nature/landscape shots, 50% really naughty stuff when nobody's around.
Bro just had to twist the knife in the wound.
Anonymous No.106310805 >>106310821 >>106310843 >>106310856 >>106310863
Can anyone point me in the right direction as to where all the faceswapping, deepfaking folks went after mrdeepfakes forum and unstablediffusion and the likes were kill? i simply can not find anything, been crawling archive sites for hours every day like a full time job and i can not come up with anything. it is censored to the fucking max where i am, looking up deepfakes only brings up news on how dangerous and disgusting it is, looking up deepfakes forum only brings up how mrdeepfakes was wiped from the face of the earth. can share very nice deleted loras in return
Anonymous No.106310811 >>106310823
>>106310771
is good. you just have to let it go. we'll help you
Anonymous No.106310813
>>106310800
You can have it too. I'm nothing special, at all. There's lots of nice girls out there that will let you corrupt them (lovingly).
Anonymous No.106310815
>>106310786
I obviously meant I responded to his own discord comments in this thread, retard
Anonymous No.106310821
>>106310805
it died because nobody cares
Anonymous No.106310823 >>106310830
>>106310811
If you dont want local model discussion you're in the wrong thread
Anonymous No.106310830 >>106310838
>>106310823
>if you don't want me to give endless monologues about my love-hate relationship to furry nigger then just say so
I did, didn't I?
Anonymous No.106310838
>>106310830
You seem mentally unwell from all that ad absurdum schizobabble.
Anonymous No.106310843
>>106310805
>Can anyone point me in the right direction as to where all the faceswapping, deepfaking folks went after mrdeepfakes forum and unstablediffusion and the likes were kill?
we made a new tube site
celebfakes dot ru
Anonymous No.106310849 >>106310950
So is 1024x1024 the max in chroma, or can I take it higher if I want to? I don't mind waiting longer for better gens if that's the case.
Anonymous No.106310856
>>106310805
simpcity has ai threads for specific people
Anonymous No.106310863 >>106311093
>>106310805
were you a creator or sunbscriber or just a glowie trying to take us down again?
If a creator let me know your discord I'll invite you to the Survivor's group
Anonymous No.106310866 >>106310879 >>106311185
>>106310649
fp8
vs
Q8 (pic related)

Who wins?
Anonymous No.106310879
>>106310866
you need to make your pussies a little puffier.
Anonymous No.106310883 >>106310894 >>106310895 >>106310899 >>106310938 >>106310959
uhmm bros???
Anonymous No.106310894 >>106310900
>>106310883
Somehow I think Wan won't respond to "Puffy pussy lips."
Anonymous No.106310895
>>106310883
am I a promptlet... whats the issue here??? SAD!!!
Anonymous No.106310899 >>106310918
>>106310883
>niggers using light lora even for image gen
grim
Anonymous No.106310900
>>106310894
somehow I think you need to get a little more creative, bitch.
Anonymous No.106310901
>>106308966
nice
Anonymous No.106310918
>>106310899
you can use the lighting lora even for edit, BRAH:
https://huggingface.co/Qwen/Qwen-Image-Edit/discussions/6
Anonymous No.106310933
>video gen randomly takes 5 minutes longer
>it's botched
whyyyyyy
Anonymous No.106310938
>>106310883
oh fuck im dumb these niggers made a new node:
TextEncodeQwenImageEdit
had to dig into the fucking code
Anonymous No.106310950 >>106310954
>>106310849
You can go higher. 2.0 megapixels is probably the max (like Flux). Try to gen at resolutions that are divisible by 64.
Anonymous No.106310954 >>106311017
>>106310950
Noted. Is it better to go straight for 2MP, or do a 512 or 1024 and upscale?
Anonymous No.106310959 >>106311003
>>106310883
turned out better than expected with the 4 steps lora
Anonymous No.106310965
qwen edit nunchaku when??????
Anonymous No.106310968 >>106310974 >>106310982 >>106311083
Anonymous No.106310974
>>106310968
>no hint of chocolate starfish
Anonymous No.106310982
>>106310968
my mongolian wife...
Anonymous No.106310995
>>106310202
Also did you know that .post1 explicit non-version-bump hotfixes are considered a higher version by pip’s absolutely gangrenous system?
Amazing
Anonymous No.106311003 >>106311060
>>106310959
I don't really know who this character is, so I don't know how badly the change butchered her, but it looks like the style changed pretty dramatically.
Anonymous No.106311017 >>106311075
>>106310954
Upscaling is probably better at this point as some versions of Chroma do better at higher resolutions and others not so much.

Look up "Iterative Latent Upscale" it's a great way to upscale, also look into "skimmed CFG", it allows you to increase CFG for better anatomy without burning the image.
Anonymous No.106311060 >>106311065
>>106311003
did some more testing and yeah, the lighting loras fucking butcher HARD
Anonymous No.106311065
>>106311060
>Blows out colors and ruins compositions
Sasuga lighting LoRA sama.
Anonymous No.106311070 >>106311099
Wan always puts tails in weird spots.
Anonymous No.106311073 >>106311081
getting "mat1 and mat2 shapes cannot be multiplied" with Q8 qwen image text encoder. works fine with the fp16 text encoder.
dafuq
Anonymous No.106311075 >>106311183 >>106311265
>>106311017
Will do - is there a generally agreed upon range for CFG for chroma? As I understand it lower CFG gets you better image quality but more creativity (read: bad prompt adherence) whereas a higher CFG gets you better prompt adherence but with a higher likelihood of having a deep fried gen - does that sound like an alright understanding?
Anonymous No.106311081
>>106311073
are you using the TextEncodeQwenImageEdit node? I was getting the same error and decided to switch back to the normal reference latent shit, which works.
Anonymous No.106311083 >>106311098 >>106311164
>>106310968
There was some freak in the last /ldg/ talking about a butthole lora (I think it was called BBFW, better buttholes for wan) for doing upskirts on wan, perhaps that's of interest to you.
Anonymous No.106311093
>>106310863
i want to create, but reactor and facedetailer just wont do what i want to do, which is especially occlusion aware faceswapping trained on cum and dicks to facilitate BJ and facial faceswaps. however after the nuke its almost impossible to find ANY info, the internet is scrubbed CLEAN.
name darkenrahl1
Anonymous No.106311098 >>106311132
>>106311083
>perhaps that's of interest to you.
I like butts but I paradoxically do not like assholes.
Anonymous No.106311099
>>106311070
>Prompter asks for a tail on a female booty ?
>He probably wants a tail butt plug !
Based Wan
Anonymous No.106311132 >>106311140 >>106311162
>>106311098
Interesting. I was in the same boat until I got to lick some. Now I'm addicted.

I still have yet to lick a woman's though, that's why I'm on /ldg/.
Anonymous No.106311140 >>106311155
>>106311132
>until I got to lick some. Now I'm addicted.
you got parasites
Anonymous No.106311143 >>106311338
Why do my gens look plasticky on Chroma? Is this a wf issue, prompting issue, or something else?
Anonymous No.106311155
>>106311140
unvaxxed and ivermectin says otherwise.
Anonymous No.106311162
>>106311132
Not my thing.
Anonymous No.106311164
>>106311083
https://huggingface.co/jermaloves69/WAN_Passionate_Kissing_v1/tree/main

this includes the BBFW lora and a lot of other good ones that are since deleted
Anonymous No.106311182
Anonymous No.106311183
>>106311075

The low end for Chroma is 3.0. it will still recognize the prompt but the results will be wackier. Top end is 5.0, which works best for very specific prompting. Anything higher burns the image most of the time. I believe the default workflow keeps it at 4.0 for a nice balance
Anonymous No.106311185 >>106311194
>>106310866
Fp8 looks better
Anonymous No.106311186
>>106310799
Requested to load WanVAE
loaded completely 1905.931640625 242.02829551696777 True
E:\ComfyUI\ComfyUI_Wan\python_embeded\Lib\site-packages\torch\nn\modules\module.py:1784: UserWarning: Using padding='same' with even kernel lengths and odd dilation may require a zero-padded copy of the input be created (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\pytorch\aten\src\ATen\native\Convolution.cpp:1028.)
return forward_call(*args, **kwargs)
Comfy-VFI: Clearing cache... Done cache clearing
Comfy-VFI: Clearing cache... Done cache clearing
Comfy-VFI: Clearing cache... Done cache clearing
Comfy-VFI: Clearing cache... Done cache clearing
Comfy-VFI: Clearing cache... Done cache clearing
Comfy-VFI: Final clearing cache... Done cache clearing
Prompt executed in 512.94 seconds
got prompt
Requested to load CLIPVisionModelProjection
loaded completely 6174.734595108032 1208.09814453125 True
Requested to load WanTEModel

E:\ComfyUI\ComfyUI_Wan>pause
Press any key to continue . . .
Anonymous No.106311194
>>106311185
I don't think so. I think fp8 often fails to "converge" on things and creates a messier output.
Anonymous No.106311197 >>106311206
anyone know where i can find faceswapping / deepfake resources? for example pretrained xseg models for cum and penis detection to mask them out for faceswapping without swapping over the nsfw bits
Anonymous No.106311206
>>106311197
Glowie glowie go away.
Anonymous No.106311265 >>106311271
>>106311075
Yes to your question. For Chroma you normally want to be around 3.5 to 4.5 CFG, I have mine set to 6 on the KSampler with Skimmed (linear) CFG set to 3.5. Low CFG doesn't give you better image quality, rather it can give more realistic skin or appearance, you'll know when the CFG is too low when the image quality ends up looking washed out and you get very frequent bad anatomy. Also if you are going for realism don't use euler as you'll end up with ai slop skin. In my own testing I found that one of the better samplers for realism is dpm_2_ancestral/beta but it's slow. If you get the RES4LYF nodes you get access to a lot more samplers/schedulers to experiment with (deis_3m/beta is another favorite of mine in terms of speed and quality).
Anonymous No.106311271
>>106311265
That sounds really unintuitive. I think I'll just switch to qwen image. Thanks.
Anonymous No.106311283
Anonymous No.106311338
>>106311143
>Chroma
Use a good model instead
Anonymous No.106311378 >>106311417 >>106311518
so many models and yet sally sameface lives. how is this possible anons?
Anonymous No.106311417
>>106311378
Don't use Chroma. It's that simple.
Anonymous No.106311427 >>106311431
Does anyone know how to apply controlnet to chroma? I was only able to find a "solution" on reddit and they would share how to actually do it other than "just stick node between here" which just gives me errors
Anonymous No.106311431
>>106311427
*they wouldn't share
Anonymous No.106311494
>safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooSmall
I'm very new to Comfy, my guess is I don't have enough memory to load all the loras? Had 100% RAM usage before getting this error.
Anonymous No.106311518
>>106311378
It's an core part of how machine learning works, it looks for patterns and when it finds one it generalizes that pattern.

So if you prompt for hot redhead woman with green eyes, you will get the generalization of all the redhead women with green eyes it has trained on. This 'problem' is further compounded due to the increase of synthetic data in training, which makes for even more generalization and loss of variation.

Back when the base models trained on celebrities by name, you could circumvent this by prompting for several celebrity names and thus get non-generic looking faces, nowadays you can do this with loras or different people, which truth be told is a more efficient way, but requires additional external data.
Anonymous No.106311536 >>106311765
>>106308594 (OP)
Anon with 3090 reporting in.
I managed to run wan2.2 lightx2v workflow (finally!), by switching models to e5m2 scaled, as suggested by another Anon yesterday.
I also installed pytorch==2.7.1 directly, because commands from oppost guide installs 2.8.0, and triton-windows==3.3.1.post19 (default installs 3.4.0, wich is not compatible with torch 2.7.x)

Another problem i faced, triton-windows simply does not want to work.
I detached WanVideoTorchCompile node and workflow runs fine, but with it i keep getting some strange errors like pic, sometimes Comfy process just dies without any errors. Have no idea, how to fix it
Anonymous No.106311549
Anonymous No.106311663 >>106311706
Boi... qwen edit has been very underwhelming.
Anonymous No.106311706 >>106311713
>>106311663
I mean, it is better than Kontext from what people have showed but I have no idea what you people were thinking in terms of expectations, it's kinda hard to train a model to do this from most editing models we've seen. Also, I have no clue if quanting has an effect on how it does since you all most likely don't have RTX Pro 6000 Blackwell GPUs.
Anonymous No.106311713
>>106311706
>it is better than Kontext
It doesn't make dwarves. But it also doesn't seem to want to do anything.
Anonymous No.106311765
>>106311536
make sure you installed the latest cuda from nvidia site. use torch-2.7.1+cu218 and triton widows 3.3.1 post 19