/ldg/ - Local Diffusion General - /g/ (#106185803) [Archived: 41 hours ago]

Anonymous
8/8/2025, 5:00:38 AM No.106185803
highlights_g_106180771_1754621276_thumb.jpg
highlights_g_106180771_1754621276_thumb.jpg
md5: a89e6bbe3a80626a7f81712bfc2dc3e6🔍
Qwenimageeditmodelbros Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106180771

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Replies: >>106186105 >>106188173
Anonymous
8/8/2025, 5:08:33 AM No.106185861
Trying to download the model files for qwen image so I can test LoRA training. Why is hugginface cli so shit?
Replies: >>106185903 >>106186057
Anonymous
8/8/2025, 5:11:37 AM No.106185903
>>106185861
>hugginface cli
i see we have our best on the job
Replies: >>106185911
Anonymous
8/8/2025, 5:12:27 AM No.106185911
>>106185903
You get what you pay for.
Anonymous
8/8/2025, 5:18:05 AM No.106185951
How do I get into this as a total newfag? Do I need a PC that costs thousands?
Replies: >>106185985 >>106186009 >>106186134 >>106187478 >>106189325
Anonymous
8/8/2025, 5:18:58 AM No.106185956
>>106184597
shut the fuck up retard
Anonymous
8/8/2025, 5:23:06 AM No.106185985
>>106185951
Depends on how coherent you want your waifu to look
Anonymous
8/8/2025, 5:26:05 AM No.106186009
>>106185951
If you even have to ask about this, just stick to the web saas/API models. I am serious
Anonymous
8/8/2025, 5:33:45 AM No.106186057
>>106185861
>git pull huggingface_repo_url
Anonymous
8/8/2025, 5:34:43 AM No.106186064
1741659305672780_thumb.jpg
1741659305672780_thumb.jpg
md5: b916cd2cfb3007d124bb8a885f534a43🔍
the man smiles and holds a cardboard cutout of an anime style Miku Hatsune, standing in the snow.

based kijai making the 2.2 i2v lora work normally (1 strength for both)
Replies: >>106186113 >>106186691
Anonymous
8/8/2025, 5:36:10 AM No.106186074
>>106181046
>is there even any point of the resize image v2 node? why don't i just plug the image straight in to the WanVideo ImageToVideo Encode? I can just set the dimensions there.
kijai's sampler will sperg out if it doesn't like the dimensions. i dunno
Anonymous
8/8/2025, 5:42:56 AM No.106186105
>>106185803 (OP)
Can someone share the json from the wan rentry?
/ldg/ Comfy T2V 480p FAST workflow (by bullerwins): ldg_2_2_t2v_14b_480p.json (updated 2nd August 2025)
https://files.catbox.moe/ygfoxx.json

I get a white page.
Replies: >>106186222
Anonymous
8/8/2025, 5:44:35 AM No.106186113
>>106186064
i'll also try i2v. wan2.2 t2v is painful for now
Anonymous
8/8/2025, 5:44:52 AM No.106186117
1748090300784021_thumb.jpg
1748090300784021_thumb.jpg
md5: 619bf4f8abd0d14ce169633a69b41d4e🔍
the man looks up at the sky and sees a giant cardboard cutout of an anime style Miku Hatsune, standing in the snow.

kino
Replies: >>106186123 >>106186166
Anonymous
8/8/2025, 5:46:05 AM No.106186123
>>106186117
>not a hologram
are you even trying?
Replies: >>106186140
Anonymous
8/8/2025, 5:49:00 AM No.106186134
LittleDirtyGhosts
LittleDirtyGhosts
md5: 39f800a081a040585aa354c2c22b5042🔍
>>106185951
idle here for a week and then decide if this is who you want to become.
Anonymous
8/8/2025, 5:49:57 AM No.106186138
1727729697345330
1727729697345330
md5: 98c4f689b8492b419ab17124bbe5aec0🔍
Anonymous
8/8/2025, 5:50:17 AM No.106186140
>>106186123
patience! one idea at a time. i'll get there
Replies: >>106186147
Anonymous
8/8/2025, 5:51:19 AM No.106186147
1742307174817273_thumb.jpg
1742307174817273_thumb.jpg
md5: e5ce23f9d639f8e2f365c84982a62d98🔍
>>106186140
here

the man looks up at the sky and sees a giant hologram of an anime style Miku Hatsune, standing in the snow.

it'd be easier if it was a screenshot at night, but it works!
Replies: >>106186166
Anonymous
8/8/2025, 5:52:16 AM No.106186158
how much vram qwen does actually consume when quantized?
Anonymous
8/8/2025, 5:53:16 AM No.106186166
>>106186117
>>106186147
i2v looks so kino compared to t2v
Replies: >>106186177
Anonymous
8/8/2025, 5:54:42 AM No.106186173
1752015862715615_thumb.jpg
1752015862715615_thumb.jpg
md5: d5800828e075fa663160e298b795fe4c🔍
got a BIG miku this time.
Replies: >>106186183 >>106186926
Anonymous
8/8/2025, 5:55:43 AM No.106186177
>>106186166
i2v is fun cause you avoid most of the randomness, you're just prompting what you want to happen next. often with hilarious results.
Anonymous
8/8/2025, 5:57:15 AM No.106186183
1737396031371136_thumb.jpg
1737396031371136_thumb.jpg
md5: 62406892e82338fddb0c867bf712c43d🔍
>>106186173
this time, "gigantic hologram":
Anonymous
8/8/2025, 5:58:25 AM No.106186189
Wan22WVI2V_KJ_RAW__00325_thumb.jpg
Wan22WVI2V_KJ_RAW__00325_thumb.jpg
md5: a379eec0c01c56917b5bd71aa870ea5f🔍
Fight scenes are definitely possible with wan2.2. I think I might have unlocked the Ryona fetish.
Replies: >>106186228 >>106186235
Anonymous
8/8/2025, 5:59:49 AM No.106186198
1752891614265205
1752891614265205
md5: 013196d2535abb41ec13d2a0fb0546a1🔍
I don't think my wan kijai txt to image works as intended lol
Anonymous
8/8/2025, 6:03:40 AM No.106186222
>>106186105
https://huggingface.co/bullerwins/Wan2.2-T2V-A14B-GGUF/blob/main/wan2_2_14B_t2v_example.png
it's not set up with light though
Replies: >>106186246
Anonymous
8/8/2025, 6:04:47 AM No.106186228
>>106186189
kino
Anonymous
8/8/2025, 6:06:05 AM No.106186235
>>106186189
>the strongest woman vs the weakest man
Anonymous
8/8/2025, 6:07:23 AM No.106186246
>>106186222
Thank you anon, and that's fine that part is easy to set up, I was mainly wondering what to use instead of WanVideo Sampler kijai uses.
Anonymous
8/8/2025, 6:16:55 AM No.106186292
1748292149041592_thumb.jpg
1748292149041592_thumb.jpg
md5: 8407073104f517ea48a8ffeb28dc4db9🔍
the man looks up at the sky and sees a gigantic hologram of an anime style Miku Hatsune waving hello at the man.

there, beeg miku
Replies: >>106186363
Anonymous
8/8/2025, 6:17:22 AM No.106186302
test_thumb.jpg
test_thumb.jpg
md5: de43a0bdaf6185cb57ee76d2cbb0af09🔍
Anonymous
8/8/2025, 6:22:31 AM No.106186335
poorfag
poorfag
md5: 43dd2ac54a7a21fb9edbdd344ea9331e🔍
why are you not making money with your shit, /ldg/?
Replies: >>106186358 >>106186396 >>106186399 >>106186409 >>106186792 >>106188520
Anonymous
8/8/2025, 6:26:56 AM No.106186358
>>106186335
Who is even paying to look at slop?
Anonymous
8/8/2025, 6:27:57 AM No.106186363
>>106186292
I don't think wan knows what a hologram is
Anonymous
8/8/2025, 6:28:53 AM No.106186367
1724621637808617_thumb.jpg
1724621637808617_thumb.jpg
md5: aa7851dbab85d8349e76d6aee4a8d608🔍
different image:
Replies: >>106186375
Anonymous
8/8/2025, 6:29:54 AM No.106186375
>>106186367
prompt (slightly diff): the man looks up at the sky and sees a gigantic hologram of an anime style Miku Hatsune singing with a microphone.
Anonymous
8/8/2025, 6:34:01 AM No.106186396
>>106186335
I have a good paying job, I want to generate stuff for me.
Anonymous
8/8/2025, 6:34:43 AM No.106186399
>>106186335
for 1 successful, 100 with like this at 5$ per month total
Replies: >>106188520
Anonymous
8/8/2025, 6:34:53 AM No.106186401
AnimateDiff_00129_thumb.jpg
AnimateDiff_00129_thumb.jpg
md5: 7b1445ef56b0f570576d92ab32a3007c🔍
Anonymous
8/8/2025, 6:35:30 AM No.106186404
For anons having a 5090, I'd like to change the nvidia-smi power target to something lower.
What is the sweet spot for inference? 350W?
Replies: >>106186448
Anonymous
8/8/2025, 6:36:45 AM No.106186409
>>106186335
I'd have to change too much of myself to be successful with that sort of thing
Anonymous
8/8/2025, 6:37:53 AM No.106186414
file
file
md5: f52b24186afff9fa67150313f79a4491🔍
Is this snakeoil any good? It's been slapped on every other wan2.1 workflow I came across but not on 2.2
Replies: >>106186472 >>106186481 >>106187020
Anonymous
8/8/2025, 6:41:21 AM No.106186431
jonah-hill-cut-it-out
jonah-hill-cut-it-out
md5: 4e6dac25463929e6f828f0766783adcc🔍
>>106185333
>bottom line is qwen image is our deepseek

on the right track but i don't know about calling it the deepseek of image gen. doesn't sound quite right.
Anonymous
8/8/2025, 6:44:53 AM No.106186448
>>106186404
>350W is power limiting territory for a 5090
>mfw I'm running my 3090 at 210W
Is electricity free where you live or something?
Replies: >>106186458
Anonymous
8/8/2025, 6:47:21 AM No.106186458
>>106186448
My 3090 is power limited at 260W, it's a good compromise. 210 sounds a bit low.
The 5090 I have no idea what to set yet but its tdp is 575W.
Anonymous
8/8/2025, 6:49:14 AM No.106186472
>>106186414
i don't think anyone, even the creator of it, knows what it does
Replies: >>106186527
Anonymous
8/8/2025, 6:49:59 AM No.106186477
chroma_00723_
chroma_00723_
md5: 13135e6f52dbeffc163c5f5563ad206e🔍
>>106185524
a chroma centaur. note this is an old gen, this is chroma 33 so a newer one would likely be a bit better. but clearly it's superior to qwen's abortive attempt
Anonymous
8/8/2025, 6:51:18 AM No.106186481
>>106186414
you can probably rename it to cargo cult node
Anonymous
8/8/2025, 6:58:21 AM No.106186515
Untitled
Untitled
md5: 031418db8146414f3940c3a5a1523af2🔍
Okay, got qwen to begin training at fp16 across X2 rtx 3090s.

It was a bit of headache to setup. Mostly because the diffusion pipe repo casually forgot to mention I would need to upgrade transformers as well. I'm not sure how I was the only one experiencing that issue.
Replies: >>106186523
Anonymous
8/8/2025, 6:59:22 AM No.106186523
>>106186515
1024x1024 btw
Replies: >>106186538
Anonymous
8/8/2025, 7:00:33 AM No.106186527
>>106186472
It does some weird shit where it averages out some numbers on each step or something, idk really. It's extremely subtle. It's not fake and gay, but it may as well be due to how little it impacts the output
Anonymous
8/8/2025, 7:02:22 AM No.106186538
Untitled
Untitled
md5: d1f2d4c71eb1175039d982009687ced9🔍
>>106186523
Getting about this much vram usage at 1024 at a batch size of one and rank of 32.

Seems perfectly trainable to me desu.
Anonymous
8/8/2025, 7:05:16 AM No.106186553
1752216146107904_thumb.jpg
1752216146107904_thumb.jpg
md5: 8a27afcb175af6f785fbe08e3026238c🔍
and this is why i2v is fun, silly shit

the man hits a baseball with a baseball bat.
Replies: >>106186581
Anonymous
8/8/2025, 7:09:55 AM No.106186581
1729486389581260_thumb.jpg
1729486389581260_thumb.jpg
md5: bb092ad3a17209767cb99c9bbbdf95cf🔍
>>106186553
bowling:
Replies: >>106186594
Anonymous
8/8/2025, 7:11:13 AM No.106186594
>>106186581
Is this with the new light LoRAs? I've noticed my I2V outputs desperately resist the subjecte turning around in them. Might just be my imagination though.
Replies: >>106186625
Anonymous
8/8/2025, 7:15:06 AM No.106186625
>>106186594
2.2 i2v, just testing random stuff atm.
Replies: >>106186633
Anonymous
8/8/2025, 7:16:07 AM No.106186633
>>106186625
*it's the lightning lora but the one kijai posted, 1 str for both
Replies: >>106186670
Anonymous
8/8/2025, 7:20:49 AM No.106186670
AnimateDiff_00346_thumb.jpg
AnimateDiff_00346_thumb.jpg
md5: 48e3222b30ade1025344a855bb80ac29🔍
>>106186633
Yeah just something I noticed when I didn't before since updating, they subject doesn't like to change directions. Then again, I don't have many examples.
Replies: >>106186736 >>106187052
Anonymous
8/8/2025, 7:23:54 AM No.106186691
>>106186064
is that the repack?
Replies: >>106186711
Anonymous
8/8/2025, 7:26:37 AM No.106186711
>>106186691
just kijai workflow with the 2.2 i2v loras

https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo2_2_I2V_A14B_example_WIP.json

https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Wan22-Lightning
Replies: >>106186725 >>106188845
Anonymous
8/8/2025, 7:29:40 AM No.106186724
agi-any-moment
agi-any-moment
md5: 7a4c636396a41ba3a741cdf6e9f2707e🔍
>agi here
Anonymous
8/8/2025, 7:29:50 AM No.106186725
>>106186711
oh those, they're fast but censored :/
Anonymous
8/8/2025, 7:32:06 AM No.106186736
>>106186670
ay yo i din kno cap picard was slick wid it like that!
Anonymous
8/8/2025, 7:33:32 AM No.106186744
genning i2v at 720p, 121 frames. tried kijai's 2.2 loras. switched back to his original workflow with "lightx2v_I2V_14B_480p_cfg_step_distill_rank64_bf16.safetensors" at 3.0 on high and 1.0 on low. i find the 2.1 loras have superior prompt adherence, faster motion and resist looping better. i don't know if the 2.2 loras are meant for 480p but the visual quality is about the same. wtf is going on with this shit
Anonymous
8/8/2025, 7:41:22 AM No.106186792
>>106186335
I started a few months back, got a sub, then life happened and I had to stop. Then I lost that sub recently. Welp. No idea how these guys got hundreds to thousands of subs though. Insane. I put in a lot of effort and it was all high quality stuff.
Replies: >>106186850
Anonymous
8/8/2025, 7:55:23 AM No.106186850
>>106186792


I made a porn game that brought in a few K every month. I couldn't maintain it after I got a new job though, and I was creatively drained. I was no longer enjoying the fetish I was facilitating.
Replies: >>106186881
Anonymous
8/8/2025, 7:56:22 AM No.106186857
Alright, I've been having a good bit of fun with 2.1 (mostly getting my NSFW images to move). How does 2.2 compare? How is it with weirder sizes? I know with 2.1 if you didn't stick exactly at 832x480, things could go pretty soft in terms of detail. And for some reason fluids exclusively genned as torrents and/or giant globs. Like say you prompt saliva, that bitch was ejecting giant globs of spit out of her mouth. I kinda figured that was down to some weird shenanigans with resolution and denoising, where it can't see the small details in the image and kinda resolved it with larger ones, but I'm not sure.
But yeah, is it a bit more flexible with resolutions and framerates?
Base 2.1 tended to be pretty shit where it'd do the slow motion stuff or loop back too.

I'm guessing requirements are mostly the same too? It's just an incremental update, so I figure it's more just optimizing and improving what's there.
Replies: >>106186867
Anonymous
8/8/2025, 7:57:13 AM No.106186866
I thought there was a comfyui node that let you run a python script but I can't find it for the life of me
Anonymous
8/8/2025, 7:57:18 AM No.106186867
>>106186857
I can't believe you haven't moved to 2.2 already. What the fuck.
Replies: >>106186874
Anonymous
8/8/2025, 7:58:20 AM No.106186874
>>106186867
I was on vacation, then I had to work ;_;
Haven't had time to pick back up until now.
Replies: >>106186884
Anonymous
8/8/2025, 7:59:41 AM No.106186881
>>106186850
story of every indie porn game developet
Replies: >>106186892
Anonymous
8/8/2025, 7:59:48 AM No.106186883
I had an idea on how to make like a video with say, only 5 frames. This would save a fuckload of gen time.
Obviously setting Length to 5 doesn't work, it has to be set to 81 for the structure to be established. But what if you establish the structure by generating for 1 step, then remove all the other latent frames, then continue generation? Would it work?

There's nodes like "TrimVideoLatent" and "Frames Slice Latent" which allow you to remove latent frames, we would just need a node that only keeps every 16th frame (when using 81 length, to make it 5 frames).
Replies: >>106186983
Anonymous
8/8/2025, 8:00:02 AM No.106186884
>>106186874
Go, go download 2.2. It's basically better in every conceivable way.
Replies: >>106187055
Anonymous
8/8/2025, 8:01:42 AM No.106186892
>>106186881
It was just too much man, and the DMs asking me to add in stuff I found revolting. The insane impossible requests that would be entire games in and of them selves. The fear of people being unhappy with the update.

Yeah. I might pick it up again one day, but it will be a new project that I actually want to work on. Passion is something intangible that users can feel.
Anonymous
8/8/2025, 8:02:41 AM No.106186896
DAE not think Qwen is actually very good at all? It looks like absolute shit compared to Flux Krea for everything I want to gen personally
Replies: >>106186924 >>106186928
Anonymous
8/8/2025, 8:06:43 AM No.106186922
1727110285316131_thumb.jpg
1727110285316131_thumb.jpg
md5: 3687d96be9f52d534e120031cb025f81🔍
>music stops:
Anonymous
8/8/2025, 8:06:54 AM No.106186924
genning-time
genning-time
md5: 628c7eb4ad92cccccb7083c71e5e2a9c🔍
>>106186896
haven't tried Krea, but I'm having a blast genning heaps of shit with qwen image, especially character reference frames to feed into hunyuan3d2.1
>a 2 panel comic portraiting Hatsune Miku, and John Wick shopping. the left panel shows John Wick holding a GPU saying "VRAM..." with Hatsune Miku looking excited. the right panel shows John Wick and Hatsune Miku walking away together saying "it's GENNING time". Ultra HD, 4K, comic, anime.

qwen image + wan2.2 back to back is hours of fun
Anonymous
8/8/2025, 8:07:53 AM No.106186926
vid_fn_00042_thumb.jpg
vid_fn_00042_thumb.jpg
md5: 9a9cecc83697dfeeb695f34f51e057d7🔍
>>106186173
Replies: >>106186929 >>106186957 >>106189291
Anonymous
8/8/2025, 8:08:19 AM No.106186928
>>106186896
I think the outputs all look "solid". Like they are very clean. Especially for anime stuff. If you do an honest comparison to default flux. It trounces it pretty handily. It's also not distilled right off the bat which makes it a much more attractive proposition (vramlets don't @ me)
Replies: >>106186993
Anonymous
8/8/2025, 8:08:30 AM No.106186929
>>106186926
kek, what did you prompt
Replies: >>106186939 >>106186967
Anonymous
8/8/2025, 8:09:39 AM No.106186939
>>106186929
Hehe yeah what did you prompt? lol. I wanna know hehe. It would be really funny if a giant naked lady just picked up a tiny little man right? heh. You know, just a fun thing?
Replies: >>106186960
Anonymous
8/8/2025, 8:12:01 AM No.106186957
>>106186926
miku noooooo
Anonymous
8/8/2025, 8:12:36 AM No.106186960
>>106186939
oh maybe prompting it for attack on titan might work?
Replies: >>106186972
Anonymous
8/8/2025, 8:13:17 AM No.106186967
>>106186929
i sent a basic prompt about him being grabbed by a giant miku through grok and got this:

In the haunting, snow-laden climax of Blade Runner 2049, K, the weary replicant portrayed by Ryan Gosling, sits slumped on the icy steps outside the Wallace Corporation, his bloodied face and tattered coat bathed in the soft glow of falling snow. As he gazes upward into the swirling, pale sky, a colossal 2D anime hologram of Hatsune Miku materializes, her vibrant teal twin-tails cascading like neon waterfalls, dominating the desolate urban horizon. Towering over K, her luminous figure radiates an ethereal warmth against the cold, dystopian backdrop. The camera slowly pulls back, revealing the staggering scale of Miku’s hologram as she fixes her playful, glowing eyes on him. With a single, fluid motion, her enormous hand descends, effortlessly scooping K from the steps like a fragile doll. She lifts him skyward, his body suspended weightlessly against the stormy expanse, snowflakes swirling around him as her vibrant presence contrasts with his quiet resolve, the city fading below in a breathtaking ascent.
Replies: >>106186982 >>106187104
Anonymous
8/8/2025, 8:13:46 AM No.106186972
AnimateDiff_00350_thumb.jpg
AnimateDiff_00350_thumb.jpg
md5: 40f8d22d242020c9cd405e876a6442aa🔍
>>106186960
Nah doesn't work for Wan. I actually tried something similar early today.
Anonymous
8/8/2025, 8:15:35 AM No.106186982
>>106186967
>teal
Isn't she more of a turquoise?
Anonymous
8/8/2025, 8:15:38 AM No.106186983
>>106186883
ah it seems it's the node called "Select Every Nth Latent" that allows you to do this.
Anonymous
8/8/2025, 8:15:39 AM No.106186985
Anyone know how to get previews working for the Phr00t AiO workflow in the Rentry?
Replies: >>106187008
Anonymous
8/8/2025, 8:16:40 AM No.106186993
>>106186928
default Flux sure but Krea takes a shit on anything that isn't WAN for photographic gens, nothing else is remotely close to as detailed in that regard
Replies: >>106187021 >>106187042
Anonymous
8/8/2025, 8:19:24 AM No.106187008
>>106186985
Nevermind, I'm a blind retard.
Just had to pan down.
Anonymous
8/8/2025, 8:21:12 AM No.106187020
>>106186414
All the snakeoils were more useful on 2.1. And 2.2 fixed of of the issue the old model had so these don't do as much as they used to.
Anonymous
8/8/2025, 8:21:17 AM No.106187021
>>106186993
True, but Krea seems very much in that niche of realism. Qwen feels like a blank canvas. Think back to SDXL and its release and how god awful it was and it still turned out great. I don't know if Qwen can achieve that due to its size, but the potential to be truly great is there.
Replies: >>106187119
Anonymous
8/8/2025, 8:26:32 AM No.106187042
>>106186993
Nah, Chroma is still the king of photorealism. It's not even close. There are hundreds of different things you can do with Chroma, you can't do with Krea, that's due to its uncensored nature. The Krea pretty images, you can get out of Chroma with good prompt engineering.
Replies: >>106187067 >>106187130
Anonymous
8/8/2025, 8:28:18 AM No.106187052
kijai_wan22_light_00010_thumb.jpg
kijai_wan22_light_00010_thumb.jpg
md5: ef7d4d0ca090a01b27eefffb27004291🔍
>>106186670
it works no problem with 2.1 light lora
Replies: >>106187075
Anonymous
8/8/2025, 8:28:46 AM No.106187055
Wan2.2_thumb.jpg
Wan2.2_thumb.jpg
md5: 7a8d1db6b7f5cbde42ae31c99f349006🔍
>>106186884
You weren't kidding. This shit is way better. And it takes like half the time (110s for 2.2 on a first gen, compared to 200s for 2.1) .
Only problem is that things are a little grainy. I grabbed the Phr00t workflow from the rentry as a bit of a "quick start" to try shit out, so maybe that has something to do with it. Or maybe it's just more sensitive to resolution than before?
Replies: >>106187066
Anonymous
8/8/2025, 8:31:40 AM No.106187066
kijai_wan22_light_00008_thumb.jpg
kijai_wan22_light_00008_thumb.jpg
md5: 595de2b80d9cf2171f04497e90b956e4🔍
2.1 light, 161 frames. not great but look how coherent
>>106187055
the AIO is ass
Replies: >>106187074
Anonymous
8/8/2025, 8:31:58 AM No.106187067
>>106187042
One simple and often overlooked example. Soles. Qwen can do them, but they are blurry and slopped. Know what else is blurry and slopped? Cloudshit models. From Reve, to Gemini, to Imagen, to GPT 4o, all slopped. Chroma is unique in that it's the only unslopped model that can do soles in any situation.
Anonymous
8/8/2025, 8:33:05 AM No.106187073
chroma is king of random noise and garbage
Anonymous
8/8/2025, 8:33:16 AM No.106187074
>>106187066
>the AIO is ass
Sheeeeeeit.
The fuck do I do then? The rentry is kind of a weird mix of 2.1/2.2 info. 2.2 isn't just a drop in replacement, is it?
Replies: >>106187097
Anonymous
8/8/2025, 8:33:17 AM No.106187075
>>106187052
I saw a r*ddit post that said they got amazing movement combining the 2.1 at strength 3 and 2.2 at 1 on the high model

and 2.2 at 1 on the low model with 2.1 at 0.25.

Both with a cfg of 2.5
Replies: >>106187082 >>106187150 >>106187360 >>106188021 >>106188074
Anonymous
8/8/2025, 8:34:17 AM No.106187082
>>106187075
The light LoRA that is.
Anonymous
8/8/2025, 8:34:33 AM No.106187083
the professional style foot pics were like a breath of fresh air
Replies: >>106187101
Anonymous
8/8/2025, 8:37:44 AM No.106187097
>>106187074
use kijai's workflow
https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo2_2_I2V_A14B_example_WIP.json
Anonymous
8/8/2025, 8:38:04 AM No.106187101
>>106187083
If I wanted pro style foot pics I would just prompt for them. They'd still come out better than distilled Seedream 3.0 slop. That's the freedom that models like Chroma give you. But you are an idiot.
Anonymous
8/8/2025, 8:38:32 AM No.106187104
>>106186967
neat, I might try that with lm studio too but prob have to unload the model (for vram), grok/etc are probably ideal options for generating detailed stuff.
Replies: >>106187120
Anonymous
8/8/2025, 8:41:19 AM No.106187116
5 epochs in on Qwen training. I'm wondering how it will handle respecting the captions. Flux had an awful tendency to bleed characters and everyone called me a schizo when I called it out.
Anonymous
8/8/2025, 8:41:47 AM No.106187119
>>106187021
HiDream was MIT licensed and the Full version wasn't distilled, and AFAIK would be much easier to train than Qwen in terms of resource needs.
Replies: >>106187128 >>106187155
Anonymous
8/8/2025, 8:41:55 AM No.106187120
1735700913658487_thumb.jpg
1735700913658487_thumb.jpg
md5: a3b9ea68357a2be2a337bc135085a9e5🔍
>>106187104
Generate an image of a man standing in the center of a large, empty room with high ceilings, dressed in casual attire - jeans and a plain white t-shirt, with a look of wonder and awe on his face as he gazes upwards at a giant holographic image of Hatsune Miku suspended above him by an invisible field of energy or magic. The holographic Miku should be at least 10 times larger than the man, with intricate details visible even from this distance, her facial expression one of calm serenity and gentle benevolence as she looks down at the man, her eyes cast directly at him as if seeing right through to his soul. A halo of soft, pulsing light surrounds Miku's image, diffused and scattered throughout the room, creating an otherworldly ambiance that makes the viewer feel like they're witnessing something truly magical, with subtle hints of digital code or programming languages floating in the background.

neat
Replies: >>106187126
Anonymous
8/8/2025, 8:43:09 AM No.106187126
>>106187120
and the prompt in lm studio was: make a detailed stable diffusion prompt for a man looking up at a giant holographic Miku Hatsune in one paragraph.

will prob use grok so I dont have to unload the model, it uses like 5-6gb.
Replies: >>106187198
Anonymous
8/8/2025, 8:43:22 AM No.106187128
>>106187119
Nobody really gave a shit about HiDream. Myself included. I don't think any of the outputs really wowed anybody.
Replies: >>106187136
Anonymous
8/8/2025, 8:43:59 AM No.106187130
>>106187042
No you can't lmao, Chroma straight up does not have that level of fidelity in it at the moment due to being trained at 512px up until now. It also has half the context length of both regular Dev and Krea because Schnell did (256 tokens versus 512 token).
Replies: >>106187215
Anonymous
8/8/2025, 8:45:58 AM No.106187136
>>106187128
I haven't really seen a single "wow" Qwen output either quite frankly. Just a lot of people using extremely easy prompts as supposed evidence of the great prompt adherence.
Replies: >>106187142 >>106187189
Anonymous
8/8/2025, 8:47:12 AM No.106187142
>>106187136
There were some that came out during the release of Flux to test how far T5 could go, someone had a long elaborate prompt for a black woman. Too lazy to look it up.
Replies: >>106187169
Anonymous
8/8/2025, 8:47:50 AM No.106187150
>>106187075
lmaoing at the jeets there that can't figure out how to connect a lora node
Replies: >>106187172
Anonymous
8/8/2025, 8:48:29 AM No.106187155
>>106187119
>4 (four) text encoders
Anonymous
8/8/2025, 8:49:40 AM No.106187162
What wan 2.1/2.2 i2v model that gives me reasonable speed with a 4070s and 32gb of ram?
I'm using wangp, wan2.1 Image2video 480p 14B, and profile 4 (12gb vram and 32gb ram), it takes 7 minutes to make 5 seconds
Replies: >>106187192
Anonymous
8/8/2025, 8:50:24 AM No.106187169
1724017559405
1724017559405
md5: 1d4e72fd8d40b10c54ca68dc04628c1b🔍
>>106187142
I found one lazily search back in the archives from last year.
This is a digitally drawn anime-style image featuring Hatsune Miku. She is seated at a wooden desk in a modern office setting. On the desk is part of a half-eaten hot dog and crumbs, the hot dog has a missing part that was bitten off and it's incomplete. She has a serious expression as she extends her right hand to shake hands with a person off-screen to the left. Likely an office colleague. Indicating a break or snack time. The desk is cluttered with various office supplies, including a pencil cup filled with colored pens and markers, a calculator, and a notebook. A green potted plant is visible on the left side of the desk, adding a touch of nature to the otherwise busy workspace. The background features a large window with multiple panes, allowing sunlight to stream in and illuminate the room. Outside the window, lush green trees are visible, suggesting an office with a view of nature. The walls are adorned with bookshelves filled with neatly organized binders and books.[\code]
This is what Flux cooked up.
Replies: >>106187175 >>106187180
Anonymous
8/8/2025, 8:50:58 AM No.106187172
>>106187150
Don't lmao too hard. I saw a guy here asking how to connect two LoRA nodes together over the space of 24 hours.
Anonymous
8/8/2025, 8:51:25 AM No.106187175
>>106187169
Fuck me. I'll just quote it.
>This is a digitally drawn anime-style image featuring Hatsune Miku. She is seated at a wooden desk in a modern office setting. On the desk is part of a half-eaten hot dog and crumbs, the hot dog has a missing part that was bitten off and it's incomplete. She has a serious expression as she extends her right hand to shake hands with a person off-screen to the left. Likely an office colleague. Indicating a break or snack time. The desk is cluttered with various office supplies, including a pencil cup filled with colored pens and markers, a calculator, and a notebook. A green potted plant is visible on the left side of the desk, adding a touch of nature to the otherwise busy workspace. The background features a large window with multiple panes, allowing sunlight to stream in and illuminate the room. Outside the window, lush green trees are visible, suggesting an office with a view of nature. The walls are adorned with bookshelves filled with neatly organized binders and books.
Replies: >>106187191
Anonymous
8/8/2025, 8:52:01 AM No.106187180
>>106187169
Someone plug this into Qwen, curious to see how it handles it and my GPUs are all occupied right now.
Anonymous
8/8/2025, 8:53:21 AM No.106187189
dipsy+miku
dipsy+miku
md5: 0dc4d05ae887ca8f699f5aaf4761f8cf🔍
>>106187136
got an idea for a prompt you'd like to try? I can run it if you want
Replies: >>106187191
Anonymous
8/8/2025, 8:53:45 AM No.106187191
>>106187189
Please do >>106187175
Replies: >>106187199 >>106187251
Anonymous
8/8/2025, 8:53:58 AM No.106187192
>>106187162
(10 steps)
Anonymous
8/8/2025, 8:55:25 AM No.106187198
1747201031626252_thumb.jpg
1747201031626252_thumb.jpg
md5: 62c7cbf24aa997e9a795a66b544bd4cf🔍
>>106187126
this time grok:

A gritty cyberpunk metropolis at night, rain-slicked streets glowing with neon reflections, a lone man in a worn trench coat staring upward in awe, a colossal holographic Miku Hatsune dominating the skyline, her vibrant teal twin-tails shimmering with intricate digital patterns, her form translucent yet luminous, surrounded by floating data streams, towering dystopian skyscrapers and flickering holographic billboards in the background, bathed in moody cyan and magenta neon hues, cinematic lighting, ultra-detailed, in the high-tech, noir aesthetic of Blade Runner 2077, immersive, futuristic atmosphere.

trippy
Replies: >>106187253
Anonymous
8/8/2025, 8:55:26 AM No.106187199
>>106187191
running now, will make 2 versions (wide and square)
Anonymous
8/8/2025, 8:57:19 AM No.106187212
Best settings for character training in wan 2.2 for high and low?
Is rank 64 or higher bucket size worth it?
Replies: >>106187220
Anonymous
8/8/2025, 8:58:01 AM No.106187215
>>106187130
Yet only with Chroma can you do proper feet, creepshots, gore, nudity, sex, bondage, yoga, contortions, etc... the list goes and on anon. And also Chroma follows the prompt better than Flux dev/Krea for these reasons.
Replies: >>106187235
Anonymous
8/8/2025, 8:58:54 AM No.106187218
AnimateDiff_00140_thumb.jpg
AnimateDiff_00140_thumb.jpg
md5: 7fec0ed8779561e82dc9cad1064237ed🔍
Replies: >>106187225
Anonymous
8/8/2025, 8:59:26 AM No.106187220
>>106187212
I did some rudimentary experiments with 2.2 at rank 64. If you're training low, you can plug the LoRA for the character into the low node and the output will look like that character while the motion remains in tact. I just did 1024x1024 images only as a test. Video I also tried but I don't have the will to really sus it out yet.
Anonymous
8/8/2025, 9:00:27 AM No.106187225
>>106187218
Looks a bit noisy are you putting LoRAs you shouldn't in the low noise output or at too high a strength.
Anonymous
8/8/2025, 9:02:06 AM No.106187235
>>106187215
got an example prompt of creepshots? do you mean images that are peeping or cctv? I've tried running cctv prompts in qwen image and it's OK but I think I'm a promptlet at directing how and where the camera is (can't get top down camera sitting in the corner of a room shot)
Replies: >>106187264
Anonymous
8/8/2025, 9:02:34 AM No.106187237
Does anyone know which ones from the following params ComfyUI uses by default?
[-h] [--listen [IP]] [--port PORT] [--tls-keyfile TLS_KEYFILE] [--tls-certfile TLS_CERTFILE] [--enable-cors-header [ORIGIN]]
[--max-upload-size MAX_UPLOAD_SIZE] [--base-directory BASE_DIRECTORY] [--extra-model-paths-config PATH [PATH ...]] [--output-directory OUTPUT_DIRECTORY]
[--temp-directory TEMP_DIRECTORY] [--input-directory INPUT_DIRECTORY] [--auto-launch] [--disable-auto-launch] [--cuda-device DEVICE_ID]
[--cuda-malloc | --disable-cuda-malloc] [--force-fp32 | --force-fp16]
[--fp32-unet | --fp64-unet | --bf16-unet | --fp16-unet | --fp8_e4m3fn-unet | --fp8_e5m2-unet | --fp8_e8m0fnu-unet] [--fp16-vae | --fp32-vae | --bf16-vae]
[--cpu-vae] [--fp8_e4m3fn-text-enc | --fp8_e5m2-text-enc | --fp16-text-enc | --fp32-text-enc | --bf16-text-enc] [--force-channels-last]
[--directml [DIRECTML_DEVICE]] [--oneapi-device-selector SELECTOR_STRING] [--disable-ipex-optimize] [--supports-fp8-compute]
[--preview-method [none,auto,latent2rgb,taesd]] [--preview-size PREVIEW_SIZE] [--cache-classic | --cache-lru CACHE_LRU | --cache-none]
[--use-split-cross-attention | --use-quad-cross-attention | --use-pytorch-cross-attention | --use-sage-attention | --use-flash-attention]
[--disable-xformers] [--force-upcast-attention | --dont-upcast-attention] [--gpu-only | --highvram | --normalvram | --lowvram | --novram | --cpu]
[--reserve-vram RESERVE_VRAM] [--async-offload] [--default-hashing-function {md5,sha1,sha256,sha512}] [--disable-smart-memory] [--deterministic]
[--fast [FAST ...]] [--mmap-torch-files] [--dont-print-server] [--quick-test-for-ci] [--windows-standalone-build] [--disable-metadata]

Had to cut some out due to character limit. I know that vae is run in bf16 by default for example, I am asking like that.
Replies: >>106187312 >>106187369
Anonymous
8/8/2025, 9:05:48 AM No.106187251
prompt-test-office-miku-wide
prompt-test-office-miku-wide
md5: 13a04a4409d9da6ee9f5dec50feee486🔍
>>106187191
Made 2 gens, one gen gets the office items positioned better but messes up the handshake and this one gets the items and handshake just in the wrong position. Neither had a bite out of the hotdog.
cfg 4.5, steps 50
Replies: >>106187265 >>106187286 >>106187324
Anonymous
8/8/2025, 9:06:13 AM No.106187253
1732025051615385_thumb.jpg
1732025051615385_thumb.jpg
md5: 6d5d024178e01f9c100181c7b3ce1b8e🔍
>>106187198
Anonymous
8/8/2025, 9:07:33 AM No.106187264
>>106187235
Any kind of image anon.

>Amateur photograph, a Japanese woman dressed as a maid, sleeping on the Tokyo Metro, her panties are slightly visible

That is one example of the kind of stuff Chroma gets right. You could do cctv, walking up a flight of stairs, peeping, etc... any kind of creepshot that has a natural description, you can do, (though Chroma just like other models benefits from a good prompt, you can enhance with VLMs)
Replies: >>106187383 >>106187503 >>106187924
Anonymous
8/8/2025, 9:07:36 AM No.106187265
>>106187251
Now you gotta do the Chinese version
Replies: >>106187383 >>106187960
Anonymous
8/8/2025, 9:10:49 AM No.106187286
prompt-test-office-miku-square
prompt-test-office-miku-square
md5: 2ab0d252f40039c517dd9689ffbe7d2c🔍
>>106187251
square aspect
Replies: >>106187324
Anonymous
8/8/2025, 9:12:59 AM No.106187312
>>106187237
Check the code
Replies: >>106187369
Anonymous
8/8/2025, 9:17:21 AM No.106187324
>>106187251
>>106187286
I think both are certainly more coherent than FLUX, but it seems like there is still a ways to go on the prompt adherence front. It is a noticeable step up though.
Anonymous
8/8/2025, 9:18:09 AM No.106187330
So with the lightx2v 2.2 workflow in the rentry, where do I set the virtual VRAM usage type thing like it was in the 2.1 workflow (the UnetLoaderGGUFDisTorchMultiGPU node)?
Gens are still around the same time as 2.1 even without it, but I don't know if I'm fucking myself over or not.
Also 2 more questions.
How do I plug loras into this? Do I just put them inline after the lightx2v loras (assuming I use a high/low lora)?
What's the difference between the e4m3fn and e5m2 versions of the i2v models?
Replies: >>106187684 >>106187716 >>106187716
Anonymous
8/8/2025, 9:25:48 AM No.106187360
>>106187075
might be something to that, tried it and got results that looked better compared to without, but the 2.5 cfg deep fries it, keeping it at 1
Anonymous
8/8/2025, 9:27:46 AM No.106187369
>>106187237
>>106187312
Well I guess there is no lovely default params list somewhere out there isn't it?
Anyway just trial and error'd what I wanted to learn, it uses fp16 precision for text encoder by default, at least on my system.
Replies: >>106187413 >>106187649
Anonymous
8/8/2025, 9:30:47 AM No.106187383
dipsy-test-office-square
dipsy-test-office-square
md5: 862f18de41cd85f7e295ee74275274f3🔍
>>106187265
here ya go
>>106187264
I'll give it a try now
Replies: >>106187423 >>106187596
Anonymous
8/8/2025, 9:35:19 AM No.106187413
>>106187369
It's comfyui so probably not, check the code, it should all be in one place, but then again it's comfyui so probably not
Anonymous
8/8/2025, 9:36:48 AM No.106187423
1729687276986
1729687276986
md5: fbbf53848b704cae54acf60c98ca2c3d🔍
>>106187383
I have another one when you have the chance, just to see at what point it gets overloaded in the description with characters.
>This is a colorful digital drawing in an anime style, featuring four young girls playing a chess game on a pink table in a bedroom. The girls are dressed in school uniforms with white sailor collars and blue skirts. The girl on the left is Sailor Moon and has long blonde hair tied into twin ponytails, the girl in the center has pink hair styled in pigtails, the girl on the right has dark blue hair, the girl in the bottom right is Hatsune Miku, and there's a small black cat sitting on the bed on the far right. They are all sitting on the floor, focused on the game. Behind them, there is a large bed with a blue and yellow striped blanket. The room has pastel-colored walls with a window that shows a bright blue sky. The overall atmosphere is playful and cheerful, with bright colors and simple, clean lines typical of anime art.
Flux failed to gen Miku and the image is severely degraded.
Replies: >>106187440 >>106187538
Anonymous
8/8/2025, 9:39:59 AM No.106187440
>>106187423
here's a link to the thread where I was sharing some stuff.
https://desuarchive.org/g/thread/106170414/#106172707
and trying out qwen as an image ref for hunyuan 3d2.1
https://desuarchive.org/g/thread/106174863/#q106175342

I'll try the prompt you shared now in a moment, I'm testing the peeping prompt at the moment
Replies: >>106187523
Anonymous
8/8/2025, 9:40:21 AM No.106187442
1739575698751613_thumb.jpg
1739575698751613_thumb.jpg
md5: 137a4cd6249e70578c2c1e15291ac5a7🔍
A rain-soaked cyberpunk city at night, neon reflections shimmering on wet streets, Ryan Gosling as a rugged man in a sleek trench coat, pointing upward with intensity and awe, a colossal holographic Miku Hatsune dominating the skyline, dynamically dancing and singing into a glowing microphone, her teal twin-tails swirling with vibrant digital patterns, her translucent form radiating ethereal light, surrounded by pulsating data streams and musical notes, dystopian skyscrapers and flickering holographic billboards in the background, drenched in moody cyan and magenta neon hues, cinematic lighting, ultra-detailed, in the gritty, high-tech noir style of Blade Runner 2077, immersive and atmospheric.

neat, I need to llm-max prompts more often, just get the basic idea and let the model elaborate/add detail.
Replies: >>106187504
Anonymous
8/8/2025, 9:41:34 AM No.106187445
I saw a blue Prius while walking today and laughed
Anonymous
8/8/2025, 9:46:53 AM No.106187478
>>106185951

its like some of you guys outright refuse to read the stickies
Anonymous
8/8/2025, 9:47:44 AM No.106187488
Wan22WVI2V_KJ_RAW__00341_thumb.jpg
Wan22WVI2V_KJ_RAW__00341_thumb.jpg
md5: 1859be9b81d36e5615c4c48e1d10ccfc🔍
Anonymous
8/8/2025, 9:50:28 AM No.106187503
sleeping-main-tall
sleeping-main-tall
md5: c9cadf310db91e0c4e67acdd3d7b8cba🔍
>>106187264
doesn't get the hint on the panties but I think if I added "her legs are slightly spread apart" it might get it.
Replies: >>106187511 >>106187604 >>106187867
Anonymous
8/8/2025, 9:50:43 AM No.106187504
>>106187442
It's interesting, with enough patience you could remake whole movies into memes

You know someone will do this
Anonymous
8/8/2025, 9:51:55 AM No.106187511
sleeping-maid-wide
sleeping-maid-wide
md5: c47b1b80e77d37217a83845ab371a21d🔍
>>106187503
wide version. almost got it but the pillow just out of nowhere lmao
Replies: >>106187604
Anonymous
8/8/2025, 9:51:58 AM No.106187512
1736673352353387_thumb.jpg
1736673352353387_thumb.jpg
md5: b565607628246330053c0d2d602b9fb9🔍
there we go, slight change to the prompt request.

A rain-slicked cyberpunk city at night, neon lights casting vibrant reflections on wet pavement, Ryan Gosling as a rugged man in a sleek trench coat, gently holding hands with a life-sized holographic Miku Hatsune, her translucent form glowing softly as she smiles warmly, her teal twin-tails shimmering with intricate digital patterns, faint data streams swirling around her, dystopian skyscrapers and flickering holographic billboards in the background, bathed in moody cyan and magenta neon tones, cinematic lighting, ultra-detailed, in the gritty, high-tech noir style of Blade Runner 2077, intimate and atmospheric.
Replies: >>106187518
Anonymous
8/8/2025, 9:53:07 AM No.106187518
>>106187512
and all I asked grok (free) was: make a stable diffusion prompt for a man holding hands with a holographic Miku Hatsune who is smiling, in the style of Blade Runner 2077, with Ryan Gosling.
Anonymous
8/8/2025, 9:53:41 AM No.106187523
1724720617536
1724720617536
md5: 39e0d87e794f8f80c811d8105fce7274🔍
>>106187440
Thanks. Seems like it will probably make the same mistakes as what I linked.
One last prompt, forgive me.
>This image is a digitally drawn cartoon in a typical comic strip format. The scene is set in an art gallery, with a girl on the left side wearing a teal blazer and light brown pants, pointing to a framed painting on the wall. The painting, which is green with a yellow border, depicts a bowl of fruit including apples, grapes, and bananas, with a price tag of "$500" attached to the lower right corner. Another identical painting, identical in style and content, hangs on the wall to the right, priced at "$1500". In the foreground, two people are standing, observing the paintings. One person, a bald man with a blue plaid shirt and brown pants, is looking at the paintings with a confused expression. The other person, a woman with dark hair and a sleeveless dress, is standing behind the bald man, watching the scene with a neutral expression. The background features a beige wall with a few other paintings, and the gallery is lit with soft, even lighting. A humorous caption at the bottom of the image reads: "It is more expensive because it took the artist several weeks to paint it, while the other one was generated in 10 seconds on my computer."
Should test out text and formatting to the extreme.
Replies: >>106187715
Anonymous
8/8/2025, 9:56:45 AM No.106187538
sailor-girls-chess-game
sailor-girls-chess-game
md5: d3edb8f9d302e30ed39ae985e428f508🔍
>>106187423
here you go. seems to handle multiple subjects very well in a prompt. I'm satisfied with qwen-image a lot and it is a night and day improvement over flux for me
Replies: >>106187614
Anonymous
8/8/2025, 10:05:59 AM No.106187596
>>106187383
>here ya go
Either you misunderstood me or you are a funny guy.
Replies: >>106187831
Anonymous
8/8/2025, 10:06:02 AM No.106187597
1747406675846403_thumb.jpg
1747406675846403_thumb.jpg
md5: 402fd55a3a41a0e88b0faa4753e18eca🔍
A rain-soaked cyberpunk city at night, neon lights casting vibrant reflections on slick streets, Ryan Gosling as a rugged man in a sleek trench coat, standing captivated as he gazes at a massive billboard displaying a holographic Miku Hatsune, her translucent form reaching out toward him with a gentle, inviting gesture, her teal twin-tails glowing with intricate digital patterns, faint data streams swirling around her, dystopian skyscrapers and flickering holographic signs in the background, drenched in moody cyan and magenta neon hues, cinematic lighting, ultra-detailed, in the gritty, high-tech noir style of Blade Runner 2077, immersive and atmospheric.

cool
Anonymous
8/8/2025, 10:07:00 AM No.106187604
>>106187503
>>106187511
Something weird I've noticed with Qwen is panties often come with a thigh strap.
Replies: >>106187632
Anonymous
8/8/2025, 10:07:54 AM No.106187614
>>106187538
Yeah, it's definitely an improvement and got all the major elements. Still has some ways to go with the chess pieces and hands but that is minor.
Anonymous
8/8/2025, 10:11:09 AM No.106187632
>>106187604
I've noticed it does that too. But I managed to get it not do so when trying to gen magazine photoshoot photos. I'd post here but it's a blue board
Anonymous
8/8/2025, 10:15:06 AM No.106187649
ComfyUI CLIP precision
ComfyUI CLIP precision
md5: 5a5545416978ea17633bb9e8c7725711🔍
>>106187369
I know no one cares but to add on, even FP32 models are loaded in FP16 unless you manually launch with --fp32-text-enc.
Kinda weird behavior desu. It definitely affects images.
Replies: >>106187668 >>106188353
Anonymous
8/8/2025, 10:18:16 AM No.106187668
>>106187649
I'd need to see more examples to really care. Good to know though.
Replies: >>106187757 >>106187774
Anonymous
8/8/2025, 10:19:52 AM No.106187684
1739591363355992
1739591363355992
md5: c0fb8f66ec79b115ba0c3c752fcfb364🔍
>>106187330
>So with the lightx2v 2.2 workflow in the rentry, where do I set the virtual VRAM usage type thing like it was in the 2.1 workflow (the UnetLoaderGGUFDisTorchMultiGPU node)?
You don't, instead you set the number of "blocks" you offload to swap. See picrel.
There are a total of 40 blocks in wan, and swapping 20 allows for 81 frames generated in 720p on a 24GB card.
If you send the whole 40 to swap, then you can go above at the price of longer generation time.
Anonymous
8/8/2025, 10:19:57 AM No.106187687
what's the best version of the rapid all in one wan? only just now moving from 2.1 to 2.2
Replies: >>106188682
Anonymous
8/8/2025, 10:23:30 AM No.106187715
gallery-test1
gallery-test1
md5: ceb1f49d13fbed9c3e38b8227817b533🔍
>>106187523
That prompt seemed to trip it up a bit. I tried cfg of 3.5, 4.5, and 5.5 with a batch of 2 seed 42. zipped all the attempts for comparison
https://files.catbox.moe/zi1948.zip
Replies: >>106187774
Anonymous
8/8/2025, 10:23:32 AM No.106187716
>>106187330
>How do I plug loras into this? Do I just put them inline after the lightx2v loras (assuming I use a high/low lora)?
Yeah, add a WanVideo Lora Select Multi and connect it behind the lightx lora loader with prev_lora. One for each lora loader.

>>106187330
>What's the difference between the e4m3fn and e5m2 versions of the i2v models?
e5m2 -> use with 3000 cards
e4m3fn -> use with 4000/5000 cards
Replies: >>106187802
Anonymous
8/8/2025, 10:29:58 AM No.106187757
>>106187668
The image I posted may or may not have been cherrypicked but yeah, I guess this is still one area that needs work which is mixed text and subject mixed prompts. Thanks for the hard work, I really appreciated the time and energy you spent satisfying my curiosity.
Replies: >>106187774
Anonymous
8/8/2025, 10:32:56 AM No.106187770
1724658664910528_thumb.jpg
1724658664910528_thumb.jpg
md5: d02fd8eb5be8235fc7559306d1d747f1🔍
lmao

A sleek, futuristic car interior from the driver's seat perspective, Ryan Gosling gripping the steering wheel with intensity, his face lit by the soft glow of a high-tech dashboard, driving at dusk on a winding road through a lush tropical island, a massive, eerie sign reading "EPSTEIN ISLAND" in bold, neon-lit letters looming ahead, surrounded by dense jungle and turquoise ocean views, vibrant sunset casting orange and purple hues, cinematic lighting, ultra-detailed, in a suspenseful, noir-inspired style, immersive and atmospheric.

asked grok to make a prompt of him driving a car on an island with a sign.
Replies: >>106187791
Anonymous
8/8/2025, 10:34:05 AM No.106187774
>>106187757
Meant to quote >>106187715
>>106187668
I wanted to post this separately.
https://www.ai-image-journey.com/2024/12/image-difference-t5xxl-clip-l.html
Use Q6_K or higher GGUF or FP8_scaled if you absolutely need to quantize your text encoders.
Replies: >>106187831
Anonymous
8/8/2025, 10:37:06 AM No.106187791
1732942177647268_thumb.jpg
1732942177647268_thumb.jpg
md5: bc76cca74debbeb26a56381bd6393bad🔍
>>106187770
revision, blue sky with clouds.
Anonymous
8/8/2025, 10:39:07 AM No.106187802
>>106187716
Didn't know about the select multi node. I was putting a normal lora select before the lightx2v lora in the chain. Outputs were fucked beyond belief. They were sped up, and incoherent blobs.
>e4m3fn -> use with 4000/5000 cards
Good to know.

Last question. How does framerate factor into this one? I know before it output at 16fps, you'd interpolate to 32. But when Riflex was a thing, you'd do like 121 frames and output straight to 24fps. That still the same?
Replies: >>106187823
Anonymous
8/8/2025, 10:41:59 AM No.106187823
>>106187802
>Last question. How does framerate factor into this one? I know before it output at 16fps, you'd interpolate to 32. But when Riflex was a thing, you'd do like 121 frames and output straight to 24fps. That still the same?
No idea for the interpolation part as I'm using 16-60 fps on videos I like on topaz instead of adding it to the wf. I don't think interpolation was added in the rentry wf but I modified mine so I'm not sure.
Anonymous
8/8/2025, 10:42:55 AM No.106187831
dipsy-office-once-more
dipsy-office-once-more
md5: 5926b16ddf5c6ab9fbe5f2c06addf090🔍
>>106187774
no worries mate
>>106187596
I tried mate, but I just can't figure out how to get bite marks. If you want, share a prompt and I'll see if it makes it china enough for ya
Replies: >>106187848
Anonymous
8/8/2025, 10:47:21 AM No.106187848
>>106187831
I think he means putting the prompt in Chinese and seeing if it does better. The dude is dumb for saying way too little about what he wants and expecting it to fall out of the sky magically.
Replies: >>106187863 >>106187960
Anonymous
8/8/2025, 10:51:17 AM No.106187863
2025-08-06_16-14-45
2025-08-06_16-14-45
md5: 4cf1fb0953decadb6975d50e2d20c823🔍
>>106187848
If that's the case there'd be no tells on what got gen'd of if it were done via chinese text prompt or not. But heck, I'll try that too. I'll ask deepseek to translate the prompt and run it through
Anonymous
8/8/2025, 10:52:53 AM No.106187867
dicksout_00181_thumb.jpg
dicksout_00181_thumb.jpg
md5: 99b8bebed4fe1397837782c9eac58a1e🔍
>>106187503
Just gotta have a move a little!
Replies: >>106187883
Anonymous
8/8/2025, 10:55:31 AM No.106187883
>>106187867
Man I really wanna get the large models up and running (without comfy). I can only run the 5B video model. Gotta look into loading the quanted models and edit the inference code to be able to load the ggufs. If I can't get it done in a week, I'll probably cave and install comfy
Anonymous
8/8/2025, 11:00:23 AM No.106187906
AnimateDiff_00141_thumb.jpg
AnimateDiff_00141_thumb.jpg
md5: d75740181a91eee650c7e5f9b0ffc101🔍
Noisy first frame. Used https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne V4 model
Anonymous
8/8/2025, 11:03:44 AM No.106187924
sleeping-maid
sleeping-maid
md5: 5065820ec6abe954cbc1631a4691a5de🔍
>>106187264
got it with a slightly modified prompt
>Amateur photograph, a Japanese woman dressed as a maid, sleeping on the Tokyo Metro, her thighs are spread slightly apart and her panties are slightly visible.
>iPhone photo, 4K, Ultra HD.
took testing 2 seeds though. cfg 4.5, seed 43, steps 45
Anonymous
8/8/2025, 11:04:34 AM No.106187931
00104-2754963139-dd68ea70-258b2bf1e9
00104-2754963139-dd68ea70-258b2bf1e9
md5: b152b930ec94eaf6b9c84e530e46a3d7🔍
Anonymous
8/8/2025, 11:11:01 AM No.106187960
dipsy-mandarin-prompt
dipsy-mandarin-prompt
md5: a0d10875d3cec64306ac3a2ef1200d64🔍
>>106187848
>>106187265
Got deepseek to translate the prompt, and this is the first seed output. 2nd one is baking.
cfg 4.5, seed 42, steps 45
original prompt
>This is a digitally drawn anime-style image featuring a Chinese warrior woman with hair buns, foggy round glasses with spirals on them, and wearing a blue dress with whale symbols all over it. She is seated at a wooden desk in a modern office setting. On the desk is part of a half-eaten hot dog and crumbs, the hot dog has a missing part that was bitten off and it's incomplete. She has a serious expression as she extends her right hand to shake hands with a person off-screen to the left. Likely an office colleague. Indicating a break or snack time. The desk is cluttered with various office supplies, including a pencil cup filled with colored pens and markers, a calculator, and a notebook. A Chinese flag is on the right side of the desk. A green potted plant is visible on the left side of the desk, adding a touch of nature to the otherwise busy workspace. The background features a large window with multiple panes, allowing sunlight to stream in and illuminate the room. Outside the window, lush green trees are visible, suggesting an office with a view of nature. The walls are adorned with bookshelves filled with neatly organized binders and books.
Replies: >>106187968
Anonymous
8/8/2025, 11:12:27 AM No.106187968
dipsy-mandarin-prompt2
dipsy-mandarin-prompt2
md5: cf490ffc1ee1a5f601340bd3e63aba10🔍
>>106187960
seed 43
deepseek translation
>这是一幅数字绘制的动漫风格图像,描绘了一位中国女武士。她梳着发髻,戴着雾面圆框螺旋纹眼镜,身穿蓝色连衣裙,裙上布满鲸鱼图案。她坐在现代办公室的木桌前,桌上有一个被咬了一半的热狗和碎屑,热狗缺了一块,显然被咬过。她表情严肃,正伸出右手与画面左侧的屏幕外人物握手,可能是同事,暗示休息或零食时间。桌上凌乱地摆放着各种办公用品,包括装满彩色笔和马克笔的笔筒、计算器和笔记本。桌子右侧有一面中国国旗,左侧有一盆绿色盆栽,为繁忙的工作空间增添了一丝自然气息。背景是一扇多格大窗,阳光透过窗户洒进房间。窗外可见茂密的绿树,表明办公室外是自然景观。墙上装饰着书架,整齐地摆满了文件夹和书籍。
Anonymous
8/8/2025, 11:20:08 AM No.106188009
00113-4089121767-3ca7ec28-258b2bf1e9
00113-4089121767-3ca7ec28-258b2bf1e9
md5: f58425ec7c56aa605b9e5042a496f46f🔍
Anonymous
8/8/2025, 11:22:03 AM No.106188021
WAN_00055_thumb.jpg
WAN_00055_thumb.jpg
md5: 27db4804b948a085849c384227ae04b4🔍
>>106187075
it gives more movement but changes things from the original image, lmao
Anonymous
8/8/2025, 11:30:33 AM No.106188055
v_thumb.jpg
v_thumb.jpg
md5: 8d600c8134667ffeb7d54c669ea1d28f🔍
Anonymous
8/8/2025, 11:34:50 AM No.106188074
>>106187075
were these using kijai loras?
Anonymous
8/8/2025, 11:41:25 AM No.106188102
vid_fn_00086_thumb.jpg
vid_fn_00086_thumb.jpg
md5: 548d12a45500bf849ee3930a413a51ce🔍
Replies: >>106188140
Anonymous
8/8/2025, 11:43:42 AM No.106188119
What is lightning in the context of wan2.2?
Anonymous
8/8/2025, 11:45:09 AM No.106188129
>udpate comfy yesterday
>bricked everything
>fresh isntall, fresh nodes, updated from cuda 12.4 to .8 and updated to python 3.12, installed triton 3.3, sageattention 2.2.1
>old gens before the fresh install
>2 min
>new gens after fresh install
>10 min

Fuck sake. Does anyone know whats possibly happening here? Spent 4 hours today and got it to finally gen but its slow as fuck
Replies: >>106188150 >>106188183 >>106188228
Anonymous
8/8/2025, 11:46:57 AM No.106188140
vid_fn_00088_thumb.jpg
vid_fn_00088_thumb.jpg
md5: 55b571b1b04fc63cf61c006ae63a7bd4🔍
>>106188102
damn this is a tricky one
Anonymous
8/8/2025, 11:47:57 AM No.106188150
>>106188129
portable moment
ポストカード !!FH+LSJVkIY9
8/8/2025, 11:50:32 AM No.106188173
>>106185803 (OP)
>made the collage again
neato :3
Replies: >>106188900 >>106189103
Anonymous
8/8/2025, 11:51:32 AM No.106188180
is the lightning workflow now the best workflow for i2v? does it beat kijai's workflow?
Anonymous
8/8/2025, 11:51:53 AM No.106188183
>>106188129
Same shit here. New Comfy update is completely broken. I get warnings like this when I try to video gen.
>Lib\site-packages\torch\_inductor\utils.py:1436] [0/0] Not enough SMs to use max_autotune_gemm mode
>\ComfyUI\ComfyUI_windows_portable\python_embeded\Lib\site-packages\torch\_inductor\compile_fx.py:282: UserWarning: TensorFloat32 tensor cores for float32 matrix multiplication available but not enabled. Consider setting `torch.set_float32_matmul_precision('high')` for better performance.
Replies: >>106188208 >>106188247
Anonymous
8/8/2025, 11:52:47 AM No.106188187
chroma v50 is out
Replies: >>106188191 >>106188285 >>106189027
Anonymous
8/8/2025, 11:53:21 AM No.106188191
>>106188187
>the final model is here
Anonymous
8/8/2025, 11:54:57 AM No.106188207
i thought v50 would be a 2nd high res epoch but it's a merge of the one high res epoch
Anonymous
8/8/2025, 11:55:03 AM No.106188208
>>106188183
>Not enough SMs
Means your gpu is too old https://arnon.dk/matching-sm-architectures-arch-and-gencode-for-various-nvidia-cards/
Replies: >>106188216
Anonymous
8/8/2025, 11:56:14 AM No.106188215
>49 and 50
Why even bother with 49?
Anonymous
8/8/2025, 11:56:17 AM No.106188216
>>106188208
>Means your gpu is too old
4070ti super. Not the newest or the best, but I think it should still work.
Anonymous
8/8/2025, 11:57:49 AM No.106188228
>>106188129
12.8 is for triton 3.4
Replies: >>106188347
Anonymous
8/8/2025, 12:00:45 PM No.106188247
>>106188183
I went back to an older version of comfyui, an early july release when it worked. It works now but I can only imagine it has something to do with custom nodes (probably wavespeed or some shit).

In your case, you might have to copy the python folders:

C:\Users\ieatassallday\AppData\Local\Programs\Python\Python31(YOUR VERSION NUMBER)\include
C:\Users\ieatassallday\AppData\Local\Programs\Python\Python31(YOUR VERSION NUMBER)\libs

To your comfyui python embedded

C:\ai\yourcomfyuifolder\python_embeded\include
C:\ai\yourcomfyuifolder\python_embeded\libs

Then again, unironically chatgpt5 helped with my issue, yours could be different
Replies: >>106188349
Anonymous
8/8/2025, 12:02:06 PM No.106188255
>the final chroma model meant for others to train on is a slopmerge and not an actual trained epoch
smoothbrained dev
Replies: >>106188344
Anonymous
8/8/2025, 12:04:35 PM No.106188277
I juyst fducking SHARTED
Anonymous
8/8/2025, 12:05:20 PM No.106188285
Chroma-ComfyUI_00306_
Chroma-ComfyUI_00306_
md5: bf807925fe368edb6634a13ca8999cde🔍
>>106188187
Yes! Finally time to retrain my v44 loras
Anonymous
8/8/2025, 12:08:15 PM No.106188294
Chroma-ComfyUI_00321_
Chroma-ComfyUI_00321_
md5: fbff41480867d99c242ff045db19a577🔍
Replies: >>106188705
Anonymous
8/8/2025, 12:09:09 PM No.106188299
sage attention fixed for gwen WHEN?????????
Anonymous
8/8/2025, 12:14:12 PM No.106188321
chromacope
chromacope
md5: 4646eceda53d158f10ddad9d0b822924🔍
AHAHAHAHAHAHA
Anonymous
8/8/2025, 12:18:47 PM No.106188344
>>106188255
It is an actual epoch (v49), not two epochs though.
Anonymous
8/8/2025, 12:19:09 PM No.106188347
vace
vace
md5: 2dc8ba68129103c53218275180cd120f🔍
>>106188228
I'm actually retarded, I had to replace a wan node for it to work and forgot to change the settings from 20 steps back to 4, kek

Backing up this install 5 times, fuck doing that again
Anonymous
8/8/2025, 12:19:50 PM No.106188349
>>106188247
>I went back to an older version of comfyui, an early july release when it worked.
Which version is it? Have to try it if python file copy doesnt work
Replies: >>106188372
Anonymous
8/8/2025, 12:20:05 PM No.106188353
CLIP_FP16_vs_FP32_comparison_
CLIP_FP16_vs_FP32_comparison_
md5: fd6f5a8dcdd38cadc75b001c0862efd9🔍
Well I did some moar testing on this FP32 CLIP.
Seems to have potential imo. There is also another FP32 Illustrious CLIP floating around that I want to get around to testing.
Not shown here, I have tested another FP16 CLIP, which seems to have exhibited behavior more similar to the FP16 CLIP inside the model(don't have a huge experiment sample size for that, admittedly).
While whatever "restoration" this guy did also has a significant effect on quality most probably, I believe that the text encoder benefits from FP32 precision. Further evidenced by the change seen here when loading FP32 CLIP as FP16>>106187649. (Conversely, the image also changes when you load the FP16 CLIP as FP32, not necessarily for the better or worse though)
Replies: >>106188369
Anonymous
8/8/2025, 12:23:02 PM No.106188369
>>106188353
and what are you comparing on that pic?
Replies: >>106188378
Anonymous
8/8/2025, 12:23:22 PM No.106188372
>>106188349
>https://github.com/comfyanonymous/ComfyUI/releases/tag/v0.3.44

Yeah its always worth a revert. Others been having issues with the new update too, so I'm going to wait until further updates on a separate install
Anonymous
8/8/2025, 12:24:46 PM No.106188378
>>106188369
I believe it is written rather in a self-explanatory manner at the top of the image.
Replies: >>106188383
Anonymous
8/8/2025, 12:25:48 PM No.106188383
>>106188378
so fp16 is much better than fp32, thanks
Replies: >>106188394
Anonymous
8/8/2025, 12:26:19 PM No.106188386
250808_12h25m28s_screenshot
250808_12h25m28s_screenshot
md5: cf626ca9dee4c616b801f88defc02d4a🔍
>chroma-unlocked-v50.safetensors
Aaahhh shit, here we go again.
Anonymous
8/8/2025, 12:27:10 PM No.106188390
Say it with me guys. TWO MORE EPOCHS
Anonymous
8/8/2025, 12:27:54 PM No.106188394
>>106188383
(You) (You) (You) (You) (You)
(You) (You) (You) (You) (You)
(You) (You) (You) (You) (You)
Replies: >>106188421
Anonymous
8/8/2025, 12:30:40 PM No.106188405
is 50 final?
Replies: >>106188418 >>106188427
Anonymous
8/8/2025, 12:32:06 PM No.106188415
AnimateDiff_00146_thumb.jpg
AnimateDiff_00146_thumb.jpg
md5: 304ededcf3ca24a9499cee5401df0a30🔍
Anonymous
8/8/2025, 12:32:10 PM No.106188416
How do I prevent ComfyUI from eating all the RAM when changing loras? I've tried different "unload" and "free memory" nodes, toggling smart memory, but it still eats extra 20GBs and I have to restart manually. It's unbearable.
Replies: >>106188423
Anonymous
8/8/2025, 12:32:40 PM No.106188418
>>106188405
Yep.
Anonymous
8/8/2025, 12:33:20 PM No.106188421
>>106188394
???
Replies: >>106188519
Anonymous
8/8/2025, 12:33:49 PM No.106188423
>>106188416
download more ram
Replies: >>106188431
Anonymous
8/8/2025, 12:34:19 PM No.106188427
>>106188405
Yes, it's finally done, now off to train loras!
Anonymous
8/8/2025, 12:35:21 PM No.106188431
>>106188423
I already bought extra 32GBs just for wan.
Anonymous
8/8/2025, 12:37:13 PM No.106188441
20240329_183408000_iOS
20240329_183408000_iOS
md5: 79c0139b4e5ab41b0b9c98a303457358🔍
>upgrade WAN to 2.2 in Comfy
>now every other gen crashes my PC and causes it to reboot
Alright
What the fuck is going on
I thought it was power spiking causing my 5090 to freak out at first but I've been monitoring the power usage and it draws less than gaming at peak load so that can't be it
And image generation still doesn't cause any issues
Replies: >>106188486 >>106188950
Anonymous
8/8/2025, 12:37:35 PM No.106188444
AnimateDiff_00156_thumb.jpg
AnimateDiff_00156_thumb.jpg
md5: e47fd68db66d6642e3f52fac78de673c🔍
Anonymous
8/8/2025, 12:43:56 PM No.106188477
when adding the lightning i2v lora to kiji's workflow, do you have to change any weights or cfg?
Anonymous
8/8/2025, 12:44:52 PM No.106188486
>>106188441
Does it only happen when you are using Comfy ? As in no problems when gaming etc ?
Anonymous
8/8/2025, 12:51:12 PM No.106188519
>>106188421
>Peach
Her gem is no longer floating on her chest and there is only one brick instead of them getting spammed in classic AI slop fashion even though the prompt says "a brick"
>Lara
Her hair is worse and so is her costume, arguably, but background has improved
>Green hair girl
FP32 looks better
>Venom
No longer faded watermark and has more detail
>Peach and Rosalina
Better image. Rosalina no longer has ghost arm and deformed hand.
>Juri
Tossup, but you could argue the original is better
>Grey hair girl
Tossup, you might argue about lack of background, but prompt says nothing about it so it is not text encoder's fault
>Samus
Pure tossup
>Wizard
The ONLY one where FP32 performed undoubtedly worse
>Zelda
Tossup, but I prefer FP32 one.
So you have like 1 image where fp32 performed worse, the rest are either better or equal.
Replies: >>106188646
Anonymous
8/8/2025, 12:51:13 PM No.106188520
>>106186335
I tried but ended up like >>106186399 said, you need an already big community/a lot of followers on RS, or start back when 1.5 released, AND also play it safe/censor yourself from anything that would get you banned from patreon
Anonymous
8/8/2025, 12:53:05 PM No.106188529
>tried to get into comfy UI
>use WAN i2v
>generated a blurry mess

I'm now #redpilled against diffusion.
Fuck this.
Replies: >>106188535 >>106188555
Anonymous
8/8/2025, 12:53:50 PM No.106188535
>>106188529
>he gave up after his first failure
i bet thats gotten you far in life
Replies: >>106188536
Anonymous
8/8/2025, 12:54:04 PM No.106188536
>>106188535
It works, it's just shit.
Anonymous
8/8/2025, 12:57:22 PM No.106188555
>>106188529
wan2gp link in op. comfyorg is currently destroying their ui making it as unstable and uncomfortable as possible
Anonymous
8/8/2025, 1:02:14 PM No.106188588
file
file
md5: 18ddbdb25dfbf003cc15c9d97e433683🔍
what is the annealed chroma v50?
Replies: >>106188603 >>106188825
Anonymous
8/8/2025, 1:04:46 PM No.106188602
vid_fn_00111_thumb.jpg
vid_fn_00111_thumb.jpg
md5: fe4da92669ab03eb3f2b039b756d74e6🔍
Replies: >>106188619
Anonymous
8/8/2025, 1:05:02 PM No.106188603
>>106188588
It's a form of model optimization, from early tests this version seems the be the best
Replies: >>106188825 >>106189027
Anonymous
8/8/2025, 1:05:17 PM No.106188608
1727720103782722
1727720103782722
md5: df91ff10186685fb06b839e9868462e1🔍
>gotoh hitori holding a guitar from the TV Anime bocchi the rock!
so this is the power of Chroma
Replies: >>106188652
Anonymous
8/8/2025, 1:05:33 PM No.106188610
Is there any reason why video generation has such wildly varying speeds? like sometimes it's only 30s/it and then next gen it's 80s/it.
Replies: >>106188675 >>106188731
Anonymous
8/8/2025, 1:06:36 PM No.106188619
>>106188602
>Tsukasa Jun art
based
Anonymous
8/8/2025, 1:09:23 PM No.106188646
>>106188519
by pairs
"holds a brick" in the positive - fp16 follows
"green swimsuit" in the positive - fp16 follows
"bra" in the positive - fp16 follows
etc
flawless fp16 victory

>looks worse/better
subjective, needs more samples, useless otherwise
Anonymous
8/8/2025, 1:09:57 PM No.106188652
>>106188608
wow dude nice gen!
Anonymous
8/8/2025, 1:11:47 PM No.106188670
suffering
suffering
md5: c2b419ee37bc18f57ea68eca84302def🔍
why does it do this with wan video gen
Replies: >>106188687 >>106188697
Anonymous
8/8/2025, 1:12:17 PM No.106188675
>>106188610
the sampler has an effect. try out euler instead of unipc it's more consistent in gen times
Anonymous
8/8/2025, 1:12:52 PM No.106188682
>>106187687
none
Anonymous
8/8/2025, 1:13:31 PM No.106188687
OK I am convinced that it is an LLM instructed to troll now, well played but I am done with giving (You)s, cya.
>>106188670
You are not gonna get far without posting workflow I think.
You are probably using a wrong node somewhere.
Anonymous
8/8/2025, 1:14:36 PM No.106188697
>>106188670
Update comfy
Anonymous
8/8/2025, 1:15:51 PM No.106188705
>>106188294
my jewgyptian snow white wife
Anonymous
8/8/2025, 1:19:52 PM No.106188731
Untitled
Untitled
md5: 76fe096d959cf9151afac4ea845e0a86🔍
>>106188610
go into nvidia settings and set this to Prefer No Sysmem Fallback so that Comfyui will stop trying to fuck you over with normal RAM gens.

for example you might open a side application that uses 1GB of VRAM and then your gens will start trying to use normal RAM which is slow.
Replies: >>106188742
Anonymous
8/8/2025, 1:21:12 PM No.106188742
>>106188731
I mean I'm only using 28GB out of 32 on the vram department, so that's not really an issue.
Anonymous
8/8/2025, 1:32:45 PM No.106188825
>>106188588
>>106188603
>chromaDev samefaging again,
Stop shilling man, it's not funny, move on with your life lodestones. Chroma is dead, people here aren't using your furry shit. MOVE ON!
Anonymous
8/8/2025, 1:33:00 PM No.106188828
is it me or is chroma v50 more coherent but also more sloppy?
Replies: >>106188836 >>106188884
Anonymous
8/8/2025, 1:34:02 PM No.106188836
>>106188828
Can you post some examples? Waiting for the gguf.
Replies: >>106188884
Anonymous
8/8/2025, 1:34:38 PM No.106188843
the day of cope has arrived
Anonymous
8/8/2025, 1:34:52 PM No.106188845
>>106186711
I tried this and my outputs start turning red. How should I set the weights? Kijai has it 3 for high noise and 1 for low noise but I don't know if that works with the new lightning 2.2 loras.
Anonymous
8/8/2025, 1:40:24 PM No.106188884
>>106188828
>>106188836
>
Anonymous
8/8/2025, 1:42:21 PM No.106188894
It's still slopped...
Anonymous
8/8/2025, 1:42:34 PM No.106188896
so what happened to 1 month to cook at 1024 for chroma?
Replies: >>106188903
Anonymous
8/8/2025, 1:42:40 PM No.106188899
>Chroma still has jacked up hands.

Well, it was a good run fellas.
ポストカード !!FH+LSJVkIY9
8/8/2025, 1:42:40 PM No.106188900
landscapediffusiongeneralwasGOODbitch
landscapediffusiongeneralwasGOODbitch
md5: f580629e594df71138f272919e0f8542🔍
>>106188173
smell ya later <3
Replies: >>106188976
Anonymous
8/8/2025, 1:42:55 PM No.106188901
2.1 lightx2v > 2.2 lightning lora

At least for anime. No doubt in my mind I'm getting better results with kijai's workflow using the old lora.
The new one clearly has more 3D-like motion, which doesn't work well for anime.
Replies: >>106188961
Anonymous
8/8/2025, 1:43:28 PM No.106188903
>>106188896
You throw more GPU at it, then it goes faster
Replies: >>106189090
Anonymous
8/8/2025, 1:48:51 PM No.106188930
chromacope5
chromacope5
md5: da33e8bd1c17bc1e6dbf33b7e20ef418🔍
why are doomGODS always right?? hopetards continue to guzzle slop to the point of embarrassment. SDXL remains winning over 2 years later
Anonymous
8/8/2025, 1:51:04 PM No.106188950
>>106188441
You can hard-lock your system if you mix up model datatypes. You might see them show up as "shape" errors. Anyway, I think what happens is the GPU locks up and falls off the PCIe bus, and then the video driver get rugpulled, and at that point your system is crashed and the kernel/windows reboots it.
Anonymous
8/8/2025, 1:51:55 PM No.106188961
>>106188901
yeah it's pretty ass, I'll keep using lightx2v.
ポストカード !!FH+LSJVkIY9
8/8/2025, 1:54:07 PM No.106188976
wan21stillgotitbb_thumb.jpg
wan21stillgotitbb_thumb.jpg
md5: 363b997175ace317dd092cda4504e061🔍
>>106188900
since dubs, get one free<3
>LOVE & LOVE IS THE ONLY THING
Anonymous
8/8/2025, 2:02:07 PM No.106189027
ComfyChroma50a_00307_
ComfyChroma50a_00307_
md5: 72d90613684796841e5ca4b67af47419🔍
>>106188187
>>106188603
Flux pro at home for free! Thank you lodestones
Replies: >>106189060
Anonymous
8/8/2025, 2:03:22 PM No.106189030
ComfyChroma50a_00304_
ComfyChroma50a_00304_
md5: b9a4640180c09acda7ad8d86215da812🔍
Anonymous
8/8/2025, 2:07:15 PM No.106189060
>>106189027
Nice. So our universe sits in a dew drop, well, at least it's better than it all being a simulation.
Anonymous
8/8/2025, 2:12:41 PM No.106189090
>>106188903
it went x3 faster than expected and both last versions came at the same time?
Replies: >>106189118 >>106189123
Anonymous
8/8/2025, 2:14:02 PM No.106189103
SOUPYY_thumb.jpg
SOUPYY_thumb.jpg
md5: 85b1f71081b837916d093f3eac2accd6🔍
>>106188173
Replies: >>106189113
Anonymous
8/8/2025, 2:15:25 PM No.106189113
>>106189103

This nigga is making collage bait!
Anonymous
8/8/2025, 2:15:42 PM No.106189118
ComfyChroma50a_00315_
ComfyChroma50a_00315_
md5: 6a42eada8d519bed2555d3bfd8d107c7🔍
>>106189090
Different anon, also confused about 49+50 concurrent release. Though, I don't feel like it was getting any sharper after testing this every day https://huggingface.co/lodestones/chroma-debug-development-only/tree/main/staging_base_4
Anonymous
8/8/2025, 2:16:49 PM No.106189123
>>106189090
AFAIK they didn't train the last epoch (v50) fully, instead they merged it with another high resolution training fork they had made.

So technically v49 is the 'true' release since it was a full 1024 resolution epoch and not merged with a fork. Of course the only thing that matters is which gives the best results.
Replies: >>106189683
Anonymous
8/8/2025, 2:18:19 PM No.106189143
>final chroma version
>anatomy still fucked

alright, what will be shilled next?
Replies: >>106189388
Anonymous
8/8/2025, 2:23:22 PM No.106189180
ComfyUI_temp_jpite_00002_
ComfyUI_temp_jpite_00002_
md5: 7449eb732fe7833f8f57f3b56c655f06🔍
Anonymous
8/8/2025, 2:34:18 PM No.106189256
>chroma
this is a QWEN thread, poorfags!!!
Anonymous
8/8/2025, 2:36:19 PM No.106189270
chroma_00004_
chroma_00004_
md5: ab2d9f486b490e47666cdc86ed548c71🔍
https://huggingface.co/lodestones/Chroma1-HD/tree/main

its up, hands seem to be fixed at least
Replies: >>106189696
Anonymous
8/8/2025, 2:36:49 PM No.106189274
ComfyChroma50a_00330_
ComfyChroma50a_00330_
md5: eeaae9921d5e472e3f6a18ca976f8a67🔍
Anonymous
8/8/2025, 2:38:15 PM No.106189286
chroma_00001_
chroma_00001_
md5: 56cd3c17060ef4491906670da1ad84a2🔍
same with eyes on larger images
Replies: >>106189335 >>106189466
Anonymous
8/8/2025, 2:39:23 PM No.106189291
>>106186926
That one is particularly nice.
Anonymous
8/8/2025, 2:40:14 PM No.106189296
chroma_00003
chroma_00003
md5: 6d3ef05def0d50617bda866935403325🔍
Anonymous
8/8/2025, 2:41:19 PM No.106189304
chroma_00008
chroma_00008
md5: b14269bff72299e05576278f452152b0🔍
Replies: >>106189351
Anonymous
8/8/2025, 2:42:21 PM No.106189309
chroma_00350_
chroma_00350_
md5: 627b2a59ecba17afced9744a755dc566🔍
yea, all small details seem to be fixed now, and the prompt following is great, maybe not quite qwen level, but its also not style locked like qwen is
Replies: >>106189327
Anonymous
8/8/2025, 2:44:03 PM No.106189321
So do I grab both versions of chroma?
Anonymous
8/8/2025, 2:44:46 PM No.106189325
>>106185951
2080TI Will suffice for most picture genning up to Illustrious
Anonymous
8/8/2025, 2:45:01 PM No.106189327
>>106189309
she has different eye color and ear piercing also long nails
Anonymous
8/8/2025, 2:45:54 PM No.106189335
>>106189286
Looks great! What prompts did you use to get that cinematic look?
Anonymous
8/8/2025, 2:46:59 PM No.106189341
some nsfw ones
https://files.catbox.moe/txo33c.jpeg
https://files.catbox.moe/op2bl4.png
https://files.catbox.moe/655vue.png
Replies: >>106189372
Anonymous
8/8/2025, 2:47:24 PM No.106189351
>>106189304
HOLY FUGGIN SLOPPA
Anonymous
8/8/2025, 2:49:31 PM No.106189372
>>106189341

These aren't even made with the newest checkpoint though kek, doing it a disservice.
Replies: >>106189444
Anonymous
8/8/2025, 2:52:09 PM No.106189388
>>106189143
Seriously, why is chroma often broken for most basic ass shit?
I am getting anatomy errors that weren't common in SD1.5 days.
Did they fuck up training params so much that they destroyed base model's knowledge?
This was such a wasted opportunity to become the next big thing in local genning.
Shame.
Replies: >>106189397 >>106189458 >>106189571
Anonymous
8/8/2025, 2:53:34 PM No.106189397
>>106189388
can you show me a example? maybe your using a bad sampler combo
Replies: >>106189628
Anonymous
8/8/2025, 2:57:02 PM No.106189418
ComfyChroma50a_00338_
ComfyChroma50a_00338_
md5: 8d5116f6305d2d3cc0245752858c9744🔍
>MFW we failed to solve the nogen negativity disease
Replies: >>106189683
Anonymous
8/8/2025, 2:57:37 PM No.106189424
bonging my tangent
Anonymous
8/8/2025, 2:59:09 PM No.106189434
So Chroma-annealed is just the new name for detail-calibrated then?
Replies: >>106189445 >>106189558
Anonymous
8/8/2025, 3:00:04 PM No.106189444
>>106189372
>it must be the NEWEST
>if i can recognize it, its SHIT!!
you are autistic and annoying as fuck bro
Anonymous
8/8/2025, 3:00:16 PM No.106189445
>>106189434
I don't think so? I could not find any info on what that is
Replies: >>106189471
Anonymous
8/8/2025, 3:01:18 PM No.106189458
>>106189388
because it was trained at 512x512 on a lobotomized version of the already underperforming flux schnell. not only did it have to re-learn basic coherence which was lost during de-distillation, it also had to try and learn new anatomy/tags on top. chroma is a foundational model project being trained on a SDXL finetune budget. he constantly tweaks things and merges things every other epoch. the dataset kept shrinking as the epochs were 'taking too long'.

anyone who was a veteran of the "resonance cascade" furfag failbake knew what to expect with this one. 'locking in' isn't a thing, you can tell by epoch 13 whether or not a model will sort itself out. chroma could've worked if it was trained normally on a bigger dataset with more compute, but compute (money) remains the ultimate moat keeping local NSFW finetunes from ever reaching their full potential.
Replies: >>106189571 >>106189628
Anonymous
8/8/2025, 3:01:46 PM No.106189466
>>106189286
Nice, looks like a promotional film still from a movie ~2010
Anonymous
8/8/2025, 3:02:09 PM No.106189471
1747311024748924
1747311024748924
md5: caa583bffa3973db768143e4727dbd25🔍
>>106189445
Looks like he wants to keep it a mystery or maybe he'll explain it in the new model card.
Anonymous
8/8/2025, 3:02:14 PM No.106189473
>replying to yourself to sound "smart"
ooffff
Anonymous
8/8/2025, 3:03:58 PM No.106189490
>doomfags were right again
that's it, i'm subbing to midjourney
Anonymous
8/8/2025, 3:04:41 PM No.106189500
1676256605978298
1676256605978298
md5: f352722f191564a5c5929d8dbfd36c28🔍
any checkpoints for making looping animations\gifs\webm???
Anonymous
8/8/2025, 3:06:42 PM No.106189522
>q8 out
lessgo
Replies: >>106189547
Anonymous
8/8/2025, 3:09:02 PM No.106189540
apparently this is the TE to use?
https://huggingface.co/silveroxides/flan-t5-xxl-encoder-only-GGUF/blob/main/flan-t5-xxl-Q8_0.gguf

at least according to https://civitai.com/models/1825018/chroma-wf-done-properly
Anonymous
8/8/2025, 3:09:50 PM No.106189547
>>106189522

WHERE AT BIG DAWG
Anonymous
8/8/2025, 3:10:16 PM No.106189552
QUICK someone besides the schizo make the fucking bake
Replies: >>106190193
Anonymous
8/8/2025, 3:11:10 PM No.106189558
>>106189434
My guess is 50a is a 49 detail merge with a smidge of extra training at a lower LR
Anonymous
8/8/2025, 3:12:49 PM No.106189571
>>106189388
>>106189458
Obvious samefag, stop being so pathethic

Chroma is easily the best base model for photorealism and equally good as any other for art, and yes, despite being trained on a shoestring budget compared to its competition.

Like with every previous successful model, the potential comes with loras and finetunes, and this excels with loras already, super easy to train a person or style lora.

And of course it is uncensored, with understanding of genitals trained back in, and no mutilated nipples, so training NSFW loras for this will be a breeze.
Replies: >>106189628
Anonymous
8/8/2025, 3:14:40 PM No.106189589
I miss the R guy
Replies: >>106189608
Anonymous
8/8/2025, 3:16:37 PM No.106189608
>>106189589
hes literally in the thread still dumbass
Anonymous
8/8/2025, 3:18:23 PM No.106189625
I will never not be smug about the failure of chroma.
Anonymous
8/8/2025, 3:18:37 PM No.106189628
muh samefag
muh samefag
md5: 3d58f08881b76f4637734cdf2e496cf6🔍
>>106189397
I delete most of the deformed slop I get but here are some that I forgot to:
https://litter.catbox.moe/kdmmggdk3wlmwaom.png
https://litter.catbox.moe/10q16jjat4sxih9w.png
https://litter.catbox.moe/258kgr18nsq123dz.png
>>106189458
Thanks for the response.
More or less what I expected to hear.
>>106189571
Stop being a schizo.
I think chroma CAN make good gens under some select circumstances but the overall package is too damaged to be worthwhile.
I can just gen NSFW on SDXL finetunes which are much faster and reliable.
Chroma does have the advantage of better text and prompting, but this is rather niche for NSFW.
Replies: >>106189643 >>106189685
Anonymous
8/8/2025, 3:19:25 PM No.106189635
Oh also I trained that qwen LoRA as a test and yeah it works but maybe I'm just fucking crazy but it like doubled inference time and the results were kind of meh. I probably way undertrained it though, only around 2000 steps.
Replies: >>106189935
Anonymous
8/8/2025, 3:20:39 PM No.106189643
>>106189628
that is V48 though, try V50
also it looks like your using the wrong text encoder
Replies: >>106189716
Anonymous
8/8/2025, 3:24:29 PM No.106189683
>>106189123
what an absolute clusterfuck of autism, but what can you expect from a furfag
>>106189418
that is complete shit
Anonymous
8/8/2025, 3:24:42 PM No.106189685
>>106189628
>I can just gen NSFW on SDXL finetunes which are much faster and reliable
And you will be able to gen even better NSFW on Chroma loras and finetunes, do you not know the HUGE difference between SDXL and said finetunes ?

Like SDXL and Flux etc, Chroma is a BASE model, it will not excel at specific things because it is a general model made to be extended through loras and finetunes

Nobody uses plain SDXL or Flux for anything, stop being retarded
Replies: >>106189708 >>106189716
Anonymous
8/8/2025, 3:25:46 PM No.106189695
here is my first try with chroma https://files.catbox.moe/6cg7to.png
Anonymous
8/8/2025, 3:26:01 PM No.106189696
>>106189270
>-HD
whats the diff with v50?
Anonymous
8/8/2025, 3:26:53 PM No.106189708
>>106189685
WOW surely you have a link to those chroma finetunes
Replies: >>106189717 >>106189748
Anonymous
8/8/2025, 3:27:25 PM No.106189716
>>106189643
I EXTREMELY STRONGLY doubt all the problems magically went away in last two epochs, but I eventually intend to check it out.
Also t5 xxl is indeed the correct text encoder, even as mentioned in chroma's hugginface.
>>106189685
I DON'T expect it to get all the fetishes and styles out of the box right
I DO expect it NOT to make unbelievably simple anatomy errors we don't deal with in any other model, and I DON'T expect any finetune or LORA to fix such grave, foundational problems.
Anonymous
8/8/2025, 3:27:26 PM No.106189717
>>106189708
nta but you realize it just released today right?
Replies: >>106189733
Anonymous
8/8/2025, 3:29:22 PM No.106189733
>>106189717
you realize that there are 45643216549 "BASE" models that have never gotten trained to be actually usable
this moronic over optimism is crazy, and it happens every time a model releases
Replies: >>106189787 >>106189791 >>106189801 >>106189805
Anonymous
8/8/2025, 3:30:17 PM No.106189748
>>106189708
It came out 2 hours ago you absolute mong

How long do you think it took for SDXL to get finetunes ? Like holy shit how stupid are you ?
Replies: >>106189790
Anonymous
8/8/2025, 3:33:45 PM No.106189787
Chroma-ComfyUI_00298_
Chroma-ComfyUI_00298_
md5: b85e2eb8cb463a3bf0761ea7a8fe3fc3🔍
>>106189733
All it takes to get great results on Chroma is to train a fucking lora, which you can do on low end cards like a 3060 in ~4 hours
Replies: >>106189794 >>106189814
Anonymous
8/8/2025, 3:33:54 PM No.106189788
>went from "chroma is the finetune flux needed!" to "chroma is just a base model for future finetunes!"
the absolute goalpost moving cope. 512x512 training killed the potential. even 20 epochs at 1024x1024 would've resulted in a better model. holy shit it's 2022-tier
Replies: >>106189841
Anonymous
8/8/2025, 3:34:09 PM No.106189790
>>106189748
how long do you think its going to take for a model like chroma that is several times more expensive than sdxl then? by the time there is a team willing to drop a huge sack of money on it, there will be a new next gen meme
Replies: >>106189799
Anonymous
8/8/2025, 3:34:10 PM No.106189791
here is that anon's batman one on v50

>>106189733
I dont get why you are so negative? since when did we ever get such a uncensored model? sd1.5 was the closest and that wasn't even close in how uncensored it was on top of the prompt following / quality difference
Anonymous
8/8/2025, 3:34:30 PM No.106189794
>>106189787
>great results
oooffff
Anonymous
8/8/2025, 3:34:39 PM No.106189796
Screenshot 2025-08-08 093316
Screenshot 2025-08-08 093316
md5: 90162ca07f9827b13c2857cd588f9d03🔍
>downloaded wan 2.2 from the op, can i2v generate fine
>get a t2i prompt so i can then use those images to i2v for funsies
>get this after downloading all requirements for the t2i workflow
what am i doing wrong exactly
Replies: >>106189872
Anonymous
8/8/2025, 3:34:50 PM No.106189799
>>106189790
???
All the finetunes on Pony are on significantly smaller datasets.
Replies: >>106189817
Anonymous
8/8/2025, 3:34:55 PM No.106189801
>>106189733
chroma was usable months ago already, sis
Anonymous
8/8/2025, 3:35:10 PM No.106189805
>>106189733
I dont get why you are so negative? since when did we ever get such a uncensored model? sd1.5 was the closest and that wasn't even close in how uncensored it was on top of the prompt following / quality difference
Replies: >>106189915
Anonymous
8/8/2025, 3:35:56 PM No.106189814
>>106189787

Best trainer software for Chroma?
Replies: >>106189855
Anonymous
8/8/2025, 3:36:02 PM No.106189817
>>106189799
show me 1 (one) finetune of pony that has meaningfully improved it
Replies: >>106189822 >>106189826 >>106189845
Anonymous
8/8/2025, 3:36:51 PM No.106189822
>>106189817
nearly any of them?
there are characters\danbooru tags etc
have you been under a rock the last 1000 days?
Replies: >>106189843
Anonymous
8/8/2025, 3:36:56 PM No.106189826
>>106189817
are you serious right now? There are like hundreds all for different styles or content focuses. I think your just retarded anon
Replies: >>106189843
Anonymous
8/8/2025, 3:38:12 PM No.106189841
>>106189788
You know this is something I don't understand.
How did they expect it to work out? Asking this rather seriously.
Models learn to generate resolutions they are trained at. (With some wiggle room for nearby resolutions)
A 512x model will shit itself when trying to generate 1024x1024, conversely a 1024x model won't make a good 512x512 image.
If you mix both, instead of getting a model that can do both, you get a confused model that can do neither well.
Anonymous
8/8/2025, 3:38:14 PM No.106189843
>>106189822
>>106189826
a lora for a specific style or a character is NOT a finetune
Replies: >>106189863 >>106189878
Anonymous
8/8/2025, 3:38:18 PM No.106189845
>>106189817
You mean like Pony Realism? The one everyone used for months?
Replies: >>106189977
Anonymous
8/8/2025, 3:38:59 PM No.106189855
>>106189814
I've been using Diffusion-Pipe which works great, there's also AI-Toolkit and I think Kohya is adding support

Hopefully OneTrainer will get it as well, but it seems to be in a development hiatus
Replies: >>106189882
Anonymous
8/8/2025, 3:39:21 PM No.106189863
>>106189843
for style it is, there are realism finetunes that are night and day, cartoon looking ones, 2d, 2.5d, there are ones trained on different prompting rules, one better at furry, ones better at animie , ones better at horror...
Replies: >>106189977
Anonymous
8/8/2025, 3:40:13 PM No.106189872
>>106189796
wrong text encoder probably
Replies: >>106189913
Anonymous
8/8/2025, 3:40:53 PM No.106189878
>>106189843
>goalposts moved
just admit you are wrong for once
Replies: >>106189892
Anonymous
8/8/2025, 3:41:14 PM No.106189882
>>106189855

AI-Toolkit has been kinda ass for chroma and wan for me personally, dunno why.
Replies: >>106189898 >>106189901
Anonymous
8/8/2025, 3:41:51 PM No.106189892
>>106189878
He can't because he'd have to admit he can't run Chroma on his mother's laptop.
Replies: >>106189917
Anonymous
8/8/2025, 3:42:14 PM No.106189898
>>106189882
nta but I always liked diffusion pipe best, needs wsl2 though, here is help on installing it
https://civitai.com/articles/12837/full-setup-guide-wan21-lora-training-on-wsl-with-diffusion-pipe
Anonymous
8/8/2025, 3:42:34 PM No.106189901
>>106189882
Hmm.. perhaps give Diffusion-Pipe a try, if you are on Windows you will have to run it through WSL2 though
Anonymous
8/8/2025, 3:44:21 PM No.106189913
>>106189872
is there a decent t2i guide? i dont see one in the op and id love to play around with this shit while i work
Anonymous
8/8/2025, 3:44:50 PM No.106189915
>>106189805
This guy has been hating on Chroma AND defending BFL for ages, samefagging like a madman

Enter conversations with him knowing that
Replies: >>106189977
Anonymous
8/8/2025, 3:45:03 PM No.106189917
>>106189892
kekked
Anonymous
8/8/2025, 3:45:31 PM No.106189923
im downloading chroma v50 I will compare the slops with qwen.

give me prompts I will do some runs
Replies: >>106189948 >>106190187 >>106190340
Anonymous
8/8/2025, 3:46:31 PM No.106189935
>>106189635
What was the VRAM usage? Similar to inference? Guessing you used an H100?
Anonymous
8/8/2025, 3:47:11 PM No.106189948
>>106189923
Make sure to do porn prompts so you can learn the real difference.
Replies: >>106189970
Anonymous
8/8/2025, 3:48:55 PM No.106189970
>>106189948
qwen can half do breasts but that is about it. THe main issue with qwen after a few days of using it though is that it is overcooked, you will only get super samey images, though they look good. And good luck changing the style much
Replies: >>106190006 >>106190255
Anonymous
8/8/2025, 3:49:17 PM No.106189977
>>106189915
im not whoever you think i am because i also dislike flux and the entirety of bfl
>>106189863
>>106189845
i will concede pony, im from the anime side and there was pretty much no progress until illustrious, which was hardly ideal
Replies: >>106189987 >>106189992
Anonymous
8/8/2025, 3:50:03 PM No.106189983
file
file
md5: f69eecd24188bd43add81aad0474a754🔍
It seems good enough to me.
Anonymous
8/8/2025, 3:50:43 PM No.106189987
>>106189977
>pretty much no progress
your wrong, pony realism was still night and day better till just super recently and now chroma, and I still use some pony tunes for certain styles instead of illustrious / noob
Replies: >>106190019
Anonymous
8/8/2025, 3:51:05 PM No.106189992
>>106189977
What progress do you think can be made? SDXL is a very shitty architecture with one of the worst text encoders imaginable.
Replies: >>106190000 >>106190019
Anonymous
8/8/2025, 3:52:21 PM No.106190000
>>106189992
>if i see PONY I WILL CALL IT SLOPPAAAA!!
yep, its HIM.
Replies: >>106190012
Anonymous
8/8/2025, 3:52:56 PM No.106190006
>>106189970
I haven't had style change issues, granted I've only tried 3DCG, pixel art, comic, anime, crayon and pencil drawings, and realism. It failed to do CCTV and grainy film footage but I probably didn't prompt well enough. Breasts were on average ok but depending on seed they get better. Genitals is just ugly bulges and weird shapes.
Replies: >>106190129
Anonymous
8/8/2025, 3:53:15 PM No.106190012
>>106190000
Have you seen SDXL outputs or are you a Jeet and can't tell what's AI slop?
Replies: >>106190032
Anonymous
8/8/2025, 3:54:00 PM No.106190019
>>106189987
yes and you can see all the annoying quirks of pony in those styles
>>106189992
what progress do you think can be made with chroma? sure there will be lora styles and celebrities and smaller scale trainings, but i doubt anyone will pretty much retrain the entire model AGAIN to remove all its problems
Replies: >>106190064 >>106190065
Anonymous
8/8/2025, 3:55:01 PM No.106190032
>>106190012
you can use depreciated models\software and still create art with merit anon
you are an autist
Anonymous
8/8/2025, 3:58:23 PM No.106190064
>>106190019
Finetunes for style and coherence are significantly smaller with less than 10,000 images. A base model ultimately is throwing lots of shit into the model's knowledge and it's like creating lots of radio stations, a finetune locks in the signal of a specific radio station.
Replies: >>106190111
Anonymous
8/8/2025, 3:58:30 PM No.106190065
>>106190019
BigASP guy already stated he is planning to use Chroma as a base for a finetune
Replies: >>106190111 >>106190134
Anonymous
8/8/2025, 4:02:45 PM No.106190111
>>106190065
i think his trainings are very cool and the stuff he writes about them and his tools, but i dont think many people use them in practice
>>106190064
from what i can tell from their discord lodestone used some sort of RL and step distilled it, im not so sure about the base model status
Replies: >>106190153
Anonymous
8/8/2025, 4:04:01 PM No.106190124
WanVideo2_2_I2V_00016_thumb.jpg
WanVideo2_2_I2V_00016_thumb.jpg
md5: 0e822143b9711de6640af8b167674edd🔍
How do I make this motion more aggressive? It has the general fluidity of a punch; its just slow and weak as fuck
Replies: >>106190146
Anonymous
8/8/2025, 4:04:16 PM No.106190129
>>106190006
now try getting different looking images with somewhat the same prompt and different seeds
Anonymous
8/8/2025, 4:04:30 PM No.106190134
file
file
md5: 8b0b6bafc8fad32902e2bc05debb9176🔍
>>106190065
i thought he was still deliberating?
Anonymous
8/8/2025, 4:05:46 PM No.106190146
>>106190124
she almost looks native alaskan kek
Anonymous
8/8/2025, 4:06:22 PM No.106190153
file
file
md5: 55cda642d2c38a30fa6dbdc94e63c1e2🔍
>>106190111
It trains just fine.
Replies: >>106190157 >>106190165
Anonymous
8/8/2025, 4:07:00 PM No.106190157
>>106190153
OOOOFFFFFF
Anonymous
8/8/2025, 4:08:17 PM No.106190165
>>106190153
i can go make a lora for base sdxl or something as well, but that doesnt say shit about how larger scale training will go
Replies: >>106190175
Anonymous
8/8/2025, 4:09:11 PM No.106190171
1boy, 1girl, couple, hug from behind, hand on own chin, hands on another's waist, wariza, art by Incase, from above, limited palette, orange theme, color lineart, woven hatching, blue outline, muted color, slice of life, chiaroscuro, stage lights, acrylic paint (medium), rating:general
Replies: >>106190187 >>106190447
Anonymous
8/8/2025, 4:09:23 PM No.106190175
>>106190165
"Large scale". Hate to break it to you anon, but no one does 100k+ image finetunes. They're using 150 synthetic images.
Replies: >>106190205
Anonymous
8/8/2025, 4:09:54 PM No.106190181
>use 2.2 guide
>turn animated previews on like it suggests
>no animated preview
what gives
Replies: >>106190235 >>106190302
Anonymous
8/8/2025, 4:10:14 PM No.106190187
>>106190171
>>106189923
Anonymous
8/8/2025, 4:10:49 PM No.106190193
>>106189552
Anonymous
8/8/2025, 4:11:55 PM No.106190205
>>106190175
kek well yeah but then we have come full circle to what i was saying in the beginning that its not wise to expect for a finetune to "save" chroma or any other """base""" model
Replies: >>106190266
Anonymous
8/8/2025, 4:14:19 PM No.106190235
>>106190181
the 2.2 guide is honestly fucking garbage. the t2v workflow it provides doesn't even work with the stuff it makes you download.
Anonymous
8/8/2025, 4:16:16 PM No.106190255
>>106189970
its not overcooked, its trained on long detailed prompts, so when you use simple words only it will result in the samey appearance
Replies: >>106190338
Anonymous
8/8/2025, 4:17:17 PM No.106190266
file
file
md5: 33da72f07a7581ec34cb18614cc9edd2🔍
>>106190205
Anon, your assertion on its face is retarded. You can use Chroma right now, it doesn't need to be "saved". And you're not the king of diffusion models, so you being unable to run Chroma (the real problem) is not my problem. But it's funny though, you're the reason why the distill these models to produce a very strict range of images so you will clap like a retarded seal.
Replies: >>106190316
Anonymous
8/8/2025, 4:17:28 PM No.106190269
1731082114272609
1731082114272609
md5: 6476a45cfd5f9b29a845904841153196🔍
Replies: >>106190289
Anonymous
8/8/2025, 4:17:44 PM No.106190272
Ok gonna try chroma after only gooning with SDXL for the most part
How do I prompt it?
Replies: >>106190291 >>106190305
Anonymous
8/8/2025, 4:19:05 PM No.106190289
>>106190269
r e n t f r e e
Anonymous
8/8/2025, 4:19:34 PM No.106190291
>>106190272
Boomer prompt it to the max.
Anonymous
8/8/2025, 4:20:26 PM No.106190302
>>106190181
Open the comfyui manager and there's an option for previews, set it to auto
Anonymous
8/8/2025, 4:20:36 PM No.106190305
>>106190272
Don't forget to get t5 as well.
Replies: >>106190345
Anonymous
8/8/2025, 4:21:45 PM No.106190316
>>106190266
well if i wanted silly meme images i can just use qwen or base flux with some lora, but i doubt it can make realism even close to wan or some pony finetune for porn or anime close to illustrious/noob or novelai
Replies: >>106190327
Anonymous
8/8/2025, 4:22:51 PM No.106190327
>>106190316
> but i doubt it can make realism even close to wan or some pony finetune
Okay you're just trolling.

>A cinematic screencap from an action movie. Master Chief from Halo holds a radio to his face and the subtitle caption reads "Get the poorfags out of here!" The frame at the top show him talking into a radio. Kirby can be seen floating in the background, he is a soft fuzzy round character. The shot is letterboxed. The background depicts a wartorn cityscape.
Pick your favorite SDXL finetune.
Replies: >>106190366
Anonymous
8/8/2025, 4:22:52 PM No.106190328
soo do i pick the annealed verison or not?
Replies: >>106190360
Anonymous
8/8/2025, 4:23:16 PM No.106190330
chroma1 hd fp8 where please and thank you
Anonymous
8/8/2025, 4:23:26 PM No.106190334
>sliding this trash off the catalog
Replies: >>106190372
Anonymous
8/8/2025, 4:23:51 PM No.106190338
>>106190255
I did, I used the qwen prompt extender and changed a few sentences, still almost the same generation which means the model actually over trained. It could maybe be fixed with a finetune but just saying
Anonymous
8/8/2025, 4:23:55 PM No.106190340
>>106189923
I am testing v50 annealed. One thing I notce is higher color saturation.
Replies: >>106190398
Anonymous
8/8/2025, 4:24:09 PM No.106190345
>>106190305
I have t5 from flux days, I guess this hasn't changed?
Replies: >>106190427
Anonymous
8/8/2025, 4:24:26 PM No.106190352
chroma has been a great model for many epochs. its already better at most things than most models. anatomy issues can be somewhat alleviated with a higher cfg. the model however does need a realism LoRA to stabilize outputs.
Anonymous
8/8/2025, 4:24:53 PM No.106190360
>>106190328
I think annealed means easier to finetune somehow? At least going by the definition, more malleable
Anonymous
8/8/2025, 4:25:54 PM No.106190366
>>106190327
yes a nice meme image that is full of blur
Replies: >>106190380
Anonymous
8/8/2025, 4:26:17 PM No.106190372
>>106190334
what do you mean anon?
Replies: >>106190397
Anonymous
8/8/2025, 4:26:38 PM No.106190374
Is huggingface chroma workflow good enough or is there something better?
Replies: >>106190396
Anonymous
8/8/2025, 4:27:01 PM No.106190380
>>106190366
Where's that SDXL greyslop again? I assume you tried the prompt and gave up.
Replies: >>106190389
Anonymous
8/8/2025, 4:28:07 PM No.106190389
>>106190380
how is he supposed to compare with 4GB of integrated graphics, anon?
Replies: >>106190405
Anonymous
8/8/2025, 4:28:29 PM No.106190396
>>106190374
It should work fine, perhaps use min_padding 1 instead of the Comfy default 0
Anonymous
8/8/2025, 4:28:32 PM No.106190397
>>106190372
>page 10
fuck you ai niggers im bumping all other threads until you die
Replies: >>106190408
Anonymous
8/8/2025, 4:28:37 PM No.106190398
>>106190340
It looks less responsive to a character Lora I have. I am going to try v50 and compare.
Anonymous
8/8/2025, 4:29:20 PM No.106190405
>>106190389
>muh pissin contest
your 50 series will not get you laid anon and your gens are still shiddy\fardy\brappy
Replies: >>106190411 >>106190418
Anonymous
8/8/2025, 4:29:38 PM No.106190408
>>106190397
What a sad life, such limp dick energy
Anonymous
8/8/2025, 4:29:59 PM No.106190411
>>106190405
you being poor and angry hurts your chances thats for sure
Anonymous
8/8/2025, 4:30:26 PM No.106190418
>>106190405
You know, because we're not Zoomers we don't need your approval to enjoy anything. You should try it sometime.
Replies: >>106190736
Anonymous
8/8/2025, 4:31:04 PM No.106190424
Ok tech retard here, I got venv for comfy with python 3.12, how can I update it to 3.13? Or do I have to redownload all of the packages?
Replies: >>106190438 >>106190446
Anonymous
8/8/2025, 4:31:15 PM No.106190427
>>106190345
Nope. Uses same text encoder.
Anonymous
8/8/2025, 4:31:41 PM No.106190438
>>106190424
Why the fuck are you upgrading Python?
Anonymous
8/8/2025, 4:32:39 PM No.106190446
>>106190424
YES
Delete venv. Create a 3.13 venv. source venv/bin/active. pip install -r requirements.txt
Anonymous
8/8/2025, 4:32:41 PM No.106190447
1742498928066792
1742498928066792
md5: 4ac45c1d0f598734e0409cd20c8c10b6🔍
>>106190171
chroma (sloppy) cfg 4 seed 42.
Replies: >>106190471
Anonymous
8/8/2025, 4:33:33 PM No.106190454
>>106190450
>>106190450
>>106190450
>>106190450
>>106190450
Replies: >>106190619
Anonymous
8/8/2025, 4:35:15 PM No.106190471
1738904759535301
1738904759535301
md5: a9404aa6fe3bc14a28856f4387558144🔍
>>106190447
qwen.. i dont understand whats happening with chroma, I've used both the integrated comfy workflow and the one they have on the model card
Replies: >>106190619
Anonymous
8/8/2025, 4:48:15 PM No.106190591
ComfyUI_00008_
ComfyUI_00008_
md5: e12887eabf1faeae114b5b164caa658f🔍
qwen-image just keeps growing on me. It looks great at 50 steps cfg 3.5 ddim/ddim_uniform. It handles a complex prompt well.
I'm very much looking forward to loras and finetunes for this model.
Replies: >>106190619
Anonymous
8/8/2025, 4:50:43 PM No.106190619
>>106190471
>>106190591

>>106190454
Anonymous
8/8/2025, 4:51:04 PM No.106190624
>>106190556 ?
Anonymous
8/8/2025, 5:01:25 PM No.106190736
>>106190418
meanwhile you do nothing but hate on Rnon