Thread 107006468

329 posts 178 images /g/

Anonymous 10/25/2025, 8:37:12 PM No.107006468 [Report] >>107006514 >>107008984

/ldg/ - Local Diffusion General

highlights_g_107001451_1761417397_thumb.jpg.webm md5: 414601fd...

v7 Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107001451

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://huggingface.co/neta-art/Neta-Lumina

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous 10/25/2025, 8:38:35 PM No.107006484 [Report]

what is opinion of pony v7, krea video, lightx2v lora nwe

Anonymous 10/25/2025, 8:38:57 PM No.107006487 [Report]

Blessed thread of frenship

Anonymous 10/25/2025, 8:41:45 PM No.107006514 [Report] >>107006522 >>107006527

>>107006468 (OP)
>not a single anime girl
:(

Anonymous 10/25/2025, 8:42:39 PM No.107006522 [Report]

>>107006514
anime died with pony

Anonymous 10/25/2025, 8:43:12 PM No.107006527 [Report]

1743952343601771_thumb.jpg.webm md5: 6c069dd3...

WebM not supported

>>107006514
>but there's a clown girl
a massive upgrade, then.

Anonymous 10/25/2025, 8:45:35 PM No.107006544 [Report] >>107006567 >>107006760 >>107006793

fae_1292.jpg md5: d3c23881...

>>107006326
It is more associated with "pointy ears" than elf, as I can get it with "demon girl" or "fairy" as in pic related.
>>107006523
Wow those do look a lot like the earrings I keep getting, especially when not trying to prompt around it. But does Chroma know "Frieren"? I thought it had characters and styles removed.

Anonymous 10/25/2025, 8:45:44 PM No.107006545 [Report] >>107006658

>It isn't that half bad desu
it makes sense that the guy who shills netadogshit would think that ponyv7 isn't half bad. horrendous shit taste.

Anonymous 10/25/2025, 8:48:09 PM No.107006562 [Report]

my prompts are too strong for you, anon

Anonymous 10/25/2025, 8:48:45 PM No.107006567 [Report]

>>107006544
>But does Chroma know "Frieren"? I thought it had characters and styles removed.
Yeah chroma knows many characters and styles, just need longer prompting than just name

Anonymous 10/25/2025, 8:50:50 PM No.107006585 [Report] >>107006627

anon, i tell you i am going to 1girl, and i want only your strongest prompts

Anonymous 10/25/2025, 8:51:46 PM No.107006588 [Report]

(you) can't handle my strongest 1girl

Anonymous 10/25/2025, 8:56:13 PM No.107006627 [Report]

>>107006585
masterpiece, loli, photorealistic

Anonymous 10/25/2025, 8:56:31 PM No.107006629 [Report] >>107006704

>>107005962
Same reason Princess Peach always has an Iron Man core embedded in her chest no matter what clothes she's wearing, if almost 100% of the examples of a given object have that particular feature then as far as that model is concerned it is an inherent aspect of that object. It's not like these models make any inherent distinction between clothing and body parts.

Anonymous 10/25/2025, 8:58:50 PM No.107006647 [Report] >>107006656 >>107006703 >>107006742

so did astralite comment about why v7 shat the bed so hard or is he pivoting straight to another grift

Anonymous 10/25/2025, 8:59:46 PM No.107006656 [Report]

>>107006647
Pony v6 was simply a fluke.
When he announced v7 and what was going on with it that already was a clear sign that the model is going to be a failure.

Anonymous 10/25/2025, 8:59:55 PM No.107006658 [Report]

>>107006545
>singular

Anonymous 10/25/2025, 9:01:42 PM No.107006675 [Report]

Anybody who defends NetaLumina or Ponyv7 is deranged and cannot be trusted.

Anonymous 10/25/2025, 9:03:47 PM No.107006690 [Report]

dot.jpg md5: 648c38da...

Anonymous 10/25/2025, 9:05:23 PM No.107006703 [Report]

>>107006647
The latter

Anonymous 10/25/2025, 9:05:24 PM No.107006704 [Report] >>107006720 >>107006731

elf-skelly-wat.jpg md5: 56a01bd3...

>>107006629
Well I know there are plenty of elf images that don't have earrings like that. And other models like the SDXL-based ones can make earring-less elves easily enough. It might be that I am pass in a cartoon style image in the workflow, that does not itself have earrings but this nudges the model as well. Perhaps it can realistic pointy earsor other styles just fine without sticking earrings on. Or perhaps if I fed it a empty latent image it wouldn't have the problem. Haven't tested it. Just noticed it was st range. At any rate, I have added frieren to negatives (along with earrings) but I don't start losing the earrings until the cfg gets all the way up to around 5.0 at which point it is looking rather fried so I guess it is not going to fix the problem.

Anonymous 10/25/2025, 9:06:56 PM No.107006720 [Report]

>>107006704
> It might be that I am passing in a cartoon style image*
and
> Perhaps it can do realistic pointy ears or other styles just fine*

Anonymous 10/25/2025, 9:08:09 PM No.107006731 [Report] >>107006840

>>107006704
bretty kino gen

Anonymous 10/25/2025, 9:10:05 PM No.107006742 [Report]

>>107006647
>pivoting straight to another grift
this, as to why anyone would support him after v7, the amount of retards far outweigh people with common sense.

Anonymous 10/25/2025, 9:12:36 PM No.107006760 [Report]

>>107006544
The annoying thing is I can get the earrings to go away by turning down cfg below the correct value (1), but then the image fails to denoise correctly.

Anonymous 10/25/2025, 9:16:46 PM No.107006793 [Report] >>107006829 >>107006840

Chroma-484501471818556_00001_.png md5: 07c4b46f...

>>107006544
I finally succeeded. I was uhh only trying to accomplish no earrings, nothing else in particular

Anonymous 10/25/2025, 9:21:46 PM No.107006829 [Report] >>107006856

>>107006793
prompt?

Anonymous 10/25/2025, 9:22:28 PM No.107006840 [Report] >>107006947 >>107007058

mermaid-girl-aquarium-cartoon.jpg md5: 8329bfa5...

>>107006793
Nice job and nice tits. Did you do anything in particular to make them go away or was it just a lucky gen?
>>107006731
Thanks, you can get some weird results when you use wd14 to interrogate an image, then use its prompt to make a new picture.

Anonymous 10/25/2025, 9:25:48 PM No.107006856 [Report] >>107006947

>>107006829
>boobas, bazoongas, over the shoulder boulder holders
one can only imagine

Anonymous 10/25/2025, 9:29:17 PM No.107006880 [Report]

view.png md5: 3f9d970e...

No finetune is going to fix that trash pony v7 model. Pic unrelated

Anonymous 10/25/2025, 9:39:27 PM No.107006947 [Report] >>107007052

>>107006840
Turning cfg down to ~0.85 had a lot to do with it, but largely just luck

>>107006856
In this case it was "tall and voluptuous" plus "[her] big flabby sagging breasts are tightly bound in her fraying smock and squeezed together for a ton of cleavage"

I found this was um necessary to make the earrings go away

Anonymous 10/25/2025, 9:42:27 PM No.107006975 [Report] >>107007117

Local models should unironically be banned

Anonymous 10/25/2025, 9:45:34 PM No.107007001 [Report]

>nigbobumping this thread

Anonymous 10/25/2025, 9:48:11 PM No.107007022 [Report] >>107007256

is the info of style clustering ANY FUCKING WHERE for ponyv7? I searched in the HF and civitai page, NOT a single fucking link to check these fucking clusters.
Yes I know that pony is shit, but I still wanted to experiment a bit with this new toy.
FML

Anonymous 10/25/2025, 9:51:45 PM No.107007052 [Report] >>107007058

>>107006947
and what about the comic style name?

Anonymous 10/25/2025, 9:53:03 PM No.107007058 [Report] >>107007082 >>107007244

>>107006840
Actually I don't know why I said it was only luck, I did a lot otherwise to try to make it happen (luck was still important of course)

I added no-makeup hashtags, removed anything signifying richness or ornateness, used a lot of words like "plain" "natural" "rustic" "barefaced" etc., tried to force a feral/pauper/tattered appearance, tried to get a retro pulp fantasy aesthetic to avoid modern "character design" slop, described the character as boyish and avoided things suggesting a stately older elf, etc.

But all those things failed until I turned cfg down a little bit. Now some of the gens have earrings and some don't.

>>107007052
"A blurry grainy scan of an old pulp fantasy illustration from 1957." I'm sure there's a lot of room for improvement there.

Anonymous 10/25/2025, 9:55:27 PM No.107007076 [Report] >>107007122 >>107007604

file.png md5: 3d06fc86...

res2s, beta57, 20 steps, 4 cfg.
man these fucking hands
my maximum permitted gen time is around 100s, this gen took 110s.

Anonymous 10/25/2025, 9:55:54 PM No.107007082 [Report]

>>107007058
>I'm sure there's a lot of room for improvement there.
E.g., I am now going to try "pulp adventure" instead of fantasy because the word fantasy is too closely associated with modern slop

Anonymous 10/25/2025, 9:59:22 PM No.107007107 [Report]

how do i gen girlfailures?

Anonymous 10/25/2025, 10:04:34 PM No.107007117 [Report]

>>107006975
>t. /de3/ vramlet jelly of /ldg/ chads' epic booba gens
lmao

Anonymous 10/25/2025, 10:04:34 PM No.107007122 [Report]

file.png md5: 2c3df5c7...

>>107007076
adding this negative:
deformed hands, bad anatomy, extra limbs, poorly drawn hands, poorly drawn face, mutation, deformed, extra eyes, extra arms, extra legs, malformed limbs, fused fingers, too many fingers, long neck, cross‑eyed, bad proportions, missing arms, missing legs, extra digit, fewer digits

seems to have fixed some of the issues actually. I prefered the older image overall composition and tone tho.

Anonymous 10/25/2025, 10:09:05 PM No.107007138 [Report]

1745396937741.png md5: 20f68e1a...

Can't make Minthy/Rouwei-T5Gemma-adapter_v0.2 work. Provided workflow requires full Gemma so I add a gguf loader node, but then I get a picrel error. LLM SDXL nodepack updooted to 3.0.1.

Anonymous 10/25/2025, 10:45:29 PM No.107007243 [Report] >>107007257 >>107007292 >>107007329 >>107007430

1739337936477317.png md5: d0fce10e...

Pony v7 q8 , fp32 vae and clip,
official comfyui workflow, 30 steps
stress test

style_cluster_1610, score_9, Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.

Anonymous 10/25/2025, 10:45:31 PM No.107007244 [Report] >>107007549

0064.jpg md5: 17f4248e...

>>107007058
I tried some of those but in my case I got the earrings still and even lowering the denoising to 0.5 didn't get rid of it. Interestingly, I don't have the problem with the non flash chrome models, such as Chroma-DC-2K-T2-SL4-bf16

Anonymous 10/25/2025, 10:47:07 PM No.107007256 [Report] >>107007267

>>107007022
that v6 tagmine spreadsheet wasnt created by the author so

Anonymous 10/25/2025, 10:47:03 PM No.107007257 [Report] >>107007269 >>107007292 >>107007329

1739355381960379.png md5: d1c30ec6...

>>107007243
Different seed

Anonymous 10/25/2025, 10:47:49 PM No.107007267 [Report]

>>107007256
since I've made that post I've read on the colab the styles groups go from 1 to 2048

Anonymous 10/25/2025, 10:48:10 PM No.107007269 [Report] >>107007292 >>107007299

1745791288575489.png md5: 4d0c9443...

>>107007257
Different seed and without "style_cluster_1610" in the prompt

Anonymous 10/25/2025, 10:50:32 PM No.107007292 [Report]

>>107007243
>>107007257
>>107007269
takes me back to good ol' SD1.4 days

Anonymous 10/25/2025, 10:50:44 PM No.107007297 [Report] >>107007307 >>107007353

the style_cluster thing for sure is cumbersome, but atleast it has artists in it in some form. I don't mind if I have to look up a table. wish we had that in chroma instead of fucking NOTHING AT ALL.

Anonymous 10/25/2025, 10:50:54 PM No.107007299 [Report] >>107007361

1752473720551238.png md5: 399a3984...

>>107007269
Different seed and without "style_cluster_1610, score_9" in the prompt

Anonymous 10/25/2025, 10:52:12 PM No.107007307 [Report] >>107007341

>>107007297
i assume the clusters do not allow for prompting individual artists which is a huge fucking kick in the nuts for no reason other than muh morals

Anonymous 10/25/2025, 10:54:22 PM No.107007329 [Report]

>>107007243
>>107007257
Goddamn, the sd1.5 and chroma merge lookin fire

Anonymous 10/25/2025, 10:55:56 PM No.107007338 [Report] >>107007346

Stop it with the Pony posts that's like seeing gore

Anonymous 10/25/2025, 10:56:22 PM No.107007341 [Report] >>107007385

>>107007307
what the fuck? I assumed that was the whole point of clusters. Maybe I should actually read the docs.

Anonymous 10/25/2025, 10:57:32 PM No.107007346 [Report]

>>107007338
You must have knowledge of the turd to appreciate the beauty of better models like Pixart Sigma.

Anonymous 10/25/2025, 10:58:17 PM No.107007353 [Report] >>107007443

>>107007297
Chroma is a base model. Why should a base model have ridiculous style tags?

Chroma as can be tuned by anyone for any purpose. That's what makes it special.

Anonymous 10/25/2025, 10:59:26 PM No.107007361 [Report] >>107007365 >>107007369 >>107007383

1739366408357497.png md5: 4584d819...

>>107007299
score_9, medieval magical intricate and detailed world, princess taking a selfie in a pink ball dress, long ginger hair, pale skin, huge breasts, smile

Anonymous 10/25/2025, 11:00:13 PM No.107007365 [Report] >>107007467

>>107007361
30 steps is too low, try 40

Anonymous 10/25/2025, 11:00:29 PM No.107007369 [Report] >>107007383 >>107007467

1733811948766349.png md5: 07a4fb84...

>>107007361
Same seed without "score_9"

Anonymous 10/25/2025, 11:00:52 PM No.107007375 [Report] >>107007388

1753642459104436.png md5: 4339e5ee...

randomizing cluster styles now

Anonymous 10/25/2025, 11:01:39 PM No.107007382 [Report]

ponyv7 is atrociously bad what the actual fuck

Anonymous 10/25/2025, 11:01:39 PM No.107007383 [Report]

>>107007361
>>107007369
horrible

Anonymous 10/25/2025, 11:01:48 PM No.107007385 [Report] >>107007395 >>107007438

>>107007341
the author has a strange heretic perversion to releasing models to the public which can be prompted with artist names. his own secret versions however do not have this problem
what a faggot

Anonymous 10/25/2025, 11:02:11 PM No.107007388 [Report] >>107007405

Ponyv7_20251025_00001_.png md5: ee4d98b8...

>>107007375
same seed

Anonymous 10/25/2025, 11:03:21 PM No.107007395 [Report]

>>107007385
>perversion
*aversion

Anonymous 10/25/2025, 11:04:33 PM No.107007405 [Report] >>107007427

Ponyv7_20251025_00002_.png md5: 0ad4ca4f...

>>107007388

Anonymous 10/25/2025, 11:07:16 PM No.107007427 [Report] >>107007568

Ponyv7_20251025_00004_.png md5: 2f5dcd13...

>>107007405
this model is so fucking bad.
gradually losing ALL hope

Anonymous 10/25/2025, 11:07:24 PM No.107007430 [Report]

1611853324372.png md5: 1fb936a3...

>>107007243
Netalumina v3.5, without style and score
lol

Anonymous 10/25/2025, 11:08:39 PM No.107007438 [Report]

>>107007385
if he released the artist ids, I bet people would even overlook the massive flaws

Anonymous 10/25/2025, 11:09:02 PM No.107007443 [Report] >>107007464

>>107007353
Wasn't it supposed to have them but they fucked up the captioning or something? I don't know, just something I read. You think we're gonna get a chroma finetune? As a dumb user, I don't really care what type the model is. All I know is chroma with artist styles would be sweet.

Anonymous 10/25/2025, 11:12:30 PM No.107007464 [Report]

>>107007443
it's not something you can mess up by mistake

Anonymous 10/25/2025, 11:13:31 PM No.107007467 [Report] >>107007572

1746136109386459.png md5: 40ee7abe...

>>107007369
>>107007365
40 steps

score_9, Attractive medieval princess taking a selfie in a pink ball dress, long ginger hair, pale skin, large breasts, smile. She is at the top of a tall stone tower, with a large window behind her that overlooks a huge and crowded medieval city at sunrise.

Anonymous 10/25/2025, 11:19:24 PM No.107007499 [Report] >>107007538 >>107007616 >>107008250

https://xcancel.com/JustinLin610/status/1982052327180918888#m
>Alibaba's CEO is asking himself why Open Source doesn't have udio at home
be the change you want to see, make Qwen Audio or something lol

Anonymous 10/25/2025, 11:22:35 PM No.107007526 [Report] >>107007687

is this stupid faggot going to post every single gen he makes? fuck off already

Anonymous 10/25/2025, 11:23:48 PM No.107007536 [Report] >>107007643 >>107007659 >>107007724 >>107007875

1739566236477949.png md5: f820c04c...

https://github.com/fal-ai/flashpack
Then do it Comfy, I'd like to load my models faster, especially with Wan 2.2 when this model is all about unloading/reloading between the HIGH and the LOW model

Anonymous 10/25/2025, 11:24:26 PM No.107007538 [Report] >>107007565

>>107007499
>we r working on it and it won't be far. i am just curious about the status

Why talk? Talk is cheap. Give me something that is Udio tier, Apache 2 licensed or I sleep. We don't want another Songbloom or ACE Step.

Anonymous 10/25/2025, 11:25:45 PM No.107007549 [Report]

>>107007244
Yeah flash is harder, which is partly what makes it fun.

As frustrating as models like that can be, fighting against them feels more like a game. Whereas with something more broad like Chroma base it's hard to know what you can do other than wait and get lucky

Anonymous 10/25/2025, 11:27:56 PM No.107007565 [Report] >>107007718

>>107007538
this, they can definitely do it, do it chinks!

Anonymous 10/25/2025, 11:28:30 PM No.107007568 [Report]

Ponyv7_20251025_00015_.png md5: 0ad03f13...

>>107007427

Anonymous 10/25/2025, 11:28:55 PM No.107007572 [Report] >>107007586

1760941687907711.png md5: 0abd2271...

>>107007467
Same except 1536x1536, which takes ~7s per step on a 3090, making this take 4-5 minutes per image. Almost the same time it takes to generate a full coherent 5s 32fps video today with Wan 2.2 lightx2v.

Unless the model will somehow be saved with "proper" prompting to take out the style knowledge which will also somehow fix the detail gore and almost make it into a completely new and better model too, it's sadly DOA.

Anonymous 10/25/2025, 11:29:04 PM No.107007576 [Report]

butiful

Anonymous 10/25/2025, 11:29:59 PM No.107007586 [Report]

1756522413972141.png md5: 0a695bdd...

>>107007572
Different seed.

Anonymous 10/25/2025, 11:30:22 PM No.107007590 [Report] >>107007598 >>107007651

so what's the best wan 2.2 lora combo with the new loras?

Anonymous 10/25/2025, 11:31:28 PM No.107007598 [Report] >>107007603 >>107008143 >>107008619 >>107008752 >>107009030 >>107009316

>>107007590
New HIGH:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors

Old LOW:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors

4 steps, cfg 1, unipc

Anonymous 10/25/2025, 11:31:38 PM No.107007599 [Report] >>107008430 >>107008443 >>107008684

>>107005507
Nice, did anyone try these SVI loras with wan2.2? What weight did you use? How did you make it work for longer videos?

Anonymous 10/25/2025, 11:32:48 PM No.107007603 [Report] >>107007609

>>107007598
1 strength for both?

Anonymous 10/25/2025, 11:33:01 PM No.107007604 [Report]

>>107007076
>res2s
res3m should be superior and faster too

Anonymous 10/25/2025, 11:33:20 PM No.107007609 [Report]

>>107007603
Yes

Anonymous 10/25/2025, 11:34:17 PM No.107007616 [Report]

>>107007499
good if he makes something

Anonymous 10/25/2025, 11:34:29 PM No.107007619 [Report] >>107007625 >>107007647

My friend's cousin works for OpenAI and he says they have a secret internal model not ready for public release yet, it's so powerful that you can type in your street address and it will show you pictures of your house, you can even prompt inside and you'll see yourself

Anonymous 10/25/2025, 11:35:37 PM No.107007625 [Report]

>>107007619
i will finally know what my oneitis' vagina looks like

Anonymous 10/25/2025, 11:36:02 PM No.107007628 [Report]

1757969225956232.jpg md5: 9b3c3f7c...

>sky, up in the clouds, heaven, pearly gates, the kingdom of heaven

Anonymous 10/25/2025, 11:38:18 PM No.107007643 [Report] >>107007715 >>107007715

file.png md5: 0bba3ed5...

>>107007536
Does that also apply to gguf? Or files in general?

Anonymous 10/25/2025, 11:38:45 PM No.107007647 [Report] >>107007661

>>107007619
My uncle works at Nintendo and he said the next Zelda is gonna be fully dynamically generated by a next-gen GPT model that runs on a VR brain implant

Anonymous 10/25/2025, 11:38:55 PM No.107007651 [Report]

>>107007590
>lora combo
the one with the stuff you want in your video

Anonymous 10/25/2025, 11:40:37 PM No.107007659 [Report]

HANKH.jpg md5: 3959e53a...

>>107007536
Model load is the most frustrating thing about comfyui...

>wan2.1
>takes minutes at the sampler then starts genning or clip keeps offloading then loads forever or memory leaks after 5 gens where I have to force close comfy
>all-in-slop
>constantly offloads the entire fucking model and have to wait another 10 minutes for it to all load again
>wan 2.2
>while the fastest and least pain in the ass, constant and increasing pausing in between high and low generation

Anonymous 10/25/2025, 11:40:41 PM No.107007661 [Report]

>>107007647
it would still be better than nu-Open World slop zelda

Anonymous 10/25/2025, 11:41:40 PM No.107007666 [Report] >>107007682

00011-2854334830.png md5: 16879b18...

Anonymous 10/25/2025, 11:43:43 PM No.107007682 [Report]

>>107007666
how could you do this to me?
>>107007665

Anonymous 10/25/2025, 11:43:48 PM No.107007685 [Report]

1740937397271342.jpg md5: b01a2e5c...

Anonymous 10/25/2025, 11:44:05 PM No.107007687 [Report] >>107007880

>>107007526
Discussion of free and open source models, faggot

Anonymous 10/25/2025, 11:48:33 PM No.107007715 [Report] >>107007724 >>107007730

>>107007643
>gguf
No.

>>107007643
>files in general
No, it's for safetensor files and you need to convert them to a flashpack format.

Anonymous 10/25/2025, 11:49:08 PM No.107007718 [Report] >>107007754

5121255412154.png md5: 025ae4a6...

>>107007565
Local has a lot of work to do.

Audio inpainting. Audio upscaling/etc... The bar is literally just a decent model that can do it all.

Anonymous 10/25/2025, 11:50:00 PM No.107007724 [Report] >>107007767

>>107007715
>you need to convert them to a flashpack format.
Comfy said you don't need to convert them, just use their methods on safetensors (look at pircel) >>107007536

Anonymous 10/25/2025, 11:50:40 PM No.107007730 [Report]

>>107007715
Can't wait to load my sdxl models super fast!

Anonymous 10/25/2025, 11:54:05 PM No.107007754 [Report] >>107007777 >>107007934

>>107007718
udio is the sota on this, but it has the same limitations as any other closed models :
- you can't ask it to make music "in the style of" (make me music like michael jackson "man in the mirror" -> moderation backend and blocked), though now you can send music to it, but it's not the same.
- you can't train it, make specialized "loras".
- anything sexual is moderated (think a sensual song).

Anonymous 10/25/2025, 11:56:06 PM No.107007767 [Report]

>>107007724
Would be nice but I'm not sure anyone would work on that.
It can shave off quite a lot of time with complex multistage samplers setups.

Anonymous 10/25/2025, 11:56:56 PM No.107007777 [Report] >>107007816 >>107007836

>>107007754
>- anything sexual is moderated (think a sensual song).
that's why it's the most hated audio software of female rappers
https://www.youtube.com/watch?v=1Gt9TTjAMvw

Anonymous 10/26/2025, 12:02:05 AM No.107007816 [Report]

>>107007777
Sure, but that's not sensual, that's just crude and vulgar, never been into these songs. Zero eroticism.

Anonymous 10/26/2025, 12:05:00 AM No.107007836 [Report]

>>107007777
yeah I guess kek

Anonymous 10/26/2025, 12:05:24 AM No.107007843 [Report] >>107007882 >>107007918

Are they ever gonna make shorter GPUs? I can't fit anything longer than 300mm in my midtower so I'm stuck with 10 VRAM

Anonymous 10/26/2025, 12:06:34 AM No.107007849 [Report] >>107007879 >>107007914 >>107008271

So does anyone have replacement recommendations to this?
https://github.com/1038lab/ComfyUI-JoyCaption
It refuses to use CUDA and does inference on the CPU. Taking a whole minute.

Anonymous 10/26/2025, 12:09:31 AM No.107007875 [Report]

1741893554978335.png md5: 98b37110...

>>107007536
kek

Anonymous 10/26/2025, 12:09:45 AM No.107007879 [Report] >>107007927

>>107007849
any VLM model will use cuda, if you're gonna figure out how to make it work for one then it may as well be joycap

Anonymous 10/26/2025, 12:09:45 AM No.107007880 [Report]

>>107007687
he's talking to himself and spamming the same 1girl, kill yourself, no discussion is being had

Anonymous 10/26/2025, 12:09:54 AM No.107007882 [Report] >>107007925

>>107007843
With the way things are going, I doubt it.
Well they will, but you'll get the less powerful stuff.

Anonymous 10/26/2025, 12:10:55 AM No.107007890 [Report] >>107007904

if I gen a 10s (161 frames) video on wan, is there a way to prompt it to do one thing then another without the second taking over immediatly?
"she types on a computer for 3 seconds, then she gets up and walks away"

Anonymous 10/26/2025, 12:12:14 AM No.107007904 [Report] >>107007943

>>107007890
what if the second thing is waiting, then the actual second thing becomes the third thing

Anonymous 10/26/2025, 12:12:18 AM No.107007908 [Report]

00017-1949654307.png md5: fb4f8f71...

Anonymous 10/26/2025, 12:12:40 AM No.107007914 [Report] >>107007927

>>107007849
the joycaption repo has a gradio interface right?

Anonymous 10/26/2025, 12:13:13 AM No.107007918 [Report]

>>107007843
no, we reached the limits on the size of a transistor, so the only way for them to get more powerful gpus is to make them bigger, the gold rush is over

Anonymous 10/26/2025, 12:13:22 AM No.107007920 [Report] >>107007931

Does anyone here know about superesolution models?

I want to train a model with my own dataset, because my dataset shares the same colours, patterns and style, but it has low resolution images, so I want to upscale them as faithfully as possible.

Please somebody help me

Anonymous 10/26/2025, 12:13:42 AM No.107007925 [Report]

>>107007882
the only options for me are the ada 48GB which is not worth it and theres one 5070 ti thats like 285mm but also not worth it. I guess I'll have to wait because I happen to think the 5090 is also a bad investment

Anonymous 10/26/2025, 12:13:50 AM No.107007927 [Report] >>107007933

>>107007879
I have no idea what you are trying to say.
It doesn't load anything to VRAM, has no GPU usage, CPU at 100% and is slow.
Maybe some other bug or whatever but it's not working as intended.
I asked for alternatives for joy caption inference.
>>107007914
If you are referring to hugging face one that has usage limits.
I am trying to mass tag images for lora training.
That's why I am trying to set up local.

Anonymous 10/26/2025, 12:14:13 AM No.107007931 [Report] >>107008242

>>107007920
just play with seedvr2 to upscale them

Anonymous 10/26/2025, 12:14:21 AM No.107007933 [Report] >>107007967

>>107007927
>If you are referring to hugging face one that has usage limits.
no, I mean the github repo

Anonymous 10/26/2025, 12:14:23 AM No.107007934 [Report] >>107007957

>>107007754
I do recall people making stuff in the same style just by inputting lyrics back when that was allowed.

Look at this
https://www.404media.co/listen-to-the-ai-generated-ripoff-songs-that-got-udio-and-suno-sued/

Obviously there's more, including a popular one
https://www.udio.com/songs/nDKNwPUB6GrMhEfvM6v2u1

Though it's more like a cover

Anonymous 10/26/2025, 12:15:27 AM No.107007943 [Report]

>>107007904
ok worth a try, thanks anon

Anonymous 10/26/2025, 12:16:37 AM No.107007957 [Report]

>>107007934
Yeah, this is where local would shine.

Anonymous 10/26/2025, 12:17:27 AM No.107007967 [Report] >>107007979

>>107007933
Gradio interfaces are typically hosted at hf and github repo links to hf for online demo as well.
Unless you are referring to something else.

Anonymous 10/26/2025, 12:18:28 AM No.107007979 [Report] >>107008007

>>107007967
a1111 and all it's forks are using gradio locally. you are being retarded

Anonymous 10/26/2025, 12:19:42 AM No.107007994 [Report] >>107008166

If anons here use torch nightly wheels, when I updated from the one in the beginning of October to the 22nd one (2.10.0.dev20251022+cu130), suddenly sage attention broke completely, it looks like an issue where everything defaulted to CPU instead of CUDA, making the sampler throw an error.
Going back to the 1002 version made it work fine again.

Anonymous 10/26/2025, 12:21:32 AM No.107008007 [Report] >>107008032

>>107007979
Oh you mean this?
https://github.com/fpgaminer/joycaption/tree/main/gradio-app
I guess I can try that.
When you said it like that I expected some sort of link to somewhere.

Anonymous 10/26/2025, 12:23:05 AM No.107008032 [Report]

>>107008007
I am too lazy to look for you but you figured it out. gold star for you

Anonymous 10/26/2025, 12:29:05 AM No.107008071 [Report]

>monthly pytorch mismatch between custom nodes that requires a reinstall
here we go

Anonymous 10/26/2025, 12:30:20 AM No.107008079 [Report] >>107008096 >>107008680

>50s/it WAN with random crashes on ROCM 7
>100s/it WAN with guaranteed stability on ROCM 6
suffering

Anonymous 10/26/2025, 12:32:28 AM No.107008096 [Report] >>107008111

>>107008079
>he broughtered'ed AMD
why??

Anonymous 10/26/2025, 12:33:45 AM No.107008111 [Report] >>107008135 >>107008183

>>107008096
Because fuck nvidia. Also gaming under Linux is less of a hassle with AMD.
It's fine, I don't have a fried attention span. I can cope.

Anonymous 10/26/2025, 12:36:50 AM No.107008135 [Report]

>>107008111
>It's fine, I don't have a fried attention span. I can cope.
I'm not sure you cope this well, you literally complained about the lack of speed here lool

Anonymous 10/26/2025, 12:37:34 AM No.107008143 [Report]

>>107007598
thank you anon the pajeet doesnt deserve your grace

Anonymous 10/26/2025, 12:40:16 AM No.107008166 [Report]

>>107007994
I had the same issue, if it's this : https://github.com/pytorch/pytorch/issues/166104

then it's "working as expected" apparently, so it means we need to get sage attention team to update or be stuck with early october torch

Anonymous 10/26/2025, 12:42:29 AM No.107008179 [Report]

1760966609883945.jpg md5: 925dfb71...

Anonymous 10/26/2025, 12:42:53 AM No.107008183 [Report]

>>107008111
Why did you make a post kvetching about speed and stability if you were going to immediately get defensive and coping lol.

Anonymous 10/26/2025, 12:47:32 AM No.107008205 [Report]

00023-3922286591.png md5: 8936c4a9...

Anonymous 10/26/2025, 12:52:19 AM No.107008242 [Report] >>107008372

>>107007931
That's not what I need, I want to train a resolution model with my own database.

The idea is to have pairings of images and teach the model what pairings are a correct upscaling.

Anonymous 10/26/2025, 12:52:55 AM No.107008250 [Report]

1596061505572.png md5: 39388eae...

>>107007499
>make Qwen Audio or something lol
they will do it, but for api, kek

Anonymous 10/26/2025, 12:54:29 AM No.107008261 [Report] >>107008489

Does ComfyUI patch lora weights into the model by default? Doesn't seem so, why isn't this the case? Wouldn't it help a lot for vram size for multiple loras? Can it be enabled somehow?

Anonymous 10/26/2025, 12:56:05 AM No.107008271 [Report]

>>107007849
skill issue literally. installa a llama-cpp-python version that has CUDA compiled in it, otherwise manually build the wheel using the correct compile flags (literally contained in this node repo through a script):
https://github.com/1038lab/ComfyUI-JoyCaption/blob/main/llama_cpp_install/llama_cpp_install.py
you're fucking retarded and should kys unironically retarded faggot brown

Anonymous 10/26/2025, 1:02:42 AM No.107008318 [Report]

1755978351307033.png md5: db42d51f...

Anonymous 10/26/2025, 1:04:23 AM No.107008334 [Report] >>107008364 >>107008509

>you still have to wait a few minutes to OOM on the first comfy video gen before the second allocates properly and works from then on
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Anonymous 10/26/2025, 1:08:13 AM No.107008364 [Report] >>107008376

>>107008334
stop being poor and buy a proper video card faggot

Anonymous 10/26/2025, 1:09:32 AM No.107008372 [Report]

>>107008242
>The idea is to have pairings of images and teach the model what pairings are a correct upscaling.
Elaborate

Anonymous 10/26/2025, 1:10:03 AM No.107008376 [Report] >>107008393

1747274203092719.png md5: dc090bdd...

>>107008364
Write proper memory alloc code faggot

Anonymous 10/26/2025, 1:12:56 AM No.107008393 [Report] >>107008398 >>107008414 >>107008425 >>107008448

file.png md5: e5df3d07...

>>107008376
>24gb
>not vramlet cope tier
stop being poor jamal, youre embarassing yourself

Anonymous 10/26/2025, 1:13:55 AM No.107008398 [Report]

>>107008393
Write proper memory alloc code faggot

Anonymous 10/26/2025, 1:14:53 AM No.107008409 [Report]

frames.jpg md5: c9b0146f...

i..I'M GOING TO OOM, AHHHHHHHH

Anonymous 10/26/2025, 1:15:07 AM No.107008414 [Report]

>>107008393
>0% utilization
embarassing indeed

Anonymous 10/26/2025, 1:16:33 AM No.107008425 [Report] >>107008475

>>107008393
>all that compute for 1girl, standing, ai-generated

Anonymous 10/26/2025, 1:16:59 AM No.107008430 [Report] >>107008529 >>107008541

>>107007599
didn't work at 5 strength for me so i gave up because wan 2.1 is shit and wan 2.2 5b will be even worse because wan 5b has no speed up lora's as far as i know, second wan2.2 vae is fucking slow and it oom sometime unless --reserve-vram 1.0 is used, oh an 5b produces shit t resolutions below 1280 x 704 or what ever.

total waste of time unless they train it on wan 2.2 14b 720p or heck even 480p, what's more is we don't even know if we are using it correctly. Yeah i think it will be forgotten about and never heard of again.

Anonymous 10/26/2025, 1:19:17 AM No.107008443 [Report] >>107008541

>>107007599
well reading that post maybe it will be fixed and working, but i won't hold my breath for it because its already limited in what it can actually do with wan 2.1 and its limited to 832 x 480 or it is slow as fuck.

Anonymous 10/26/2025, 1:19:40 AM No.107008448 [Report] >>107008475

>>107008393
>600 watts idle
waow

Anonymous 10/26/2025, 1:22:51 AM No.107008475 [Report] >>107008497 >>107008506 >>107008519 >>107008627

>>107008425
im actually running REAL LLMs in here, sadly at q8 quant (GLM 4.6, 400gb~), fully in VRAM unlike you poors who make do with q2ks poverty tier quants and offload to CPU anyway lmao. I use the spare 200gb~ to load FULL precision video/audio/image models to delive a superior and immersive chat experience, with SOTA imagen/textgen/audiogen/voicegen all happening automatically as I rp with my waifus.
>>107008448
these are rented in a datacenter, no way I can run this shit at home. Also memed my company into doing it, bunch of clueless retard, im feeding them shit from bedrock itself while using the real cluster for myself.

Anonymous 10/26/2025, 1:24:33 AM No.107008489 [Report]

>>107008261
you can minimize the vram use by merging, but I don't think it's possible to do on the fly so we are stuck with model + lora sizes

Anonymous 10/26/2025, 1:25:22 AM No.107008497 [Report]

>>107008475
enjoy it while it lasts bruh

Anonymous 10/26/2025, 1:26:23 AM No.107008506 [Report]

>>107008475
maybe if you put in as much effort into real life you could have a real girl friend?

Anonymous 10/26/2025, 1:26:27 AM No.107008508 [Report] >>107008523 >>107008710

Every now and again I dream about picking up a data center GPU for fire sale prices after the crash happens and then I remember the insane power draw and the fact that they apparently use total loss water cooling.

Anonymous 10/26/2025, 1:26:30 AM No.107008509 [Report] >>107008524

>>107008334
I don't have the issue, what video gen resolution and how many frames? Can you share your wf?

Anonymous 10/26/2025, 1:27:33 AM No.107008519 [Report] >>107008545

>>107008475
>Also memed my company into doing it, bunch of clueless retard, im feeding them shit from bedrock itself while using the real cluster for myself.
They don't even see they pay twice?

Anonymous 10/26/2025, 1:28:01 AM No.107008523 [Report]

>>107008508
>after the crash happens
Keep dreaming

Anonymous 10/26/2025, 1:28:03 AM No.107008524 [Report] >>107008559

>>107008509
Maximum resolution and frames, Q8
https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper

Anonymous 10/26/2025, 1:28:53 AM No.107008529 [Report] >>107008541

>>107008430
>total waste of time unless they train it on wan 2.2 14b 720p or heck even 480p, what's more is we don't even know if we are using it correctly. Yeah i think it will be forgotten about and never heard of again.
They released the training code so maybe some rich anon will do it.

Anonymous 10/26/2025, 1:30:11 AM No.107008541 [Report] >>107008588

>>107008430
>>107008443
>>107008529
OK I guess we'll wait then.

Anonymous 10/26/2025, 1:31:04 AM No.107008545 [Report] >>107008564

psxhr_flux.krea_0009.png md5: 5f9b8301...

>>107008519
>believing larpers

Anonymous 10/26/2025, 1:32:29 AM No.107008559 [Report]

1749447214725219.png md5: cf7d4604...

>>107008524
>Maximum resolution and frames
720p and 81 frames? Do you blockswap?
Try block swapping 5 blocks for example, see if it works. As long as you have enough ram, the difference in speed is minimal.

Anonymous 10/26/2025, 1:33:00 AM No.107008564 [Report] >>107008597

>>107008545
wtf bfl, this was almost unsafe

Anonymous 10/26/2025, 1:36:10 AM No.107008588 [Report]

>>107008541
sorry for the black pill bro but i've done some testing the base wan 2.2 5b model today. it don't work with lightx speed lora's so its actually slower than just using wan 2.2 high and low on my machine. The vae is a pain in the ass as well, its really slow to decode on my rtx 3060. it might be alright some people but the quality is shit if using resolutions below 1280 x 720

so yeah its slow and i really don't know why people with lower vram use it when they could just use Q4 gguf high and low models and get much better quality and faster due to speed lora's. So I'm not gonna bother when they release the wan2.2 5B version.

tl;dr is was DOA

Anonymous 10/26/2025, 1:37:15 AM No.107008597 [Report] >>107008604

psxzstyle_flux.krea_0009.png md5: 87806cac...

>>107008564
we definitely keep it safe around here

>https://files.catbox.moe/aasfd1.png

Anonymous 10/26/2025, 1:37:53 AM No.107008604 [Report] >>107008773

>>107008597
wish it was good at making filled used condoms

Anonymous 10/26/2025, 1:39:13 AM No.107008614 [Report] >>107008632

wan gen :
[Subject Description] + [Scene Description] + [Motion Description] + [Aesthetic Control] + [Stylization]

Example: "A young woman in a red dress (subject), standing in a bustling neon-lit city street at night (scene), walks forward then stops to look up at the rain, slow motion tracking shot (motion), cinematic lighting, moody atmosphere (aesthetic), film noir style (stylization)"

Anonymous 10/26/2025, 1:40:09 AM No.107008619 [Report] >>107008791

1745458766296140_thumb.jpg.webm md5: ed4e3170...

WebM not supported

>>107007598
forgot to set size for the vid but it still worked pretty good.

the anime girl runs to the left out the door and closes it.

ty anon

Anonymous 10/26/2025, 1:41:03 AM No.107008627 [Report]

>>107008475
How many wan frames can you load and how fast is your wan gens?

Anonymous 10/26/2025, 1:41:37 AM No.107008632 [Report]

>>107008614
I just wish wan could handle longer prompts for more actions. It hardly ever works for me even when using context windows and 161+ frames at 81 frame chunks with overlap.

Anonymous 10/26/2025, 1:49:53 AM No.107008680 [Report] >>107008699

>>107008079
My ROCM 7 has been rock solid after upgrading to the official stuff. ComfyUI amd memory management update also swagged my shit out.
50s/it seems pretty good for wan with amd, which card you got?

Anonymous 10/26/2025, 1:50:21 AM No.107008684 [Report] >>107008719

Screenshot_2025-10-26_00-46-32.png md5: bc070564...

>>107007599
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1519#issuecomment-3440759925

I tried telling anons this when it first dropped, it can't be as easy as just 1 lastframe because base wan does not have any concept of the previous genned video. It treats each as a new video, so to me it looks more like how wananimate works in taking 5 previous frames to continue the motion since 1 frame isn't enough to continue motion with.

People were making videos with it but those would have been placebo gens.

Anonymous 10/26/2025, 1:51:51 AM No.107008699 [Report] >>107008739

>>107008680
9070 xt. Not sure what you mean by "official stuff", I'm using the nightly URL from the official Pytorch page. I haven't upgraded Comfy in a while either.

Anonymous 10/26/2025, 1:53:07 AM No.107008710 [Report] >>107008725 >>107008731

>>107008508
>and the fact that they apparently use total loss water cooling
is this one of those cases of turbo-niggering the environment to save $0.01?

Anonymous 10/26/2025, 1:54:18 AM No.107008719 [Report] >>107008726

1754900427998614.png md5: bd8d29b4...

>>107008684
How to even feed 5 frames to next video? I don't think it's possible right now, last time I asked it was only 1 frame starting the next video.
So on top of their lora, we need some node to "inject" 5 frames instead of 1 as latent into a sampler.
Basically picrel but 5 frames.

Anonymous 10/26/2025, 1:55:03 AM No.107008725 [Report]

>>107008710
Gigawatt in equals gigawatt out. All that energy has to eventually turn into heat, and there ain't an air conditioning system on this earth that can dissipate 1 GW. That being said, I don't know the specifics, only the basic laws of physics.

Anonymous 10/26/2025, 1:55:20 AM No.107008726 [Report] >>107008748 >>107008788

>>107008719
feed first 5 frames 0 denoise then the rest on the next sampler

Anonymous 10/26/2025, 1:56:01 AM No.107008731 [Report]

>>107008710
it's water in water out, they don't inject wastes from the Gange in it anon

Anonymous 10/26/2025, 1:56:36 AM No.107008739 [Report]

>>107008699
you should upgrade your comfy to at least 0.3.65, very good amd improvements
https://github.com/ROCm/TheRock
I think it's these but I see you're on linux so nevermind. Official windows support I meant.

Anonymous 10/26/2025, 1:57:44 AM No.107008748 [Report] >>107008790

>>107008726
Images in wan aren't processed in series, all the frames are processed at the same time, it's just that the first one is "fixed" in latent, rest is sent as noise.
What we need is to "fix" the 5 first instead.

Anonymous 10/26/2025, 1:58:19 AM No.107008752 [Report] >>107008769

>>107007598
willing to give this a try, i'm assuming scheduler simple?

Anonymous 10/26/2025, 2:01:00 AM No.107008769 [Report] >>107008792

>>107008752
Yes

Anonymous 10/26/2025, 2:01:22 AM No.107008773 [Report]

psxzstyle_0018.png md5: 370de40c...

>>107008604
train a lora man, it takes like two hours

Anonymous 10/26/2025, 2:04:12 AM No.107008788 [Report] >>107008790 >>107008842

>>107008726
won't work, this is shit we tried months ago I'm sure, it just ignores them. and how you even gonna do this? de-noise in advanced sampler the first 5? it won't matter because wan does all frames at once as a new video. it won't magically know there are 5 frames already done. KJ was wrong to assume it needed no codes changes, we gonna need a new node.

Anonymous 10/26/2025, 2:04:47 AM No.107008790 [Report]

>>107008748
>>107008788
vibe code it

Anonymous 10/26/2025, 2:04:51 AM No.107008791 [Report] >>107008892

1749642696327592_thumb.jpg.webm md5: 9bb4428f...

WebM not supported

>>107008619
the new 2.2 MoE high lora works so much better for motion/fluidity. ty kijai for fixing it

Anonymous 10/26/2025, 2:05:16 AM No.107008792 [Report] >>107008797 >>107008853

>>107008769
thanks, but why KJ lora's? Is there something different about them? Or are just extracted from their model?

Anonymous 10/26/2025, 2:05:51 AM No.107008797 [Report] >>107008878

>>107008792
I tested different loras and this combination had the best result

Anonymous 10/26/2025, 2:13:22 AM No.107008842 [Report]

>>107008788
>we gonna need a new node.
Yep, and even better if we do import latent corresponding to the last 5 images instead of degraded images through vae decode, but I'm not even sure that's possible.

Anonymous 10/26/2025, 2:14:08 AM No.107008846 [Report] >>107008851 >>107009795

1756528451241047_thumb.jpg.webm md5: 849cb5f1...

WebM not supported

Anonymous 10/26/2025, 2:15:37 AM No.107008851 [Report]

>>107008846
it started so well, but we didn't get glorious cleavage bouncing

Anonymous 10/26/2025, 2:15:41 AM No.107008853 [Report] >>107008890

>>107008792
NTA, but the LoRAs released by Lightx2v were extracted wrong initially so you had to use KJ extracted LoRAs. They Lightx2v ones were then re-uploaded with correctly working versions at some point. KJ didn't extract the newest I2V Lightx2v LoRAs because they actually did it right on the first try this time. https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main

Anonymous 10/26/2025, 2:20:11 AM No.107008878 [Report] >>107008900 >>107008913

>>107008797
well its no where near as good as my settings and setup, its fucking blurry at 720 x 720

Anonymous 10/26/2025, 2:22:14 AM No.107008890 [Report] >>107008913

>>107008853
lol its still shit i will prove it...

Anonymous 10/26/2025, 2:22:35 AM No.107008892 [Report] >>107009160

1736965521980822_thumb.jpg.webm md5: e58fd9c1...

WebM not supported

>>107008791

Anonymous 10/26/2025, 2:23:59 AM No.107008900 [Report] >>107008913 >>107009030 >>107009641

>>107008878
im using q8 2.2 with https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper

Anonymous 10/26/2025, 2:25:41 AM No.107008909 [Report]

Ok needed to figure out how to integrate CUDA Toolkit into my docker setup but Joy Caption is working now with GPU acceleration, 4 times faster.
You are the thread schizo who regularly shits it up. As such I won't give a (You), but credit where due thank you, bastard.

Anonymous 10/26/2025, 2:26:28 AM No.107008913 [Report] >>107008927

>>107008900
>>107008890
>>107008878
wait a minute i forgot to change the god damn steps start and end... This has probably been why its so blurry lol. I'll check it again.

Anonymous 10/26/2025, 2:27:38 AM No.107008927 [Report] >>107008960

>>107008913
I tried those LoRAs as well and I got some weird hyperspace zoom effect, but I'm just using whatever quants of WAN22 come with Comfy's default workflow.

Anonymous 10/26/2025, 2:35:57 AM No.107008959 [Report] >>107008966 >>107008983 >>107008992 >>107009180

output_t2v_refine_1_thumb.jpg.webm md5: fbaf34be...

WebM not supported

I was able to run the LongCat Video demo. This is the stock prompt:

>prompt = "In a realistic photography style, a white boy around seven or eight years old sits on a park bench, wearing a light blue T-shirt, denim shorts, and white sneakers. He holds an ice cream cone with vanilla and chocolate flavors, and beside him is a medium-sized golden Labrador. Smiling, the boy offers the ice cream to the dog, who eagerly licks it with its tongue. The sun is shining brightly, and the background features a green lawn and several tall trees, creating a warm and loving scene."
>negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"

Generation is in 3 stages (initial, distilled, refined) that each output a video. It took 24 minutes to generate this and 74gb of vram (FP32).

Going to try the long video (1min) generation next and expecting it to take hours.

Anonymous 10/26/2025, 2:36:02 AM No.107008960 [Report]

>>107008927
I'm a quality improving study tonight, so we will see which is best.

Anonymous 10/26/2025, 2:37:31 AM No.107008966 [Report] >>107008971 >>107008974

>>107008959
That is good quality, how long did it take?

Anonymous 10/26/2025, 2:38:28 AM No.107008971 [Report]

>>107008966
>It took 24 minutes to generate this

Anonymous 10/26/2025, 2:39:04 AM No.107008974 [Report]

>>107008966
nvm i didn't read full post 24 mins 74GB vram I'm gonna cry :(

Anonymous 10/26/2025, 2:40:32 AM No.107008983 [Report] >>107009009 >>107009018

>>107008959
>Generation is in 3 stages (initial, distilled, refined)
ok that might mean it could run on smaller cards?

Anonymous 10/26/2025, 2:40:33 AM No.107008984 [Report] >>107008991

>>107006468 (OP)
can you guys make me some realistic apustajas?

Anonymous 10/26/2025, 2:41:57 AM No.107008991 [Report]

>>107008984
https://civitai.com/models/175781/apu-apustaja-model-sd-xl
https://civitai.com/models/679189/apu-apustaja

Anonymous 10/26/2025, 2:42:06 AM No.107008992 [Report] >>107009003

psxannie_0007.png md5: e22eb52e...

>>107008959
thanks for doing this anon, i ran oom on my 3090 multiple times before giving up

Anonymous 10/26/2025, 2:43:30 AM No.107009003 [Report] >>107009016

>>107008992
scully?

Anonymous 10/26/2025, 2:44:42 AM No.107009009 [Report] >>107009018 >>107009024 >>107009048

output_t2v_thumb.jpg.webm md5: 8dd8d837...

WebM not supported

>>107008983
It already uses 55gb on the first pass, but keep in mind it's at FP32. At Q8 the 74gb peak should be down to 18.5, so it would work on a 24gb card.

The first two passes aren't really meant to be used as-is, anyway. First stage here.

Anonymous 10/26/2025, 2:45:21 AM No.107009014 [Report] >>107009030

Kind of annoying how the lightx2v ruins videos with end frames. It distorts right at the ending but without the lora it works fine

Anonymous 10/26/2025, 2:45:34 AM No.107009016 [Report] >>107009038

>>107009003
allison brie you fucking cretin

Anonymous 10/26/2025, 2:45:45 AM No.107009018 [Report]

output_t2v_distill_thumb.jpg.webm md5: 88aec1be...

WebM not supported

>>107009009
>>107008983
Second stage (distill)

Anonymous 10/26/2025, 2:46:34 AM No.107009024 [Report]

>>107009009
>letting your dog lick chocolate syrup
fucking retarded kid

Anonymous 10/26/2025, 2:47:24 AM No.107009030 [Report] >>107009042 >>107009641

>>107009014
Everyone complaining about lightx2v color distortion, flickering, or blurryness or anything else is a workflow issue, i never had any of those with >>107008900
>>107007598

Anonymous 10/26/2025, 2:49:17 AM No.107009038 [Report] >>107009081

>>107009016
she looked better younger

Anonymous 10/26/2025, 2:49:53 AM No.107009042 [Report]

>>107009030
I don't have them either with latest version, pretty nice.

Anonymous 10/26/2025, 2:50:16 AM No.107009048 [Report]

>>107009009
the first stage looks alright desu.
>At Q8 the 74gb peak should be down to 18.5, so it would work on a 24gb card.
Yeah I think we will be eating good again soon.

Anonymous 10/26/2025, 2:54:54 AM No.107009081 [Report]

>>107009038
they all do

Anonymous 10/26/2025, 2:01:33 AM No.107009116 [Report]

output_long_video_0_thumb.jpg.webm md5: 444afc52...

WebM not supported

Running the LongCat 1min demo. It generates 11 segments and chains them together. I'm guessing it'll take about 4.5 hours if it doesn't fail. Here's the initial step of the first 11th.

>prompt = "realistic filming style, a person wearing a dark helmet, a deep-colored jacket, blue jeans, and bright yellow shoes rides a skateboard along a winding mountain road. The skateboarder starts in a standing position, then gradually lowers into a crouch, extending one hand to touch the road surface while maintaining a low center of gravity to navigate a sharp curve. After completing the turn, the skateboarder rises back to a standing position and continues gliding forward. The background features lush green hills flanking both sides of the road, with distant snow-capped mountain peaks rising against a clear, bright blue sky. The camera follows closely from behind, smoothly tracking the skateboarder’s movements and capturing the dynamic scenery along the route. The scene is shot in natural daylight, highlighting the vivid outdoor environment and the skateboarder’s fluid actions."
>negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"

Anonymous 10/26/2025, 2:04:24 AM No.107009129 [Report]

ComfyUI_temp_dbeyl_00046_.png md5: f3285ffd...

Anonymous 10/26/2025, 2:09:51 AM No.107009152 [Report] >>107009157

ComfyUI_temp_dbeyl_00047_.png md5: 479f0a01...

captchas are failing, 4chan is going down

Anonymous 10/26/2025, 2:11:39 AM No.107009157 [Report]

>>107009152
[audience] wooooOOO

Anonymous 10/26/2025, 2:11:56 AM No.107009160 [Report]

1743105653542370_thumb.jpg.webm md5: 23cd1f96...

WebM not supported

>>107008892

Anonymous 10/26/2025, 2:17:05 AM No.107009178 [Report]

ComfyUI_03653_.png md5: ffad9e28...

Anonymous 10/26/2025, 2:17:16 AM No.107009180 [Report] >>107009246

>>107008959
6 second gen took 24 mins to do? that's rough

Anonymous 10/26/2025, 2:20:25 AM No.107009196 [Report]

ComfyUI_temp_ptasu_00097_.png md5: 1193579a...

Anonymous 10/26/2025, 2:23:01 AM No.107009222 [Report]

ComfyUI_temp_ptasu_00099_.png md5: 36fc4553...

Anonymous 10/26/2025, 2:26:07 AM No.107009243 [Report] >>107009301

ComfyUI_temp_ptasu_00101_.png md5: fbbb57cd...

Anonymous 10/26/2025, 2:26:22 AM No.107009246 [Report]

>>107009180
Yeah but so is wan with out speed loras.

now imagine what this thing could do in future.

Anonymous 10/26/2025, 2:32:00 AM No.107009301 [Report] >>107009330

>>107009243
what model?

Anonymous 10/26/2025, 2:32:29 AM No.107009306 [Report]

1730557881155378_thumb.jpg.webm md5: 2e031c67...

WebM not supported

the man in the blue shirt turns and fires a blue energy beam at the plane, which explodes into fire and smoke.

live action dragonball. used unipc instead of euler this time.

Anonymous 10/26/2025, 2:33:19 AM No.107009316 [Report] >>107009333

>>107007598
I'm guessing you mean 8 total steps 4/4 ? because with only 4 total steps its blurry, I'm now trying with 8 total steps with your settings.

Anonymous 10/26/2025, 2:36:05 AM No.107009330 [Report] >>107009349 >>107009357 >>107010268

ComfyUI_temp_dbeyl_00050_.png md5: c9c783f6...

>>107009301
chroma

Anonymous 10/26/2025, 2:36:29 AM No.107009333 [Report] >>107009364

>>107009316
No it's 4 steps total, unipc, 720x1280, 81 frames, q8 wan, umt5 bf16

Anonymous 10/26/2025, 2:39:40 AM No.107009349 [Report]

>>107009330
catbox?

Anonymous 10/26/2025, 2:40:41 AM No.107009357 [Report] >>107009402

>>107009330
>Chroma
Ew

Anonymous 10/26/2025, 2:41:25 AM No.107009364 [Report] >>107009378

>>107009333
its not enough for 720 x 720 that's for sure, its looking a lot better using 8 steps total but I am using q4, umt5 16fp

Anonymous 10/26/2025, 2:42:40 AM No.107009378 [Report] >>107009387 >>107009406 >>107009438 >>107009478

>>107009364
Wan was trained primarily for 1280x720 and 720x1280, and Q4 is too low even for full res anyway

Anonymous 10/26/2025, 2:44:04 AM No.107009387 [Report] >>107009395

>>107009378
>low even for full res anyway
i think not :)

Anonymous 10/26/2025, 2:45:26 AM No.107009395 [Report]

>>107009387
The blurry output certainly thinks so :)

Anonymous 10/26/2025, 2:47:15 AM No.107009402 [Report] >>107009505

ComfyUI_temp_xhrxd_00054_.png md5: 47e09f8e...

>>107009357
Its pretty good once you figure it out how to use it

Anonymous 10/26/2025, 2:47:40 AM No.107009406 [Report] >>107009412 >>107009414

>>107009378
Mine
https://files.catbox.moe/qkl1j3.mp4
6 steps different settings

yours
https://files.catbox.moe/ir5zvj.mp4
8 steps only change to settings

mine looks a bit over cooked and i need to then adjust slight the cfg.

I'm using old 2.1 light lora at 5 strength in high, old 2.2 lora in low at 1.0

Anonymous 10/26/2025, 2:48:39 AM No.107009412 [Report] >>107009414

>>107009406
>10s

Anonymous 10/26/2025, 2:49:32 AM No.107009414 [Report]

>>107009406
same prompt, same seed btw. Q4

>>107009412
ping ponged because why not?

Anonymous 10/26/2025, 2:50:29 AM No.107009421 [Report] >>107009437 >>107010167

ComfyUI_temp_xhrxd_00055_.png md5: 7716a8a6...

Anonymous 10/26/2025, 2:52:53 AM No.107009437 [Report]

>>107009421
built

Anonymous 10/26/2025, 2:53:07 AM No.107009438 [Report]

>>107009378
>1280x720
and i can that res and it does look better even with q4, in fact at that res it looks amazing but it takes more time and i don't care for it. I'm just looking for better settings than what I'm using for the same res 720 x 720 since then i can do 81 frames no problems even with lots of lora. I can do the higher res but i'd need to use the context window nodes. Heh then i can gen really long videos but takes ages on a 3060

Anonymous 10/26/2025, 2:55:45 AM No.107009450 [Report]

1754720305810903.jpg md5: fadd621b...

Anonymous 10/26/2025, 2:59:03 AM No.107009472 [Report] >>107009645 >>107009668 >>107009897

why isn't there any useful chroma lora on civitai?

Anonymous 10/26/2025, 3:00:43 AM No.107009478 [Report] >>107009512

>>107009378
I mean the thing is mate, my settings use cfg so it follows the prompt better and i can use negative prompt if i want. That is entirely the point, its still only 6 total steps so its plenty fast enough, i'm just trying to tweak the cfg so its not so over cooked. The only problem is that these settings I use have a very fine threshold with cfg between the high and low and changing them can cause it to be all messed up.

I don't trust these new lightx lora recently uploaded and I'm not the only one. The old ones still work better with the right settings.

Anonymous 10/26/2025, 3:04:57 AM No.107009505 [Report] >>107009547 >>107009555

>>107009402
>Its pretty good once you figure it out how to use it
>posts a gen with the long torso issue

Anonymous 10/26/2025, 3:05:39 AM No.107009512 [Report] >>107009531 >>107009544 >>107009551

>>107009478
>be absolute retard
>literally every single setting that you can fuck up you fuck up
>while also using a toy quant
>then you also for good measure use lightning models without cfg set to 1
reminder these are your average retards that give their opinions on what is good or bad and complain about their outputs having problems on /ldg/

Anonymous 10/26/2025, 3:09:51 AM No.107009531 [Report]

>>107009512
>>then you also for good measure use lightning models without cfg set to 1
No is forced to do anything they are told and those retards can't even get a proper wan 2.2 version. using it still makes it much faster though which is why i use it. you couldn't get decent quality anything at 6 steps without it.

Anonymous 10/26/2025, 3:12:20 AM No.107009544 [Report]

>>107009512
and i will make eat you shit you just posted within a few minutes once i adjust the cfg from 4 to 3.5 and load the q8 and up the resolution to full 720p then you can eat shit and die

Anonymous 10/26/2025, 3:12:29 AM No.107009547 [Report] >>107009667 >>107009920

ComfyUI_temp_ptasu_00127_.png md5: b7a9bdc0...

>>107009505
>elbows too pointy

Anonymous 10/26/2025, 3:13:33 AM No.107009550 [Report] >>107009556 >>107009576

taking notes.gif md5: 1ffcfab9...

Can I use rule34 anime videos to train realistic LoRAs? After all I'm training just the motion, right?

Anonymous 10/26/2025, 3:13:46 AM No.107009551 [Report]

>>107009512
its not just here its everywhere in ai

Anonymous 10/26/2025, 3:14:46 AM No.107009555 [Report]

>>107009505
I agree that Chroma has its issues but to be fair to Chroma that's not the correct resolution for 1.75 ratio in a 1024p model.

Anonymous 10/26/2025, 3:14:51 AM No.107009556 [Report]

>>107009550
I'm talking Wan btw

Anonymous 10/26/2025, 3:16:06 AM No.107009563 [Report]

1752355531556982.jpg md5: 14434f13...

Anonymous 10/26/2025, 3:17:51 AM No.107009576 [Report]

>>107009550
Hypothetically with a very diverse dataset and zero overlearning random noise, yes.
In practice the realism will be lower than training on realistic/mixed dataset of same quality. The difference may or may not matter too much though.
Note that I never trained video diffusion. Just guessing from what I know about images.

Anonymous 10/26/2025, 3:26:52 AM No.107009641 [Report] >>107009659 >>107009764 >>107009978

>>107009030
I'm using the WAN 2.2 I2V workflow from >>107008900
but it's still fucking up the last frame or so when I set an end frame

Anonymous 10/26/2025, 3:27:09 AM No.107009645 [Report]

>>107009472
Most of its community still thinks Flux is SOTA. And then you have all the Chinks who have successfully shilled Qwen.

Anonymous 10/26/2025, 3:28:29 AM No.107009656 [Report]

1758160382086397.png md5: b79c7d9b...

Anonymous 10/26/2025, 3:28:51 AM No.107009659 [Report] >>107009764

>>107009641
Actually after checking the last few runs it does seem a little more stable but the first two got distorted for some reason

Anonymous 10/26/2025, 3:29:52 AM No.107009666 [Report] >>107009792

1754816819286029_thumb.jpg.webm md5: cdae2ae4...

WebM not supported

hatsune miku runs in from the left and waves hello.

new lora combo is smooth (MoE high + 2.2 lightning low)

Anonymous 10/26/2025, 3:30:06 AM No.107009667 [Report]

>>107009547
use an upscaler (with sampler) to fix those lines nigga

Anonymous 10/26/2025, 3:30:08 AM No.107009668 [Report] >>107009682 >>107009710

>>107009472
There's several good loras for Chroma just civitai are cucks and take them down almost as immediately as they get uploaded.

Anonymous 10/26/2025, 3:32:36 AM No.107009682 [Report]

>>107009668
why? Is Chroma against rules too? Did Flux krauts do this?

Anonymous 10/26/2025, 3:32:41 AM No.107009684 [Report]

want2v_00042_thumb.jpg.webm md5: bfe1bf09...

WebM not supported

cfg in high adjusted to 3.5 from 4.0 seems better, i noticed when not using other lora's the frame rate is slower, so next video will need to be 32fps instead of 16fps. I will gen now with q8 @ 1280 x 720 81 frames, slightly different prompt to include synthetic led lighting so that sun light is hopefully not present and change the seed and run at only 4 steps down from 6 total, 2 steps each sampler just to see what it produces. I bet you it will be better than that load of bollocks suggested in the thread, why? because I genned thousands of wan videos by now using these settings.

Anonymous 10/26/2025, 3:36:12 AM No.107009710 [Report] >>107009722

>>107009668
Can you give an example of what you are referring to here?
Celeb loras? NSFW in general?
I agree civit sucks regardless though.

Anonymous 10/26/2025, 3:38:29 AM No.107009722 [Report]

>>107009710
I lied. there's no such thing and chroma is shit model therefore no good lora

Anonymous 10/26/2025, 3:39:06 AM No.107009726 [Report] >>107009731 >>107009777

Reminder
>.\python_embeded\python.exe -m pip cache purge
Files removed: 368 (8132.7 MB)

Anonymous 10/26/2025, 3:40:06 AM No.107009731 [Report]

>>107009726
i don't get

Anonymous 10/26/2025, 3:40:18 AM No.107009732 [Report] >>107009822

ComfyUI_temp_kalqx_00016_.png md5: 27610215...

Anonymous 10/26/2025, 3:44:38 AM No.107009764 [Report]

Screenshot_2025-10-26_02-42-26.png md5: 22311520...

>>107009641
>>107009659
because its fucking shit, every workflow shared on civ or where ever is a fucking meme, a big giant pile of pointless crock of shit. smashed together in a about 5 minutes using someone else's shit and it becomes some Frankenstein turd. Let me guess KJ wrapper nodes even though the guy tells you to not use the wrapper once native nodes are available. People still use the wrapper node workflows and its like OMFG these people are retarded.

Anonymous 10/26/2025, 3:45:59 AM No.107009773 [Report] >>107009822

ComfyUI_temp_dbeyl_00053_.png md5: af4b1ded...

Anonymous 10/26/2025, 3:46:30 AM No.107009777 [Report] >>107009836

>>107009726
8 GB wow... You're just gonna have to re download those files when doing anything to install or update shit don't you know that?

Anonymous 10/26/2025, 3:49:12 AM No.107009792 [Report]

1752413595697592_thumb.jpg.webm md5: fb87168f...

WebM not supported

>>107009666
lmao the random icons.

Anonymous 10/26/2025, 3:49:26 AM No.107009795 [Report] >>107009901

>>107008846
The breasts sag and bounce are perfect. Is this with no lora?

Anonymous 10/26/2025, 3:54:06 AM No.107009821 [Report] >>107009897

Chroma UltraReal LoRA made for Flux sloppers-
https://www.reddit.com/r/StableDiffusion/comments/1o3bkgc/lenovo_ultrareal_chroma_lora/

Looks less realistic than original, but gives Chroma back the Flux aesthetic.

Anonymous 10/26/2025, 3:54:15 AM No.107009822 [Report] >>107009896

>>107009732
>>107009773
why would hair fly in the car?

Anonymous 10/26/2025, 3:56:02 AM No.107009834 [Report] >>107009895

ComfyUI_temp_dbeyl_00054_.png md5: a0c5ca43...

Anonymous 10/26/2025, 3:56:16 AM No.107009836 [Report] >>107009918 >>107009932

>>107009777
A lot of things are redundant old versions that won't be needed anymore or from nodes that were deleted after testing

Anonymous 10/26/2025, 4:04:31 AM No.107009885 [Report] >>107009893

What's the Light2v setup for Wan 2.2 T2V? Is it just the distilled 2.2 loras or do you guys use the old ones at a higher weight like the i2v worklfow?

Anonymous 10/26/2025, 4:07:29 AM No.107009893 [Report]

>>107009885
speed up loras are for losers

Anonymous 10/26/2025, 4:07:51 AM No.107009895 [Report]

>>107009834
>a car interior that isn't nonsense
Impressive!

Anonymous 10/26/2025, 4:08:04 AM No.107009896 [Report]

ComfyUI_temp_dbeyl_00055_.png md5: fe039379...

>>107009822
the car doesn't have any windows anon, pay more attention

Anonymous 10/26/2025, 4:08:12 AM No.107009897 [Report]

>>107009472
>>107009821
most if not all flux loras work in chroma so try them.

Anonymous 10/26/2025, 4:08:32 AM No.107009901 [Report] >>107010036

>>107009795
https://civitai.com/models/2008663/slop-twerk-wan-22-i2v

Anonymous 10/26/2025, 4:12:46 AM No.107009918 [Report] >>107009932

>>107009836
you don't understand when you install requirements.txt it will check and re download most probably.

Anonymous 10/26/2025, 4:13:09 AM No.107009920 [Report] >>107009935

1742364684649963_thumb.jpg.webm md5: 40a2bf9b...

WebM not supported

>>107009547

Anonymous 10/26/2025, 4:15:06 AM No.107009929 [Report]

>i2v
>inpaint and fix the last frame
>flf2v
easy

Anonymous 10/26/2025, 4:15:18 AM No.107009932 [Report]

>>107009836
>>107009918
unless its only removing old versions of files i guess, but i wouldn't bother for the sake of 8GB free disk space.

Anonymous 10/26/2025, 4:15:41 AM No.107009935 [Report]

>>107009920
got dang

Anonymous 10/26/2025, 4:21:46 AM No.107009971 [Report] >>107010043 >>107010298 >>107010391

I'm trying to gen i2v with comfyui's example workflow for wan 2.2 and the quality is dogshit compared to with the lightx2 loras. Anyone else willing to test this please? I'm trying to use the workflow at the bottom of comfyui's wan example page here:

https://comfyanonymous.github.io/ComfyUI_examples/wan22/

My light2x videos are generally fine, just had one that wasn't quite hitting the level of quality I wanted and back on 2.1 I could just switch to not using the loras and it would generally improve it, but 2.2 it's way worse. Way waaaay worse. Probably something wrong with my environment but would like someone else to test it or at least just tell me it works fine for them before I redo my venv.

Anonymous 10/26/2025, 4:22:35 AM No.107009978 [Report]

>>107009641
the last or first frames being messed up is probably due to not enough steps because that is what is happening to me right now when dropping from 6 steps to 4 steps. 4 total steps isn't enough, splitting steps like this is also retarded but its what we do with these speed up lora's.

for wan2.2 without the minimum would be something like

25 total steps
10 high 15 low or the other way round I forgot, but there is a chart somewhere on reddit that explains. Its based on shift settings that's all i remember.

Anonymous 10/26/2025, 4:27:29 AM No.107010005 [Report]

00110-885313502.png md5: 67763956...

Anonymous 10/26/2025, 4:32:41 AM No.107010036 [Report]

>>107009901
thanks

Anonymous 10/26/2025, 4:35:20 AM No.107010043 [Report] >>107010221 >>107010298

>>107009971
Comfyui local examples are purposely shittier so you pay API.

Anonymous 10/26/2025, 4:58:52 AM No.107010167 [Report]

1761357964189354_thumb.jpg.webm md5: 16e52885...

WebM not supported

>>107009421

Anonymous 10/26/2025, 5:06:51 AM No.107010206 [Report]

1737299827315456_thumb.jpg.webm md5: d70052a7...

WebM not supported

the anime girl stands up and dives into a swimming pool on the right.

Anonymous 10/26/2025, 5:08:51 AM No.107010221 [Report] >>107010298

>>107010043
this. only a matter of time before they start to fuck with the sampling

Anonymous 10/26/2025, 5:11:27 AM No.107010233 [Report]

1745777013437417.png md5: be0a3089...

https://www.reddit.com/r/StableDiffusion/comments/1og3u26/automatically_texturing_a_character_with_sdxl/

Anonymous 10/26/2025, 5:11:36 AM No.107010234 [Report] >>107010243 >>107010256

Is there a video gen for a <12GB vramlet?

Anonymous 10/26/2025, 5:13:20 AM No.107010243 [Report]

>>107010234
https://github.com/deepbeepmeep/Wan2GP

Anonymous 10/26/2025, 5:16:14 AM No.107010256 [Report] >>107010259

>>107010234
you can use wan 2.2 by offloading

Anonymous 10/26/2025, 5:17:20 AM No.107010259 [Report]

>>107010256
how much ram is recommended for offloading with that amount of vram?

Anonymous 10/26/2025, 5:18:37 AM No.107010264 [Report]

im back.jpg md5: b1ceeb78...

Anonymous 10/26/2025, 5:19:44 AM No.107010268 [Report]

>>107009330
prompt?

Anonymous 10/26/2025, 5:26:54 AM No.107010298 [Report] >>107010307 >>107010331

>>107009971
It's dogshit because it uses just 20 steps when the official settings use 40 steps for i2v. In my own testing you need at least 35. Also splitting the steps at half is stupid as the high noise model doesn't need that many steps compared to the low noise model (use something like the MoE Ksampler and set the boundary to 0.900 for i2v and let it determine the split). Lastly, fp8 scaled is inferior to Q8.

>>107010043
>>107010221
I suppose it never occurred to you that those dipshits at comfyui never extensively test workflows/settings/models, but rather just cobble the nodes together and say it's good enough and leave it at that? Retards.

Anonymous 10/26/2025, 5:29:23 AM No.107010307 [Report]

>>107010298
tbf when there are a bunch of redditors constantly screaming WEN COMFY NODE it may be hard to actually spend time testing first.

on the othe hand, it is a bit suspect when they deviate from the model developers' defaults.

Anonymous 10/26/2025, 5:34:35 AM No.107010331 [Report]

>>107010298
Thanks anon I'll try higher steps and switch between models earlier.

Anonymous 10/26/2025, 5:41:31 AM No.107010368 [Report]

Fresh

>>107010364
>>107010364
>>107010364

Anonymous 10/26/2025, 5:47:52 AM No.107010391 [Report]

want2v_00044_thumb.jpg.webm md5: 26742ad2...

WebM not supported

>>107009971
I got you anon just just tweaking to hit the right spot and i'm getting close to the sweet spot. this i think was only 4 steps using old wan2.1 lora rank 64 iv2 in high at 5.00 strength and old wan2.2 low in low at 1.0 strength. But I'm using unconventional sampler settings that some anons here think is retarded because it uses cfg but honestly the gen time is not all that much more time and you get high quality with prompt adherence so its balance I seek and not just the fastest speed.