/ldg/ - Local Diffusion General
Anonymous
10/25/2025, 8:38:35 PM
No.107006484
[Report]
what is opinion of pony v7, krea video, lightx2v lora nwe
Anonymous
10/25/2025, 8:38:57 PM
No.107006487
[Report]
Blessed thread of frenship
>>107006468 (OP)
>not a single anime girl
:(
Anonymous
10/25/2025, 8:42:39 PM
No.107006522
[Report]
>>107006514
anime died with pony
Anonymous
10/25/2025, 8:43:12 PM
No.107006527
[Report]
>>107006514
>but there's a clown girl
a massive upgrade, then.
>>107006326
It is more associated with "pointy ears" than elf, as I can get it with "demon girl" or "fairy" as in pic related.
>>107006523
Wow those do look a lot like the earrings I keep getting, especially when not trying to prompt around it. But does Chroma know "Frieren"? I thought it had characters and styles removed.
Anonymous
10/25/2025, 8:45:44 PM
No.107006545
[Report]
>>107006658
>It isn't that half bad desu
it makes sense that the guy who shills netadogshit would think that ponyv7 isn't half bad. horrendous shit taste.
Anonymous
10/25/2025, 8:48:09 PM
No.107006562
[Report]
my prompts are too strong for you, anon
Anonymous
10/25/2025, 8:48:45 PM
No.107006567
[Report]
>>107006544
>But does Chroma know "Frieren"? I thought it had characters and styles removed.
Yeah chroma knows many characters and styles, just need longer prompting than just name
Anonymous
10/25/2025, 8:50:50 PM
No.107006585
[Report]
>>107006627
anon, i tell you i am going to 1girl, and i want only your strongest prompts
Anonymous
10/25/2025, 8:51:46 PM
No.107006588
[Report]
(you) can't handle my strongest 1girl
Anonymous
10/25/2025, 8:56:13 PM
No.107006627
[Report]
>>107006585
masterpiece, loli, photorealistic
Anonymous
10/25/2025, 8:56:31 PM
No.107006629
[Report]
>>107006704
>>107005962
Same reason Princess Peach always has an Iron Man core embedded in her chest no matter what clothes she's wearing, if almost 100% of the examples of a given object have that particular feature then as far as that model is concerned it is an inherent aspect of that object. It's not like these models make any inherent distinction between clothing and body parts.
so did astralite comment about why v7 shat the bed so hard or is he pivoting straight to another grift
Anonymous
10/25/2025, 8:59:46 PM
No.107006656
[Report]
>>107006647
Pony v6 was simply a fluke.
When he announced v7 and what was going on with it that already was a clear sign that the model is going to be a failure.
Anonymous
10/25/2025, 8:59:55 PM
No.107006658
[Report]
Anonymous
10/25/2025, 9:01:42 PM
No.107006675
[Report]
Anybody who defends NetaLumina or Ponyv7 is deranged and cannot be trusted.
Anonymous
10/25/2025, 9:03:47 PM
No.107006690
[Report]
Anonymous
10/25/2025, 9:05:23 PM
No.107006703
[Report]
>>107006629
Well I know there are plenty of elf images that don't have earrings like that. And other models like the SDXL-based ones can make earring-less elves easily enough. It might be that I am pass in a cartoon style image in the workflow, that does not itself have earrings but this nudges the model as well. Perhaps it can realistic pointy earsor other styles just fine without sticking earrings on. Or perhaps if I fed it a empty latent image it wouldn't have the problem. Haven't tested it. Just noticed it was st range. At any rate, I have added frieren to negatives (along with earrings) but I don't start losing the earrings until the cfg gets all the way up to around 5.0 at which point it is looking rather fried so I guess it is not going to fix the problem.
Anonymous
10/25/2025, 9:06:56 PM
No.107006720
[Report]
>>107006704
> It might be that I am passing in a cartoon style image*
and
> Perhaps it can do realistic pointy ears or other styles just fine*
Anonymous
10/25/2025, 9:08:09 PM
No.107006731
[Report]
>>107006840
>>107006704
bretty kino gen
Anonymous
10/25/2025, 9:10:05 PM
No.107006742
[Report]
>>107006647
>pivoting straight to another grift
this, as to why anyone would support him after v7, the amount of retards far outweigh people with common sense.
Anonymous
10/25/2025, 9:12:36 PM
No.107006760
[Report]
>>107006544
The annoying thing is I can get the earrings to go away by turning down cfg below the correct value (1), but then the image fails to denoise correctly.
>>107006544
I finally succeeded. I was uhh only trying to accomplish no earrings, nothing else in particular
Anonymous
10/25/2025, 9:21:46 PM
No.107006829
[Report]
>>107006856
>>107006793
Nice job and nice tits. Did you do anything in particular to make them go away or was it just a lucky gen?
>>107006731
Thanks, you can get some weird results when you use wd14 to interrogate an image, then use its prompt to make a new picture.
Anonymous
10/25/2025, 9:25:48 PM
No.107006856
[Report]
>>107006947
>>107006829
>boobas, bazoongas, over the shoulder boulder holders
one can only imagine
Anonymous
10/25/2025, 9:29:17 PM
No.107006880
[Report]
No finetune is going to fix that trash pony v7 model. Pic unrelated
Anonymous
10/25/2025, 9:39:27 PM
No.107006947
[Report]
>>107007052
>>107006840
Turning cfg down to ~0.85 had a lot to do with it, but largely just luck
>>107006856
In this case it was "tall and voluptuous" plus "[her] big flabby sagging breasts are tightly bound in her fraying smock and squeezed together for a ton of cleavage"
I found this was um necessary to make the earrings go away
Anonymous
10/25/2025, 9:42:27 PM
No.107006975
[Report]
>>107007117
Local models should unironically be banned
Anonymous
10/25/2025, 9:45:34 PM
No.107007001
[Report]
>nigbobumping this thread
Anonymous
10/25/2025, 9:48:11 PM
No.107007022
[Report]
>>107007256
is the info of style clustering ANY FUCKING WHERE for ponyv7? I searched in the HF and civitai page, NOT a single fucking link to check these fucking clusters.
Yes I know that pony is shit, but I still wanted to experiment a bit with this new toy.
FML
Anonymous
10/25/2025, 9:51:45 PM
No.107007052
[Report]
>>107007058
>>107006947
and what about the comic style name?
>>107006840
Actually I don't know why I said it was only luck, I did a lot otherwise to try to make it happen (luck was still important of course)
I added no-makeup hashtags, removed anything signifying richness or ornateness, used a lot of words like "plain" "natural" "rustic" "barefaced" etc., tried to force a feral/pauper/tattered appearance, tried to get a retro pulp fantasy aesthetic to avoid modern "character design" slop, described the character as boyish and avoided things suggesting a stately older elf, etc.
But all those things failed until I turned cfg down a little bit. Now some of the gens have earrings and some don't.
>>107007052
"A blurry grainy scan of an old pulp fantasy illustration from 1957." I'm sure there's a lot of room for improvement there.
res2s, beta57, 20 steps, 4 cfg.
man these fucking hands
my maximum permitted gen time is around 100s, this gen took 110s.
Anonymous
10/25/2025, 9:55:54 PM
No.107007082
[Report]
>>107007058
>I'm sure there's a lot of room for improvement there.
E.g., I am now going to try "pulp adventure" instead of fantasy because the word fantasy is too closely associated with modern slop
Anonymous
10/25/2025, 9:59:22 PM
No.107007107
[Report]
how do i gen girlfailures?
Anonymous
10/25/2025, 10:04:34 PM
No.107007117
[Report]
>>107006975
>t. /de3/ vramlet jelly of /ldg/ chads' epic booba gens
lmao
Anonymous
10/25/2025, 10:04:34 PM
No.107007122
[Report]
>>107007076
adding this negative:
deformed hands, bad anatomy, extra limbs, poorly drawn hands, poorly drawn face, mutation, deformed, extra eyes, extra arms, extra legs, malformed limbs, fused fingers, too many fingers, long neck, cross‑eyed, bad proportions, missing arms, missing legs, extra digit, fewer digits
seems to have fixed some of the issues actually. I prefered the older image overall composition and tone tho.
Anonymous
10/25/2025, 10:09:05 PM
No.107007138
[Report]
Can't make Minthy/Rouwei-T5Gemma-adapter_v0.2 work. Provided workflow requires full Gemma so I add a gguf loader node, but then I get a picrel error. LLM SDXL nodepack updooted to 3.0.1.
Pony v7 q8 , fp32 vae and clip,
official comfyui workflow, 30 steps
stress test
style_cluster_1610, score_9, Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.
Anonymous
10/25/2025, 10:45:31 PM
No.107007244
[Report]
>>107007549
>>107007058
I tried some of those but in my case I got the earrings still and even lowering the denoising to 0.5 didn't get rid of it. Interestingly, I don't have the problem with the non flash chrome models, such as Chroma-DC-2K-T2-SL4-bf16
Anonymous
10/25/2025, 10:47:07 PM
No.107007256
[Report]
>>107007267
>>107007022
that v6 tagmine spreadsheet wasnt created by the author so
>>107007243
Different seed
Anonymous
10/25/2025, 10:47:49 PM
No.107007267
[Report]
>>107007256
since I've made that post I've read on the colab the styles groups go from 1 to 2048
>>107007257
Different seed and without "style_cluster_1610" in the prompt
Anonymous
10/25/2025, 10:50:32 PM
No.107007292
[Report]
>>107007243
>>107007257
>>107007269
takes me back to good ol' SD1.4 days
the style_cluster thing for sure is cumbersome, but atleast it has artists in it in some form. I don't mind if I have to look up a table. wish we had that in chroma instead of fucking NOTHING AT ALL.
Anonymous
10/25/2025, 10:50:54 PM
No.107007299
[Report]
>>107007361
>>107007269
Different seed and without "style_cluster_1610, score_9" in the prompt
Anonymous
10/25/2025, 10:52:12 PM
No.107007307
[Report]
>>107007341
>>107007297
i assume the clusters do not allow for prompting individual artists which is a huge fucking kick in the nuts for no reason other than muh morals
Anonymous
10/25/2025, 10:54:22 PM
No.107007329
[Report]
>>107007243
>>107007257
Goddamn, the sd1.5 and chroma merge lookin fire
Anonymous
10/25/2025, 10:55:56 PM
No.107007338
[Report]
>>107007346
Stop it with the Pony posts that's like seeing gore
Anonymous
10/25/2025, 10:56:22 PM
No.107007341
[Report]
>>107007385
>>107007307
what the fuck? I assumed that was the whole point of clusters. Maybe I should actually read the docs.
Anonymous
10/25/2025, 10:57:32 PM
No.107007346
[Report]
>>107007338
You must have knowledge of the turd to appreciate the beauty of better models like Pixart Sigma.
Anonymous
10/25/2025, 10:58:17 PM
No.107007353
[Report]
>>107007443
>>107007297
Chroma is a base model. Why should a base model have ridiculous style tags?
Chroma as can be tuned by anyone for any purpose. That's what makes it special.
>>107007299
score_9, medieval magical intricate and detailed world, princess taking a selfie in a pink ball dress, long ginger hair, pale skin, huge breasts, smile
Anonymous
10/25/2025, 11:00:13 PM
No.107007365
[Report]
>>107007467
>>107007361
30 steps is too low, try 40
>>107007361
Same seed without "score_9"
Anonymous
10/25/2025, 11:00:52 PM
No.107007375
[Report]
>>107007388
randomizing cluster styles now
Anonymous
10/25/2025, 11:01:39 PM
No.107007382
[Report]
ponyv7 is atrociously bad what the actual fuck
Anonymous
10/25/2025, 11:01:39 PM
No.107007383
[Report]
>>107007341
the author has a strange heretic perversion to releasing models to the public which can be prompted with artist names. his own secret versions however do not have this problem
what a faggot
Anonymous
10/25/2025, 11:02:11 PM
No.107007388
[Report]
>>107007405
Anonymous
10/25/2025, 11:03:21 PM
No.107007395
[Report]
>>107007385
>perversion
*aversion
Anonymous
10/25/2025, 11:04:33 PM
No.107007405
[Report]
>>107007427
Anonymous
10/25/2025, 11:07:16 PM
No.107007427
[Report]
>>107007568
>>107007405
this model is so fucking bad.
gradually losing ALL hope
Anonymous
10/25/2025, 11:07:24 PM
No.107007430
[Report]
>>107007243
Netalumina v3.5, without style and score
lol
Anonymous
10/25/2025, 11:08:39 PM
No.107007438
[Report]
>>107007385
if he released the artist ids, I bet people would even overlook the massive flaws
Anonymous
10/25/2025, 11:09:02 PM
No.107007443
[Report]
>>107007464
>>107007353
Wasn't it supposed to have them but they fucked up the captioning or something? I don't know, just something I read. You think we're gonna get a chroma finetune? As a dumb user, I don't really care what type the model is. All I know is chroma with artist styles would be sweet.
Anonymous
10/25/2025, 11:12:30 PM
No.107007464
[Report]
>>107007443
it's not something you can mess up by mistake
Anonymous
10/25/2025, 11:13:31 PM
No.107007467
[Report]
>>107007572
>>107007369
>>107007365
40 steps
score_9, Attractive medieval princess taking a selfie in a pink ball dress, long ginger hair, pale skin, large breasts, smile. She is at the top of a tall stone tower, with a large window behind her that overlooks a huge and crowded medieval city at sunrise.
https://xcancel.com/JustinLin610/status/1982052327180918888#m
>Alibaba's CEO is asking himself why Open Source doesn't have udio at home
be the change you want to see, make Qwen Audio or something lol
Anonymous
10/25/2025, 11:22:35 PM
No.107007526
[Report]
>>107007687
is this stupid faggot going to post every single gen he makes? fuck off already
https://github.com/fal-ai/flashpack
Then do it Comfy, I'd like to load my models faster, especially with Wan 2.2 when this model is all about unloading/reloading between the HIGH and the LOW model
Anonymous
10/25/2025, 11:24:26 PM
No.107007538
[Report]
>>107007565
>>107007499
>we r working on it and it won't be far. i am just curious about the status
Why talk? Talk is cheap. Give me something that is Udio tier, Apache 2 licensed or I sleep. We don't want another Songbloom or ACE Step.
Anonymous
10/25/2025, 11:25:45 PM
No.107007549
[Report]
>>107007244
Yeah flash is harder, which is partly what makes it fun.
As frustrating as models like that can be, fighting against them feels more like a game. Whereas with something more broad like Chroma base it's hard to know what you can do other than wait and get lucky
Anonymous
10/25/2025, 11:27:56 PM
No.107007565
[Report]
>>107007718
>>107007538
this, they can definitely do it, do it chinks!
Anonymous
10/25/2025, 11:28:30 PM
No.107007568
[Report]
Anonymous
10/25/2025, 11:28:55 PM
No.107007572
[Report]
>>107007586
>>107007467
Same except 1536x1536, which takes ~7s per step on a 3090, making this take 4-5 minutes per image. Almost the same time it takes to generate a full coherent 5s 32fps video today with Wan 2.2 lightx2v.
Unless the model will somehow be saved with "proper" prompting to take out the style knowledge which will also somehow fix the detail gore and almost make it into a completely new and better model too, it's sadly DOA.
Anonymous
10/25/2025, 11:29:04 PM
No.107007576
[Report]
butiful
Anonymous
10/25/2025, 11:29:59 PM
No.107007586
[Report]
>>107007572
Different seed.
so what's the best wan 2.2 lora combo with the new loras?
>>107005507
Nice, did anyone try these SVI loras with wan2.2? What weight did you use? How did you make it work for longer videos?
Anonymous
10/25/2025, 11:32:48 PM
No.107007603
[Report]
>>107007609
>>107007598
1 strength for both?
Anonymous
10/25/2025, 11:33:01 PM
No.107007604
[Report]
>>107007076
>res2s
res3m should be superior and faster too
Anonymous
10/25/2025, 11:33:20 PM
No.107007609
[Report]
Anonymous
10/25/2025, 11:34:17 PM
No.107007616
[Report]
>>107007499
good if he makes something
My friend's cousin works for OpenAI and he says they have a secret internal model not ready for public release yet, it's so powerful that you can type in your street address and it will show you pictures of your house, you can even prompt inside and you'll see yourself
Anonymous
10/25/2025, 11:35:37 PM
No.107007625
[Report]
>>107007619
i will finally know what my oneitis' vagina looks like
Anonymous
10/25/2025, 11:36:02 PM
No.107007628
[Report]
>sky, up in the clouds, heaven, pearly gates, the kingdom of heaven
>>107007536
Does that also apply to gguf? Or files in general?
Anonymous
10/25/2025, 11:38:45 PM
No.107007647
[Report]
>>107007661
>>107007619
My uncle works at Nintendo and he said the next Zelda is gonna be fully dynamically generated by a next-gen GPT model that runs on a VR brain implant
Anonymous
10/25/2025, 11:38:55 PM
No.107007651
[Report]
>>107007590
>lora combo
the one with the stuff you want in your video
Anonymous
10/25/2025, 11:40:37 PM
No.107007659
[Report]
>>107007536
Model load is the most frustrating thing about comfyui...
>wan2.1
>takes minutes at the sampler then starts genning or clip keeps offloading then loads forever or memory leaks after 5 gens where I have to force close comfy
>all-in-slop
>constantly offloads the entire fucking model and have to wait another 10 minutes for it to all load again
>wan 2.2
>while the fastest and least pain in the ass, constant and increasing pausing in between high and low generation
Anonymous
10/25/2025, 11:40:41 PM
No.107007661
[Report]
>>107007647
it would still be better than nu-Open World slop zelda
Anonymous
10/25/2025, 11:41:40 PM
No.107007666
[Report]
>>107007682
Anonymous
10/25/2025, 11:43:43 PM
No.107007682
[Report]
>>107007666
how could you do this to me?
>>107007665
Anonymous
10/25/2025, 11:43:48 PM
No.107007685
[Report]
Anonymous
10/25/2025, 11:44:05 PM
No.107007687
[Report]
>>107007880
>>107007526
Discussion of free and open source models, faggot
>>107007643
>gguf
No.
>>107007643
>files in general
No, it's for safetensor files and you need to convert them to a flashpack format.
Anonymous
10/25/2025, 11:49:08 PM
No.107007718
[Report]
>>107007754
>>107007565
Local has a lot of work to do.
Audio inpainting. Audio upscaling/etc... The bar is literally just a decent model that can do it all.
Anonymous
10/25/2025, 11:50:00 PM
No.107007724
[Report]
>>107007767
>>107007715
>you need to convert them to a flashpack format.
Comfy said you don't need to convert them, just use their methods on safetensors (look at pircel)
>>107007536
Anonymous
10/25/2025, 11:50:40 PM
No.107007730
[Report]
>>107007715
Can't wait to load my sdxl models super fast!
>>107007718
udio is the sota on this, but it has the same limitations as any other closed models :
- you can't ask it to make music "in the style of" (make me music like michael jackson "man in the mirror" -> moderation backend and blocked), though now you can send music to it, but it's not the same.
- you can't train it, make specialized "loras".
- anything sexual is moderated (think a sensual song).
Anonymous
10/25/2025, 11:56:06 PM
No.107007767
[Report]
>>107007724
Would be nice but I'm not sure anyone would work on that.
It can shave off quite a lot of time with complex multistage samplers setups.
>>107007754
>- anything sexual is moderated (think a sensual song).
that's why it's the most hated audio software of female rappers
https://www.youtube.com/watch?v=1Gt9TTjAMvw
Anonymous
10/26/2025, 12:02:05 AM
No.107007816
[Report]
>>107007777
Sure, but that's not sensual, that's just crude and vulgar, never been into these songs. Zero eroticism.
Anonymous
10/26/2025, 12:05:00 AM
No.107007836
[Report]
>>107007777
yeah I guess kek
Are they ever gonna make shorter GPUs? I can't fit anything longer than 300mm in my midtower so I'm stuck with 10 VRAM
So does anyone have replacement recommendations to this?
https://github.com/1038lab/ComfyUI-JoyCaption
It refuses to use CUDA and does inference on the CPU. Taking a whole minute.
Anonymous
10/26/2025, 12:09:31 AM
No.107007875
[Report]
Anonymous
10/26/2025, 12:09:45 AM
No.107007879
[Report]
>>107007927
>>107007849
any VLM model will use cuda, if you're gonna figure out how to make it work for one then it may as well be joycap
Anonymous
10/26/2025, 12:09:45 AM
No.107007880
[Report]
>>107007687
he's talking to himself and spamming the same 1girl, kill yourself, no discussion is being had
Anonymous
10/26/2025, 12:09:54 AM
No.107007882
[Report]
>>107007925
>>107007843
With the way things are going, I doubt it.
Well they will, but you'll get the less powerful stuff.
Anonymous
10/26/2025, 12:10:55 AM
No.107007890
[Report]
>>107007904
if I gen a 10s (161 frames) video on wan, is there a way to prompt it to do one thing then another without the second taking over immediatly?
"she types on a computer for 3 seconds, then she gets up and walks away"
Anonymous
10/26/2025, 12:12:14 AM
No.107007904
[Report]
>>107007943
>>107007890
what if the second thing is waiting, then the actual second thing becomes the third thing
Anonymous
10/26/2025, 12:12:18 AM
No.107007908
[Report]
Anonymous
10/26/2025, 12:12:40 AM
No.107007914
[Report]
>>107007927
>>107007849
the joycaption repo has a gradio interface right?
Anonymous
10/26/2025, 12:13:13 AM
No.107007918
[Report]
>>107007843
no, we reached the limits on the size of a transistor, so the only way for them to get more powerful gpus is to make them bigger, the gold rush is over
Anonymous
10/26/2025, 12:13:22 AM
No.107007920
[Report]
>>107007931
Does anyone here know about superesolution models?
I want to train a model with my own dataset, because my dataset shares the same colours, patterns and style, but it has low resolution images, so I want to upscale them as faithfully as possible.
Please somebody help me
Anonymous
10/26/2025, 12:13:42 AM
No.107007925
[Report]
>>107007882
the only options for me are the ada 48GB which is not worth it and theres one 5070 ti thats like 285mm but also not worth it. I guess I'll have to wait because I happen to think the 5090 is also a bad investment
Anonymous
10/26/2025, 12:13:50 AM
No.107007927
[Report]
>>107007933
>>107007879
I have no idea what you are trying to say.
It doesn't load anything to VRAM, has no GPU usage, CPU at 100% and is slow.
Maybe some other bug or whatever but it's not working as intended.
I asked for alternatives for joy caption inference.
>>107007914
If you are referring to hugging face one that has usage limits.
I am trying to mass tag images for lora training.
That's why I am trying to set up local.
Anonymous
10/26/2025, 12:14:13 AM
No.107007931
[Report]
>>107008242
>>107007920
just play with seedvr2 to upscale them
Anonymous
10/26/2025, 12:14:21 AM
No.107007933
[Report]
>>107007967
>>107007927
>If you are referring to hugging face one that has usage limits.
no, I mean the github repo
Anonymous
10/26/2025, 12:14:23 AM
No.107007934
[Report]
>>107007957
>>107007754
I do recall people making stuff in the same style just by inputting lyrics back when that was allowed.
Look at this
https://www.404media.co/listen-to-the-ai-generated-ripoff-songs-that-got-udio-and-suno-sued/
Obviously there's more, including a popular one
https://www.udio.com/songs/nDKNwPUB6GrMhEfvM6v2u1
Though it's more like a cover
Anonymous
10/26/2025, 12:15:27 AM
No.107007943
[Report]
>>107007904
ok worth a try, thanks anon
Anonymous
10/26/2025, 12:16:37 AM
No.107007957
[Report]
>>107007934
Yeah, this is where local would shine.
Anonymous
10/26/2025, 12:17:27 AM
No.107007967
[Report]
>>107007979
>>107007933
Gradio interfaces are typically hosted at hf and github repo links to hf for online demo as well.
Unless you are referring to something else.
Anonymous
10/26/2025, 12:18:28 AM
No.107007979
[Report]
>>107008007
>>107007967
a1111 and all it's forks are using gradio locally. you are being retarded
Anonymous
10/26/2025, 12:19:42 AM
No.107007994
[Report]
>>107008166
If anons here use torch nightly wheels, when I updated from the one in the beginning of October to the 22nd one (2.10.0.dev20251022+cu130), suddenly sage attention broke completely, it looks like an issue where everything defaulted to CPU instead of CUDA, making the sampler throw an error.
Going back to the 1002 version made it work fine again.
Anonymous
10/26/2025, 12:21:32 AM
No.107008007
[Report]
>>107008032
>>107007979
Oh you mean this?
https://github.com/fpgaminer/joycaption/tree/main/gradio-app
I guess I can try that.
When you said it like that I expected some sort of link to somewhere.
Anonymous
10/26/2025, 12:23:05 AM
No.107008032
[Report]
>>107008007
I am too lazy to look for you but you figured it out. gold star for you
Anonymous
10/26/2025, 12:29:05 AM
No.107008071
[Report]
>monthly pytorch mismatch between custom nodes that requires a reinstall
here we go
>50s/it WAN with random crashes on ROCM 7
>100s/it WAN with guaranteed stability on ROCM 6
suffering
Anonymous
10/26/2025, 12:32:28 AM
No.107008096
[Report]
>>107008111
>>107008079
>he broughtered'ed AMD
why??
>>107008096
Because fuck nvidia. Also gaming under Linux is less of a hassle with AMD.
It's fine, I don't have a fried attention span. I can cope.
Anonymous
10/26/2025, 12:36:50 AM
No.107008135
[Report]
>>107008111
>It's fine, I don't have a fried attention span. I can cope.
I'm not sure you cope this well, you literally complained about the lack of speed here lool
Anonymous
10/26/2025, 12:37:34 AM
No.107008143
[Report]
>>107007598
thank you anon the pajeet doesnt deserve your grace
Anonymous
10/26/2025, 12:40:16 AM
No.107008166
[Report]
>>107007994
I had the same issue, if it's this :
https://github.com/pytorch/pytorch/issues/166104
then it's "working as expected" apparently, so it means we need to get sage attention team to update or be stuck with early october torch
Anonymous
10/26/2025, 12:42:29 AM
No.107008179
[Report]
Anonymous
10/26/2025, 12:42:53 AM
No.107008183
[Report]
>>107008111
Why did you make a post kvetching about speed and stability if you were going to immediately get defensive and coping lol.
Anonymous
10/26/2025, 12:47:32 AM
No.107008205
[Report]
Anonymous
10/26/2025, 12:52:19 AM
No.107008242
[Report]
>>107008372
>>107007931
That's not what I need, I want to train a resolution model with my own database.
The idea is to have pairings of images and teach the model what pairings are a correct upscaling.
Anonymous
10/26/2025, 12:52:55 AM
No.107008250
[Report]
>>107007499
>make Qwen Audio or something lol
they will do it, but for api, kek
Anonymous
10/26/2025, 12:54:29 AM
No.107008261
[Report]
>>107008489
Does ComfyUI patch lora weights into the model by default? Doesn't seem so, why isn't this the case? Wouldn't it help a lot for vram size for multiple loras? Can it be enabled somehow?
Anonymous
10/26/2025, 12:56:05 AM
No.107008271
[Report]
>>107007849
skill issue literally. installa a llama-cpp-python version that has CUDA compiled in it, otherwise manually build the wheel using the correct compile flags (literally contained in this node repo through a script):
https://github.com/1038lab/ComfyUI-JoyCaption/blob/main/llama_cpp_install/llama_cpp_install.py
you're fucking retarded and should kys unironically retarded faggot brown
Anonymous
10/26/2025, 1:02:42 AM
No.107008318
[Report]
>you still have to wait a few minutes to OOM on the first comfy video gen before the second allocates properly and works from then on
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Anonymous
10/26/2025, 1:08:13 AM
No.107008364
[Report]
>>107008376
>>107008334
stop being poor and buy a proper video card faggot
Anonymous
10/26/2025, 1:09:32 AM
No.107008372
[Report]
>>107008242
>The idea is to have pairings of images and teach the model what pairings are a correct upscaling.
Elaborate
Anonymous
10/26/2025, 1:10:03 AM
No.107008376
[Report]
>>107008393
>>107008364
Write proper memory alloc code faggot
>>107008376
>24gb
>not vramlet cope tier
stop being poor jamal, youre embarassing yourself
Anonymous
10/26/2025, 1:13:55 AM
No.107008398
[Report]
>>107008393
Write proper memory alloc code faggot
Anonymous
10/26/2025, 1:14:53 AM
No.107008409
[Report]
i..I'M GOING TO OOM, AHHHHHHHH
Anonymous
10/26/2025, 1:15:07 AM
No.107008414
[Report]
>>107008393
>0% utilization
embarassing indeed
Anonymous
10/26/2025, 1:16:33 AM
No.107008425
[Report]
>>107008475
>>107008393
>all that compute for 1girl, standing, ai-generated
>>107007599
didn't work at 5 strength for me so i gave up because wan 2.1 is shit and wan 2.2 5b will be even worse because wan 5b has no speed up lora's as far as i know, second wan2.2 vae is fucking slow and it oom sometime unless --reserve-vram 1.0 is used, oh an 5b produces shit t resolutions below 1280 x 704 or what ever.
total waste of time unless they train it on wan 2.2 14b 720p or heck even 480p, what's more is we don't even know if we are using it correctly. Yeah i think it will be forgotten about and never heard of again.
Anonymous
10/26/2025, 1:19:17 AM
No.107008443
[Report]
>>107008541
>>107007599
well reading that post maybe it will be fixed and working, but i won't hold my breath for it because its already limited in what it can actually do with wan 2.1 and its limited to 832 x 480 or it is slow as fuck.
Anonymous
10/26/2025, 1:19:40 AM
No.107008448
[Report]
>>107008475
>>107008393
>600 watts idle
waow
>>107008425
im actually running REAL LLMs in here, sadly at q8 quant (GLM 4.6, 400gb~), fully in VRAM unlike you poors who make do with q2ks poverty tier quants and offload to CPU anyway lmao. I use the spare 200gb~ to load FULL precision video/audio/image models to delive a superior and immersive chat experience, with SOTA imagen/textgen/audiogen/voicegen all happening automatically as I rp with my waifus.
>>107008448
these are rented in a datacenter, no way I can run this shit at home. Also memed my company into doing it, bunch of clueless retard, im feeding them shit from bedrock itself while using the real cluster for myself.
Anonymous
10/26/2025, 1:24:33 AM
No.107008489
[Report]
>>107008261
you can minimize the vram use by merging, but I don't think it's possible to do on the fly so we are stuck with model + lora sizes
Anonymous
10/26/2025, 1:25:22 AM
No.107008497
[Report]
>>107008475
enjoy it while it lasts bruh
Anonymous
10/26/2025, 1:26:23 AM
No.107008506
[Report]
>>107008475
maybe if you put in as much effort into real life you could have a real girl friend?
Every now and again I dream about picking up a data center GPU for fire sale prices after the crash happens and then I remember the insane power draw and the fact that they apparently use total loss water cooling.
Anonymous
10/26/2025, 1:26:30 AM
No.107008509
[Report]
>>107008524
>>107008334
I don't have the issue, what video gen resolution and how many frames? Can you share your wf?
Anonymous
10/26/2025, 1:27:33 AM
No.107008519
[Report]
>>107008545
>>107008475
>Also memed my company into doing it, bunch of clueless retard, im feeding them shit from bedrock itself while using the real cluster for myself.
They don't even see they pay twice?
Anonymous
10/26/2025, 1:28:01 AM
No.107008523
[Report]
>>107008508
>after the crash happens
Keep dreaming
Anonymous
10/26/2025, 1:28:03 AM
No.107008524
[Report]
>>107008559
Anonymous
10/26/2025, 1:28:53 AM
No.107008529
[Report]
>>107008541
>>107008430
>total waste of time unless they train it on wan 2.2 14b 720p or heck even 480p, what's more is we don't even know if we are using it correctly. Yeah i think it will be forgotten about and never heard of again.
They released the training code so maybe some rich anon will do it.
Anonymous
10/26/2025, 1:30:11 AM
No.107008541
[Report]
>>107008588
Anonymous
10/26/2025, 1:31:04 AM
No.107008545
[Report]
>>107008564
>>107008519
>believing larpers
Anonymous
10/26/2025, 1:32:29 AM
No.107008559
[Report]
>>107008524
>Maximum resolution and frames
720p and 81 frames? Do you blockswap?
Try block swapping 5 blocks for example, see if it works. As long as you have enough ram, the difference in speed is minimal.
Anonymous
10/26/2025, 1:33:00 AM
No.107008564
[Report]
>>107008597
>>107008545
wtf bfl, this was almost unsafe
Anonymous
10/26/2025, 1:36:10 AM
No.107008588
[Report]
>>107008541
sorry for the black pill bro but i've done some testing the base wan 2.2 5b model today. it don't work with lightx speed lora's so its actually slower than just using wan 2.2 high and low on my machine. The vae is a pain in the ass as well, its really slow to decode on my rtx 3060. it might be alright some people but the quality is shit if using resolutions below 1280 x 720
so yeah its slow and i really don't know why people with lower vram use it when they could just use Q4 gguf high and low models and get much better quality and faster due to speed lora's. So I'm not gonna bother when they release the wan2.2 5B version.
tl;dr is was DOA
Anonymous
10/26/2025, 1:37:15 AM
No.107008597
[Report]
>>107008604
Anonymous
10/26/2025, 1:37:53 AM
No.107008604
[Report]
>>107008773
>>107008597
wish it was good at making filled used condoms
Anonymous
10/26/2025, 1:39:13 AM
No.107008614
[Report]
>>107008632
wan gen :
[Subject Description] + [Scene Description] + [Motion Description] + [Aesthetic Control] + [Stylization]
Example: "A young woman in a red dress (subject), standing in a bustling neon-lit city street at night (scene), walks forward then stops to look up at the rain, slow motion tracking shot (motion), cinematic lighting, moody atmosphere (aesthetic), film noir style (stylization)"
Anonymous
10/26/2025, 1:40:09 AM
No.107008619
[Report]
>>107008791
>>107007598
forgot to set size for the vid but it still worked pretty good.
the anime girl runs to the left out the door and closes it.
ty anon
Anonymous
10/26/2025, 1:41:03 AM
No.107008627
[Report]
>>107008475
How many wan frames can you load and how fast is your wan gens?
Anonymous
10/26/2025, 1:41:37 AM
No.107008632
[Report]
>>107008614
I just wish wan could handle longer prompts for more actions. It hardly ever works for me even when using context windows and 161+ frames at 81 frame chunks with overlap.
Anonymous
10/26/2025, 1:49:53 AM
No.107008680
[Report]
>>107008699
>>107008079
My ROCM 7 has been rock solid after upgrading to the official stuff. ComfyUI amd memory management update also swagged my shit out.
50s/it seems pretty good for wan with amd, which card you got?
Anonymous
10/26/2025, 1:50:21 AM
No.107008684
[Report]
>>107008719
>>107007599
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1519#issuecomment-3440759925
I tried telling anons this when it first dropped, it can't be as easy as just 1 lastframe because base wan does not have any concept of the previous genned video. It treats each as a new video, so to me it looks more like how wananimate works in taking 5 previous frames to continue the motion since 1 frame isn't enough to continue motion with.
People were making videos with it but those would have been placebo gens.
Anonymous
10/26/2025, 1:51:51 AM
No.107008699
[Report]
>>107008739
>>107008680
9070 xt. Not sure what you mean by "official stuff", I'm using the nightly URL from the official Pytorch page. I haven't upgraded Comfy in a while either.
>>107008508
>and the fact that they apparently use total loss water cooling
is this one of those cases of turbo-niggering the environment to save $0.01?
Anonymous
10/26/2025, 1:54:18 AM
No.107008719
[Report]
>>107008726
>>107008684
How to even feed 5 frames to next video? I don't think it's possible right now, last time I asked it was only 1 frame starting the next video.
So on top of their lora, we need some node to "inject" 5 frames instead of 1 as latent into a sampler.
Basically picrel but 5 frames.
Anonymous
10/26/2025, 1:55:03 AM
No.107008725
[Report]
>>107008710
Gigawatt in equals gigawatt out. All that energy has to eventually turn into heat, and there ain't an air conditioning system on this earth that can dissipate 1 GW. That being said, I don't know the specifics, only the basic laws of physics.
>>107008719
feed first 5 frames 0 denoise then the rest on the next sampler
Anonymous
10/26/2025, 1:56:01 AM
No.107008731
[Report]
>>107008710
it's water in water out, they don't inject wastes from the Gange in it anon
Anonymous
10/26/2025, 1:56:36 AM
No.107008739
[Report]
>>107008699
you should upgrade your comfy to at least 0.3.65, very good amd improvements
https://github.com/ROCm/TheRock
I think it's these but I see you're on linux so nevermind. Official windows support I meant.
Anonymous
10/26/2025, 1:57:44 AM
No.107008748
[Report]
>>107008790
>>107008726
Images in wan aren't processed in series, all the frames are processed at the same time, it's just that the first one is "fixed" in latent, rest is sent as noise.
What we need is to "fix" the 5 first instead.
Anonymous
10/26/2025, 1:58:19 AM
No.107008752
[Report]
>>107008769
>>107007598
willing to give this a try, i'm assuming scheduler simple?
Anonymous
10/26/2025, 2:01:00 AM
No.107008769
[Report]
>>107008792
Anonymous
10/26/2025, 2:01:22 AM
No.107008773
[Report]
>>107008604
train a lora man, it takes like two hours
>>107008726
won't work, this is shit we tried months ago I'm sure, it just ignores them. and how you even gonna do this? de-noise in advanced sampler the first 5? it won't matter because wan does all frames at once as a new video. it won't magically know there are 5 frames already done. KJ was wrong to assume it needed no codes changes, we gonna need a new node.
Anonymous
10/26/2025, 2:04:47 AM
No.107008790
[Report]
Anonymous
10/26/2025, 2:04:51 AM
No.107008791
[Report]
>>107008892
>>107008619
the new 2.2 MoE high lora works so much better for motion/fluidity. ty kijai for fixing it
>>107008769
thanks, but why KJ lora's? Is there something different about them? Or are just extracted from their model?
Anonymous
10/26/2025, 2:05:51 AM
No.107008797
[Report]
>>107008878
>>107008792
I tested different loras and this combination had the best result
Anonymous
10/26/2025, 2:13:22 AM
No.107008842
[Report]
>>107008788
>we gonna need a new node.
Yep, and even better if we do import latent corresponding to the last 5 images instead of degraded images through vae decode, but I'm not even sure that's possible.
Anonymous
10/26/2025, 2:15:37 AM
No.107008851
[Report]
>>107008846
it started so well, but we didn't get glorious cleavage bouncing
Anonymous
10/26/2025, 2:15:41 AM
No.107008853
[Report]
>>107008890
>>107008792
NTA, but the LoRAs released by Lightx2v were extracted wrong initially so you had to use KJ extracted LoRAs. They Lightx2v ones were then re-uploaded with correctly working versions at some point. KJ didn't extract the newest I2V Lightx2v LoRAs because they actually did it right on the first try this time.
https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main
>>107008797
well its no where near as good as my settings and setup, its fucking blurry at 720 x 720
Anonymous
10/26/2025, 2:22:14 AM
No.107008890
[Report]
>>107008913
>>107008853
lol its still shit i will prove it...
Anonymous
10/26/2025, 2:22:35 AM
No.107008892
[Report]
>>107009160
Anonymous
10/26/2025, 2:25:41 AM
No.107008909
[Report]
Ok needed to figure out how to integrate CUDA Toolkit into my docker setup but Joy Caption is working now with GPU acceleration, 4 times faster.
You are the thread schizo who regularly shits it up. As such I won't give a (You), but credit where due thank you, bastard.
Anonymous
10/26/2025, 2:26:28 AM
No.107008913
[Report]
>>107008927
>>107008900
>>107008890
>>107008878
wait a minute i forgot to change the god damn steps start and end... This has probably been why its so blurry lol. I'll check it again.
Anonymous
10/26/2025, 2:27:38 AM
No.107008927
[Report]
>>107008960
>>107008913
I tried those LoRAs as well and I got some weird hyperspace zoom effect, but I'm just using whatever quants of WAN22 come with Comfy's default workflow.
I was able to run the LongCat Video demo. This is the stock prompt:
>prompt = "In a realistic photography style, a white boy around seven or eight years old sits on a park bench, wearing a light blue T-shirt, denim shorts, and white sneakers. He holds an ice cream cone with vanilla and chocolate flavors, and beside him is a medium-sized golden Labrador. Smiling, the boy offers the ice cream to the dog, who eagerly licks it with its tongue. The sun is shining brightly, and the background features a green lawn and several tall trees, creating a warm and loving scene."
>negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"
Generation is in 3 stages (initial, distilled, refined) that each output a video. It took 24 minutes to generate this and 74gb of vram (FP32).
Going to try the long video (1min) generation next and expecting it to take hours.
Anonymous
10/26/2025, 2:36:02 AM
No.107008960
[Report]
>>107008927
I'm a quality improving study tonight, so we will see which is best.
>>107008959
That is good quality, how long did it take?
Anonymous
10/26/2025, 2:38:28 AM
No.107008971
[Report]
>>107008966
>It took 24 minutes to generate this
Anonymous
10/26/2025, 2:39:04 AM
No.107008974
[Report]
>>107008966
nvm i didn't read full post 24 mins 74GB vram I'm gonna cry :(
>>107008959
>Generation is in 3 stages (initial, distilled, refined)
ok that might mean it could run on smaller cards?
Anonymous
10/26/2025, 2:40:33 AM
No.107008984
[Report]
>>107008991
>>107006468 (OP)
can you guys make me some realistic apustajas?
Anonymous
10/26/2025, 2:41:57 AM
No.107008991
[Report]
Anonymous
10/26/2025, 2:42:06 AM
No.107008992
[Report]
>>107009003
>>107008959
thanks for doing this anon, i ran oom on my 3090 multiple times before giving up
Anonymous
10/26/2025, 2:43:30 AM
No.107009003
[Report]
>>107009016
>>107008983
It already uses 55gb on the first pass, but keep in mind it's at FP32. At Q8 the 74gb peak should be down to 18.5, so it would work on a 24gb card.
The first two passes aren't really meant to be used as-is, anyway. First stage here.
Anonymous
10/26/2025, 2:45:21 AM
No.107009014
[Report]
>>107009030
Kind of annoying how the lightx2v ruins videos with end frames. It distorts right at the ending but without the lora it works fine
Anonymous
10/26/2025, 2:45:34 AM
No.107009016
[Report]
>>107009038
>>107009003
allison brie you fucking cretin
Anonymous
10/26/2025, 2:45:45 AM
No.107009018
[Report]
>>107009009
>>107008983
Second stage (distill)
Anonymous
10/26/2025, 2:46:34 AM
No.107009024
[Report]
>>107009009
>letting your dog lick chocolate syrup
fucking retarded kid
>>107009014
Everyone complaining about lightx2v color distortion, flickering, or blurryness or anything else is a workflow issue, i never had any of those with
>>107008900
>>107007598
Anonymous
10/26/2025, 2:49:17 AM
No.107009038
[Report]
>>107009081
>>107009016
she looked better younger
Anonymous
10/26/2025, 2:49:53 AM
No.107009042
[Report]
>>107009030
I don't have them either with latest version, pretty nice.
Anonymous
10/26/2025, 2:50:16 AM
No.107009048
[Report]
>>107009009
the first stage looks alright desu.
>At Q8 the 74gb peak should be down to 18.5, so it would work on a 24gb card.
Yeah I think we will be eating good again soon.
Anonymous
10/26/2025, 2:54:54 AM
No.107009081
[Report]
Anonymous
10/26/2025, 2:01:33 AM
No.107009116
[Report]
Running the LongCat 1min demo. It generates 11 segments and chains them together. I'm guessing it'll take about 4.5 hours if it doesn't fail. Here's the initial step of the first 11th.
>prompt = "realistic filming style, a person wearing a dark helmet, a deep-colored jacket, blue jeans, and bright yellow shoes rides a skateboard along a winding mountain road. The skateboarder starts in a standing position, then gradually lowers into a crouch, extending one hand to touch the road surface while maintaining a low center of gravity to navigate a sharp curve. After completing the turn, the skateboarder rises back to a standing position and continues gliding forward. The background features lush green hills flanking both sides of the road, with distant snow-capped mountain peaks rising against a clear, bright blue sky. The camera follows closely from behind, smoothly tracking the skateboarder’s movements and capturing the dynamic scenery along the route. The scene is shot in natural daylight, highlighting the vivid outdoor environment and the skateboarder’s fluid actions."
>negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"
Anonymous
10/26/2025, 2:04:24 AM
No.107009129
[Report]
Anonymous
10/26/2025, 2:09:51 AM
No.107009152
[Report]
>>107009157
captchas are failing, 4chan is going down
Anonymous
10/26/2025, 2:11:39 AM
No.107009157
[Report]
>>107009152
[audience] wooooOOO
Anonymous
10/26/2025, 2:11:56 AM
No.107009160
[Report]
Anonymous
10/26/2025, 2:17:05 AM
No.107009178
[Report]
Anonymous
10/26/2025, 2:17:16 AM
No.107009180
[Report]
>>107009246
>>107008959
6 second gen took 24 mins to do? that's rough
Anonymous
10/26/2025, 2:20:25 AM
No.107009196
[Report]
Anonymous
10/26/2025, 2:23:01 AM
No.107009222
[Report]
Anonymous
10/26/2025, 2:26:07 AM
No.107009243
[Report]
>>107009301
Anonymous
10/26/2025, 2:26:22 AM
No.107009246
[Report]
>>107009180
Yeah but so is wan with out speed loras.
now imagine what this thing could do in future.
Anonymous
10/26/2025, 2:32:00 AM
No.107009301
[Report]
>>107009330
Anonymous
10/26/2025, 2:32:29 AM
No.107009306
[Report]
the man in the blue shirt turns and fires a blue energy beam at the plane, which explodes into fire and smoke.
live action dragonball. used unipc instead of euler this time.
Anonymous
10/26/2025, 2:33:19 AM
No.107009316
[Report]
>>107009333
>>107007598
I'm guessing you mean 8 total steps 4/4 ? because with only 4 total steps its blurry, I'm now trying with 8 total steps with your settings.
Anonymous
10/26/2025, 2:36:29 AM
No.107009333
[Report]
>>107009364
>>107009316
No it's 4 steps total, unipc, 720x1280, 81 frames, q8 wan, umt5 bf16
Anonymous
10/26/2025, 2:39:40 AM
No.107009349
[Report]
Anonymous
10/26/2025, 2:40:41 AM
No.107009357
[Report]
>>107009402
Anonymous
10/26/2025, 2:41:25 AM
No.107009364
[Report]
>>107009378
>>107009333
its not enough for 720 x 720 that's for sure, its looking a lot better using 8 steps total but I am using q4, umt5 16fp
>>107009364
Wan was trained primarily for 1280x720 and 720x1280, and Q4 is too low even for full res anyway
Anonymous
10/26/2025, 2:44:04 AM
No.107009387
[Report]
>>107009395
>>107009378
>low even for full res anyway
i think not :)
Anonymous
10/26/2025, 2:45:26 AM
No.107009395
[Report]
>>107009387
The blurry output certainly thinks so :)
Anonymous
10/26/2025, 2:47:15 AM
No.107009402
[Report]
>>107009505
>>107009357
Its pretty good once you figure it out how to use it
>>107009378
Mine
https://files.catbox.moe/qkl1j3.mp4
6 steps different settings
yours
https://files.catbox.moe/ir5zvj.mp4
8 steps only change to settings
mine looks a bit over cooked and i need to then adjust slight the cfg.
I'm using old 2.1 light lora at 5 strength in high, old 2.2 lora in low at 1.0
Anonymous
10/26/2025, 2:48:39 AM
No.107009412
[Report]
>>107009414
Anonymous
10/26/2025, 2:49:32 AM
No.107009414
[Report]
>>107009406
same prompt, same seed btw. Q4
>>107009412
ping ponged because why not?
Anonymous
10/26/2025, 2:52:53 AM
No.107009437
[Report]
Anonymous
10/26/2025, 2:53:07 AM
No.107009438
[Report]
>>107009378
>1280x720
and i can that res and it does look better even with q4, in fact at that res it looks amazing but it takes more time and i don't care for it. I'm just looking for better settings than what I'm using for the same res 720 x 720 since then i can do 81 frames no problems even with lots of lora. I can do the higher res but i'd need to use the context window nodes. Heh then i can gen really long videos but takes ages on a 3060
Anonymous
10/26/2025, 2:55:45 AM
No.107009450
[Report]
why isn't there any useful chroma lora on civitai?
Anonymous
10/26/2025, 3:00:43 AM
No.107009478
[Report]
>>107009512
>>107009378
I mean the thing is mate, my settings use cfg so it follows the prompt better and i can use negative prompt if i want. That is entirely the point, its still only 6 total steps so its plenty fast enough, i'm just trying to tweak the cfg so its not so over cooked. The only problem is that these settings I use have a very fine threshold with cfg between the high and low and changing them can cause it to be all messed up.
I don't trust these new lightx lora recently uploaded and I'm not the only one. The old ones still work better with the right settings.
>>107009402
>Its pretty good once you figure it out how to use it
>posts a gen with the long torso issue
>>107009478
>be absolute retard
>literally every single setting that you can fuck up you fuck up
>while also using a toy quant
>then you also for good measure use lightning models without cfg set to 1
reminder these are your average retards that give their opinions on what is good or bad and complain about their outputs having problems on /ldg/
Anonymous
10/26/2025, 3:09:51 AM
No.107009531
[Report]
>>107009512
>>then you also for good measure use lightning models without cfg set to 1
No is forced to do anything they are told and those retards can't even get a proper wan 2.2 version. using it still makes it much faster though which is why i use it. you couldn't get decent quality anything at 6 steps without it.
Anonymous
10/26/2025, 3:12:20 AM
No.107009544
[Report]
>>107009512
and i will make eat you shit you just posted within a few minutes once i adjust the cfg from 4 to 3.5 and load the q8 and up the resolution to full 720p then you can eat shit and die
>>107009505
>elbows too pointy
Can I use rule34 anime videos to train realistic LoRAs? After all I'm training just the motion, right?
Anonymous
10/26/2025, 3:13:46 AM
No.107009551
[Report]
>>107009512
its not just here its everywhere in ai
Anonymous
10/26/2025, 3:14:46 AM
No.107009555
[Report]
>>107009505
I agree that Chroma has its issues but to be fair to Chroma that's not the correct resolution for 1.75 ratio in a 1024p model.
Anonymous
10/26/2025, 3:14:51 AM
No.107009556
[Report]
>>107009550
I'm talking Wan btw
Anonymous
10/26/2025, 3:16:06 AM
No.107009563
[Report]
Anonymous
10/26/2025, 3:17:51 AM
No.107009576
[Report]
>>107009550
Hypothetically with a very diverse dataset and zero overlearning random noise, yes.
In practice the realism will be lower than training on realistic/mixed dataset of same quality. The difference may or may not matter too much though.
Note that I never trained video diffusion. Just guessing from what I know about images.
>>107009030
I'm using the WAN 2.2 I2V workflow from
>>107008900
but it's still fucking up the last frame or so when I set an end frame
Anonymous
10/26/2025, 3:27:09 AM
No.107009645
[Report]
>>107009472
Most of its community still thinks Flux is SOTA. And then you have all the Chinks who have successfully shilled Qwen.
Anonymous
10/26/2025, 3:28:29 AM
No.107009656
[Report]
Anonymous
10/26/2025, 3:28:51 AM
No.107009659
[Report]
>>107009764
>>107009641
Actually after checking the last few runs it does seem a little more stable but the first two got distorted for some reason
Anonymous
10/26/2025, 3:29:52 AM
No.107009666
[Report]
>>107009792
hatsune miku runs in from the left and waves hello.
new lora combo is smooth (MoE high + 2.2 lightning low)
Anonymous
10/26/2025, 3:30:06 AM
No.107009667
[Report]
>>107009547
use an upscaler (with sampler) to fix those lines nigga
>>107009472
There's several good loras for Chroma just civitai are cucks and take them down almost as immediately as they get uploaded.
Anonymous
10/26/2025, 3:32:36 AM
No.107009682
[Report]
>>107009668
why? Is Chroma against rules too? Did Flux krauts do this?
Anonymous
10/26/2025, 3:32:41 AM
No.107009684
[Report]
cfg in high adjusted to 3.5 from 4.0 seems better, i noticed when not using other lora's the frame rate is slower, so next video will need to be 32fps instead of 16fps. I will gen now with q8 @ 1280 x 720 81 frames, slightly different prompt to include synthetic led lighting so that sun light is hopefully not present and change the seed and run at only 4 steps down from 6 total, 2 steps each sampler just to see what it produces. I bet you it will be better than that load of bollocks suggested in the thread, why? because I genned thousands of wan videos by now using these settings.
Anonymous
10/26/2025, 3:36:12 AM
No.107009710
[Report]
>>107009722
>>107009668
Can you give an example of what you are referring to here?
Celeb loras? NSFW in general?
I agree civit sucks regardless though.
Anonymous
10/26/2025, 3:38:29 AM
No.107009722
[Report]
>>107009710
I lied. there's no such thing and chroma is shit model therefore no good lora
Reminder
>.\python_embeded\python.exe -m pip cache purge
Files removed: 368 (8132.7 MB)
Anonymous
10/26/2025, 3:40:06 AM
No.107009731
[Report]
Anonymous
10/26/2025, 3:40:18 AM
No.107009732
[Report]
>>107009822
Anonymous
10/26/2025, 3:44:38 AM
No.107009764
[Report]
>>107009641
>>107009659
because its fucking shit, every workflow shared on civ or where ever is a fucking meme, a big giant pile of pointless crock of shit. smashed together in a about 5 minutes using someone else's shit and it becomes some Frankenstein turd. Let me guess KJ wrapper nodes even though the guy tells you to not use the wrapper once native nodes are available. People still use the wrapper node workflows and its like OMFG these people are retarded.
Anonymous
10/26/2025, 3:45:59 AM
No.107009773
[Report]
>>107009822
Anonymous
10/26/2025, 3:46:30 AM
No.107009777
[Report]
>>107009836
>>107009726
8 GB wow... You're just gonna have to re download those files when doing anything to install or update shit don't you know that?
Anonymous
10/26/2025, 3:49:12 AM
No.107009792
[Report]
>>107009666
lmao the random icons.
Anonymous
10/26/2025, 3:49:26 AM
No.107009795
[Report]
>>107009901
>>107008846
The breasts sag and bounce are perfect. Is this with no lora?
Anonymous
10/26/2025, 3:54:06 AM
No.107009821
[Report]
>>107009897
Chroma UltraReal LoRA made for Flux sloppers-
https://www.reddit.com/r/StableDiffusion/comments/1o3bkgc/lenovo_ultrareal_chroma_lora/
Looks less realistic than original, but gives Chroma back the Flux aesthetic.
Anonymous
10/26/2025, 3:54:15 AM
No.107009822
[Report]
>>107009896
>>107009732
>>107009773
why would hair fly in the car?
Anonymous
10/26/2025, 3:56:02 AM
No.107009834
[Report]
>>107009895
>>107009777
A lot of things are redundant old versions that won't be needed anymore or from nodes that were deleted after testing
Anonymous
10/26/2025, 4:04:31 AM
No.107009885
[Report]
>>107009893
What's the Light2v setup for Wan 2.2 T2V? Is it just the distilled 2.2 loras or do you guys use the old ones at a higher weight like the i2v worklfow?
Anonymous
10/26/2025, 4:07:29 AM
No.107009893
[Report]
>>107009885
speed up loras are for losers
Anonymous
10/26/2025, 4:07:51 AM
No.107009895
[Report]
>>107009834
>a car interior that isn't nonsense
Impressive!
Anonymous
10/26/2025, 4:08:04 AM
No.107009896
[Report]
>>107009822
the car doesn't have any windows anon, pay more attention
Anonymous
10/26/2025, 4:08:12 AM
No.107009897
[Report]
>>107009472
>>107009821
most if not all flux loras work in chroma so try them.
Anonymous
10/26/2025, 4:08:32 AM
No.107009901
[Report]
>>107010036
Anonymous
10/26/2025, 4:12:46 AM
No.107009918
[Report]
>>107009932
>>107009836
you don't understand when you install requirements.txt it will check and re download most probably.
Anonymous
10/26/2025, 4:13:09 AM
No.107009920
[Report]
>>107009935
Anonymous
10/26/2025, 4:15:06 AM
No.107009929
[Report]
>i2v
>inpaint and fix the last frame
>flf2v
easy
Anonymous
10/26/2025, 4:15:18 AM
No.107009932
[Report]
>>107009836
>>107009918
unless its only removing old versions of files i guess, but i wouldn't bother for the sake of 8GB free disk space.
Anonymous
10/26/2025, 4:15:41 AM
No.107009935
[Report]
I'm trying to gen i2v with comfyui's example workflow for wan 2.2 and the quality is dogshit compared to with the lightx2 loras. Anyone else willing to test this please? I'm trying to use the workflow at the bottom of comfyui's wan example page here:
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
My light2x videos are generally fine, just had one that wasn't quite hitting the level of quality I wanted and back on 2.1 I could just switch to not using the loras and it would generally improve it, but 2.2 it's way worse. Way waaaay worse. Probably something wrong with my environment but would like someone else to test it or at least just tell me it works fine for them before I redo my venv.
Anonymous
10/26/2025, 4:22:35 AM
No.107009978
[Report]
>>107009641
the last or first frames being messed up is probably due to not enough steps because that is what is happening to me right now when dropping from 6 steps to 4 steps. 4 total steps isn't enough, splitting steps like this is also retarded but its what we do with these speed up lora's.
for wan2.2 without the minimum would be something like
25 total steps
10 high 15 low or the other way round I forgot, but there is a chart somewhere on reddit that explains. Its based on shift settings that's all i remember.
Anonymous
10/26/2025, 4:27:29 AM
No.107010005
[Report]
Anonymous
10/26/2025, 4:32:41 AM
No.107010036
[Report]
>>107009971
Comfyui local examples are purposely shittier so you pay API.
Anonymous
10/26/2025, 4:58:52 AM
No.107010167
[Report]
Anonymous
10/26/2025, 5:06:51 AM
No.107010206
[Report]
the anime girl stands up and dives into a swimming pool on the right.
Anonymous
10/26/2025, 5:08:51 AM
No.107010221
[Report]
>>107010298
>>107010043
this. only a matter of time before they start to fuck with the sampling
Anonymous
10/26/2025, 5:11:27 AM
No.107010233
[Report]
Is there a video gen for a <12GB vramlet?
Anonymous
10/26/2025, 5:13:20 AM
No.107010243
[Report]
Anonymous
10/26/2025, 5:16:14 AM
No.107010256
[Report]
>>107010259
>>107010234
you can use wan 2.2 by offloading
Anonymous
10/26/2025, 5:17:20 AM
No.107010259
[Report]
>>107010256
how much ram is recommended for offloading with that amount of vram?
Anonymous
10/26/2025, 5:18:37 AM
No.107010264
[Report]
Anonymous
10/26/2025, 5:19:44 AM
No.107010268
[Report]
>>107009971
It's dogshit because it uses just 20 steps when the official settings use 40 steps for i2v. In my own testing you need at least 35. Also splitting the steps at half is stupid as the high noise model doesn't need that many steps compared to the low noise model (use something like the MoE Ksampler and set the boundary to 0.900 for i2v and let it determine the split). Lastly, fp8 scaled is inferior to Q8.
>>107010043
>>107010221
I suppose it never occurred to you that those dipshits at comfyui never extensively test workflows/settings/models, but rather just cobble the nodes together and say it's good enough and leave it at that? Retards.
Anonymous
10/26/2025, 5:29:23 AM
No.107010307
[Report]
>>107010298
tbf when there are a bunch of redditors constantly screaming WEN COMFY NODE it may be hard to actually spend time testing first.
on the othe hand, it is a bit suspect when they deviate from the model developers' defaults.
Anonymous
10/26/2025, 5:34:35 AM
No.107010331
[Report]
>>107010298
Thanks anon I'll try higher steps and switch between models earlier.
Anonymous
10/26/2025, 5:41:31 AM
No.107010368
[Report]
Anonymous
10/26/2025, 5:47:52 AM
No.107010391
[Report]
>>107009971
I got you anon just just tweaking to hit the right spot and i'm getting close to the sweet spot. this i think was only 4 steps using old wan2.1 lora rank 64 iv2 in high at 5.00 strength and old wan2.2 low in low at 1.0 strength. But I'm using unconventional sampler settings that some anons here think is retarded because it uses cfg but honestly the gen time is not all that much more time and you get high quality with prompt adherence so its balance I seek and not just the fastest speed.