/ldg/ - Local Diffusion General
Anonymous
10/14/2025, 1:22:56 PM
No.106884402
>>106884374 (OP)
very shitty and retarded collage. besides the clever tobi gen, the rest are 1.5 tier slop.
Anonymous
10/14/2025, 1:23:55 PM
No.106884411
shit miku general of gay
Anonymous
10/14/2025, 1:25:19 PM
No.106884426
>>106887149
>>106887189
Anonymous
10/14/2025, 1:36:56 PM
No.106884501
>>106884547
Anonymous
10/14/2025, 1:41:08 PM
No.106884528
>>106884600
the blonde anime girl rolls down her car window and throws her tea at the people outside.
new lora, kino
Anonymous
10/14/2025, 1:43:14 PM
No.106884547
>>106884590
>>106884501
These look like 3D renders more than anything else. Is it deliberate?
Anonymous
10/14/2025, 1:46:04 PM
No.106884564
I've subjected myself to the Forbidden Dream where you can never wake up from and lived
Anonymous
10/14/2025, 1:48:21 PM
No.106884573
>>106884580
>>106886140
Worth upgrading from 64gb to 96gb ram, with a 5090?
Anonymous
10/14/2025, 1:49:39 PM
No.106884580
>>106884615
>>106884573
are you running out of memory and swapping? if not, more RAM doesn't automatically provide a performance boost.
Anonymous
10/14/2025, 1:50:55 PM
No.106884590
>>106884547
yes indeed 3d render
Anonymous
10/14/2025, 1:52:29 PM
No.106884600
>>106884528
the blonde anime girl sips her cup of tea as people outside the car are walking by.
Anonymous
10/14/2025, 1:54:58 PM
No.106884615
>>106884665
>>106884580
I am. But feels wasteful for such a "small" upgrade. There are no larger packs supported on the qvl list.
Anonymous
10/14/2025, 1:59:23 PM
No.106884652
>>106884677
the anime girl eats cereal from her cereal bowl, and gives a thumbs up.
neat, double bowl.
Anonymous
10/14/2025, 2:00:43 PM
No.106884660
>>106884687
what does wan VACE do, is it for replacing characters in scenes? if yes, can't that be done with wanimate already?
Anonymous
10/14/2025, 2:01:17 PM
No.106884665
>>106884998
>>106884615
I'm assuming DDR5, but email the motherboard vendor with the model of a 2x64GB kit and ask if it'll be supported. my board shipped with "max. 64GB" at launch, but they've increased that to 256GB now that there are larger DIMM sizes.
Anonymous
10/14/2025, 2:02:41 PM
No.106884677
>>106884652
for rife VFI is 47 or 49 better (recommends both)
this is with 49:
Anonymous
10/14/2025, 2:03:43 PM
No.106884687
>>106884660
editing, so they say
extending frames at front or back, extending the canvas, transferring poses, depth, outlines, and stuff I'm probably forgetting
Anonymous
10/14/2025, 2:07:26 PM
No.106884720
>>106884374 (OP)
>i didnt make it to the collage
shit collage kys
Anonymous
10/14/2025, 2:10:57 PM
No.106884737
the man with the beard on the left drinks his champagne, and the yellow cartoon character on the right throws 100 dollar bills into the air
Anonymous
10/14/2025, 2:27:46 PM
No.106884860
the white hair anime girl wearing a blindfold stands up and runs quickly to the right through a door in her house.
new lora, pretty good
Anonymous
10/14/2025, 2:34:26 PM
No.106884916
the anime girl is riding a blue skateboard around a racetrack.
Anonymous
10/14/2025, 2:43:10 PM
No.106884982
Anonymous
10/14/2025, 2:47:05 PM
No.106884998
>>106885148
>>106884665
Dear god going through asus support is absolutely horrible.
I ordered the 96gb pack.
Anonymous
10/14/2025, 2:51:40 PM
No.106885033
>>106885073
the camera pans out and an anime style Hatsune Miku puts her hand on the man's shoulder, as he kneels in the snow outside.
Anonymous
10/14/2025, 2:55:51 PM
No.106885057
>increase cfg in high noise
>the FFLF now ignores the last frame
Who do I blame for this?
Anonymous
10/14/2025, 2:58:16 PM
No.106885073
>>106885121
>>106885033
the camera pans out and an anime style Hatsune Miku gives the man a hug as he stands in front of neon signs.
new lora works very well. kijai says use his lora for high noise + the 2.1 i2v for low.
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22_Lightx2v
Anonymous
10/14/2025, 3:04:22 PM
No.106885121
>>106885154
>>106885073
one more test with this one.
the camera pans out and the man eats McDonalds french fries with his hands, that are on a round table.
Anonymous
10/14/2025, 3:08:21 PM
No.106885148
>>106884998
yeah, but not as bad when it's not an RMA. they didn't respond to my ticket, just closed it after they'd published the updated BIOS with a note about memory support.
Anonymous
10/14/2025, 3:09:05 PM
No.106885154
>>106885121
Pretty smooth
Anonymous
10/14/2025, 3:13:29 PM
No.106885181
Anonymous
10/14/2025, 3:16:51 PM
No.106885209
>tfw migu
:)
Anonymous
10/14/2025, 3:38:53 PM
No.106885370
>>106885346
did you use a style lora/tag for this? if so do you mind sharing
Anonymous
10/14/2025, 3:39:20 PM
No.106885375
>>106885484
>>106883990
If any anon here feels that doesnt belong remember that there is an actual place where we test these same models by actually using it creatively and having fun with them, no miku spamming or girl crunching pointing out at viewer spamming, instead actual good quality gens with same models, just saying. If you like being shilled, stay here.
Anonymous
10/14/2025, 3:40:26 PM
No.106885381
>>106885346
the more innocent she looks, the bigger the chance she'll push a finger up your bum while blowing
Anonymous
10/14/2025, 3:52:47 PM
No.106885484
>>106885375
>advertising sdg circlejerk bullshit general
>in ldg of all places, which was created specifically to get away from sdg cancer.
You fucking retard.
Anonymous
10/14/2025, 3:53:51 PM
No.106885492
>>106885505
>>106885512
These video gens with the fixed lora look like they're out of an entirely new model. local is eating good!
Anonymous
10/14/2025, 3:55:46 PM
No.106885505
>>106885541
Anonymous
10/14/2025, 3:56:34 PM
No.106885512
>>106885531
>>106885492
what's the gen time for single image?
Anonymous
10/14/2025, 3:58:55 PM
No.106885531
>>106885615
>>106885512
picrel is/was Chroma1-HD-Flash. Extracting text embeddings is 6-7 seconds, 8 steps heun is 9-10 seconds
Anonymous
10/14/2025, 3:59:57 PM
No.106885541
>>106885505
>I don't get it.
It's magic to me too. Ask the god of latents Kijai
Anonymous
10/14/2025, 4:02:24 PM
No.106885561
Anonymous
10/14/2025, 4:04:00 PM
No.106885584
>>106885615
Anonymous
10/14/2025, 4:06:57 PM
No.106885615
>>106885688
>>106885531
>>106885584
Still rocking the 1gb beast Emma lora?
Anonymous
10/14/2025, 4:13:46 PM
No.106885688
>>106885855
>>106885615
Rank 32 this time (213.8mb). I think it needs more baking. Ran out of captioned datasets and starting to update old ones
Anonymous
10/14/2025, 4:32:35 PM
No.106885855
>>106886502
>>106885688
Ah you captioned it properly this time. How does it benchmark against old version?
Anonymous
10/14/2025, 4:39:38 PM
No.106885903
30 minutes on a 12gb 3060 with wan 14b. 5b is 5 minutes but it sucks.
Anonymous
10/14/2025, 4:44:47 PM
No.106885958
>>106885980
>>106885829
it's been in comfy for months
'Tangential Damping CFG' node.
nobody knows how to use it or what exactly it does though so it was forgotten.
Anonymous
10/14/2025, 4:46:54 PM
No.106885980
>>106885958
it's a different thing, no?
Anonymous
10/14/2025, 4:57:06 PM
No.106886069
>>106886059
the wan2.2 i2v lora ive been using seems no different than the new one posted, so i dunno.
Anonymous
10/14/2025, 4:57:32 PM
No.106886073
>>106886059
Back from genning tiktok videos and 1grils? I dunno.
Anonymous
10/14/2025, 5:03:07 PM
No.106886111
>>106886059
we never left desu
Anonymous
10/14/2025, 5:06:19 PM
No.106886140
>>106886532
>>106886552
>>106884573
I upgraded from 64gb of ram to 96gb (also have a 5090) and it was worth it to me as I use the fp16 Wan models. The noticeable boost is that the model switching (high>low) takes like 30 seconds now as it doesn't constantly use all of my system ram the entire time. If you just use the Q8 models of Wan then 64gb is fine imo.
Anonymous
10/14/2025, 5:26:01 PM
No.106886292
Anonymous
10/14/2025, 5:41:56 PM
No.106886435
>>106886602
>>106886700
for wan if i wanted to create videos of a realistic character/celeb is it better to:
- generate a lora and use t2v
- use wan animate/vace to face swap
- something else?
Anonymous
10/14/2025, 5:48:02 PM
No.106886498
Anonymous
10/14/2025, 5:48:51 PM
No.106886502
>>106886529
>>106886998
>>106885855
Same captions. HD really dialed in the details
Anonymous
10/14/2025, 5:51:07 PM
No.106886526
Anyone have any joy running qwen on 12gb vram? i have watched a video and copied that claims it can but keeps failing.
Anonymous
10/14/2025, 5:51:15 PM
No.106886529
>>106886564
>>106886502
why does it look like that though
Anonymous
10/14/2025, 5:51:23 PM
No.106886532
>>106886700
>>106886140
does fp16 really make a difference over q8?
let's see how long fp16 takes, and if its worth the extra time
Anonymous
10/14/2025, 5:51:54 PM
No.106886538
>>106887145
>>106885829
every single one of these guidance methods are snake oil so nobody cares
Anonymous
10/14/2025, 5:53:01 PM
No.106886552
>>106886140
I'm running q8. I can run fp16, but with more loras it gets too heavy.
Anonymous
10/14/2025, 5:54:13 PM
No.106886564
>>106886661
>>106886529
You need to go outside and look at more women
It seems like unless you have a super computer local gen is a waste of time.
Anonymous
10/14/2025, 5:57:31 PM
No.106886602
>>106886661
>>106886435
Make a lora of them in SDXL/Chroma and then I2V those for the most consistency.
Anonymous
10/14/2025, 5:57:54 PM
No.106886607
>>106886568
4GB laptop gang represent
Wan 2.2 lightning lora testing results:
After trying many combinations including old loras and new rCM, I got the best results with:
New HIGH:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors
Old LOW:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors
4 steps, cfg 1, unipc
I think unipc does the most work here at low 4 steps to remove blur from motion, and the new HIGH lora does well to add more motion compared to the old HIGH lora.
Anonymous
10/14/2025, 6:00:08 PM
No.106886632
>>106886681
>>106886568
a high end consumer is not a 'super computer'.
you just need 3090/4090/5090 gpu and 64gb ddr5 ram. thats it.
Anonymous
10/14/2025, 6:01:45 PM
No.106886648
>>106886681
>>106886568
you can image gen with a 600-800$ pc and videogen with a 1-1.5k pc, if you dont have that by now while liking tech you got bigger problems
Anonymous
10/14/2025, 6:02:04 PM
No.106886655
>>106887607
An H200 is about $32k.
Solar panel setup to save on electricity are about $40k.
Man, I really want to access the full power of these models unrestricted..but the cost savings from using solar panels would be enormous..I could gen without ever worrying about how much electricity I use.
Such tough decisions.
Anonymous
10/14/2025, 6:02:22 PM
No.106886661
>>106886945
>>106886564
most women i see in real life are not violently saturated and pixelated
>>106886602
you mean generate pictures using sdxl/chroma then train a lora based on those? or generate an image in sdxl/chroma then use that for i2v?
Anonymous
10/14/2025, 6:04:52 PM
No.106886681
>>106886648
I should have clarified with image2video.
>>106886632
>you just need 3090/4090/5090 gpu and 64gb ddr5 ram
And that's faster than using a web gen?
Anonymous
10/14/2025, 6:07:00 PM
No.106886700
>>106886797
>>106886940
>>106886435
another option is i2v and switch the scene with the person (having them do whatever you want) through prompting.
>>106886532
A while back I tested Q8 and fp8 scaled against fp16 with the i2v models and the fp8 scaled just melted at higher video length (10 sec) while the Q8 was almost the same as the fp16 model (there were some additional minor movements in the background with the fp16 model). Honestly I just use the fp16 model because I can.
Anonymous
10/14/2025, 6:18:02 PM
No.106886797
>>106886700
i2v could work with a few tweaks
Anonymous
10/14/2025, 6:18:08 PM
No.106886798
Anonymous
10/14/2025, 6:19:01 PM
No.106886812
Anonymous
10/14/2025, 6:21:16 PM
No.106886827
>>106886841
>>106887696
https://www.reddit.com/r/StableDiffusion/comments/1o67ntj/comment/njfdj4a/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>A new somewhat interesting option is Nvidia's rCM distillation, which I also extracted as a LoRA:
>https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
>It's for 2.1, so for 2.2 it needs to be used at higher strength, but it seems to have more/better motion and also bigger changes to the output than lightx2v, granted we may not have the exact scheduler they use implemented yet.
Anonymous
10/14/2025, 6:23:00 PM
No.106886841
>>106886864
Anonymous
10/14/2025, 6:24:47 PM
No.106886864
>>106886841
I'd like him to show a result with rCM and see how it compares to that new lora (and rCM is for t2v so it's irrelevant to I2V)
Anonymous
10/14/2025, 6:32:10 PM
No.106886940
>>106891081
>>106886700
nah you were right
Anonymous
10/14/2025, 6:32:26 PM
No.106886945
>>106886661
>most women i see in real life are not violently saturated and pixelated
You need to get out more
Anonymous
10/14/2025, 6:36:47 PM
No.106886991
Anonymous
10/14/2025, 6:37:37 PM
No.106886998
>>106887652
>>106886502
can you post the lora?
Anonymous
10/14/2025, 6:38:21 PM
No.106887007
>>106887229
Anonymous
10/14/2025, 6:46:51 PM
No.106887100
>>106885829
>oh wow guys, it looks better compared to SD3 at cfg 1
no shit, SD3 isn't supposed to run with no cfg
Anonymous
10/14/2025, 6:51:56 PM
No.106887145
>>106887204
>>106886538
>every single one of these guidance methods are snake oil so nobody cares
which is insane to me, like they invented CFG years ago and somehow it's the optimal method and we can't replace perfection, or else they were the luckiest motherfuckers on earth, or else it can actually be replaced at some point
Anonymous
10/14/2025, 6:52:12 PM
No.106887149
>>106884426
Local? For real? What settings did you use?
Anonymous
10/14/2025, 6:56:04 PM
No.106887189
>>106884426
looks like the bouncing boobs and leaning forward lora
Anonymous
10/14/2025, 6:57:27 PM
No.106887204
>>106887223
>>106887145
it's math. there is one true way of doing something as efficiently as possible. all these knock-offs are just papermaxxing someone's resume
Anonymous
10/14/2025, 6:59:10 PM
No.106887223
>>106887204
>there is one true way of doing something as efficiently as possible.
and usually it's not the first try it manages to do that, it gets improved over time, but not for CFG, they nailed that shit first try somehow
Anonymous
10/14/2025, 6:59:44 PM
No.106887229
>>106887007
>we’re also open sourcing a JAX nnx diffusion codebase that Willis @ma_nanye has been building
lol that's pretty jokes
Anonymous
10/14/2025, 7:02:47 PM
No.106887247
>>106887262
Lodestone is fucked since that new huggingface update lool
https://xcancel.com/bdsqlsz/status/1978114907598909724#m
Anonymous
10/14/2025, 7:04:23 PM
No.106887262
>>106887247
>Best-effort
>impactful work
the fuck?
best sampling settings for wan 2.2 WITHOUT lightning?
Anonymous
10/14/2025, 7:09:26 PM
No.106887311
>>106887275
the default euler ones
Anonymous
10/14/2025, 7:11:37 PM
No.106887332
>>106887378
>>106887275
uni_pc/beta for anime
deis/beta for realism
what i found personally the best.
Anonymous
10/14/2025, 7:16:28 PM
No.106887378
>>106887402
Anonymous
10/14/2025, 7:18:44 PM
No.106887402
>>106887378
1 cfg if using lightning
5 cfg without
5 shift for both
Anonymous
10/14/2025, 7:26:42 PM
No.106887477
>>106886613
>New HIGH:
>https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors
I'm not a big fan of that lora, yeah kijai fixed it with his own format, but I'm still getting ghosting shit, the motion is there but something else is gone (for euler at least, it's a bit better on uni_pc like you said)
Anonymous
10/14/2025, 7:34:10 PM
No.106887556
>>106887662
>>106889683
Chroma doesn't know how to type "fägäri" because ä. It's over!
Anonymous
10/14/2025, 7:37:51 PM
No.106887607
>>106887640
>>106886655
Unless you live in an actual hellhole (Cali, prepare to be arrested for using a local AI model) solar panels will never pay back over normal electricity prices. Solar panels are for paying for electrical independence, for for saving money.
Anonymous
10/14/2025, 7:40:05 PM
No.106887640
>>106887694
>>106887607
from what i hear some places give you big monetary incentives to buy solars which might make it worth but yes
Anonymous
10/14/2025, 7:40:48 PM
No.106887652
>>106887693
>>106891104
Anonymous
10/14/2025, 7:41:18 PM
No.106887662
Anonymous
10/14/2025, 7:43:16 PM
No.106887693
>>106887652
based emma enjoyer, danke.
Anonymous
10/14/2025, 7:43:17 PM
No.106887694
>>106887771
>>106887640
No, there is no payback and it's all a scam, even with stealing the money of your fellow taxpayers you're lucky to break even over 30 years -- assuming they last that long -- and it's pure sunk cost especially understanding that every 10 years there are major breakthroughs in solar panel and battery technology. Again, they're for energy independence and off-grid living, not a realistic way to save money. You'd be way better off and it'd be way cheaper to spend money on insulation with literally payback within several years.
Anonymous
10/14/2025, 7:43:23 PM
No.106887696
>>106887711
>>106886827
>New High + rCM Low (8 steps)
Neat combo for motion, but I think the new High introduced some gamma issues. Everything's been a touch darker since I've been using it.
Anonymous
10/14/2025, 7:44:35 PM
No.106887711
>>106888797
>>106887696
>8 steps
nahh, 4steps or bust, they managed to nail it on wan 2.1 I expect the same for wan 2.2
I'm trying to do a thing where we see a full-body illustration on one side of the image, and multiple close-ups of different parts of the girl's body on the other side of the image.
>ass focus, breast focus, crotch focus
>(full body:2.0), lower body, upper body
>close-up, multiple views
It only works 10% of the time.
Any tips on how to make this work more consistently? What kind of prompt would I use? Maybe there's already a lora for this?
Ordered my Spark, I think my first experiment will be trying to make a long video Wan LoRA.
Anonymous
10/14/2025, 7:48:32 PM
No.106887756
>>106887728
train a lora for it, you are on /g/
Anonymous
10/14/2025, 7:48:33 PM
No.106887757
>>106887728
first thing I would try is changing it to (multiple views:2.0)
Anonymous
10/14/2025, 7:49:32 PM
No.106887771
>>106887804
>>106887694
I pay $500/mo for electric. That's $6k a year. After 5 years, I'd be saving money if I can get my electric cut in half. They have 10-15 year warranties.
>You'd be way better off and it'd be way cheaper to spend money on insulation
All the insulation in the world isn't going to reduce how much electricity my gpu + server uses when running 24/7.
Anonymous
10/14/2025, 7:50:15 PM
No.106887777
>>106887789
>>106887752
cool bait anon
Anonymous
10/14/2025, 7:51:18 PM
No.106887789
>>106887814
>>106887777
Which part, that I got a Spark or that it can train a long frame LoRA?
Anonymous
10/14/2025, 7:51:56 PM
No.106887796
>>106887728
gen x amount of images and take the one that has good composition and put that one into ControlNet with depthanythingv2. This way you'll almost always get the thing you want. You can also use gimp/photoshop to make the original gen for depth
Anonymous
10/14/2025, 7:52:26 PM
No.106887804
>>106887771
>15 year warranty
Well you're in luck anon I have a warranty for a crypto coin to sell you.
Anonymous
10/14/2025, 7:53:00 PM
No.106887809
>>106887752
you're going to train a lora on a machine with just ram? lol
Anonymous
10/14/2025, 7:53:31 PM
No.106887814
>>106887825
>>106887789
spark is for llm's. the vram performance is worse than a GTX 1080. no idea what you're doing but good luck.
Anonymous
10/14/2025, 7:54:28 PM
No.106887825
>>106887854
>>106887814
>LLMs
You mean AI models that are also using Tranformers? Are you technically illiterate?
But yes, I'll also use it to finetune a Gemma model on smut.
Anonymous
10/14/2025, 7:55:15 PM
No.106887841
>>106887890
>>106887752
You will regret it. A Wan 2.2 i2v lora with just 100 640x360 takes almost 6 days on an Ada 6000, and that's far more powerful than a Spark.
Anonymous
10/14/2025, 7:56:13 PM
No.106887854
>>106887864
>>106887825
Here's your (You). Not wasting my time on you.
Anonymous
10/14/2025, 7:56:52 PM
No.106887864
>>106887878
>>106887854
You're wasting my time because you think LLMs and Diffusion models aren't fundamentally the same.
Anonymous
10/14/2025, 7:57:46 PM
No.106887878
>>106887864
>LLMs and Diffusion models are fundamentally the same
ohh that's a quality bait not gonna lie
Anonymous
10/14/2025, 7:58:25 PM
No.106887885
what a special little guy
>>106887752
>Ordered my Spark
>GB10 delivers up to 1 PFLOP of sparse FP4 tensor performance, placing its AI capability roughly between that of an RTX 5070 and 5070 Ti
Anonymous
10/14/2025, 7:58:57 PM
No.106887890
>>106888403
>>106887841
Regret what? Having a machine training over time with 128 GB of available RAM allowing for much larger model training? No wonder nothing is ever trained or finetuned because retards need 5 second gratification on literally everything. I don't care if it takes 2 months to train if it results in long videos.
Anonymous
10/14/2025, 7:59:41 PM
No.106887902
Poorfags absolutely malding
Anonymous
10/14/2025, 8:00:03 PM
No.106887907
>>106887888
Yeah I think the key point is it has 128 GB of RAM which is the bottleneck for most training especially video models.
Anonymous
10/14/2025, 8:00:50 PM
No.106887924
>>106886613
Their new lora model isn't great desu, they had much more success with the 2.2 T2V one
Anonymous
10/14/2025, 8:02:58 PM
No.106887950
>bro why buy a terminal in 1987, don't you realize it'll take you 6 months to code anything?
Anonymous
10/14/2025, 8:12:06 PM
No.106888059
ey gais ordered my spunk, gonna do some llms and ais on it ... k cya
Anonymous
10/14/2025, 8:12:28 PM
No.106888066
>>106888291
>>106887888
https://www.reddit.com/r/LocalLLaMA/comments/1lk5te5/nvidia_dgx_spark_whats_the_catch/
>It was never meant as a stand-alone product for inference or training beyond testing whether what you're trying to do will actually work.
Anonymous
10/14/2025, 8:13:56 PM
No.106888080
I tried to trick it
Anonymous
10/14/2025, 8:15:03 PM
No.106888089
>>106888124
Kill cogsuckers. Behead cogsuckers. Roundhouse kick a cogsucker into the concrete. Slam dunk cogsucker's babies into the trashcan. Crucify filthy cogsuckers. Defecate in a cogsucker's mouth. Launch cogsuckers into the sun. Stir fry cogsuckers in a wok. Toss cogsuckers into active volcanoes. Urinate into a cogsucker's face. Judo throw cogsuckers into a wood chipper. Twist cogsuckers heads off. Report cogsuckers to the IRS. Karate chop cogsuckers in half. Curb stomp cogsucker. Trap cogsuckers in quicksand. Crush cogsuckers in the trash compactor. Liquify cogsuckers in a vat of acid. Smack cogsuckers. Dissect cogsuckers. Exterminate cogsuckers in the gas chamber. Stomp cogsucker heads with steel toed boots. Cremate cogsuckers in the oven. Lobotimize cogsuckers. Mandatory prison sentences for cogsuckers. Grind cogsuckers in the garbage disposal. Drown cogsuckers in acid. Vaporize cogsuckers with thermite. Kick old cogsuckers down the stairs. Feed cogsuckers to lions. Slice cogsuckers heads off with laserbeams.
Anonymous
10/14/2025, 8:15:27 PM
No.106888094
>>106888189
>>106887888
isn't that super close to 6000 Ada performance?
Anonymous
10/14/2025, 8:17:59 PM
No.106888124
>>106888410
>>106888089
>cogsuckers
but CogVideo is a fine model though :(
https://github.com/zai-org/CogVideo
Anonymous
10/14/2025, 8:23:28 PM
No.106888189
>>106888094
That doesn't matter they can't afford and it's not 10 pflops for $350 with 512 GB of VRAM.
>>106887275
What hardware are you running? Curious what you need to gen without the lightning loras
Anonymous
10/14/2025, 8:26:37 PM
No.106888228
Anonymous
10/14/2025, 8:31:46 PM
No.106888291
>>106888435
>>106888066
>The NVIDIA RTX 3090 offers approximately 142 TFLOPS
>meant to be
According to who? There are people who still buy 3090s for "cheap" finetuning and it's operates at much higher wattage and slower speeds. People are ultimately mad the Spark is overpriced which is true, but if it was $1000 there be a 2 year waiting period to buy one.
Anonymous
10/14/2025, 8:35:19 PM
No.106888330
>>106888720
https://xcancel.com/Alibaba_Qwen/status/1978150959621734624#m
I have a feeling they'll use those new VL models as text encoder for their future video/edit models
Anonymous
10/14/2025, 8:36:16 PM
No.106888347
>>106888199
regular porn clips?
everything looks better without using lightning loras obviously, unless you're doing boring poses or something.
Anonymous
10/14/2025, 8:41:58 PM
No.106888403
>>106888428
>>106887890
>doesn't care if it takes two months
Uhuh. You will care. Enjoy your gold painted brick.
Anonymous
10/14/2025, 8:42:09 PM
No.106888410
>>106888124
What really happened to cog and mochi? They could of been on par with wan if they weren't so slow. Even ltxv kinda drifted off. Who knows, maybe they're all cooking behind the scenes.
>>106888403
Computers aren't run by hand crank any more grandpa, it can literally sit in the back of my office doing whatever I want freeing up my main workstation to do whatever I want. And it's a hilarious assertion, it taking two months really doesn't matter especially compared to what you do: nothing. Maybe someone that isn't you will finetune something lmao.
Anonymous
10/14/2025, 8:45:24 PM
No.106888435
>>106888448
>>106888291
>but if it was $1000 there be a 2 year waiting period to buy one.
Only because nvidia doesn't give a shit. Notice how Apple, as gay as they are, actually are able to meet demand for a product, even at launch?
Again, if it's not a datacenter GPU, nvidia does not really give a shit. The Spark is a "look what I can do" toy for twitter idiots, most of who got theirs for free or on loan.
Anonymous
10/14/2025, 8:46:36 PM
No.106888448
>>106888435
>Apple
too bad it can't do anything except specific inference workflows lmao, at least a Spark can run diffusers
Anonymous
10/14/2025, 8:48:57 PM
No.106888472
>>106888483
>>106888495
>>106888428
Sounds like you never trained a LoRA before. You'll be in for a surpsrise when two months later you try it and realize it's overbaked, underbaked, you forgot a key setting, etc... and now you have to wait another two months.
Just return the fucking thing and buy a 5000 Pro, or a pair of Quadro 8000, which BTW will be faster than the Spark, since they have much faster memory.
Anonymous
10/14/2025, 8:50:05 PM
No.106888483
>>106888472
Apparently you have never trained a LoRA because you think it takes 2 months to train one. Even if it was 1/4 the speed of a 4090, it's still literally several days, not several months.
Anonymous
10/14/2025, 8:50:50 PM
No.106888489
>>106888718
Question, how hard is video gen, ComfyUI, WAN to do/learn?
I've been screwing around for a while using Forge but haven't tried Comfy. Can i keep using Stability Matrix?
Anonymous
10/14/2025, 8:51:16 PM
No.106888495
>>106888522
>>106888472
Let anon bask in their stupidity. Some people need to learn the hard way.
Anonymous
10/14/2025, 8:52:51 PM
No.106888522
>>106888495
As we all know a LoRA is like baking a cake and there's no way to test its performance mid training, literally impossible, you just train for 2 months straight and cross your fingers. Every day it's more and more obvious why the LoRA and finetune ecosystem has dried up.
Anonymous
10/14/2025, 8:53:04 PM
No.106888524
>>106888642
>>106888428
Sorry I totally missed the part where you hilariously said you were going to finetune a model. Yes, you do that on your Spark, everyone else is idiots for using an H100 cluster.
https://huggingface.co/quarterturn/models
Now post your huggingface page.
Anonymous
10/14/2025, 8:55:12 PM
No.106888546
>>106888565
I can't believe 1.5 is still better than 2.2 and there is no 3.0 after all this time.
ALso is this shit finally useful for making comic books or it's just as useless?
Anonymous
10/14/2025, 8:55:52 PM
No.106888555
>>106888714
Anonymous
10/14/2025, 8:56:41 PM
No.106888565
>>106888546
>I can't believe 1.5 is still better than 2.2 and there is no 3.0 after all this time.
>1.5
you mean wan 2.1?
Anonymous
10/14/2025, 8:57:41 PM
No.106888581
Anonymous
10/14/2025, 9:03:02 PM
No.106888642
>>106888524
Sorry, you’ll need to post a picture of your face next to your monitor showing that page with a piece of paper that says “I’m not doing stolen valor.”
>everyone else is idiots for using an H100 cluster.
That’s not my assertion, but that’s par for the course from someone like you arguing in bad faith.
Thanks for conceding that you can fine-tune a model on a Spark. Now you’ll have to concede that a Spark performs roughly like a 3090 which people do use to fine-tune models like Flux. In another thread you’d probably be calling that a “good GPU.”
The only person pretending a Spark is supposed to be an H100 is you, and that’s disingenuous. An H100 costs more than a dollar an hour to rent meaning if you mess up a run, you literally just burn money. My Spark doesn’t cease to exist after a bad fine-tune.
So then you have to move the goalposts to “wasted time,” as if that even matters. People spend time on hobbies all the time. This isn’t a hand-cranked computer it runs without me touching it. Are you going to tell someone growing plants they’re wasting time because a seedling died after three months? Are you a real person, or just incredibly sad?
Anonymous
10/14/2025, 9:09:44 PM
No.106888714
Anonymous
10/14/2025, 9:10:01 PM
No.106888718
>>106888489
i don't know anything about stability matrix but if you have half a brain and can read documentation you can pick it up in a few weeks
Anonymous
10/14/2025, 9:10:13 PM
No.106888720
>>106889197
>>106888330
Didn't they already use VL to auto-caption their training set for Qwen-Image?
Actually on that note has anybody outside of Alibaba tried using VL for captioning?
Anonymous
10/14/2025, 9:13:02 PM
No.106888753
>>106888199
you're just increasing the steps which doesn't change the hardware requirements, it just takes longer
Anonymous
10/14/2025, 9:17:09 PM
No.106888797
>>106887711
I was only getting videos full of artifacts at four steps, but yeah, I'm sure they'll figure it out.
Anonymous
10/14/2025, 9:17:48 PM
No.106888807
>>106888835
>>106888539
Almost completed Saika dataset earlier. Same difficulty as with k-pop stars. Every photo is photoshopped to hell so loras produce alien face mongoloids
>>106888807
oh hi! are you the hailey rose trainer?? what else did you cook up my man!
>>106884374 (OP)
Please can anyone help, I'm trying to generate batches of images with the same seed for the wildcard generator and also the ksampler.
This so far seems impossible. (generate every n images change the seed) Even with a counter node I can't do it because you can't reset the counter.
You also cant generate a seed with a node then use that seed for the ksampler or the wildcard processor as when you change it it doesn't register until you start and stop generating.
Anonymous
10/14/2025, 9:32:05 PM
No.106888977
>>106889027
>>106888835
nah different guy
Anonymous
10/14/2025, 9:37:33 PM
No.106889038
>>106889148
>>106888911
Have you tried the LatentBatchSeedBehavior node?
Anonymous
10/14/2025, 9:38:02 PM
No.106889045
>>106889148
>>106888911
post workflow
Anonymous
10/14/2025, 9:42:39 PM
No.106889089
>>106889148
>>106888911
Use code and the local API. Not everything is a nail to be screwed.
Anonymous
10/14/2025, 9:48:04 PM
No.106889131
>>106889164
Anonymous
10/14/2025, 9:50:40 PM
No.106889148
>>106889163
>>106889198
>>106889038
>>106889045
>>106889089
literally just how can i change the seed and thereby the wildcard generated every n generations on impactwildcard processor.
Why did they invent this "programming" language without loops...
Anonymous
10/14/2025, 9:50:48 PM
No.106889149
>>106888428
2 months for a cooked lora, meanwhile your electricity bill jumps 20%
Anonymous
10/14/2025, 9:51:56 PM
No.106889163
>>106889148
>guys how can I programmically do something extremely specific without programming
Anonymous
10/14/2025, 9:52:09 PM
No.106889164
Anonymous
10/14/2025, 9:53:35 PM
No.106889184
Anonymous
10/14/2025, 9:53:45 PM
No.106889188
Where can I adjust extra parameters in OneTrainer.. betas and such?
Anonymous
10/14/2025, 9:54:29 PM
No.106889197
>>106888720
I sure hope not, this shit is ass
Anonymous
10/14/2025, 9:54:31 PM
No.106889198
>>106889148
i dont know that meme node, just use fixed seed, queue X gens, change seed, queue more?
Anonymous
10/14/2025, 9:54:35 PM
No.106889200
>>106889237
>>106889027
ooh i like this
Anonymous
10/14/2025, 9:57:02 PM
No.106889221
>>106889027
Damn, this is good.
Anonymous
10/14/2025, 9:57:08 PM
No.106889222
Anonymous
10/14/2025, 9:59:04 PM
No.106889237
Can we gen our way out of Chinese Communist engineering dominance?
Anonymous
10/14/2025, 10:13:58 PM
No.106889373
>>106889518
>>106889266
>Chinese Communist engineering dominance
>dominance
what dominance? Sora 2 destroyed everything
Anonymous
10/14/2025, 10:14:20 PM
No.106889379
>>106889266
the US is cooked, might as well start learning Mandarin right now to get ahead of the curve
Anonymous
10/14/2025, 10:16:32 PM
No.106889400
>>106889266
The way NVIDIA is operating and the degree to which the US economy is dependant on it means we are begging for China to gain complete leverage over us.
Anonymous
10/14/2025, 10:22:14 PM
No.106889461
>>106889480
>>106889266
move Taiwan to Hawaii, just uproot the entire island
Anonymous
10/14/2025, 10:24:50 PM
No.106889480
>>106889491
>>106889461
>tsmc gets killed by a volcano or tsunami
epic
Anonymous
10/14/2025, 10:26:29 PM
No.106889491
>>106889480
the alternative is move outside of Cali but then it would just be gay
Anonymous
10/14/2025, 10:28:22 PM
No.106889504
>>106888835
my man. did this film style lora for WAN
Anonymous
10/14/2025, 10:29:33 PM
No.106889516
>>106888904
tommy king body style lora
Anonymous
10/14/2025, 10:29:54 PM
No.106889518
>>106889533
>>106889554
>>106889373
>Sora 2 destroyed everything
only for a short time. now, sora 2 is as boring as the first. because of censorship
China in charge:
>snakeoil paper spam
>slop datasets
>bench/bloatmaxxing
US in charge:
>censorship retardation
>saasfagging
>walled gardens
we lose no matter what. slavs should save us because they don't give a fuck
Anonymous
10/14/2025, 10:31:30 PM
No.106889533
>>106889554
>>106889518
yeah it's completely cucked, but you can make state of the art "cat working at mcdonalds" videos i guess
Anonymous
10/14/2025, 10:31:45 PM
No.106889538
>>106889556
>>106889520
>slavs should save us because they don't give a fuck
but slaves are fucking retarded (djokovic is still my goat though)
Anonymous
10/14/2025, 10:31:48 PM
No.106889541
>>106889567
Anonymous
10/14/2025, 10:32:14 PM
No.106889551
Anonymous
10/14/2025, 10:32:29 PM
No.106889552
flux custom character lora (merging actresses).
>>106889027
this is dope as hell btw
Anonymous
10/14/2025, 10:32:46 PM
No.106889554
>>106889975
>>106889518
>>106889533
you can still make edgy shit though
>>>/wsg/5999084
>>>/wsg/5999085
>>>/wsg/5999088
Anonymous
10/14/2025, 10:32:52 PM
No.106889556
>>106889538
>slaves are fucking retarded
ever hear of retard strength? also all the papers that actually matter have slavs credited. they are the only ones not chasing bullshit
Anonymous
10/14/2025, 10:33:53 PM
No.106889567
>>106889541
tired of you spamming this shit. fuck comfyui. migu should not be advertising such slop
Anonymous
10/14/2025, 10:39:54 PM
No.106889623
>>106889685
Anonymous
10/14/2025, 10:46:46 PM
No.106889683
Anonymous
10/14/2025, 10:46:57 PM
No.106889685
>>106889766
>>106889623
i don't get it
Anonymous
10/14/2025, 10:52:45 PM
No.106889734
>>106889736
>>106889753
>>106889520
https://github.com/ai-forever/Kandinsky-5
some russian bank is making a video
model and they claim it outperforms wan
Anonymous
10/14/2025, 10:53:12 PM
No.106889736
Anonymous
10/14/2025, 10:55:03 PM
No.106889753
>>106889734
>2B parameters
>It outperforms larger Wan models (5B and 14B)
Riiiiiiight......
Anonymous
10/14/2025, 10:55:20 PM
No.106889758
>>106889807
>Kijai
[+4] 19 points 10 hours ago
I haven't really tested that much lately, I don't like the 2.2 Lightning LoRAs personally as they affect the results aesthetically (everything gets brighter), so for me the old 2.1 Lightx2v at higher strength is still the go-to.
A new somewhat interesting option is Nvidia's rCM distillation, which I also extracted as a LoRA:
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
It's for 2.1, so for 2.2 it needs to be used at higher strength, but it seems to have more/better motion and also bigger changes to the output than lightx2v, granted we may not have the exact scheduler they use implemented yet.
some have said this one for low noise lora with the new 2.2 one for high is a good combo, gonna give it a try.
Anonymous
10/14/2025, 10:56:11 PM
No.106889766
>>106889685
That's perfectly fine
Anonymous
10/14/2025, 10:59:32 PM
No.106889807
>>106889831
>>106889849
>>106889758
>It's for 2.1, so for 2.2 it needs to be used at higher strength
a shame he didn't say the strength value to use, I'm too lazy to test that out and find the good spot only to find out it's inferior or some shit
Anonymous
10/14/2025, 11:00:39 PM
No.106889815
Anonymous
10/14/2025, 11:01:43 PM
No.106889831
>>106889807
>I've tried all of these in a few combos in the past hour on my 5090: new "moe distill i2v" that dropped earlier today, your MoE 2.2 i2v high you linked above, nvidia rcm, original 2.2 lightning i2v, 2.1 lightning i2v...
My best results by far so far are the version of the 2.2 i2v MoE distill lightning lora HIGH you linked above in high, and the nVidia rcm rank148 in low.
It's even better if you bump up the steps to like double, but that goes for all of these with motion...
gonna try the low and 2.2 kijai lora at 1 str first just to see what it does. otherwise 2.1 low works fine.
Anonymous
10/14/2025, 11:03:38 PM
No.106889849
>>106889807
the low model is basically the same as 2.1 so you don't need to increase the strength for low
Anonymous
10/14/2025, 11:11:31 PM
No.106889915
>>106889952
>>106890017
test prompt: 2.2 kijai lora (high) 1 str, nVidia rcm rank148 lora 1 str for low
the camera pans out and the man shakes hands with an anime style Hatsune Miku.
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22_Lightx2v
works, need to test other prompts though
scabPICKER
10/14/2025, 11:15:31 PM
No.106889950
Anonymous
10/14/2025, 11:15:49 PM
No.106889952
>>106889999
>>106889915
the anime girl opens a white pizza box and eats a slice of pizza.
looks smooth to me. using rife interpolation node (2x).
Anonymous
10/14/2025, 11:17:05 PM
No.106889965
>>106889520
china is just racing to get ahead, but they have momentum and absolute staying power
the us is only good at short bursts due to the shit money cycle and short-term ROI requirements for everything
Anonymous
10/14/2025, 11:17:34 PM
No.106889969
>>106890091
Anonymous
10/14/2025, 11:18:08 PM
No.106889975
>>106889990
>>106889554
yeah real edgy watermarked shit
Anonymous
10/14/2025, 11:18:48 PM
No.106889982
>>106890144
some experiments with wan speed up loras. same seed. i wanted to see what differences come up when these loras are in place as i use them a lot. i think the difference is less dramatic when the prompt isn't complex so i tried to put a twist on it. there's no other loras, or upscale applied.
prompt:
>a dslr photograph of a 32yo blonde woman, a demon, a succubus, her eyes are red, and her skin is pitch black, lined with veins, standing in a graffiti filled alleyway. She wears a black thong with a short white crop top, with a relaxed yet deliberate stance. she has a sexy presence, strong cinematic lighting.
res_2s/beta57 for both.
Anonymous
10/14/2025, 11:20:14 PM
No.106889990
>>106890010
>>106889975
>moves the goalpost
you can remove the watermark if you go for the pro subscription mode (yeah I know that sucks but it's possible though), and there's watermark removers on the internet and they work fine as well
Anonymous
10/14/2025, 11:21:51 PM
No.106889999
>>106890011
>>106889952
the anime girl opens a mcdonalds paper bag and eats a mcdonalds cheeseburger.
yeah, this new combo works pretty well. still need to test more.
Anonymous
10/14/2025, 11:22:54 PM
No.106890010
>>106890055
>>106889990
>indian guy screaming at computer
whoa crazy stuff
Anonymous
10/14/2025, 11:22:54 PM
No.106890011
>>106890038
>>106890569
>>106889999
helps if I add the video.
Anonymous
10/14/2025, 11:23:19 PM
No.106890017
prompt:
>3dcg, a blue haired 19yo woman in a sailor costume sits on an inflatable whale floating in space
so we get a lack of understanding of some concepts with short prompts
>>106889915
thanks for this. gonna do some more testing after these
Anonymous
10/14/2025, 11:24:58 PM
No.106890038
>>106890069
>>106890011
the anime girl runs to the right very fast out a door in the white room, and closes it.
rife vfi for interpolation (2x, so 32fps), seems faster than film vfi but that's better quality, this is for quicker gens.
Anonymous
10/14/2025, 11:26:05 PM
No.106890050
I don't get this Context Windows (Manual) node. seems like it does the exact same shit whether or not I have it enabled
Anonymous
10/14/2025, 11:27:02 PM
No.106890055
>>106890010
the computer explodes though! and the epstein files! and the heckin vegetables jokes makin fun of disabled people though!
Anonymous
10/14/2025, 11:28:19 PM
No.106890069
>>106890154
>>106890038
the anime girl gets in a teal colored convertible car and drives away out of the white room, through a garage door.
okay this combo is really good imo. 1 strength for both, base wan 2.2 i2v template in comfy, with the two loras added (2.2 kijai for high noise, nvidia one by kijai for low noise).
Anonymous
10/14/2025, 11:29:55 PM
No.106890091
Anonymous
10/14/2025, 11:30:28 PM
No.106890096
prompt:
>In the still frame, a lone, ancient tree stands tall in the center of a misty marsh. Its twisted branches reach out against a cloudy, overcast sky, while the soft, reflective waters surround its base, creating a mirror-like surface. The muted tones of greens and grays evoke a quiet, haunting atmosphere, with wisps of fog drifting through the scene, emphasizing the solitude and resilience of the tree amid the vast, tranquil expanse of the marsh.
Anonymous
10/14/2025, 11:35:25 PM
No.106890133
>>106890163
last one. a full prompt for our 3DCG to see if that makes a difference.
prompt:
>In a surreal, otherworldly scene rendered in stunning 3D computer graphics, a 19-year-old woman with vibrant blue hair sits gracefully atop an enormous inflatable whale. She is dressed in a classic sailor costume, complete with a navy blue and white striped top, a sailor collar, and a small anchor emblem. The inflatable whale, with its glossy, smooth surface and cheerful expression, floats effortlessly through the vast emptiness of space. Surrounding her, the cosmos stretches infinitely—stars shimmer softly in the distance, and faint nebulae cast a gentle glow, creating a mesmerizing contrast between the playful innocence of her attire and the boundless mystery of the universe. The scene exudes a whimsical, dreamlike quality, blending childhood nostalgia with cosmic wonder in a visually striking, cinematic tableau.
all same seed: 671162467703350
scabPICKER
10/14/2025, 11:36:48 PM
No.106890144
>>106890172
>>106889982
What's the process of just genning one image like, on Wan?
Anonymous
10/14/2025, 11:37:45 PM
No.106890154
>>106890167
>>106890069
the man runs out the door as the camera tracks him, down the street of New York at night.
scabPICKER
10/14/2025, 11:38:11 PM
No.106890163
>>106890172
>>106890175
>>106890133
Also, why does a lora speed things up? That's surprising to me, all the loras I ever used at best were the same speed.
scabPICKER
10/14/2025, 11:39:12 PM
No.106890167
>>106890175
>>106890154
>he turns into another person
why can't ai into likenesses?
Anonymous
10/14/2025, 11:39:59 PM
No.106890172
>>106890390
>>106890163
they are speed up loras, they let you generate an image with fewer steps, but as you can see they skip details and in some cases prompt adherence.
>>106890144
set frames to 1 instead of 81 or whatever
Anonymous
10/14/2025, 11:40:15 PM
No.106890175
Anonymous
10/14/2025, 11:41:25 PM
No.106890188
>>106890226
>>106890358
kek
Anonymous
10/14/2025, 11:44:12 PM
No.106890221
>>106889266
fuck off glowies, no one likes you
Anonymous
10/14/2025, 11:44:37 PM
No.106890226
>>106890188
the person in the blue coat throws a molotov at the van in front of him, causing it to explode into flames.
So what's the peak UI atm? Is it still Comfy or did another come on top?
scabPICKER
10/14/2025, 11:57:34 PM
No.106890358
Anonymous
10/15/2025, 12:00:28 AM
No.106890389
>>106890242
comfy since day 1, it can't be topped due to the flexible customizable nature of the program as well having only the smartest people in the world contributing to it's source code. giving all of that to us for free? is... man..., the guy responsible needs a nobel peace prize.
scabPICKER
10/15/2025, 12:00:28 AM
No.106890390
>>106890172
They both added Earth. Or at least a planet.
Anonymous
10/15/2025, 12:01:28 AM
No.106890399
the large anime girl on the billboard waves hello, as cars drive by.
2.2 kijai lora (new) and 2.1 i2v rank64 lightx2v lora (old, but works fine)
the cars somehow didnt crash!
scabPICKER
10/15/2025, 12:01:29 AM
No.106890400
>>106890491
>>106890242
For me, it's stable-diffusion.cpp. But it has a loooong ways to go to be imo really past alpha. Including incomplete parameter validation bugs lol
scabPICKER
10/15/2025, 12:05:25 AM
No.106890427
scabPICKER
10/15/2025, 12:07:14 AM
No.106890443
Anonymous
10/15/2025, 12:08:54 AM
No.106890456
>>106890242
what model/lora did you use for that goblin? mine always come out with normal faces/noses
Anonymous
10/15/2025, 12:11:19 AM
No.106890480
>>106890501
>>106890544
the anime girl is playing her guitar on stage.
2.2 kijai + older 2.1 i2v lora
Anonymous
10/15/2025, 12:12:47 AM
No.106890491
>>106890400
it would help if more people contributed. it's just four guys doing it
Anonymous
10/15/2025, 12:14:04 AM
No.106890501
>>106890555
>>106890480
Are you using 4 steps or 8? I keep getting grainy outputs with 4 no matter what I do.
Anonymous
10/15/2025, 12:20:33 AM
No.106890537
>>106890575
>>106890615
why can't i local diffuse music yet
Anonymous
10/15/2025, 12:21:06 AM
No.106890544
>>106890619
>>106890480
this time with the rCM nvidia lora that kijai posted for low:
all tested with 1 str for each lora
might be better, need to test more though.
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
Anonymous
10/15/2025, 12:22:08 AM
No.106890555
>>106890501
default i2v wan 2.2 template in comfy, should be 6 steps (3/3 for high/low).
Anonymous
10/15/2025, 12:22:23 AM
No.106890558
Anonymous
10/15/2025, 12:23:42 AM
No.106890569
>>106890582
>>106890592
>>106890011
fuck you, now I'm craving that literal poison
scabPICKER
10/15/2025, 12:24:15 AM
No.106890575
>>106890615
>>106890537
There's Ace Step. It's pretty lowfi
Anonymous
10/15/2025, 12:24:42 AM
No.106890582
>>106890714
>>106890569
it tastes like dogshit, fuck is wrong with you
Anonymous
10/15/2025, 12:25:17 AM
No.106890592
>>106890714
>>106890569
>poison
oxygen is poison, yet we consume this shit constantly, I don't see you complaining on that!
haven't genned in a year but what the fuck happened to comfyui? it's so fucking shit now
Anonymous
10/15/2025, 12:26:44 AM
No.106890615
>>106891582
>>106890537
>>106890575
I genuinely don't understand why there is no big "midi first" music diffusion model. Wouldn't that be the cleanest possible dataset combined with the most utility for integration into existing music production workflows? It's not like midi files are even heavy, so you think the model would be easy to train and the data easy to tag.
Anonymous
10/15/2025, 12:27:09 AM
No.106890619
>>106890655
>>106890544
the anime girl puts down her guitar and starts playing the drums.
kek, with 2.2 kijai and 2.1 rCM (low), 1 str.
Anonymous
10/15/2025, 12:27:41 AM
No.106890624
>>106890641
Anonymous
10/15/2025, 12:30:09 AM
No.106890641
>>106890624
butthurt shill
Anonymous
10/15/2025, 12:31:46 AM
No.106890655
>>106890619
the anime girl transforms into hatsune miku, with a microphone.
I like this new combo, still need to test other stuff though.
Anonymous
10/15/2025, 12:33:17 AM
No.106890668
>>106890732
>>106891036
>>106890609
sad. where are we moving to now? sdcpp like anon linked? seems like a good idea since all this python shit just sucks for brainlets
Anonymous
10/15/2025, 12:37:41 AM
No.106890703
>>106890726
>>106890609
for once it's a jew that made the great noticing kek
Anonymous
10/15/2025, 12:39:11 AM
No.106890714
>>106890582
Its the food equivalent of "it hurts so good"
>>106890592
Not all things are created equal. Oxygen is both necessary and damaging to our biology, with a diet rich in antioxidants we have systems to mitigate the damage. Constant consumption of micky D's on the other hand is a death sentence
Anonymous
10/15/2025, 12:39:40 AM
No.106890720
the anime girl transforms her guitar into a black pistol, and fires it at the camera.
smooth bocchi.
Anonymous
10/15/2025, 12:40:24 AM
No.106890726
>>106890703
I still want to punch this faggot's face
Anonymous
10/15/2025, 12:41:09 AM
No.106890732
>>106890668
The language used to set up all of the CUDA machinery really doesn't matter that much. Python and PyTorch are ass but they're easy to iterate with and that's all that matters.
Anonymous
10/15/2025, 12:43:26 AM
No.106890751
>>106890932
>>106891591
the men who are sitting get up, kick their chairs, and walk off camera to the left.
no kick but pretty good.
In short, what does it take (hardware, etc) to generate high quality videos today?
Anonymous
10/15/2025, 12:47:51 AM
No.106890796
>>106890847
>>106890932
>when AI companies try to charge you $100 per prompt
Anonymous
10/15/2025, 12:49:03 AM
No.106890810
>>106890832
>>106890794
nobody here knows because we all have to quant vid models. rent a h100 perhaps?
Anonymous
10/15/2025, 12:50:36 AM
No.106890832
>>106890848
>>106890794
>>106890810
I run it at full precision and the quality doesn't really seem to be any better than what I usually see here.
Anonymous
10/15/2025, 12:52:38 AM
No.106890847
>>106890932
Anonymous
10/15/2025, 12:52:41 AM
No.106890848
>>106890885
Anonymous
10/15/2025, 12:54:56 AM
No.106890865
>>106890901
>>106890794
nothing because wan only outputs 5, max 8 seconds
Anonymous
10/15/2025, 12:56:38 AM
No.106890884
>>106890910
>>106891079
the japanese girl jumps up and down.
with rCM for low, 2.2 kijai high:
Anonymous
10/15/2025, 12:56:48 AM
No.106890885
>>106890893
>>106890848
Sure, give me a prompt and I'll run it if I have time.
Anonymous
10/15/2025, 12:57:35 AM
No.106890893
>>106891035
>>106890885
prompt: anything you want, idgaf. just post something you made
Anonymous
10/15/2025, 12:58:13 AM
No.106890901
Anonymous
10/15/2025, 12:59:06 AM
No.106890910
>>106890941
>>106890884
and this time with 2.1 i2v lightx2v distil rank64:
Anonymous
10/15/2025, 1:01:25 AM
No.106890932
>>106890751
>>106890796
>>106890847
your prompts are boring, you should make good stuff like me instead
Anonymous
10/15/2025, 1:02:21 AM
No.106890941
>>106890957
>>106890910
another with 2.1 i2v low:
I think the rCM one works a bit better, more motion/physics.
Anonymous
10/15/2025, 1:04:17 AM
No.106890957
>>106891025
>>106890941
and this is with rCM low. it seems to yield better motion in general.
Anonymous
10/15/2025, 1:11:43 AM
No.106891025
>>106890957
seems to work, diff image:
Anonymous
10/15/2025, 1:12:33 AM
No.106891035
>>106891061
Anonymous
10/15/2025, 1:12:48 AM
No.106891036
>>106891056
>>106890668
>>106890595
>>106890609
samefag. so obvious.
anon hasn't "genned in a year" and suddenly mentions sdcpp as if that is something anyone remotely uses.
fuck off. your tactics are getting old.
Anonymous
10/15/2025, 1:14:07 AM
No.106891049
>>106891059
>>106891083
NetaYume Lumina officially supported in ComfyUI! ComfyUI remains as the leading UI for cutting edge AI!
https://x.com/ComfyUI/status/1978127680886521869
Anonymous
10/15/2025, 1:14:58 AM
No.106891056
>>106891073
Anonymous
10/15/2025, 1:15:31 AM
No.106891059
>>106891075
>>106891096
>>106891049
this shit any good?
Anonymous
10/15/2025, 1:15:41 AM
No.106891061
>>106891084
>>106891035
wow, your settings suck. didn;t even bother to mention speed or hardware. fuck you anon and do the second pass like a normal human bean
Anonymous
10/15/2025, 1:15:45 AM
No.106891063
>>106891077
okay boobs are nice but lets try something more fun.
the anime girl snaps her finger, and the black man on the floor disappears.
Anonymous
10/15/2025, 1:16:31 AM
No.106891073
>>106891088
>>106891056
anyone can inspect element or use vpn/proxies to dual post anon.
Anonymous
10/15/2025, 1:16:41 AM
No.106891075
>>106891059
no it's severely undercooked and the authors ran out of money
Anonymous
10/15/2025, 1:16:47 AM
No.106891077
>>106891078
>>106891063
he became a dolphin or what? lool
Anonymous
10/15/2025, 1:17:09 AM
No.106891078
>>106891077
too much fent.
Anonymous
10/15/2025, 1:17:18 AM
No.106891079
Anonymous
10/15/2025, 1:17:36 AM
No.106891081
>>106886940
fucking love this, well done
Anonymous
10/15/2025, 1:17:42 AM
No.106891083
>>106891049
nobody cares. tell the authors to add it to sdcpp
Anonymous
10/15/2025, 1:17:57 AM
No.106891084
>>106891061
The settings are all stock from the hf repo. RTX 6000 BW. Feel free to suggest something if you want to see it. I'm not too interested in video.
Anonymous
10/15/2025, 1:18:24 AM
No.106891088
>>106891095
>>106891114
>>106891073
so how can I prove I'm not a samefag? that's the funny thing about this, you claim someone is a samefag while knowing he has no way to prove his good faith, that's delightfully devilish, Seymour
Anonymous
10/15/2025, 1:19:00 AM
No.106891095
>>106891088
but it's so obvious except when asked for the burden of proof!
Anonymous
10/15/2025, 1:19:01 AM
No.106891096
>>106892547
>>106891059
no, that's why no here uses it.
the ONLY models that matter currently for local are: XL(Image gen for anime), WAN(video gen), Qwen(Editing) & Chroma(Image gen realism).
Anonymous
10/15/2025, 1:19:34 AM
No.106891104
>>106887652
thank you for the lora
Anonymous
10/15/2025, 1:20:46 AM
No.106891114
>>106891118
>>106891129
>>106891088
>shits on comfy as usual then immediately brings up a meme alternative
anon, we aren't stupid. I can also dual post. Watch, I will reply to this post within 5 seconds of posting, which is impossible even with a pass.
Check it
Anonymous
10/15/2025, 1:21:07 AM
No.106891118
>>106891129
Anonymous
10/15/2025, 1:21:38 AM
No.106891123
Anonymous
10/15/2025, 1:22:15 AM
No.106891129
>>106891114
>>106891118
>it is possible to samefag therefore everyone samefag
if only life was this simple
Anonymous
10/15/2025, 1:25:57 AM
No.106891165
the anime girl stands up and starts dancing beside the cop car.
this time: 2.2 kijai lora high, 2.1 i2v distilled rank64 low.
Anonymous
10/15/2025, 2:10:17 AM
No.106891536
btw if you update comfyui it insta-terminates video gens now
scabPICKER
10/15/2025, 2:15:44 AM
No.106891582
>>106890615
>"midi first" music diffusion model.
idk, diffusion has mostly not been used for this type of data. But someone says there is a language model that uses diffusion, so idk, maybe.
I think *we* have the potential to train an llm to create um...
idk, what are rosegarden files called? I hate them, but I think ai could make those really well, and they can be turned into midi.
btw, there are - out there - commercial ai that seems to be basically "techno" midi. idk what they call it, edm? they seem only eh soso - but maybe I did it wrong trying them out.
scabPICKER
10/15/2025, 2:16:47 AM
No.106891591
>>106890751
That's actually disturbing, it looks pretty real. The girl behind is perfect, except for the deformed hand.
scabPICKER
10/15/2025, 2:20:05 AM
No.106891628
AAAAHAHAHAHAHAHAHAHAH
https://github.com/leejet/stable-diffusion.cpp/issues/396
>closed
>not fixed
stable-diffusion.cpp does NOT use clip_l.
Anonymous
10/15/2025, 4:22:16 AM
No.106892547
>>106891096
lmao chroma sucks