← Home ← Back to /g/

Thread 106910887

344 posts 224 images /g/
Anonymous No.106910887 [Report] >>106911317 >>106911562 >>106912158 >>106913522
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106904218

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106910893 [Report]
>mfw
Anonymous No.106910897 [Report] >>106910939 >>106910980 >>106911030 >>106911034
blessed bred
Anonymous No.106910902 [Report]
Anonymous No.106910939 [Report]
>>106910897
simple and nice
Anonymous No.106910980 [Report]
>>106910897
off to a good start
Anonymous No.106911019 [Report]
Blessed thread of frenship
Anonymous No.106911025 [Report]
Blessed thread of frenship
Anonymous No.106911030 [Report] >>106911158
>>106910897
strangely hypnotic
Anonymous No.106911034 [Report] >>106911176
>>106910897
Anonymous No.106911036 [Report] >>106911260
comfy shoudl be dragged out on the street and shot
Anonymous No.106911048 [Report] >>106911177 >>106911322
is 12GB VRAM and 32GB RAM enough to train chroma loras or will my computer shit itself
Anonymous No.106911156 [Report]
Which qwen image edit lightx2v lora is recommended?
Anonymous No.106911158 [Report]
>>106911030
only if you're wh*te
Anonymous No.106911168 [Report]
Is there a bypass for Lora Manager license key?
Anonymous No.106911176 [Report]
>>106911034
Anonymous No.106911177 [Report]
>>106911048
beg ai-toolkit dev to tell you how to setup his project with ramtorch for chroma
Anonymous No.106911208 [Report]
Anonymous No.106911217 [Report] >>106911230
That faggot comfy changed something and now my wan previews aren't animating, despite me setting it TAESD. It's just a static fucking image. Anyone know how to fix it?
Anonymous No.106911230 [Report] >>106911257
>>106911217
Display animated previews when sampling in options. It's always been like that. You might've pulled and it reset or something
Anonymous No.106911257 [Report]
>>106911230
I set it to auto-pull on launch and it's never disabled it before, but you're right. What a cunt.
Anonymous No.106911260 [Report]
>>106911036
>The size of tensor a (768) must match the size of tensor b (128) at non-singleton dimension 2
Yeah, I'm thinking you're right. This error is literally random now.
Anonymous No.106911304 [Report]
>he pulled
Anonymous No.106911317 [Report] >>106911334 >>106911429
>>106910887 (OP)
extremely based collage
Anonymous No.106911322 [Report] >>106911657
>>106911048
I train on 12GB/48GB with onetrainer and on loading it maxes my ram but idk if it just takes all available or if I'm at the treshold.
Anonymous No.106911334 [Report] >>106911366
>>106911317
>1girl x10
yeah bro extremely based bro, never seen shit like this before
Anonymous No.106911355 [Report]
Anonymous No.106911366 [Report]
>>106911334
You fucking fags have whole discord just for you already. Fuck off.
Anonymous No.106911368 [Report] >>106913616
What lightx2v's are best for Wan 2.2? I've tried a couple, but I feel like motion is worse than 2.1 with its own lightx2v LoRA.

This was recommended by some redditor, but same issue, motion seems stiff :

High - Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16

Low - Wan_2_1_T2V_14B_rCM_lora_average_rank_148_bf16
Anonymous No.106911411 [Report] >>106911427
>my gens didnt make the collage
shit thread, reported, saged, contacted hiroshimoot, spammed the irc and sent an anonymous report to the FBI
Anonymous No.106911412 [Report]
Anonymous No.106911427 [Report]
>>106911411
kek
Anonymous No.106911429 [Report]
>>106911317
not with catjak tranny in it
Anonymous No.106911438 [Report]
Anonymous No.106911448 [Report]
Anonymous No.106911457 [Report] >>106911491 >>106913967
Anonymous No.106911491 [Report]
>>106911457
that's a sexo from me
Anonymous No.106911556 [Report] >>106911577 >>106911579 >>106911608
is there any way I can change the value of the width/height fields for all at once instead of having to go through 1 by 1
Anonymous No.106911562 [Report]
>>106910887 (OP)
>snubbed again in favor of some absolute slop
This baker fucking sucks
Anonymous No.106911577 [Report] >>106911600
>>106911556
Integer node -> connect to all of them.
Anonymous No.106911579 [Report] >>106911600
>>106911556
Anonymous No.106911600 [Report] >>106911610
>>106911577
>>106911579
>you can just do that
Anonymous No.106911608 [Report] >>106911612
>>106911556
but why do this anyways?
Anonymous No.106911610 [Report]
>>106911600
Now imagine doing that in other UIs.
Anonymous No.106911612 [Report] >>106911615
>>106911608
sdxl batch generation
I can sense I'm about to get shit on
Anonymous No.106911615 [Report] >>106911620
>>106911612
>batch_size 1
nigga bffr
Anonymous No.106911620 [Report]
>>106911615
tried it with just one and playing batch size, destroys the result
Anonymous No.106911653 [Report]
Anonymous No.106911657 [Report] >>106911671 >>106911692
>>106911322
what do you use to set the captions for a chroma dataset?
Anonymous No.106911671 [Report]
>>106911657
taggui with joycaption. Need to load the 4bit version so I don't oom. Needs QC afterwards.
Anonymous No.106911692 [Report] >>106911709
>>106911657
some people use gemini with great success
Anonymous No.106911706 [Report] >>106911714 >>106911729 >>106911764 >>106912122
wan 2.2 anons, use this lora setup with the new kijai 2.2 lora, works really well

and shift 8:

4 steps, works well
Anonymous No.106911709 [Report] >>106911766 >>106911783
>>106911692
Doesn't gemini have a daily limit? How do I batch 40-50 pics?
Anonymous No.106911714 [Report] >>106911733
>>106911706
the anime girl on the large advertisement waves hello, as people walk by on the streets of Tokyo.
Anonymous No.106911729 [Report] >>106911755
>>106911706
>stole it from reddit
kek
Anonymous No.106911733 [Report]
>>106911714
too bad the city and people all look like plastic
Anonymous No.106911746 [Report] >>106911767 >>106911768
>>106910662
Anonymous No.106911755 [Report] >>106911799
>>106911729
well, it is a good combo so why not link the node setup

also template shift was 5 default, 8 seems to help motion too.
Anonymous No.106911764 [Report]
>>106911706
Never tried mixing the light loras like that. If it works, cool.
Anonymous No.106911766 [Report]
>>106911709
>Doesn't gemini have a daily limit? How do I batch 40-50 pics?
As far as I know people who use it pay for it. No clue how they feed batch of images, but I'd guess it's just asking Grok to code Python script for it
Anonymous No.106911767 [Report]
>>106911746
sdxl really doesn't believe in prompt adherence unless you cajole it like an autistic who speaks in google search
Anonymous No.106911768 [Report]
>>106911746
>everything is wrong
lol
Anonymous No.106911783 [Report]
>>106911709
If you've only got a small dataset, just use joycaption. You'll have to go through and do some cleanup afterwards, but that's standard for local tagging models.
Anonymous No.106911799 [Report] >>106911816
>>106911755
nothing wrong with just linking to the reddit post that has all the info and workflow already
Anonymous No.106911816 [Report] >>106911839 >>106911896
the pink hair anime character is standing on a car drifting around the street of Tokyo at night. Smoke emits from the tires as it drifts.

with new setup (shift 8, from 5, + loras)
>>106911799
https://pastebin.com/g19a5seP

cant link site thinks it's spam.

this is WAY smoother than before. it has rife VFI interpolation but still, much better motion. the new kijai lora works very well for the high noise pass.
Anonymous No.106911839 [Report] >>106911855
>>106911816
also, qwen edit is great for making wan 2.2 i2v source content.
Anonymous No.106911855 [Report] >>106911866 >>106912082
>>106911839
>also, qwen edit is great for making wan 2.2 i2v source content.
is it when you put end image there as well so it knows what's supposed to happen?
Anonymous No.106911858 [Report] >>106911863
the pink hair anime character is standing on a car which drives over a ramp and flies high into the sky in Tokyo.

holy shit, that escalated fast.
Anonymous No.106911863 [Report]
>>106911858
rocket league double jump
Anonymous No.106911866 [Report]
>>106911855
you can do that for the first/last wan one, havent messed with that too much though just regular i2v.
Anonymous No.106911896 [Report]
>>106911816
>cant link site thinks it's spam.
https://www.reddit.com/r/StableDiffusion/comments/1o8exnu/
weird never had issues myself

https://www.reddit.com/r/StableDiffusion/comments/1o7r7sb/
https://www.reddit.com/r/StableDiffusion/comments/1o8662h/
if anybody is interested in playing with animate more these look interesting, even has a cunny showcase
Anonymous No.106911897 [Report]
Anonymous No.106911906 [Report]
Anonymous No.106911916 [Report]
Anonymous No.106911972 [Report]
yeah, the lora combo + higher shift (8) seems to be a winner. this will do till wan 2.5 if it comes out.
Anonymous No.106911988 [Report]
I get a bigger buzz seeing my influence in other gens, rather than direct acknowledgement
*cough* more leather *cough*
Anonymous No.106912019 [Report] >>106912023 >>106914066
the blonde anime girl drinks her tea as people outside her car walk by.

yeah this combo is definitely a big improvement, the new 2.2 kijai lora + the 2.1 combo seems much better than the old setup.
Anonymous No.106912020 [Report]
Anonymous No.106912023 [Report] >>106912035
>>106912019
what card do you gen on?
Anonymous No.106912035 [Report]
>>106912023
4080 (16gb)

wan works fine on almost anything though, 8-12gb works too
Anonymous No.106912037 [Report] >>106912040 >>106912054 >>106912068 >>106912078
retard here new to this. Got Wan2.2 on Comfy running and was recommended to use a dictionary autocorrect spellchecker and wildcards. What the fuck does any of that mean/do
Anonymous No.106912040 [Report]
>>106912037
>hey claude
>retard here new to comfyui. Got Wan2.2 on Comfy running and was recommended to use a dictionary autocorrect spellchecker and wildcards. What the fuck does any of that mean/do
Anonymous No.106912054 [Report] >>106912075
>>106912037
ask grok
Anonymous No.106912067 [Report]
Is the turbo contrarian trolling on /h/ all just one dude? I kinda suspect it is at this point, he just aggressively disagrees about everything almost no matter what it is in a pretty distinct way
Anonymous No.106912068 [Report]
>>106912037
>recommended to use a dictionary autocorrect spellchecker and wildcards
wut. what are you trying to do? if you're just doing i2v then prompting is very simple
Anonymous No.106912075 [Report] >>106912087 >>106912170
>>106912054
https://files.catbox.moe/5zjzyd.webm
Anonymous No.106912078 [Report]
>>106912037
>was recommended to use a dictionary autocorrect spellchecker
Huh? Are you a special needs person or retarded? Just type normally into the fucking box and gen, what the fuck man.
Anonymous No.106912082 [Report] >>106912091
>>106911855
the man puts on a blue hat

cool, it works
Anonymous No.106912087 [Report]
>>106912075
saar...you must ask grok...redeem the grok
Anonymous No.106912091 [Report] >>106912572
>>106912082
he should hold up a sign after which says "IT JUST WORKS"
Anonymous No.106912103 [Report]
do i have to close forge everytime i want to delete a lora i used but didn't like? i can delete loras after using them in comfy
Anonymous No.106912122 [Report] >>106912136
>>106911706
Where can I find this file? For exact comparison.
Anonymous No.106912128 [Report]
wow, this combo is so much smoother. and I went from 6 steps to 4 (default).

two FBI agents arrest the man and take him away, off screen to the right.
Anonymous No.106912136 [Report] >>106912153 >>106912180 >>106912290
>>106912122
should be this:

https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-I2V-A14B-4steps-lora-rank64-Seko-V1
Anonymous No.106912153 [Report]
>>106912136
the low noise one (for wan 2.2 low)

the other one is the default wan 2.1 lightx2v lora.
Anonymous No.106912158 [Report]
>>106910887 (OP)
Based hand crafted collage
Anonymous No.106912159 [Report]
Anonymous No.106912170 [Report]
>>106912075
>https://files.catbox.moe/5zjzyd.webm
grok is this true?
Anonymous No.106912174 [Report]
neta is so fucking dogshit, what a waste
Anonymous No.106912180 [Report]
>>106912136
also I think the shortened link was considered spam.

https://www.reddit.com/r/StableDiffusion/comments/1o8exnu/zero_cherrypicking_crazy_motion_with_new_wan22/

this works well. (workflow in post)
Anonymous No.106912192 [Report]
I wish i had gork
Anonymous No.106912204 [Report] >>106912227 >>106912235
An linux wizards here? thinking of going full linux or linux with virtual machine with an rtx, what is best recommended for running comfy? Wont be trying dual boot shit again (too many issues)
Anonymous No.106912227 [Report]
>>106912204
virtual machine is worst recommended
Anonymous No.106912235 [Report] >>106912323
>>106912204
for a while i just grabbed a 1tb ssd, made it a bootable ubuntu install and ran comfy in there. when i wanted/needed windows back i just removed the usb cable and rebooted. linux was slightly more performant though so i still use both
Anonymous No.106912289 [Report] >>106912963
>https://xcancel.com/__TheBen/status/1829554120270987740#m
>two layers at 640px

why don't we hear more about this?
does anyone know other lora hacks?
Anonymous No.106912290 [Report] >>106912331
>>106912136
is that the same as the original 2.2 light lora? my fucking head is spinning from all these versions
Anonymous No.106912296 [Report] >>106912304 >>106912327
Anonymous No.106912304 [Report]
>>106912296
Anonymous No.106912319 [Report] >>106912341
the man puts on a wizard hat and casts a frost spell, making the desk turn to ice.

kek idk why it did a transition
Anonymous No.106912323 [Report] >>106912367 >>106912646
>>106912235
Wait..so I can just unplug windows and plug in linux without dual boot fuckery? The idea was to go full linux but if I can just do that, then that would save a world of headache. I just wanna keep win 10 as all of my softwares work and dont require updating
Anonymous No.106912327 [Report]
>>106912296
Anonymous No.106912331 [Report]
>>106912290
there is a new version, kijai fixed it

https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22_Lightx2v
Anonymous No.106912341 [Report] >>106912613
>>106912319
the man puts on a wizard hat and points in the air, causing a large block of ice to form on his desk.
Anonymous No.106912342 [Report] >>106912352
Anonymous No.106912352 [Report] >>106912365 >>106912531
>>106912342
Look at you fag forgot to change filename?
Anonymous No.106912365 [Report]
>>106912352
what
Anonymous No.106912366 [Report] >>106912548 >>106913355
>>106910389
>I never got base Chroma to produce anything remotely close to this

Nta but you'd be surprised what Chroma can do when you prompt it slightly differently. Like I didn't think it could do gyarus, and it doesn't when you just prompt for it, but actually it turns out that simple change of prompt and enhanced description is all it takes to get gyarus.
Anonymous No.106912367 [Report] >>106912795
>>106912323
yeah i have my boot order to boot off the disk if it's present, if not just regular ass windows. i wanted the same set up since i use this machine for gaming, and the ubuntu for genning. plus who knows what the fuck i'm downloading with these models
Anonymous No.106912371 [Report] >>106912458
the anime girl is typing at her computer.
Anonymous No.106912395 [Report]
Anonymous No.106912417 [Report] >>106912483
Idle
Attack 1
Attack 2
Run
Guard
Evade
Taking Damage
At Low HP
>Incapacitated
Triumph
Flourish

1st test: Shift 8, 4 high/4 low steps. No net benefit to the complicated mix. For wan22 mix. The spark is missing when weapons collide. The fog in the background was unnecessarily denoised. Blood already on the floor before the girl falls. Freaking reddit should post a side by side comparison before screaming from the rooftops of their break throughs.
Anonymous No.106912458 [Report]
>>106912371
the anime girl stands up and walks out of the computer lab.
Anonymous No.106912483 [Report] >>106912593
>>106912417
>should post a side by side comparison before screaming from the rooftops of their break throughs.
very rarely happens even here
Anonymous No.106912492 [Report] >>106912524 >>106912545 >>106912993
kek

the man opens the bag of ONIONS potato chips and eats one.

the motion is OBJECTIVELY better with the new lora + combo.
Anonymous No.106912514 [Report]
Anonymous No.106912518 [Report]
I'm against it.
Anonymous No.106912524 [Report]
>>106912492
I wanted lemon lime!
Anonymous No.106912526 [Report]
whens that faggot going to post his coveted ani shota collection?
Anonymous No.106912531 [Report]
>>106912352
What is bro hallucinating about?
Anonymous No.106912539 [Report] >>106912577
Anonymous No.106912542 [Report]
Anonymous No.106912545 [Report] >>106912552
>>106912492
yea, good result. what is the "combo"?
Anonymous No.106912548 [Report] >>106912636
>>106912366
prompt? and what chroma version
Anonymous No.106912552 [Report] >>106912592
>>106912545
https://www.reddit.com/r/StableDiffusion/comments/1o8exnu/zero_cherrypicking_crazy_motion_with_new_wan22/

workflow: https://pastebin.com/g19a5seP
Anonymous No.106912556 [Report]
Anonymous No.106912572 [Report] >>106912633
>>106912091
Anonymous No.106912577 [Report]
>>106912539
the pink hair anime girl puts down her guitar and starts playing the drums on stage.

smooth transition desu
Anonymous No.106912580 [Report] >>106912590 >>106912609
the noob TE schizo is back with more useless bullshit that doesn't do anything
https://redlib.catsarch.com/r/StableDiffusion/comments/1o7nnc1/clips_can_understand_well_beyond_77_tokens/

why do these "people" go to all the effort on their model training exercises only to provide nothing of practical value?

>>106909976
there are some startup flags that apparently can mitigate this, search comfyui github issues on AMD. I have yet to do a deep dive and test to figure out what works. also there's a way to turn off the bullshit compiling phase that SDXL models go through whenever you gen at a new res
Anonymous No.106912590 [Report]
>>106912580
>why do these "people" go to all the effort on their model training exercises only to provide nothing of practical value?
autism is a hell of a drug
Anonymous No.106912592 [Report]
>>106912552
thanks, will have a look
Anonymous No.106912593 [Report]
>>106912483
niggers always do this. they go WHOA CHECK THIS OUT, post one comparison then never talk about it again
Anonymous No.106912609 [Report]
>>106912580
>clip l
I sleep
Anonymous No.106912611 [Report]
the pink hair anime girl is running around the stage while playing her guitar.
Anonymous No.106912613 [Report] >>106912627
>>106912341

Does this only work well with img2vid? and not txt2vid?
Anonymous No.106912616 [Report]
Anonymous No.106912617 [Report]
>still no updated I2V 2.2 lightning lora
Anonymous No.106912627 [Report]
>>106912613
t2v should work fine but the workflow/setup in this case is with i2v loras. not sure there is a new 2.2 t2v update
Anonymous No.106912628 [Report] >>106912705
does anyone know how to prompt picrel? - https://files.catbox.moe/rwgu80.jpeg

not so much concerned about the magazine cover style, more about the pose with chair and the sparkling water and lighting.

it looks like derp photoshop
Anonymous No.106912633 [Report]
>>106912572
based
Anonymous No.106912636 [Report] >>106912643
>>106912548
HD Flash
https://files.catbox.moe/dlksp4.png

Btw there's also a way to get stylized/filtered images with Chroma (pic rel).
Anonymous No.106912638 [Report] >>106913855
Anonymous No.106912641 [Report]
Less coherence with this reddit combo. Imagine my disappointment, my day ruined.
Anonymous No.106912643 [Report] >>106912737
>>106912636
>Btw there's also a way to get stylized/filtered images with Chroma
ok. prompt?
Anonymous No.106912646 [Report] >>106912795
>>106912323
i keep a portable nvme with windows on it for when i want to be a gaymer. you dont have to dual boot
Anonymous No.106912705 [Report]
>>106912628
thanks, so maybe try that moe img lora, but with light text loras. Since I cant find a text version of that moe lora.
Anonymous No.106912709 [Report]
Anonymous No.106912737 [Report] >>106912774
>>106912643
https://files.catbox.moe/k7p6ku.png
Anonymous No.106912774 [Report] >>106913057
>>106912737
Cool
Anonymous No.106912795 [Report]
>>106912646
>>106912367

Nice, this seems like the best option.
Anonymous No.106912908 [Report]
Anonymous No.106912963 [Report] >>106914864 >>106914883
>>106912289
>does anyone know other lora hacks?
lora+ actually works
Anonymous No.106912977 [Report] >>106913029
the anime girl stands up and starts dancing in the Japanese classroom.
Anonymous No.106912987 [Report] >>106913039 >>106913060 >>106913081 >>106913101 >>106913134 >>106913487 >>106913780 >>106915296
Comfy is a PROUD partner of NVIDIA, getting EXCLUSIVE access to NVIDIA's products for making ComfyUI truly great!
Anonymous No.106912993 [Report]
>>106912492
I like the OG better
Anonymous No.106913029 [Report]
>>106912977
The chalkboard eraser was *completely gone* for 32 frames and the model still remembered it perfectly!
Anonymous No.106913032 [Report]
Any lora or prompt to type to get a digicam kind of look on Chroma?
Anonymous No.106913039 [Report] >>106913050
>>106912987
wtf?? give me the sauce anon lmao
Anonymous No.106913050 [Report] >>106913074
>>106913039
https://x.com/ComfyUI/status/1978529150798569531
Anonymous No.106913057 [Report] >>106915302
>>106912774
Thanks
Anonymous No.106913060 [Report]
>>106912987
>for making ComfyUI truly great
great in what? more api? lmaooooo
Anonymous No.106913068 [Report] >>106913269 >>106913310
i wish chroma wasn't so FUCKING SLOW WHAT THE FUCK
Anonymous No.106913074 [Report] >>106913093
>>106913050
didn't know will I am was a nerd, based lol
https://www.youtube.com/watch?v=WpYeekQkAdc
Anonymous No.106913081 [Report] >>106913095 >>106913115
>>106912987

Why are they shilling Spark? That thing is so underpowered.
Anonymous No.106913093 [Report]
>>106913074
william is a larping aliexpress merchant
Anonymous No.106913095 [Report]
>>106913081
Because NVIDIA is paying a bunch of AI relevant companies to shill it.
Anonymous No.106913101 [Report] >>106913138
>>106912987
>Comfy: "No I will not implement HunyuanImage 3.0 it's a bloated product"
based
>Also comfy: "Yass qween, Nvdia DGMeme is the future!"
cringe...
Anonymous No.106913115 [Report] >>106913139 >>106913186
>>106913081
what is the use case again? certainly not image diffusion. it doesn't even have that much memory for LLMs.
Anonymous No.106913116 [Report]
the anime girl stands up and shakes hands with hatsune miku.

I like the new lora + combo, and im using 4 steps instead of 6 now, still good outputs:
Anonymous No.106913133 [Report] >>106913136
I was pretty involved in SD in the early days. What models do you guys use these days? Everything still seems to be based on 1.5. Has Flux taken over SD?
Anonymous No.106913134 [Report] >>106913158
>>106912987
the comfy curse. anything he endorses is shit. all started with sd 3.0
Anonymous No.106913136 [Report]
>>106913133
anime: noobai/illustrious, for anime I use wai v15

realism: qwen, qwen edit

video: wan 2.2 + lightx2v
Anonymous No.106913138 [Report] >>106913427
>>106913101
he barely said shit for like a 5 sec clip and got a dgx spark for free to fuck around with. i'm jelly
Anonymous No.106913139 [Report] >>106914181
>>106913115
for using large text based models like deepseek r1 or Ollama. it is not for image/video gen. no, it is not even for training or finetuning either.
Anonymous No.106913158 [Report] >>106913165
>>106913134
>the comfy curse. anything he endorses is shit. all started with sd 3.0
Did he really endorse it? I can't believe it
Anonymous No.106913165 [Report] >>106913176 >>106913185
>>106913158
for months he was saying it's the best model ever
Anonymous No.106913176 [Report] >>106913180 >>106913185
>>106913165
I don't believe you
Anonymous No.106913180 [Report] >>106913215
>>106913176
then you have a year of lore to catch up on
Anonymous No.106913185 [Report] >>106913215
>>106913165
>>106913176
he said that because he was an employee of StabilityAI, then he realized he was selling his soul to the wrong company, now it's all right, he's selling his soul to API nodes, that's much better
Anonymous No.106913186 [Report]
>>106913115
from reddit;
>DGX Spark is a dev kit for GB300. So if you’re developing a high performance software and can’t afford to buy/rent GB300 for development, you can buy DGX Spark and test your code there.

>DGX Spark is not for local LLM inference.
> If you buy one, do not use it for LLM inference, that's dumb.

https://www.reddit.com/r/LocalLLaMA/comments/1o69vm5/whats_the_point_of_a_dgx_spark_for_inference_if_a/

Basically the anon 2 days ago that bought it thinking it was going to make WAN loras was stupid as fuck.
Anonymous No.106913208 [Report] >>106913230
https://github.com/comfyanonymous/ComfyUI/pull/10373
>Workaround for nvidia issue where VAE uses 3x more memory on torch 2.9
wtf, did anyone switch to torch 2.9?
Anonymous No.106913215 [Report] >>106913268
>>106913180
>>106913185
This has to be bullshit
Anonymous No.106913230 [Report]
>>106913208
if you update to the latest version, it will put you on torch 2.9, which breaks xformers btw. there's an alternative updated version that still uses 2.8 though.
Anonymous No.106913238 [Report]
>Kijai
Anonymous No.106913268 [Report] >>106913277
>>106913215
it is true though, that's why he left StabilityAi, he couldn't accept lying this much about such a mid product like SD3 medium
Anonymous No.106913269 [Report]
>>106913068
Chroma HD Flash is all you need to speed it up. Though ideally nunchaku Chroma would be out by now, any day now...
Anonymous No.106913275 [Report]
Is Chroma really better than Flux?
Anonymous No.106913277 [Report]
>>106913268
>that's why he left StabilityAi
he left because robin left. grift chink scooped him up to slap new chains on him now he shills api nodes and shitty hardware
Anonymous No.106913282 [Report] >>106913342
lightly technical question:
I'm using comfy, and almost every lora I've downloaded has the trigger words baked into the file, so that the Lora Info node can read them and let me just copy them into the prompt
however, one lora I've downloaded hasn't done this, and the output from the Lora Info node is completely blank, presumably because the author is a retard
does anyone know of a node that can save notes attached to a lora, even when that lora isn't loaded, or failing that can I edit the lora itself so that the trigger words (which I can get from the civitai page) show up in the Lora Info node?
and before you ask, no I can't get comfy to just pull the info from civitai itself, I just get 500 errors
Anonymous No.106913299 [Report]
can you clean vram on forge?
Anonymous No.106913301 [Report] >>106913309
So I'm going to try image gen which I stopped shortly after the release of ComfyUI. Is there a go-to UI for retards where I can get started fairly easily and dive into details later?
Anonymous No.106913309 [Report]
>>106913301
ComfyUI
Anonymous No.106913310 [Report] >>106913317 >>106913329 >>106913336 >>106914041
>>106913068
Anonymous No.106913317 [Report] >>106913328 >>106913341
>>106913310
I don't get
Anonymous No.106913328 [Report]
>>106913317
open your manager and look for it retard
Anonymous No.106913329 [Report] >>106913395
>>106913310
post workflow
Anonymous No.106913336 [Report]
>>106913310
post skin color
Anonymous No.106913340 [Report] >>106913353
comfy nooooooooooo
Anonymous No.106913341 [Report] >>106913351
>>106913317
you just skip the first 30% of the gen and get straight to the good stuff, ez
Anonymous No.106913342 [Report] >>106913421
>>106913282
Don't know what you're talking about. Try power lora loader and check what tokens was trained.
Anonymous No.106913351 [Report]
>>106913341
No, it's the opposite. It doesn't skip the first 30% because they are most important. And the rest it skips every 1 step.
Anonymous No.106913353 [Report]
>>106913340
lmao
Anonymous No.106913355 [Report] >>106913432
>>106912366
That looks slopped and nowhere close to what that other anon did (the anon who refused to share his catbox)
But I get your intention was just to make a big titty 'schoolgirl'
Anonymous No.106913386 [Report] >>106913399
is there a single reason to use pony?
Anonymous No.106913390 [Report] >>106913428
the blue hair anime girl gives the red hair anime girl a wrapped gift.
Anonymous No.106913395 [Report] >>106913420
>>106913329
https://files.catbox.moe/9e1twx.png
Anonymous No.106913399 [Report]
>>106913386
Horse too expensive, mule too stubborn
Anonymous No.106913420 [Report]
>>106913395
sankyu, neuro enjoyer
Anonymous No.106913421 [Report]
>>106913342
I don't need the trained tokens, and I can already get those with pythongs lora loader anyway
Anonymous No.106913427 [Report] >>106913468
>>106913138
wow, he used it for 5 mins then just uses his 5090
Anonymous No.106913428 [Report] >>106913446
>>106913390
What is the point of spamming garbage from this scene every other thread? Those "tests" lost their novelty and are not interesting at all.
Are you the same guy who used to spam Miku edits with Kontext and Qwen Edit?
Anonymous No.106913432 [Report] >>106913438 >>106913457
>>106913355
I'm not prompting for that though. It's clearly not impossible with a model as good as Chroma, so not sure why you're doubting him.
Anonymous No.106913438 [Report] >>106913466
>>106913432
If you ran it for 30 steps, it would look way better. Or used base + flash lora
Anonymous No.106913446 [Report] >>106913507
>>106913428
it's just a test nogenner
Anonymous No.106913457 [Report] >>106914102
>>106913432
Anon, I have been telling you since the last thread: your gens are not as good as you think. Pay attention to the walls, skies etc, there are noticeable artifacts, and the subjects have a weird smudge in their skin
Anonymous No.106913466 [Report] >>106913479
>>106913438
All my images are only 8-9 steps with Heun/beta, CFG 1. I'm on Chroma HD Flash so no need for that many steps and it pretty much one shots it most of the time. I know many of my prompts would take a bunch of tries if not impossible on the full version (HD Flash is closer to convergence).
Anonymous No.106913468 [Report]
>>106913427
worth it
Anonymous No.106913479 [Report]
>>106913466
>8-9
That's why it looks so ass retard
Anonymous No.106913485 [Report]
post a single gen from chroma that looks good
Anonymous No.106913487 [Report] >>106913493
>>106912987
his name is literally "ComfyAnonymous"?
wat
Anonymous No.106913493 [Report] >>106913510
>>106913487
I guess he doesn't want to say his real name
Anonymous No.106913494 [Report]
Anonymous No.106913503 [Report] >>106913633 >>106913658 >>106913670
comfy yes
Anonymous No.106913507 [Report] >>106913531
>>106913446
A tranime character giving a gift to another, wow, so interesting and original
What is the purpose of sharing those uninteresting tests every thread?
It was interesting early on (new model releases with significant differences), now they are just a waste of space
Anonymous No.106913510 [Report] >>106913564
>>106913493
hm okay
Anonymous No.106913522 [Report] >>106913535 >>106913546 >>106913556 >>106914041
>>106910887 (OP)
Yo, What's the minimum hardware requirement to start with this shit?
Anonymous No.106913531 [Report]
>>106913507
the guy is clearly autistic and slow in the head and has no idea how to read the room. go easy on him. in real life you also dont walk up to any mentally ill retard and ask him why hes doing the retarded shit that hes doing. hes retarded. convincing him to act like a normal person is going to be impossible, like trying to get a downie to play a normal human. doesnt work.
so chill out, just ignore the guy if you dont like his posts.
Anonymous No.106913535 [Report]
>>106913522
All of it
Anonymous No.106913546 [Report]
>>106913522
what shit, retard? requirements vary greatly based on what model you want to use
Anonymous No.106913551 [Report]
when is the ani shota collection suppose to drop bros
Anonymous No.106913556 [Report]
>>106913522
>Yo
ey YO yiki yoYO that was a pretty broad question yokoYO
Anonymous No.106913564 [Report]
>>106913510
lmao, this is amazing
Anonymous No.106913616 [Report]
>>106911368
Use that but add a first step in high without the lora.
Anonymous No.106913633 [Report]
>>106913503
Kek
Anonymous No.106913638 [Report]
Anonymous No.106913658 [Report]
>>106913503
ahahah, it's been a while I haven't seen such kino in this place
Anonymous No.106913670 [Report]
>>106913503
make ani hit him with a shovel
Anonymous No.106913675 [Report]
Anonymous No.106913721 [Report]
Lmao I've never head this kind of failgen
Anonymous No.106913773 [Report]
love qwen edit
Anonymous No.106913780 [Report] >>106913786 >>106913925
>>106912987
Thats why I use AniStudio
Anonymous No.106913786 [Report] >>106913831
>>106913780
that mspaint lora is putting in work!
Anonymous No.106913804 [Report] >>106913811 >>106913818 >>106913931
https://www.youtube.com/watch?v=qGe_fq68x-Q
Westsisters? Our response?
Anonymous No.106913811 [Report]
>>106913804
no CUDA
Anonymous No.106913818 [Report] >>106913827
>>106913804
>no fan
it's obviously shit. didn't click
Anonymous No.106913827 [Report]
>>106913818
is this the /g/ version of
>no tail
Anonymous No.106913831 [Report]
>>106913786
mspaint is local, and my hand movements while drawing are my tags
Anonymous No.106913855 [Report]
>>106912638
Anonymous No.106913869 [Report]
>wan i2v models and wf
>forgot to connect load image node to WanFirstLastFrameToVideo node so no image provided
>still get a scene of what I prompted like t2v
Anonymous No.106913894 [Report]
Anonymous No.106913908 [Report]
Anonymous No.106913925 [Report] >>106913933
>>106913780
When are we getting a good AI tool not designed by neckbeards and trannies?
It seems all non-autistic dudes that write good software go on the SaaS route

Hopefully the llm vibe coding culture lowers the barrier to write sane software
Anonymous No.106913928 [Report] >>106913948
Anonymous No.106913931 [Report]
>>106913804
LPDDR4x. It's probably slower than a DGX Spark at full compatibility
Anonymous No.106913933 [Report] >>106913951
>>106913925
do people "vibe code" with chatgpt or do they use open source llms with lm studio and so on?

I imagine an unrestricted model would be better than censorshipAI
Anonymous No.106913948 [Report] >>106913984
>>106913928
I will note that qwen edit is better with text, before kontext was better on that front. now qwen is overall better on all fronts.
Anonymous No.106913951 [Report] >>106913958 >>106913972
>>106913933
local LLMs are a joke and are only useful for degenerate purposes (ERP and write erotica), but don't let anyone know that
Even the high end open weights LLMs (the ones that no one can run on local hardware unless you own an enterprise grade cluster at home) underperform vs the proprietary API-only ones
Anonymous No.106913958 [Report] >>106913985 >>106913990
>>106913951
so for making an app/game lets say, grok or chatgpt? I figure grok might be better cause openAI love censoring shit
Anonymous No.106913967 [Report]
>>106911457
Anonymous No.106913972 [Report]
>>106913951
>local LLMs are a joke and are only useful for degenerate purposes (ERP and write erotica), but don't let anyone know that
Painfully dumb take, hurts to read it. GJ anon.
Anonymous No.106913973 [Report]
Anonymous No.106913984 [Report]
>>106913948
kontext is better at safety
Anonymous No.106913985 [Report]
>>106913958
"censorship" doesn't matter that much when it comes to write software (which those "censored" LLMs do well), it's just a retarded narrative parroted in AI circles where people only use AI to jerk off
Anonymous No.106913990 [Report] >>106915617
>>106913958
Unfortunately Claude is really good, but it's expensive enough and the company is horrible enough that you aren't going to want to pay them unless you're really desperate. I found Deepseek pretty serviceable, but sometimes it gets really dumb. I've been hearing great things about GLM 4.6 but I haven't used it for coding yet. Supposedly the new Gemini blows everything out of the water but I don't think it is actually out yet.
Anonymous No.106913998 [Report]
yeah. the new qwen edit is much better at text.

you heard her!
Anonymous No.106914027 [Report] >>106914032 >>106914056 >>106914574
What's the best method to run ComfyUI?
- Desktop?
- Portable?
- Stability Matrix?
Anonymous No.106914031 [Report] >>106914904
Anonymous No.106914032 [Report]
>>106914027
in a venv on a dedicated linux machine
Anonymous No.106914041 [Report]
>>106913310
Enjoy your grain
>>106913522
You can run SD1.5 on a raspberry pi
Anonymous No.106914047 [Report]
very inorganic
Anonymous No.106914056 [Report]
>>106914027
Portable. I've heard desktop has some issues.
Anonymous No.106914066 [Report] >>106914103
>>106912019
What are your times like? I've got 4080 FE and 6 seconds takes me a solid 9 minutes for 480p using the fast rentry setup T2V. I2V is much faster though.
Anonymous No.106914102 [Report]
>>106913457
Not me you were talking to, I'm purposely prompting for grainy images and I can swap the style out if I want to.
Anonymous No.106914103 [Report] >>106914111
>>106914066
with lightx2v loras and 4 steps (2/2) it's like 100-120 seconds with interpolation

dont do wan without the loras or it takes forever. quality can still be very good with them, before loras gens would take like 10 to 15 min.
Anonymous No.106914111 [Report]
>>106914103
also, use the wan 2.2 i2v template workflow in comfy, it works well and has the lora setup as well I believe.
Anonymous No.106914116 [Report] >>106914204 >>106914641
Retard here, been mesmerized by this for a while. Love wardrobe malfunction, and qt nihons. I want to wallow in depression.

What do I need to gen something like this and can we actually get nipples and genitals?
Anonymous No.106914181 [Report] >>106914215
>>106913139
bro you arent even close to fitting deepseek on that
Anonymous No.106914204 [Report]
>>106914116
These models are not in a state yet where they can do specific fetish out of the box even if you prompt really hard for it, they heavily rely on Loras
So unless you train the lora yourself or someone already did it, you shouldn't expect to get what you want easily
Anonymous No.106914215 [Report] >>106914243
>>106914181
You could fit several ollama deepseek R1s.
Anonymous No.106914243 [Report] >>106914296
>>106914215
you could fit several of my nuts in your mouth, but that does not mean they are the real deepseek
Anonymous No.106914296 [Report] >>106914314 >>106914316
>>106914243
The Ollama devs singlehandedly wrote the code that made local LLMs possible so I'm going to trust them over you on this one.

I know, I'm laying it on too thick.
Anonymous No.106914314 [Report] >>106914340
>>106914296
>The Ollama devs singlehandedly wrote the code that made local LLMs possible
Who mean the grifters who just forked llama.cpp, never gave it credit, and made a flashy normie-baiting product (which is essentially only a wrapper/interface with repo) out of it?
Anonymous No.106914316 [Report]
>>106914296
the experts of ldg always manage to surprise me so im never sure
Anonymous No.106914340 [Report] >>106914380
>>106914314
Yeah, but now they're working on rewriting the codebase after the fact so they can take it private. Then nobody can say they don't deserve it!
Anonymous No.106914341 [Report] >>106914391 >>106914443 >>106914479 >>106914574
Any recommendations for music generation?
Anonymous No.106914380 [Report]
>>106914340
they will fade into obscurity when LlamaBarn gets released
Anonymous No.106914391 [Report]
>>106914341
Honestly just stick to SaaS for that (Suno, Udio). All open-source musicgen models are shit
Anonymous No.106914443 [Report] >>106914462
>>106914341
YuE afaik

>106914391
why don't you let anon decide that for themself
Anonymous No.106914462 [Report] >>106914701
>>106914443
>YuE afaik
Yeah... If you can afford waiting 10~20 minutes to get so-so outputs that skips entire verses, sure
Anonymous No.106914479 [Report] >>106914628
>>106914341
udio is so far ahead local it would be like having sd1.4 local compared to current novelai on the saas front
Anonymous No.106914574 [Report]
>>106914341
songbloom, but it's not that great

>>106914027
i'd use stabilitymatrix or portable
Anonymous No.106914592 [Report]
Anonymous No.106914628 [Report]
>>106914479
I agree. We'll catch up eventually, there's no way Chinks are sitting on this goldmine that is AI music gen. Someone has to be cooking something good.
Anonymous No.106914641 [Report] >>106914664
>>106914116
Anonymous No.106914648 [Report]
Anonymous No.106914661 [Report] >>106914665
I just want a closed source competitor to ComfyUI
Anonymous No.106914664 [Report]
>>106914641
Not bad
Anonymous No.106914665 [Report]
>>106914661
Comfy?
Anonymous No.106914674 [Report] >>106914710 >>106914726 >>106914850 >>106915044
Damn, this mmaudio nsfw finetune actually works.
(nsfw) https://files.catbox.moe/nahift.mp4
https://huggingface.co/phazei/NSFW_MMaudio
Anonymous No.106914701 [Report] >>106914717 >>106914743
>>106914462
YuE is still best for composition.

https://map-yue.github.io/music/%E5%AE%8C%E7%92%A7%E3%81%AA%E9%96%A2%E4%BF%82.mp3

I have posted this many times before, feel free to read the lyrics here under "English + Japanese + Korean Code Switching Kpop"
https://map-yue.github.io/

It pretty much nails every language.

It's behind closed source but this is a bad Udio output:
https://www.udio.com/songs/79crys6WpDoA1FQUswzuWK

No, ACE-Step can not do anything like this. All ACE-Step seems to be good at is some Chinese rapping music. Regardless of how good their sound quality may be compare to YuE, their composition is not at the same level. There is a massive difference in dataset quality used to train both models. The same can be said about Songbloom (even more, and that model can't even be prompted without a sample). If you want to compare models genning from samples, look at this:
https://x.com/cocktailpeanut/status/1886456240156348674

Can Songbloom do this level of quality with the instruments? No.
Anonymous No.106914708 [Report]
Anonymous No.106914710 [Report] >>106914720
>>106914674
Neat. I can't believe I'm saying this.. does MMAudio have comfyUI integration? Since Wan22 is already there, might as well add MMAudio.
Anonymous No.106914717 [Report]
>>106914701
Retard
Anonymous No.106914720 [Report] >>106914730
>>106914710
https://github.com/kijai/ComfyUI-MMAudio/tree/main
Anonymous No.106914726 [Report] >>106914735
>>106914674
How long does this take to run inference? Being 2B it should be fast but doesn't hurt to ask.
Maybe I should give it a try.
Anonymous No.106914730 [Report]
>>106914720

Kijaigod... I kneel.
Anonymous No.106914735 [Report]
>>106914726
like ten seconds
Anonymous No.106914743 [Report]
>>106914701
>It's behind closed source but this is a bad Udio output

And I meant to say, that YuE output is very similar in quality to the bad Udio output (slightly worse in terms of sound quality, but still).
Anonymous No.106914802 [Report]
can wanimate be run without speed loras with the wrapper and KJ scaled model, or do i need the native workflow and model?
for some reason, when i try running without the speed lora even with 40 steps, it still comes out looking slopped compared to using speed lora and I can't figure out why
Anonymous No.106914812 [Report]
Anonymous No.106914830 [Report]
Anonymous No.106914850 [Report]
>>106914674
Can it do sfx too?
Like bed creaking, etc?
Anonymous No.106914864 [Report] >>106914883
>>106912963
this ain't bad for a 1hr train
Anonymous No.106914879 [Report]
which video upscaling model would you recommend that i could run locally? only need a 2x upscale at most
Anonymous No.106914883 [Report] >>106914959
>>106912963
>>106914864
Is it in any trainers?
Anonymous No.106914904 [Report]
>>106914031
good gen
Anonymous No.106914919 [Report]
Anonymous No.106914959 [Report]
>>106914883
i don't think lora+ is but as far the method from TheBen, it's available in any trainer that lets you specify layers. the lora is indeed 9MB
Anonymous No.106914995 [Report]
Anonymous No.106914998 [Report] >>106915026
bottom heavy lora,10 images from (/s/thread/22290821)
Anonymous No.106915006 [Report] >>106915008 >>106915029
homie got weird titty but she kinda fine
Anonymous No.106915008 [Report] >>106915035
>>106915006
what model, tho?
Anonymous No.106915026 [Report]
>>106914998
Upload please.
Anonymous No.106915029 [Report]
>>106915006
now that's a proper ass
Anonymous No.106915035 [Report] >>106915040 >>106915050
>>106915008
flux, going to run these with chroma soon to compare
https://files.catbox.moe/02kzlc.png
Anonymous No.106915040 [Report]
>>106915035
ty, king <3
Anonymous No.106915044 [Report]
>>106914674
I should mention, videos have to be 24 fps at least.
Anonymous No.106915050 [Report]
>>106915035
Which Flux? Krea is best.
Anonymous No.106915055 [Report]
flux cant do nsfw without a billion loras though
Anonymous No.106915086 [Report]
bak?
Anonymous No.106915096 [Report]
bak to the past
Anonymous No.106915103 [Report]
new
>>106915102
>>106915102
>>106915102
>>106915102
Anonymous No.106915296 [Report]
>>106912987
why the long face at 0:13? is it because he's shilling a piece of shit?
Anonymous No.106915302 [Report]
>>106913057
i don't know why i find these psycho asian chicks so hot.. never imagined i would
Anonymous No.106915617 [Report]
>>106913990
>and the company is horrible enough
qrd?