← Home ← Back to /g/

Thread 106259679

318 posts 232 images /g/
Anonymous No.106259679 >>106259712 >>106261207 >>106262011 >>106262044
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106255294

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106259712 >>106259779
>>106259679 (OP)
first post
Anonymous No.106259719 >>106259748
>>106259681
he did, look at the archive newfag. he was acting like his UI was the only way to test it despite another anon showing off the nudity the day before
Anonymous No.106259734
>>106259637
monitor your vram usage. 100% sure I was able to work with sdxl with an 8gb vram card. slow, painful, but it did work. and on a1111, back then.
Anonymous No.106259738 >>106259750
I shouldn't let the normalfag piss filter rhetoric get to me but I do. Why are they so retarded.
Anonymous No.106259748 >>106259760
>>106259719
>he was acting like his UI was the only way to test it
Pretty sure that guy was just using the command line to generate it.
Anonymous No.106259750 >>106259759
>>106259738
not an issue if you don't use o1
Anonymous No.106259757 >>106259767 >>106259844 >>106260181
Anonymous No.106259759
>>106259750
Exactly. And they refuse to acknowledge it.
Anonymous No.106259760
>>106259748
ok? that anon let us know more about the model than fennec girl in the gutter covered in mud over and over again. he was floating how varied the model was when all his images looked pretty much the same
Anonymous No.106259767
>>106259757
>My beautiful daughter's first day as a nurse. I'm so proud of her.
Anonymous No.106259779 >>106259791 >>106259855
>>106259712
this reminds me, what's with the new anime diffusion thread (or whatever the fuck it's called)? I mean, noob is still giving, so yeah, sure, still..
Anonymous No.106259791 >>106259872
>>106259779
nai posters have so where to dump their shit too now. idrc it just accelerates towards an /ai/ board
Anonymous No.106259796 >>106259872
evenin
Anonymous No.106259813 >>106259815
Anonymous No.106259815
>>106259813
kek
Anonymous No.106259844
>>106259757
TJD
Anonymous No.106259855
>>106259779
No one knows what their deal is.
Anonymous No.106259872 >>106260583 >>106260609
>>106259796
freckles woooo. sdxl shoe fuckup wooo. ass woooo. evening.
>>106259791
I see. I mean we had this discussion a few times but animated / non animated/actual fucking images would make sense imo
Anonymous No.106259876
comfy should be dragged out on the street and shot
Anonymous No.106259894 >>106259951 >>106260233
https://civitai.com/models/1802623?modelVersionId=2039975 Would the 480p of this actually work on 8GB VRAM like it claims?
Anonymous No.106259951 >>106259975 >>106260233
>>106259894
there is really only one way to find out anon. probably need a decent ram buffer tho, how much you got?
Anonymous No.106259975
>>106259951
32GB. I'm almost done downloading it so I'm gonna give it a go.
Anonymous No.106260025 >>106260033
Anonymous No.106260033
>>106260025
Heartwarming
Anonymous No.106260069 >>106260089 >>106260110 >>106260152 >>106263846 >>106263894 >>106264658 >>106264710
I wonder what anon posting intentional uncanny valley was banned for. anyways, just want to apologize I went on a detour because I was bored. getting rid of the tables for the params. just too much trying to make it work when it's not really needed at all
Anonymous No.106260089 >>106260103
>>106260069
save the fucking param load state on startup first, that is more annoying
Anonymous No.106260103
>>106260089
true. I'll do that first then
Anonymous No.106260110
>>106260069
AHHHHHH
I SEE A NIPPLE
I SEE TWO NIPPLES
MY VIRGIN EYES NOOOOOO
Anonymous No.106260142
Anonymous No.106260152 >>106260184
>>106260069
Retard here; would it be nontrivial (as in not needing to recompile) to, for example, change the latent resolution setting to function as an aspect ratio instead of what it is now? I'm imagining setting it to something like 16:9, or 2.3, and then another box for scaling the length and width.

This is just one of my many ideas.
Anonymous No.106260153
Anonymous No.106260181
>>106259757
god damn this is some really good shie..
Anonymous No.106260184 >>106260222
>>106260152
yes but I have to focus on logistics. you would need to recompile but I separated the project into three separate libs and all external libs are statically linked so I skip rebuilding them too. the only pain in the ass is the first build and waiting for Conan to fetch all the libs not included directly
Anonymous No.106260222
>>106260184
that doesn't sound too bad. let the thread know when the next update hits and I might just pr some things
Anonymous No.106260233 >>106260339 >>106262315
>>106259894
>>106259951
Oh fuck it works.
Doesn't look as good as the one I stole the prompt from https://civitai.com/posts/20620021
but I can work with this.
Anonymous No.106260283
so if you have 16gb, can you run qwen at all? what quants/files?
Anonymous No.106260309 >>106260338 >>106260350 >>106260366 >>106260410
What's the verdict on Chroma versions, we've had enough time. I am on v49 but still haven't discarded v48.
Anonymous No.106260338
>>106260309
48 for "old" chroma. The vresions above kinda suck for 3dpd. I'm keeping them until the mentioned retrain emerges
Anonymous No.106260339 >>106260369
>>106260233
w-work w..what anon?
Anonymous No.106260350
>>106260309
this so much!
Anonymous No.106260366 >>106260388
>>106260309
I'm on the 49 + 18 steps OSS + lora train right now
Anonymous No.106260369 >>106260400
>>106260339
Video gen you goof.
Anonymous No.106260388
>>106260366
>A. Botez
Sir please, the needful
Anonymous No.106260400
>>106260369
blessed digits
Anonymous No.106260410
>>106260309
I tested training photorealism lora on v48 and v49, and v49 was better for that, no idea how artstyles does though.

If you're just prompting from the model, no idea, I'm all into lora training.

That said, lodestones is apparently re-training from v48, so perhaps wait for whatever that ends up being.
Anonymous No.106260476 >>106260595
Friendly Reminder that /ldg/ it's a Chroma friendly general, if you are AntiChromaSchizo, please don't come here anymore or post negative things about Chroma anymore!.
Anonymous No.106260494
https://www.reddit.com/r/StableDiffusion/comments/1mm4l00/qwenimage_has_been_distilled_to_run_in_8steps/

has anyone tried the distilled versions?
Anonymous No.106260583
>>106259872
>there is a plate of food next to her
she really likes limes
Anonymous No.106260595
>>106260476
didn't vote, doesn't count
Anonymous No.106260609
>>106259872
>there is a plate of food next to her
she really likes limes
Anonymous No.106260616
Doesn't know Marge Simpson hair, it's over!
Anonymous No.106260632 >>106260684
https://www.reddit.com/r/StableDiffusion/comments/1mlt803/lightx2v_team_relased_8step_lora_for_qwen_image/
Anonymous No.106260684
>>106260632
they should release a lora wan 2.2 that isn't complete shit
Anonymous No.106260695 >>106260795
the man in a black trenchcoat on the right of the image jumps into a swimming pool at the beach on a summer day. he is wearing black sunglasses.
Anonymous No.106260790
res_3s is my new favourite friend
Anonymous No.106260795
>>106260695
fun in the sun:
Anonymous No.106260871
is EBsynth still the best for rotoscoping?, im trying to animate something, hate making too many key frames.
Anonymous No.106260876
Anonymous No.106260918 >>106261038
Is there a way to run qwen with sage turned on or do I have to keep changing launch parameters?
Anonymous No.106261002 >>106261106
>discovered chroma
>using v37 and loving it
>downloading v10

what am I for?
Anonymous No.106261038
>>106260918
you can have a separate .bat file
Anonymous No.106261054
the two men bungie jump off the building, over a mountain.

silly prompts for testing are best prompts
Anonymous No.106261106
>>106261002
Going back in time, Jurassic era
Anonymous No.106261174 >>106261259 >>106261316 >>106261399
https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main

https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF/tree/main

gguf + the lightning lora at 8 steps works, using this workflow:

https://huggingface.co/datasets/theaidealab/workflows/blob/main/qwen_image_distill_gguf.json

Q5 gguf. now I wanna see if Q8 works fine with multigpu. what is default qwen image size btw?
Anonymous No.106261186
imposter
Anonymous No.106261207
>>106259679 (OP)
>niggershit and shartcuck cancer in the OP
Kill yourself, subhuman piece of trash.
Anonymous No.106261212
time for some weather lady news anchor slop

the text is supposed to say "huge eruption soon"
Anonymous No.106261215
oh it can actually do εœŸδΈ‹εΊ§
Anonymous No.106261219 >>106261241 >>106261246
when will local be as good as online generation
Anonymous No.106261241 >>106261270
>>106261219
when chinks stop training on synthslop
Anonymous No.106261246 >>106261308
>>106261219
it is, but you are using quants because you are too poor to have the necessary vram to run it
Anonymous No.106261259 >>106261269 >>106261399
>>106261174
neat, works fine, 30-40s. no issues with GGUF Q5 will try Q8 when it's done.
Anonymous No.106261269 >>106261286
>>106261259
>30-40s
no thanks
Anonymous No.106261270 >>106261281
>>106261241
I wish we could harness the power of friendship to train the ultimate model using all computers from around the world.
Anonymous No.106261274
Is there still no reliable anti-talk conditioning for wan2.2? I don't get why these smelly chinks are so obsessed with making sure anime characters are constantly talking.
Anonymous No.106261281
>>106261270
I don't simply because Indians exist
Anonymous No.106261286 >>106261325 >>106261399
>>106261269
30 secs at that initial res is very fast, usually sdxl is 1024x1024

this was 37 secs, and it has yet to fuck up text on a sign.
Anonymous No.106261308
>>106261246
>it is
Anonymous No.106261316
>>106261174
>what is default qwen image size btw
1328x1328 IIRC
Anonymous No.106261325 >>106261343
>>106261286
revised:

cute anime girl Miku Hatsune wearing oversized clothes summer uniform long blue maxi skirt standing on a sunny beach. she is holding a sign with her right hand that says "Miku posts or ELSE" written in cursive, with a chibi Miku Hatsune on the sign. her left hand is holding a silver pistol, pointed at the camera.

yep, I like qwen already.
Anonymous No.106261343 >>106261355
>>106261325
better. never fucking advertise comfart UI like that. comfy's prompts are low quality shit
Anonymous No.106261355 >>106261379
>>106261343
fox girls are fine. at least it's a cute fox girl and not something like a troon in a rainbow shirt.
Anonymous No.106261362 >>106261389 >>106261403
what should I use for wan2.2 videos, comfy?
Anonymous No.106261379
>>106261355
cumfart 's foxgirl's outfit is normally pink and blue, the color of the trans flag. it's literally troon encoded
Anonymous No.106261384 >>106261473
uhhh...so for local diffusion you need gpu power while llms can comfortably use cpu/ram instead, right?
Anonymous No.106261389
>>106261362
https://github.com/deepbeepmeep/Wan2GP
if you value your time instead of troubleshooting retardation a lot
Anonymous No.106261395 >>106261423
cute anime girl Miku Hatsune wearing a white racing suit with "Racing Miku" on the front. She is holding a large umbrella, with a chibi Miku Hatsune on the fabric.

CUTE!
Anonymous No.106261399 >>106261421 >>106261436
>>106261174
>>106261259
>>106261286

This node based workflow is unironically retarded.

Why couldn't you look at how DAWs do it? You can copy the UI from any modern DAW.
FL Studio and even hardware mixers like x32 figured this out ages ago with routing matrices and numbered channels.
You can manage hundreds of connections without creating a single visual spaghetti noodle.
Instead we get this garbage.
Pic related
Anonymous No.106261403
>>106261362
Is there anything else that supports Wan 2.2 ?
Anonymous No.106261421
>>106261399
that isn't comfy and you'd probably have an easier time asking ani for it. he has a video sequencer that could easily foot the bill
Anonymous No.106261423
>>106261395
this is very good as a flux alternative, might be even better (well, it's 20B, it should be)
Anonymous No.106261436
>>106261399
hey bud you're seething at the kontext autist
Anonymous No.106261445 >>106261454
WHERE IS COMFY!!!
Anonymous No.106261453
Does qwen have it's own lora loader node? The lightning doesn't seem to work with the normal loader
Anonymous No.106261454
>>106261445
having sex with her Asian ladyboy
Anonymous No.106261473
>>106261384
yes
but also there's this
https://github.com/leejet/stable-diffusion.cpp
but its a meme
Anonymous No.106261497
even detailed wheels, neat

cute anime girl Miku Hatsune wearing a white racing suit with "Racing Miku" on the front. She is holding a large umbrella, with a chibi Miku Hatsune on the fabric.
Anonymous No.106261607
gainsmaxing
Anonymous No.106261619 >>106261633
Anonymous No.106261633 >>106261640
>>106261619
i went a good few hours without remembering niggers exist and you just had to ruin it
Anonymous No.106261640
>>106261633
remember you arent one and all is well
Anonymous No.106261659 >>106261715 >>106261848
A four panel comic. In the first panel Miku Hatsune says "why is there so much crime?" while sitting at a news desk. In the second panel, there is a picture of Chicago on fire. In the third panel, Miku says "how did this happen?". In the fourth panel a cartoonish black man is visible saying "sheeeeeeet!" while standing in the streets of chicago.

this is pretty good. very good coherence.
Anonymous No.106261667
zzzzzzzzzzzzzzzzzz
Anonymous No.106261715 >>106261722
>>106261659
Chicago should be prompted to be burning in the last panel, but yes, it does well. Chinks keeps improving local, will western big tech ever step up ?

Meanwhile ClosedAI rolls out a disaster GPT-5, Sam Altman clearly needs another 500 billion
Anonymous No.106261722 >>106261758
>>106261715
Elon's desire to crush Sam will advance AI more than anything
Anonymous No.106261745 >>106261794 >>106261848
A four panel comic. In the first panel Miku Hatsune says "would you like to solve the puzzle?" while sitting at a desk on a game show like wheel of fortune. In the second panel, an anime girl with red curly hair is holding a sign saying "N". In the third panel, Miku Hatsune says "you win!". In the fourth panel the anime girl with red curly hair celebrates with Miku Hatsune as dollar bills rain from the sky.

it doesn't know Teto without a lora so you get knockoff Teto.
Anonymous No.106261758 >>106261769
>>106261722
Nothing like good old competition. Imagine the hardware we would've had by now if njewdia actually someone to compete with
Anonymous No.106261769 >>106261784 >>106262020
>>106261758
it's wild how AMD make godlike CPUs but can't be as competitive with the GPU side. Their CPUs basically killed Intel at this point.
Anonymous No.106261784
>>106261769
they just don't fucking care about rocm at all. it's actually fucking insane
Anonymous No.106261794 >>106261848
>>106261745
pretty good
Anonymous No.106261848 >>106261861 >>106261878
>>106261794
>>106261745
>>106261659
what model is this?
Anonymous No.106261861
>>106261848
nta, but qwen i'm assuming
Anonymous No.106261878 >>106261896 >>106261910 >>106262002 >>106262105 >>106265220
A World War 2 propaganda style poster with Miku Hatsune. The sign says "if you don't listen to Miku, you listen to HITLER!". Miku is wearing a military uniform and helmet.

very good

>>106261848
qwen distilled (faster, more vram friendly)

https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF/tree/main

using q5, and use the 8 step lora:

https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main
Anonymous No.106261896
>>106261878
this workflow:

https://huggingface.co/datasets/theaidealab/workflows/blob/main/qwen_image_distill_gguf.json

but attach the lora loader if it's not there or gguf loader
Anonymous No.106261910
>>106261878
damn the chinks cooked yet again, preciate the info anon
Anonymous No.106261940 >>106262008 >>106262105
a painting of Miku Hatsune in an art gallery, done in the style of the artist Van Gogh.

neat
Anonymous No.106261955
ar rook same
Anonymous No.106262002 >>106262048
>>106261878
it's so over for 12GB vramlets
Anonymous No.106262008
>>106261940
picasso:

pretty cool
Anonymous No.106262011 >>106262044 >>106263899
>>106259679 (OP)
>file deleted
Anonymous No.106262020 >>106262038
>>106261769
It's weird, as in suspiciously weird

Nvidia is pumping billions into making their AI framework and optimized their cards for AI for ~a decade

AMD which has strong GPUs just sit there and roll their thumbs, AI explodes, AMD still rolls their thumbs for three years, then finally starts to realize they need to be a participant in the AI space

Like WTF ?
Anonymous No.106262038
>>106262020
jensen and lisa are cousins, could be a "gentleman's agreement". who knows.
Anonymous No.106262044 >>106262105 >>106262213
>>106262011
>>106259679 (OP)
but why,
Anonymous No.106262048 >>106262062
>>106262002
Ehh, if anything with these quantizized weights they're so back
Anonymous No.106262062 >>106262112
>>106262048
I doubt q2 will be that good
Anonymous No.106262072
hm, q8 qwen distilled is 21gb but works fine without multigpu. I have 16gb vram. I expected to need the node with virtual vram (gguf multigpu).

Q8 distilled sample, 29s with the 8 step lora
Anonymous No.106262073 >>106262105 >>106262213
why did the collage get deleted
Anonymous No.106262105 >>106262126 >>106262167
>>106261878
>>106261940
These are a bit yellow.

>>106262044
>>106262073
It had racist and transphobic image, ch*d.
Anonymous No.106262112 >>106262162
>>106262062
With offloading they can run q4 at the very least
Anonymous No.106262126 >>106262675
>>106262105
>It had racist and transphobic image, ch*d.
I thought I was on 4chan, not reddit...
Anonymous No.106262129
Can the issue of models almost always fucking up zero shots of eyes, hands, and feet be solved by going to bigger models?
For example, is qwen better at this than flux which would be itself better than sdxl?
Anonymous No.106262136 >>106262213
literally all of my gens are racist and transphobic because im white and they're brown so it's raceplay and the women are so beautiful it makes ywnbaw's kill themselves but i dont see what that has to do with the collage being deleted
Anonymous No.106262160
too much lust provoking gens. take heed baker, bake with your heart not your dong.
Anonymous No.106262162
>>106262112
it's weird, q8 distill quen image is 21gb. I have 16 VRAM. But it works just fine, my last gen was 28 seconds. I expected to need the virtual vram node/gguf but...no issue.
Anonymous No.106262167 >>106262456
>>106262105
> racist and transphobic image, ch*d
Anonymous No.106262179 >>106262199
q8 version of the previous racing miku prompt:

got the actual logo this time.
Anonymous No.106262199
>>106262179
Anonymous No.106262200 >>106262223 >>106262224 >>106262225
please somebody give me a (you)!
Anonymous No.106262213
>>106262044
>>106262073
>>106262136
i imagine some report OP simply as a matter of course because they do not like the fact that this general exists, mayhaps
Anonymous No.106262223
oh, retried the comic with the q8.
>>106262200
(you)
training wizza No.106262224
maybe it's a bit of a stretch but when using last_frame to do a second part of a video, is it possible for the next segment to have some memory or consistency?

>>106262200
Have one
Anonymous No.106262225 >>106262701
>>106262200
checked
Anonymous No.106262280 >>106262336 >>106262384 >>106263016
I'm tired of swapping resolution to go from landscape to portrait and vice versa.
Is there a convenient node with a button to swap them out?
Anonymous No.106262315 >>106262389
>>106260233
>A sexy woman on the beach is running into the direction of the moving camera, smiling, (her breasts are bouncing:1.2)
Does the "1.2" even do something for wan?
Anonymous No.106262336 >>106262373
>>106262280
Anonymous No.106262367 >>106262371 >>106262409 >>106262494 >>106263594
Currently baking, shame I can't train 1024 or even 768 on my 16GB AMD card. Went with Prodigy, might be a mistake. Chroma v38.
Probably too many stills of Dafoe and Pattinson (3 of both out of 31).
Anonymous No.106262371
>>106262367
BASED
Anonymous No.106262373
>>106262336
Yeah I guess that works lol
Anonymous No.106262384 >>106262470
>>106262280
D2 nodes: D2 Size Slector (no, it's not a typo).
Anonymous No.106262386
Anonymous No.106262389 >>106262475
>>106262315
Anonymous No.106262398
Anons, I seek enlightenment. On this blessed day by the Pope, it's August 14th, 2025, please tell me the chroma meta.
Anonymous No.106262409
>>106262367
>Probably too many stills of Dafoe and Pattinson (3 of both out of 31).
Doesn't sound like too many, particularly if you identified their characters in the captions so you can omit them from your prompts when you don't want them.
Anonymous No.106262456
>>106262167
sorry bud, this is a designated no fun zonetm
Anonymous No.106262470 >>106263346 >>106263446
>>106262384
Nice, thanks anon.
Anonymous No.106262475
>>106262389
Got it, so it's useless.
Anonymous No.106262480 >>106262522 >>106262529
there we go. even with a not very optimal prompt.

A magazine on a wooden table called "1GIRLS". Miku Hatsune is sitting at a computer with a chibi Miku Hatsune on the screen, and is typing on the cover of the magazine. A headline saying "how to generate your very own Miku!". At the bottom of the magazine is an ad for Nvidia GPUs, with the text "5090, only $10000 plus tip!".

qwen q8 distilled + 8 step lightx2v lora
Anonymous No.106262494 >>106262614
>>106262367
>Currently baking, shame I can't train 1024 or even 768 on my 16GB AMD card.
With offloading to ram you should be able to do batch 1 768 and likely 1024 on 16gb vram without too much slowdown, what trainer are you using ?

>Went with Prodigy
This is kind of vram hungry, but even so it should be doable with offloading.

You could go with adamw with 1e-4 lr if you want to skip Prodigy
Anonymous No.106262522
>>106262480
Yeah, Qwen is indeed a huge step closer to SAAS image generation quality on local, and you can train loras to further improve it.
Anonymous No.106262529 >>106262556 >>106262557
>>106262480

26 seconds (distill + lora, 8 steps)

apparently the model is "slow" with the regular version but the distill + lora works great. open source always wins.
Anonymous No.106262556
>>106262529
interesting thing about the mikus is it doesn't do the 01 by default. like the bars there. but that is easily fixed with "Miku has "01" in red font on her arm.".
Anonymous No.106262557 >>106262577
>>106262529
>26 seconds
On what hardware ?
Anonymous No.106262577 >>106262612
>>106262557
4080 (16gb)

the distilled qwen q8 + the lora is what makes it so fast compared to the regular model (which people said was slower than flux)

https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main

https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF/tree/main

q8 distill, 8 step bf16 lora
Anonymous No.106262581
Anonymous No.106262612
>>106262577
*also the workflow has sageattn working and fp16 optimizations working

https://huggingface.co/datasets/theaidealab/workflows/tree/main

https://huggingface.co/datasets/theaidealab/workflows/blob/main/qwen_image_distill_gguf.json

workflow there

also this to install sage/triton fast:

https://civitai.com/articles/12851/easy-installation-triton-and-sageattention
Anonymous No.106262614 >>106262632 >>106262746
>>106262494
>With offloading to ram you should be able to do batch 1 768 and likely 1024 on 16gb vram without too much slowdown, what trainer are you using ?
diffusion-pipe. I have flash-attention installed and it works fine with comfy but I don't know if d-p is using it. 512 is taking 13.3GB VRAM right now.
But it's probably a ROCM thing.
Anonymous No.106262632
>>106262614
and yes i'm offloading the unet
Anonymous No.106262672 >>106262688
any creative upscaler model
Anonymous No.106262675
>>106262126
4chan was lost to trannies and redditoids sometime after 2015

can't even say the n or speak about (((them))) without getting banned nowadays
Anonymous No.106262688 >>106262868
>>106262672
what does "creative upscaler" mean?
Anonymous No.106262701 >>106263334
>>106262225
That's a clean and coherent gen
Anonymous No.106262725 >>106262784
there, now it's a proper ad.
Anonymous No.106262746
>>106262614
Increase the block_swap parameter in your config, it will make training slower but the increase in resolution will be the major slowdown, 1024 is four times the resolution of 512.
Anonymous No.106262765
Anonymous No.106262784
>>106262725
ah right, it forgot the number on the arm.

added: On her arm is the text "01" in red text.
Anonymous No.106262808
migu painting migu in an art studio
Anonymous No.106262864 >>106262894
one more migu for now:

anime style Miku Hatsune on the cover of a 1930 Marvel comics comic book. The comic has the headline "Miku tours the city!" in stylish white text. Miku is wearing a yellow dress and standing beside a Ford car in New York City. On her arm is the text "01" in red text.
Anonymous No.106262868
>>106262688
Upscale your video and automatically interpret/add details so it looks like native resolution.
https://www.topazlabs.com/astra
Anonymous No.106262875 >>106263148 >>106264473
Noooo, don't go into the lake that turns your foot weird briefly!
ACK-
>Gotta lower the bitrate so I can fit the filesize now
Hiroshimoot... please.
Anonymous No.106262894 >>106263042
>>106262864
>one more migu for now:
Yeah, I mean these are nice, but we get that it can generate Mihu very well, how about some other artstyles, characters, to show off its capabilities
Anonymous No.106263000
vibe killed
Anonymous No.106263016 >>106263446
>>106262280
this one supports all native sdxl resolutions so you dont have to manually pick width/height.
Anonymous No.106263042 >>106263059 >>106263080
>>106262894
a book showing how to draw Miku Hatsune step by step, in black and white.

this is pretty neat imo
Anonymous No.106263059
>>106263042
>"a book explaining the meaning of life"
Anonymous No.106263070
When training a lora, if I select multiple resolutions, ie, 512/1024/etc, would that increase my vram usage?
Anonymous No.106263080 >>106263089
>>106263042
Miku Hatsune in a kitchen cooking a meal, a cooking pot is filled with green leek vegetables. Miku is wearing a chef's outfit. On her arm is the text "01" in red text.
Anonymous No.106263086 >>106263334
> A small toy truck with big tires center frame on a frozen lake in antarctica. Wide angle shot. It is snowing out and snow gently piles onto the ice in various spots.
Anonymous No.106263088
>for now
Anonymous No.106263089 >>106263101
>>106263080
even better!
Anonymous No.106263101
>>106263089
honestly pretty good
Anonymous No.106263135
why am I getting random confetti falling around when using wan 2.2?
Anonymous No.106263139 >>106263164
a blueprint on Miku Hatsune, with technical details, on a work bench.

neat
Anonymous No.106263148
>>106262875
This scene is quite beautiful
Anonymous No.106263164 >>106263182
>>106263139
see if it can do 3d mesh wireframes
Anonymous No.106263170 >>106263296
a plush doll version of Miku Hatsune sitting on a pedestal in a museum.

cute!
Anonymous No.106263182 >>106263191 >>106263192
>>106263164
a 3d mesh wireframe of Miku Hatsune.

first attempt
Anonymous No.106263191 >>106263266 >>106263578
>>106263182
generate a 3d mesh wireframe of Miku Hatsune. show only the 3d mesh wireframe.
Anonymous No.106263192 >>106263222
>>106263182
wow honestly impressed already, try for a transparent wireframe so it layers
Anonymous No.106263222 >>106263266 >>106263274 >>106263578 >>106263588
>>106263192
tried: generate a transparent 3d mesh wireframe of Miku Hatsune

this + wan rotation and you have a blender wireframe or whatever
Anonymous No.106263239
miku hatsune made out of wood:
Anonymous No.106263258
oh neat, it can do ice as well.

an ice sculpture of miku hatsune in the middle of an art exhibit
Anonymous No.106263266
>>106263222
>>106263191
Is this slopmodel incapable of NOT doing the thick outline?
Anonymous No.106263274 >>106263588
>>106263222
>checked
wow this looks great, surprised that worked lol, thanks for giving it a shot anon
Anonymous No.106263281 >>106263299 >>106263304 >>106263568
lora trainer anon here
What lora should I train next? Should I train something for Qwen-Image or Chroma? I am the same guy who trained the digital photo lora for it before
The last time I tried, I found that Qwen Lightning is not compatible with other loras, even after merging
I am not a super fan of Qwen because it's slow, hard to unslop, harder to train loras for, and from a personal point of view most Chroma gens are "good enough" for me
I have a friend who trained a qwen-image lora using nf4 and he could train it on a single 24gb card faster than I could, so I wonder if I should try again
Anonymous No.106263296 >>106263308
>>106263170
Was living under a rock for the past couple months, which model is that and tl;dr?
Anonymous No.106263299 >>106263413
>>106263281
Would probably wait for chroma loras to see how the retrain of the last two epochs goes.
Anonymous No.106263304 >>106263413 >>106263515
>>106263281
hey that photo lora was sweet, was it that vhs one? do you happen to still have the catbox for it?
Anonymous No.106263308 >>106263438
>>106263296
it's qwen but the distilled model (q8) and the same people that made the good wan lora (lightx2v) have one for that as well, so you get good outputs in 8 steps.
Anonymous No.106263334
>>106263086

>>106262701
ty, wan is insane at realism
Anonymous No.106263346
>>106262470
>preset: custom
How does that work? Can I create my own list of resolutions so I can use separate ones for sdxl/flux and others for wan?
Anonymous No.106263362 >>106263372
>horrifying monster runs at the camera fast inside of a dark yellow liminal empty building
kekd
Anonymous No.106263372 >>106263524
>>106263362
try "very fast monster runs at the camera at incredibly high speed"
Anonymous No.106263413
>>106263299
Yeah, the model is still producing bad hands and limbs quite often, but I won't completely blame because I use the distilled model, so I don't know how common those problems are on the base model

But something tells me Lodestone won't "fix" the model and someone will have to do a human anatomy high rank lora to fix bad hands and limbs... That model will probably survive on merges, lol

>>106263304
Those were two different loras (one was trained on high end 2000s digital cameras and other was trained on shitty VHS stills I could find, which surprisingly are difficult to find good images with a basic google search)

I don't have the catboxes right now, but the VHS one was trained on v48 and the digital camera one was trained on v50, I might have to retrain both anyway since lodestone is retraining Chroma
Anonymous No.106263438
>>106263308
Thank you anon
Anonymous No.106263441 >>106263451 >>106263452 >>106263574
1500 steps (50 epochs). Stopped for a while because I had to test it and make sure I can actually resume training from saved state. Prodigy is uh let's say strong. This here is a "a man is sitting at a desk smoking a cigarette" or some such prompt. Next post will be "l1ghth0us3, donald trump" (i only captioned a trigger word).

I'm continuing to 3000 for some reason. Here's the 1500 abortion: https://files.catbox.moe/8zgz4t.safetensors
Anonymous No.106263443 >>106263491
Anonymous No.106263446 >>106263468
>>106263016
>>106262470
is there one like this that works for wan/video?
Anonymous No.106263451
>>106263441
Anonymous No.106263452 >>106263464
>>106263441
You forgot to say which base model is that, anon
Anonymous No.106263464
>>106263452
KKKhroma v48
Anonymous No.106263468 >>106263613
>>106263446
It has width/height outputs, just connect them to a regular Empty Latent Video node
Anonymous No.106263491 >>106263505 >>106263595
>>106263443
Miku Hatsune in a game like Super Mario World by Nintendo. The text "Super Miku World" is visible. She is jumping and is holding a green leek vegetable, and is smiling. On her arm is the text "01" in red text.
Anonymous No.106263505 >>106263845
>>106263491
also, without q8 distill (or any gguf distill) and the lightx2 lora, this is supposed to be much slower than flux. but at 8 steps i'm getting 25-30s gens.

also interesting, I have 16gb vram and q8 distill qwen is 21gb. but it's not having any issue loading or generating despite that. no need for multigpu + virtual vram stuff.

it just works, which is nice.
Anonymous No.106263515 >>106263628
>>106263304
nta, but i gotchu https://files.catbox.moe/xuzs67.safetensors
Anonymous No.106263524
>>106263372
>tfw a business man running towards me is what the chiners deem as a "monster"
Anonymous No.106263568 >>106263686
>>106263281
Dario Argento lora for Chroma. Or 80's scifi like Terminator and Robocop
Anonymous No.106263574 >>106263594 >>106263623
>>106263441
What style is that and what's the trigger?
Anonymous No.106263578 >>106263599
>>106263191
>>106263222
>this + wan rotation and you have a blender wireframe or whatever
That's honestly interesting. If you could get a lora that makes decently lowpoly characters with that it could be a handy tool for modeling. Something like
>artist draws (or sloppa generates) A/T-pose character
>img2img it with a wireframe
>i2v turnaround to create a modeling reference
Anonymous No.106263588
>>106263222
>>106263274
It's both amazing and dystopian to fuck how The Running Man predicted deepfakes and the propaganda they bring.
Anonymous No.106263594 >>106263623
>>106263574
Trying to make a lora out of "The Lighthouse" >>106262367
trigger is l1ghth0us3
Anonymous No.106263595
>>106263491
Miku Hatsune on the cover of a newspaper titled "LDG NEWS". Miku is looking at a computer. The headline reads "popular vocaloid appears in stable diffusion image!". A headline below that reads "1girl generation up 10000%!" with an image of an nvidia GPU beside the text.
Anonymous No.106263599
>>106263578
the topology isn't perfect which is the downside
Anonymous No.106263613
>>106263468
oh I'm dumb I didn't realize you can plug them in since they didn't have the dot visible before hovering over there
Anonymous No.106263623
>>106263594
>>106263574
Also that photo of a vaguely Pattinson looking guy was without the trigger and <0.90 strength. I might have to redo this with adamw.
Anonymous No.106263628 >>106264008
>>106263515
based thanks anon
Anonymous No.106263642
can i have a fatty migu please
Anonymous No.106263649 >>106263684
comfygods, how do I make more advanced workflows?
I've made a workflow that takes the image from a more capable NSFW model and dumps it into WAN

However, I want the image generation and video generation to occur as separate blocks.
Basically, I should be able to trigger a batch of 20 images until I find one that I like, then I should be able to select an image and change the video tokens and run video generation until I get a video generation that I like
Anonymous No.106263658 >>106263691
>trigger word
>for a single style lora
Anonymous No.106263684
>>106263649
>comfygods
more like comfy cucks lmao
Anonymous No.106263686 >>106263706 >>106263726 >>106263805 >>106263848 >>106263871 >>106264049
>>106263568
I trained an 80s Dark Fantasy one, but I haven't released

It's interesting
Anonymous No.106263691 >>106263758
>>106263658
Look, I don't know how to train these things. I'm just semi-following these guides on Civitai written by amateurs who don't know either.
Anonymous No.106263695
Anonymous No.106263706
>>106263686
momma
Anonymous No.106263726 >>106263812
>>106263686
Goddamn. Gimmie more.
Anonymous No.106263736 >>106263832
Anonymous No.106263758 >>106263988
>>106263691
i understand. just omit it for single style loras. the user shouldnt have to both add the lora and then prompt an activation word for it to function in my humble opinion. doubling the work for nothing.
Anonymous No.106263794
Anonymous No.106263805
>>106263686
Is there a site with celebrities loras for chroma?
Anonymous No.106263812 >>106263848 >>106263851 >>106265420
>>106263726
Sure
Anonymous No.106263814
Anonymous No.106263832
>>106263736
>reddit screenshot
Anonymous No.106263845
>>106263505
I'm running the full flux1-dev model on a 4070. My RAM usage is 45-50GB though. lol
Anonymous No.106263846
>>106260069
Kys
Anonymous No.106263848
>>106263686
>>106263812
AI rocks
Anonymous No.106263850
Since we are posting bosoms and since this has no nipples I guess it's okay. From failed lora experiments.
Anonymous No.106263851
>>106263812
Anonymous No.106263871 >>106263910
>>106263686
Anonymous No.106263894 >>106263917
>>106260069
just letting you know no one will ever use this shitty wrapper
Anonymous No.106263899
>>106262011
That's how you know it was a good collage.
Anonymous No.106263910
>>106263871
kino
someday we will do our own films using old cinematography aesthetics with practical effects like stuff instead of modernslop with grey filter and shaky cameras
as I was building the dataset for for this lora, I noticed how "painterly" some shots were for these older movies
Anonymous No.106263917 >>106263960
>>106263894
it looks kinda cool and if I can make games with it then I'll probably give it a shot
Anonymous No.106263960
>>106263917
I'm actually wondering if it can just be an extended editor for renpy so I can make vns as lazily as possible
Anonymous No.106263984 >>106264000
Hi everyone, im looking for a Penis and Cum detection model for facedetailer or similar so i can faceswap but mask out the penis and cum.
would be very grateful for some information, the clearnet seems wiped "clear" lmao.

If you look at the most recent deepfakes theyre amazing, penis actually goes into mouth, cum is detected reliably and masked out of the swap. how is this done? I have managed to build my own reactor nsfw workflow but the penis detection is hit or miss and you can forget cumshots.
Anonymous No.106263988
>>106263758
Okay, I get that. When I download loras I even name them with their trigger_words to keep track of that.
But what's the correct procedure for style loras? No captions at all? (That seems the best option to me.) For Chroma but I guess in general too.
Anonymous No.106263992
jugs theme huh
Anonymous No.106264000 >>106264031
>>106263984
on some huggingface somewhere. can't remember the account. training a detector model is pretty easy though but you need a bunch of images with penis and cum
Anonymous No.106264006 >>106264029
pool filled with jello, technically true?
Anonymous No.106264008
>>106263628
and here's the 2000s photography lora https://files.catbox.moe/hn8034.safetensors
Anonymous No.106264029
>>106264006
uhh, gelatinous red water?
Anonymous No.106264030 >>106264048
>people talking about loras
>me following the discussion
>nothing is shared
>waste of time...
Anonymous No.106264031
>>106264000
of course, my problem is mainly searching for it seems impossible on any search engine ive tried. it seems they censor that on purpose
Anonymous No.106264032
Local Jugs General
Anonymous No.106264043
mud worked first try for qwen

A sexy brunette woman wearing a bikini with very large breasts is in a pool filled with mud at the beach. She is rolling around the mud on her hands and knees, and smiles.

YES IT IS MUD it isnt india.
Anonymous No.106264048
>>106264030
>ctrl+f safetensors
>3 matches
Anonymous No.106264049 >>106264128 >>106264144
>>106263686
Is it a lewd one or did you simply use it to gen big booba witch?
Anonymous No.106264077 >>106264116
A 200 foot tall sexy brunette woman with big breasts and a slim waist is standing at the beach. She is leaning over and looking down at a tiny man who is pointing up.

actually worked. tried gigantic and got an architect size woman.
Anonymous No.106264093
You are not entitled to my LoRAs.
Anonymous No.106264106 >>106264116 >>106264119
attack on 1girls:
Anonymous No.106264116
>>106264077
>>106264106
well its more like the guy is tiny
Anonymous No.106264119 >>106264171
>>106264106
better:
Anonymous No.106264128
>>106264049
I just prompted, there are no "big boobs" in the data
Anonymous No.106264144
>>106264049
NTA, but for Chroma "has large breasts" means absurd hanging milkers with 1mm separation from implied areolas
Anonymous No.106264171
>>106264119
one more
Anonymous No.106264185
on Chroma v48, the prompt "large sagging breasts" had an amazing effect. It produced huge naturals, without prompting for "natural breasts" since it has a weaker effect and still look like a boobjob
On v49/v50 it produces literal granny-like tits if you prompt for that, so it is a downgrade in that regard
Anonymous No.106264244 >>106264511
The D in ldg stands for DD
Anonymous No.106264249
Anonymous No.106264269 >>106264452
Aaaaand he's gone.
Anonymous No.106264273 >>106264354 >>106264638
Anonymous No.106264318
Anonymous No.106264321
i fucked up my catbox album upload script so here have a collection of upskirts from a couple days ago
https://catbox.moe/c/jn2w2z
Anonymous No.106264354 >>106264437
>>106264273
That topology is ass.
Anonymous No.106264399 >>106264638
lol hat just despawns
Anonymous No.106264423 >>106264450 >>106264622
>python 3.13 is supported but using 3.12 is recommended because some custom nodes and their dependencies might not support it yet.
Is anyone rocking Comfy on 3.13 with lots of extensions? Is this warning overly cautious or should I heed it?
I have been using in 3.12 for a while.
I guess worst case scenario I can just delete venv and reinstall 3.12, but maybe someone will save me time.
Anonymous No.106264437 >>106264502
>>106264354
you're still losing your to it
Anonymous No.106264450 >>106264476
>>106264423
I'm on 3.13 with all custom nodes I'd need, except nunchaku but apparently they're working on it.
Funnily enough I tried to add another 3.12 venv today and that totally fucked up a lot of nodes, so I'll just wait.
No idea how the 3.12->3.13 migration would go, though.
Anonymous No.106264452 >>106264472
>>106264269
I have been listening Melissa every once in a while for many years now.
No other song stuck with me as much, but great band.
Anonymous No.106264472
>>106264452
That's Come to the Sabbath for me.
Anonymous No.106264473
>>106262875
that's actually crazy how it got the cloth floating on the water
Anonymous No.106264476
>>106264450
Oh I am doing a separate install on the same system. Not exactly migrating.
I am trying to switch from a normal Comfy install to docker based one for security.
Anonymous No.106264488 >>106264500
I have been out of the AI loop for 2 or so years. is something like Midjourney still levels above local models? I've seen some extremely impressive videos generated from there and I'm looking to see if it's possible to get to that level locally
Anonymous No.106264500
>>106264488
Man, shut up. Use your eyes.
Anonymous No.106264502
>>106264437
>prompt: youre still losing your to it
Anonymous No.106264511
>>106264244
you have a catbox or something? I need more
Anonymous No.106264527 >>106264577
>generate a batch of 40 images
>accidentally shove it into wan
whoops lol
Reminds me of accidentally deleting keyframes
Anonymous No.106264577
>>106264527
neat
Anonymous No.106264622
>>106264423
I had to use pyenv to downgrade and even then it only barely works some of the time. I fucking hate juggling python environments and whoever is responsible for this task fit only for beasts of burden should be drug out onto the street and shot.
Anonymous No.106264638
>>106264273
>>106264399
cool idea desu
Anonymous No.106264658 >>106264676
>>106260069
>waiNSFW
Anonymous No.106264667
Anonymous No.106264668
Anonymous No.106264676
>>106264658
haha yeah... what a bad model... I totally don't use it ;_;
Anonymous No.106264697 >>106264792
Is there a tutorial or rentry for Chroma? I'm new to this, but I've been reading your threads and it seems like Chroma has a ton of feedback here. Can someone share a rentry or a tutorial?
Anonymous No.106264707
>>106264704
>>106264704
>>106264704
>>106264704
Anonymous No.106264710
>>106260069
Ani my beloved??? Please save me from Comfy!
Anonymous No.106264792
>>106264697
I believe chroma has some example workflow in their huggingface, you can start with that.
A little bit extra effort is necessary to get less worse output with it.
Anonymous No.106265220
>>106261878
i can do both
Anonymous No.106265420
>>106263812
proompt plox