← Home ← Back to /g/

Thread 107135438

314 posts 228 images /g/
Anonymous No.107135438 [Report] >>107135579 >>107136119 >>107140193
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107123435

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.107135458 [Report]
>ran took everything from me
Anonymous No.107135470 [Report] >>107135499 >>107135630 >>107135640 >>107136995
Is 9800X3D + 5070Ti good for local AI? Or should I just go for 5090?
Anonymous No.107135474 [Report] >>107140172 >>107140305
Anonymous No.107135485 [Report]
>Comfy must be dragged into the streets and shot
Anonymous No.107135498 [Report] >>107135511
Blessed thread of frenship
Anonymous No.107135499 [Report]
>>107135470
AMD is the best card for AI applications.
Anonymous No.107135511 [Report]
>>107135498
thread schizo with Stockholm syndrome
Anonymous No.107135515 [Report] >>107135818
Post moar Jebby Nicholsman.
Anonymous No.107135520 [Report] >>107135531
what makes LDG the best diffusion thread on /g/ and 4chan?
Anonymous No.107135531 [Report] >>107135604
>>107135520
no schizos except that one guy
Anonymous No.107135550 [Report]
Anonymous No.107135563 [Report] >>107135572 >>107135703
What's the best UI for advanced inpainting?
Anonymous No.107135572 [Report] >>107135707
>>107135563
anistudio
Anonymous No.107135579 [Report] >>107135606 >>107135775
>>107135438 (OP)
holy fuck, look at all those 1girl prompts! immaculate creativity from fat people
Anonymous No.107135604 [Report]
>>107135531
you?
Anonymous No.107135606 [Report]
>>107135579
nogen
Anonymous No.107135630 [Report]
>>107135470
If you can afford it yes.
Anonymous No.107135640 [Report] >>107135724
>>107135470
you provided no information on what you're planning on doing, you stupid faggot. im inclined to say yes because im assuming you're just going to prompt anime waifus with forge like a pleb.
Anonymous No.107135703 [Report]
>>107135563
trained on Krea, the results are better. it's just got a really low learning rate already (3e-5), and i possibly have to go lower
Anonymous No.107135707 [Report]
>>107135572
I only use 2.2 with lightning and at low cfg it doesn't listen that well, so I try to keep it a little vague. Most important thing is to prompt anything you think might get hidden, like if eyes close and you don't say "blue eyes", you might get green eyes when they open, stuff like that.
Anonymous No.107135713 [Report] >>107135732 >>107135853
oh brother
Anonymous No.107135723 [Report]
Anonymous No.107135724 [Report] >>107137099
>>107135640
wan2gp gets the job done but deepbeepmeep is too fucking slow with the updates and is obsessed with vace and multitalk shit. He needs to add more samplers, upscalers and schedule types and stop with the vace and wan animate crap. image generation settings are too barebones.
Anonymous No.107135732 [Report]
>>107135713
Post your hand
Anonymous No.107135775 [Report] >>107135781
>>107135579
Female form upsets the tranny, reminder of what he will never be.
Anonymous No.107135781 [Report]
>>107135775
no need, we already know you're nonwhite
Anonymous No.107135818 [Report] >>107135829 >>107136000
>>107135515
Use it wisely!
Anonymous No.107135829 [Report]
>>107135818
I noticed no change at all, I'm using the same model as before.
Anonymous No.107135853 [Report]
>>107135713
I wonder what upset him this time and caused him to spin up the "grab posts from old threads" script
Anonymous No.107135869 [Report] >>107135943
the thing would not converge, god damn it
Anonymous No.107135943 [Report] >>107135956
>>107135869
elaborate
Anonymous No.107135956 [Report] >>107135981
>>107135943
huge dataset to help with anatomy, didn't let train long enough
Anonymous No.107135981 [Report] >>107136040
>>107135956
so just continue training, whats the issue
Anonymous No.107136000 [Report] >>107137763
>>107135818
Oh Jebby... I'll be in my bunk.
Anonymous No.107136023 [Report]
Anonymous No.107136040 [Report] >>107136114
>>107135981
well i would much rather express myself with this once in a lifetime art than watch terminal window
Anonymous No.107136114 [Report]
>>107136040
train overnight, use old gpu to gen with the latest checkpoint if you have it
great shape of the big soft tits btw, do post the lora when done
Anonymous No.107136119 [Report] >>107136149 >>107136150 >>107136159
>>107135438 (OP)
Where do you guys stay up to date with upcoming (local) models and technology, and research papers and what not
Anonymous No.107136149 [Report]
>>107136119
Given this place is done, https://www.reddit.com/r/StableDiffusion
Anonymous No.107136150 [Report]
>>107136119
Ancestral blood memory, everything is known before it's released.
Anonymous No.107136159 [Report]
>>107136119
Right here, of course
Anonymous No.107136182 [Report]
basemodel photo 1girl prompting:
>prompt box feels like a "mushy" and unresponsive input
>initial gens are frustratingly meh
>the active experience of prompting and receiving your gens live is boring
>sorting through 500 gens one of them will actually touch your heart

booru model anime 1girl prompting:
>prompt box feels ultra-responsive and powerful
>initial gens are high-quality and exciting
>the active experience of prompting is fun and engaging like a video game
>sorting through 500 gens is a laborious fruitless chore that yields frustratingly little
Anonymous No.107136250 [Report] >>107136256 >>107136270 >>107136572
recommend me your favorite illustrious/noob model and post a gen with it if possible
Anonymous No.107136256 [Report] >>107136264
>>107136250
base
Anonymous No.107136264 [Report]
>>107136256
d
Anonymous No.107136270 [Report]
>>107136250
Anonymous No.107136298 [Report]
>cancermerge
>abysmally short prompt
We don't do that here
Anonymous No.107136476 [Report] >>107136504 >>107136572
will i get better lora results if i train it on the specific checkpoint i use? or would any pony checkpoint work with any pony model
Anonymous No.107136504 [Report] >>107137252
>>107136476
>will i get better lora results if i train it on the specific checkpoint i use
obv
Anonymous No.107136572 [Report] >>107137252
>>107136250
Noobai Rectified Flow Test 486k
I mean it's not radically different from base Noob and less stable due to being undertrained for what's it is supposed to be, but I like it.
>>107136476
>will i get better lora results if i train it on the specific checkpoint i use?
Yes. Less compatibility for others is the only drawback if you decide to share it.
>or would any pony checkpoint work with any pony model
Some shitmixes don't like certain loras.
Anonymous No.107136666 [Report] >>107136828 >>107136853
Anonymous No.107136828 [Report] >>107137234
>>107136666
nice quads. also, catbox plz.
Anonymous No.107136853 [Report]
>>107136666
SEX
Anonymous No.107136948 [Report]
Man, there's so many diffusion generals on /trash/ lol, never noticed that until today. They even have their own literal /sdg/ for some reason
Anonymous No.107136995 [Report]
>>107135470
GPU costs especially are driven very much by demand for AI at the moment. 5090 is better.
Anonymous No.107137033 [Report] >>107137061 >>107137613 >>107141712 >>107141748 >>107141887 >>107142004 >>107142017 >>107142021 >>107142035
>For those who've been following Pony model development closely, it's no surprise that I don't like LoRAs, nor am I a big fan of ControlNets. Such tech, while useful, has always felt like a hack to me, so I've been very happy to see the rise of editing models. Want to use pose control? Just provide an image of the pose. Looking for a particular style? Why not use a few sample images to instruct the model how to draw things?

>We've planned an editing model for a long time and originally called it PomniGen, as we expected to use OmniGen (and I like this name too much to drop it), so we'll keep it. It's actually a QWEN/QWEN Editing alternative. We're cleaning up our own extensive Pony-flavored editing dataset and are excited to see how well it performs on various character-focused tasks.

>I also promise we'll be sharing ongoing checkpoints instead of waiting for a fully trained model this time!

Odds of this:
a) Not being complete dogshit
b) Not have some cucked censorship built in (As in to prevent "nudify" use or whatever)
after V7?
Anonymous No.107137061 [Report] >>107137090
>>107137033
Unironically zero reason to be interested in this at all.
Anonymous No.107137087 [Report]
Anonymous No.107137090 [Report]
>>107137061
Well I am interested in a Qwen Image Edit that knows NSFW out of the box? Not saying he will pull it off of course.
Anonymous No.107137099 [Report] >>107137163
>>107135724
why is the bot still active? man these threads are soo dead.
Anonymous No.107137163 [Report] >>107137308
>>107137099
>man these threads are soo dead.
its funny how much slower it feels when ldg is the fourth most active /g/ thread and not the first but also pcbg is the most with only one post every two minutes so the board itself is slow right now
Anonymous No.107137170 [Report]
Anonymous No.107137234 [Report] >>107137419
>>107136828
too lazy to upload lora
Anonymous No.107137252 [Report] >>107137378 >>107137461
>>107136504
>>107136572
are there any good guides to lora training? what happens if i do not caption the images and only add a caption for "my_prompt" or something? i wanted to make a realistic version of something from a cartoon, so i took my cartoon images and trained it on a realistic checkpoint but with the lora the checkpoint just makes cartoon images
Anonymous No.107137279 [Report]
Anonymous No.107137308 [Report] >>107137404
>>107137163
>ldg is the fourth most active /g/ thread
so long as you keep the bot running
Anonymous No.107137338 [Report]
Anonymous No.107137378 [Report] >>107137410
>>107137252
>are there any good guides to lora training?
Valstrix's civit guide is as good as it gets. Most guides are useless slop.
> what happens if i do not caption
Well it's possible to train loras without captions but it's not ideal on most cases.
>only add a caption for "my_prompt"
You risk AI learning irrelevant noise in the dataset. Captioning is:
trigger word + broad description of wtf AI is supposed to be looking at in the image + details you do not want AI to learn
>wanted to make a realistic version of something from a cartoon, so i took my cartoon images and trained it on a realistic checkpoint but with the lora the checkpoint just makes cartoon images
Your best bet is curating a dataset of that character/thing drawn in wide variety of styles and hope that AI learns to separate style from substance.
A realism based model might be better for this task.
Anonymous No.107137404 [Report]
>>107137308
? there are maybe 6 of them from two hours ago
are you saying the anon botting wants to make it look like ldg is active and not just disrupt anon posting? kek ldg was very active before he started anyway
Anonymous No.107137410 [Report]
>>107137378
>Valstrix's civit guide
thanks ill read this
Anonymous No.107137419 [Report] >>107137457
>>107137234
just drag and drop into gofile.io no account needed
Anonymous No.107137423 [Report]
Anonymous No.107137457 [Report] >>107137491
>>107137419
https://gofile.io/d/SdFhQh
Anonymous No.107137461 [Report]
>>107137252
You should really switch to illustrious or noob instead of pony for XL models anyway desu
Anonymous No.107137491 [Report]
>>107137457
Basado
Anonymous No.107137541 [Report] >>107137584
Anonymous No.107137584 [Report] >>107137787
>>107137541 why?
You seem obsessed.
Anonymous No.107137613 [Report]
>>107137033
ipadapter, controlnets and loras > gay edit models. this is only an excuse to bloatmaxx to a point nobody is able to run it conveniently
Anonymous No.107137633 [Report]
Do anyone here have perfected the art form of generating high-fidelity synthetic data from shitty source pics/frames to fill dataset gaps for a peak quality person lora?
What are your main techniques and models used?

I feel like if I master upscaling/denoising I can manage some professional tier lora, just couple it with some inpainting and qwen edit fuckery. But doing the first part, that is just turning shitty pics into something highres and detailed without straying far from the source material seems like a challenge already.
Anonymous No.107137763 [Report]
>>107136000
Anonymous No.107137787 [Report]
>>107137584
>You seem obsessed.
I've never posted an image mentioning BBC in this thread ever before ever so not really
Anonymous No.107137800 [Report]
not as obsessed as the ani stalker schizo
Anonymous No.107137811 [Report] >>107140172
Anonymous No.107137877 [Report]
Anonymous No.107137894 [Report] >>107137911 >>107137925 >>107137945 >>107138034 >>107138096 >>107138524 >>107140271
Anonymous No.107137911 [Report]
>>107137894
boaring
Anonymous No.107137925 [Report] >>107138002
>>107137894
lmao what the fuck is rfh making comfyui edits. HAHAHAHA
Anonymous No.107137945 [Report]
>>107137894
this is just a normal meme you changed the filename of bleh!
Anonymous No.107137966 [Report]
Anonymous No.107138002 [Report] >>107138014
>>107137925
comfyui is basically stolen valor webslop anyways
Anonymous No.107138003 [Report]
Anonymous No.107138014 [Report] >>107138027
>>107138002
What do you mean?
Anonymous No.107138027 [Report] >>107138782
>>107138014
it's just slightly changed diffusers code and it takes credit for a lot of other people's achievements when all it is is a shitty node framework made in shitty python.
Anonymous No.107138034 [Report]
>>107137894
the zoomer stare
Anonymous No.107138083 [Report] >>107138086
Anonymous No.107138086 [Report] >>107138106
>>107138083
Very cool
Anonymous No.107138096 [Report]
>>107137894
Give her tits
Anonymous No.107138106 [Report]
>>107138086
thx. going for that surreal scfi feel.
Anonymous No.107138146 [Report]
Anonymous No.107138154 [Report]
Anonymous No.107138208 [Report]
Anonymous No.107138276 [Report]
Anonymous No.107138288 [Report]
Anonymous No.107138392 [Report]
Anonymous No.107138419 [Report]
Anonymous No.107138451 [Report]
Anonymous No.107138467 [Report]
Anonymous No.107138489 [Report] >>107138529
Anonymous No.107138524 [Report]
>>107137894
Based
Anonymous No.107138525 [Report] >>107138783
Anonymous No.107138529 [Report]
>>107138489
heh, nice ones
Anonymous No.107138552 [Report]
gm /sdg/
Anonymous No.107138574 [Report]
Anonymous No.107138607 [Report]
Anonymous No.107138628 [Report]
Anonymous No.107138647 [Report]
Anonymous No.107138669 [Report]
Anonymous No.107138713 [Report]
Anonymous No.107138748 [Report]
Anonymous No.107138782 [Report]
>>107138027
I feel ya, but isn't that the nature of open source? Shit gets swiped and re-cobbled together in forks?

Also finally got wan 2.2 working well-ish locally. These t2v outputs are freaky. Had to convert to webm and lose a bit of quality due to size.
Anonymous No.107138783 [Report] >>107138809 >>107138860
>>107138525
it reminds me a bit of the retro anime style stuff people used to do with dalle3
Anonymous No.107138809 [Report] >>107138851
>>107138783
>retro anime style
you mean best style
Anonymous No.107138848 [Report] >>107138920
Anonymous No.107138851 [Report] >>107138870
>>107138809
and this one is just slop
Anonymous No.107138860 [Report] >>107138872
>>107138783
I was playing around with flux dev again, there is a really fun retro anime lora
Anonymous No.107138863 [Report] >>107138920
Anonymous No.107138870 [Report]
>>107138851
thanks, I try
Anonymous No.107138872 [Report] >>107138909
>>107138860
If it's a lora then more than likely it had dalle3 stuff in it
Anonymous No.107138881 [Report] >>107138898
wahoo bing bing
Anonymous No.107138886 [Report] >>107138917 >>107138920 >>107139400 >>107141625
Anonymous No.107138898 [Report]
>>107138881
Hah, that's fun. I have so many pleasant memories playing that shit on the N64.
Anonymous No.107138908 [Report] >>107138920
Anonymous No.107138909 [Report] >>107139019
>>107138872
I want to say it is more MJ than dalle, fun lora either way.
Anonymous No.107138917 [Report]
>>107138886
quality content
Anonymous No.107138920 [Report]
>>107138848
>>107138863
>>107138886
>>107138908
now do a dark white queen smoking a Newport
Anonymous No.107138930 [Report]
Anonymous No.107138992 [Report]
Anonymous No.107139019 [Report] >>107139058
>>107138909
from that pic it looks like dalle3 because of the high color contrast and the use of wide angle (dalle loves wide angle compositions)
Anonymous No.107139058 [Report]
>>107139019
>high color contrast
meant saturated colors
Anonymous No.107139299 [Report] >>107139373 >>107140207
I think I like making loras more than using them
sorta similar thing with putting cfw on consoles, I do that then never play them. What does it mean?
Anonymous No.107139373 [Report]
>>107139299
sounds like those people who enjoy shopping for things more than they enjoy the things. in that case what your dopamine circuits are after is the novelty.
that or aut*sm. or both
Anonymous No.107139394 [Report] >>107139442
wtf, this thread is so slow.
did local chads figure out how to gen IRL?
Anonymous No.107139400 [Report]
>>107138886
nice
Anonymous No.107139442 [Report] >>107139483
>>107139394
4chan posting alone on a Friday night? gosh your pathetic
Anonymous No.107139483 [Report] >>107141497
>>107139442
Your patheticism is my passion
Anonymous No.107139528 [Report] >>107139966
Anonymous No.107139613 [Report] >>107139764 >>107139944
As an offloading device, does cuda/tflops matter?
Anonymous No.107139764 [Report]
>>107139613
Yes
Anonymous No.107139886 [Report]
Anonymous No.107139944 [Report]
>>107139613
If you're considering Intel for something other than LLMs, don't.
Anonymous No.107139966 [Report] >>107139972
>>107139528
I like this one.
Anonymous No.107139972 [Report]
>>107139966
i dont
Anonymous No.107140048 [Report] >>107141899
Anonymous No.107140114 [Report]
>finally got a good gen yesterday before heading to bed
>wake up and see that seedvr2 released

Nice.
Anonymous No.107140168 [Report] >>107141899
Anonymous No.107140172 [Report]
>>107135474
gud i liek, free palestine

>>107137811
also gud, paints
Anonymous No.107140193 [Report]
>>107135438 (OP)
I'm liking Chroma and my Chroma LoRA so far
Anonymous No.107140207 [Report]
>>107139299
You sound like me. I've spent the past few months focusing on LLM training but now I think I'm gonna focus on my original passion that got me into AI on the first place


https://civitai.com/user/AI_Art_Factory
Anonymous No.107140244 [Report]
has there been any attempts in making ultimate realistic amateur cosplay model by merging illustrious and bigasp together?
Anonymous No.107140271 [Report]
>>107137894
is this real
Anonymous No.107140301 [Report] >>107140318 >>107140378
Anonymous No.107140305 [Report] >>107141434
>>107135474
is this even AI? looks too good to be fake
Anonymous No.107140318 [Report] >>107140322 >>107140330
>>107140301
this is stupid
Anonymous No.107140322 [Report]
>>107140318
and ure gay
Anonymous No.107140323 [Report] >>107140378
Anonymous No.107140330 [Report]
>>107140318
it's dora the explora
faggot
Anonymous No.107140339 [Report]
Dora the dumptruck
Anonymous No.107140377 [Report] >>107142197
Anonymous No.107140378 [Report]
>>107140323
>>107140301
man the qwen sameface. still better than buttchins
Anonymous No.107140420 [Report]
Anonymous No.107140441 [Report]
Anonymous No.107140448 [Report]
Anonymous No.107140833 [Report] >>107141911
What setup of nodes do I need to fetch the frame count of a video in comfy? Can it then also be calculated to show the amount of frames needed for a set amount of batches?
So if a video has 150frames, it automatically splits it into the number of batches you want, so 3 for example, it then calculates 50frames for each batch.
Anonymous No.107140888 [Report] >>107141258 >>107141445
was it civit that banned a certain underwear because it apparently makes people think about bodily fluids? or was that a fever dream
Anonymous No.107140983 [Report]
Damn, seedrv2 really doesn't like anime huh. Getting massive stylechange, like it's adding an emboss filter.
Anonymous No.107141056 [Report] >>107141486
>256p tilesize gives me 24% vram usage
>double the size and oom
Anonymous No.107141061 [Report]
quadratic'd
Anonymous No.107141258 [Report]
>>107140888
Could be true
Anonymous No.107141434 [Report]
>>107140305
ofc its AI heh
Anonymous No.107141445 [Report]
>>107140888
kek
Anonymous No.107141486 [Report]
>>107141056
>how do pixels work
thanks for outing yourself as a retard
Anonymous No.107141497 [Report] >>107141506 >>107141521
Input: >>107139483
Output picrel
https://github.com/CSU-JPG/VCode
https://huggingface.co/spaces/CSU-JPG/VCode
Anonymous No.107141506 [Report]
>>107141497
/sdg/ is that way
Anonymous No.107141521 [Report] >>107141625
>>107141497
what the fuck is this garbage? literally using LLMs lmao, you dont need a fucking project to achieve this.
fucking makjing PAPERS out of this stupid fucking garbage
Anonymous No.107141625 [Report]
Input: >>107138886
Output picrel

>>107141521
Fun little SVGs
Anonymous No.107141712 [Report]
>>107137033
Give me a few thousand bucks and I'll fix it. I've got enough datasets for everything.
Anonymous No.107141748 [Report]
>>107137033
Is this the guy who took out artist tags from Pony?
Useless douchebag.
Anonymous No.107141887 [Report]
>>107137033
I think he got lucky with the sdxl pony model. I find it funny that he hates Loras even tho that's the only thing that made Pony as popular is it's now. I don't think being able to do style transfers with few images can replace a well trained lora for style/aesthetic.
Anonymous No.107141899 [Report]
>>107140168
>>107140048
doom's hellscape
balmora
city17
de_dust2
Anonymous No.107141911 [Report]
>>107140833
What video?
Anonymous No.107141971 [Report] >>107142591
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-Seko-V2.0
seems like they improved on the t2v lightning lora again
Anonymous No.107142004 [Report]
>>107137033
What a dumbass. Controlnets and loras are for people who actually want to use this for something practical. Not everyone uses it as just a dopamine slot machine.
Anonymous No.107142017 [Report]
>>107137033
great message, wrong messenger, he won't make that revolutionary edit model, he's not up to the task
Anonymous No.107142021 [Report]
>>107137033
>I don't like LoRAs
says the guy removing the artist tags on his base models so that people are forced to make artist loras to compensate btw
Anonymous No.107142035 [Report] >>107142341 >>107142343
>>107137033
>as we expected to use OmniGen (and I like this name too much to drop it), so we'll keep it. It's actually a QWEN/QWEN Editing alternative.
is he retarded? why not finetuning Qwen Image Edit instead? it's the best edit model and has the apache 2.0 licence
Anonymous No.107142197 [Report]
>>107140377
wtf is this, light 1030?
Anonymous No.107142341 [Report]
>>107142035
> Qwen Image Edit
cursed model or weights
Anonymous No.107142343 [Report]
>>107142035
>is he retarded?
well he made pony v7, and he's a ponyfag
Anonymous No.107142380 [Report] >>107142516
https://files.catbox.moe/8z9vdv.png
Anonymous No.107142396 [Report]
https://files.catbox.moe/grw9xb.png
Anonymous No.107142405 [Report]
https://files.catbox.moe/egb2ik.png
Anonymous No.107142417 [Report] >>107142619
https://files.catbox.moe/f2o9m8.png
Anonymous No.107142427 [Report] >>107142435 >>107142452
>now the bot uploads gens with catbox
How does it even do that?
Anonymous No.107142435 [Report] >>107142477
https://files.catbox.moe/7fkvtn.png

>>107142427
I'm not a bot, I'm spamming for the love of the game
Anonymous No.107142452 [Report] >>107142477
https://files.catbox.moe/frpya4.png

>>107142427
last one for now; this one's for you, because your epic ;)
Anonymous No.107142477 [Report] >>107142484
>>107142435
A manual spammer?
>>107142452
T-thanks *blushes*
Anonymous No.107142484 [Report]
>>107142477
love me ai gens, simple as
Anonymous No.107142516 [Report] >>107142611 >>107142614 >>107142711
>>107142380
Nice gens. Thought they were qwen + realism lora before catbox.
Is the spark finetune of Chroma much different in terms of quality, or would you attribute it mostly to your extensive post-processing?
Anonymous No.107142591 [Report]
>>107141971
>https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-Seko-V2.0
>seems like they improved on the t2v lightning lora again
I have an excuse to generate and share voluptuous brown women again let's go
Anonymous No.107142611 [Report] >>107142761
>>107142516
Spark avoids the "AI Slop" look that I abhor (plastic skin, etc). Love that checkpoint, seems that other workflows also get similar results

https://files.catbox.moe/5hcove.png
Anonymous No.107142614 [Report] >>107142674
>>107142516
oh wait
I spent so much time genning anime coom that I am completely out of the loop when it comes to new shit
are models actually good at generating thots now? what did you use for this? can it do nudity?
Anonymous No.107142619 [Report] >>107142654
>>107142417
This is great. I like how there's consistency between those spacecrafts. Are they from some tv show?
Anonymous No.107142654 [Report] >>107142688
>>107142619
nope. WAN just "got" that it is supposed to be the same model of spaceships

https://files.catbox.moe/lhf9on.png
Anonymous No.107142674 [Report] >>107142679 >>107142786
>>107142614
I will only say that this is a lora trained on chroma-hd with diffusion-pipe and inferenced on chroma-hd-flash 18 steps unipc/simple
Anonymous No.107142679 [Report]
>>107142674
pic
Anonymous No.107142688 [Report] >>107142711
>>107142654
>WAN just "got" that it is supposed to be the same model of spaceships
AI is best when it's generalizing. The best uses of AI art is for combining concepts (the more juxtaposed, the more kino)

Oh and
>>107132402
>RTX 50 SUPER SERIES CANCELLED - THERE'S NO 3GB VRAM FOR IT
"Wait for the 5070ti super" fags btfo. I'm so happy I got my 5070ti at MSRP
Anonymous No.107142711 [Report] >>107142718 >>107142761 >>107143104
>>107142516
Needless to say: Spark Chroma is the one I use to gen porn, mostly. I heard that you can load Flux Loras into Chroma models/checkpoints, but I haven't tested that yet

WAN is the one I love using for "digital photo" look, and when you need anatomical precision (WAN is the best one for correct anatomy)

>>107142688
Agree 100%

https://files.catbox.moe/ehv5ro.png
Anonymous No.107142718 [Report] >>107142745 >>107142792 >>107142848
>>107142711
> Spark Chroma is the one I use to gen porn
Any complex examples?
Anonymous No.107142745 [Report] >>107142803 >>107142823 >>107143104
>>107142718
I can't show most of them here (my tastes are a bit niche)

This is one of the most artistic/safe-ish ones I genned

https://files.catbox.moe/u8ysoa.png
Anonymous No.107142761 [Report] >>107142808
>>107142611
Cool will try. Interesting that it's trained on a single 4090

>>107142711
>WAN is the one I love using for "digital photo" look
Yeah a sharp photo. Cellphone slop on Chroma-HD(-Flash) all day
Anonymous No.107142770 [Report] >>107142818
>tfw youve found the perfect combo of light loras for motion
>tfw I need a different combo one for each image
Anonymous No.107142786 [Report] >>107142826
>>107142674
Model name is enough, thanks. I have pretty much zero idea about local models past sdxl (noob) and base flux.
Anonymous No.107142792 [Report]
>>107142718
trained on Krea, the results are better. it's just got a really low learning rate already (3e-5), and i possibly have to go lower
Anonymous No.107142803 [Report] >>107142848
>>107142745
I only use 2.2 with lightning and at low cfg it doesn't listen that well, so I try to keep it a little vague. Most important thing is to prompt anything you think might get hidden, like if eyes close and you don't say "blue eyes", you might get green eyes when they open, stuff like that.
Anonymous No.107142808 [Report] >>107143793
>>107142761
me posting against the troonku obsession of a generic tranime girl you commited your whole identity around spamming is proof enough that i dont have 80 iq retard brain
Anonymous No.107142818 [Report]
>>107142770
he just wants money. if a company offered for 200 mil he'd do it
Anonymous No.107142823 [Report] >>107142835 >>107142848
>>107142745
you're not a bloody nonce are ya?
Anonymous No.107142826 [Report]
>>107142786
honestly this is better than without the lora. i think you need to lower the strength because that lora jiggle is so unrealistic
Anonymous No.107142835 [Report]
>>107142823
either way, the moment they sell out is the moment another ui will take their place. it's as simple as that. there are plenty of devs waiting for comfy to die anyway so there will be alternatives.
Anonymous No.107142848 [Report] >>107142855 >>107143104
>>107142718
I'll just say this: Chroma models are the only ones that not only can generate porn out of the box, but it's the only one that can generate males with correct genitalia

>>107142803
I actually disable the lighting lora when I need more artistic photos, there's a lora for better lighting (confusingly also called wan lighting) that I leave on as a default

>>107142823
hehehe *laughs nervously* me? no, no of course not <.<

https://files.catbox.moe/bocylz.png
Anonymous No.107142850 [Report] >>107142856 >>107142862 >>107143546 >>107144737
What does the booru tag "lother" mean? I got it from an image interrogated with wd-eva02-large-tagger-v3. Googling it turns up nothing.
Anonymous No.107142855 [Report]
>>107142848
comfy is not the majority shareholder. the grift chink is. anything comfy says about company direction is not in his control
Anonymous No.107142856 [Report] >>107142881
>>107142850
Wait nevermind it was "1other", I misread the 1.
Anonymous No.107142862 [Report]
>>107142850
as much as I don't"t want to believe that it's exactly the kind of thing to expect in a year or two. we need something else
Anonymous No.107142881 [Report]
>>107142856
ncels who ai image gen
Anonymous No.107142940 [Report] >>107142952 >>107142953 >>107142978 >>107143002 >>107143346 >>107143426
Tired of the pauses between high noise then low noise. The pauses can add up to an additional 2 - 3 minutes. Is there not a way or a node that just does the whole thing in one go?
Anonymous No.107142952 [Report] >>107142991 >>107142995
>>107142940
load both models at once
you got the vram for that?
Anonymous No.107142953 [Report] >>107142991 >>107143010
>>107142940
>pauses can add up to an additional 2 - 3 minutes
get more ram or an ssd so you arent reading the models from the hdd into your 16gb ram?
Anonymous No.107142978 [Report]
>>107142940
What scheduler is it meant to be used with? Fails to denoise correctly with DDIM uniform (shows large influence of input image with 1.0 denoising)
Anonymous No.107142991 [Report] >>107143013 >>107143019
>>107142952
Not yet, only a 4070tis

>>107142953
Already have an ssd and 32gb ram. Kinda fucked until I can upgrade, typical ram prices would double as of late
Anonymous No.107142995 [Report]
>>107142952
its qwen with the lora to turn drawings into cosplay
Anonymous No.107143002 [Report] >>107143027 >>107143148
>>107142940
yeah, I hope we'll get a replacement to wan 2.2 at some point, this shit stinks
Anonymous No.107143010 [Report]
>>107142953
maybe as a big breast lover I'm just that unsophisticated but I like women with R-cups, wouldn't you want her to look more like a rare occurance?
Anonymous No.107143013 [Report]
>>107142991
stan, y u so mad, try to understand, that i do want u as a fan
Anonymous No.107143019 [Report] >>107143029 >>107143148
>>107142991
that'd be it, can't fit 50gb of models into 32 so it's loading from disk before ram
it's slow for me too but not 2-3 minues, maybe 40 seconds
Anonymous No.107143027 [Report]
>>107143002
Enjoy your slomo, reduced prompt adherence and deadened motion then, I guess.
Anonymous No.107143029 [Report]
>>107143019
The only "fix" is to disable lightxv's lora for the high noise phase. 6 steps high noise, 3.5cfg. 4 steps low noise, lighx2v, 1cfg. No slomo.
Anonymous No.107143104 [Report] >>107143112
>>107142848
>>107142745
>>107142711
are all these wan gens anon?
Anonymous No.107143112 [Report] >>107143120
>>107143104
the flamethrower and the warship gens are, the statue one is Spark Chroma
Anonymous No.107143120 [Report] >>107143146
>>107143112
can you share a decent chroma gen catbox? off all the models I cant get chroma to work right. Is the statue gen a good wf for chroma?
Anonymous No.107143146 [Report] >>107143153 >>107143225
>>107143120
It is. Here is it again. I'm not sure if the statue one uses a more simplified one, with removed nodes that I don't use

https://files.catbox.moe/qh0p74.png
Anonymous No.107143148 [Report]
>>107143002
Hopefully. Wonder if 2.5 is high and low too, hopefully we'll get a local version of that or 3.0 in the future

>>107143019
I use Q6 GGUF https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF/tree/main/LowNoise, that shouldn't be anymore than 24gb
Anonymous No.107143153 [Report] >>107143225
>>107143146
thnks will get back with a gen!
Anonymous No.107143225 [Report] >>107143249
>>107143146
>>107143153
its nice! but the facedetailer completely ruined it
Anonymous No.107143249 [Report] >>107143292
>>107143225
It does that sometimes. It saves an image for every step, so you can pick and choose the best one
Anonymous No.107143292 [Report]
>>107143249
yeah this is hiresfix output
Anonymous No.107143346 [Report] >>107143625
>>107142940
>loading off an HDD
Your problem. Takes 12 seconds for me
Anonymous No.107143426 [Report] >>107143485 >>107143625
>>107142940
You can cope with Phroot's all in one model.
Anonymous No.107143485 [Report]
>>107143426
It's not that bad. I made some quick placeholder idle animations with it.
Anonymous No.107143546 [Report]
>>107142850
best part about AI is it generalizes the art style well always to new stuff and colors not in the reference image
Anonymous No.107143620 [Report] >>107143669 >>107143753
it appears that Seko 2.0 lightx2v has fixed its slow motion problems

left: WAN2.2-Lightning_T2V-v1.1-A14B-4steps-lora_{HIGH/LOW}_fp16.safetensors

right: Wan2.2-T2V-A14B-4steps-lora-rank64-Seko-V2.0{HIGH/LOW}.safetensors

better prompt adherence/training data as well? it followed "glimpse of pink thong panties" this time around
Anonymous No.107143625 [Report]
>>107143346
>he thinks I use hdds

lol, lmao, probaby bot post

>>107143426

phr00ts models are good however, every 2 or 3 gens it automatically offloads ALL of the models and takes like 12 minutes to load again (no other model does this apart from phr00ts).
Anonymous No.107143669 [Report]
>>107143620
Gonna wait for i2v but looks good. Stacking various 2.1 and 2.2 light/lightning loras produces some interesting results
Anonymous No.107143753 [Report]
>>107143620
>Why Everyone Is So Mean 2 me </3
Anonymous No.107143793 [Report] >>107144098 >>107144161
>>107142808
10/10 sperg. Would post again
Anonymous No.107143884 [Report] >>107143895 >>107143910 >>107144080
Visited /sdg/ as incognito today, found these:

HamsterAnon (he only posts hamsters):
>>107140647

Lumi (xe only posts this catgirl):
>>107140950

LandscapeAnon (only posts landscapes .mp4 which is based):
>>107141783

SubwayAnon (he only generates memes with food brands):
>>107140856

Debo:
>107142901

KreaSlopper:
>107140589

Which one do you think Debo samefags with?
Anonymous No.107143895 [Report]
>>107143884
Gm. Please stay in your thread, ty.
Anonymous No.107143910 [Report]
>>107143884
>HamsterAnon
That's definitely a Quokka
Anonymous No.107144080 [Report]
>>107143884
nobody fucking cares about your retarded drama you fucking nigger, go back to the cesspoll that is /sdg/ and stop shitting up this thread, fucking fag
Anonymous No.107144083 [Report]
>looks good
it appears to be strictly better than the previous lightning lora for 2.2. much less issues with motion (not perfectly fixed but at least 80% better) and it is visibly more aligned to ethnic features and skin tone etc as well
Anonymous No.107144098 [Report] >>107144161 >>107144258
>>107143793

Mind catboxing a gen? Made a lora of a MILF I know and need this type of quality out of chroma with the lora
Anonymous No.107144161 [Report] >>107144258
>>107144098
>>107143793
same pls
Anonymous No.107144258 [Report] >>107144266 >>107144421
>>107144098
>>107144161
https://files.catbox.moe/sqjhot.png
Anonymous No.107144266 [Report] >>107144916
>>107144258
make a erika kirk lora haha
Anonymous No.107144421 [Report] >>107144442 >>107144916
>>107144258
Anonymous No.107144434 [Report] >>107144515 >>107144531
what would happen if i merged sdxl with a pony checkpoint then merged an illustrious? would it just be a mess? they all do certain things well and have knowledge of concepts i am looking for, im trying to design a good work flow. or maybe it would be better to gen the concept i want with one checkpoint then make a lora for another checkpoint? but lora making is hard i havnt figured out how to make a good one yet
Anonymous No.107144442 [Report]
>>107144421
>Heart floats up to his hand
Comical.
Anonymous No.107144493 [Report] >>107144531
Anonymous No.107144497 [Report]
Oof..
Anonymous No.107144515 [Report] >>107144610
>>107144434
>what would happen if i merged sdxl with a pony checkpoint then merged an illustrious?
You'd feel a sudden urge to upload it on civitai as a "trained model" under "early access".
Anonymous No.107144531 [Report] >>107144535
>>107144434
>if i merged sdxl with a pony checkpoint then merged an illustrious
You'd get a terrible model that doesn't work. I think you can merge some specific layers to get some likeness, but that's it

>>107144493
Thick legs, she's built like a tank
Anonymous No.107144535 [Report]
>>107144531
THICK THIGHS SAVE LIVES
Anonymous No.107144544 [Report] >>107144587
Finally pulled on ComfyUI (and custom nodes) for the first time since July as I was looking to experiment with video stuff and now every one of my workflows is broken because the impact pack whitelist is not letting my .pt files through. I have added them by "just the filename" as it says in the documentation but also tried the full paths, to no avail. It sees the whitelist and loads them, as i can see the

[Impact Pack/Subpack] Loaded 4 model(s) from whitelist:

But I still get the error popup when I call UltralyticsDetectorProvider. By the way, is not ComfyUI about the most irritating error handling possible? Just this popup window with a massive python backtrace?
Anonymous No.107144587 [Report] >>107144648
>>107144544
imma be real with you dude if you just pay the claude jew and give them a dollar a day you can basically just get an AI to figure out your entire problems for you at this point assuming you're a programmer and can understand how to set all that up and understand what claude says back to you, like itll keep reading source code and opening web pages and taking screenshots of your desktop and stuff until it figures out the issue
Anonymous No.107144610 [Report]
>>107144515
kek
Anonymous No.107144617 [Report] >>107144733
How have they not officially released sage attention 3 yet?
Anonymous No.107144648 [Report] >>107144718 >>107144733 >>107144797
>>107144587
I couldn't even get claude to generate a damn shell script to delete every image on a folder over 30 days old without going through six or seven versions and a troubleshooting session. I am not about to let it monkey around randomly on my computer.

At any rate I found the issue. Needed to update ultralytics python package.
Anonymous No.107144718 [Report] >>107144776
>>107144648
were you using Claude Haiku?
Anonymous No.107144733 [Report] >>107144776
>>107144617
its destructive compared to sage attention 2 so who cares

local needs a new base model and cheaper compute (buy all the memory you will need until 2029 sooner rather than later. nvme, ram, vram everything. all prices are going up and all manufacturing capacity is booked)

>>107144648
oh ok. python dependency management is the antichrist
Anonymous No.107144737 [Report] >>107144760
>>107142850
https://danbooru.donmai.us/wiki_pages/1other
Anonymous No.107144760 [Report] >>107144780
>>107144737
looks like a relatively worthless tag imo, i was expecting humanoids/robots for the examples
Anonymous No.107144776 [Report]
>>107144718
Sonnet 4.5 with thinking enabled.
On the plus side it wrote the systemctl timer and service files just fine.
>>107144733
Not bad, I like how the bow stayed intact.
Anonymous No.107144780 [Report] >>107144951
>>107144760
>humanoids/robots for the examples
I use it for anything that's not a regular 1boy or 1girl desu like monsters, beasts, and ghosts. It might work for robots.
Anonymous No.107144797 [Report]
>>107144648
I cycle code around Claude, ChatGPT, Kimi2 and Grok. It's actually pretty fun to take something simple and use their research models to make separate versions with commentary. I got wildly different versions of simple image cropping program.
Anonymous No.107144916 [Report] >>107144951
>>107144266
no u

>>107144421
>36 prior convictions of stealing breakfast
Why is this monster on the streets?
Anonymous No.107144951 [Report]
>>107144780
>I use it for anything that's not a regular 1boy or 1girl desu like monsters, beasts, and ghosts
ok but like do you NEED it? i refuse to believe if you prompt everything for a monster like (horns) etc it's going to be able to figure it out.

i guess it might be useful in theory to distinguish who should have 1other traits in a gen like (1other, 1girl, horns) but i bet it doesn't even work like that/is trained like that

>>107144916
>Why is this monster on the streets?
well technically he's in a park
Anonymous No.107145002 [Report] >>107145014 >>107145092 >>107145119 >>107145149 >>107145173
made a simple Wan 2.2 T2I workflow. Anything missing/wrong? The low noise part takes over 11 mins for some reason, high noise only needs around one minute
Anonymous No.107145014 [Report]
>>107145002
the result
Anonymous No.107145092 [Report] >>107145326
>>107145002
>fp8 clip
>non-1280x720p res
Anonymous No.107145119 [Report] >>107145326
>>107145002
I'm 99% sure the last time I did wan t2i I used a single packaged sft i.e. no separate low and high
I can't remember where I found it
Anonymous No.107145149 [Report] >>107145195 >>107145326
>>107145002
The sampler setup is awful. You need a chain sampler node that picks up leftover noise. Not denoise another seed, at full strength. First pass has no purpose here and the low denoising model can't cope too well with high denoising steps.
Do you have 5000 series? Use Q8 umt5.
Dont exceed 720x1280 resolution.
I think some model sampling value like 5 is preferred, for both, but I haven't personally experimented much.
I don't know how good euler beta is with this model.
Anonymous No.107145173 [Report] >>107145326 >>107145361
>>107145002
why are you doing 10 steps of both high and low for lighting loras
Anonymous No.107145195 [Report] >>107145252 >>107145284
>>107145149
>Do you have 5000 series? Use Q8 umt5.
never quant the text encoder. fp16 t5 with --fast > Q8
Anonymous No.107145252 [Report] >>107145359
>>107145195
--fast fucks up the quality for image gen, its ok for wan
Anonymous No.107145284 [Report] >>107145359
>>107145195
Fast will rape it more than Q8 lol.
I agree in principle with not quantizing the text encoder but umt5 is cancer to run if you don't have a lot VRAM and system RAM.
Anonymous No.107145326 [Report] >>107145397
>>107145092
ok switched to fp16
>>107145149
will try with ClownsharkChainsampler. I have that one from https://civitai.com/models/2106471 but that workflow had like 4 passes for some reason which is why I tried to make a simpler one. And no, I have a 3060
>>107145119
someone posted his workflow here (pic related) but it was using wan 2.1 and switching to 2.2 caused artifacts
>>107145173
because I have no idea what I'm doing
Anonymous No.107145359 [Report] >>107145441
>>107145252
yeah on a fp16 image model, not relevant for a text encoder
and this is in reference to wan t2i so your advice cancels itself out

>>107145284
>Fast will rape it more than Q8 lol.
there is no way fp32 -> fp16 (of just the accumulation operations) is more destructive than fp16 -> Q8. prove this shit. it's a text encoder show me the perplexities right now if you're willing to make a claim this un-intuitive
Anonymous No.107145361 [Report]
>>107145173
brap
Anonymous No.107145385 [Report]
fresh

>>107145378
>>107145378
>>107145378

fresh
Anonymous No.107145397 [Report]
>>107145326
>And no, I have a 3060
Also 3060 here
You want either fp16 if you can bear it or Q8.
Also you want the sampler setup to look something like this.
Anonymous No.107145441 [Report]
>>107145359
>there is no way fp32 -> fp16 (of just the accumulation operations) is more destructive than fp16 -> Q8. prove this shit.
Midwit take.
Not every part of the model has the same importance.
The FP32 parts are kept at FP32 because they are most sensitive to precision.
Fast mashes them into FP16, which in turn rapes coherency.
While Q8 (without fast) keeps them at FP32 and only quantizies less important parts.
The result is better quality at lower size.
>it's a text encoder show me the perplexities right now if you're willing to make a claim this un-intuitive
This is based on intuition and my previous experiments.
Feel free to provide sufficient counter examples.
Anonymous No.107145647 [Report] >>107145683
is wan q8 better or fp8? considering 16gb vram.
Anonymous No.107145683 [Report]
>>107145647
Q8 has better quality but fp8 will run faster on 5000 series