← Home ← Back to /g/

Thread 106589837

331 posts 182 images /g/
Anonymous No.106589837 [Report] >>106589848 >>106589862 >>106589978 >>106590027
/ldg/ - Local Diffusion General
I'm Getting Litty Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106585705

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106589848 [Report] >>106589855
>>106589837 (OP)
you didn't add anistudio to the OP
Anonymous No.106589855 [Report] >>106589862
>>106589848
correct
Anonymous No.106589862 [Report] >>106590129
>>106589837 (OP)
faggot OP forgot this
https://github.com/FizzleDorf/AniStudio/releases/tag/pre-release

>>106589855
>t. faggot
Anonymous No.106589866 [Report] >>106589958
Blessed thread of frenship
Anonymous No.106589872 [Report] >>106591309
>nunchaku qwen image models released
>deepcompressor last updated 6 months ago
>no quantization code in deepcompressor so you can't quantize your own merged models
they are going closed source
Anonymous No.106589911 [Report] >>106589916 >>106589927 >>106589948 >>106589966 >>106589974 >>106590046
SaaS is here to stay.
>>106589823
The only difference between you and me is how far my GPU sits from my setup.
I'm running local just like you, images save to my local drive in the same folder as my API nodes and local UI workflow.

Or are you telling me you're somehow generating with autonomous electricity and ibterbet access while staying local, retard?
Anonymous No.106589916 [Report]
>>106589911
ani is saving local. saas lost
Anonymous No.106589927 [Report]
>>106589911
>The only difference between you and me
kek, you are using the fisher price version of AI generation, you are like someone who thinks Facebook is the internet

go back to /saasdg/ and wallow in your misery
Anonymous No.106589944 [Report] >>106589965
Doe lora files contain metadata? And if yes, is there a way to strip it?
Anonymous No.106589948 [Report]
>>106589911
>The only difference between you and me is how far my GPU sits from my setup.
Anonymous No.106589958 [Report] >>106590063
>>106589866
>jumpscares you
Anonymous No.106589965 [Report]
>>106589944
Doubt it, since that would have to be placed there by the trainer program, for what purpose ?
Anonymous No.106589966 [Report]
>>106589911
>how far my GPU sits from my setup
>my GPU
LOL
Anonymous No.106589974 [Report]
>>106589911
>The only difference between you and me is
kek, you litterally have a jew standing behind you saying 'no goy, you can't generate that, you can only generate this'

(((SAAS))) shill begone
Anonymous No.106589978 [Report] >>106589984 >>106589986 >>106589998 >>106590013 >>106593336
>>106589837 (OP)
What is the alternative to buying a GPU to run AI models? Surely all those server farms aren't built off GPUs right?
Anonymous No.106589984 [Report]
>>106589978
I think they call them accelerator cards in the enterprise market, but yes they're graphics cards
Anonymous No.106589986 [Report]
>>106589978
nah you're right. they use pixie dust and unicorn farts
Anonymous No.106589987 [Report] >>106590025 >>106590049
/ldg/ bross welcome SaaS users too!
Share your gens, don't be shy about your outputs here!
We're all working with generative models, local or cloud based doesn't matter.

/ldg/ accepts everyone, post your creations and let's see what you've been making!!

Pic related generated in my local pc with local nodes with my local billing adress, same as you.
Anonymous No.106589995 [Report] >>106590071
*yawn*
Anonymous No.106589998 [Report]
>>106589978
>Surely all those server farms aren't built off GPUs right?
NVidia is swimming in endless money, surely those server farms aren't using GPU's, right ?
Anonymous No.106590010 [Report] >>106590019
Qwen SRPO waiting room:
https://github.com/Tencent-Hunyuan/SRPO

Waiting for a hero to try this on Qwen
Anonymous No.106590013 [Report]
>>106589978
They're not built off gaming gpus, no. That's why nvidia can charge an arm and a leg for tripling your power bill, they don't care about you because gaymers don't financially matter (anymore)
Anonymous No.106590019 [Report] >>106590038 >>106590056
>>106590010
It was tried on Flux and results weren't that good. Flux is more realistic than Qwen by default. It is over.
Anonymous No.106590025 [Report]
>>106589987
big if true
Anonymous No.106590027 [Report] >>106590040
>>106589837 (OP)
>previous thread
>ctrl + f "litty"
>zero results
what did OP mean by this
Anonymous No.106590038 [Report] >>106590058
>>106590019
>Flux is more realistic than Qwen by default
lel
Anonymous No.106590040 [Report] >>106590052 >>106590053
>>106590027
This is a zoomer thread now, he's bussin on god no cap fr fr on ohio skibidi rizzler's gyatt
Anonymous No.106590046 [Report]
>>106589911
KYS shizo
Anonymous No.106590049 [Report]
>>106589987
>the true state of local
Anonymous No.106590052 [Report] >>106590095 >>106590113
>>106590040
fr fr ong no cappin' bruhski we
uh, fuck i don't know more zoomer lingo
Anonymous No.106590053 [Report]
>>106590040
O.o
Anonymous No.106590056 [Report]
>>106590019
Even if it doesn't completely unslop the model, lowering the bias towards 4o garbage would already be a victory, don't you agree?
Anonymous No.106590058 [Report] >>106590078
>>106590038
It's the truth. Krea was no accident. It is based off an non-distilled Flux model. Chroma was no accident, similar. Qwen is not even distilled and it's already slopped.
Anonymous No.106590063 [Report] >>106590073 >>106590110 >>106590464
>>106589958
gib catbox for godzilla tits miku
Anonymous No.106590065 [Report] >>106590086 >>106590096
Anonymous No.106590071 [Report] >>106590825
>>106589995
Imagine yawning at objective truth.
Imagine yawning when your "local" models were literally trained on LAION scraped by German academics using cloud compute clusters, then distributed by Stability AI (a company).
Imagine yawning when your loras were trained on datasets someone else curated and uploaded to HuggingFace (owned by investors).
Imagone yawning when your "local" inference is running on hardware designed by NVIDIA/AMD and drivers maintained by corpos.

Sure, keep yawning about how clicking download on civitai makes you some kind of digital freedom fighter while SaaS chads just cut out the middleman. At least we are honest about using corpo services instead of larping as tech libertarians.
Anonymous No.106590073 [Report] >>106590110
>>106590063
bro dumped like a dozen catboxes recently FAGGOT GO LOOK FOR THEM IN THR PREVIOUS THREADS
Anonymous No.106590075 [Report] >>106590138
*yawn*
Anonymous No.106590077 [Report]
>106590071
holy cloudkek cope
Anonymous No.106590078 [Report]
>>106590058
Krea attempted to specifically remove Flux slop, and even that failed.

Flux is THE slop model, nothing else comes close.
Anonymous No.106590086 [Report]
>>106590065
i should call her...
Anonymous No.106590090 [Report] >>106590104 >>106593400
Decided to try out flux and apparently my 4090 is not enough lol
Anonymous No.106590095 [Report] >>106590110
>>106590052
https://en.wikipedia.org/wiki/Glossary_of_2020s_slang
some i remember being used when i was a kid and probably before that
quite a few are p funny
Anonymous No.106590096 [Report]
>>106590065
Why are you posting pics of my waifu
Anonymous No.106590104 [Report]
>>106590090
that's one way to out yourself as a total retard
Anonymous No.106590110 [Report] >>106590135 >>106590139
>>106590063
that just made my dih leak a fluid, sure bud
https://files.catbox.moe/w7lnm4.png
though the background is a bit fried, i think i busted the cfg scale in this one though i forgor

>>106590073
i appreciate the enthusiasm but its fiiine

>>106590095
>99% of this shit is just black/urban simplifications of already simple words
shits the way the cookie crumbles i guess, personal favorite has to go to "ligma" and its variations though honestly.
Anonymous No.106590113 [Report] >>106590139
>>106590052
Model/catbox? I'm trying to improve my proompting when it comes to realistic gens
Anonymous No.106590128 [Report] >>106590175
Anonymous No.106590129 [Report] >>106590182
>>106589862
>Commercial License
>If you prefer to use this project under a commercial license, please contact us at [your-email@example.com]

kek
Anonymous No.106590135 [Report] >>106590146
>>106590110
>https://files.catbox.moe/w7lnm4.png
no i wanted the catbox of miku, unless that wasn't you
Anonymous No.106590138 [Report]
>>106590075
Aother yawn from "I downloaded someone else's model" anon?
Let me guess, you're running it on Windows too? Microsoft thanks you for your "local" rebellion.
While you yawn your GPU drivers getting updates from NVIDIA servers?
While you yawn your Python packages from PyPI?
Your model checkpoints from Google Drive links posted by randoms?

Please, tell me about your sovereignty while your entire stack depends on corpo infrastructure you have no control over.

Yawn harder,

SaaS brothers welcome! We are /ldg/!
Anonymous No.106590139 [Report] >>106590183
>>106590113
this catbox >>106590110
same model, same prompt, just cfg i'd float around 3.5-4.5 depending on the lora,
though honestly if you have a gpu with more than 8gb of vram, i'd look into doing a refiner pass with an already realistic sdxl model
the second i pick up the 16gb card i'm looking at, i'm dedicating entire weekends to trying that out. nova animal is good but, it has its weaknesses.
Anonymous No.106590146 [Report] >>106590233
>>106590135
>no i wanted the catbox of miku, unless that wasn't you
oh i thought by the way you worded it, you wanted to TRADE for the catbox of my image.
periods and commas are important my m8. pretty sure that guy just used wan and asked for some titty jiggles, it's not difficult to pull off.
Anonymous No.106590150 [Report]
>>106588315
sexo with jenny
Anonymous No.106590175 [Report] >>106590190
>>106590128
Qwen does an okay job with the prompt. The model is superior to Seedream 4.
Anonymous No.106590182 [Report]
>>106590129
where does it say that?
Anonymous No.106590183 [Report] >>106590188
>>106590139
Thanks
>more than 8gb vram
I do, but aren't there options to unload the base model if you're using a refiner? After all, the base model is not needed during the refiner pass
Anonymous No.106590188 [Report]
>>106590183
reforge as far as i can tell, fully unloads the first model then loads the second, then when you do a new model it does the process over again.
very slow on my near 10 year old card, i'm sure its near instantaneous anything rtx 3000 and newer.
with comfyui there's specific nodes you're supposed to use for better memory management, like unloading models and clearing memory. some wan workflows use that for vid2vid passes.
>sorry i can't be of more help i haven't touched anything like this in months again due to aging gpu
Anonymous No.106590190 [Report] >>106590208
>>106590175
sounds like a prompt skill issue
Anonymous No.106590199 [Report]
Anonymous No.106590206 [Report] >>106590345
out of curiosity, there's no way to monetize genning an animation from an image right? since i assume there's a lot of legal trouble even if its extremely private


like theres i2v websites but if you were a named individual on like patreon or something, you'd probably get taken down and sued fast.
Anonymous No.106590208 [Report]
>>106590190
The prompt is as straightforward as you can get. Works on Chroma. Works on actual decent SaaS models. It doesn't get any better than that.
Anonymous No.106590233 [Report] >>106590255
>>106590146
noooo i'm the one who animated it
Anonymous No.106590248 [Report] >>106590255 >>106590257 >>106590463
>see insane outputs on civitai model page
>"dude, why do my outputs always look like ass but everyone else's are fire?"
>download image
>import into comfyui
>at least 30 nodes with multiple groups for upscaling, skin enhancers, hand fixers, face detailers and so on
Do... do you guys actually do all that? Am I just not autistic enough?
Anonymous No.106590249 [Report]
javascript:quote('106589609')
Escape from Tokyo with Snakette Pliskenawa
Anonymous No.106590255 [Report] >>106590289
>>106590233
holy fucking booba animation dude

>>106590248
you're not autistic enough for the autism club
Anonymous No.106590257 [Report] >>106590289
>>106590248
>Am I just not autistic enough?
correct
Anonymous No.106590263 [Report] >>106590287
>106590249
Anonymous No.106590287 [Report]
>>106590263
that's how chatgpt said I should do it man
Anonymous No.106590289 [Report] >>106590294 >>106590359 >>106590371 >>106590428 >>106590500
>>106590255
>>106590257
Okay then, what do you guys recommend? Are upscalers needed when the default resolution range of sdxl models (so like 768x1280 for example) is fine for me? What about the other groups I mentioned?
Anonymous No.106590294 [Report] >>106590313
>>106590289
If you have to ask, you'll never make it.
Anonymous No.106590296 [Report]
>>106589609
Escape from Tokyo with Snakette Pliskenawa
Anonymous No.106590313 [Report]
>>106590294
How dare I ask for advice instead of wasting hours experimenting with what works and what's needed, shame on me
Anonymous No.106590345 [Report]
>>106590206
Where there's a demand, there's a way, but discussion falls outside of the scope of this site.
Anonymous No.106590359 [Report] >>106590371 >>106590405
>>106590289
>(so like 768x1280 for example) is fine for me?
90% of the wow factor for a gen is how high the res is desu
Anonymous No.106590371 [Report] >>106590405
>>106590289
>>106590359
also its, im pretty sure, 1216x832, not 1280x768 for some reason.
Anonymous No.106590405 [Report] >>106590419 >>106590429 >>106593141
>>106590359
I don't need 8k images when I'm trying to generate some decent-looking smut, especially since I'm on a 1080p monitor
>>106590371
As long as the total amount of pixels is the same and the dimensions are a power of 64
Anonymous No.106590419 [Report] >>106590588
>>106590405
Upscaling+refiner second pass is good for ironing out minor errors and artifacts. And just 1.25x is fine really.
Anonymous No.106590428 [Report] >>106590588
>>106590289
>Are upscalers needed
you can upscale if you want, adds more detail and is a easy way to fix shit. usually the way people usually inpaint is by cropping a square around their masked area -> upscaling or downscaling it to their desired resolution (either 1024x or higher) -> denoising the masked area. -> stitching the masked area to the original image (this is done automatically using the 'masked only' option in a1111-like guis or the crop and stitch node in comfyui). upscaling the original image gives you more pixels to work with and is less lossy. optional though if you know what you are doing
>hand fixers, face detailers
i recommend you skip this stuff and stick to doing it yourself manually
Anonymous No.106590429 [Report] >>106590436 >>106590588
>>106590405
>I don't need 8k images
It's not about being 8k. Doing at least a single second pass amplifies how "good" an image looks.
>especially since I'm on a 1080p monitor
That's the thing about "highresfix", it adds details beyond what we think of as being just a higher resolution. I'm on a 480p screen and I gen images that are at least double if not triple the base res.
Anonymous No.106590436 [Report] >>106590448
>>106590429
>I'm on a 480p screen
crt monitor? or just those TFT dell monitors from like 2006?
Anonymous No.106590448 [Report] >>106590452 >>106590457
>>106590436
It's a MAG 321UP, I just like 480P.
Anonymous No.106590452 [Report] >>106590465
>>106590448
Why would you pretend to be me though.
Anonymous No.106590457 [Report]
>>106590448
>4k ultrawide monitor
>uses it in 480p
Anonymous No.106590463 [Report]
>>106590248
>do i have to do all those things that make a gen good to make my gens look good?
lots of workflows can be trimmed due to bypassed or disconnected nodes but in general yes
Anonymous No.106590464 [Report] >>106590738
>>106590063
back from the gym, here at catbox, i used wan2gp
https://files.catbox.moe/5s6c7t.mp4
https://files.catbox.moe/4yulk1.png
Anonymous No.106590465 [Report]
>>106590452
i like to add to the narrative
Anonymous No.106590500 [Report]
>>106590289
look into tiled diffusion
Anonymous No.106590547 [Report] >>106590560
Anonymous No.106590560 [Report] >>106592825 >>106593647
>>106590547
I think people underestimate Wan for being able to deslop poses and composition. Like you can use wan to have the character move to a place with a more appealing background or take a position or look in a direction that makes the piece unique and different from other things you might see from SDXL outputs.
Anonymous No.106590563 [Report]
Anonymous No.106590568 [Report]
But can I use booru tags in Wan? Huh? Yeah, thought so.
Anonymous No.106590577 [Report] >>106590627 >>106590656 >>106590675 >>106590715
>2.2 vace model is over 32gb

How the hell am I supposed to use this?
Anonymous No.106590588 [Report] >>106590595
>>106590419
>>106590429
Fair enough, I'll give it a shot then. Do you use latent upscaling or a dedicated upscaler model?
>>106590428
Nice write-up, thank you
Anonymous No.106590595 [Report] >>106590640 >>106590703
>>106590588
I use latent upscale. Also if you modify the prompt in the conditioning hook going to the second sampler you can even do minor edits to the original pic with high enough denoise.
Anonymous No.106590627 [Report]
>>106590577
get a job, hombre
Anonymous No.106590640 [Report] >>106590723
>>106590595
How low denoise do you typically set for second pass if you're just looking to upscale / fix small defects ?
Anonymous No.106590656 [Report]
>>106590577
Not worth it
Anonymous No.106590675 [Report]
>>106590577
did they release the 2.2 vace model or are you talking about "vace fun". i thought that just plugs into the existing t2v model like a lora
Anonymous No.106590703 [Report]
>>106590595
Alright, cool, time to hopefully elevate my 1girl experience to the next level
Anonymous No.106590715 [Report]
>>106590577
you mean the vace-fun version? real vace 2.2 is not out yet
Anonymous No.106590723 [Report] >>106590746
>>106590640
Idk if all models share this but on chroma 0.65-0.70 for plain upscale and 0.75+ it starts reiterating things and chaning stuff
Anonymous No.106590738 [Report] >>106591211
>>106590464
thank the Lord in Heaven

btw 4 days late lol
>>106556008
>how do I prompt for blow kiss but without the hand movement?
>the woman stretches her arms out in front of her as if to give a hug. the camera zooms in on her face. her lips fill the entire screen and she kisses the camera with her lips
that at least keeps them from blowing a kiss with their hands
Anonymous No.106590746 [Report]
>>106590723
Thanks man
Anonymous No.106590749 [Report] >>106590784 >>106590803
Anonymous No.106590784 [Report] >>106590827 >>106590888
>>106590749
The cigarettes are better
Anonymous No.106590803 [Report] >>106590821 >>106590888
>>106590749
the surprise dick in the mouth are better
Anonymous No.106590821 [Report]
>>106590803
Please Furkan, enough with that
Anonymous No.106590825 [Report] >>106590843
>>106590071
>larping as tech libertarians.
Keep crying pajeet, the only reason you are seething is cause you don't have the skill and the hardware, go back to your general /aids/ and /dale/, here is not for brown skin poorfags like you
Anonymous No.106590827 [Report]
>>106590784
Fuck off this is the zoomer thread
Anonymous No.106590843 [Report]
>>106590825
inb4 lefties claiming local ai generation is white supremacy
Anonymous No.106590888 [Report] >>106590915
>>106590784
>>106590803
Impossible to please I swear.
Anonymous No.106590915 [Report]
>>106590888
Now that's more like it
Anonymous No.106590956 [Report]
Anonymous No.106591035 [Report] >>106591123
Anonymous No.106591072 [Report] >>106591123
Anonymous No.106591089 [Report] >>106591098 >>106591124
can chroma be prompted to do anime or cartoon illustration?
Anonymous No.106591098 [Report]
>>106591089
yes of course but the style won't be consistent between seeds
Anonymous No.106591111 [Report] >>106591123 >>106591474
Anonymous No.106591123 [Report] >>106591138 >>106591147 >>106591155
>>106591035
>>106591072
>>106591111
model? It's not cloudshit / seedream, isn't it?
Anonymous No.106591124 [Report]
>>106591089
yes, pretty extensively.

if you want to prompt specific characters with their standard outfits tho you'll usually find that easier on illustrious or noobai
Anonymous No.106591138 [Report]
>>106591123
worse than that it's qwen but least it's not chroma
Anonymous No.106591147 [Report]
>>106591123
mindbroken
what a great troll that was
Anonymous No.106591155 [Report]
>>106591123
It's just qwen with an awful whinnie the pooh lora I cam across while looking for porn and thought "why not?"
Anonymous No.106591166 [Report]
20Loras No.106591171 [Report] >>106591172 >>106591188 >>106591196 >>106591222
Still suffering from horrible colorshifts.
Makes the whole video genning useless really.
Anonymous No.106591172 [Report] >>106591212
>>106591171
Based on everything else here, this seems to be an entirely you issue.
Anonymous No.106591177 [Report] >>106591189
I heard sd3 launched and came to see if it's any good
Anonymous No.106591188 [Report] >>106591212
>>106591171
2.2 fast loras caused this last time I tried
Anonymous No.106591189 [Report] >>106591201
>>106591177
Yeah and Trump became president AFTER like two assassination attempts, bitcoin passed 100k, Thailand and Burma had a brief conflict. Putin Visited the US. Biden has colon cancer. And SD3 is shit.
Anonymous No.106591193 [Report]
go to bed debo
Anonymous No.106591196 [Report] >>106591212
>>106591171
how are you specifically struggling so much with video gen. it's the easiest shit in the world
Anonymous No.106591201 [Report]
>>106591189
welp, back to cave I guess
Anonymous No.106591211 [Report] >>106591229
>>106590738
>kisses the camera
including this has a 50/50 chance of wan spawning a camera and having her kiss that in my experience
20Loras No.106591212 [Report] >>106591220 >>106591227
>>106591172
>>106591188
>>106591196
The lightx2v loras introduce motion to the gen, it seems impossible to not use them.

The others aren't using first frame last frame loops.
Anonymous No.106591220 [Report] >>106591283
>>106591212
I really don't understand how they come out so bad on your end. Maybe switch to Kijai's workflows? They are retard proof.
Anonymous No.106591222 [Report] >>106591283
>>106591171
some people can't be helped
Anonymous No.106591227 [Report] >>106591283
>>106591212
catbox your workflow and i will tell you how you are fucking up
Anonymous No.106591229 [Report] >>106591235
>>106591211
has not happened to me once
Anonymous No.106591235 [Report] >>106591251 >>106591328
>>106591229
happens to me almost every time i prompt something involving a camera. most recently i tried prompting for handheld camera movement and it just spawned a camera in
Anonymous No.106591251 [Report] >>106591277
>>106591235
It only ever happened to me once on a throwaway gen and it's because I specifically prompted ronald to "bat the camera away with his hand" which it interpreted as a camera appearing to be pushed away.
Anonymous No.106591277 [Report]
>>106591251
i love that gen. still cracks me up
20Loras No.106591283 [Report] >>106591295 >>106591310
>>106591220
>>106591222
Guess I'm retarded, works fine with kijai.

>>106591227
https://files.catbox.moe/2u1ndd.mp4
Anonymous No.106591295 [Report] >>106591371
>>106591283
I just use kij these days desu. Something about comfy workflows that always come out wrong.
Which kind of makes sense since Kij is literally just the script wan gives you to run the model and all kij is doing is making nodes in comfy that can interact with that script.
Anonymous No.106591305 [Report]
ugh bros my 90s candid amateur out of focus grainy gens are so SOVLFUL!!! chromabros we're so back!!!!!!!!
Anonymous No.106591309 [Report]
>>106589872
bro just merge the attention layers manually DUH
Anonymous No.106591310 [Report] >>106591371
>>106591283
okay yeah nvm aside from the questionable prompting im not sure there's any glaring issues lol. comfy implementation simply might be worse
Anonymous No.106591328 [Report] >>106591350
>>106591235
i don't know what you're doing wrong, using a retarded allinone model, or using a <8 cope quant but it just does not happen to me
Anonymous No.106591341 [Report] >>106591352 >>106591626
It seems that when i use more than 3 regions in regional prompter (forge UI), it breaks and only takes the first region into account. any idea what's the problem? I'm sure I used at least 4 regions in the past.
Anonymous No.106591350 [Report]
>>106591328
nope im just using Q8 wan + 2.1 lightx2v.
Anonymous No.106591352 [Report] >>106591626
>>106591341
I can't help myself, the problem is you aren't using comfy UI.
I think my head would have exploded if I didn't at least type that. Sorry.
Anonymous No.106591363 [Report]
20Loras No.106591371 [Report] >>106591394
>>106591295
>>106591310
I should have listened to my own notes. Now that I've swapped back and forth between two different 'Save Video' nodes, the color shift is gone. There has to be some shit going on that glitches it out.
20Loras No.106591394 [Report]
>>106591371
Flawless loops now, with lightx2v, REEEE
Anonymous No.106591442 [Report] >>106591458 >>106591470 >>106591567 >>106591997
https://files.catbox.moe/2mxzj8.png
https://files.catbox.moe/mfy6zj.mp4
Anonymous No.106591458 [Report] >>106591550
>>106591442
The consistency is impressive, some small artifacts in the bodypaint but overall it looks like in game footage
Anonymous No.106591470 [Report] >>106591550
>>106591442
ummm metadata?
Anonymous No.106591474 [Report] >>106591489
>>106591111
If only the bears weren't such bearlets.
Anonymous No.106591489 [Report]
>>106591474
Whinnie the pooh is a canonical bearlet. It can't be helped.
20Loras No.106591496 [Report] >>106591558
Is it possible to save each new gen into it's own new folder? I realized with png, I can get rid of even more distortion in a loop.
Anonymous No.106591550 [Report] >>106591588
>>106591458
this shit to me a whole 45 minutes to get it decently right. very frustrating trying to do convenient censorship just right enough to avoid triggering jannies with the ban hammer.
this is the original lora if curious.
https://civitai.com/models/1714926/tomb-raider-lara-croft-survivor
>>106591470
drag the image to png info on forge anon. video metadata is in its comments. drag it to wan2gp.
Anonymous No.106591558 [Report] >>106591873
>>106591496
filename_prefix can do https://blenderneko.github.io/ComfyUI-docs/Interface/SaveFileFormatting/ and also you can do folder/subfolder/[...]/filename
Anonymous No.106591567 [Report] >>106591997 >>106592093
>>106591442
*steals your workflow.*
Anonymous No.106591588 [Report]
>>106591550
Thumbs up!
Anonymous No.106591626 [Report]
>>106591341
>>106591352
ok it works in a1111 so it's a problem with forge. using comfy is out of the question so i guess i'll try reforge
Anonymous No.106591647 [Report] >>106591652
What samplers do you all use for wan? I've been using res_multistep and while good it makes gens look sorta plasticy
Anonymous No.106591652 [Report] >>106591658
>>106591647
>it makes gens look sorta plasticy
That would be the 4step LoRAs
Anonymous No.106591658 [Report]
>>106591652
nunchaku wan never ever
Anonymous No.106591722 [Report] >>106591725 >>106591740 >>106592284
https://huggingface.co/bytedance-research/UMO
bytedance has released the full model of UMO (not just the lora)
Anonymous No.106591725 [Report]
>>106591722
@grok what is this
Anonymous No.106591731 [Report] >>106591747 >>106591806 >>106591851 >>106591867 >>106591896 >>106591974
https://s2guidance.github.io/
Babe wake up, it's time for Alibaba to go for a new cope, "The next replacement of CFG(TM)1!1!!!1!"
Anonymous No.106591740 [Report] >>106591759
>>106591722
I highly doubt ByteDance is going to give local a model like Seedream.
It's either going to be Wan or Qwen that does it. Probably Wan since they don't go for slop evals.
Anonymous No.106591747 [Report]
>>106591731
>THING is all you need
Whenever I see this I assume it's trash.
Anonymous No.106591759 [Report] >>106591764 >>106591770
>>106591740
idk alibaba has started to keep some things behind closed doors lately
Anonymous No.106591764 [Report]
>>106591759
>lately
Anonymous No.106591770 [Report] >>106591771
>>106591759
>alibaba has started to keep some things behind closed doors lately
what happened?
Anonymous No.106591771 [Report] >>106591780
>>106591770
Qwen has started keeping certain models api-only
Anonymous No.106591780 [Report]
>>106591771
>it's begining
it was bound to happen, they're starting to get models that can be serious rivals to the best API ones, no way they're gonna release SOTA models, no one is doing that, ever
Anonymous No.106591795 [Report] >>106591845
Anonymous No.106591806 [Report]
>>106591731
interesting paper, but id like all these bold claims to be backed up by actual code.
Anonymous No.106591810 [Report] >>106592249 >>106592503
https://xcancel.com/bdsqlsz/status/1967431792992129065#m
His English is rough, but if I understand correctly, in a week we will have a new editing model and another video model.
Anonymous No.106591845 [Report] >>106591855
>>106591795
that's why europe got rid of most of the bears
Anonymous No.106591851 [Report]
>>106591731
that's funny, all the other replacements cope of CFG were actually making it worse than CFG itself
Anonymous No.106591855 [Report] >>106591904
>>106591845
Did they get nostalgic and decide to mass reintroduce wild creatures to assault the local population?
Anonymous No.106591867 [Report]
>>106591731
I want to say "nothingburger" but since it's from Alibaba I want to believe, so far they showed that they are a serious company.
20Loras No.106591873 [Report] >>106591910 >>106591922
>>106591558
It's saving to a folder, but each subsequent gen goes into the same folder. For png renders that's going to be a mess, hence a new folder for each time you gen.
I'm a complete beginner so I can't make sense of that page.
Anonymous No.106591896 [Report] >>106591911
>>106591731
>We use a De-distilled version of Flux Labs (2024) in our experiments.
excuse me? how did they get that? I doubt they collaborated with BFL so I guess they used this model?
https://huggingface.co/nyanko7/flux-dev-de-distill
Anonymous No.106591904 [Report]
>>106591855
italy and romania and probably others had a number of bear situations where people died without having done anything particularly stupid, yes
Anonymous No.106591910 [Report] >>106591948
>>106591873
Change your "filename_prefix" field from AnimateDiff to %date:yyMMdd-hhmmss%/AnimateDiff
Anonymous No.106591911 [Report] >>106591914
>>106591896
>how did they get that?
Seems they give it out to anyone so long as you're not a filthy local shitter.
Anonymous No.106591914 [Report]
>>106591911
>as you're not a filthy local shitter.
that's why you don't have it either :(
Anonymous No.106591922 [Report] >>106591948
>>106591873
the page explains that you can use %node_name.widget_name% or %date:FORMAT% to define the foldername it goes to.

use that for a foldername with the date and seed for example
20Loras No.106591948 [Report] >>106591989
>>106591910
Hell yeah, that worked. I bring you raunchy frieren and orc wip as thanks: https://files.catbox.moe/ze7qky.mp4

>>106591922
Ah so the date format decides each new folder, because it's counted in seconds. If I were to keep it just to the days, everything I gen goes into that one folder for the day?
Anonymous No.106591969 [Report] >>106592064
Anonymous No.106591974 [Report] >>106591982
>>106591731
>https://s2guidance.github.io/
The outputs look a little deep fried to me desu.
Anonymous No.106591982 [Report] >>106591992
>>106591974
it does, everytime a "replacement" of CFG comes in, it's always some ultra slopped, ultra fried shit (but it follows the prompt better though !!!!)
Anonymous No.106591989 [Report] >>106592003
>>106591948
>Ah so the date format decides each new folder, because it's counted in seconds. If I were to keep it just to the days, everything I gen goes into that one folder for the day?
yes. and as it says with %node_name.widget_name% you could also use any other information from any other node, such as a seed or other random number from a random number generating node
Anonymous No.106591992 [Report]
>>106591982
>Makes the image better by destroying it.

Can't wait to play with it for a few hours and never use it again.
Anonymous No.106591997 [Report] >>106592013 >>106592039 >>106592046 >>106592093 >>106592253
>>106591442
>>106591567
that's a good idea
20Loras No.106592003 [Report] >>106592131
>>106591989
The widget name would be the parameter inside of that named node, for example %Ksampler:noise_seed%?
Anonymous No.106592013 [Report]
>>106591997
>webm
That's quite the glowup of her.
Anonymous No.106592039 [Report] >>106592237
>>106591997
Why is she so captivating?
Anonymous No.106592046 [Report] >>106592058
>>106591997
She could have a future in porn
Anonymous No.106592058 [Report]
>>106592046
Wasn't her father like very high up at Goldman Sachs?
She never ever has to worry about money. Ever.
Anonymous No.106592064 [Report] >>106592073 >>106592093
>>106591969
Anonymous No.106592073 [Report]
>>106592064
>Bomb expert, dual bomb expert to be exact
Anonymous No.106592086 [Report] >>106592107
Spent some time with SRPO
Terrible prompt comprehension and medium knowledge compared to Chroma or even Qwen. Half the time it just ignores a chunk of my prompt. Still slops hands regularly. Can only decently do 3d, but is completely incapable of good nsfw. Beyond faster generation, i honestly don't understand what the fuck is even the point of that finetune is or why it has been shilled around lately.
Pic related is how SRPO understands OIL PAINTING.
Anonymous No.106592093 [Report] >>106592115 >>106592116
>>106592064
>>106591997
>>106591567
you guys are getting it all wrong, i trying tooth and nails to get the camera to do a 360 degree orbit around the subject and not the subject doing a 360 body spinning in from of a static camera. This shit is pissing me off to no end.
Anonymous No.106592107 [Report] >>106592114
>>106592086
SRPO seems to be a good method to unslop renders, but doing it on Flux was a retarded move, you can't save Flux it's obvious at this point, can't wait to see them try on qwen image though
Anonymous No.106592114 [Report] >>106592118
>>106592107
Wasn't the SRPO guys from Alibaba, if so why didn't they do it on Qwen to begin with ?
Anonymous No.106592115 [Report]
>>106592093
damn I forgot about her game, should I buy some lube to play it?
Anonymous No.106592116 [Report] >>106592282
>>106592093
what's your prompt for this? it does seem to ignore "camera orbiting around character" prompt a lot
Anonymous No.106592118 [Report] >>106592162
>>106592114
no, SRPO was invented by Tencent
Anonymous No.106592131 [Report]
>>106592003
i believe it is %KSampler.noise_seed% but you got the concept right.

i'm using another node than the vanilla KSampler so I can't definitely check it
Anonymous No.106592162 [Report]
>>106592118
Ahh, that makes more sense then
Anonymous No.106592179 [Report] >>106592210
Is it possible to pass an existing video into an img2vid generation with Wan2.2? I try doing it, but it fucks with the colors in a way that regular img2vid generations don't do.
Anonymous No.106592210 [Report] >>106592267
>>106592179
What are you trying to achieve exactly?
Anonymous No.106592214 [Report]
Anonymous No.106592237 [Report]
>>106592039
Because you're white
Anonymous No.106592249 [Report] >>106592256 >>106592262
>>106591810
Why so much focus on video and imagen now? Not complaining of course.
Anonymous No.106592253 [Report] >>106592264
>>106591997
Can you post the starting pic?
Anonymous No.106592256 [Report] >>106592284
>>106592249
They already have a niche cut out in the llm space. Now they are trying to cut out one in the image gen space. They throw us the failed attempts along the way.
Anonymous No.106592262 [Report] >>106592266
>>106592249
Whatever tech is easiest to improve on for the return will be the focus. LLMs are now in an incremental codemaxxing era while there's a lot to improve on in video gen
Anonymous No.106592264 [Report] >>106592515
>>106592253
Anonymous No.106592266 [Report]
>>106592262
>LLMs are now in an incremental codemaxxing era
I hate this. I just want one where its reward training is extracting semen from balls.
Anonymous No.106592267 [Report] >>106592272 >>106592390
>>106592210
I've got a video with glitchy eyes, and I'd like to pass it through the refiner to see if it gets better. It does get better, but it fucks with the colors.
Anonymous No.106592272 [Report] >>106592288
>>106592267
You can't just pass it through T2V at a very low denoise?
Anonymous No.106592282 [Report] >>106592811
>>106592116
prompt "The camera is orbiting 360 degrees around the woman's showing the viewer her side, back, other side of body before completing the full rotation back to her starting position. The 360 degree camera orbit around the woman's body is fast and smooth. The lighting is cinematic and dramatic, with soft shadows and realistic detail." good luck getting this work, frankly after 6 hours straight with multiple failures, i give up.
here a catbox for a spicy gen attempt.
https://files.catbox.moe/764uam.mp4
Anonymous No.106592283 [Report]
After trying video gen for a couple days. My respect for blacked content has increased. It's impossible to get it working right without a shit ton of loras and different strengths etc etc. for regular sex it's so easy just plug in and go. I wish black girls were more attractive then because I just want contrasting content during sex.
Anonymous No.106592284 [Report] >>106592323
>>106592256
>They throw us the failed attempts along the way.
feelsbad being a local fag, because this is true, look at bytedance, they failed with UMO so they gave it to us, and once they struck gold with seedream 4.0 they kept it for themselves >>106591722
Anonymous No.106592288 [Report] >>106592314
>>106592272
I've tried that, but it changes the rest of the image too much, even at extremely low deneoise, ie. 0.05.
Anonymous No.106592306 [Report] >>106592452
Anonymous No.106592308 [Report]
Anonymous No.106592314 [Report] >>106592344
>>106592288
Okay, and you tried i2v but with the first frame of the video as the input and vae encoded the broken video into the low noise sampler? Did you add noise to the samples?
Anonymous No.106592323 [Report] >>106592333 >>106592368
>>106592284
Genuinely curious as to what makes seed dream so good. I'm not sure Ive even seen an output from it.
Anonymous No.106592327 [Report] >>106592459 >>106592466
Image models from 2021 (sovlMaxxing) vs image models from 2025 (slomMaxxing)
Anonymous No.106592333 [Report] >>106592353
>>106592323
>Genuinely curious as to what makes seed dream so good.
they released the paper
https://xcancel.com/bdsqlsz/status/1966034419183124527#m
https://arxiv.org/abs/2509.08826
Anonymous No.106592340 [Report] >>106592465
Anonymous No.106592344 [Report]
>>106592314
Yeah, and that fucks with the colors, unfortunately.
Anonymous No.106592346 [Report]
/adt/ got deleted
Anonymous No.106592353 [Report]
>>106592333
Yeah, but I've never seen an output from it.
Anonymous No.106592368 [Report] >>106592383
>>106592323
>I'm not sure Ive even seen an output from it.
>>106577845
>>106578184
>>106576677
>>106576615
Anonymous No.106592383 [Report]
>>106592368
Hmm well, they are pretty sharp.
Anonymous No.106592390 [Report] >>106592402 >>106592412 >>106592453 >>106592619
>>106592267
read this, you can use a segment model to mask a face -> crop around it and upscale before sending it to the low model for a 1 step i2i and paste it back in
>https://www.notion.so/bedovyy/WanFaceDetailer-261ce80b3952805f8aaefb1cdb90ec04
Anonymous No.106592402 [Report]
>>106592390
I'll look into it. Thanks for sticking around and giving me a way forward, whether it pans out or not.
Anonymous No.106592408 [Report] >>106592415 >>106592446 >>106592463 >>106592486
Can someone help me figure out what to try next?
https://files.catbox.moe/fal8ln.mp4
Just want the dude to be black. and unfortunately neg prompts don't work for me and this setup.
Anonymous No.106592412 [Report]
>>106592390
That is interesting.
Anonymous No.106592415 [Report]
>>106592408
>Just want the dude to be black
Anonymous No.106592446 [Report] >>106592571
>>106592408
pajeet you gotta master the english language before you can master prompting, so things like negro, basketball american, breathing simulator 9000 all evoke images of 'black skinned' people in the ai's mind
Anonymous No.106592452 [Report]
>>106592306
amazing consistency
Anonymous No.106592453 [Report]
>>106592390
I wonder how this actually works for non-anime stuff.
Anonymous No.106592457 [Report]
Anonymous No.106592459 [Report] >>106592484
>>106592327
>Image models from 2021
there was none
Anonymous No.106592463 [Report]
>>106592408
>Just want the dude to be black.
a BBC enjoyer I see
https://www.youtube.com/watch?v=oNsNjMuevXw
Anonymous No.106592465 [Report]
>>106592340
That flame, else it's really good
Anonymous No.106592466 [Report] >>106592472 >>106592476 >>106592480 >>106592491 >>106592537 >>106592600 >>106592603
>>106592327
What's the actual difference here, beyond the tan and him getting older?
Anonymous No.106592472 [Report] >>106592476
>>106592466
A man oblivious to the concept of plastic surgery.
Anonymous No.106592476 [Report] >>106592509
>>106592472
>>106592466
he was in a motorcycle accident and needed facial reconstruction.
Anonymous No.106592480 [Report] >>106592491 >>106592499
>>106592466
You can't see his cheeks, lips and chin have been bogged to hell and back?
Is this that face blindness austists talk about?
Anonymous No.106592484 [Report]
>>106592459
2021 was the year this paper predicted that the future of image models would be diffusion models.
https://youtu.be/W-O7AZNzbzQ?t=3235
https://arxiv.org/abs/2105.05233
Anonymous No.106592486 [Report] >>106592501
>>106592408
Wan BLACKED lora when?
Anonymous No.106592491 [Report]
>>106592466
>>106592480
>You can't see his cheeks, lips and chin have been bogged to hell and back?
>Is this that face blindness austists talk about?
those are the same "people" who see no problem with Chroma btw
Anonymous No.106592497 [Report]
>106592491
obsessed
Anonymous No.106592499 [Report]
>>106592480
I can see there being lip filler in 2025, but he already looks bogged in 2021. I don't see much difference in the other features that couldn't be explained by aging, a wider smile, and weight gain.
Anonymous No.106592501 [Report]
>>106592486
tried one and it made weird 4 legged, cock and pussy monsters fucking.
Anonymous No.106592503 [Report] >>106592522
>>106591810
Surely they aren't releasing inferior model to what we have. If they ain't on par with at least Qwen edit and Wan 2.2, nobody is gonna use them, kinda like omnigen 2, Hunyuan i2v and so on
Anonymous No.106592509 [Report] >>106592613
>>106592476
>he was in a motorcycle accident and needed facial reconstruction.
it was in 2013 though, he looked fine before the 2020's, he started looking like bog way after that
Anonymous No.106592515 [Report]
>>106592264
Thanks
https://files.catbox.moe/a9w5zo.mp4
Anonymous No.106592522 [Report]
>>106592503
they don't care that they don't compete with local SOTA, they just want some good boi points and be treated like the "nice guy company", optics are important, especially for investors
Anonymous No.106592529 [Report]
>image2image
>staring images is a white man on the button of the woman.
>10 lora setup
perhaps you sittings and start image is bad to begin with.
Anonymous No.106592537 [Report]
>>106592466
>What's the actual difference here
Anonymous No.106592571 [Report] >>106592605
>>106592446
can't believe basketball player worked...
almost there but the penis is all wrong
https://files.catbox.moe/mus9ye.mp4
Anonymous No.106592600 [Report]
>>106592466
Left is normal, right is someone using img2img on his face
Anonymous No.106592603 [Report]
>>106592466
left is training a model with real data, right is training the same model with synthetic data
Anonymous No.106592605 [Report] >>106592695 >>106593159
>>106592571
try using "bestiality" next time in the prompt to get that skin
Anonymous No.106592613 [Report] >>106592624
>>106592509
>in an AI thread
>can't tell when something is obviously using AI and/or photoshop
grim
Anonymous No.106592618 [Report]
>>106588114
Can I get a box for this?
Anonymous No.106592619 [Report]
>>106592390
I can't be assed to download all of the segmentation models for this right now, but as far as I can tell, it just takes an input video. Segments and boxes the faces, upscales them and denoises them at a higher detail the pastes them back over the video, right?

I think the most interesting takeaway is that he uses causevid for the lowpass.
Anonymous No.106592624 [Report] >>106592635
>>106592613
unfortunately, that's the real face of Zac nowdays, what a waste...
Anonymous No.106592635 [Report]
>>106592624
Did you train the lora on Mickey Rourke's face
Anonymous No.106592695 [Report]
>>106592605
K i'll try it next.
Hell if dog knots is easier than black guys I'll go that route instead
Anonymous No.106592739 [Report]
Anonymous No.106592811 [Report] >>106592971
>>106592282
you have to be autistic with prompting with wan
mention all four angles step by step, mention how the background moves step by step
Anonymous No.106592825 [Report] >>106593319
>>106590560
That's how I keep consistency for characters in my project. I started with one good image and then do gens with her changing positions then use that as a starting image, it's not perfect but better than the gacha of trying to gen the exact same outfit/appearance again. Sometimes I'll run the final image through img2img in an image model or to refine it. The main drawback is the resolution is a bit low so if the character isn't close up close you can lose some details.
Anonymous No.106592886 [Report] >>106592889 >>106592899
what lora can i use in wan2.2 to pull their top down and expose breasts?
Anonymous No.106592889 [Report] >>106592910 >>106593159
>>106592886
>The woman pulls down her shirt, exposing her breasts

No lora needed.
Anonymous No.106592899 [Report]
>>106592886
FLF
Anonymous No.106592910 [Report]
>>106592889
i had
>she exposes her large breasts at the start
and nothing. I'll try this
Anonymous No.106592969 [Report]
https://files.catbox.moe/2tyce3.png
Anonymous No.106592971 [Report] >>106592985 >>106592997
>>106592811
Just turned off my pc due too hearing weird noises with my 5090 and i gotta sleep. Please Post the entire prompt positive and negative prompts. I suffered seven hours straight with multiple failured gens and feel very burnt out. Help an anon out please :'). Please do a 360 orbit of pic related.
Anonymous No.106592985 [Report] >>106592997 >>106593032
>>106592971
prompt
>The camera is orbiting 360 degrees around the girl showing the viewer her left side, the background moves showing the left side of the room, wall. Then the camera continues to rotate showing the viewer her back, the background moves showing the back side of the room, audiences. Then the camera continues to rotate showing the viewer her right side, the background moves showing the right side of the room, wall. Then the camera continues to rotate showing the viewer her front, the background moves showing the front side of the room.
Anonymous No.106592996 [Report]
https://files.catbox.moe/gzd7ba.png
Anonymous No.106592997 [Report] >>106593032
>>106592971
>>106592985
also use first and last frame and set the two frames to the same image
Anonymous No.106593009 [Report]
https://files.catbox.moe/v5c1s5.png
Anonymous No.106593015 [Report]
lmk if such type of gore is allowed here, idk how to spoiler images https://files.catbox.moe/n9fkqr.png
Anonymous No.106593032 [Report]
>>106592985
>>106592997
Will test this out later, thank you very much anon.
Anonymous No.106593040 [Report]
https://files.catbox.moe/kpdgky.png

Catpcha:YGANG
Anonymous No.106593138 [Report]
>Chroma-DC-2K-T2-SL4
These niggas will train anything except the qwen text encoder for it. Someone stop them
Anonymous No.106593141 [Report]
>>106590405
>As long as the total amount of pixels is the same and the dimensions are a power of 64
I'm pretty sure SDXL isn't trained on 1280x768. It means a lot what specific resolutions it's trained on. SDXL is trained on multiple different resolutions. within the 2048 pixel plane meaning 1024x1024 and 1216x832 and some others i believe; you can look it up.
You're going to get the best results if you stick to the exact dimensions it was trained on.
It's the same way where you get very bad results with SD1.5 if you make it any other dimension than 512x512.
The training dimensions are kind of hardcoded into the model, and when you don't follow them, you kind of warp the vector space and it's associations with the pixel space.
You can generate at the native resolutions, and then upscale to a high resolution and then you can downsize and crop the images later. That's the way you'll get the best qualitative outcome with the current models.
Anonymous No.106593149 [Report] >>106593334 >>106593338 >>106593362
guys which joycaption nodes to use?
Anonymous No.106593159 [Report]
>>106592889
>>106592605
Thanks! it works way better than what I had originally. still need to work on the legs, either the guy has legs coming out of his hips to the side, or the girl is missing the bottom half of her legs
https://files.catbox.moe/ke3kv9.mp4
Anonymous No.106593201 [Report] >>106593211 >>106593349 >>106593371
>heh I haven't updated comfy in weeks
>update
>now all my gens come out a blurry mess
I'm fed up
Anonymous No.106593211 [Report] >>106593226
>>106593201
turn off fast optimizations retard
Anonymous No.106593226 [Report] >>106593248
>>106593211
There are no fast optimizations, retard.
Anonymous No.106593248 [Report]
>>106593226
Turn off your eyeballs

https://files.catbox.moe/ct8jyn.png
Anonymous No.106593279 [Report] >>106593331
If i want to continue a video, do i use a different prompt? I used the same prompt as the initial and it just. slowed down and barely moved.
Anonymous No.106593312 [Report]
I enjoy Chroma.
Anonymous No.106593318 [Report]
i enjoi haveng extra chromasomee..
Anonymous No.106593319 [Report] >>106593678 >>106593733
>>106592825
do you think ani is on the right track for making a game engine with diffusion mechanics built in? you are the only anon I know that's making a game
Anonymous No.106593331 [Report]
>>106593279
ymmv, generally the same prompt shouldn't do LESS on average

maybe you have too many speedup things eabled that interfere with motion, or not enough steps, or maybe you want to try the hps/mps reward or movement lora, or other things
Anonymous No.106593334 [Report] >>106593426
>>106593149
>joycaption
>nodes
lol ur gay
Anonymous No.106593336 [Report]
>>106589978
>Surely all those server farms aren't built off GPUs right?
Yes they are, but it's not the same GPU's you can buy to put in your PC. If you watch some of the nVidia presentations, you'll see that they're super huge, and they're making them bigger and bigger each time.
nVidias market is no longer really focused on consumer GPU's. They're more so in the business of designing custom systems for big businesses that need datacenters and software solutions for training and analyzing all kinds of stuff with AI.
Anonymous No.106593338 [Report] >>106593426
>>106593149
Just get taggui
Anonymous No.106593343 [Report] >>106593349
Specs: 32GB RAM, 12GB GPU, i12900k Running ComfyUI with SDXL realisticslop

Switched from Forge to Comfy recently.
Two questions:

Is Comfy actually faster for gen/checkpoint loading than Forge or just me?

Anyone else notice Comfy outputs seem slightly softer/less sharp/quality?
Anonymous No.106593349 [Report] >>106593359
>>106593343
this anon just mentioned it >>106593201
Anonymous No.106593359 [Report]
>>106593349
I have to move back to Forge?
Anonymous No.106593362 [Report] >>106593426
>>106593149
probably decide between some of the most recently updated
Anonymous No.106593371 [Report]
>>106593201
>he pulled
Anonymous No.106593400 [Report]
>>106590090
You know you can use quantized versions right?
Anonymous No.106593426 [Report] >>106593517
>>106593334
I just love noodles, what can I say
>>106593338
id rather not have YET another conda enviro PLEASE
>>106593362
I wanted to go to a 'mostly' generic route using a generic LLAVA wrapper which uses llama-cpp-python, but the generic vision nodes were not updated recently, the other nodes have descriptions all written and chink and id rather xi not see what im captioning
Anonymous No.106593510 [Report] >>106593529
so different seeds can give you a completely fucked unusable gen? is there any way to know how bad the gens will be? I did 2 gens and one had a random mystery guy added behind the girl and the other was perfect.
Anonymous No.106593517 [Report]
>>106593426
xi has a "few" chinese that can write english - but the chinese sometimes write in their language.

just ignore it as long as you can use it.
Anonymous No.106593529 [Report] >>106593537
>>106593510
depending on model and prompt sure

you can't know in advance unless you use a very special model type but you can generate preiews with like TAESD on each step as it's crunching the tensors... on most models at least
Anonymous No.106593537 [Report] >>106593680
>>106593529
yeah i have previews on but by the time i can tell if it's fucked or not it's too late to abort,
Anonymous No.106593548 [Report]
Qwen SRPO when.
Anonymous No.106593647 [Report]
>>106590560
>my boy discovering that wan is the best edit model
Anonymous No.106593670 [Report]
new
>>106593668
>>106593668
>>106593668
>>106593668
Anonymous No.106593678 [Report]
>>106593319
I don't know the specifics of what he's doing, is it just allowing the devs to create a prompt along with parameters to generate images in game? I think using image/video gen at runtime will be pretty common eventually so starting something like that for games to come out a few years from now is probably a good idea. I don't think it's super viable right now because it's too slow or just not possible on the average consumer's hardware. Players won't like waiting a minute for a scene to generate. There's also the issues with discontinuity but maybe it will be solved in future models or players just won't care.
Anonymous No.106593680 [Report]
>>106593537
i don't see how there could be a way that lets you see it even earlier
Anonymous No.106593733 [Report]
>>106593319
i think trani (read, you) should kill himself immediately