← Home ← Back to /g/

Thread 105782211

315 posts 136 images /g/
Anonymous No.105782211 >>105783876
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105776972

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105782220 >>105782239
>mfw
Anonymous No.105782232
Blessed thread of frenship
Anonymous No.105782233 >>105782248 >>105782303
>>105782166
Do you need to do the math on the likelihood 1 token will appear in 25 images with a random dropout? That's also ignoring the absurdity that prompts aren't just random hodgepodges of words, we are trying write instructions. It's like having a radio and randomly adding static to different parts of the clip and then asking someone to learn English or even better, what they're talking about and even more specifically that Mary's dress is red and she ordered blue. Caption dropout *might* work for tag soups but they're not training on tag soups, they're doing full paragraph + tag soup + metadata. You don't just randomly do caption dropouts without side effects or now the obvious fact: IT IS NOT LEARNING ARTISTS and as I already pointed out, this is a logical outcome.
Anonymous No.105782239
>>105782220
>retarded drama on the previous thread
>debo is here
that explains a lot
Anonymous No.105782240 >>105782255 >>105782282 >>105782296
I fucking HATE the sun
Anonymous No.105782248 >>105782263 >>105782264
>>105782233
it knows a few furry artists really well at least, might be more about the artist you are wanting just not making it into the 5 million image budger
Anonymous No.105782255
>>105782240
https://www.youtube.com/watch?v=c1HSeVMz_yw
Anonymous No.105782257
>optimization comes out
>gets forgotten
Anonymous No.105782263
>>105782248
>budger
budget
Anonymous No.105782264 >>105782273 >>105782287
>>105782248
>it knows a few furry artists really well at least
like who? I wanna test that out, cool that it starts to know some artists at last
Anonymous No.105782273
>>105782264
dimwitdog
Anonymous No.105782282 >>105782296
>>105782240
https://www.youtube.com/watch?v=SHnt6zYURvk
Anonymous No.105782286
Can anyone who uses Sage try to compare imagegen speeds when using a normal model loader and and when using the MultiGPU loader with RAM set to 0?
Anonymous No.105782287
>>105782264
fluff-kevlar also somewhat works atm
Anonymous No.105782290 >>105782302 >>105782310 >>105782394
Returning to Chroma and the new AE:

>Lodestone:
>I'm experimenting on calibrating Chroma v38 checkpoint to my newer AE where it has compression and general EQ.
>the AE has different compression level F16C64 vs F32C64 and it has nifty properties where the latent has clear frequency separation. it's going to be interesting!
Here's hoping this won't just be a waste of time and force a rollback, as I understand it this is not used in any of the released checkpoints, but perhaps in v42 ?
Anonymous No.105782296
>>105782240
>>105782282
https://www.youtube.com/watch?v=8p3_CWjRkHY
Anonymous No.105782302 >>105782349 >>105782394
>>105782290
>v38
dont see why it would end up being a rollback, he is testing on a already released checkpoint
Anonymous No.105782303
>>105782233
>"A young woman named Mary stands in front of a boutique. She's wearing a long red dress with lace trim. Her expression suggests disappointment. A tag on the dress reads 'ordered: blue'. The sidewalk reflects soft afternoon light. Metadata: artist: McCumface3999, style: oil painting, location: downtown street, lighting: natural, color scheme: warm tones, mood: contemplative, keywords: woman, red dress, order mistake, fashion, lace."

=

>*"A young woman ***** Mary stands in front ***** a boutique. She's ***** a long ***** dress with ***** trim. Her ***** suggests *****. A ***** on the ***** reads 'ordered: *****'. The sidewalk ***** soft afternoon *****. Metadata: artist: *****, style: ***** painting, location: ***** street, lighting: *****, color scheme: ***** tones, mood: *****, keywords: *****, red dress, ***** mistake, fashion, ****."

Hope we didn't drop out anything important that epoch.
Anonymous No.105782310 >>105782322
>>105782290
that looks bad for the 3 of them lol, but what would Chroma gains if the AE gets improved?
Anonymous No.105782317
Anonymous No.105782322 >>105782348
>>105782310
https://x.com/LodestoneE621/status/1939504778805191097
Anonymous No.105782334 >>105782346 >>105782348
and maybe this is related?
https://x.com/LodestoneE621/status/1938734989434351796
Anonymous No.105782339
For the anons trying the Wan nsfw finetune/loras, teacache's settings and the light2x lora don't work well on it anymore, as the finetune strayed too far from the base model
Tried the extracted lora for i2v with different samplers, steps, cfg, but nah, still blurred, most likely because the t2v model strayed too far from the i2v model. I didn't try without teacache though, but it would be a pain to do so
Anonymous No.105782346
>>105782334
>this will scale and shift the default CFG strength 5 into 1.
So, guidance distillation?
Anonymous No.105782348 >>105782354
>>105782322
>>105782334
Why don't you use xcanceled?
Anonymous No.105782349
>>105782302
If he implements this in upcoming releases and it turns out it's not so good after a few epochs of use, that would mean a rollback.
Anonymous No.105782354 >>105782360 >>105782367 >>105782388 >>105783390
>>105782348
cause im not a weirdo
Anonymous No.105782360 >>105782362
>>105782354
Sorry Mister but this is a discord screenshot.
Anonymous No.105782362 >>105782378
>>105782360
still not as strange as using a "service" whos entire point is to protest something dumb for stupid reasons
Anonymous No.105782364
is this the discord screenshot comments thread?
Anonymous No.105782367
>>105782354
>muh fast AE that keeps the quality
I've seen that somewhere...
https://nvlabs.github.io/Sana/
Anonymous No.105782378 >>105782395 >>105782400
>>105782362
You are the perfect example of the people who argue here about nothing and in-fight each other.
Less than 70 IQ with no technical knowledge whatsoever.
Anonymous No.105782388 >>105782408
>>105782354
imo he's too obsessed with the speed, first with the "fast" thing that makes the model run on lower steps, now this, can't he just focus on getting a model that doesn't have frankenstein anatomy first, we'll think about the speed later
Anonymous No.105782394 >>105782441 >>105782514
>>105782290
>>105782302
this nigga has good intentions but he has fucking squirrel level adhd, instead of just training chroma to v50 and going from there, he's doing distillation, detail experiments, training his own vae and random spaghetti merging 7 checkpoints into one at the same time and releasing another 5 parallel branches
Anonymous No.105782395
>>105782378
go cry on blue cry, bet you use that dying hugbox full of pedos as well
Anonymous No.105782396 >>105782487
>i miss anim-anon
Anonymous No.105782398 >>105784173
Anonymous No.105782400 >>105782416
>>105782378
>You are the perfect example of the people who argue here about nothing and in-fight each other.
says the guy who argues here about nothing
Anonymous No.105782401 >>105782465
Is there a local AI solution similar to Eleven Labs? I want to transcribe audio books without paying 100 a month
Anonymous No.105782408 >>105782424 >>105782448 >>105782463
>>105782388
its all separate anyways
Anonymous No.105782416 >>105782426 >>105782432 >>105782480
>>105782400
What do you mean? I only wanted to recommend xcanceled instead of direct links of twitter or discord. Seems like you are pretty hostile.
Anonymous No.105782424
>>105782408
"fast" is some schizo distilled bullshit that has been merged into the main model since v29 or so and it introduces flux plastic slopskin back
Anonymous No.105782425
>>105773833
catbox por favor?
Anonymous No.105782426 >>105782452
>>105782416
>xcanceled
stop with this meme
Anonymous No.105782432
>>105782416
>I only wanted to recommend
request denied, if you can't accept a no, you're starting to argue here about nothing, you don't want to sound like a low IQ don't you anon?
Anonymous No.105782433 >>105782440
i had respect for you but now i'm worried. you're spending 16 hours on 4chan every single day.
Anonymous No.105782440
>>105782433
never meet your heroes
Anonymous No.105782441
>>105782394
Yeah, as much as I like Chroma I have to agree.

Like who the fuck decides to switch to his own experimental AE at 41 of estimated 50 epochs, this guy sure don't like smooth sailing.

That said, if this works well and is an undeniable improvement, he will be hailed as a genius, if not...
Anonymous No.105782448
>>105782408
>its all separate anyways
if only he did the same for fast, I don't mind his experiments, but at some point he changed the whole training process at v29.5, for the worse
Anonymous No.105782449
Please stop fighting and gen stupid poop!
Anonymous No.105782452 >>105782466
>>105782426
you are crying because you cannot use your centralized accounts?
Anonymous No.105782463 >>105782473
>>105782408
Ok, THAT makes sense.

Just how many branches is he training now ? Who is funding all this, furries ?
Anonymous No.105782465
>>105782401
sparktts
Anonymous No.105782466 >>105782472
>>105782452
you are using a website who's entire purpose is crying about another
Anonymous No.105782472 >>105782480
>>105782466
?
Anonymous No.105782473 >>105782489
>>105782463
>Just how many branches is he training now ?
all those useless branches when he could just focus all his gpus to a single normal one and get this shit done faster, sigh...
Anonymous No.105782480 >>105782498
>>105782472
>>105782416
Anonymous No.105782487
>>105782396
he is in the hyperbolic time chamber training on the highest gravity settings. be patient, let him cook
Anonymous No.105782489
>>105782473
I think he already has everything calibrated for however many gpu's he's using per instance
Anonymous No.105782498
>>105782480
Anonymous No.105782514
>>105782394
>this nigga has good intentions but he has fucking squirrel level adhd
Idk if he's brave or just crazy, he said he already spent 100k on this project, and yet with all those stakes he's still doing some weird experiments as if the only bad consequence would be a broken glass or something lol
Anonymous No.105782524 >>105782536 >>105782543 >>105782548 >>105782555 >>105782563 >>105782576 >>105783278 >>105783642
chroma bros..? 50 steps btw
Anonymous No.105782528
>fr fr twitter and netflix are cool
/ldg/ moments
Anonymous No.105782536
>>105782524
trust the plan anon, he knows what he's doing, the "fast" poison is just kicking in... or something...
Anonymous No.105782543
>>105782524
settings?
Anonymous No.105782548 >>105782567 >>105782586
>>105782524
I have a simple question, I never tested Schnell, were the hands that bad on that one?
Anonymous No.105782550
okay time for Radial Attention NOW
Anonymous No.105782555 >>105782595
>>105782524
Chroma will never get the small details right. He admitted his database is relatively small and training time is also an issue.
Anonymous No.105782563
>>105782524
Anyone can cherry-pick a gen that looks bad across multiple epochs, it proves jack shit.
Anonymous No.105782567
>>105782548
they are not that bad on either, he is using shitty settings or something
Anonymous No.105782576 >>105782624 >>105782675
>>105782524
grab a workflow from here
https://civitai.com/models/1330309/chroma
Anonymous No.105782586
>>105782548
just try it? it's a nice model. hands do have a chance to come out wrong, it's certainly not always tho.
Anonymous No.105782592
Anonymous No.105782595 >>105782609 >>105782633
>>105782555
Only recently was a branch bumped from 512 to 1024, fine detail like fingers will likely look bad on zoomed out people even at epoch 50.

It can most likely be fixed with further fine-tuning, perhaps even loras.
Anonymous No.105782609
>>105782595
It's a matter of further training.
Anonymous No.105782612
i'm so fucking bad at proooompting
i hate myself
Anonymous No.105782624
>>105782576
>scroll down
>first image is two zesty niggas hiding the sausage
Anon...
Anonymous No.105782633
>>105782595
What I do remember is that when SDXL 0.9 was published it was already pretty good. SDXL 1.0 was worse in some ways.
Anonymous No.105782641
Chroma with NAG works well for me for real life stuff, but for illustrations it keeps giving me worse results than regular neg.
Anonymous No.105782651
Moth girls!
Anonymous No.105782675 >>105782688
>>105782576
it's literally that workflow, but the person is not zoomed in portrait 1girl like most gens but a guy dancing in front of a store
i'm not mad, the guy spent like 50k training this and is giving it away for free, but let's not pretend it is what it isn't
Anonymous No.105782683
Anonymous No.105782688
>>105782675
>but let's not pretend it is what it isn't
that's too hard anon, you have to pretend that this model has no issue and that everything he's done to the model was the right choice!
Anonymous No.105782733 >>105782740 >>105782755 >>105782791 >>105783661 >>105783685
>try to generate oil painting of a woman
>the background and clothing look like an oil painting, but the woman's face and cleavage look realistic because the model was over-trained on photographs of womens' faces and boobs
what's the solution here?
yes I already tried (oil painting:2.0)
Anonymous No.105782734
>using sage
Why do my gens get worse it/s as it progresses?
Anonymous No.105782740
>>105782733
welcome to 1girl slop hell
Anonymous No.105782755
>>105782733
Looks like a Flux thing.
Try
>oil paint medium, impasto technique, oil painting by some artist, glazing,
Anonymous No.105782763
I for one like chroma, but I'm the target audience
Anonymous No.105782777 >>105782798 >>105783939
woops, meant to catbox

I for one like chroma, but I'm the target audience

https://files.catbox.moe/1rg68z.png
Anonymous No.105782780 >>105782790
Anonymous No.105782787 >>105782830
Anonymous No.105782790
>>105782780
aww man not my strawberry preserves
Anonymous No.105782791
>>105782733
did you try an oil painting lora and increasing its strength
Anonymous No.105782798
>>105782777
what a slut
Anonymous No.105782800 >>105782803 >>105782889 >>105782933
How do I:
>completely nuke all traces of python
>completely nuke all traces of comfy
I might need to reinstall this entire thing and get portable since this shit is falling apart
Anonymous No.105782803 >>105782837
>>105782800
Do you know what directory is?
Anonymous No.105782830 >>105782848
>>105782787
ani?
Anonymous No.105782837 >>105782882
>>105782803
I wish this was in one directory and not buried in several hidden folders.
Anonymous No.105782848
>>105782830
Bit busy now :/
Anonymous No.105782882
>>105782837
What about using an uninstaller? You clicked Python.exe installer and now you need to click the uninstaller.
Anonymous No.105782887 >>105782899
Anonymous No.105782889 >>105782908 >>105782910
>>105782800
you are in for a bad time regardless of how you install python
Anonymous No.105782890
Hello, I'm the guy trying to get 2D loops working. I thought I had it working with a Color Match node, but it was a placebo effect.
Now I'm looking at https://github.com/kazeyori/ComfyUI-QuickImageSequenceProcess to remove the starting/end frames to just bypass the unwanted flashing effect, but unfortunately it doesn't work as stated and you can't define a negative number to remove frames.
Back to the drawing board...
Anonymous No.105782899
>>105782887
how did you get it to work? anons were saying it crashes all the time
Anonymous No.105782908
>>105782889
This is why people like you should only be using smartphones.
Anonymous No.105782910 >>105782917
>>105782889
I know but I installed too many random python packages and custom nodes and now it's all fucky
Anonymous No.105782917 >>105782935 >>105782962
>>105782910
the exact same thing happens with the portable version. Just delete the venv and all the shitty custom nodes you don't use
Anonymous No.105782931
I'm unironically excited again that we soon might be free from the webshit menace. C chads, rise up
Anonymous No.105782933 >>105782941
>>105782800
which OS are you on? how did you install python and comfy cause there's like a million ways
desu Comfy should've just embraced uv as the idiomatic way, now you've got an army of tech illiterates whose installs are split between native python installations, python venvs, conda, and those weird zipped portable or one click installer shit
good luck troubleshooting all of that at once
Anonymous No.105782935
>>105782917
ok i deleted it and now it says something about missing program files folder?
Anonymous No.105782941
>>105782933
The standalone.
Anonymous No.105782962 >>105782967
>>105782917
Here's hoping he installed the random python packages inside the venv and not system wide...
Anonymous No.105782967
>>105782962
Yeah about that...
Anonymous No.105782975 >>105782983
Anonymous No.105782983 >>105783004 >>105784263
>>105782975
Samefagging pedo
Anonymous No.105783004 >>105783029
>>105782983
I am posting gens
Anonymous No.105783021
Anonymous No.105783029 >>105783173 >>105783334 >>105784198
>>105783004
have you considered therapy instead of gens?
Anonymous No.105783119
Anonymous No.105783173 >>105783554 >>105784198
>>105783029
wow you are a meanie :(
Anonymous No.105783199 >>105783206 >>105783250
Anonymous No.105783206
>>105783199
kek
Anonymous No.105783233
Can I delete the pip and uv cache? There's no system shit inside, right?
Anonymous No.105783245
Anonymous No.105783250
>>105783199
i LULEd
Anonymous No.105783278
>>105782524
doomGODS always win
Anonymous No.105783317 >>105783334
hands are hard
Anonymous No.105783334
>>105783317
>>105783029
Anonymous No.105783390 >>105783393 >>105783733
>>105782354
The throughput of the 16 channel VAE is like 100ms. It's not noticeable during generation.
Anonymous No.105783393 >>105783419
>>105783390
level of compression seems to be the goal
Anonymous No.105783419 >>105783441
>>105783393
He might as well train a new model on top of 1.6B Sana then it'd be quicker.
Anonymous No.105783441 >>105783448
>>105783419
he apparently found a better way
Anonymous No.105783448 >>105783468
>>105783441
None of his schemes proved to be that good, even his custom model is inferior to Flex which actually is trainable on a 24 GB GPU and has half the inference time.
Anonymous No.105783468 >>105783476
>>105783448
>Flex
tried it, seemed shit in comparison on top of being censored
Anonymous No.105783476 >>105783500
>>105783468
It's just a smaller Flux with CFG, there is nothing special to it outside of being something someone should be spending $100k to finetune and not a shitty extremely slow model we call Chroma. Flex is just, foundationally, faster than Chroma and it also has CFG.
Anonymous No.105783479
Anonymous No.105783500 >>105783513
>>105783476
I can't say I care about a slightly faster flux since nunchucku already exists anyways
Anonymous No.105783504 >>105783521
Any Chinese danbooru artists? Or any that aren't Japanese or Western Cartoon?
Anonymous No.105783513 >>105783522
>>105783500
Well you should care because that also applies to training speed and cost and I assume you're a desperate coomer given you talk about Chroma.
Anonymous No.105783521 >>105783562
>>105783504
Ching BiaoXing is a great one.
Anonymous No.105783522 >>105783530
>>105783513
you know it would cost about as much still right? Retraining a model on anatomy is a huge deal
Anonymous No.105783530 >>105783560
>>105783522
Flex is at least 30% faster than Chroma, so that's 30% faster per step buddy. That would mean *checks notes* we'd be past Epoch 50 now. TURNS OUT THAT MATTERS :D
Anonymous No.105783536 >>105783551 >>105783664
Why the FUCK does portable not come with node manager??
Anonymous No.105783547
>using the portable version
Anonymous No.105783551
>>105783536
if you weighed it down with too much stuff it wouldn't be portable now would it
Anonymous No.105783554
>>105783173
Aww, you're so cute and innocent, you butt hurt silly boy! :3

I just want to eat you up! ;p
Anonymous No.105783560 >>105783569 >>105783579 >>105783939
>>105783530
can it do this though?
https://files.catbox.moe/vmda0y.png

also besides nsfw stuff flux is way too locked to a few styles
Anonymous No.105783562
>>105783521
I can't find that one on danbooru :(
Anonymous No.105783569 >>105783575 >>105783597
>>105783560
Yes if he trained using Flex instead of his frakenmodel that is 30% slower?
Anonymous No.105783575 >>105783581
>>105783569
didn't flex release afterwards?
Anonymous No.105783579
>>105783560
y-you want that?
Anonymous No.105783581 >>105783647
>>105783575
Flex is 5 months now so they have similar timing.
Anonymous No.105783595 >>105784154
has anyone had any success with ltx-video?
Anonymous No.105783597 >>105783671
>>105783569
from his discord the smaller size comes at a cost
>flex is just removing some transformer block
>while chroma is not
chroma retain 19 MMDiT and 38 DiT blocks
while flex prune the MMDiT down to 5 IIRC
so chroma the model depth is not compromized
Anonymous No.105783642 >>105783645 >>105783651
>>105782524
> 50 steps

ah, so you're retarded. got it.
Anonymous No.105783645 >>105783651
>>105783642
that anon is right, you need at least 69420 steps to get decent hands
Anonymous No.105783647 >>105783681
>>105783581
You do know ostris just undistilled it right? lodestone undistilled it and is now retraining anatomy pretty much entirely, and a side effect is it also broke flux's style bias

That takes quite a lot more time to do
Anonymous No.105783651
>>105783642
>>105783645
i get pretty good hands at 30-35 on photos
Anonymous No.105783661
>>105782733
>because the model was over-trained on photographs of womens' faces and boobs
use a better model keke
Anonymous No.105783664
>>105783536
What stops you from using
>.\python_embeded\python.exe -m pip install -r .\ComfyUI\custom_nodes\SOMENODE\requirements.txt
It's not that hard.
Anonymous No.105783671 >>105783680 >>105783694
>>105783597
The pruned layers in Flex contributed little to nothing to final output, Flux is an overly bloated model so made his model 30% slower for no reason.
Anonymous No.105783680 >>105783693
>>105783671
lodestone said he removed all redundant layers and that removing any more started to have a negative effect on it
Anonymous No.105783681 >>105783697
>>105783647
Flex is 8B parameters and undistilled, so yes it's basically Chroma except 30% faster and would've been a better starting model and he would've saved 30% on his training and maybe wouldn't have had to resort in his gimmick training strategy to begin with.
Anonymous No.105783685 >>105783756
>>105782733
>>105717894
welcome to local models. redeem the mogao if you want to unlock painting styles
Anonymous No.105783689
Anonymous No.105783693
>>105783680
Yes, you prune and then train a little to bring back capabilities, the process is well documented and again, Flex did a very good job at it an unlike Chroma is actually fast and requires less VRAM and compute to train. Chroma is just an example of hubris costing money.
Anonymous No.105783694
>>105783671
chroma is slower because it re-introduces CFG/negative prompt, something that flux lacks. flux would be even slower if it had this
Anonymous No.105783697 >>105783704 >>105783732
>>105783681
cept flex came out just recently, is not nearly the same scale nor does it have the same goal... flex still is locked to flux style / subject bias. Good luck doing anything not person standing or the 3 styles it defaults to
Anonymous No.105783704 >>105783718 >>105783728
>>105783697
flex was trained on flux outputs so it's just doubling down on fluxslop. i don't know why people believe flux needs to be de-distilled in the first place. flux dev is perfectly trainable with both finetunes and loras
Anonymous No.105783718 >>105783739
>>105783704
cause a tune big enough to introduce nsfw elements costs tons in compute, illustrious cost like several hundred thousand and it was sdxl, can't expect people wanting to entirely foot that kind of bill out of charity
Anonymous No.105783728 >>105783735
>>105783704
>flux dev is perfectly trainable with both finetunes and loras
must be why there're so many amazing nsfw loras and finetunes of dev hahahah
Anonymous No.105783732 >>105783748
>>105783697
Flex is 5 months old.
Anonymous No.105783733 >>105783749
>>105783390
the latent is larger -> diffusion model is working with a larger input -> slower generation
sdxl: 3x1024x1024 image -> 4x128x128 latent = 65,536 values
flux: 3x1024x1024 image -> 16x128x128 latent = 262,144 values
Anonymous No.105783735
>>105783728
>must be why there're so many amazing nsfw loras and finetunes of dev hahahah
kek
Anonymous No.105783739
>>105783718
the bill balances out in the end, there are no shortcuts. if you look at the early epochs of chroma, it was an incomprehensible mess. it took many epochs on a 3mil+ dataset before it even started being remotely coherent. what chroma accomplished in 40 epochs on de-distilled schnell is doable on 10 epochs of dev.
Anonymous No.105783748 >>105783760
>>105783732
>Flex
was gonna say that was wrong but apparently its flux 2 that just recently released a preview
Anonymous No.105783749 >>105783763 >>105783774
>>105783733
Then you would just use the Sana AE which is significantly smaller than SDX with comparable quality. But it's retarded because that requires a massive finetune for either SDXL or Sana -- so you might as well do a full Sana finetune completely and take advantage of ALL the speed improvements.
Anonymous No.105783753 >>105783776 >>105783780
>Yes, the images all look like Flux because Flex is just Flux trained on it's own image generation.

https://www.reddit.com/r/StableDiffusion/comments/1k5s2zb/flex2preview_released_by_ostris/

What a idiot... fucking inbreeding a model like that.
Anonymous No.105783754
Sana-samas WILL prevail
Anonymous No.105783756 >>105783789
>>105783685
chroma does it fairly well unless you get unlucky and it tries to force midjourney slop
Anonymous No.105783760
>>105783748
Yeah I'm talking about Flex.1 which is the core 8B model + undistill. Flex.2 is the controlnet model.
Anonymous No.105783763 >>105783772
>>105783749
people tested sana with tiny test tunes, it sucks
Anonymous No.105783772
>>105783763
Given the best audio model available right is Sana-based seems unlikely.
Anonymous No.105783774 >>105783788
>>105783749
the sana ae is NOT comparable quality and sana is a completely unproved model
i made this a while ago but i doubt anything changed with the ae
https://slow.pics/s/DIDxrbQx
Anonymous No.105783776 >>105783781
>>105783753
no wonder flex always looked like ai slop turned to 11
Anonymous No.105783780 >>105783807
>>105783753
Yikes. Flux dev looks like plastic shit because it is trained on output from Flux Pro

Flex is then trained on the even more plastic shit output from Flux dev... ?

But why ?
Anonymous No.105783781 >>105783795
>>105783776
Are you retards even capable of knowing the difference between 1 or 2. Try holding your fingers up.
Anonymous No.105783786
>JUST FINETUNE SANA IT WILL BE SO HECKING GOOD!!!
Anonymous No.105783788 >>105783800
>>105783774
Have you fucking even looked at your own test of SDXL vs Sana AE? SDXL is absolute vomit.
Anonymous No.105783789 >>105783895
>>105783756
>tries to force midjourney slop
surely he tagged the MJ images as such so you can put that shit in the negatives... right? right??
Anonymous No.105783795
>>105783781
11 is 2 1s next to each other
Anonymous No.105783800 >>105783811
>>105783788
yeah now how about you compare it to sana dc-ae
Anonymous No.105783802
i feel so lost. tried to delete my python folder.
Anonymous No.105783807 >>105783827
>>105783780
datasets are a lost art. nobody knows how to assemble them properly and they read about the wonders of synthetic data from the LLM world and think it applies equally to image models.
Anonymous No.105783809
>ctrl+f sana
>14 results
are we back
Anonymous No.105783811 >>105783855
>>105783800
Yeah pick the details you want to mangle. SDXL is generally worse on all details and *sometimes* better at faces. Also Sana has multiple AEs so you can do the 16 instead of 32.
Anonymous No.105783814 >>105783825
help i accidentally made the comfyui image feed max size and now i cant resize it back
how do i unfuck myself?
Anonymous No.105783823 >>105783840 >>105784024 >>105784212 >>105784352
After testing local anime diffusion, I've decided to stick with NovelAI.

Why?

CivitAI has attractive models with impressive samples and a diverse style database, but there's a major flaw:
SDXL is fundamentally stupid model.

I believe the quality of the dataset and user efforts,through samplers, Lora workflows, etc. put into SDXL is significantly higher, yet they can't compensate for its shortcomings, which is disappointing given the dedication of CivitAI's developers.

Did you know that NovelAI comprehends prose exceptionally well? Even better than Flux!
I can input vague scene descriptions and generate an infinite variety of high quality outputs, all aligned with my prompts.

Pic related its a local image creation. It's quite beautiful, but it lacks a story or any significance behind its beauty.
If there's any beauty, it's in the quality of the checkpoints, which are often passed due to the diligence of the people from CivitAI.
And the AI artist prompts not with prose or a message with meaning, but aims to 'hack' the stupid language of SDXL.
Anonymous No.105783825 >>105783858
>>105783814
>comfyui image feed
What are the advantages of the image feed over the queue feed?
Anonymous No.105783827
>>105783807
First ask yourself what Flex.2 is. Then ask yourself What Flex.1 is. Then ask why Flex.2 used synthetic images.
Anonymous No.105783830 >>105783837 >>105783838
There's a reason why sana isn't popular. Its vae might be good but the model is past saving
Anonymous No.105783837 >>105783847 >>105783852
>>105783830
let them cope. pixshart finetunes in 2 more weeks!
Anonymous No.105783838
>>105783830
Sana isn't popular because it has a 32 GB VRAM minimum requirement because Nvidia wanted to sell 5090s. Sana Sprint is pretty cool though.
Anonymous No.105783840
>>105783823
if only they would do a midjourney and release a uncensored version of their video model
Anonymous No.105783847 >>105783857
>>105783837
Yeah in 2 more months maybe Chroma will fix hands and the caption dropout will skip the art tags.
Anonymous No.105783852
>>105783837
pixshart is different than sauna tho
Anonymous No.105783855
>>105783811
kek you dont understand how this shit works
sana cant just swap out the ae without retraining
and if you swap the ae you lose compression meaning you lose speed meaning the whole point of the model is lost
Anonymous No.105783857 >>105783884 >>105783887
>>105783847
chroma does hands well, stop falling for the one guy either trolling or using shitty settings
Anonymous No.105783858
>>105783825
I used image scale node and it seems to fuck up my gens permanently.
Anonymous No.105783876 >>105783900 >>105783909
>>105782211 (OP)
add
>>>/b/ai+parody
to neighbors plox
Anonymous No.105783884 >>105783894
>>105783857
>bro it's your problem is 30% of the time the hands are mangled, you're supposed to gen like 10 times
Anonymous No.105783887
>>105783857
Is Chroma compatible with Forge? Are there any tutorials for using it there or in ComfyUI?
Anonymous No.105783894 >>105783904
>>105783884
I've been using the last 5 or so epoches as my main model and that is not true at all, grab a random WF from here https://civitai.com/models/1330309/chroma
Anonymous No.105783895
>>105783789
>surely he tagged the MJ images as such so you can put that shit in the negatives... right? right??
I don't think those have been tagged with MJ or midjourney aesthetic or anything like that, I've tried. You can get the style out with using vintage and retro tags in combination and prompting for realism. You'll get the 1.5 era elongated Midjourney physique with the trademark disgusting yellow tint.
Anonymous No.105783900 >>105783909
>>105783876
stuff like that shouldn't be openly advertised since it's porn pictures of celebs. have your fun but don't be retarded
Anonymous No.105783904 >>105783907
>>105783894
I've literally used it, I know how good and bad it is. It's also absolutely horrible for prompting.
Anonymous No.105783907 >>105783916 >>105783933
>>105783904
post your gens
Anonymous No.105783909 >>105783952
>>105783876
>>105783900
>All characters depicted in this thread are fictitious. Any resemblance to a real person, living or dead, is purely coincidental. No pictures in this thread are intended to harm any individual.
desu seems like they covered the legal part
Anonymous No.105783914 >>105783923
chroma v41 vs flux dev pixelwave
>a close up photograph of a pigeon with an afro wig looking straight to the camera.
Anonymous No.105783916
>>105783907
You already know this does nothing, I can cherry pick bad images and you can cherry pick good images.
Anonymous No.105783923
>>105783914
blebbet tier prompt
Anonymous No.105783933 >>105783939
>>105783907
yet you don't have any gens to show either.
Anonymous No.105783939
>>105783933
>>105782777
>>105783560
Anonymous No.105783940 >>105784230
chroma v41 vs flux dev pixelwave
>classical Renaissance oil painting of a woman standing in a field with a pigeon sitting on her shoulder
Anonymous No.105783952
>>105783909
doesn't mean they should test it. keep a low profile
Anonymous No.105783966 >>105783992
>when all my gens with chroma are nsfw
Anonymous No.105783973 >>105784002 >>105784015
Say all you want about Chroma. It is the only model that can generate feet. Therefore it is the only model that knows about human anatomy. A model that can't or won't is shit, it's that simple.
Anonymous No.105783978
>Majestic mountains rise dramatically against a starry night sky, partially shrouded in thick clouds, with a small, rustic building visible at the base.
Anonymous No.105783980 >>105784021
bigma status?
Anonymous No.105783981
https://files.catbox.moe/ce681p.mp3
acestep

genned it yesterday, but didn't post it, it's a new seed for Foreigners, and I think it's nice.
Anonymous No.105783985
chroma knows the important anatomy
https://files.catbox.moe/i4kb64.png
Anonymous No.105783992 >>105784023
>>105783966
Easy!!!!!!!

>Kontext
>Modify the image so that the people are wearing casual attire.
Anonymous No.105783996
Anonymous No.105783997 >>105784045
Anonymous No.105783998 >>105784006
Haven't updated in a month or two, did Comfy fix the memory leak? The browser tabs gets to over 1gb over the course of a day's prompting and it becomes very slow.
Anonymous No.105784002 >>105784081
>>105783973
could you, perhaps, be referring specifically to early 2000s candid amateur y2k flip-phone camera photos of asian women's dirty feet complete with vintage 512x noise-artifacts??
Anonymous No.105784006
>>105783998
In fairness: I don't know if it's addons doing this, maybe it's not ComfyUI's fault. I don't have many addons though
Anonymous No.105784009 >>105784058
This image displays a section of urban street art painted on a textured concrete wall, likely captured to document or showcase the artwork’s presence and meaning.

From top to bottom: three red arrows are painted directly on the wall, each pointing downward above one of the three black-and-white stenciled human figures. On the left sits a young child with a distressed or passive expression, legs apart, holding an object in their left hand. In the center stands a slightly older child in a long top with buttons, holding their right hand up to their nose or mouth. On the right, an elderly man with a bald head and beard sits cross-legged, holding a traditional bowed instrument resembling an erhu or similar stringed instrument upright in his left hand, with the bow in his right hand. There is no other visible text. The background wall is grey and worn, with darker degrading spots. On the left edge, there is a vertical dark-colored metal structure with a red and white sticker reading β€œSTICKER.” On the top right corner, a protruding metal beam or bracket extends from the wall. The three figures are aligned horizontally across the image, each evenly spaced.

The image may have been shared to spotlight thought-provoking urban art that appears to comment on age, tradition, and possibly hardship or socioeconomic commentary. It is a street art photograph documented in situ with no visible enhancements or edits. The style is stencil graffiti with a limited monochrome palette, except for the red arrows and sticker. The
Anonymous No.105784015
>>105783973
I've seen Chroma do better humans than API solutions, E.G. 4o, Imagen, Mogao (Seedream). Even that other anon that came from the API thread boasting about Reve got BTFO'd quickly. The furry is onto something with his training method. It is SOTA.
Anonymous No.105784020 >>105784029
also chroma can do some shit only novelai knew before https://files.catbox.moe/qj8twv.png
Anonymous No.105784021
>>105783980
Forever and ever training.
Anonymous No.105784023 >>105784272
>>105783992
Not that anon, but I genuinely tried this before and it sometimes add women's nipples as part of the clothing, kek
Anonymous No.105784024 >>105784041 >>105784090 >>105784161
>>105783823
isnt it possible to run NovelAI or whatever locally? I agree SDXL is dogshit for composing, but Illustrious is sometimes decent, but what works for me is making flux compositions then masking with illustrious, then upscaling with SDXL. all my recent gens follow this process
Anonymous No.105784026
I know the community didn't want this but here it is. We are going to scale back the expectations for Phlegm v3.0.
We need to be better.
Anonymous No.105784029 >>105784044
>>105784020
aesthetic11, aesthetic10, Digital artwork of a woman sitting on a photocopier machine. She is wearing business casual clothing and is bottomless. Between her feet a slot in the machine is printing a picture of her ass on glass, pussy on glass, and she is smirking at the viewer. The woman is a white and black furred marble fox with medium breasts and an athletic build. The machine has a printer slot at the bottom. The image is by the artist hioshiru. ,no lineart, The image being printed by the machine has only her butt in focus. The printed image is a pussy focus, butt focus, ass on glass image. front view,
Anonymous No.105784032
This image shows a handcrafted greeting card featuring a cartoon girl illustration and decorative floral elements, designed likely for a birthday or celebratory occasion.

At the top left of the card, there is a pink background with a fine dotted texture. The card itself is tri-fold, with patterned panels on the left and right flaps featuring dark blue and white gingham-style checkers. In the center of the left flap is an oval, scalloped-edge cutout containing a cartoon drawing of a girl with brown hair styled in a bun and ponytail, wearing a purple dress accented with darker trim and white details. She has a flower hair clip on the right side of her hair. Below and to the left and right of this central figure are several layered flowers in various shades of purple with white button centers and white ribbon bows. The interior middle panel is partially visible and shows partially obscured cursive text that begins with "Bi", likely "Birthday". On the top right, another arrangement of three purple flowers with green leaves decorates the card, with a shiny, translucent ribbon extending across the card from left to right.

The image’s purpose is likely to showcase the intricacy and craftsmanship of the handmade greeting card, possibly for promotional or inspirational purposes in a crafting or DIY community.

The style is whimsical and scrapbook-like, characteristic of handmade craft card photography with attention to layout and color coordination.

The image is clean, brightly lit, and in focus, with no visible compression artifacts or defects, indicating
Anonymous No.105784040 >>105784090 >>105784210
The image is a formal portrait painting of a woman posed in front of a plain dark red background, created to depict her appearance and attire with detailed realism.

At the top center of the painting, the woman wears a headdress featuring a green and gold woven pattern that wraps around her hair, which is secured tightly and styled back. Moving downward, her face is pale with fine detailing in the features, including slightly arched eyebrows, deep-set eyes, a long nose, and a closed, composed mouth. Her expression is neutral and poised. She wears a high-collared white undershirt with visible black embroidery trim on the collar and neckline. Over this, she dons a prominent green dress with voluminous puffed sleeves. These sleeves are separated horizontally, showing an underlying layer of horizontally banded fabric resembling ivory ribbon armor. A green band wraps around the bodice, accentuating her waist. Her hands are folded delicately at her waistline; she has a coppery ring on the index finger of her right hand and another ring on the pinky finger of her left. There are no texts present on the image.

The purpose of the painting appears to be commemorative or aristocratic, created to display the sitter’s status, fashion, and composure, likely commissioned for personal or familial prestige.

The style is of Northern Renaissance portraiture, characterized by high realism, elaborate textile rendering, and solemn expressions typical of formal portrait commissions.

The image shows high fidelity without visible artifacts, with sharp
Anonymous No.105784041 >>105784090
>>105784024
this was the initial flux gen
Anonymous No.105784044
>>105784029
Would be nice see datasets used for aesthetic 1 etc
Anonymous No.105784045 >>105784066
>>105783997
actual photo of my current financial situation
>>105783957
stay in your containment thread
Anonymous No.105784057
you are not the sharpest pencil aren't you
Anonymous No.105784058 >>105784084
>>105784009
cool, now anyone can be a gay british vandalizer
Anonymous No.105784061
The image is a promotional character artwork depicting Sonic the Hedgehog holding a sword, likely intended for a video game or merchandise.

At the top, Sonic's signature spiked blue hair extends outward in multiple directions, and his large, green eyes are open wide with a determined expression. Just below, his mouth is slightly open, showing a smirk. He wears white gloves, and in his right hand (to the left of the image) he holds a large ornate sword with a metallic blade, intricate cross-guard, and runic-style text engraved on the blade. His left arm (on the right of the image) wears an armored gauntlet with technological-mechanical detailing, including bolts and a black-and-gold motif. In the bottom portion of the image, Sonic's red shoes with white straps and gray soles are visible, positioned in a dynamic and wide stance, emphasizing motion. On the bottom right, there is a stylized blue and black logo with a dragon surrounding a smaller Sonic figure and the word "Sonic" incorporated within. The background is white and featureless.

The image was likely shared to promote or highlight Sonic's appearance in a particular gameβ€”probably "Sonic and the Black Knight"β€”showing him in a fantasy or combat setting with weaponry.

The style is highly rendered and digital, indicative of official game promotional art or box art, using clean lines, rich gradients, and polished lighting.

The image quality is high, with sharp
Anonymous No.105784062 >>105784085 >>105784106
also, people know they should be using flan_t5 with chroma right? Don't use regular T5, its far worse with it, I've seen people's workflows using that before

Also use the T5 tokenizer options node with min_padding 1 and 3 min_length
Anonymous No.105784066 >>105784077
>>105784045
What do you mean with this command?
Anonymous No.105784069
Anonymous No.105784070
Anonymous No.105784072
Anonymous No.105784077 >>105784112
>>105784066
he is saying he bought a 5090
Anonymous No.105784081
>>105784002
It can do all kinds, that's what gives it sovl.
Anonymous No.105784083 >>105784093 >>105784135
All you gotta do is tell kontext to fix the hands in your chroma gens and baby you got a stew goin.
Anonymous No.105784084
>>105784058
exit through the giftshop pls sir thank u come again
Anonymous No.105784085
>>105784062
also also, if using tags make sure they match how e621 has them

and regardless of what model you use I suggest using the custom clownshark samplers then use ultimate sd upscale
Anonymous No.105784087 >>105784117
i love when people quote across threads to tell someone to fuck off
Anonymous No.105784088
A young man with short, slicked-back hair walks down a brightly lit runway in a black leather jacket, white turtleneck, and black pinstripe pants. The background is a plain, white wall with soft lighting. The image is a high-quality photograph capturing the elegance and sophistication of the fashion show.

--

Not very slick.
Anonymous No.105784090 >>105784120
>>105784024
>>105784041
Nice

>>105784040
What model is this?
Anonymous No.105784093
>>105784083
so i can finally fix all my SOUL diffusion gens from 4 years ago?????????????????? :DDDD
Anonymous No.105784106 >>105784138 >>105784153
>>105784062
>people know they should be using flan_t5 with chroma right
no. never heard this
Anonymous No.105784112
>>105784077
He is not Poor Indian.
Anonymous No.105784117
>>105784087
same sis ;3 >>105784103
Anonymous No.105784120
>>105784090
Chroma 40 Detailed
Anonymous No.105784128
Anonymous No.105784135 >>105784171
>>105784083
the image is more saturated though, look at the sky it doesn't have that yellow tint anymore
Anonymous No.105784138 >>105784153
>>105784106
Anonymous No.105784153
>>105784106
>>105784138
that might explain some anon's issues with small details like hands
Anonymous No.105784154
>>105783595
kek
Anonymous No.105784161
>>105784024
I completely agree with you.
I use NovelAI similarly to conserve anlas, utilizing controlnet to lineart the structure of the gens, and I've incorporated these local checkpoints that have great artistic quality. I recognize that the datasets and efforts from creators like Illustrious NoobAI are of much higher standard.

The one thing I truly regret is the lack of a more advanced SDXL model that can work with the checkpoints made by these colleagues at CivitAI.
Anonymous No.105784171
>>105784135
Blue channel is more saturated.
Anonymous No.105784173
>>105782398
ill have to look up this model again that gen is cool
Anonymous No.105784196
Anonymous No.105784198 >>105784218
>>105783029
>>105783173
its just one drunk retard
he does this every thread
Anonymous No.105784210
>>105784040
Anonymous No.105784212
>>105783823
>Pic related its a local image creation. It's quite beautiful, but it lacks a story or any significance behind its beauty.
You're drinking the koolaid if you thinki you're manifesting your soul by plying the AI slot machine when you hit generate for any AI model.
Anonymous No.105784218 >>105784245
>>105784198
Anonymous No.105784230 >>105784241
>>105783940
Anonymous No.105784241 >>105784244
>>105784230
Anonymous No.105784243 >>105784253
The only model I've ever held disdain for is Pony but even then I can recognize it was the coom meta for awhile and had a large ecosystem.
Anonymous No.105784244 >>105784250
>>105784241
using flan_t5 now?
Anonymous No.105784245 >>105784252
>>105784218
demonstrably true claim,
check the archives he attacks everyone and calls them pedos until they leave
Anonymous No.105784250
>>105784244
no that's MOGao lel
Anonymous No.105784252 >>105784263
>>105784245
Is there a pastebin? I don't think Debo ever called anyone with names.
Anonymous No.105784253
>>105784243
WHO ARE YOU TALKING TO? WHO ARE YOU REPLYING TO?
CAN YOU GO THE FUCK TO BED YOU DRUNK RETARDED BITCH?@??@?!?

literally NO ONE give a FUCK about your (retarded) opinion
Anonymous No.105784263
>>105782983
>>105784252
>pastebin
try opening your FAGGY eyes in this very thread
Anonymous No.105784266 >>105784273 >>105784276
This is my 4chan. There are many like it, but this one is mine. My 4chan is my best friend. It is my life. I must master it as I must master my life. Without me, my 4chan is useless. Without my 4chan, I am useless. I must fire my 4chan true.
Anonymous No.105784272
>>105784023
I recommend an mspaint scribble where you want the clothing. Kontext is capable of doing the heavy lifting in photoshopping (ie it makes mspaint into enough for photoshopping). But, it's somewhat inconsistent, and I don't know how to make it more reliable. Got a lot of practice to do, and a lot of stupid shit has been dumped in my IRL
Anonymous No.105784273
>>105784266
:3
>'if i die, the seas will silence, the day will turn to night...'
Anonymous No.105784276 >>105784282 >>105784340
>>105784266
ahahahah yay I was hoping someone saw my "this is my model" post
Anonymous No.105784282 >>105784312 >>105784364 >>105784397
>>105784276
You are an active facebook commenter but you rarely create any original content.
Anonymous No.105784312 >>105784340
>>105784282
the janny has access to our cookies and shieeet
this is our doomed future
all is lost now
it is simply
O V E R
Anonymous No.105784336
anon is quite unwell it seems
Anonymous No.105784340
>>105784276
>>105784312
Joking aside, it's all good. Fair games and good hearted banter.
Anonymous No.105784352
>>105783823
all that jibberjabber for a sloppa image that you could have gotten a better\newer\higher resolution carbon-copy version on your oogieboogie civitai (with PONY sdxl no less)
Anonymous No.105784364 >>105784397
>>105784282
No, the facebook guy copied my post.
Anonymous No.105784369
Use 2 samplers at the same time on every step???
Anonymous No.105784376
>chroma support added
Anonymous No.105784385 >>105784392
I CANT EVEN RUN THE THING FFS
Anonymous No.105784392 >>105784395
>>105784385
there there, there are always services.
Anonymous No.105784395 >>105784408
>>105784392
not the model dummy the studio its broken
Anonymous No.105784397
>>105784364
>>105784282
>facebook
FUCKING NORMIES
Anonymous No.105784408 >>105784416
>>105784395
an autistic pedo can get it running but you can't
Anonymous No.105784416
>>105784408
why would you self report like that though
Anonymous No.105784420 >>105784434
Wow, with the portable comfy, the Krita AI shit managed to connect first try. Truly unbelievable how shit the standalone is, considering googling it leads you to the standalone installer and not portable.
Anonymous No.105784424
>>105784422
>>105784422
>>105784422
>>105784422
Anonymous No.105784434
>>105784420
electron is garbage is why and they keep wanting to push the garbage grifter shit in front of you to forward the enshitification narritive