Thread 106577883

323 posts 158 images /g/

Anonymous 9/14/2025, 12:13:08 AM No.106577883 >>106578232 >>106579138

/ldg/ - Local Diffusion General

highlights_g_106575437_1757801240_thumb.jpg.webm md5: 66140c9e... 🔍

Trying to Piss You Off Edition

Discussion and development of local image and video models and UI

Prev: >>106575437

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous 9/14/2025, 12:14:25 AM No.106577893 >>106577937 >>106578095

00201-1863256780.png md5: acc88df2... 🔍

I have stabilized a style most likely

Anonymous 9/14/2025, 12:14:48 AM No.106577897 >>106577991 >>106578602

1757736090814-7637c756-215c-4c18-ab17-10f46500cb5d2.jpg md5: fafcb978... 🔍

Anonymous 9/14/2025, 12:15:35 AM No.106577906 >>106577960

>>106577885
>You don't seem to understand what a base model is, it's a model made to have as little bias as possible and know of as many concepts as possible so it can be used as a BASE for further finetuning
can someone put the post of the anti chroma anon saying "you are here" and at the end it was, "we can save it with finetune though" or something like that

Anonymous 9/14/2025, 12:18:40 AM No.106577937

>>106577893
so just slopstyle?

Anonymous 9/14/2025, 12:18:53 AM No.106577941 >>106577967 >>106578630 >>106578662

https://www.alphaxiv.org/overview/2509.07295v1

Anonymous 9/14/2025, 12:19:11 AM No.106577943 >>106577955 >>106578006 >>106578014 >>106578022 >>106578095

1751741009725793.png md5: 197adfc9... 🔍

>>106577796
>NOOOO YOU CAN'T JUST TELL THE MODEL WHAT YOU WANT AND GET WHAT YOU WANT USING A HIGHLY CONCISE FORMAT!!! YOU HAVE TO BOOMERPROMPT!!!

>>106577809
I find chroma most useful for actual artistic stuff (which none of the chromaschizos can even conceive of, all they do is generate asian waifus and feet), it's just disappointing that it's so much weaker than it should be. The ROI on prompting effort is much lower than Noob or Qwen, but it is true it can do things no other local model can do at the moment.

>pic
The truth is that we're going to free ourselves from this shit by baking our own models from scratch, using techniques like those in https://huggingface.co/KBlueLeaf/HDM-xut-340M-anime. Architectural changes and training optimizations are going to make it possible to train fully unslopped, uncensored, DEBLOATED models with Qwen-level comprehension and far fewer parameters, with local or rented GPUs on a budget <$10k very soon. It may already be possible.

>>106577820
>>106577885
>Pony and Noob are large finetunes with TONS of sexual positions.
>Chroma is a base model, like SDXL, Flux, QWen, as such it's not specifically focused on anything but knows some of practically everything
This makes no sense whatsoever. It was trained on e621 data right? That data has sex position tags does it not?? So why doesn't chroma?

>>106577829
Nothing wrong with that, but it should have been trained on tags alongside/interchanged with the captions. Seems it wasn't.

Anonymous 9/14/2025, 12:19:32 AM No.106577948 >>106577983 >>106578346

>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
AAAAAAAAAAAAAAAAAAAAa

Anonymous 9/14/2025, 12:20:37 AM No.106577955

>>106577943
>another wall of text
what's wrong with you dude?

Anonymous 9/14/2025, 12:21:04 AM No.106577959 >>106578022

Once again there's never been a successful model that's retagged *booru images with NLP slop.

Anonymous 9/14/2025, 12:21:06 AM No.106577960 >>106577979

>>106577906
Desperately moving the goal post again

It was always presented as a base model, a de-distilled and de-slopped uncensored version of Flux Schnell, which is also a base model, just like SD1.5, SDXL, SD3, Flux, Qwen, Wan

And just like with all these models, if you want it to be really good for a specific concept, you need finetunes/loras

Anonymous 9/14/2025, 12:22:15 AM No.106577967 >>106578045

>>106577941
> A 1.5B-parameter model with RecA achieved state-of-the-art results on image generation benchmarks like GenEval (0.86) and DPGBench (87.21) with only 27 A100 GPU-hours
sounds impressive? they show 0 images though, wtf?

Anonymous 9/14/2025, 12:23:18 AM No.106577979 >>106578000

stop.png md5: b997c270... 🔍

>>106577960
>just 2 more finetunes bro
just let it go, it's over

Anonymous 9/14/2025, 12:23:31 AM No.106577983 >>106578324

>>106577948
> he can't matmul
ngmi

Anonymous 9/14/2025, 12:24:26 AM No.106577991 >>106577999

file.jpg md5: 3d850118... 🔍

>>106577897
giwtwm

>>106577878
thanks but it came out a turd

>>106577886
yeah forgot what the prompt was but i think it did have it in there

Anonymous 9/14/2025, 12:25:12 AM No.106577998 >>106578095

file.png md5: b7f0a7a5... 🔍

Anonymous 9/14/2025, 12:25:17 AM No.106577999

>>106577991
>it came out a turd
yeah, the proportions are all fucked up lol

Anonymous 9/14/2025, 12:25:23 AM No.106578000 >>106578006 >>106578020

>>106577979
He always post off topic images in the thread, same pattern no deviation and you people keep fucking replying to him

Anonymous 9/14/2025, 12:26:58 AM No.106578006 >>106578033 >>106578034

>>106578000
>same pattern no deviation
you mean the wall of texts? + the overusage of the world "shill"?
>>106577943
>>106577527
>>106577373

Anonymous 9/14/2025, 12:28:31 AM No.106578014 >>106578035

>>106577943
>This makes no sense whatsoever. It was trained on e621 data right? That data has sex position tags does it not?? So why doesn't chroma?
What part of focus don't you understand ?

If you train on a shit ton of images with no particular biases, then it will not learn particular biases as well as if you train on a shit ton of images with particular biases

It's not rocket science, the model learns through pattern recognition and repeats, if one training has 50k images of fetish X and 5 million images of other stuff, and the other training has 200k images of fetish X and 1 million images of other stuff, the latter model will learn fetish X much better

ポストカード !!FH+LSJVkIY9 9/14/2025, 12:29:42 AM No.106578020

>>106578000
:c

what do you mean "YOU people"???

Anonymous 9/14/2025, 12:29:43 AM No.106578022 >>106578095

1750492740684917.png md5: 34b05cde... 🔍

>>106577959
>>106577943
I'm not a baker, but here's what seems obvious to me:
Make three caption sets:
>Pure NL captions
>NL captions that are infused with tag keywords by telling the VLLM to make sure to include the image's tags in the NL description
>plain tags
Then when training, just include each image three times, one time for each version of the caption. Or concatenate the caption sets in a randomized order.

Why wouldn't this work? Why don't bakers do this?

Anonymous 9/14/2025, 12:29:46 AM No.106578024

local
chads
eatin
gud

Anonymous 9/14/2025, 12:30:04 AM No.106578026 >>106578074 >>106578104

there is no excuse for a model as big as chroma to not fit in every drop of knowledge from every booru out there with plenty of room to spare. it could fit 4 SDXLs in it. it's simply bad

Anonymous 9/14/2025, 12:30:08 AM No.106578027 >>106578065 >>106578072 >>106578096

still down to train an interesting lora to post here

>>106577496
as long as the artist style is not shit, sure

Anonymous 9/14/2025, 12:30:44 AM No.106578033

00203-1863256781-ad-before.jpg md5: 24e2ffbc... 🔍

>>106578006
Autistic people connect the problem is the one shilling for non free shit has no reason to be in this thread. He's just rattling the cage hoping to trigger people

Anonymous 9/14/2025, 12:31:11 AM No.106578034 >>106578054

>>106578006
>He always post off topic images in the thread
kek, never posted anything not AI generated here

>the overusage of the world "shill"?
can't be overuse when it's 100% correct

Anonymous 9/14/2025, 12:31:14 AM No.106578035 >>106578136

>>106578014
Yet chroma learned a ton of concepts that weren't in Schnell. One obvious example, it can do genitals. How is learning basic sex positions harder than learning genitals, which weren't in Schnell at all??

Anonymous 9/14/2025, 12:31:17 AM No.106578036 >>106579818

I can't believe there is still someone here desperately trying to convince people that Chroma is the future. It's done. It's shit.

Anonymous 9/14/2025, 12:31:24 AM No.106578039

G0wjL1vXEAAt-a_.jpg md5: c0d56535... 🔍

Anonymous 9/14/2025, 12:31:44 AM No.106578045 >>106578069

>>106577967
I shared the overview link. click Paper up at the top.

Anonymous 9/14/2025, 12:32:40 AM No.106578054

>>106578034
see, you can argue without making a wall of text, that's way more pleasing to the eyes, thank you

Anonymous 9/14/2025, 12:34:15 AM No.106578065 >>106578089 >>106578142

>>106578027
80s, 90s, 2000s movie / tv show styles are always appreciated, also not that hard to caption

Anonymous 9/14/2025, 12:34:42 AM No.106578069

1730803754554847.png md5: d7f4dd74... 🔍

>>106578045
oh ok, thanks
https://arxiv.org/pdf/2509.07295

Anonymous 9/14/2025, 12:34:57 AM No.106578072

>>106578027
im uploading a couple of datasets if you give me a sec ill post em

Anonymous 9/14/2025, 12:35:35 AM No.106578074 >>106578128

>>106578026
You are so retarded it's not even funny

Sad that you comment so much but know absolutely nothing about AI training

Anonymous 9/14/2025, 12:37:37 AM No.106578089

facts.png md5: 527a972a... 🔍

>>106578065
>90s, tv show styles are always appreciated

Anonymous 9/14/2025, 12:38:26 AM No.106578095 >>106578119

>>106577998
>>106577943
Please post in /adt/ we need you
>>106577893
you to also!
>>106578022
please post your good gens in /adt/!

Anonymous 9/14/2025, 12:38:35 AM No.106578096 >>106578142

>>106578027
https://www.mediafire.com/folder/enj83lxxnq1ih/datasets

Anonymous 9/14/2025, 12:39:46 AM No.106578104

WHERE ARE THEY.jpg md5: b7c14160... 🔍

>>106578026
>there is no excuse for a model as big as chroma to not fit in every drop of knowledge from every booru out there
he said on reddit that the artist tags were gonna be there, it didn't happen

Anonymous 9/14/2025, 12:39:59 AM No.106578107

rocketjeet is so desperate, it's pathetic

Anonymous 9/14/2025, 12:41:45 AM No.106578119 >>106578310

1742907517617710.jpg md5: 8114cef6... 🔍

>>106578095
/adt/ endorses API models, so no.

Anonymous 9/14/2025, 12:42:41 AM No.106578128 >>106578160

>>106578074
>nooo, you don't understand, SDXL (3.5b) had no issue getting all the booru tags, but Chroma (8.9b) just can't do itttttttt
or else this anon has 50 of IQ, or else we're dealing with lodestone there, there's no way he's not trolling right? I refuse to believe someone can be this dumb

Anonymous 9/14/2025, 12:43:41 AM No.106578136

>>106578035
NTA but just with that example almost every image depicting sex is gonna have genitals while a small subset might depict a particular sex act. Anyway I think a lot of chroma's issues are the unconventional way it was trained more than the dataset.

Anonymous 9/14/2025, 12:44:35 AM No.106578142 >>106578156 >>106578172

>>106578065
https://huggingface.co/silveroxides/Chroma-LoRA-Experiments/tree/main
has 1980s and 2000s lora that work

>>106578096
dataset usually means with captions but i'll try. they seem a little too abstract for even chroma but i guess we'll see

Anonymous 9/14/2025, 12:46:41 AM No.106578156 >>106578162

>>106578142
>dataset usually means with captions
i can upload mine if youd like but theyre really just shitty joycaption "danbooru-like" tags. i wouldnt think theyd do well with chroma

Anonymous 9/14/2025, 12:46:55 AM No.106578160 >>106578168

>>106578128
>SDXL (3.5b) had no issue getting all the booru tags
Just stop lying, they didn't get all the booru tags, the FINETUNES of SDXL did, because they were FOCUSED on booru content

This is so tiresome

Anonymous 9/14/2025, 12:47:32 AM No.106578162 >>106578219

>>106578156
i trained a booru only dataset and it worked fine. send em

Anonymous 9/14/2025, 12:48:15 AM No.106578168 >>106578183 >>106578489 >>106579247

>>106578160
Chroma doesn't know a single booru tag, and you find this normal? this motherfucker trained the model with booru images

Anonymous 9/14/2025, 12:48:49 AM No.106578172 >>106578539

>>106578142
>has 1980s and 2000s lora that work
I'm sure they work but how effective are they given how old they are with all the training that has happened since ?

I mean initially Flux loras worked well, but that quickly changed as training progressed and the underlying models diverged

Anonymous 9/14/2025, 12:49:46 AM No.106578176 >>106578218

lodestone booru.png md5: 60193dd6... 🔍

why do chromakeks not even understand their own model? the fact that it was trained on booru images/tags but hasn't learned any artist tags or characters is quite concerning. perhaps anti-chroma 'schizos' were right that it was trained wrong and schnell is a mess of a base model

Anonymous 9/14/2025, 12:50:20 AM No.106578183 >>106578209

>>106578168
>trained the model with booru images
Captioned by Gemini

Please stop being so goddamn retarded

Anonymous 9/14/2025, 12:50:22 AM No.106578184 >>106578223

1757803397314-5dfd793c-5c06-4405-ada6-b45eb731449e2.jpg md5: 92a3fe2c... 🔍

Anonymous 9/14/2025, 12:51:53 AM No.106578201

>seedream paid jeet force gets clowned on in the thread
>unprompted random seethe spam about local models and chroma in the next thread
ooooooooooooooooooooo im nooooticiiing

Anonymous 9/14/2025, 12:52:22 AM No.106578209 >>106578218 >>106578252 >>106578288

read retard.png md5: b047d55a... 🔍

>>106578183
you are so fucking retarded it's exhausting
https://www.reddit.com/r/StableDiffusion/comments/1j4biel/comment/mg81j11/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

Anonymous 9/14/2025, 12:52:28 AM No.106578210 >>106578244 >>106578267

>chroma is shit
>t-the seedream shills caused this!
LMAO!

Anonymous 9/14/2025, 12:53:12 AM No.106578218 >>106578593

>>106578176
>>106578209
ok, so the furry did train on tags? then what went wrong??

Anonymous 9/14/2025, 12:53:37 AM No.106578219 >>106578539

>>106578162
okay all are uploaded now, the garmash ones are fresh
>they seem a little too abstract for even chroma
i was surprised at how well noob took to them so i can only hope chroma will be absolute kino, if its anywhere as good as what i saw anon post when flux dev loras were new
looking forward to seeing yours

Anonymous 9/14/2025, 12:53:42 AM No.106578223

>>106578184
>visual representation of all the errors holding me down from training flux

y-you too..

Anonymous 9/14/2025, 12:53:48 AM No.106578224 >>106578275

chroma can't do artist tags:
>IT WASNT TRAINED LIKE THAT ITS A BASE MODEL NOT A BOORU FINETUNE!!
it was trained on booru art:
>WELL IT WAS RECAPTIONED TO BE MORE ACCURATE TO NATURAL LANGUAGE
he said himself he preserved the tags:
>SEEDREAM SHILL!!!!

Anonymous 9/14/2025, 12:54:41 AM No.106578232

>>106577883 (OP)
>Trying to Piss You Off Edition
Anons are still falling for it

Anonymous 9/14/2025, 12:56:27 AM No.106578244

>>106578210
This goy knows whats up

Anonymous 9/14/2025, 12:57:04 AM No.106578252 >>106578269

>>106578209
>noob and illustrious above pony because of "thousands of artist" tags
absolute bullshit

Anonymous 9/14/2025, 12:57:56 AM No.106578267 >>106578285

>>106578210
kek, I blame Comfy a little bit though, he didn't implement it right and his ego is to fragile to admit it
https://github.com/comfyanonymous/ComfyUI/pull/7965

Anonymous 9/14/2025, 12:58:22 AM No.106578269 >>106578305 >>106578326

>>106578252
do you still use the weird chink tagmine spreadsheet website kek does anon still update it?

ポストカード !!FH+LSJVkIY9 9/14/2025, 12:58:28 AM No.106578270 >>106578303 >>106578884

reani.jpg md5: 6c47ce93... 🔍

one eight four

i got banned so many times for showing just a BIT of panties, just to clear the air, to announce to the room so to speak :p

aaaaaaaaaaaaaaaaaaaanyways

Anonymous 9/14/2025, 12:58:30 AM No.106578271

>it was unironically the pony baker defending chroma's lack of artist tag this entire thread
hooollyyy shit that's pathetic

Anonymous 9/14/2025, 12:58:56 AM No.106578275 >>106578288

>>106578224
>he said himself he preserved the tags
Well he clearly didn't, or he preserved very selectively

Go ask him in the Discord channel or on reddit if you are considering suicide because your favorite booru tags aren't in Chroma

Anonymous 9/14/2025, 12:59:02 AM No.106578278 >>106578290 >>106578308 >>106578313 >>106578328 >>106578368

Speaking of chroma, Chroma radiance is now officially on ComfyUi
https://github.com/comfyanonymous/ComfyUI/pull/9682

Anonymous 9/14/2025, 12:59:42 AM No.106578285 >>106578336

>>106578267
>comfy can't fix the ~100 or so lines of code for chroma
>can implement 1200 lines for Seedream 4 overnight
maybe seedream DID cause this after all

Anonymous 9/14/2025, 1:00:02 AM No.106578288 >>106578325

>>106578275
>Well he clearly didn't
he promised he would've preserved them though ;-; >>106578209

Anonymous 9/14/2025, 1:00:14 AM No.106578290

>>106578278
how do we use it? i have the gguf already downloaded but it errors in nag workflow

Anonymous 9/14/2025, 1:00:52 AM No.106578298

>the absolute backpedaling
massive defeat for chromashills today

Anonymous 9/14/2025, 1:01:12 AM No.106578303 >>106578322

>>106578270
Didn't think the mods where that anal, particularly for stylistic stuff

I mean practically all of /a/ would be banned otherwise

Anonymous 9/14/2025, 1:01:23 AM No.106578305 >>106578326

>>106578269
yes, i still use it. I don't know if it's still being updated

Anonymous 9/14/2025, 1:01:50 AM No.106578308

>>106578278
Is it good doe?

Anonymous 9/14/2025, 1:01:52 AM No.106578310 >>106578779

>>106578119
But your electricity and internet are also pay as you go API SaaS services. Stop with the dumb ideology and post your exelent gens in /adt/.
We need you!

Anonymous 9/14/2025, 1:02:07 AM No.106578313

>>106578278
Is there quants available yet?

Anonymous 9/14/2025, 1:03:42 AM No.106578322

>>106578303
catbox next time bb <3
+post part of the gen cropped or somethin hehe

Anonymous 9/14/2025, 1:04:06 AM No.106578324 >>106578818

>>106577983
>mat1 and mat2 shapes cannot be multiplied (2x2304 and 2816x1280)
I hate image generation

Anonymous 9/14/2025, 1:04:14 AM No.106578325 >>106578339 >>106578400

>>106578288
He broke some promise made in some reddit post half a year ago, you should sue him for releasing this free model without your favorite booru tags!

Anonymous 9/14/2025, 1:04:15 AM No.106578326

>>106578269
>>106578305
I actually made a wildcards file with them in it so I can roll different pony styles with my seed.

Anonymous 9/14/2025, 1:04:28 AM No.106578328 >>106578349

>>106578278
and we train it ... how?
Chroma is shit without training it.

Anonymous 9/14/2025, 1:04:58 AM No.106578336

>>106578285
Shut the fuck up already chud

Anonymous 9/14/2025, 1:05:22 AM No.106578339 >>106578352

>>106578325
>in some reddit post
you mean the official chroma announcement post? he only made 2 posts, one where he announced chroma, and the 2nd one is when he finished it

Anonymous 9/14/2025, 1:06:03 AM No.106578346

1737270552163287.png md5: f645b6b8... 🔍

>>106577948
just do the calc manually bro

Anonymous 9/14/2025, 1:06:28 AM No.106578349

>>106578328
By ignoring it. Radiance is a meme for now. Also normal loras work on it anyway. So train on base and use later on any other chroma version.

Anonymous 9/14/2025, 1:06:48 AM No.106578352 >>106578369

>>106578339
OMG! He can't get away with this!

Anonymous 9/14/2025, 1:07:03 AM No.106578355 >>106578441

ComfyUI_14369.png md5: 75c5b61b... 🔍

>>106576546
>a whole dataset based on Flux gens
Yikes! I added about 20 synthetic images to my dataset and it slopped up my LoRA something fierce. Too bad they didn't share any prompts/settings for their images, I want to see how they were using Flux. It can either look really good with the right settings, or be very limited with just the basic options in use.

Anonymous 9/14/2025, 1:08:04 AM No.106578368

>>106578278
Has it finished training or is it still ongoing ?

Also what's with the Chroma 2k thing, what's that about ?

Anonymous 9/14/2025, 1:08:08 AM No.106578369

based.png md5: 6e20b256... 🔍

>>106578352
>He can't get away with this!
based

Anonymous 9/14/2025, 1:10:31 AM No.106578386 >>106579373

ComfyUI_00008_.png md5: 3bf06774... 🔍

Anonymous 9/14/2025, 1:11:54 AM No.106578400 >>106578482

hmmm.png md5: af2de968... 🔍

>>106578325
>He broke some promise made in some reddit post half a year ago
when will he refund the guys who supported him on ko-fi because they believed there will be artist tags though? that's false advertisment

Anonymous 9/14/2025, 1:14:52 AM No.106578429

God bless ComfyUI API nodes

Anonymous 9/14/2025, 1:15:45 AM No.106578437 >>106578476

00252-213288062-ad-before.jpg md5: abc76726... 🔍

Anonymous 9/14/2025, 1:16:11 AM No.106578441 >>106578606

>>106578355
The only legitimate use case is if there aren't enough real images of for example a specific object, that being said it's still not worth it due to the massive slopping

Anonymous 9/14/2025, 1:17:50 AM No.106578462

00246-213288060-ad-before.jpg md5: f73fce84... 🔍

Anonymous 9/14/2025, 1:19:40 AM No.106578475

00241-213288058.png md5: 5cc7ad7c... 🔍

Anonymous 9/14/2025, 1:19:49 AM No.106578476 >>106578481

>>106578437
It should read
>don't get mad
>get better
prompt adherence issue?

Anonymous 9/14/2025, 1:20:41 AM No.106578481

>>106578476
Seems that way, they can't all be winners plus this prompt is ass going to make something else

Anonymous 9/14/2025, 1:20:41 AM No.106578482 >>106578498

>>106578400
Perhaps if they ask for refunds

You didn't pay shit so why are you complaining ? Get a life

Anonymous 9/14/2025, 1:21:38 AM No.106578489 >>106578505 >>106578515

>>106578168
Point me to a single Flux-based model that can do artists.

Anonymous 9/14/2025, 1:21:57 AM No.106578493 >>106578499

Is there an "official" chroma 2k workflow. Using the original chroma offical workflow on it show awful artifacts when I got to higher resolution (2k by 1k for instance).

Anonymous 9/14/2025, 1:22:42 AM No.106578498 >>106578512 >>106578547

>>106578482
>You didn't pay shit
how do you know that? you saw it in your dream?

Anonymous 9/14/2025, 1:22:43 AM No.106578499 >>106578509 >>106578566

>>106578493
How many steps?

Anonymous 9/14/2025, 1:23:47 AM No.106578505 >>106578609 >>106578628

>>106578489
Are you assuming that flux can't learn artists?

Anonymous 9/14/2025, 1:24:02 AM No.106578509

>>106578499
Tried 26 and 50.

Anonymous 9/14/2025, 1:24:25 AM No.106578512 >>106578524

>>106578498
>saw it in your dream
dont fall for the subtle shill tactics, they are trying to plant a seed. stay vigilant, localbros

Anonymous 9/14/2025, 1:24:41 AM No.106578515 >>106578525 >>106578536

>>106578489
that's basically the problem with all modern models. They're all empty. If you wanna prompt something you have to train it yourself.
They're shit.

Anonymous 9/14/2025, 1:25:37 AM No.106578524

>>106578512
kek, that wasn't intended I swear!

Anonymous 9/14/2025, 1:25:43 AM No.106578525 >>106578542 >>106578579

>>106578515
Seedream just works

Anonymous 9/14/2025, 1:26:43 AM No.106578536

facts.png md5: 4954c498... 🔍

>>106578515
>that's basically the problem with all modern models. They're all empty. If you wanna prompt something you have to train it yourself.
>They're shit.
amen

Anonymous 9/14/2025, 1:27:01 AM No.106578539 >>106578607

file.png md5: 50650c29... 🔍

>>106578172
the loras work fine. i use them with flash and HD chroma.

only some flux loras work but most do indeed not work properly anymore even if they do influence the output.

>>106578219
shit hoster, see image. use gofile or catbox. the fact anyone still uses mediafire in 2025 is surprising

Anonymous 9/14/2025, 1:27:39 AM No.106578542

>>106578525
for your generic stock images sure, but let's see some art by greg rutkowski

Anonymous 9/14/2025, 1:28:17 AM No.106578547

>>106578498
>how do you know that?
lel

Anonymous 9/14/2025, 1:30:59 AM No.106578566

>>106578499
Never mind I think it was the lora I was using.

Anonymous 9/14/2025, 1:31:00 AM No.106578567 >>106578653

>>106577787
Imagine being so retarded you can't even understand the advantages of boomer prompting. It doesn't fit into your tiny head that the model that can only understand basic tags is a nightmare to customize to get what you actually want if you want to have any control over the output. SDXL/Noob and all booru-based models suffer from terrible prompt bleed and poor prompt following.

Anonymous 9/14/2025, 1:31:41 AM No.106578578

chromakek copeymelty

Anonymous 9/14/2025, 1:31:48 AM No.106578579

>>106578525
>Generic slop, three generic artstyles
>Seedream just works
I mean, it's slightly better than GPT piss filter and one pose, but that's a VERY low bar

Enjoy your fisher price toy

Anonymous 9/14/2025, 1:33:22 AM No.106578589 >>106578681

You xboxtards don't understand the brilliance of our PS3 goodness!

Anonymous 9/14/2025, 1:33:52 AM No.106578593

>>106578218
>furry
you answered your own question.

Anonymous 9/14/2025, 1:35:00 AM No.106578602

>>106577897
api is getting mighty based. this would take 10 hours on local

Anonymous 9/14/2025, 1:35:27 AM No.106578605 >>106578681

Sega does what nintendon't!

Anonymous 9/14/2025, 1:35:43 AM No.106578606

>>106578441
I do still use synthetic images in my dataset (mostly detailed close-ups), but they had to be extremely scrutinized before only the best of the best got further touched up in PS, and they're weighted much lower than the rest of the images. Just going full Flux sounds like a recipe for disaster.

Anonymous 9/14/2025, 1:35:46 AM No.106578607

>>106578539
please excuse my retardation
schauer: https://files.catbox.moe/1popxh.zip
garmash: https://files.catbox.moe/y52mpi.zip
shejtano: https://files.catbox.moe/y38p84.zip

Anonymous 9/14/2025, 1:36:00 AM No.106578609 >>106578628

>>106578505
Not without a massive Chroma style finetune. Obviously it would have to be magnitudes more. Its lack of artist knowledge is very hard baked into the model, and the same applies to Qwen etc... anything that's greater than 2B parameters is pretty hard to teach from scratch. SD 3.5 did it right in that it knew its artists from the start, though it had its drawbacks.

Anonymous 9/14/2025, 1:37:03 AM No.106578617

https://youtu.be/ppDnmrgtHBk

Anonymous 9/14/2025, 1:38:20 AM No.106578628

>>106578505
>>106578609
Anyways, it's not that Chroma, nor Flux, can't learn artists. But attention is overwhelmed by prompt following. Unless an architecture handles styles seperately, it will only learn styles through LoRAs and tunes that focus solely on those styles. Flux is already SOTA at styles that have been trained by the community, and obviously so is Chroma.

Anonymous 9/14/2025, 1:38:29 AM No.106578630

>>106577941
I'm not smart enough to understand any of the details, is this a big deal or something? you just linked this paper without elaborating further

Anonymous 9/14/2025, 1:38:38 AM No.106578631

well.jpg md5: 5b47fecd... 🔍

>my dad works at lodestones

Anonymous 9/14/2025, 1:38:46 AM No.106578632 >>106578684 >>106578686

how organic

Anonymous 9/14/2025, 1:39:23 AM No.106578637

you retards, my team is better than your team!

Anonymous 9/14/2025, 1:39:53 AM No.106578641

only one model generates at 4k though

Anonymous 9/14/2025, 1:41:54 AM No.106578651 >>106578678 >>106578693

/ldg/: 132 / 28 / 1
/adt/: 31/ 138 / 6

Anonymous 9/14/2025, 1:42:01 AM No.106578653 >>106578673 >>106578718

>>106578567
Neta Lumina handles plain tag prompts and has less concept bleed than chroma.

Anonymous 9/14/2025, 1:43:11 AM No.106578662 >>106578799 >>106579083

1751576523149611.png md5: 02ab9db4... 🔍

>>106577941
https://reconstruction-alignment.github.io/
>We introduce Reconstruction Alignment (RecA), a resource-efficient post-training method that leverages visual understanding encoder embeddings as dense "text prompts," providing rich supervision without captions. Concretely, RecA conditions a UMM on its own visual understanding embeddings and optimizes it to reconstruct the input image with a self-supervised reconstruction loss, thereby realigning understanding and generation.
that's impressive wtf

Anonymous 9/14/2025, 1:44:52 AM No.106578673 >>106578718

>>106578653
Yes, also it takes the quintuple to generate

Anonymous 9/14/2025, 1:45:21 AM No.106578678

>>106578651
if I just spammed every retarded seed variation I get I could fill this thread up with images too

Anonymous 9/14/2025, 1:45:45 AM No.106578681

>>106578589
>>106578605
They're more like
>Stadia is superior to owning a console

Anonymous 9/14/2025, 1:46:08 AM No.106578684

>>106578632
He's just going to do this until his caretaker takes his internet away

Anonymous 9/14/2025, 1:46:49 AM No.106578686

>>106578632
roaches never die

Anonymous 9/14/2025, 1:47:36 AM No.106578693 >>106578837

>>106578651
adt is 144 / 33 why would you lie about something so easily verifiable kek

Anonymous 9/14/2025, 1:49:53 AM No.106578713

I can't take your model seriously if you don't package it into an sft

Anonymous 9/14/2025, 1:50:52 AM No.106578718

>>106578653
>>106578673
The model is not based on Flux.

Anonymous 9/14/2025, 1:55:24 AM No.106578752 >>106578773

>leave for 2 weeks
>thread is still chromaseethers vs chromekekkers
no new releases lately?

Anonymous 9/14/2025, 1:59:00 AM No.106578773 >>106578784

>>106578752
i'm slopping hard atm with sneedream it's quite fun throwing artists/styles at it to see what it shits out

Anonymous 9/14/2025, 2:00:05 AM No.106578779

>>106578310
>But your electricity and internet are also pay as you go API SaaS services.
im adding this to my arsenal of truth nuk3s for when some faggot ITT gets mad at me suggesting renting a GPU

Anonymous 9/14/2025, 2:01:11 AM No.106578784 >>106578801 >>106578811 >>106578849

also nice to see civit finally added a chroma category
>>106578773
i thought seedream was saas? is there an open weights release?

Anonymous 9/14/2025, 2:03:34 AM No.106578799 >>106578817 >>106579083 >>106579113

>>106578662
>using visual understanding encoder embeddings as dense "text prompts," providing rich supervision without captions
This is fucking genius. Shit like this is EXACTLY what I mean when I say we have so much low hanging fruit and so many more optimizations to find

Anonymous 9/14/2025, 2:03:39 AM No.106578801

>>106578784
>i thought seedream was saas?
it is

Anonymous 9/14/2025, 2:03:39 AM No.106578802

Imagine having a life so empty you spend 14 hours a day in a general shilling some off topic service and arguing with anons in a thread for LOCAL image gen. It's almost as if the person doing this suffers from some sort of disability.

Anonymous 9/14/2025, 2:04:52 AM No.106578811

>>106578784
>i thought seedream was saas?
it is but it's fun to see what non local models can do. don't see the point in limiting yourself to one side of it unless you're just a gooner.

Anonymous 9/14/2025, 2:05:45 AM No.106578817 >>106578933

>>106578799
can you explain further, I'm sure it's a genius idea but I can't visualize it, what do they do exactly to get this improvement

Anonymous 9/14/2025, 2:05:46 AM No.106578818

>>106578324
>mat1 and mat2 shapes cannot be multiplied
>I hate image generation
you downloaded the wrong version of a model or text encoder

Anonymous 9/14/2025, 2:06:29 AM No.106578824

why are people talking about slopdream online here? just ban these trolls

Anonymous 9/14/2025, 2:08:44 AM No.106578837

>>106578693
it was obviously a joke. how do you have more images than posts

Anonymous 9/14/2025, 2:09:21 AM No.106578840

Am I the only one who felt actual anger when they saw they added chroma to civit?
I don't know why that model upsets me so much. But the feeling is real.

Anonymous 9/14/2025, 2:09:53 AM No.106578844

>>106578828
>now some are trolling hard with it
about pretending that chroma is the best model ever because it can do boobs and vagene?

Anonymous 9/14/2025, 2:10:47 AM No.106578849 >>106578962

>>106578784
>i thought seedream was saas? is there an open weights release?
its a shill campaign
anons discussed it in good faith when it first released, then some time passed, now some are trolling hard with it for some reason no idea why

Anonymous 9/14/2025, 2:14:56 AM No.106578876 >>106578901

0_00112__thumb.jpg.webm md5: f2861100... 🔍

WebM not supported

Good morning

Anonymous 9/14/2025, 2:15:59 AM No.106578884 >>106578979

>>106578270
yeah the difference is anons hate you

Anonymous 9/14/2025, 2:18:51 AM No.106578901 >>106578945 >>106578954

>>106578876
gm based /ss/ anon

Anonymous 9/14/2025, 2:22:52 AM No.106578933 >>106578945 >>106579041

>>106578817
>can you explain further, I'm sure it's a genius idea but I can't visualize it, what do they do exactly to get this improvement

Imagine ML models that can understand images but suck at generating them. This new method, Reconstruction Alignment (RecA), fixes that by using the model's own visual understanding as a super-dense prompt.

Instead of relying on crappy text captions that miss most image details, it makes the model reconstruct images using its own semantic understanding. The result appears to better image generation and editing, and it only takes 27 GPU hours to implement.

Anonymous 9/14/2025, 2:25:06 AM No.106578945 >>106578954

>>106578901
that's not him (i am he), all big tit Latina milfs made with WAN lightning just look the same lol

>>106578933
oh and important to note: this is only relevant for improving multimodal models, not pure text-to-image models

Anonymous 9/14/2025, 2:25:49 AM No.106578954 >>106578975 >>106579015

0_00115__thumb.jpg.webm md5: 256a2fe7... 🔍

WebM not supported

>>106578901
>>106578945
post more big titted latinas

Anonymous 9/14/2025, 2:26:50 AM No.106578962 >>106578986 >>106578990

>>106578849
It's just the new anti-Chroma trolling flavor. Before it was Qwen, now it's Seedream.

Anonymous 9/14/2025, 2:27:07 AM No.106578964

>noooo seedream doesn't belong here!
Seedream belongs here, I will clear up any misinformation and miscommunication.
The OP states "Discussion and development of local image and video models and UI". ComfyUI, as linked in the OP, by default has Seedream as one of the available options. When first opening ComfyUI you are greeted with a pop-up requesting you to sign up and add tokens. One of the models you can spend tokens on is Seedream. The fact that ComfyUI prompts you to use Seedream before ever mentioning SDXL or Chroma suggest that it's an integral part of ComfyUI, and by extension a core part of ComfyUI discussion.
>But the OP says LOCAL image!!
True, but it also says "and UI". Discussion of ComfyUI includes the discussion of any and all components of ComfyUI, including the locally run code in the comfy_api_nodes/nodes_bytedance.py file. This file is contained locally on my device, and runs locally in my ComfyUI installation. Seedream discussion fits this thread as it falls under discussion of ComfyUI.
Conclusion: You are allowed to post Seedream outputs as long as they are generated using ComfyUI API nodes.

Anonymous 9/14/2025, 2:28:07 AM No.106578973 >>106578985

Every post he makes is an admission to his suffering and losing

Anonymous 9/14/2025, 2:28:09 AM No.106578975

>>106578954
>post more big titted latinas
i'm in my "small girls" phase right now, so i'm posting on /b/ for obvious reasons. glad to see you're still around and passionate for video

Anonymous 9/14/2025, 2:28:37 AM No.106578979

>>106578884
>anons
Again it's only you

Anonymous 9/14/2025, 2:29:16 AM No.106578980

00140-2869718969.png md5: ed74ba8a... 🔍

Anonymous 9/14/2025, 2:29:43 AM No.106578985

>>106578973
It's really quite odd isn't it. Odd and sad.

Anonymous 9/14/2025, 2:29:44 AM No.106578986 >>106578997

>>106578962
Seedream is an API locked paypig model. Nano banana at least is free to try and prompt. Truly a wonder why they would pick the chinkshit model over the free one.

Anonymous 9/14/2025, 2:30:00 AM No.106578990

>>106578962
This

Neet with anti Chroma obsessive compulsive disorder just can't stop

Anonymous 9/14/2025, 2:31:09 AM No.106578997 >>106579020 >>106580181

>>106578986
talking about google anything is a fast road to being called an indian around these parts

Anonymous 9/14/2025, 2:31:49 AM No.106579004 >>106579016 >>106579074

Found an incredible artist but her style has evolved so much I'm afraid my vramlet model won't be able to handle her entire body of work at one time

Anonymous 9/14/2025, 2:32:40 AM No.106579015

>>106578954
SLOP SLOP SLOP
How many "Speed LoRAs" and "_fast" shortcuts did you use with your "Torch compiled" "fp8_scaled" model?

Anonymous 9/14/2025, 2:32:41 AM No.106579016 >>106579023

>>106579004
Show it to me and I might help you

Anonymous 9/14/2025, 2:32:54 AM No.106579020

ComfyUI_00093__thumb.jpg.webm md5: a316deb3... 🔍

WebM not supported

>>106578997
rightfully so

Anonymous 9/14/2025, 2:33:31 AM No.106579023 >>106579031

>>106579016
https://x.com/motkaambu/media

Anonymous 9/14/2025, 2:34:40 AM No.106579031 >>106579035

>>106579023
I'm not high enough for this shit
No

Anonymous 9/14/2025, 2:35:06 AM No.106579035

>>106579031
Kek

Anonymous 9/14/2025, 2:35:30 AM No.106579041 >>106579074

>>106578933
>Instead of relying on crappy text captions that miss most image details, it makes the model reconstruct images using its own semantic understanding.
so you don't need to captions images anymore?

Anonymous 9/14/2025, 2:39:40 AM No.106579070 >>106579080

comfy698.jpg md5: f2961825... 🔍

Anonymous 9/14/2025, 2:41:08 AM No.106579074

>>106579004
>Found an incredible artist but her style has evolved so much I'm afraid my vramlet model won't be able to handle her entire body of work at one time
And by artist I mean "girl on instagram" and by
"her style has evolved" I mean she's aged and her face/body changed

>>106579041
the abstract says "providing rich supervision without captions"
this is where my ML knowledge ends. if the only purpose of captions is indeed supervision, then yeah there's no longer a purpose

remember though that this just makes the text-to-image as good as its image-to-text. apparently image-to-text mogs text-to-image in pretty much every multimodal LLM so this is a very welcome change

Anonymous 9/14/2025, 2:42:20 AM No.106579080

>>106579070
model?

Anonymous 9/14/2025, 2:42:39 AM No.106579083 >>106579137

>>106578662
>>106578799
https://huggingface.co/collections/sanaka87/reca-68ad2176380355a3dcedc068
They published their models. Anyone want to try them for us?

Anonymous 9/14/2025, 2:47:40 AM No.106579110 >>106579267

comfy332.jpg md5: 2ef32255... 🔍

made a mistake and posted it in a dead threat, ups

Anonymous 9/14/2025, 2:48:02 AM No.106579113

>>106578799
>this is EXACTLY what I mean when I say we have so much low hanging fruit
I don't know who you are, but you stole that quote from me.

Anonymous 9/14/2025, 2:51:55 AM No.106579135 >>106579202 >>106579235 >>106579395

1756987484784285.jpg md5: 6db7e0e7... 🔍

Which model can I use to make backgrounds and scenery without a subject being the focus like all these noobai/illustrious 1girl models?

Anonymous 9/14/2025, 2:52:06 AM No.106579137 >>106579202 >>106579362

1739674824474746.png md5: eee03a8b... 🔍

>>106579083
>worse than Kontext
I'll pass, but this method seems promissing, I really believe we'll get Nano Banana's level with this shit

SPRO to unslop + this to make it good = Qwen Image Edit Ultimate <3

Anonymous 9/14/2025, 2:52:07 AM No.106579138 >>106579486

>>106577883 (OP)
Hey can I ask a really stupid question?
What UI do Image-To-Text app use?
I'm looking at gemini, qwen or joycaption, and I don't understand if their suppose to be run with Stable-Dif, Comfy, something else. Or they are just their own stand alone UI.

>I'm having problems installing so I'm trying to figure out if i'm doing something really basic wrong.

Anonymous 9/14/2025, 2:52:45 AM No.106579148

I know the micropenis pun model has taken a lot of QIE's thunder, but qwen image edit is still pretty good desu.

Anonymous 9/14/2025, 2:55:13 AM No.106579165

1757811118030-7efe926c-3055-4a4a-8f57-a2c468049b2a.jpg md5: 83a7e1d0... 🔍

Anonymous 9/14/2025, 2:59:45 AM No.106579202 >>106579213 >>106579249 >>106579265 >>106579362

>>106579135
use the "no humans" tag, put 1girl, 1boy etc in negatives. use NegPip instead of the default negatives implementation.

>>106579137
it beats Kontext on three of those columns though, and seems it might be less slopped/better at styles? Also they note in the paper that they could have probably gone further with Bagel, but basically say they ran out of money.

>SPRO to unslop + this to make it good = Qwen Image Edit Ultimate
Not sure if this would work for Qwen Image. It seems to be for multimodal models only. Or maybe there's a way to get it to work?

Anonymous 9/14/2025, 3:00:14 AM No.106579208 >>106579260 >>106579356

1727012159112235_thumb.jpg.webm md5: 2d197da8... 🔍

WebM not supported

Animating some frazetta paintings

Anonymous 9/14/2025, 3:01:27 AM No.106579213 >>106579362

>>106579202
>it beats Kontext on three of those columns though,
oh yeah I'm fucking blind, I should get some sleep lol

Anonymous 9/14/2025, 3:03:43 AM No.106579232 >>106579267

comfy1231.jpg md5: 90ad4bb8... 🔍

Anonymous 9/14/2025, 3:04:36 AM No.106579235 >>106579265

noob_naiXLVpred102d_custom.safetensors_00495_.png md5: b17460b3... 🔍

>>106579135
eg:
>scenery, no humans, an abandoned factory building, rust, grass, vines, flowers,sunlight, day, red sky, clouds,
>by takamura kazuhiro, by sushio, by sadamoto yoshiyuki , very awa, absurdres, best quality, masterpiece, ultra-detailed, amazing composition,watercolor $medium$,
>(jpeg artifacts, text, watermark, cropped, censored:-1)

Anonymous 9/14/2025, 3:05:06 AM No.106579237

1732338371098346_thumb.jpg.webm md5: bacff0da... 🔍

WebM not supported

Anonymous 9/14/2025, 3:05:33 AM No.106579240

00278-3250109216.png md5: 19a40077... 🔍

Anonymous 9/14/2025, 3:06:36 AM No.106579247 >>106579257

file.png md5: 0ecacb70... 🔍

>>106578168
>Chroma doesn't know a single booru tag
no, it knows many. what it does have trouble with is the artist tags that some people would like to use

Anonymous 9/14/2025, 3:06:53 AM No.106579249 >>106579263

>>106579202
>Not sure if this would work for Qwen Image. It seems to be for multimodal models only.
maybe it can work if you use the visual text encoder?

Anonymous 9/14/2025, 3:07:54 AM No.106579257 >>106579641

>>106579247
>no, it knows many.
show me some tags it can do (action tags and artist tags)

Anonymous 9/14/2025, 3:08:03 AM No.106579259 >>106579267

comfy12309.jpg md5: f82c9db6... 🔍

Anonymous 9/14/2025, 3:08:07 AM No.106579260

>>106579208
Based

Anonymous 9/14/2025, 3:08:38 AM No.106579263

>>106579249
I think you're right.

Anonymous 9/14/2025, 3:08:39 AM No.106579265

1745697272355017.jpg md5: 1c62e269... 🔍

>>106579202
>>106579235
Whenever I don't prompt for a 1girl and use "no humans" it seems the background quality is greatly reduced, while if a 1girl is there somewhere suddenly the details are pretty sharp and make more sense. Without it the backgrounds almost always have a 1-point perspective and look like early SDXL generic crap.

I think I need better prompts too, but thanks for the tips.

Anonymous 9/14/2025, 3:09:09 AM No.106579267

>>106579110
>>106579232
>>106579259
Some Flux LoRA?

Anonymous 9/14/2025, 3:22:26 AM No.106579356

>>106579208
moar

Anonymous 9/14/2025, 3:23:16 AM No.106579362 >>106579382 >>106579393

>>106579213
>>106579202
>>106579137
there's a demo
https://huggingface.co/spaces/sanaka87/BAGEL-RecA
so far, not seeing great style results. but the demo space might have bad configs

Anonymous 9/14/2025, 3:25:29 AM No.106579373

>>106578386
Shit gen
Shit asuka style
Grow up

Anonymous 9/14/2025, 3:26:35 AM No.106579382

>>106579362
>but the demo space might have bad configs
I've seen this cope enough times to know the model is dead on arrival.

Anonymous 9/14/2025, 3:27:22 AM No.106579388 >>106579398 >>106579453

My fellow /ldg/entlemen, I recently tried out NAI v4.5 to see how proprietary models are doing and the vibe transfer feature is actually pretty good. Now I want to replicate something similar using local models. I've a few questions for the sages of this thread
1. Is IPAdapter the same thing / good enough? If so, is there a good write-up anywhere? Haven't been able to find a lot
2. If not, does local even have anything similar?
3. If so, does anyone have a comfy workflow I can look at?
Yeah that's pretty much it, I just want vibe transfer at home

Anonymous 9/14/2025, 3:28:06 AM No.106579393 >>106579417 >>106579470

1750791885221213.jpg md5: e24fea07... 🔍

>>106579362
>https://huggingface.co/spaces/sanaka87/BAGEL-RecA
bagel is such a shit model though, if they do this shit on QIE then maybe that can be interesting

Anonymous 9/14/2025, 3:28:29 AM No.106579395 >>106579423

>>106579135
>>>/ldg/

Anonymous 9/14/2025, 3:29:05 AM No.106579398 >>106579459 >>106579459

>>106579388
all local one-shot style transfer options are shit, however style loras absolutely destroy vibe transfer.

Anonymous 9/14/2025, 3:30:56 AM No.106579417

1733082335851599.png md5: 2e8a87e5... 🔍

>>106579393
keeek

Anonymous 9/14/2025, 3:31:27 AM No.106579421 >>106579441

I hate Chroma
I hate Comfy
I hate Asuka fag
I hate SeeDream

Anonymous 9/14/2025, 3:31:42 AM No.106579423 >>106579429

>>106579395
Yes, I'm posting in it currently.

Anonymous 9/14/2025, 3:32:31 AM No.106579429 >>106579443

>>106579423
Landscape diffusion general

Anonymous 9/14/2025, 3:33:55 AM No.106579441

00148-3061477033.png md5: 6fec3425... 🔍

>>106579421
>made the list
wow, I am going to finally download chroma and then spin up some api nodes in celebration. cheers.

Anonymous 9/14/2025, 3:34:12 AM No.106579443 >>106579464

>>106579429
Long dead and they never posted any tips or helpful comments (I read the the threads in full).

Anonymous 9/14/2025, 3:35:18 AM No.106579453 >>106579459

>>106579388
IPAdapter is superior but you have to dial in the settings. The right settings for you, only you can find.

Anonymous 9/14/2025, 3:37:04 AM No.106579459 >>106579480

>>106579398
True, but vibe transfers doesn't just copy style. I'm just curious if someone has tried training a similar model pipeline that NAI is using with the information we've been given
>>106579453
Does IPA only copy the style or also the "essence" like VT does? That's what I'd like to see as you can just grab style loras otherwise like >>106579398 said

Anonymous 9/14/2025, 3:37:48 AM No.106579464

>>106579443
Didn't they post some nice hudson river school stuff ?

Anonymous 9/14/2025, 3:38:30 AM No.106579470

>>106579393
desu, qwen edit might not benefit as much because it's already so bloated and powerful. I'm more interested in this being a way to power up smaller/debloated models

Anonymous 9/14/2025, 3:40:02 AM No.106579478

I've just installed kohya ss but when I try to run it a cmd prompt just opens and then closes immediately. Anyone know the fix?

Anonymous 9/14/2025, 3:40:25 AM No.106579480 >>106579512

>>106579459
You'd have to define "essence". I used to throw a dozen or so images into it and it'd work quite well, really well in fact. The other anon is right in that no one really gives a shit about it anymore and will train a lora instead.
Desu 1.5's IPA is miles better than XL's the last time I used either.

Anonymous 9/14/2025, 3:41:16 AM No.106579486

>>106579138
ComfyUI, for example: https://github.com/1038lab/ComfyUI-JoyCaption

Anonymous 9/14/2025, 3:41:23 AM No.106579487 >>106579508

00320-83642732.jpg md5: d38671cc... 🔍

Anonymous 9/14/2025, 3:44:49 AM No.106579501 >>106579511

ComfyUI_00020_.png md5: b87fd523... 🔍

trying to make some new desktop backgrounds

Anonymous 9/14/2025, 3:45:31 AM No.106579508

>>106579487
Thanks, Ran...

Anonymous 9/14/2025, 3:45:52 AM No.106579511 >>106579694

>>106579501
nice one

Anonymous 9/14/2025, 3:45:57 AM No.106579512 >>106579531

>>106579480
>1.5 IPA is miles better than XL
Damn, well I'll see if I can find some workflows online and just try it out, thanks. Loras are great but if you have 3 or 4 of them, they have a tendency of deep-frying your image, not to mention that you (or someone else) has to spend a few hours cooking up a good one. Maybe NAI will open source their VT pipeline/models in a year or two...

Anonymous 9/14/2025, 3:46:27 AM No.106579515 >>106579649

Guys how do the models just KNOW asuka?

Anonymous 9/14/2025, 3:48:09 AM No.106579531

>>106579512
>Maybe NAI will open source their VT pipeline/models in a year or two...
its not really good enough to really care noob clears nu nai anyway other than text

Anonymous 9/14/2025, 3:48:27 AM No.106579533

>ran took everything from me

Anonymous 9/14/2025, 3:58:11 AM No.106579607

seedream stole my will to prompt locally...

Anonymous 9/14/2025, 4:00:26 AM No.106579621 >>106579724

I honest to god have no idea why you're all shitting yourselves over seed dream. What even makes it good?

Anonymous 9/14/2025, 4:00:33 AM No.106579622

244jt6-678755358.jpg md5: 279c1d16... 🔍

Ehemmmmmm

Anonymous 9/14/2025, 4:01:28 AM No.106579628 >>106579642

*yawn*

Anonymous 9/14/2025, 4:04:34 AM No.106579641

file.png md5: f3d58569... 🔍

>>106579257
> action
have "drawing (action)", one of the few *action* tags in the boorus as far as I can tell

>artist
I literally just said these are some of the tags it has trouble with.

The implication of it having learned many booru tags from the multiple boorus that went into this also is that indeed it hasn't learned all booru tags. Would be nice to have such a model. It didn't learn AS much though, just probably most of all models so far.

Anonymous 9/14/2025, 4:04:36 AM No.106579642

>>106579628
SaaS is evil but (you) download their leaked models as fast as (you) can and
(you) despices cloud services (you) download their models from hugging face that is litteraly a cloud service.

(you) are a joke, this is the real /sdg/

Anonymous 9/14/2025, 4:05:27 AM No.106579645

>schizos out

Anonymous 9/14/2025, 4:06:23 AM No.106579649 >>106579652

file.png md5: 1501c24b... 🔍

>>106579515
like miku and a few others she is all over the internets even if you don't ingest *booru or anime-leaning parts of social networks or whatever as training data specifically

Anonymous 9/14/2025, 4:07:08 AM No.106579652

>>106579649
>tranny hands

Anonymous 9/14/2025, 4:07:31 AM No.106579656 >>106579694 >>106579700 >>106579826

WanVideo2_2_I2V_00376_thumb.jpg.webm md5: abd7b2c8... 🔍

WebM not supported

How to stop yapping?

Anonymous 9/14/2025, 4:13:31 AM No.106579690 >>106579703 >>106579708

Best way to train a lora locally that isn't kohya? Kohya ss just crashes on start up for me

Anonymous 9/14/2025, 4:14:10 AM No.106579694 >>106579706

0_00084__thumb.jpg.webm md5: 9b2b01d0... 🔍

WebM not supported

>>106579511
thank you anon
>>106579656
give her a face mask

Anonymous 9/14/2025, 4:14:51 AM No.106579700

ComfyUI_00247__thumb.jpg.webm md5: 9fd66dc7... 🔍

WebM not supported

>>106579656
prompt more about facial expression

Anonymous 9/14/2025, 4:15:14 AM No.106579703 >>106579718

>>106579690
OneTrainer, literally just used it to train a lora
https://github.com/Nerogar/OneTrainer

Anonymous 9/14/2025, 4:15:26 AM No.106579704 >>106579712

00343-1768283568.jpg md5: 52f0ab71... 🔍

Anonymous 9/14/2025, 4:16:01 AM No.106579706

ComfyUI_00012_.png md5: 43b3e5df... 🔍

>>106579694
forgive the crap video it was just a quick example

Anonymous 9/14/2025, 4:16:28 AM No.106579708

>>106579690
>Kohya ss just crashes on start up for me
I'd look into that.

Anonymous 9/14/2025, 4:17:27 AM No.106579712 >>106579714

>>106579704
Whoa momma, very nice. Model?

Anonymous 9/14/2025, 4:18:05 AM No.106579714 >>106579728 >>106579811

>>106579712
ChromaHD with a self made lora

Anonymous 9/14/2025, 4:18:45 AM No.106579718

>>106579703
nice, ty

Anonymous 9/14/2025, 4:19:39 AM No.106579724

>>106579621
>What even makes it good?
you can gen 4k quality natively it seems, no upscaling. prompt adherence is also very good, they actually wrote a paper on their algorithm and why its so good

the aesthetic superiority is subjective as always

Anonymous 9/14/2025, 4:20:17 AM No.106579728

veeunus-spongebob.gif md5: 951be678... 🔍

>>106579714
>Chroma model

Anonymous 9/14/2025, 4:28:07 AM No.106579789 >>106580082

G0ws4dxXcAAL2Aw.jpg md5: d74ab837... 🔍

Anonymous 9/14/2025, 4:28:55 AM No.106579793

2loras_test__00023_.png md5: 22442ae0... 🔍

some day those signs will actually say something, hopefully by 2030

Anonymous 9/14/2025, 4:31:44 AM No.106579811 >>106579858

>>106579714
Is chroma "finished" already? I was under the impression that it's still training. Also, is it a style or character lora? How much vram/time does it take to train one?

Anonymous 9/14/2025, 4:32:39 AM No.106579818 >>106579995

00081-2784931192.jpg md5: f78f1a2a... 🔍

>>106578036
i tried the model and hated it. its just broken pos model that was obviously train wrong and is very finicky with its settings. I'll just stick to using to sdxl finetunes.

Anonymous 9/14/2025, 4:32:54 AM No.106579820

tall.jpg md5: c6bec9a1... 🔍

Anonymous 9/14/2025, 4:34:50 AM No.106579826

>>106579656
have her suck something

Anonymous 9/14/2025, 4:36:17 AM No.106579831

2loras_test__00030_.png md5: 632602ca... 🔍

Anonymous 9/14/2025, 4:40:55 AM No.106579858 >>106579868 >>106579875

>>106579811
HD is the finished version but they are still making different versions. I'm using a 5090 and it takes me 8 hours to train

Anonymous 9/14/2025, 4:41:22 AM No.106579860

00103-3456784782.jpg md5: 2ca3beaa... 🔍

Anonymous 9/14/2025, 4:43:38 AM No.106579868 >>106579901

>>106579858
8 hours on a 5090 is fucked, training an SDXL lora takes like 1-2 hours on my 3090, I shudder to think about how long Chroma would take. That's probably why there are only a handful of loras out on civitai

Anonymous 9/14/2025, 4:45:15 AM No.106579875

>>106579858
hey wait a second i didnt make this post. where did my post about --fast being still relevant for fp16 text encoders even when using Q8 go??

Anonymous 9/14/2025, 4:46:34 AM No.106579881 >>106579885 >>106580121

It's just like, why use Chroma when SDXL does the exact same thing but faster?

Anonymous 9/14/2025, 4:47:18 AM No.106579885 >>106579915

>>106579881
better prompt adherence and can do text

Anonymous 9/14/2025, 4:47:45 AM No.106579890

ok weird i guess spam filter discarded it, first time i had that happen on the evasion site

anyways, --fast matters if you use Q8 with a fp16 encoder

video not using --fast on Q8: 302 seconds exactly every time
video using --fast on Q8: give or take 250 seconds. --fast introduces a much larger time range that can vary between 240 to 280 seconds but its always faster than not using it

un-noticable difference to prompt adherence. nothing like going from fp16 to fp8 scaled for t5xxl

Anonymous 9/14/2025, 4:48:46 AM No.106579896

00348-1768283570-ad-before.jpg md5: 4c61013a... 🔍

Anonymous 9/14/2025, 4:49:36 AM No.106579898 >>106579911 >>106580119

file.png md5: 8f5f804d... 🔍

>Pruned Qwen Image in half (10B params). It needs a lot of training to be useful, so I decided to make it a pixel space model. Patching pixel space with 32x32 patches. Samples are the current 10B latent version and the pixel space version. Both will need a lot more training.
https://xcancel.com/ostrisai/status/1966987356357226612

Anonymous 9/14/2025, 4:51:31 AM No.106579901 >>106580088

>>106579868
>8 hours on a 5090 is fucked
Totally depends on the number of images and resolution

Have you ever trained at all ?

Anonymous 9/14/2025, 4:52:40 AM No.106579911

ComfyUI_00139__thumb.jpg.webm md5: f8cf1310... 🔍

WebM not supported

>>106579898
>xcancel

Anonymous 9/14/2025, 4:54:01 AM No.106579915

>>106579885
>better prompt adherence
Control nets exist and are extremely good. Most of the actual practical uses of Chroma do not even demonstrate the need for its prompt adherence over SDXL either. It's all just 1 girl stuff.

>can do text
Benchmaxxing useless shit.

Anonymous 9/14/2025, 4:54:20 AM No.106579919 >>106579925

00113-2287984840.jpg md5: 2dc8e0b5... 🔍

Anonymous 9/14/2025, 4:55:24 AM No.106579925 >>106579993

>>106579919
you should go back to doing lolis with bulges

Anonymous 9/14/2025, 5:05:59 AM No.106579990 >>106580004

>>>/b/939770328

Anonymous 9/14/2025, 5:06:25 AM No.106579993 >>106580004 >>106580009

00119-1035844509.jpg md5: da1348db... 🔍

>>106579925
not in the mood to get another 3 day vacation again. Even posting on /trash/ got me banned and /b/ is full of compressed low res slop gen. Not risking it. just got banned on /v/ and warning the other day on /aco/ for posting young lunafreya.

Anonymous 9/14/2025, 5:06:36 AM No.106579995

>>106579818
>w-w-w-Would
>Would
>Would
>Would

https://www.youtube.com/watch?v=fZVDNPODfeY

Anonymous 9/14/2025, 5:07:54 AM No.106580004

>>106579990
you should be banned for linking to /b/ in an AI thread and showing me adults instead of a sexy kid

>>106579993
>not in the mood to get another 3 day vacation again.
ok so just replace the boards 4chan org with k1w1 dot st in your url address bar but with an i instead of a 1 so its the name of the fruit/new zealander

Anonymous 9/14/2025, 5:08:39 AM No.106580009

>>106579993
fair enough bro cool gens and style tho

Anonymous 9/14/2025, 5:19:52 AM No.106580071

00122-3559916170.jpg md5: 19e68a63... 🔍

Anonymous 9/14/2025, 5:22:10 AM No.106580082 >>106580141

>>106579789
model?

Anonymous 9/14/2025, 5:22:41 AM No.106580088

>>106579901
>training an SDXL lora takes like 1-2 hours on my 3090
Yes. Yes I have. I don't know how many images other people use, but I haven't gone above 100 yet. I train at 1024 pixels and rank 32-64

Anonymous 9/14/2025, 5:26:10 AM No.106580108 >>106580195

i regret replying to 106579993 because its obvious he was trolling :/ talking about compressed slop when he posted a sub 200kb jpeg and when i gave him a solution to his only problem he didn't take it. so obviously he could have just left it at "not in the mood" but because he's a left winger he needs to invent problems :/

Anonymous 9/14/2025, 5:27:13 AM No.106580113

I like Chroma.

Anonymous 9/14/2025, 5:29:27 AM No.106580119

>>106579898
interesting attempt but i wonder if he can train it enough to confirm his setup is working or not (or retrain if it isn't)

Anonymous 9/14/2025, 5:29:55 AM No.106580121 >>106580136 >>106580190

>>106579881
i can do 2girls interacting with each other with chroma instead of just 1girl with sdxl+snake oil

Anonymous 9/14/2025, 5:32:21 AM No.106580136 >>106580147

file.png md5: 9c6617a2... 🔍

>>106580121
it works better.

but wan (1 frame for images), hidream, qwen and maybe others are probably actually quite at bit stronger yet. they unfortunately also have less interesting 2girls overall

Anonymous 9/14/2025, 5:33:07 AM No.106580141 >>106580160 >>106580251

>>106580082
seedream 4

Anonymous 9/14/2025, 5:34:03 AM No.106580147 >>106580228

>>106580136
i wish someone would give qwen the chroma treatment

Anonymous 9/14/2025, 5:36:27 AM No.106580160 >>106580205 >>106580215

>>106580141
you try this? https://www.reddit.com/r/Bard/comments/1nfl7tx/comment/ndxhrmc/
too much effort for me

Anonymous 9/14/2025, 5:38:28 AM No.106580177

So I want to finally try Qwen out: is Qwen + lightx2v 8step the go-to if I want to use LoRAs?

Anonymous 9/14/2025, 5:39:04 AM No.106580181

fbce8f3b-aef0-4969-a7fc-84259129e873.png md5: dac04508... 🔍

>>106578997
yeah, there is a reason for that anon. iChuds won

Anonymous 9/14/2025, 5:39:41 AM No.106580187 >>106580200 >>106580206

Alright, I really want to get something good out of Chroma, since Qwen is so slopped for photoreal nsfw at the moment, but goddamnit I just can't get a decent gen to save my life.

Yes it's a skill issue, yes I'm a fag. Can someone share a box of a decent Chroma-HD-Flash image or recommend some settings?

Anonymous 9/14/2025, 5:40:03 AM No.106580190 >>106580203

>>106580121
pure skill issue if you can't do 2girl with noob

Anonymous 9/14/2025, 5:40:57 AM No.106580195 >>106580222

00020-1100356185.png md5: fe21caf0... 🔍

>>106580108
not trolling, here is the catbox if it makes you happy.
https://files.catbox.moe/4o5oaq.png
https://files.catbox.moe/k7p27d.png
https://files.catbox.moe/pafows.png
https://files.catbox.moe/d2593f.jpg
https://files.catbox.moe/kjw6xv.png
https://files.catbox.moe/w7n9xh.png
https://files.catbox.moe/51xixp.png
https://files.catbox.moe/s2wuas.jpg
https://files.catbox.moe/0h5tgj.png
https://files.catbox.moe/knmk31.png

Anonymous 9/14/2025, 5:42:00 AM No.106580200

>>106580187
Whats the last prompt you used?

Anonymous 9/14/2025, 5:42:35 AM No.106580203

>>106580190
i burnt myself out on anime a long time ago

Anonymous 9/14/2025, 5:43:17 AM No.106580205 >>106580223

1757811556206-226a4546-2736-467a-a337-fb62d4b1e2d12.jpg md5: 24a74da8... 🔍

>>106580160
i haven't, though i have found a couple ways to scam infinite credits though such as LMArena and Yupp.

Anonymous 9/14/2025, 5:43:43 AM No.106580206

>>106580187
Flash tends to suck ass on its own and I get better results with Hyper-low-step at 1.00 and flash lora at 0.4

Anonymous 9/14/2025, 5:44:05 AM No.106580207

go back to the cloud threads and leave this one to the grown ups kek

Anonymous 9/14/2025, 5:44:50 AM No.106580210

>grown ups
>512x512
little baby boy soiled his diaper~~

Anonymous 9/14/2025, 5:45:48 AM No.106580215

>>106580160
uncensored chinese api sounds mighty tempting.. hard to resist the siren’s song

Anonymous 9/14/2025, 5:46:26 AM No.106580220

>>106580216
>>106580216
>>106580216
>>106580216

Anonymous 9/14/2025, 5:46:27 AM No.106580221

>>106580213
kys

Anonymous 9/14/2025, 5:46:38 AM No.106580222 >>106580238 >>106580252

>>106580195
>here is the catbox if it makes you happy
well i was trolling with my post so now i feel bad for making you put in that effort. i hope the other anon appreciates what you shared and replies thanks to you as well

Anonymous 9/14/2025, 5:47:02 AM No.106580223

>>106580205
>LMArena
yeah that's what i've been using tried yupp yesterday but didn't get too far into it

Anonymous 9/14/2025, 5:47:14 AM No.106580228 >>106580237

file.png md5: cfed0ec8... 🔍

>>106580147
would be nice but you need someone/some entity with even more money to spare than lodestone (who mostly funded chroma, donations only covered the smaller part of expenses until now)

Anonymous 9/14/2025, 5:48:04 AM No.106580233

christ

Anonymous 9/14/2025, 5:48:28 AM No.106580237 >>106580241 >>106580277

>>106580228
>who mostly funded chroma, donations only covered the smaller part of expenses until now
Is lodestones just ultrarich?

Anonymous 9/14/2025, 5:48:32 AM No.106580238

>>106580222
you started it not me i said what i wanted to say already lol

Anonymous 9/14/2025, 5:49:36 AM No.106580241 >>106580383

>>106580237
Furries seem to be rich, for some weird reason

Anonymous 9/14/2025, 5:50:42 AM No.106580251 >>106580278

>>106580141
Not local though.

Anonymous 9/14/2025, 5:51:03 AM No.106580252

>>106580222
it's alright, tonight i have the right amount energy to slop up some gens and effort post. going to keep cooking.

Anonymous 9/14/2025, 5:55:57 AM No.106580277

file.png md5: b11e8df7... 🔍

>>106580237
the wealth of the ultrarich is far more insane even if you just look at liquid assets they could easily expend on a sustained base.

unfortunately we're not getting them to drop good uncensored NSFW models on the public so far.

but he clearly isn't poor

Anonymous 9/14/2025, 5:56:10 AM No.106580278

>>106580251
He will attempt to discuss it here regardless because he is a faggot

Anonymous 9/14/2025, 6:15:12 AM No.106580383

>>106580241
>Furries seem to be rich, for some weird reason
there's just a lot of rich people out there bro. some of them are furries. 3.5% of men are pedos. are you a golem who thinks the forbes list of billionaires is all the billionaires on the planet?