Anonymous
9/14/2025, 12:13:08 AM
No.106577883
>>106578232
>>106579138
/ldg/ - Local Diffusion General
Anonymous
9/14/2025, 12:14:25 AM
No.106577893
>>106577937
>>106578095
I have stabilized a style most likely
Anonymous
9/14/2025, 12:14:48 AM
No.106577897
>>106577991
>>106578602
Anonymous
9/14/2025, 12:15:35 AM
No.106577906
>>106577960
>>106577885
>You don't seem to understand what a base model is, it's a model made to have as little bias as possible and know of as many concepts as possible so it can be used as a BASE for further finetuning
can someone put the post of the anti chroma anon saying "you are here" and at the end it was, "we can save it with finetune though" or something like that
Anonymous
9/14/2025, 12:18:40 AM
No.106577937
>>106577893
so just slopstyle?
>>106577796
>NOOOO YOU CAN'T JUST TELL THE MODEL WHAT YOU WANT AND GET WHAT YOU WANT USING A HIGHLY CONCISE FORMAT!!! YOU HAVE TO BOOMERPROMPT!!!
>>106577809
I find chroma most useful for actual artistic stuff (which none of the chromaschizos can even conceive of, all they do is generate asian waifus and feet), it's just disappointing that it's so much weaker than it should be. The ROI on prompting effort is much lower than Noob or Qwen, but it is true it can do things no other local model can do at the moment.
>pic
The truth is that we're going to free ourselves from this shit by baking our own models from scratch, using techniques like those in
https://huggingface.co/KBlueLeaf/HDM-xut-340M-anime. Architectural changes and training optimizations are going to make it possible to train fully unslopped, uncensored, DEBLOATED models with Qwen-level comprehension and far fewer parameters, with local or rented GPUs on a budget <$10k very soon. It may already be possible.
>>106577820
>>106577885
>Pony and Noob are large finetunes with TONS of sexual positions.
>Chroma is a base model, like SDXL, Flux, QWen, as such it's not specifically focused on anything but knows some of practically everything
This makes no sense whatsoever. It was trained on e621 data right? That data has sex position tags does it not?? So why doesn't chroma?
>>106577829
Nothing wrong with that, but it should have been trained on tags alongside/interchanged with the captions. Seems it wasn't.
Anonymous
9/14/2025, 12:19:32 AM
No.106577948
>>106577983
>>106578346
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
>mat1 and mat2 shapes cannot be multiplied
AAAAAAAAAAAAAAAAAAAAa
Anonymous
9/14/2025, 12:20:37 AM
No.106577955
>>106577943
>another wall of text
what's wrong with you dude?
Anonymous
9/14/2025, 12:21:04 AM
No.106577959
>>106578022
Once again there's never been a successful model that's retagged *booru images with NLP slop.
Anonymous
9/14/2025, 12:21:06 AM
No.106577960
>>106577979
>>106577906
Desperately moving the goal post again
It was always presented as a base model, a de-distilled and de-slopped uncensored version of Flux Schnell, which is also a base model, just like SD1.5, SDXL, SD3, Flux, Qwen, Wan
And just like with all these models, if you want it to be really good for a specific concept, you need finetunes/loras
Anonymous
9/14/2025, 12:22:15 AM
No.106577967
>>106578045
>>106577941
> A 1.5B-parameter model with RecA achieved state-of-the-art results on image generation benchmarks like GenEval (0.86) and DPGBench (87.21) with only 27 A100 GPU-hours
sounds impressive? they show 0 images though, wtf?
Anonymous
9/14/2025, 12:23:18 AM
No.106577979
>>106578000
>>106577960
>just 2 more finetunes bro
just let it go, it's over
Anonymous
9/14/2025, 12:23:31 AM
No.106577983
>>106578324
>>106577948
> he can't matmul
ngmi
Anonymous
9/14/2025, 12:24:26 AM
No.106577991
>>106577999
>>106577897
giwtwm
>>106577878
thanks but it came out a turd
>>106577886
yeah forgot what the prompt was but i think it did have it in there
Anonymous
9/14/2025, 12:25:12 AM
No.106577998
>>106578095
Anonymous
9/14/2025, 12:25:17 AM
No.106577999
>>106577991
>it came out a turd
yeah, the proportions are all fucked up lol
Anonymous
9/14/2025, 12:25:23 AM
No.106578000
>>106578006
>>106578020
>>106577979
He always post off topic images in the thread, same pattern no deviation and you people keep fucking replying to him
Anonymous
9/14/2025, 12:26:58 AM
No.106578006
>>106578033
>>106578034
>>106578000
>same pattern no deviation
you mean the wall of texts? + the overusage of the world "shill"?
>>106577943
>>106577527
>>106577373
Anonymous
9/14/2025, 12:28:31 AM
No.106578014
>>106578035
>>106577943
>This makes no sense whatsoever. It was trained on e621 data right? That data has sex position tags does it not?? So why doesn't chroma?
What part of focus don't you understand ?
If you train on a shit ton of images with no particular biases, then it will not learn particular biases as well as if you train on a shit ton of images with particular biases
It's not rocket science, the model learns through pattern recognition and repeats, if one training has 50k images of fetish X and 5 million images of other stuff, and the other training has 200k images of fetish X and 1 million images of other stuff, the latter model will learn fetish X much better
γγΉγγ«γΌγ
!!FH+LSJVkIY9
9/14/2025, 12:29:42 AM
No.106578020
>>106578000
:c
what do you mean "YOU people"???
Anonymous
9/14/2025, 12:29:43 AM
No.106578022
>>106578095
>>106577959
>>106577943
I'm not a baker, but here's what seems obvious to me:
Make three caption sets:
>Pure NL captions
>NL captions that are infused with tag keywords by telling the VLLM to make sure to include the image's tags in the NL description
>plain tags
Then when training, just include each image three times, one time for each version of the caption. Or concatenate the caption sets in a randomized order.
Why wouldn't this work? Why don't bakers do this?
Anonymous
9/14/2025, 12:29:46 AM
No.106578024
local
chads
eatin
gud
Anonymous
9/14/2025, 12:30:04 AM
No.106578026
>>106578074
>>106578104
there is no excuse for a model as big as chroma to not fit in every drop of knowledge from every booru out there with plenty of room to spare. it could fit 4 SDXLs in it. it's simply bad
still down to train an interesting lora to post here
>>106577496
as long as the artist style is not shit, sure
Anonymous
9/14/2025, 12:30:44 AM
No.106578033
>>106578006
Autistic people connect the problem is the one shilling for non free shit has no reason to be in this thread. He's just rattling the cage hoping to trigger people
Anonymous
9/14/2025, 12:31:11 AM
No.106578034
>>106578054
>>106578006
>He always post off topic images in the thread
kek, never posted anything not AI generated here
>the overusage of the world "shill"?
can't be overuse when it's 100% correct
Anonymous
9/14/2025, 12:31:14 AM
No.106578035
>>106578136
>>106578014
Yet chroma learned a ton of concepts that weren't in Schnell. One obvious example, it can do genitals. How is learning basic sex positions harder than learning genitals, which weren't in Schnell at all??
Anonymous
9/14/2025, 12:31:17 AM
No.106578036
>>106579818
I can't believe there is still someone here desperately trying to convince people that Chroma is the future. It's done. It's shit.
Anonymous
9/14/2025, 12:31:24 AM
No.106578039
Anonymous
9/14/2025, 12:31:44 AM
No.106578045
>>106578069
>>106577967
I shared the overview link. click Paper up at the top.
Anonymous
9/14/2025, 12:32:40 AM
No.106578054
>>106578034
see, you can argue without making a wall of text, that's way more pleasing to the eyes, thank you
Anonymous
9/14/2025, 12:34:15 AM
No.106578065
>>106578089
>>106578142
>>106578027
80s, 90s, 2000s movie / tv show styles are always appreciated, also not that hard to caption
Anonymous
9/14/2025, 12:34:42 AM
No.106578069
Anonymous
9/14/2025, 12:34:57 AM
No.106578072
>>106578027
im uploading a couple of datasets if you give me a sec ill post em
Anonymous
9/14/2025, 12:35:35 AM
No.106578074
>>106578128
>>106578026
You are so retarded it's not even funny
Sad that you comment so much but know absolutely nothing about AI training
Anonymous
9/14/2025, 12:37:37 AM
No.106578089
>>106578065
>90s, tv show styles are always appreciated
Anonymous
9/14/2025, 12:38:26 AM
No.106578095
>>106578119
>>106577998
>>106577943
Please post in /adt/ we need you
>>106577893
you to also!
>>106578022
please post your good gens in /adt/!
Anonymous
9/14/2025, 12:38:35 AM
No.106578096
>>106578142
Anonymous
9/14/2025, 12:39:46 AM
No.106578104
>>106578026
>there is no excuse for a model as big as chroma to not fit in every drop of knowledge from every booru out there
he said on reddit that the artist tags were gonna be there, it didn't happen
Anonymous
9/14/2025, 12:39:59 AM
No.106578107
rocketjeet is so desperate, it's pathetic
Anonymous
9/14/2025, 12:41:45 AM
No.106578119
>>106578310
>>106578095
/adt/ endorses API models, so no.
Anonymous
9/14/2025, 12:42:41 AM
No.106578128
>>106578160
>>106578074
>nooo, you don't understand, SDXL (3.5b) had no issue getting all the booru tags, but Chroma (8.9b) just can't do itttttttt
or else this anon has 50 of IQ, or else we're dealing with lodestone there, there's no way he's not trolling right? I refuse to believe someone can be this dumb
Anonymous
9/14/2025, 12:43:41 AM
No.106578136
>>106578035
NTA but just with that example almost every image depicting sex is gonna have genitals while a small subset might depict a particular sex act. Anyway I think a lot of chroma's issues are the unconventional way it was trained more than the dataset.
Anonymous
9/14/2025, 12:44:35 AM
No.106578142
>>106578156
>>106578172
>>106578065
https://huggingface.co/silveroxides/Chroma-LoRA-Experiments/tree/main
has 1980s and 2000s lora that work
>>106578096
dataset usually means with captions but i'll try. they seem a little too abstract for even chroma but i guess we'll see
Anonymous
9/14/2025, 12:46:41 AM
No.106578156
>>106578162
>>106578142
>dataset usually means with captions
i can upload mine if youd like but theyre really just shitty joycaption "danbooru-like" tags. i wouldnt think theyd do well with chroma
Anonymous
9/14/2025, 12:46:55 AM
No.106578160
>>106578168
>>106578128
>SDXL (3.5b) had no issue getting all the booru tags
Just stop lying, they didn't get all the booru tags, the FINETUNES of SDXL did, because they were FOCUSED on booru content
This is so tiresome
Anonymous
9/14/2025, 12:47:32 AM
No.106578162
>>106578219
>>106578156
i trained a booru only dataset and it worked fine. send em
>>106578160
Chroma doesn't know a single booru tag, and you find this normal? this motherfucker trained the model with booru images
Anonymous
9/14/2025, 12:48:49 AM
No.106578172
>>106578539
>>106578142
>has 1980s and 2000s lora that work
I'm sure they work but how effective are they given how old they are with all the training that has happened since ?
I mean initially Flux loras worked well, but that quickly changed as training progressed and the underlying models diverged
Anonymous
9/14/2025, 12:49:46 AM
No.106578176
>>106578218
why do chromakeks not even understand their own model? the fact that it was trained on booru images/tags but hasn't learned any artist tags or characters is quite concerning. perhaps anti-chroma 'schizos' were right that it was trained wrong and schnell is a mess of a base model
Anonymous
9/14/2025, 12:50:20 AM
No.106578183
>>106578209
>>106578168
>trained the model with booru images
Captioned by Gemini
Please stop being so goddamn retarded
Anonymous
9/14/2025, 12:50:22 AM
No.106578184
>>106578223
Anonymous
9/14/2025, 12:51:53 AM
No.106578201
>seedream paid jeet force gets clowned on in the thread
>unprompted random seethe spam about local models and chroma in the next thread
ooooooooooooooooooooo im nooooticiiing
Anonymous
9/14/2025, 12:52:28 AM
No.106578210
>>106578244
>>106578267
>chroma is shit
>t-the seedream shills caused this!
LMAO!
Anonymous
9/14/2025, 12:53:12 AM
No.106578218
>>106578593
>>106578176
>>106578209
ok, so the furry did train on tags? then what went wrong??
Anonymous
9/14/2025, 12:53:37 AM
No.106578219
>>106578539
>>106578162
okay all are uploaded now, the garmash ones are fresh
>they seem a little too abstract for even chroma
i was surprised at how well noob took to them so i can only hope chroma will be absolute kino, if its anywhere as good as what i saw anon post when flux dev loras were new
looking forward to seeing yours
Anonymous
9/14/2025, 12:53:42 AM
No.106578223
>>106578184
>visual representation of all the errors holding me down from training flux
y-you too..
Anonymous
9/14/2025, 12:53:48 AM
No.106578224
>>106578275
chroma can't do artist tags:
>IT WASNT TRAINED LIKE THAT ITS A BASE MODEL NOT A BOORU FINETUNE!!
it was trained on booru art:
>WELL IT WAS RECAPTIONED TO BE MORE ACCURATE TO NATURAL LANGUAGE
he said himself he preserved the tags:
>SEEDREAM SHILL!!!!
Anonymous
9/14/2025, 12:54:41 AM
No.106578232
>>106577883 (OP)
>Trying to Piss You Off Edition
Anons are still falling for it
Anonymous
9/14/2025, 12:56:27 AM
No.106578244
>>106578210
This goy knows whats up
Anonymous
9/14/2025, 12:57:04 AM
No.106578252
>>106578269
>>106578209
>noob and illustrious above pony because of "thousands of artist" tags
absolute bullshit
Anonymous
9/14/2025, 12:57:56 AM
No.106578267
>>106578285
>>106578210
kek, I blame Comfy a little bit though, he didn't implement it right and his ego is to fragile to admit it
https://github.com/comfyanonymous/ComfyUI/pull/7965
Anonymous
9/14/2025, 12:58:22 AM
No.106578269
>>106578305
>>106578326
>>106578252
do you still use the weird chink tagmine spreadsheet website kek does anon still update it?
γγΉγγ«γΌγ
!!FH+LSJVkIY9
9/14/2025, 12:58:28 AM
No.106578270
>>106578303
>>106578884
one eight four
i got banned so many times for showing just a BIT of panties, just to clear the air, to announce to the room so to speak :p
aaaaaaaaaaaaaaaaaaaanyways
Anonymous
9/14/2025, 12:58:30 AM
No.106578271
>it was unironically the pony baker defending chroma's lack of artist tag this entire thread
hooollyyy shit that's pathetic
Anonymous
9/14/2025, 12:58:56 AM
No.106578275
>>106578288
>>106578224
>he said himself he preserved the tags
Well he clearly didn't, or he preserved very selectively
Go ask him in the Discord channel or on reddit if you are considering suicide because your favorite booru tags aren't in Chroma
Speaking of chroma, Chroma radiance is now officially on ComfyUi
https://github.com/comfyanonymous/ComfyUI/pull/9682
Anonymous
9/14/2025, 12:59:42 AM
No.106578285
>>106578336
>>106578267
>comfy can't fix the ~100 or so lines of code for chroma
>can implement 1200 lines for Seedream 4 overnight
maybe seedream DID cause this after all
Anonymous
9/14/2025, 1:00:02 AM
No.106578288
>>106578325
>>106578275
>Well he clearly didn't
he promised he would've preserved them though ;-;
>>106578209
Anonymous
9/14/2025, 1:00:14 AM
No.106578290
>>106578278
how do we use it? i have the gguf already downloaded but it errors in nag workflow
Anonymous
9/14/2025, 1:00:52 AM
No.106578298
>the absolute backpedaling
massive defeat for chromashills today
Anonymous
9/14/2025, 1:01:12 AM
No.106578303
>>106578322
>>106578270
Didn't think the mods where that anal, particularly for stylistic stuff
I mean practically all of /a/ would be banned otherwise
Anonymous
9/14/2025, 1:01:23 AM
No.106578305
>>106578326
>>106578269
yes, i still use it. I don't know if it's still being updated
Anonymous
9/14/2025, 1:01:50 AM
No.106578308
>>106578278
Is it good doe?
Anonymous
9/14/2025, 1:01:52 AM
No.106578310
>>106578779
>>106578119
But your electricity and internet are also pay as you go API SaaS services. Stop with the dumb ideology and post your exelent gens in /adt/.
We need you!
Anonymous
9/14/2025, 1:02:07 AM
No.106578313
>>106578278
Is there quants available yet?
Anonymous
9/14/2025, 1:03:42 AM
No.106578322
>>106578303
catbox next time bb <3
+post part of the gen cropped or somethin hehe
Anonymous
9/14/2025, 1:04:06 AM
No.106578324
>>106578818
>>106577983
>mat1 and mat2 shapes cannot be multiplied (2x2304 and 2816x1280)
I hate image generation
Anonymous
9/14/2025, 1:04:14 AM
No.106578325
>>106578339
>>106578400
>>106578288
He broke some promise made in some reddit post half a year ago, you should sue him for releasing this free model without your favorite booru tags!
Anonymous
9/14/2025, 1:04:15 AM
No.106578326
>>106578269
>>106578305
I actually made a wildcards file with them in it so I can roll different pony styles with my seed.
Anonymous
9/14/2025, 1:04:28 AM
No.106578328
>>106578349
>>106578278
and we train it ... how?
Chroma is shit without training it.
Anonymous
9/14/2025, 1:04:58 AM
No.106578336
>>106578285
Shut the fuck up already chud
Anonymous
9/14/2025, 1:05:22 AM
No.106578339
>>106578352
>>106578325
>in some reddit post
you mean the official chroma announcement post? he only made 2 posts, one where he announced chroma, and the 2nd one is when he finished it
Anonymous
9/14/2025, 1:06:03 AM
No.106578346
>>106577948
just do the calc manually bro
Anonymous
9/14/2025, 1:06:28 AM
No.106578349
>>106578328
By ignoring it. Radiance is a meme for now. Also normal loras work on it anyway. So train on base and use later on any other chroma version.
Anonymous
9/14/2025, 1:06:48 AM
No.106578352
>>106578369
>>106578339
OMG! He can't get away with this!
Anonymous
9/14/2025, 1:07:03 AM
No.106578355
>>106578441
>>106576546
>a whole dataset based on Flux gens
Yikes! I added about 20 synthetic images to my dataset and it slopped up my LoRA something fierce. Too bad they didn't share any prompts/settings for their images, I want to see how they were using Flux. It can either look really good with the right settings, or be very limited with just the basic options in use.
Anonymous
9/14/2025, 1:08:04 AM
No.106578368
>>106578278
Has it finished training or is it still ongoing ?
Also what's with the Chroma 2k thing, what's that about ?
Anonymous
9/14/2025, 1:08:08 AM
No.106578369
>>106578352
>He can't get away with this!
based
Anonymous
9/14/2025, 1:10:31 AM
No.106578386
>>106579373
Anonymous
9/14/2025, 1:11:54 AM
No.106578400
>>106578482
>>106578325
>He broke some promise made in some reddit post half a year ago
when will he refund the guys who supported him on ko-fi because they believed there will be artist tags though? that's false advertisment
Anonymous
9/14/2025, 1:14:52 AM
No.106578429
God bless ComfyUI API nodes
Anonymous
9/14/2025, 1:15:45 AM
No.106578437
>>106578476
Anonymous
9/14/2025, 1:16:11 AM
No.106578441
>>106578606
>>106578355
The only legitimate use case is if there aren't enough real images of for example a specific object, that being said it's still not worth it due to the massive slopping
Anonymous
9/14/2025, 1:17:50 AM
No.106578462
Anonymous
9/14/2025, 1:19:40 AM
No.106578475
Anonymous
9/14/2025, 1:19:49 AM
No.106578476
>>106578481
>>106578437
It should read
>don't get mad
>get better
prompt adherence issue?
Anonymous
9/14/2025, 1:20:41 AM
No.106578481
>>106578476
Seems that way, they can't all be winners plus this prompt is ass going to make something else
Anonymous
9/14/2025, 1:20:41 AM
No.106578482
>>106578498
>>106578400
Perhaps if they ask for refunds
You didn't pay shit so why are you complaining ? Get a life
Anonymous
9/14/2025, 1:21:38 AM
No.106578489
>>106578505
>>106578515
>>106578168
Point me to a single Flux-based model that can do artists.
Anonymous
9/14/2025, 1:21:57 AM
No.106578493
>>106578499
Is there an "official" chroma 2k workflow. Using the original chroma offical workflow on it show awful artifacts when I got to higher resolution (2k by 1k for instance).
Anonymous
9/14/2025, 1:22:42 AM
No.106578498
>>106578512
>>106578547
>>106578482
>You didn't pay shit
how do you know that? you saw it in your dream?
Anonymous
9/14/2025, 1:22:43 AM
No.106578499
>>106578509
>>106578566
>>106578493
How many steps?
Anonymous
9/14/2025, 1:23:47 AM
No.106578505
>>106578609
>>106578628
>>106578489
Are you assuming that flux can't learn artists?
Anonymous
9/14/2025, 1:24:02 AM
No.106578509
>>106578499
Tried 26 and 50.
Anonymous
9/14/2025, 1:24:25 AM
No.106578512
>>106578524
>>106578498
>saw it in your dream
dont fall for the subtle shill tactics, they are trying to plant a seed. stay vigilant, localbros
Anonymous
9/14/2025, 1:24:41 AM
No.106578515
>>106578525
>>106578536
>>106578489
that's basically the problem with all modern models. They're all empty. If you wanna prompt something you have to train it yourself.
They're shit.
Anonymous
9/14/2025, 1:25:37 AM
No.106578524
>>106578512
kek, that wasn't intended I swear!
Anonymous
9/14/2025, 1:25:43 AM
No.106578525
>>106578542
>>106578579
>>106578515
Seedream just works
Anonymous
9/14/2025, 1:26:43 AM
No.106578536
>>106578515
>that's basically the problem with all modern models. They're all empty. If you wanna prompt something you have to train it yourself.
>They're shit.
amen
Anonymous
9/14/2025, 1:27:01 AM
No.106578539
>>106578607
>>106578172
the loras work fine. i use them with flash and HD chroma.
only some flux loras work but most do indeed not work properly anymore even if they do influence the output.
>>106578219
shit hoster, see image. use gofile or catbox. the fact anyone still uses mediafire in 2025 is surprising
Anonymous
9/14/2025, 1:27:39 AM
No.106578542
>>106578525
for your generic stock images sure, but let's see some art by greg rutkowski
Anonymous
9/14/2025, 1:28:17 AM
No.106578547
>>106578498
>how do you know that?
lel
Anonymous
9/14/2025, 1:30:59 AM
No.106578566
>>106578499
Never mind I think it was the lora I was using.
Anonymous
9/14/2025, 1:31:00 AM
No.106578567
>>106578653
>>106577787
Imagine being so retarded you can't even understand the advantages of boomer prompting. It doesn't fit into your tiny head that the model that can only understand basic tags is a nightmare to customize to get what you actually want if you want to have any control over the output. SDXL/Noob and all booru-based models suffer from terrible prompt bleed and poor prompt following.
Anonymous
9/14/2025, 1:31:41 AM
No.106578578
chromakek copeymelty
Anonymous
9/14/2025, 1:31:48 AM
No.106578579
>>106578525
>Generic slop, three generic artstyles
>Seedream just works
I mean, it's slightly better than GPT piss filter and one pose, but that's a VERY low bar
Enjoy your fisher price toy
Anonymous
9/14/2025, 1:33:22 AM
No.106578589
>>106578681
You xboxtards don't understand the brilliance of our PS3 goodness!
Anonymous
9/14/2025, 1:33:52 AM
No.106578593
>>106578218
>furry
you answered your own question.
Anonymous
9/14/2025, 1:35:00 AM
No.106578602
>>106577897
api is getting mighty based. this would take 10 hours on local
Anonymous
9/14/2025, 1:35:27 AM
No.106578605
>>106578681
Sega does what nintendon't!
Anonymous
9/14/2025, 1:35:43 AM
No.106578606
>>106578441
I do still use synthetic images in my dataset (mostly detailed close-ups), but they had to be extremely scrutinized before only the best of the best got further touched up in PS, and they're weighted much lower than the rest of the images. Just going full Flux sounds like a recipe for disaster.
Anonymous
9/14/2025, 1:35:46 AM
No.106578607
Anonymous
9/14/2025, 1:36:00 AM
No.106578609
>>106578628
>>106578505
Not without a massive Chroma style finetune. Obviously it would have to be magnitudes more. Its lack of artist knowledge is very hard baked into the model, and the same applies to Qwen etc... anything that's greater than 2B parameters is pretty hard to teach from scratch. SD 3.5 did it right in that it knew its artists from the start, though it had its drawbacks.
Anonymous
9/14/2025, 1:37:03 AM
No.106578617
Anonymous
9/14/2025, 1:38:20 AM
No.106578628
>>106578505
>>106578609
Anyways, it's not that Chroma, nor Flux, can't learn artists. But attention is overwhelmed by prompt following. Unless an architecture handles styles seperately, it will only learn styles through LoRAs and tunes that focus solely on those styles. Flux is already SOTA at styles that have been trained by the community, and obviously so is Chroma.
Anonymous
9/14/2025, 1:38:29 AM
No.106578630
>>106577941
I'm not smart enough to understand any of the details, is this a big deal or something? you just linked this paper without elaborating further
Anonymous
9/14/2025, 1:38:38 AM
No.106578631
>my dad works at lodestones
Anonymous
9/14/2025, 1:38:46 AM
No.106578632
>>106578684
>>106578686
how organic
Anonymous
9/14/2025, 1:39:23 AM
No.106578637
you retards, my team is better than your team!
Anonymous
9/14/2025, 1:39:53 AM
No.106578641
only one model generates at 4k though
Anonymous
9/14/2025, 1:41:54 AM
No.106578651
>>106578678
>>106578693
/ldg/: 132 / 28 / 1
/adt/: 31/ 138 / 6
Anonymous
9/14/2025, 1:42:01 AM
No.106578653
>>106578673
>>106578718
>>106578567
Neta Lumina handles plain tag prompts and has less concept bleed than chroma.
Anonymous
9/14/2025, 1:43:11 AM
No.106578662
>>106578799
>>106579083
>>106577941
https://reconstruction-alignment.github.io/
>We introduce Reconstruction Alignment (RecA), a resource-efficient post-training method that leverages visual understanding encoder embeddings as dense "text prompts," providing rich supervision without captions. Concretely, RecA conditions a UMM on its own visual understanding embeddings and optimizes it to reconstruct the input image with a self-supervised reconstruction loss, thereby realigning understanding and generation.
that's impressive wtf
Anonymous
9/14/2025, 1:44:52 AM
No.106578673
>>106578718
>>106578653
Yes, also it takes the quintuple to generate
Anonymous
9/14/2025, 1:45:21 AM
No.106578678
>>106578651
if I just spammed every retarded seed variation I get I could fill this thread up with images too
Anonymous
9/14/2025, 1:45:45 AM
No.106578681
>>106578589
>>106578605
They're more like
>Stadia is superior to owning a console
Anonymous
9/14/2025, 1:46:08 AM
No.106578684
>>106578632
He's just going to do this until his caretaker takes his internet away
Anonymous
9/14/2025, 1:46:49 AM
No.106578686
>>106578632
roaches never die
Anonymous
9/14/2025, 1:47:36 AM
No.106578693
>>106578837
>>106578651
adt is 144 / 33 why would you lie about something so easily verifiable kek
Anonymous
9/14/2025, 1:49:53 AM
No.106578713
I can't take your model seriously if you don't package it into an sft
Anonymous
9/14/2025, 1:50:52 AM
No.106578718
>>106578653
>>106578673
The model is not based on Flux.
Anonymous
9/14/2025, 1:55:24 AM
No.106578752
>>106578773
>leave for 2 weeks
>thread is still chromaseethers vs chromekekkers
no new releases lately?
Anonymous
9/14/2025, 1:59:00 AM
No.106578773
>>106578784
>>106578752
i'm slopping hard atm with sneedream it's quite fun throwing artists/styles at it to see what it shits out
Anonymous
9/14/2025, 2:00:05 AM
No.106578779
>>106578310
>But your electricity and internet are also pay as you go API SaaS services.
im adding this to my arsenal of truth nuk3s for when some faggot ITT gets mad at me suggesting renting a GPU
also nice to see civit finally added a chroma category
>>106578773
i thought seedream was saas? is there an open weights release?
>>106578662
>using visual understanding encoder embeddings as dense "text prompts," providing rich supervision without captions
This is fucking genius. Shit like this is EXACTLY what I mean when I say we have so much low hanging fruit and so many more optimizations to find
Anonymous
9/14/2025, 2:03:39 AM
No.106578801
>>106578784
>i thought seedream was saas?
it is
Anonymous
9/14/2025, 2:03:39 AM
No.106578802
Imagine having a life so empty you spend 14 hours a day in a general shilling some off topic service and arguing with anons in a thread for LOCAL image gen. It's almost as if the person doing this suffers from some sort of disability.
Anonymous
9/14/2025, 2:04:52 AM
No.106578811
>>106578784
>i thought seedream was saas?
it is but it's fun to see what non local models can do. don't see the point in limiting yourself to one side of it unless you're just a gooner.
Anonymous
9/14/2025, 2:05:45 AM
No.106578817
>>106578933
>>106578799
can you explain further, I'm sure it's a genius idea but I can't visualize it, what do they do exactly to get this improvement
Anonymous
9/14/2025, 2:05:46 AM
No.106578818
>>106578324
>mat1 and mat2 shapes cannot be multiplied
>I hate image generation
you downloaded the wrong version of a model or text encoder
Anonymous
9/14/2025, 2:06:29 AM
No.106578824
why are people talking about slopdream online here? just ban these trolls
Anonymous
9/14/2025, 2:08:44 AM
No.106578837
>>106578693
it was obviously a joke. how do you have more images than posts
Anonymous
9/14/2025, 2:09:21 AM
No.106578840
Am I the only one who felt actual anger when they saw they added chroma to civit?
I don't know why that model upsets me so much. But the feeling is real.
Anonymous
9/14/2025, 2:09:53 AM
No.106578844
>>106578828
>now some are trolling hard with it
about pretending that chroma is the best model ever because it can do boobs and vagene?
Anonymous
9/14/2025, 2:10:47 AM
No.106578849
>>106578962
>>106578784
>i thought seedream was saas? is there an open weights release?
its a shill campaign
anons discussed it in good faith when it first released, then some time passed, now some are trolling hard with it for some reason no idea why
Anonymous
9/14/2025, 2:14:56 AM
No.106578876
>>106578901
Good morning
Anonymous
9/14/2025, 2:15:59 AM
No.106578884
>>106578979
>>106578270
yeah the difference is anons hate you
Anonymous
9/14/2025, 2:18:51 AM
No.106578901
>>106578945
>>106578954
>>106578876
gm based /ss/ anon
Anonymous
9/14/2025, 2:22:52 AM
No.106578933
>>106578945
>>106579041
>>106578817
>can you explain further, I'm sure it's a genius idea but I can't visualize it, what do they do exactly to get this improvement
Imagine ML models that can understand images but suck at generating them. This new method, Reconstruction Alignment (RecA), fixes that by using the model's own visual understanding as a super-dense prompt.
Instead of relying on crappy text captions that miss most image details, it makes the model reconstruct images using its own semantic understanding. The result appears to better image generation and editing, and it only takes 27 GPU hours to implement.
Anonymous
9/14/2025, 2:25:06 AM
No.106578945
>>106578954
>>106578901
that's not him (i am he), all big tit Latina milfs made with WAN lightning just look the same lol
>>106578933
oh and important to note: this is only relevant for improving multimodal models, not pure text-to-image models
Anonymous
9/14/2025, 2:25:49 AM
No.106578954
>>106578975
>>106579015
>>106578901
>>106578945
post more big titted latinas
Anonymous
9/14/2025, 2:26:50 AM
No.106578962
>>106578986
>>106578990
>>106578849
It's just the new anti-Chroma trolling flavor. Before it was Qwen, now it's Seedream.
Anonymous
9/14/2025, 2:27:07 AM
No.106578964
>noooo seedream doesn't belong here!
Seedream belongs here, I will clear up any misinformation and miscommunication.
The OP states "Discussion and development of local image and video models and UI". ComfyUI, as linked in the OP, by default has Seedream as one of the available options. When first opening ComfyUI you are greeted with a pop-up requesting you to sign up and add tokens. One of the models you can spend tokens on is Seedream. The fact that ComfyUI prompts you to use Seedream before ever mentioning SDXL or Chroma suggest that it's an integral part of ComfyUI, and by extension a core part of ComfyUI discussion.
>But the OP says LOCAL image!!
True, but it also says "and UI". Discussion of ComfyUI includes the discussion of any and all components of ComfyUI, including the locally run code in the comfy_api_nodes/nodes_bytedance.py file. This file is contained locally on my device, and runs locally in my ComfyUI installation. Seedream discussion fits this thread as it falls under discussion of ComfyUI.
Conclusion: You are allowed to post Seedream outputs as long as they are generated using ComfyUI API nodes.
Anonymous
9/14/2025, 2:28:07 AM
No.106578973
>>106578985
Every post he makes is an admission to his suffering and losing
Anonymous
9/14/2025, 2:28:09 AM
No.106578975
>>106578954
>post more big titted latinas
i'm in my "small girls" phase right now, so i'm posting on /b/ for obvious reasons. glad to see you're still around and passionate for video
Anonymous
9/14/2025, 2:28:37 AM
No.106578979
>>106578884
>anons
Again it's only you
Anonymous
9/14/2025, 2:29:16 AM
No.106578980
Anonymous
9/14/2025, 2:29:43 AM
No.106578985
>>106578973
It's really quite odd isn't it. Odd and sad.
Anonymous
9/14/2025, 2:29:44 AM
No.106578986
>>106578997
>>106578962
Seedream is an API locked paypig model. Nano banana at least is free to try and prompt. Truly a wonder why they would pick the chinkshit model over the free one.
Anonymous
9/14/2025, 2:30:00 AM
No.106578990
>>106578962
This
Neet with anti Chroma obsessive compulsive disorder just can't stop
Anonymous
9/14/2025, 2:31:09 AM
No.106578997
>>106579020
>>106580181
>>106578986
talking about google anything is a fast road to being called an indian around these parts
Anonymous
9/14/2025, 2:31:49 AM
No.106579004
>>106579016
>>106579074
Found an incredible artist but her style has evolved so much I'm afraid my vramlet model won't be able to handle her entire body of work at one time
Anonymous
9/14/2025, 2:32:40 AM
No.106579015
>>106578954
SLOP SLOP SLOP
How many "Speed LoRAs" and "_fast" shortcuts did you use with your "Torch compiled" "fp8_scaled" model?
Anonymous
9/14/2025, 2:32:41 AM
No.106579016
>>106579023
>>106579004
Show it to me and I might help you
Anonymous
9/14/2025, 2:32:54 AM
No.106579020
>>106578997
rightfully so
Anonymous
9/14/2025, 2:33:31 AM
No.106579023
>>106579031
Anonymous
9/14/2025, 2:34:40 AM
No.106579031
>>106579035
>>106579023
I'm not high enough for this shit
No
Anonymous
9/14/2025, 2:35:06 AM
No.106579035
Anonymous
9/14/2025, 2:35:30 AM
No.106579041
>>106579074
>>106578933
>Instead of relying on crappy text captions that miss most image details, it makes the model reconstruct images using its own semantic understanding.
so you don't need to captions images anymore?
Anonymous
9/14/2025, 2:39:40 AM
No.106579070
>>106579080
Anonymous
9/14/2025, 2:41:08 AM
No.106579074
>>106579004
>Found an incredible artist but her style has evolved so much I'm afraid my vramlet model won't be able to handle her entire body of work at one time
And by artist I mean "girl on instagram" and by
"her style has evolved" I mean she's aged and her face/body changed
>>106579041
the abstract says "providing rich supervision without captions"
this is where my ML knowledge ends. if the only purpose of captions is indeed supervision, then yeah there's no longer a purpose
remember though that this just makes the text-to-image as good as its image-to-text. apparently image-to-text mogs text-to-image in pretty much every multimodal LLM so this is a very welcome change
Anonymous
9/14/2025, 2:42:20 AM
No.106579080
Anonymous
9/14/2025, 2:42:39 AM
No.106579083
>>106579137
Anonymous
9/14/2025, 2:47:40 AM
No.106579110
>>106579267
made a mistake and posted it in a dead threat, ups
Anonymous
9/14/2025, 2:48:02 AM
No.106579113
>>106578799
>this is EXACTLY what I mean when I say we have so much low hanging fruit
I don't know who you are, but you stole that quote from me.
Which model can I use to make backgrounds and scenery without a subject being the focus like all these noobai/illustrious 1girl models?
Anonymous
9/14/2025, 2:52:06 AM
No.106579137
>>106579202
>>106579362
>>106579083
>worse than Kontext
I'll pass, but this method seems promissing, I really believe we'll get Nano Banana's level with this shit
SPRO to unslop + this to make it good = Qwen Image Edit Ultimate <3
Anonymous
9/14/2025, 2:52:07 AM
No.106579138
>>106579486
>>106577883 (OP)
Hey can I ask a really stupid question?
What UI do Image-To-Text app use?
I'm looking at gemini, qwen or joycaption, and I don't understand if their suppose to be run with Stable-Dif, Comfy, something else. Or they are just their own stand alone UI.
>I'm having problems installing so I'm trying to figure out if i'm doing something really basic wrong.
Anonymous
9/14/2025, 2:52:45 AM
No.106579148
I know the micropenis pun model has taken a lot of QIE's thunder, but qwen image edit is still pretty good desu.
Anonymous
9/14/2025, 2:55:13 AM
No.106579165
>>106579135
use the "no humans" tag, put 1girl, 1boy etc in negatives. use NegPip instead of the default negatives implementation.
>>106579137
it beats Kontext on three of those columns though, and seems it might be less slopped/better at styles? Also they note in the paper that they could have probably gone further with Bagel, but basically say they ran out of money.
>SPRO to unslop + this to make it good = Qwen Image Edit Ultimate
Not sure if this would work for Qwen Image. It seems to be for multimodal models only. Or maybe there's a way to get it to work?
Anonymous
9/14/2025, 3:00:14 AM
No.106579208
>>106579260
>>106579356
Animating some frazetta paintings
Anonymous
9/14/2025, 3:01:27 AM
No.106579213
>>106579362
>>106579202
>it beats Kontext on three of those columns though,
oh yeah I'm fucking blind, I should get some sleep lol
Anonymous
9/14/2025, 3:03:43 AM
No.106579232
>>106579267
Anonymous
9/14/2025, 3:04:36 AM
No.106579235
>>106579265
>>106579135
eg:
>scenery, no humans, an abandoned factory building, rust, grass, vines, flowers,sunlight, day, red sky, clouds,
>by takamura kazuhiro, by sushio, by sadamoto yoshiyuki , very awa, absurdres, best quality, masterpiece, ultra-detailed, amazing composition,watercolor \(medium\),
>(jpeg artifacts, text, watermark, cropped, censored:-1)
Anonymous
9/14/2025, 3:05:06 AM
No.106579237
Anonymous
9/14/2025, 3:05:33 AM
No.106579240
Anonymous
9/14/2025, 3:06:36 AM
No.106579247
>>106579257
>>106578168
>Chroma doesn't know a single booru tag
no, it knows many. what it does have trouble with is the artist tags that some people would like to use
Anonymous
9/14/2025, 3:06:53 AM
No.106579249
>>106579263
>>106579202
>Not sure if this would work for Qwen Image. It seems to be for multimodal models only.
maybe it can work if you use the visual text encoder?
Anonymous
9/14/2025, 3:07:54 AM
No.106579257
>>106579641
>>106579247
>no, it knows many.
show me some tags it can do (action tags and artist tags)
Anonymous
9/14/2025, 3:08:03 AM
No.106579259
>>106579267
Anonymous
9/14/2025, 3:08:07 AM
No.106579260
Anonymous
9/14/2025, 3:08:38 AM
No.106579263
>>106579249
I think you're right.
Anonymous
9/14/2025, 3:08:39 AM
No.106579265
>>106579202
>>106579235
Whenever I don't prompt for a 1girl and use "no humans" it seems the background quality is greatly reduced, while if a 1girl is there somewhere suddenly the details are pretty sharp and make more sense. Without it the backgrounds almost always have a 1-point perspective and look like early SDXL generic crap.
I think I need better prompts too, but thanks for the tips.
Anonymous
9/14/2025, 3:09:09 AM
No.106579267
Anonymous
9/14/2025, 3:22:26 AM
No.106579356
Anonymous
9/14/2025, 3:23:16 AM
No.106579362
>>106579382
>>106579393
>>106579213
>>106579202
>>106579137
there's a demo
https://huggingface.co/spaces/sanaka87/BAGEL-RecA
so far, not seeing great style results. but the demo space might have bad configs
Anonymous
9/14/2025, 3:25:29 AM
No.106579373
>>106578386
Shit gen
Shit asuka style
Grow up
Anonymous
9/14/2025, 3:26:35 AM
No.106579382
>>106579362
>but the demo space might have bad configs
I've seen this cope enough times to know the model is dead on arrival.
Anonymous
9/14/2025, 3:27:22 AM
No.106579388
>>106579398
>>106579453
My fellow /ldg/entlemen, I recently tried out NAI v4.5 to see how proprietary models are doing and the vibe transfer feature is actually pretty good. Now I want to replicate something similar using local models. I've a few questions for the sages of this thread
1. Is IPAdapter the same thing / good enough? If so, is there a good write-up anywhere? Haven't been able to find a lot
2. If not, does local even have anything similar?
3. If so, does anyone have a comfy workflow I can look at?
Yeah that's pretty much it, I just want vibe transfer at home
Anonymous
9/14/2025, 3:28:06 AM
No.106579393
>>106579417
>>106579470
>>106579362
>https://huggingface.co/spaces/sanaka87/BAGEL-RecA
bagel is such a shit model though, if they do this shit on QIE then maybe that can be interesting
Anonymous
9/14/2025, 3:28:29 AM
No.106579395
>>106579423
Anonymous
9/14/2025, 3:29:05 AM
No.106579398
>>106579459
>>106579459
>>106579388
all local one-shot style transfer options are shit, however style loras absolutely destroy vibe transfer.
Anonymous
9/14/2025, 3:30:56 AM
No.106579417
Anonymous
9/14/2025, 3:31:27 AM
No.106579421
>>106579441
I hate Chroma
I hate Comfy
I hate Asuka fag
I hate SeeDream
Anonymous
9/14/2025, 3:31:42 AM
No.106579423
>>106579429
>>106579395
Yes, I'm posting in it currently.
Anonymous
9/14/2025, 3:32:31 AM
No.106579429
>>106579443
>>106579423
Landscape diffusion general
Anonymous
9/14/2025, 3:33:55 AM
No.106579441
>>106579421
>made the list
wow, I am going to finally download chroma and then spin up some api nodes in celebration. cheers.
Anonymous
9/14/2025, 3:34:12 AM
No.106579443
>>106579464
>>106579429
Long dead and they never posted any tips or helpful comments (I read the the threads in full).
Anonymous
9/14/2025, 3:35:18 AM
No.106579453
>>106579459
>>106579388
IPAdapter is superior but you have to dial in the settings. The right settings for you, only you can find.
Anonymous
9/14/2025, 3:37:04 AM
No.106579459
>>106579480
>>106579398
True, but vibe transfers doesn't just copy style. I'm just curious if someone has tried training a similar model pipeline that NAI is using with the information we've been given
>>106579453
Does IPA only copy the style or also the "essence" like VT does? That's what I'd like to see as you can just grab style loras otherwise like
>>106579398 said
Anonymous
9/14/2025, 3:37:48 AM
No.106579464
>>106579443
Didn't they post some nice hudson river school stuff ?
Anonymous
9/14/2025, 3:38:30 AM
No.106579470
>>106579393
desu, qwen edit might not benefit as much because it's already so bloated and powerful. I'm more interested in this being a way to power up smaller/debloated models
Anonymous
9/14/2025, 3:40:02 AM
No.106579478
I've just installed kohya ss but when I try to run it a cmd prompt just opens and then closes immediately. Anyone know the fix?
Anonymous
9/14/2025, 3:40:25 AM
No.106579480
>>106579512
>>106579459
You'd have to define "essence". I used to throw a dozen or so images into it and it'd work quite well, really well in fact. The other anon is right in that no one really gives a shit about it anymore and will train a lora instead.
Desu 1.5's IPA is miles better than XL's the last time I used either.
Anonymous
9/14/2025, 3:41:16 AM
No.106579486
Anonymous
9/14/2025, 3:41:23 AM
No.106579487
>>106579508
Anonymous
9/14/2025, 3:44:49 AM
No.106579501
>>106579511
trying to make some new desktop backgrounds
Anonymous
9/14/2025, 3:45:31 AM
No.106579508
>>106579487
Thanks, Ran...
Anonymous
9/14/2025, 3:45:52 AM
No.106579511
>>106579694
Anonymous
9/14/2025, 3:45:57 AM
No.106579512
>>106579531
>>106579480
>1.5 IPA is miles better than XL
Damn, well I'll see if I can find some workflows online and just try it out, thanks. Loras are great but if you have 3 or 4 of them, they have a tendency of deep-frying your image, not to mention that you (or someone else) has to spend a few hours cooking up a good one. Maybe NAI will open source their VT pipeline/models in a year or two...
Anonymous
9/14/2025, 3:46:27 AM
No.106579515
>>106579649
Guys how do the models just KNOW asuka?
Anonymous
9/14/2025, 3:48:09 AM
No.106579531
>>106579512
>Maybe NAI will open source their VT pipeline/models in a year or two...
its not really good enough to really care noob clears nu nai anyway other than text
Anonymous
9/14/2025, 3:48:27 AM
No.106579533
>ran took everything from me
Anonymous
9/14/2025, 3:58:11 AM
No.106579607
seedream stole my will to prompt locally...
Anonymous
9/14/2025, 4:00:26 AM
No.106579621
>>106579724
I honest to god have no idea why you're all shitting yourselves over seed dream. What even makes it good?
Anonymous
9/14/2025, 4:00:33 AM
No.106579622
Ehemmmmmm
Anonymous
9/14/2025, 4:01:28 AM
No.106579628
>>106579642
*yawn*
Anonymous
9/14/2025, 4:04:34 AM
No.106579641
>>106579257
> action
have "drawing (action)", one of the few *action* tags in the boorus as far as I can tell
>artist
I literally just said these are some of the tags it has trouble with.
The implication of it having learned many booru tags from the multiple boorus that went into this also is that indeed it hasn't learned all booru tags. Would be nice to have such a model. It didn't learn AS much though, just probably most of all models so far.
Anonymous
9/14/2025, 4:04:36 AM
No.106579642
>>106579628
SaaS is evil but (you) download their leaked models as fast as (you) can and
(you) despices cloud services (you) download their models from hugging face that is litteraly a cloud service.
(you) are a joke, this is the real /sdg/
Anonymous
9/14/2025, 4:05:27 AM
No.106579645
>schizos out
Anonymous
9/14/2025, 4:06:23 AM
No.106579649
>>106579652
>>106579515
like miku and a few others she is all over the internets even if you don't ingest *booru or anime-leaning parts of social networks or whatever as training data specifically
Anonymous
9/14/2025, 4:07:08 AM
No.106579652
>>106579649
>tranny hands
Anonymous
9/14/2025, 4:13:31 AM
No.106579690
>>106579703
>>106579708
Best way to train a lora locally that isn't kohya? Kohya ss just crashes on start up for me
Anonymous
9/14/2025, 4:14:10 AM
No.106579694
>>106579706
>>106579511
thank you anon
>>106579656
give her a face mask
Anonymous
9/14/2025, 4:14:51 AM
No.106579700
>>106579656
prompt more about facial expression
Anonymous
9/14/2025, 4:15:14 AM
No.106579703
>>106579718
>>106579690
OneTrainer, literally just used it to train a lora
https://github.com/Nerogar/OneTrainer
Anonymous
9/14/2025, 4:15:26 AM
No.106579704
>>106579712
Anonymous
9/14/2025, 4:16:01 AM
No.106579706
>>106579694
forgive the crap video it was just a quick example
Anonymous
9/14/2025, 4:16:28 AM
No.106579708
>>106579690
>Kohya ss just crashes on start up for me
I'd look into that.
Anonymous
9/14/2025, 4:17:27 AM
No.106579712
>>106579714
>>106579704
Whoa momma, very nice. Model?
Anonymous
9/14/2025, 4:18:05 AM
No.106579714
>>106579728
>>106579811
>>106579712
ChromaHD with a self made lora
Anonymous
9/14/2025, 4:18:45 AM
No.106579718
Anonymous
9/14/2025, 4:19:39 AM
No.106579724
>>106579621
>What even makes it good?
you can gen 4k quality natively it seems, no upscaling. prompt adherence is also very good, they actually wrote a paper on their algorithm and why its so good
the aesthetic superiority is subjective as always
Anonymous
9/14/2025, 4:20:17 AM
No.106579728
>>106579714
>Chroma model
Anonymous
9/14/2025, 4:28:07 AM
No.106579789
>>106580082
Anonymous
9/14/2025, 4:28:55 AM
No.106579793
some day those signs will actually say something, hopefully by 2030
Anonymous
9/14/2025, 4:31:44 AM
No.106579811
>>106579858
>>106579714
Is chroma "finished" already? I was under the impression that it's still training. Also, is it a style or character lora? How much vram/time does it take to train one?
Anonymous
9/14/2025, 4:32:39 AM
No.106579818
>>106579995
>>106578036
i tried the model and hated it. its just broken pos model that was obviously train wrong and is very finicky with its settings. I'll just stick to using to sdxl finetunes.
Anonymous
9/14/2025, 4:32:54 AM
No.106579820
Anonymous
9/14/2025, 4:34:50 AM
No.106579826
>>106579656
have her suck something
Anonymous
9/14/2025, 4:36:17 AM
No.106579831
Anonymous
9/14/2025, 4:40:55 AM
No.106579858
>>106579868
>>106579875
>>106579811
HD is the finished version but they are still making different versions. I'm using a 5090 and it takes me 8 hours to train
Anonymous
9/14/2025, 4:41:22 AM
No.106579860
Anonymous
9/14/2025, 4:43:38 AM
No.106579868
>>106579901
>>106579858
8 hours on a 5090 is fucked, training an SDXL lora takes like 1-2 hours on my 3090, I shudder to think about how long Chroma would take. That's probably why there are only a handful of loras out on civitai
Anonymous
9/14/2025, 4:45:15 AM
No.106579875
>>106579858
hey wait a second i didnt make this post. where did my post about --fast being still relevant for fp16 text encoders even when using Q8 go??
Anonymous
9/14/2025, 4:46:34 AM
No.106579881
>>106579885
>>106580121
It's just like, why use Chroma when SDXL does the exact same thing but faster?
Anonymous
9/14/2025, 4:47:18 AM
No.106579885
>>106579915
>>106579881
better prompt adherence and can do text
Anonymous
9/14/2025, 4:47:45 AM
No.106579890
ok weird i guess spam filter discarded it, first time i had that happen on the evasion site
anyways, --fast matters if you use Q8 with a fp16 encoder
video not using --fast on Q8: 302 seconds exactly every time
video using --fast on Q8: give or take 250 seconds. --fast introduces a much larger time range that can vary between 240 to 280 seconds but its always faster than not using it
un-noticable difference to prompt adherence. nothing like going from fp16 to fp8 scaled for t5xxl
Anonymous
9/14/2025, 4:48:46 AM
No.106579896
Anonymous
9/14/2025, 4:49:36 AM
No.106579898
>>106579911
>>106580119
>Pruned Qwen Image in half (10B params). It needs a lot of training to be useful, so I decided to make it a pixel space model. Patching pixel space with 32x32 patches. Samples are the current 10B latent version and the pixel space version. Both will need a lot more training.
https://xcancel.com/ostrisai/status/1966987356357226612
Anonymous
9/14/2025, 4:51:31 AM
No.106579901
>>106580088
>>106579868
>8 hours on a 5090 is fucked
Totally depends on the number of images and resolution
Have you ever trained at all ?
Anonymous
9/14/2025, 4:52:40 AM
No.106579911
Anonymous
9/14/2025, 4:54:01 AM
No.106579915
>>106579885
>better prompt adherence
Control nets exist and are extremely good. Most of the actual practical uses of Chroma do not even demonstrate the need for its prompt adherence over SDXL either. It's all just 1 girl stuff.
>can do text
Benchmaxxing useless shit.
Anonymous
9/14/2025, 4:54:20 AM
No.106579919
>>106579925
Anonymous
9/14/2025, 4:55:24 AM
No.106579925
>>106579993
>>106579919
you should go back to doing lolis with bulges
Anonymous
9/14/2025, 5:05:59 AM
No.106579990
>>106580004
>>>/b/939770328
Anonymous
9/14/2025, 5:06:25 AM
No.106579993
>>106580004
>>106580009
>>106579925
not in the mood to get another 3 day vacation again. Even posting on /trash/ got me banned and /b/ is full of compressed low res slop gen. Not risking it. just got banned on /v/ and warning the other day on /aco/ for posting young lunafreya.
Anonymous
9/14/2025, 5:06:36 AM
No.106579995
Anonymous
9/14/2025, 5:07:54 AM
No.106580004
>>106579990
you should be banned for linking to /b/ in an AI thread and showing me adults instead of a sexy kid
>>106579993
>not in the mood to get another 3 day vacation again.
ok so just replace the boards 4chan org with k1w1 dot st in your url address bar but with an i instead of a 1 so its the name of the fruit/new zealander
Anonymous
9/14/2025, 5:08:39 AM
No.106580009
>>106579993
fair enough bro cool gens and style tho
Anonymous
9/14/2025, 5:19:52 AM
No.106580071
Anonymous
9/14/2025, 5:22:10 AM
No.106580082
>>106580141
Anonymous
9/14/2025, 5:22:41 AM
No.106580088
>>106579901
>training an SDXL lora takes like 1-2 hours on my 3090
Yes. Yes I have. I don't know how many images other people use, but I haven't gone above 100 yet. I train at 1024 pixels and rank 32-64
Anonymous
9/14/2025, 5:26:10 AM
No.106580108
>>106580195
i regret replying to 106579993 because its obvious he was trolling :/ talking about compressed slop when he posted a sub 200kb jpeg and when i gave him a solution to his only problem he didn't take it. so obviously he could have just left it at "not in the mood" but because he's a left winger he needs to invent problems :/
Anonymous
9/14/2025, 5:27:13 AM
No.106580113
I like Chroma.
Anonymous
9/14/2025, 5:29:27 AM
No.106580119
>>106579898
interesting attempt but i wonder if he can train it enough to confirm his setup is working or not (or retrain if it isn't)
Anonymous
9/14/2025, 5:29:55 AM
No.106580121
>>106580136
>>106580190
>>106579881
i can do 2girls interacting with each other with chroma instead of just 1girl with sdxl+snake oil
Anonymous
9/14/2025, 5:32:21 AM
No.106580136
>>106580147
>>106580121
it works better.
but wan (1 frame for images), hidream, qwen and maybe others are probably actually quite at bit stronger yet. they unfortunately also have less interesting 2girls overall
Anonymous
9/14/2025, 5:33:07 AM
No.106580141
>>106580160
>>106580251
Anonymous
9/14/2025, 5:34:03 AM
No.106580147
>>106580228
>>106580136
i wish someone would give qwen the chroma treatment
Anonymous
9/14/2025, 5:36:27 AM
No.106580160
>>106580205
>>106580215
Anonymous
9/14/2025, 5:38:28 AM
No.106580177
So I want to finally try Qwen out: is Qwen + lightx2v 8step the go-to if I want to use LoRAs?
Anonymous
9/14/2025, 5:39:04 AM
No.106580181
>>106578997
yeah, there is a reason for that anon. iChuds won
Anonymous
9/14/2025, 5:39:41 AM
No.106580187
>>106580200
>>106580206
Alright, I really want to get something good out of Chroma, since Qwen is so slopped for photoreal nsfw at the moment, but goddamnit I just can't get a decent gen to save my life.
Yes it's a skill issue, yes I'm a fag. Can someone share a box of a decent Chroma-HD-Flash image or recommend some settings?
Anonymous
9/14/2025, 5:40:03 AM
No.106580190
>>106580203
>>106580121
pure skill issue if you can't do 2girl with noob
Anonymous
9/14/2025, 5:40:57 AM
No.106580195
>>106580222
Anonymous
9/14/2025, 5:42:00 AM
No.106580200
>>106580187
Whats the last prompt you used?
Anonymous
9/14/2025, 5:42:35 AM
No.106580203
>>106580190
i burnt myself out on anime a long time ago
Anonymous
9/14/2025, 5:43:17 AM
No.106580205
>>106580223
>>106580160
i haven't, though i have found a couple ways to scam infinite credits though such as LMArena and Yupp.
Anonymous
9/14/2025, 5:43:43 AM
No.106580206
>>106580187
Flash tends to suck ass on its own and I get better results with Hyper-low-step at 1.00 and flash lora at 0.4
Anonymous
9/14/2025, 5:44:05 AM
No.106580207
go back to the cloud threads and leave this one to the grown ups kek
Anonymous
9/14/2025, 5:44:50 AM
No.106580210
>grown ups
>512x512
little baby boy soiled his diaper~~
Anonymous
9/14/2025, 5:45:48 AM
No.106580215
>>106580160
uncensored chinese api sounds mighty tempting.. hard to resist the sirenβs song
Anonymous
9/14/2025, 5:46:26 AM
No.106580220
Anonymous
9/14/2025, 5:46:27 AM
No.106580221
Anonymous
9/14/2025, 5:46:38 AM
No.106580222
>>106580238
>>106580252
>>106580195
>here is the catbox if it makes you happy
well i was trolling with my post so now i feel bad for making you put in that effort. i hope the other anon appreciates what you shared and replies thanks to you as well
Anonymous
9/14/2025, 5:47:02 AM
No.106580223
>>106580205
>LMArena
yeah that's what i've been using tried yupp yesterday but didn't get too far into it
Anonymous
9/14/2025, 5:47:14 AM
No.106580228
>>106580237
>>106580147
would be nice but you need someone/some entity with even more money to spare than lodestone (who mostly funded chroma, donations only covered the smaller part of expenses until now)
Anonymous
9/14/2025, 5:48:04 AM
No.106580233
christ
Anonymous
9/14/2025, 5:48:28 AM
No.106580237
>>106580241
>>106580277
>>106580228
>who mostly funded chroma, donations only covered the smaller part of expenses until now
Is lodestones just ultrarich?
Anonymous
9/14/2025, 5:48:32 AM
No.106580238
>>106580222
you started it not me i said what i wanted to say already lol
Anonymous
9/14/2025, 5:49:36 AM
No.106580241
>>106580383
>>106580237
Furries seem to be rich, for some weird reason
Anonymous
9/14/2025, 5:50:42 AM
No.106580251
>>106580278
>>106580141
Not local though.
Anonymous
9/14/2025, 5:51:03 AM
No.106580252
>>106580222
it's alright, tonight i have the right amount energy to slop up some gens and effort post. going to keep cooking.
Anonymous
9/14/2025, 5:55:57 AM
No.106580277
>>106580237
the wealth of the ultrarich is far more insane even if you just look at liquid assets they could easily expend on a sustained base.
unfortunately we're not getting them to drop good uncensored NSFW models on the public so far.
but he clearly isn't poor
Anonymous
9/14/2025, 5:56:10 AM
No.106580278
>>106580251
He will attempt to discuss it here regardless because he is a faggot
Anonymous
9/14/2025, 6:15:12 AM
No.106580383
>>106580241
>Furries seem to be rich, for some weird reason
there's just a lot of rich people out there bro. some of them are furries. 3.5% of men are pedos. are you a golem who thinks the forbes list of billionaires is all the billionaires on the planet?