← Home ← Back to /g/

Thread 106585705

369 posts 208 images /g/
Anonymous No.106585705 [Report] >>106585724 >>106587207
/ldg/ - Local Diffusion General
Sometime It's Too Much Edition

Discussion and development of local image and video models and UI

Prev: >>106580216

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106585724 [Report] >>106585823
>>106585705 (OP)
finally, a good collage untainted
Anonymous No.106585727 [Report] >>106585960
WHERE NUNCHAKU QWEN LORA FIX
WHERE NUNCHAKU WAN2.2
Anonymous No.106585767 [Report] >>106585785 >>106586194 >>106588910
So, which Chroma Model is the go-to model now? 2k? Base? HD? Flash?
Anonymous No.106585770 [Report]
>>106585620
Chroma
prodigy
batch 1
150 epochs
~60 pics dataset
A second lora I trained this way but I bumped up the resolution from 512 to 640 this time.
Anonymous No.106585785 [Report]
>>106585767
Base has better fine detail but 2K handles higher mpx count better. I use 2K for gens and Base for second pass.
Anonymous No.106585812 [Report]
Blessed thread of frenship
Anonymous No.106585823 [Report]
>>106585724
Previous was untainted at least
Anonymous No.106585842 [Report] >>106585960
Qwen SRPO waiting room
Anonymous No.106585877 [Report] >>106588904
Anonymous No.106585912 [Report] >>106585934 >>106585953
why are wanchads abandoning /ldg/?
Anonymous No.106585934 [Report] >>106585960
>>106585912
They'll be back once the nunchaku version is released
Anonymous No.106585953 [Report]
>>106585912
I'm out of ideas.
Anonymous No.106585957 [Report]
she nun on my chaku
Anonymous No.106585960 [Report]
>>106585727
this so much this
>>106585842
this so much this
>>106585934
this so much this
Anonymous No.106586038 [Report]
I hate the muttjeets shilling all their bananas or sneedeam shit in local generals, both are not different of I get with a good workflow for Chroma or Qwen
Anonymous No.106586194 [Report] >>106586220 >>106586277 >>106586410
>>106585767
2k is not done yet so I go to Flash all the time (this is a mix I use for 2k)
Anonymous No.106586203 [Report]
>>106584840
>>106584921
Rescale is more difficult for anon to identify as snake oil because it's aesthetic not performance based. There's a reason it's not discussed anymore.
Anonymous No.106586220 [Report] >>106586273
>>106586194
>(this is a mix I use for 2k)
Is this the 2k weights with the Flash delta applied?
Anonymous No.106586269 [Report] >>106586293 >>106587363
Anonymous No.106586273 [Report]
>>106586220
HD (v50) with Flash delta weights applied
Anonymous No.106586277 [Report] >>106586316 >>106586417
>>106586194
Do you run second pass/upscale or are you rawdogging this resolution?
Anonymous No.106586293 [Report] >>106586341 >>106587770
>>106586269
ads used to look good
Anonymous No.106586302 [Report]
>>106585470
what are your gen times?
Anonymous No.106586316 [Report] >>106586358
>>106586277
Anyone have a workflow or even a screenshot of how they set up a second pass ?

Not sure what it means, you run the same seed on the generated image at a low denoise, or ?
Anonymous No.106586341 [Report] >>106587363
>>106586293
Indeed
Anonymous No.106586358 [Report] >>106586376
>>106586316
https://comfyanonymous.github.io/ComfyUI_examples/2_pass_txt2img/
Anonymous No.106586371 [Report] >>106586401 >>106586463
Anonymous No.106586376 [Report]
>>106586358
Thank you my man!
Anonymous No.106586401 [Report] >>106586463
>>106586371
very nice
Anonymous No.106586410 [Report] >>106586471
>>106586194
Mind sharing that image's catbox?
Anonymous No.106586417 [Report] >>106586592 >>106588952
>>106586277
>second pass/upscale
No, this is the workflow that I use to mix them
https://files.catbox.moe/xh9gv2.png

No second pass. It is quite fast (1:51 per gen, about as fast as a regular Chroma HD gen at 1024 for 30 steps).
Anonymous No.106586421 [Report] >>106586447
Can anyone share a working set of lightning i2v lora for wan2.2?
The default (2.2 i2v lightning at 1 strength for both high and low) is shit, and while I found t2v tricks like reusing wan2.1 loras alongside 2.2 ones to make it better and playing with weights, I don't know about i2v.
Anonymous No.106586447 [Report] >>106586544
>>106586421
I just use the AIO build, it's faster than base 2.2 5b and offers a much higher quality
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne
Anonymous No.106586463 [Report]
>>106586371
>>106586401
>>106565868
Anonymous No.106586471 [Report]
>>106586410
Sure
https://files.catbox.moe/dwwfc5.png
Keep in mind I default to 8 steps and may take more sometimes to fix imperfections (same as previous Chromas). Though I have had to do that way less with HD, with the mix at 2k it does mess up fingers a bit more but is easy to fix.
Anonymous No.106586493 [Report] >>106586515
>>106586479
Anonymous No.106586515 [Report]
>>106586493
no multiple choice?
Anonymous No.106586544 [Report]
>>106586447
I use the 14B, so not for me.
Thanks anyway anon.
Anonymous No.106586592 [Report] >>106586744 >>106587234
>>106586417
Any reason you use SD clip with padding removel?
Anonymous No.106586642 [Report] >>106586672 >>106586687 >>106586790 >>106586858 >>106586899 >>106587209
Cooking a pepe jonasan lora for Chroma, attempted to do both natural language description and booru tags within each text file to see if it works better with chorma,
Anonymous No.106586650 [Report]
Anonymous No.106586672 [Report] >>106586753
>>106586642
>and booru tags within each text file to see if it works better with chorma,
Spoiler alert: it won't.
Don't try training Chroma to make images with booru tags in non-natural language, it ain't gonna happen
Anonymous No.106586687 [Report] >>106586753
>>106586642
I haven't done both in the same training, but booru style tags work just as well as natural language for Chroma in my tests, despite the model being trained exclusively on Gemini captions AFAIK
Anonymous No.106586742 [Report] >>106586809
Anonymous No.106586744 [Report] >>106587234
>>106586592
No idea, for some reason the Chroma one gives me an error "ChromaPaddingRemoval
'attention_mask' " which I have never bothered to fix
Anonymous No.106586753 [Report] >>106586762
>>106586687
I agree mine worked well which is why I want to mix it now
>>106586672
It worked well for me so I'm mixing both it was easy to setup with joycaption
Anonymous No.106586762 [Report] >>106586767 >>106586773 >>106586776
>>106586753
>joycaption
nigga, why using that shit at all when a free gemini api exists?
Anonymous No.106586767 [Report] >>106586780 >>106587152
>>106586762
Why the fuck would I use non local to caption porn when I can easily run the model at max performance?
Anonymous No.106586773 [Report] >>106586780
>>106586762
>free gemini api
try to make it caption nsfw stuff
Anonymous No.106586776 [Report]
>>106586762
>>106565868
Anonymous No.106586780 [Report] >>106586784 >>106586786 >>106587137
>>106586767
because joycaption is subpar compared to the cloud models?
And gemini can caption images featuring nudity as long as it isn't hardcore porn and doesn't feature lolis

>>106586773
Works fine on my end
Anonymous No.106586784 [Report]
>>106586780
booru tagging is not a fine art and it has done a great job describing the images. You would only do that if your system couldn't run joycaption at max performance.
Anonymous No.106586786 [Report]
>>106586780
>isn't hardcore porn and doesn't feature lolis
So it's useless
Anonymous No.106586790 [Report] >>106586824 >>106587138
>>106586642
Of all the gens you made, there isn't a single one that caught my attention, much less that I couldn't do with SDXL and a text editor in a third of the time it takes you (5min according to previous threads). But at the same time you're a damn namefag so I wish you the worst. Therefore keep training Chroma, keep wasting time on that, and experiment with Booru tags.
Anonymous No.106586809 [Report] >>106586827
>>106586742
Anonymous No.106586824 [Report] >>106586893
>>106586790
I'm sorry that my existence and learning how to work with loras while learning chroma hurts you
Anonymous No.106586827 [Report] >>106586839
>>106586809
Seedream?
Anonymous No.106586839 [Report] >>106586887
>>106586827
Good old Flux
Anonymous No.106586856 [Report] >>106587151 >>106587184
Anonymous No.106586858 [Report] >>106586867 >>106586869 >>106586938
>>106586642
why are you always bloging posting you dirty avatarfag, you're like 3 years behind lora traning, stfu
Anonymous No.106586861 [Report]
API nodes status?
Anonymous No.106586867 [Report]
>>106586858
He wants to be like Ani
Anonymous No.106586869 [Report] >>106586944
>>106586858
Can you define a avatar because I don't avatar post. Shit I leave this general for months. Also your hands are mangled.
Anonymous No.106586873 [Report] >>106586902 >>106586962
https://files.catbox.moe/mjfg3t.png
Anonymous No.106586887 [Report] >>106586931
>>106586839
Anonymous No.106586888 [Report] >>106590231
https://files.catbox.moe/baueij.png
Anonymous No.106586893 [Report] >>106586907
>>106586824
Trying things out for yourself and experimenting is illegal, anon
Anonymous No.106586898 [Report]
Charlie kirk assassination video but big knockers
Anonymous No.106586899 [Report] >>106586907
>>106586642
Imagine being such a namefag that you train garbage loras in a garbage model just to avoid admitting you failed at making anything decent. Peak attention seeking behavior, absolutely retarded.
Anonymous No.106586902 [Report] >>106586911 >>106586925 >>106586959
>>106586873
What crimes has this roastie committed?
Anonymous No.106586905 [Report] >>106586962
https://files.catbox.moe/ulqww0.png
Anonymous No.106586907 [Report] >>106586935
>>106586893
It seems that way,
I want to deprive the schizo so I'm going to post again once the lora is done
>>106586899
Anonymous No.106586911 [Report]
>>106586902
Playing league
Anonymous No.106586915 [Report]
>106586869
KYS nobody needs you here, you are trash
Anonymous No.106586925 [Report]
>>106586902
slopposting
Anonymous No.106586931 [Report]
>>106586887
Anonymous No.106586935 [Report]
>>106586907
You will never be Ani
Anonymous No.106586938 [Report] >>106586951
>>106586858
prompt?
Anonymous No.106586944 [Report]
>>106586869
everybody knows you're ran, aka another annoying avatarfag, you keep posting the same gens over and over, now you finally learn how to train a lora and for some reason you wanna share that info with us, you're always blog posting whatever new stuff you're into, like when you bought a 5090 and you said you were going to be a menace, what happened with that? I remember.
If you wanna blog post so much, you better create a pixiv/X account and share your progress with your followers, not here, we don't care and its annoying
Anonymous No.106586951 [Report] >>106586998
>>106586938
1girl, pink leotard, large breasts, on one leg, backrooms, vhs style
Anonymous No.106586959 [Report] >>106586972 >>106586984
>>106586902
homicide with a 3d printed gun (this gen is a homage to the Luigi perp walk photos)

https://files.catbox.moe/mrhq12.png
Anonymous No.106586962 [Report] >>106586978
>>106586873
>>106586905
Man, how could they have shat the bed so hard with Qwen-img when Wan (from the same company and possibly the same team) is so good?

We were so fucking close to making it to paradise if only they didn't completely destroy the model with 4o slop during post-training...
Anonymous No.106586972 [Report] >>106586980 >>106586988
>>106586959
damn cool!
I'm a sucker for retro anime. Is this chroma?
Anonymous No.106586978 [Report] >>106587009
>>106586962

I agree with you there. I use Wan for its realism and Chroma for porn, when the Wan porn loras can't follow my prompt correctly

Have you seen tencent/SRPO btw? It unslops Flux, basically

https://files.catbox.moe/jtrxzh.png
Anonymous No.106586980 [Report] >>106586984
>>106586972
Nigga, he literally posted the catbox...
Anonymous No.106586984 [Report]
>>106586959
>>106586980
I just saw it! Wan huh, nice! Thanks
Anonymous No.106586987 [Report] >>106586996
>the perfect UI doesn't exi-
Anonymous No.106586988 [Report] >>106587184
>>106586972
Wan, using the Goldenboy lora
https://files.catbox.moe/eijfo1.png
Anonymous No.106586996 [Report]
>>106586987
nice cars nigga
Anonymous No.106586998 [Report]
>>106586951
thx
Anonymous No.106587009 [Report] >>106587059
>>106586978
>Have you seen tencent/SRPO btw? It unslops Flux, basically
I have, and I am waiting for a hero to train that on Qwen
Anonymous No.106587059 [Report] >>106587072
>>106587009
me too
----
henlo 'puter frens, i need assistance on prompt engineering.
Try as I might, I can't replicate the leftmost image, my best successes are the middle and rightmost images of the collage.
The catboxes with the configuration and prompts are below. Any tips or hints here? tks in advance!
https://files.catbox.moe/ug3nlx.png
https://files.catbox.moe/xjqcyv.png
Anonymous No.106587071 [Report] >>106587095
hey all, anyone have any info on a decent nudify workflow? i've been using qwen and flux kontext but im still getting pretty shitty results. also looking for any discord's or decent forums where i can get help/feedback?
Anonymous No.106587072 [Report] >>106587120
>>106587059
Feed the youtube screenshot to gemini?
Anonymous No.106587095 [Report] >>106587163
>>106587071
just use Wan and any of the clothes stripping loras
Anonymous No.106587120 [Report]
>>106587072
Hmm, good idea. I'll try that out. But I prefer using FOSS models for that, and using text-to-image. I want to pick the ... how should I put this? Chaotic? Improvised? Unstaged? composition/blocking/mood of the original image for other pictures as well. If it could be reliably generated via a text prompt, that would be ideal
Anonymous No.106587137 [Report]
>>106586780
It can caption hardcore porn
Anonymous No.106587138 [Report]
>>106586790
This level of seething...

Mental illness doesn't even begin to describe it
Anonymous No.106587151 [Report]
>>106586856
>punished maid / godzilla crossover
Has this never been done ? Wasted opportunity if not
Anonymous No.106587152 [Report] >>106587174
>>106586767
Because who the fuck cares?? Google already knows you’re a degenerate. You’re being paranoid
Anonymous No.106587163 [Report] >>106587184 >>106587186
>>106587095
im trying to do images specifically, does wan do images? what are the best loras/where can i find them?
Anonymous No.106587174 [Report]
>>106587152
Go away, jew

Seek your shekels elsewhere
Anonymous No.106587184 [Report] >>106587224
>>106586856
Love this image, good stuff!

>>106587163
Ask Wan to do a one-frame "video", and you turned it into an img genner. Pick any of the catboxes I posted (like this one >>106586988) for a Wan T2I flow. It includes a 2x upscaler pass as well
Anonymous No.106587186 [Report] >>106587235
>>106587163
I was talking about videos. Base Wan can do nipples and butts just fine, and nudifying with a video model since it has temporal+spatial awareness and things like breast sizes etc will be more accurate
Anonymous No.106587207 [Report]
>>106583457
>>106585705 (OP)
Based
Anonymous No.106587209 [Report]
>>106586642
Please share your good quality gens in /adt/ too. We're trying to clean up the general from trolls.
We also need more people who don’t use SDXL.

>>106587093 See?
We’re trying to start fresh and keep things good here. Please give us another opportunity.
Anonymous No.106587224 [Report] >>106587289
>>106587184
awesome! thank you for the info, is there anywhere i can get more info on this workflow? i'm sort of a hack and pretty new to this
Anonymous No.106587234 [Report] >>106587301 >>106587319 >>106587369 >>106587410 >>106587625
>>106586744
>>106586592
So I tried the default Comfy workflow that has the padding stuff. Turns out that it messes up the output (introduces fuzziness). To me, this retardation explains why Chroma hasn't taken off yet to civit/Plebbit normies. They are using a broken official workflow. So they conclude the model can only make shitty pics.

This is what comfy says on the workflow
>min_padding 1 is supposed to be the official way to inference chroma but I think the results are better with min_padding 0

But even that has fuzziness.

I can reproduce the fuzziness across every picture on official comfy workflow E.G.
https://files.catbox.moe/txieok.png

Here are the two comparison workflows
https://files.catbox.moe/obeqbv.png
https://files.catbox.moe/z9txlj.png

Not sure if anyone here has reported on this issue before, or maybe it's a Flash only issue.
Anonymous No.106587235 [Report]
>>106587186
nudify a video model? is there anywhere i can get more info on this as well?
Anonymous No.106587237 [Report]
Seedream 4 local when
Anonymous No.106587241 [Report] >>106590381
Anonymous No.106587242 [Report]
whats the best nodepack for joycaption? I just want to run it on a dir and produce txt files to go along with my dataset :)
Anonymous No.106587246 [Report]
>All this negativity
Good vlñes only, Anon. Good vlñes only.
Anonymous No.106587289 [Report]
>>106587224
I used a workflow from the Mystic NSFW Lora

It's a bit hard to wrap your head around it at first due to the subgraphs and the fact that ComfyUI does not signal where the error is if it comes from a node inside a graph

For example: changing the address of the Lora gives an error on the image subgraph (can't post a screenshot since the machine I'm on doesn't have CUI installed), in the LoraStackerAdv node. You need to click the circle to expand and then correct the lora's address. I set its strength to zero so I wouldn't need to bother changing Loras in two places at once, and it doesn't seem to affect final img quality, prompt following, etc at all, not sure why it's there

Also added a img preview node so that the workflow submenu would display the generated imgs thumbnails, and have a more convenient way to save imgs

https://civitai.com/models/1295758?modelVersionId=2149217
Anonymous No.106587301 [Report] >>106587414
>>106587234
Why no neg prompt?
Anonymous No.106587314 [Report] >>106587752
Anonymous No.106587316 [Report] >>106587626
Anonymous No.106587319 [Report] >>106587369 >>106587482
>>106587234
Hmm, I thought it was a me issue when I stumbled upon this exact same issue, using ChromaHDV10. Got it a touch better using heun + beta scheduler. res_2s + bong_tangent works decently as well

https://files.catbox.moe/eem0ca.png
Anonymous No.106587335 [Report] >>106587353 >>106587580
I am doing some tests with Flux SRPO + NAG (for negative promtps). It does unslop Flux a lot. Might completely replace Chroma for my non-nsfw use cases, especially if I can train Loras on it.

I sure hope someone does SRPO training on Qwen, then we'll be in heaven.
Anonymous No.106587340 [Report]
Does anyone knows how to bypass the safety guidelines in Gemini when it comes to creating the 3D figurines pictures that I've seen going around?
Anonymous No.106587341 [Report] >>106587493
Do I just wait until it starts to increase again or what? Is it supposed to flatline?
Anonymous No.106587353 [Report] >>106587395
>>106587335
Could you kindly share your workflow? Or is it just replacing Flux with SRPO on the CheckpointLoader and fingers crossed your GPU has the VRAM to take it?

https://files.catbox.moe/fj6vs4.png
Anonymous No.106587363 [Report]
>>106586269
>>106586341
hot. box?
Anonymous No.106587369 [Report]
>>106587234
>>106587319
Yeah, nvm, seems to have been a beta scheduler issue
Anonymous No.106587395 [Report] >>106587399 >>106587835
>>106587353
Here:
https://files.catbox.moe/ljzpd9.png
Anonymous No.106587399 [Report]
>>106587395
thanks, fren
Anonymous No.106587410 [Report]
>>106587234
Anonymous No.106587414 [Report] >>106587448
>>106587301
HD Flash doesn't accept one
Anonymous No.106587420 [Report]
fuck kohya ss
Anonymous No.106587422 [Report]
Anonymous No.106587428 [Report]
Anonymous No.106587432 [Report] >>106587689
Anonymous No.106587447 [Report] >>106587483 >>106587497 >>106587518 >>106587547
What is the current state of NSFW generation from the big companies? Iirc Open AI said they would allow some nsfw stuff, has that happened? Is stable diffusion the only good nsfw still?

Have any of these companies realized all people want to do is make porn with it?
Anonymous No.106587448 [Report] >>106587511
>>106587414
Prompt:
>A beautiful Korean idol woman is taking a selfie, flashing a peace sign and a bright smile. She has shoulder-length brown hair and is wearing a white t-shirt. In the background, a screen shows her performing on stage, wearing a crop top and a skirt. The crowd behind her is filled with fans holding up their phones, capturing the moment. The atmosphere is lively and celebratory, with her joyful expression reflecting the excitement of the event.

Pic rel:
>A beautiful Korean idol woman is taking a selfie, flashing a peace sign and a bright smile. She has shoulder-length brown hair and is wearing a white t-shirt. In the background, a screen shows her performing on stage, wearing a crop top and a skirt. The crowd behind her is facing away from the woman, only their backs visible, filled with fans holding up their phones, capturing the moment. The atmosphere is lively and celebratory, with her joyful expression reflecting the excitement of the event.

Chroma just needs some prompt engineering to fix itself kek
Anonymous No.106587456 [Report]
Anonymous No.106587461 [Report]
I am not sure how to handle multiview character sheets in a dataset. should I split them all up?
what about expression sheets, that show head shots of a character making like 10 different expressions?
Anonymous No.106587480 [Report] >>106587548
Anonymous No.106587482 [Report] >>106587545
>>106587319
why chroma?
Anonymous No.106587483 [Report]
>>106587447
>What is the current state of NSFW generation from the big companies?
lol
Anonymous No.106587486 [Report]
I enjoy Chroma.
Anonymous No.106587493 [Report] >>106588034
>>106587341
It has nothing left to learn
Anonymous No.106587495 [Report]
Prompt:
Anonymous No.106587497 [Report] >>106588914
>>106587447
WAN + Porn Loras have great anatomy, but it has that "stock photography" look, which can be corrected with creative lighting loras, but those can sometimes break WAN's otherwise perfect human anatomy

Otherwise, Chroma is best for porn, no need for Loras, and better prompt adherence too

old flux workflow
https://files.catbox.moe/r7p9oz.png
Anonymous No.106587511 [Report]
>>106587448
Pic rel:
Anonymous No.106587518 [Report]
>>106587447
As for paid companies, I don't think they'll go through it, too risky
Anonymous No.106587520 [Report]
im gunna traaaaaiiinnn
Anonymous No.106587534 [Report]
is qwen still broken with sage attention? i haven't pulled in a long time
Anonymous No.106587545 [Report] >>106587561
>>106587482
Porn out of the box, basically. Also, It's the only one that can do a penis correctly
Anonymous No.106587547 [Report] >>106587564
>>106587447
>Iirc Open AI said they would allow some nsfw stuff, has that happened?

It's just a grift to attract local users to their API slop. That will never happen as long as there are ways to prompt for someone that resembles a real person or a child. I'd be impossible because celebs are used in their dataset. 4o is very hard to jailbreak. Jailbroken Dalle is unironically still more uncensored than 4o.

>Is stable diffusion the only good nsfw still?

No, Chroma happened anon (pic rel). It's uncensored Dalle.
Anonymous No.106587548 [Report]
>>106587480
Nine 海楼石, anon
Anonymous No.106587558 [Report]
Anonymous No.106587560 [Report] >>106587571
Anonymous No.106587561 [Report]
>>106587545
>Porn out of the box, basically. Also, It's the only one that can do a penis correctly

valid point
Anonymous No.106587564 [Report] >>106587673
>>106587547
flash? what settings are you using?
Anonymous No.106587570 [Report]
Anonymous No.106587571 [Report] >>106587673
>>106587560
box onegai?
Anonymous No.106587580 [Report]
>>106587335
Anonymous No.106587611 [Report] >>106587938
Anonymous No.106587616 [Report] >>106587647 >>106587701 >>106588315
Man... This shit (Flux SRPO) is basically Chroma minus mangled anatomy (and minus NSFW). If loras have a good effect on it, then Chroma is officially dead to me. And this is coming from one of Chroma's biggest shills ITT

Someone must apply SRPO to Qwen ASAP
Anonymous No.106587625 [Report] >>106588827
>>106587234
>So I tried the default Comfy workflow that has the padding stuff. Turns out that it messes up the output (introduces fuzziness). To me, this retardation explains why Chroma hasn't taken off yet to civit/Plebbit normies. They are using a broken official workflow. So they conclude the model can only make shitty pics.
I don't know why lodestone doesn't care about that, the fix is here, all Comfy has to do is to merge this shit
https://github.com/comfyanonymous/ComfyUI/pull/7965
Anonymous No.106587626 [Report]
>>106587316
Dinner is served
Anonymous No.106587647 [Report] >>106587714
>>106587616
Redpill me on qwen, I've only used it (the edit model) once and told it "see this image? add some text right THERE"
It didn't really work too well, is the base model better for pure t2i? What about lora variety for characters? 2D?
Anonymous No.106587673 [Report] >>106587885
>>106587564
Yeah, Flash
>>106587571
https://files.catbox.moe/g6iq7z.png
Anonymous No.106587689 [Report]
>>106587432
Anonymous No.106587701 [Report] >>106587716 >>106587797 >>106587856
>>106587616
Isn't the entire point of Chroma the NSFW?
Anonymous No.106587706 [Report]
Anonymous No.106587714 [Report] >>106587741
>>106587647
Qwen is the largest and theoretically the most powerful open T2I model, and its prompt alignment is really good.
The problem is that Alibaba completely shat the bed by fine-tuning the model on gpt4o slop and they obliterated the model's ability to make diverse images and photorealism.
Anonymous No.106587716 [Report]
>>106587701
NSFW is the only reason why the mankind did not extinct yet
Anonymous No.106587735 [Report] >>106587745
I want to make a lora of a realistic person, and then use it with illustrious.

I've created images of anime characters with style loras and I can get the character with another style, and it's cool. But I don't know how to make the same with people (example, one of the JD Vance memes with Akira Toyama style). Whenever I try to make a lors I get generic brown hair anime guy or the character doesn't translates into the style.
Anonymous No.106587741 [Report]
>>106587714
OpenAI should have never released GPT, you're telling me that in addition to slopping the shit out of text gen, it's also ruined image gen?
Anonymous No.106587745 [Report]
>>106587735
I could use flux but I've heard that it is not compatible with sdxl models.
Anonymous No.106587752 [Report]
>>106587314
this is the stuff I'd like to visit in 20 years when you will be able to get into generated images like a game
Anonymous No.106587765 [Report]
>>106585470
was looking forward to these, thanks!
Anonymous No.106587770 [Report] >>106587857
>>106586293
it's not just ads, office approved wardrobe was that before everyone decided suits and skirts were bad in the west for some reason
sad but at least I can gen hot OLs locally
Anonymous No.106587797 [Report] >>106587933
>>106587701
For many people in this thread (myself included), it was mostly about its ability to do photorealism and "unslopped" stuff without any particular bias. Once I can do that with other models without the mangled anatomy, I am never looking back.

(once again, picrel is Flux SRPO with NAG and a bunch of negative prompts)
Anonymous No.106587813 [Report] >>106587835 >>106587922
after using both qwen and chroma i can confidently say that i like neither of them
Anonymous No.106587835 [Report] >>106587845 >>106587850
>>106587813
Try Flux SRPO (and use NAG to add negative prompts)

Workflow here:
>>106587395
Anonymous No.106587845 [Report] >>106587869 >>106587913
>>106587835
can you do nsfw with it
Anonymous No.106587850 [Report] >>106587913
>>106587835
How does it compare to Flux Krea
Anonymous No.106587856 [Report] >>106588564
>>106587701
For me it's the different styles it knows without a LoRA already. That's miles ahead of Qwen, for example.
Anonymous No.106587857 [Report]
>>106587770
Probably a thing until the early 2000s.
Nowadays only specific companies and Asia in general does it properly.
Anonymous No.106587869 [Report] >>106587886
>>106587845
No, it's still Flux.
Anonymous No.106587879 [Report] >>106587938
Anonymous No.106587885 [Report]
>>106587673
posted a wrong pic previously
Anonymous No.106587886 [Report]
>>106587869
i'll keep using sd1.5 then
Anonymous No.106587913 [Report] >>106587918
>>106587845
No. It's just Flux, but unslopped.
>>106587850
Flux Krea is just a glorified lora
Anonymous No.106587918 [Report] >>106588980
>>106587913
>Flux Krea is just a glorified lora
somehow even worse at nsfw
Anonymous No.106587922 [Report]
>>106587813
If you don't like it you can always tune the model to whatever aesthetic you're going for.
Anonymous No.106587924 [Report] >>106587938 >>106587949 >>106588074
Anonymous No.106587933 [Report]
>>106587797
Have you tried using dpm_adaptive with flash? Since it's low step and cfg 1 it shouldn't take forever.
Anonymous No.106587938 [Report]
>>106587611
>>106587879
>>106587924
>>106565868
Anonymous No.106587946 [Report]
rip charlie kek
Anonymous No.106587949 [Report]
>>106587924
model?
Anonymous No.106587961 [Report] >>106588011 >>106588114
can Seedream do non-asians?
Anonymous No.106588011 [Report]
>>106587961
no you need overseedream lora
Anonymous No.106588022 [Report] >>106588030 >>106588042 >>106588045 >>106588053 >>106588058 >>106588114
I know there are zoomers here, so I need to ask, how do you feel when you see unironic posting like this? I'm an ancient millennial and this is probably the first time in my life where I really feel like an old man.
Anonymous No.106588030 [Report]
>>106588022
I don't feel at all.
Anonymous No.106588034 [Report]
>>106587493
Yes, I noticed after testing the epochs, the 1000 step one was the best.
So anyway, after noticing that training a lora on actual comic book panels is retarded, I trained it on the cover art instead, dataset of 27 (2 for validation), 512, rank 8 for this test.
Anonymous No.106588042 [Report]
>>106588022
this would be perfect in facebook, the right mix of cheesy and awful
Anonymous No.106588045 [Report] >>106588079
>>106588022
I don't know who those people are, so I really can't give a shit
Anonymous No.106588053 [Report] >>106588079
>>106588022
who's it?
Anonymous No.106588058 [Report]
>>106588022
I feel like it's kind of fucked to use a picture of someone's dying moments to make a heckin wholesome meme like dawg she was bleeding uncontrollably out her neck right there
Anonymous No.106588074 [Report] >>106588166 >>106588173
>>106587924
Which model was this?
Picrel is Flux SRPO
Anonymous No.106588079 [Report] >>106588115
>>106588045
>>106588053
Ur so lucky. Keep it that way. No one's going to remember this shit in a few weeks anyway.
Anonymous No.106588114 [Report]
>>106588022
the charlie kirk and jesus vid was funny

>>106587961
https://www.reddit.com/r/Bard/comments/1ndul0x/
https://bytedance.larkoffice.com/docx/PBvldM6Xlo5OHKxsRNVcyAq4nFe
can try it out on lmarena like other anons have mentioned, it's alright with some art mediums. i haven't got anything great tho mainly due to throwing schizo prompts at it to see what it shits out.
Anonymous No.106588115 [Report] >>106588126 >>106588195 >>106588281
>>106588079
That's why the only social media platform I'm using is discord, mostly sticking to a few select servers with friends. xitter, instagram, reddit, tiktok and the rest are just brainrot farms at this point and I need my last few brain cells to make it through college
Anonymous No.106588123 [Report] >>106588344
Man... Just imagine if lodestone, ostris & co weren't spending money on useless garbage and just gave us an SRPO version of Qwen. They would likely spend way less money and would deliver us a much better model
Anonymous No.106588126 [Report]
>>106588115
Good lad. You've got your head on straight.
Anonymous No.106588130 [Report]
Anonymous No.106588166 [Report] >>106588225
>>106588074
>actual metal nails on the fingers
lmao
Anonymous No.106588173 [Report] >>106588225
>>106588074
we could tell by the chin
Anonymous No.106588195 [Report] >>106588929
>>106588115
yet here you are
Anonymous No.106588200 [Report] >>106588281
Anyone use Chroma Flash and can tell me how to avoid my gens randomly turning into anime? I'm using silveroxide's Flash lora, and I've tried it with both GGUF and FP8. Prompting for a photo doesn't seem to matter.
Anonymous No.106588218 [Report] >>106588255
Is chroma that good? I constantly see you masturbating about it
Anonymous No.106588225 [Report]
>>106588166
yep, no idea why that happened.
It is otherwise a very good model, at least it produces good hands unlike Chroma
>>106588173
None of the other pics I posted has the Flux chin
Anonymous No.106588255 [Report] >>106588360
>>106588218
People like to speak about it in hyperbole (whether good or bad) but it's okay. It has issues with anatomy sometimes but it trains well, has good prompt adherence, and is uncensored out of the box.
Anonymous No.106588256 [Report]
Someone please go train a Lora on Flux SRPO as the base and report back the results
Anonymous No.106588276 [Report]
Anonymous No.106588281 [Report] >>106588929
>>106588200
Ran into that issue with one of my prompts so far, but not regular Flash, with my mix.
Adjust prompt? Reorder prompt so that photograph is the first token, and mention it multiple times throughout the prompts are some ideas.

>>106588115
Youtube is okay as long as you selectively watch types of videos you want and keep away from "Reels".
Anonymous No.106588289 [Report] >>106588305 >>106588949
Is anyone else tired of plastic koreans crowding out all other realistic porn generation
What is even the appea
Anonymous No.106588293 [Report] >>106588317
>china defeated the west
>by making the best and cheapest api model
loooool what a tweest
Anonymous No.106588305 [Report]
>>106588289
just ignore them. i always scroll past them
Anonymous No.106588315 [Report]
>>106587616
Yeah, SRPO can take a prompt vanilla Flux would always fail at and make it usable. Been trying to find a good all-rounder setup for a sampler/scheduler as I'm going through old, ambitious prompts to see what SRPO can do for it. Right now I'm using rk_beta/beta.
Anonymous No.106588317 [Report] >>106588414
>>106588293
some of the gens on their page are p good
Anonymous No.106588330 [Report]
Anonymous No.106588333 [Report] >>106588348
How long before I can take a story from ao3 or literotica and feed it to an ai and it creates videos from it?
Anonymous No.106588336 [Report] >>106588403
When are we going to get an uncensored Qwen finetune?
Anonymous No.106588344 [Report] >>106588467
>>106588123
Uncensored prompt following is just as important as SRPO anon
Anonymous No.106588348 [Report]
>>106588333
This is arguably already somewhat doable.
Anonymous No.106588360 [Report] >>106588453 >>106588498
>>106588255
What about anime? I know about sdxl and illustrious, how does chroma work?
Anonymous No.106588403 [Report]
>>106588336
never because it would cost $500,000 for 3 epochs
Anonymous No.106588414 [Report] >>106588452 >>106588454 >>106588804
>>106588317
Pic rel is Chroma v38 anon
Anonymous No.106588452 [Report]
>>106588414
i can tell lol
Anonymous No.106588453 [Report] >>106588468 >>106588490
>>106588360
It just werks. People keep sperging about muh artists, but I prefer to just do my own stuff.
Anonymous No.106588454 [Report] >>106588804
>>106588414
no, do the beam split jump
Anonymous No.106588467 [Report] >>106588515 >>106588628
>>106588344
Sorry anon, but Chroma is dead. SRPO can output that shit just as good.
We have a new king for photorealism, one that doesn't do mangled limbs as often.
Anonymous No.106588468 [Report]
>>106588453
May hide muh artists with pig latin or rot13
Anonymous No.106588473 [Report] >>106588486
so many reslets itt
2k is the bare minimum
3k is good
4k is ideal
Anonymous No.106588479 [Report]
Hey all,

Since MrDeepFakes and similar sites are toast, and CivitAI etc... got cucked by VISA, where are the communities discussing adult/fringe uses of local and hosted AI?

I'm pretty savvy, have 48GB local vram and access to paid cloud GPUs, and I have workflows built up that I like. But I don't know where to discuss this shit anymore that isn't censored to all fuck. Feels lonely, man.
Anonymous No.106588486 [Report] >>106588506
>>106588473
that's a big space shuttle
Anonymous No.106588490 [Report] >>106588502
>>106588453
What I mean is if it has an equivalent anime model like sdxl>illustrious or is it not necessary for loras?
Anonymous No.106588498 [Report]
>>106588360
It does okay. It's not spectacular at it and it can be hard to prompt for artists and characters. From what I understand Chroma saw a lot of danbooru/e621 stuff during training, so if you want to train your own LoRAs on a particular artist or character then it would probably work well. I'd still say that Noob/Illustrious are a better option though.
Anonymous No.106588502 [Report]
>>106588490
It's a generalist model. I could change the template prompt starting word from "anime illustration" to "photo" or "statue" or whatever I want
Anonymous No.106588506 [Report]
>>106588486
It's a Buran.
Anonymous No.106588515 [Report] >>106588563
>>106588467
Not that anon but until SRPO lets people make those two girls fuck then I don't see the point.
Anonymous No.106588563 [Report] >>106589025
>>106588515
We are on a diffusion general for a blue board. It just doesn't make sense to shill a model that the only thing going for it is porn (which is not going to get posted anyway) when there are better models.
Anonymous No.106588564 [Report] >>106588813
I'm a beginner that has been playing with a few models and flows in ComfyUI (mostly sdxl and flux), so far it has not been great
>>106587856
Is it even possible to generate highly detailed, large and clear images like this on 16GB VRAM, or there is no way around having more?
Anonymous No.106588570 [Report]
>day 3 of trying to train a flux lora
>only thing that "works" is the comfyui flux trainer nodes
>be 16gb vramlet
>had to delete 512 and 1024 data input nodes then set validation settings to 768 or it OOMs
>ITS FINALLY FUCKING WORKING
>come back after 4 hours
>all outputs are black
Anonymous No.106588581 [Report] >>106588585 >>106588694
>he thinks that people are here for sfw because we're on a blue board
Anonymous No.106588585 [Report] >>106589046
>>106588581
well, you should be using the best model you can for the kind of output you are going for. Why bothering posting garbage from a subpar model?
Anonymous No.106588590 [Report]
coomerboomers belong on /aco/. you can tell they post here because they're the ones who shill subpar shit like chroma and bigasp
Anonymous No.106588596 [Report]
Anonymous No.106588599 [Report] >>106588676 >>106589030
Why would anyone use Flux trash when you could be using Seedream?
Anonymous No.106588628 [Report] >>106588636
>>106588467
>Sorry anon, but Chroma is dead. SRPO can output that shit just as good.
Good skin texture, but hands are fucked, water is too simplistic looking and it's not uncensored. Prompt flexibility wouldn't be on par with what I can do with Chroma.
Anonymous No.106588636 [Report] >>106589056
>>106588628
>but hands are fucked
are you implying Chroma hands are any better? lmao
Anonymous No.106588653 [Report] >>106588746
>so.... we decided to finetune flux
why do they keep doing this?
Anonymous No.106588676 [Report] >>106588709
>>106588599
Can Seedream gen horse dicks?
Anonymous No.106588694 [Report]
>>106588581
i do but it is less than i use to since a lot of other places play/do interesting things with the new releases
Anonymous No.106588709 [Report]
>>106588676
Calm down Vaush
Anonymous No.106588746 [Report] >>106588765
>>106588653
What do you expect?
1 - Qwen wasn't out
2 - No one cares about Lumina
3 - Every other model (ie Hidream, Wan) would provide diminishing returns
4 - No one is going to do serious work on Chroma since, while it's Apache, it's a model designed around porn and is a clearly unfinished model that produces mangled limbs every so often since lodestone doesn't seem to know what the fuck he is doing
5 - Fine-tuning on ostris models (ie Flex) would not make a lot of difference since they are doing non-commercial work anyway, and using Flex as base would deliver "worse" results
Anonymous No.106588764 [Report] >>106589028
>he forgot about SD3?
>we all did
Anonymous No.106588765 [Report]
>>106588746
>No one cares about Lumina
And that's a bloody shame
Anonymous No.106588804 [Report] >>106588826
>>106588414
Found a way to prompt it for free (LMArena)

Here's Seedream slop with same prompt. Fingerprinted slopped gen. Looking at its toe count its coherence about on par with Chroma and that's a paid SaaS model that's supposed to be superior. Chroma looks much better (actually, it also looks better than what 4o gives you with similar prompts, and it's not just the natural look).

>>106588454
I know Chroma would nail this. While the seedream result is just a nitpicked image. The model isn't as good as they advertise.
Anonymous No.106588813 [Report]
>>106588564
Yeah, just gotta have enough DRAM to offload.
Anonymous No.106588814 [Report]
>1280x
thattsssss notttt seeedreammmmm
Anonymous No.106588826 [Report] >>106588849
>>106588804
Lmao, try to guess which image this is
Anonymous No.106588827 [Report]
>>106587625
The padding setting is there right in the 'official' Comfy workflow, so anyone can just set it to whatever they want

I agree that the default should be what the model creator says, but comfyanon will be comfyanon

Either way it is largely a nothingburger
Anonymous No.106588849 [Report] >>106588863
>>106588826
>Amateur photograph, a Japanese idol woman, performing an advanced contortion pose indoors, likely in a studio setting. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.

>A white towel is draped over her front for modesty. She has straight black hair with bangs, and she wears a black wristband or watch on one wrist

This is even worse than the Qwen result I got
Anonymous No.106588863 [Report] >>106588875 >>106588877
>>106588849
API models are wasting millions on censorship, while Chroma intractably learns the human body and you can do as you please with it, that's why Chroma will always be superior to API slop.
Anonymous No.106588875 [Report]
>>106588863
I can't wait until the first chink company trains another slop model on additional layer of inbred synthetic data from qwen and creates the first habsburg model.
Anonymous No.106588877 [Report] >>106589031
>>106588863
Are these a pair of tracks in different gauges?
Anonymous No.106588886 [Report] >>106588908
>chroma has anatomy issues and takes eons to gen a single image, any attempt to speed it up nukes prompt adherence
>qwen can do consistent text and anatomy at the cost of literally almost everything else, is too big for anyone to finetune
>even worse, some people spent thousands of dollars on gpus to use models that fight them if they try to do anything that violates the Black Forest Labs Acceptable Use Policy™, meaning they might as well be using saas models
1,000,000 more years of sdxl
1,000,000,000 more years of sd1.5
Anonymous No.106588904 [Report]
>>106585877
What am I doing wrong?
I installed onnx and it download v 19
Anonymous No.106588908 [Report] >>106588915
>>106588886
>chroma has anatomy issues and takes eons to gen a single image

Most of which are fixed with Chroma Flash HD, other than that you're just a vramlet.
Anonymous No.106588910 [Report] >>106588919
>>106585767
2k is promising. For now I'd stick to Base since HD tends to slop prompts that work fine on the other models and randomly decided to blur outputs.
Anonymous No.106588914 [Report]
>>106587497
>but those can sometimes break WAN's otherwise perfect human anatomy
I noticed this as well back when I was training on Flux, closest thing to a remedy was to make sure that the images I trained didn't have complex anatomy like hands, but then you still basically have the Flux 'stock hands' which is unfortunate
Anonymous No.106588915 [Report] >>106588952
>>106588908
>any attempt to speed it up nukes prompt adherence
Anonymous No.106588919 [Report]
>>106588910
2k + base for second pass is a good combo.
Anonymous No.106588929 [Report]
>>106588195
Yep! I browse threads when I'm actively engaging with the content discussed therein. A few days ago I started messing with video gen and ip adapters, so I scoured a few of the threads for information. Once I've figured everything out, I'll probably fuck off again for a while
>>106588281
>Youtube is okay as long as you selectively watch types of videos you want and keep away from "Reels"
I don't really consider youtube a real social media platform in the way that the others are, since the only kind of communication that exists is the comment section. Luckily, it has a way of keeping me away from it as whenever I read more than 3 comments, my brain just melts from the most retarded take I've had the displeasure of reading
Anonymous No.106588935 [Report]
>he fintuned flux schnell
what was he thinking?????????????????????????????
Anonymous No.106588947 [Report]
when will sd1.5 get the chroma treatment?
Anonymous No.106588949 [Report] >>106588956
>>106588289
I like Korean/Japanese girls. Plastic skin? Not so much
Still, I don't exclusively generate them, I don't want to end up like those weirdos on civitai that post the same character in the same pose doing the same thing for DAYS
Anonymous No.106588951 [Report]
>512x512
it already did in 2022!
Anonymous No.106588952 [Report] >>106588969
>>106588915
You can try my workflow, mix of v50 with HD Flash to improve the adherence
>>106586417
Without it Flash can't really follow prompts that well.
Though it has the drawback of being most stable at 2k for deslopping certain prompts, many prompts still work fine at lower res.
Anonymous No.106588956 [Report]
>>106588949
>the same character in the same pose doing the same thing for DAYS
Kek. I had to hide AI from pixiv even though I post my own slop there since it was absolutely unbearable.
Anonymous No.106588969 [Report] >>106589092
>>106588952
Also we got betrayed by nunchaku Chinks
https://github.com/nunchaku-tech/nunchaku/issues/167
Anonymous No.106588980 [Report]
>>106587918
>somehow even worse at nsfw
That should be impossible
Anonymous No.106589025 [Report]
>>106588563
>We are on a diffusion general for a blue board
For LOCAL gen, are you literally retarded, anti-Chroma fag ?
Anonymous No.106589028 [Report]
>>106588764
sd3.5 medium was good and in some ways better than flux.
Anonymous No.106589030 [Report]
>>106588599
>API
Gay.
Anonymous No.106589031 [Report] >>106589048 >>106589067
>>106588877
It's an AI pic
Anonymous No.106589046 [Report]
>>106588585
Because this is a place where people discuss the best models for subject X, even if you can't post it in this thread

I know you are just playing stupid since it's all about people not using Chroma when it comes to you and nothing else, for some insane reason, but still the amount of time and effort you put into that means you must be mentally ill
Anonymous No.106589048 [Report]
>>106589031
>It's an AI pic
chroma*
Anonymous No.106589056 [Report]
>>106588636
>accept my censored model because hands are better
>posts image showing hands are no better
What an absolute failure you are
Anonymous No.106589067 [Report]
>>106589031
I know and it's slopped
Anonymous No.106589078 [Report]
where the fuck are the Q8 ggufs for WAN2.2 VACE?

https://huggingface.co/QuantStack/Wan2.2-VACE-Fun-A14B-gguf/tree/main
Anonymous No.106589092 [Report] >>106589109
>>106588969
Chroma is still getting Nunchaku version, but it was bumped down when Lodestone started retraining Chroma1-HD, so now it will come after Wan support is done
Anonymous No.106589102 [Report] >>106589111
Poll

https://poal.me/q16fid
https://poal.me/q16fid
https://poal.me/q16fid
https://poal.me/q16fid
https://poal.me/q16fid
Anonymous No.106589109 [Report] >>106589137
>>106589092
>He thinks anyone of those Chinks will work that hard for free

There's a reason we still don't have Chroma after months anon... If miraculously they get Wan running they will just stop development for a year or two and call it a day.
Anonymous No.106589111 [Report] >>106589127 >>106589139
>>106589102
>all that garbage
add seedream, dall-e 3, and gpt-image instead please
Anonymous No.106589127 [Report] >>106589129 >>106589139
>>106589111
what about midjourney
Anonymous No.106589129 [Report] >>106589145
>>106589127
outdated with poor prompt comprehension
Anonymous No.106589137 [Report]
>>106589109
>He thinks anyone of those Chinks will work that hard for free
They're in academia, this is their research
Anonymous No.106589139 [Report] >>106589167 >>106589216
>>106589111
>>106589127
fuck off to >>>/g/DE3/ cloudshitter
Anonymous No.106589145 [Report]
>>106589129
i like a lot of the horror leaning stuff that comes out of there never used it myself tho
Anonymous No.106589146 [Report] >>106589172
I want to make a lora of an old flashgame character that has 2 images in the same pose only. Could someone give me some advice on how to train?
So far it turns out really bad
Anonymous No.106589167 [Report]
>>106589139
you don't need to seethe so hard over models available in comfyui
Anonymous No.106589172 [Report] >>106589182
>>106589146
Use nano banana or seedream to make new images out of the two reference images, pick the best ones, and build your dataset
Anonymous No.106589182 [Report] >>106589199
>>106589172
>nano banana or seedream
I've been out of the game since sdxl dropped what are those?
Anonymous No.106589199 [Report] >>106589244
>>106589182
jesus christ anon, just ask chatgpt
Anonymous No.106589216 [Report] >>106589250 >>106589450 >>106589563
>>106589139
SAAS fags are desperate

Big tech will drop non-commercial image generation since there's no money in people prompting taylor swift as a pixar character and then quickly growing tired of it

Midjourney is being sued by Disney for generating ~1:1 copies of movie film stills

Meanwhile people are increasingly moving to local to avoid the ever increasing censorship, practically nobody is interested in generating sterile advertising stock photos
Anonymous No.106589244 [Report]
>>106589199
But I wanted to ask g friends :(
Anonymous No.106589250 [Report] >>106589265 >>106589617
>>106589216
Seedream 4 is like the biggest cope that I've ever seen. It's hilarious. I refuse to believe anyone who's tried it actually thought it was a good model. They knew it was a massive flop and thought it would be a funny bait.

Anyways, it does prove one thing, BFL's base models are insane and we are lucky that they gave us Flux, because years later it still shits on every slopped API model that is put out, though nano banana is the closest thing to it yet (not as good as Chroma, but okay).
Anonymous No.106589252 [Report] >>106589257 >>106589273 >>106589297
Tried chroma for realistic cosplay gens (mixed natural language + booru tags) and it's... not really better than illustrious tunes (optionally with a realism refiner model). The text shit is cool but ultimately just a gimmick you can add using a traditional inpainting pipeline. It followed my complex prompt about as well as illustrious tunes, so I have no idea what all the extra 7B parameters are doing. Mangled hands and a lot of plastic skin galore, my face shows signs of great bore
Idk it's probably good for something, but it doesn't really do what I need it to
Anonymous No.106589257 [Report] >>106589313
>>106589252
debo likes to shill for chroma here you can just ignore him
Anonymous No.106589265 [Report]
>>106589250
>BFL
The one caveat is that they are not going all out. Imagine if they were. We wouldn't even be talking about QIE because it would've been laughed off as a meme.
Anonymous No.106589273 [Report] >>106589313
>>106589252
lel, you are so mentally ill

do you think people don't recognize you at this point, anti chroma schizo ?
Anonymous No.106589291 [Report]
Anonymous No.106589297 [Report] >>106589313
>>106589252
>no gen
ok
Anonymous No.106589313 [Report] >>106589337 >>106589340
>>106589257
Other people might have more success with it than me, but after an hour or so of trying a bunch of different sampler combinations and whatnot, I'm not really convinced. Also, trying to use ip adapters with flux is a mess right now, which is a shame
>>106589273
oh no, you got me, the legendary poopdickschizo
>>106589297
Are visible areolae against blue board rules? I can post the last image I genned
Anonymous No.106589337 [Report] >>106589385
>>106589313
>Other people might have more success with it than me
it's not just you going off the chroma gens here
Anonymous No.106589340 [Report] >>106589385
>>106589313
>blue board rules
you know damn well you can catbox it
Anonymous No.106589385 [Report] >>106589463 >>106589534
>>106589337
People rarely post about things working well
>>106589340
I do now. Here: https://files.catbox.moe/tiwg9p.png
Anonymous No.106589415 [Report] >>106589434 >>106589440 >>106589467 >>106589500 >>106589526 >>106589527 >>106589544
https://github.com/FizzleDorf/AniStudio/releases/tag/pre-release

I added new windows binaries. linux ones soonish, just have to fix a couple of things for the build
Anonymous No.106589427 [Report]
Wan 2.2 firstframe-lastframe is amazing, some of you guys should try it
Now that nano banana and seedream can generate image variations accurately, it should help a lot making consistent videos with Wan
>waaaa not local!
Wan videogen is local
Anonymous No.106589434 [Report]
>>106589415
finally some news
Anonymous No.106589440 [Report] >>106589456
>>106589415
what is opencl?
Anonymous No.106589450 [Report]
>>106589216
It’s a shame, if we could somehow take sora/geminis prompt interpretation and apply it to local, it would be the coining endgame. Sora seems to just fucking understand everything I throw at it for stills, even like implied context and shit. Bring it to local chinks I beg you
Anonymous No.106589456 [Report]
>>106589440
pre-cuda tensor library. cuda is technically based off of v2 opencl but they veered off into a different direction. it's good for old stuff that don't support cuda or vulkan
Anonymous No.106589463 [Report] >>106589580
>>106589385
Lmao what is this ponyllustrious realism tier shit, is this the power of “modern” local models?
Anonymous No.106589467 [Report]
>>106589415
Go away schizo
Anonymous No.106589485 [Report]
Anonymous No.106589500 [Report]
>>106589415
>the perfect UI doesn't exis-
Anonymous No.106589515 [Report]
>Still not API nodes
Comfy won
Anonymous No.106589526 [Report]
>>106589415
PUT TELEMETRY AND API ACCESS IN IMMEDIATELY!!! I DEMAND IT!!!!!!!!!!
Anonymous No.106589527 [Report] >>106589557
>>106589415
Can I use this over the network? Program on laptop gpu in serverrack
Anonymous No.106589534 [Report] >>106589580
>>106589385
>https://files.catbox.moe/tiwg9p.png
Was this a joke ?
Anonymous No.106589544 [Report]
>>106589415
100% LOCAL TELEMETRY
100% LOCAL API ACCESS

FUCK YEAHHHHH NO MORE SNAKES!!!
Anonymous No.106589557 [Report] >>106589573
>>106589527
not yet. I plan on having a build for webgl and the backend for sdcpp has a server but I need to set up all the logistics. it's more important that I separate sdcpp, opencv and whatever else into addon shared libs and use the already built sdcpp binaries. someone can just do it as a PR, especially the webgl stuff since there is a tut in the imgui example, but the backend needs a bit more organization before that happens
Anonymous No.106589563 [Report] >>106589570
>>106589216
>Meanwhile people are increasingly moving to local to avoid the ever increasing censorship, practically nobody is interested in generating sterile advertising stock photos

Projecting again, grandpa? Zoomers loves SaaS
Anonymous No.106589570 [Report]
>>106589563
it’s the redditspacing chromatard. he’s been off his rocker for months
Anonymous No.106589573 [Report] >>106589615
>>106589557
I'm banned from github but cool
Anonymous No.106589580 [Report]
>>106589463
I put the same amount of effort in it as my regular sdxl gens. Actually a little bit more as I had to use 3-4 sentences of natural language instead of a few tags. Maybe I overdid it with the booru tags at the tail end of my prompt, but like, I'm not going to spend 10 hours minmaxxing a prompt just to get a decent image
>>106589534
I wish it was. I'm not sure how well chroma handles non 1024x1024 resolutions, but no one gens at that exact resolution, anyway. If you've any tips to share / pitfalls to avoid, I'd love to hear them. Otherwise I'm going back to my sdxl 1girl slop machine that cranks out more than one image per minute
Anonymous No.106589592 [Report] >>106589597
https://files.catbox.moe/hf5oge.png
Anonymous No.106589593 [Report] >>106589612 >>106589823
Zoomers are all about Steam, Netflix, Disney+, Spotify,
SaaS is here to stay.
If you want to stay relevant in /ldg/ you need to adapt or gtfo boomers.
Anonymous No.106589597 [Report]
>>106589592
Younger please
Anonymous No.106589599 [Report]
reminder that you can use SaaS locally with ComfyUI Api Nodes
Anonymous No.106589602 [Report]
not very organic, get better bait or request higher payment
Anonymous No.106589609 [Report]
Anonymous No.106589612 [Report]
>>106589593
Steam is not like the others though. You buy a product and you get to keep it. Steam's terms even say that if they go under, you'll continue to have access to the content you bought. The rest is subscription slop and I hate it
Anonymous No.106589615 [Report] >>106589675
>>106589573
lol what did you do?
Anonymous No.106589617 [Report] >>106589627
>>106589250
Left a tranny, right an actual woman
You losted Lodestones, fuck off
Anonymous No.106589627 [Report]
>>106589617
Poor bait
Anonymous No.106589675 [Report]
>>106589615
I used it as a login for other services only
Anonymous No.106589748 [Report] >>106589757 >>106589761 >>106589787 >>106589790 >>106589822
I don't know jack shit about ML and diffusion models.
But I asked GPT5 about the SRPO paper and the GPU-hours it used, and to also consider model size, asking how much it would cost to train Qwen-Image on the same pipeline.
It gave me an estimation that it would take about 9 GPU-hours on a cluster with 32 H20s, with a training time of 16 minutes, costing about $70 USD total.

So at first glance, it seems to be something the average anon, myself included, could afford to train (an SRPO version of Qwen). What is the catch?
Anonymous No.106589757 [Report]
>>106589748
bold of you to assume chatgpt is correct
Anonymous No.106589761 [Report]
>>106589748
>But I asked GPT5
Ask a real model
Anonymous No.106589763 [Report]
bake bake bake bake b-bake iiiiitttt
Anonymous No.106589787 [Report]
>>106589748
Gemini says something similar
Anonymous No.106589790 [Report] >>106589805
>>106589748
The catch is that if you fuck up, you need to restart the run
Also I sure hope you fact-checked whatever sources gippity gave you
Anonymous No.106589805 [Report]
>>106589790
>you need to restart the run
Which is less of a pain with this drastically reduced time.
Anonymous No.106589815 [Report]
I asked both gpt and gemini locally through comfyapi nodes and they said similar. so both saas and local llms are in agreement
Anonymous No.106589822 [Report]
>>106589748
If GPT could really read papers then implementing nunchaku or Wan for Chroma would be a breeze. If it can't, then why trust it with training settings?
Anonymous No.106589823 [Report]
>>106589593
Yes, that's why you have to go to a local diffusion general on 4chan to shill SAAS

lel
Anonymous No.106589840 [Report]
now triplecheck the paper with grok
Anonymous No.106589843 [Report]
new
>>106589837
>>106589837
>>106589837
>>106589837
Anonymous No.106590231 [Report]
>>106586888
nice
Anonymous No.106590381 [Report]
>>106587241
noice