/ldg/ - Local Diffusion General
Anonymous
8/21/2025, 4:28:10 AM
No.106331100
>>106331122
>>106331447
Cursed thread of techlets
elf-hugger
8/21/2025, 4:29:44 AM
No.106331111
Anonymous
8/21/2025, 4:30:40 AM
No.106331122
>>106331158
>>106331100
It's mostly vramletism that's the issue. Can you imagine how many thick 720p mommies would be in this thread everyday otherwise? You'd be able to live off the cellulite alone
Anonymous
8/21/2025, 4:35:23 AM
No.106331158
>>106332265
>>106331122
I wish the guy who made the cakeful latinas wasn’t into the weird pedoshit, his gens would be amazing otherwise.
Anonymous
8/21/2025, 4:38:57 AM
No.106331186
>>106331195
Anonymous
8/21/2025, 4:40:03 AM
No.106331194
>>106331205
What is the meta for upscaling image gens in comfy?
I want fast + reliable + next to no chance of ghosting some deformed denoising nightmare into base image.
>>106331186
Back to /sdg/ please. You've been genning the same woman for two years or more now.
Anonymous
8/21/2025, 4:41:17 AM
No.106331205
>>106331194
either ultimate sd upscale or tile diffusion + tile controlnet
Anonymous
8/21/2025, 4:54:05 AM
No.106331268
>>106331195
at least he posted a gen unlike your schizo ass shitting up the thread daily
Anonymous
8/21/2025, 4:55:46 AM
No.106331272
>>106331288
>>106331444
>>106331195
>You've been genning the same woman for two years or more now.
what gettin no pussy ever does to a mf lmao kek
Anonymous
8/21/2025, 4:56:04 AM
No.106331275
>>106331317
>>106331383
i'm not sure why saying booru tag system is limited is controversial. even something as simple as being able to associate tags would be a major improvement like:
character:a - short hair, smile
character:b - long hair, frown
character:a hugs character:b
background: character:c
my point is how can you develop a system of prompting that allows more control than booru style tags
yes i know you can inpaint and regional prompt, but these are bandaids around an inherently limited system, but regional prompting and inpainting does not solve the problems that a more complex prompting system would fix. regional prompting just applies different prompts to different regions, but doesn't change how you prompt
Anonymous
8/21/2025, 4:57:26 AM
No.106331282
>what is concat conditioning
Anonymous
8/21/2025, 4:57:38 AM
No.106331287
Anonymous
8/21/2025, 4:57:54 AM
No.106331288
>>106331272
yeah, it's not even a good image or cute.
Anonymous
8/21/2025, 4:58:55 AM
No.106331298
Anonymous
8/21/2025, 5:01:32 AM
No.106331317
>>106331275
It’s probably the classic case of “you are criticizing something I like or consider myself good at therefore reeeeeeeeeeeee” and ensuing lack of objectivity.
Anonymous
8/21/2025, 5:01:33 AM
No.106331318
>>106331195
Iinteresting how you never said that to the mikutroon? Hmmm
Anonymous
8/21/2025, 5:09:38 AM
No.106331369
>simply create more spaghetti noodles on your screen instead of making better models
I genuinely do not understand people who find comfyui difficult. Like if you don't like the UI, that's fine, but if your reason is because it's difficult I can only assume you have a double digit IQ or something.
Anonymous
8/21/2025, 5:11:09 AM
No.106331383
>>106331436
>>106331275
Basic masking with regional prompt is much better than polluting everything in a single text input box with a custom, naturally unintuitive prompting format of "character:a - short hair, smile" or anything like that which also requires retraining of all image gen models, captioning models, and prompting guides
You will always need to use regional prompting if you want specific things in a very specific place, and the rest can be taken cared for with basic language with you speaking to an LLM that can help prompt the model for now until we get natively multimodal image gen.
Anonymous
8/21/2025, 5:11:46 AM
No.106331387
>>106331425
Booru tags are MUCH better suited to AI than natural language.
When trying to do natural language you are converting from real text to associations between words and phrases, this is very abstract and hard.
Tags reduce it back to being single words/tokens associating with each other.
When you see the tag "red hair" the model can work with red hair, but working from the tag "The women has green eyes, red hair and is waving to her friend with orange hair" is just convoluted. Simple single token tags are working with the technology not forcing it to work harder.
Anonymous
8/21/2025, 5:12:54 AM
No.106331393
>>106331379
a large portion of the people interested in imagegen, maybe even most people, are abject retards who don't understand how computers work or how to use them in the most basic sense
Anonymous
8/21/2025, 5:13:17 AM
No.106331397
Anonymous
8/21/2025, 5:14:11 AM
No.106331406
>>106331416
>>106331422
booru tags vs natural language is a retarded debate and ideally a model would be able to work with both. you shouldn't have to write an essay to gen basic shit but you should also be able to be descriptive and specific about what you want
>>106331406
natural language is entirely a compromise to get retards able to prompt and damages the concept of an image model
Anonymous
8/21/2025, 5:16:29 AM
No.106331422
>>106331406
Agreed. The simplest test, prompt an ethnicity, fails with boorutags. But they’re still good for lots of other things. I don’t know why autismos here always have to align hard in one direction or another. But they’re like this for everything on this board I guess.
Anonymous
8/21/2025, 5:16:37 AM
No.106331425
>>106331387
Booru tags make sense when the model providing the embeddings is absolute dogshit at understanding the world, but doesn't natural language become better when you consider the fact that modern image gen models use an LLM to create the image embeddings and have a better understanding of the relationship between objects compared to clip models?
Anonymous
8/21/2025, 5:16:47 AM
No.106331429
>>106331416
can't wait to see your next gen tag only model
Anonymous
8/21/2025, 5:17:13 AM
No.106331431
>>106331416
that might genuinely be the single most retarded thing i have ever read in this thread
Anonymous
8/21/2025, 5:17:39 AM
No.106331433
>>106331195
>knowing this at all
>noticing this at all
what no boing boing on cocky do to a mfer
Anonymous
8/21/2025, 5:17:43 AM
No.106331436
>>106331470
>>106331383
how is "character:a - short hair, smile, character:b - long hair, frown" anymore pollution than "short hair, long hair, smile, frown" except that I've associated tags with each other?
no seriously this is mind boggingly stupid thing to say that adding character tags (which most people already do if they're prompting specific characters) is somehow tag pollution?
>regional prompting if you want specific things in a very specific place, and the rest can be taken cared for with basic language with you speaking to an LLM
yeah, specific down to the pixel location. what if i want to have relative descriptions between tags that aren't mask region specific, like saying one character is taller than another? sure LLMs can help, but then how is "character:a - short hair" worse than "Character A has short hair"
the point is to mix some form of natural language, with more technical non-natural language syntax to give the greatest control. i don't see how you can disagree with this except if you just want to criticize my suggestion for merely being a suggestion
it seriously seems like some people can only imagine a natural language or tag list prompting system
Anonymous
8/21/2025, 5:18:32 AM
No.106331444
>>106331460
>>106331272
>>106331434
Did you really need to post this twice?
WALD0RFF
8/21/2025, 5:18:43 AM
No.106331447
Anonymous
8/21/2025, 5:18:52 AM
No.106331450
@comfyanon comfyui
Do you think it might be beneficial to not only encourage websites but also to you yourself have on your comfyui websites a "Copy Workflow" button which just copies to clipboard the relevant workflow that is being talked about just like they have on civitai and then just ctrl+v into comfy instead of having people click to download an image/json, select where to download it, click save, then find it in that folder, hold it and then drag, alt tab and drop it into a prepared comfyui tab, retard?
Anonymous
8/21/2025, 5:19:03 AM
No.106331452
>>106331481
>milk, fridge, table, counter
GEE ANON THIS IS SO DESCRIPTIVE! WHERE DOES THE MILK GO?
Anonymous
8/21/2025, 5:19:47 AM
No.106331458
QIE output wants to apply a Samsung skin smoothing filter on everything it generates, but I know it's capable of making realistic textures. Anyone know the way?
Statler AND WALD0RF
8/21/2025, 5:20:07 AM
No.106331460
>>106331444
triples witnessed, but alas it wasnt me
>great minds think alike
BEAHAGAHAH
Anonymous
8/21/2025, 5:20:08 AM
No.106331461
If someone showed me an /sdg/ thread from two years ago and told me it was from yesterday I would 100% believe them with zero suspicion.
It's literally the same avatars for the last two years.
Anonymous
8/21/2025, 5:21:18 AM
No.106331470
>>106331482
>>106331490
>>106331436
>didnt understand how his "solution" of a custom arbitrary special prompting spec is not a good
based retard nocoder
Anonymous
8/21/2025, 5:21:58 AM
No.106331481
>>106331493
>>106331452
Ah but you see there is another booru tag "Table_Milk". Booru tags win again. In fact. I will only speak in booru tags from now on.
man, typing, captcha, mouse, computer, post
Anonymous
8/21/2025, 5:22:19 AM
No.106331482
>>106331470
>a good
*a good solution
Anonymous
8/21/2025, 5:23:34 AM
No.106331490
>>106331470
sorry my adhoc example is an "arbitrary special prompting spec " do you want me to write an RFC? as if booru style tags wasn't an arbitrary prompting spec selected due to the vast quantity of pre-labled data instead of it being a good spec for image generation
Anonymous
8/21/2025, 5:23:37 AM
No.106331491
i WILL NOT let the past two years be for waste. i know every booru tag. i put more effort into memorizing tags than i did my institutional education.
Anonymous
8/21/2025, 5:23:47 AM
No.106331493
>>106331481
>a man fills out a captcha on a tablet computer, touching on an on-screen keyboard, behind the dialog box is his post
Anonymous
8/21/2025, 5:24:10 AM
No.106331498
>>106331501
>>106331505
>"You should use all the tools available depending on the job, but generally booru tags with simple specific sentences when needed are best today"
>REEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE
Anonymous
8/21/2025, 5:25:27 AM
No.106331501
>>106331498
The ree is someone saying that booru tags are the be-all-end-all of prompting and you can't do better with sentences.
Anonymous
8/21/2025, 5:26:13 AM
No.106331505
>>106331498
Seriously feels like anons spent all this time memorizing tags and are now feel actively attacked at the mere suggestion of anything else lol. Even just supplementation and not replacement is offensive, truly bizarre
Anonymous
8/21/2025, 5:26:58 AM
No.106331507
>>106331521
>>106333603
Anonymous
8/21/2025, 5:28:34 AM
No.106331521
>>106331507
>when dat shy ytboi says da hard r for da first time
nupony is going to hit this bread like a freight truck no one is ready
Anonymous
8/21/2025, 5:32:21 AM
No.106331542
>>106331581
>>106331536
It looks quite underwhelming and it'll feel like it's a year old.
Anonymous
8/21/2025, 5:32:42 AM
No.106331548
>>106331536
is it finally releasing? 2 more weeks?
Anonymous
8/21/2025, 5:33:47 AM
No.106331561
>save us dogfucker!
>save us horsefucker!
Anonymous
8/21/2025, 5:36:22 AM
No.106331581
>>106331583
>>106331734
>>106331536
you wouldnt believe the outputs ive seen
>>106331542
as if the target audience isnt still using a model from 2024
Anonymous
8/21/2025, 5:36:46 AM
No.106331583
>>106331581
The target audience won't move from SDXL for obvious reasons.
Anonymous
8/21/2025, 5:38:53 AM
No.106331589
Anonymous
8/21/2025, 5:39:18 AM
No.106331591
>>106331608
>>106331618
Anonymous
8/21/2025, 5:39:55 AM
No.106331596
target audience is so stupid amirite lol
Anonymous
8/21/2025, 5:41:31 AM
No.106331606
>>106331617
>>106331619
If i use the stuff on perchance can i post it here?
Anonymous
8/21/2025, 5:41:51 AM
No.106331608
>>106331591
i was literally just looking for a 2.2 bj lora
Anonymous
8/21/2025, 5:44:20 AM
No.106331617
>>106331686
>>106331606
Kill yourself.
Anonymous
8/21/2025, 5:44:29 AM
No.106331618
>>106332663
>>106331591
I can't believe people would rather beg for buzz than just train their own.
Anonymous
8/21/2025, 5:44:29 AM
No.106331619
>>106331686
>>106331606
There should be DALL-E or API diffusion general in catalog.
Anonymous
8/21/2025, 5:44:31 AM
No.106331620
>>106331636
>>106330053
What's the point of this over IL?
Anonymous
8/21/2025, 5:44:48 AM
No.106331622
Is there an AI that can change this to white guys?
Anonymous
8/21/2025, 5:46:14 AM
No.106331636
>>106331681
>>106331620
It has gemma so maybe you can do natural language prompts?
Might have some value if it can do nsfw.
Anonymous
8/21/2025, 5:49:31 AM
No.106331656
>>106331778
>>106330194
https://civitai.com/images/95506830
>generated by the author of the Zoot models
Ah. That's why it's horrific, kek.
Anonymous
8/21/2025, 5:49:59 AM
No.106331662
>>106331673
>>106331677
>Hey guys did you hear the random furry discord guy who vibe trains models made a perfect vae free model?
>Also did you hear the pony guy is going to release their next model and it wlll be the best model yet?
How do people keep falling for grifters?
Anonymous
8/21/2025, 5:52:08 AM
No.106331673
>>106331662
most people are weak to hype and media manipulation and immediately submit to the hivemind if there's any positivity around a topic
Anonymous
8/21/2025, 5:52:24 AM
No.106331677
>>106331662
the vae free thing isn't a joke, flux was so poorly designed that it was running that the at actual-pixel-dimensions making it completely redundant to have a vae
Anonymous
8/21/2025, 5:53:20 AM
No.106331681
>>106331703
>>106331636
It can do nsfw as good base illustrious but also with nlp support. My only gripe with it is the quality if the art is kinda meh.
Anonymous
8/21/2025, 5:54:10 AM
No.106331686
>>106331617
Shut up sissy.
>>106331619
Ah i see, thanks for clarifying
Anonymous
8/21/2025, 5:55:14 AM
No.106331695
>>106331699
>>106331757
>update Comfy
>now have redundant row of tabs right on top of each other
Gee, thanks... I guess.
Anonymous
8/21/2025, 5:55:59 AM
No.106331698
Anonymous
8/21/2025, 5:56:05 AM
No.106331699
Anonymous
8/21/2025, 5:57:14 AM
No.106331703
>>106331763
>>106331681
How do speed of gens compare? Closer to SDXL/pony/illustrious or flux/chroma?
I have a fetish that needs NLP. Maybe I should check it out.
>quality if the art is kinda meh
Meh as in dull and boring or is it constantly making deformed slop? I can work around the first. Second makes it worthless.
Anonymous
8/21/2025, 6:02:53 AM
No.106331734
>>106331581
I've been trying it on the Fictional.ai website, at least there it's not particularly impressive at all when I do same-seed / same-prompt comparisons to Chroma (since they have that too). I'm reserving judgement for when I can test it local with proper sampler options and stuff though.
Main problem with it is the 4-channel VAE, it's just very noticeable. It also cannot do text, at all.
Anonymous
8/21/2025, 6:06:56 AM
No.106331757
>>106331767
>>106331847
>>106331695
Can I get metadata for this?
Anonymous
8/21/2025, 6:07:52 AM
No.106331763
>>106331783
>>106331808
>>106331703
Nah composition wise it looks what you would expect out of anime just styles are underbaked and hands are pretty iffy (go in thinking base illustrious 0.1)
>How do speed of gens compare
This is the kicker unfortunately almost double xl gen time so closer to chroma and I doubt anyone is gonna be making a nunchuk of this kek
Anonymous
8/21/2025, 6:08:36 AM
No.106331767
>>106331787
>>106331847
>>106331757
1jenny, sparkly red dress, nighttime, boat
Anonymous
8/21/2025, 6:09:33 AM
No.106331774
>>106331800
>>106330053
its at least more coherent but still lacks heavily
Anonymous
8/21/2025, 6:10:33 AM
No.106331778
>>106331810
>>106331656
>"this objectively fine image is bad because i'm an autistic faggot /hdg/ reject"
kek, if you want to pretend like it wasn't a sensible reply to the guy I was replying to and pretend like the version from the finetune isn't quite clearly overall better for a quality-tags only prompt with no artists used, go right ahead.
Beyond that the only way this is going to go is like it always does when people who apparently heavily dislike me for totally unclear reasons pop up, you'll insist I'm Indian and or Trans or some retarded shit over and over and over and over again while I relentlessly mock you, so why bother?
Anonymous
8/21/2025, 6:11:59 AM
No.106331783
>>106331808
>>106331763 (me)
Also has more knowledge of recent characters but less of older ones. Honestly not sure how that works since it has bigger dataset than noob did I think.
Anonymous
8/21/2025, 6:12:14 AM
No.106331787
Anonymous
8/21/2025, 6:12:55 AM
No.106331794
>>106331803
>>106331805
Any tips on prompts to get the women in my Wan2.2 Kijai workflow to take their shirts off? They just keep dancing
Also, if I dont mind the wait for higher quality, what can I do/mute in the workflow? (tried disabling the lightx2v nodes but that made things noisy/blurry)
Anonymous
8/21/2025, 6:14:46 AM
No.106331800
>>106331774
well I assume 2.0 Plus is probably not the last version the guy is gonna do. He's come a long way just by moving over from the Neta Lumina Beta to 1.0 as a training base, so seems promising I think. It's good he's still working on it IMO, given the Illustrious people for example AFAIK aren't working on their Lumina 2 model anymore I don't think. And I don't think we're gonna get any other not-overly-gigantic 16-channel VAE / good text encoder finetune in the near future.
Anonymous
8/21/2025, 6:14:55 AM
No.106331803
>>106332054
>>106331794
the woman pulls up/down her shirt to reveal her nude breasts
if you disable the light lora you have to increase the steps to like 20 at least
Anonymous
8/21/2025, 6:15:06 AM
No.106331805
>>106332054
>>106331794
How can we help you if we don't know what you have tried and if you have used any LoRAs?
Anonymous
8/21/2025, 6:15:40 AM
No.106331808
>>106331763
Hmmm. Doesn't seem too promising but still worth a shot probably. I am done for today though.
I will check it out tomorrow.
>>106331783
Strange, but this can be lora'd if the base model functions worthwhile.
But probably no one has any idea how to train for it yet (if ever)
Anonymous
8/21/2025, 6:15:46 AM
No.106331810
>>106331856
>>106331778
Kek, no anon I just disagree with you on an aesthetic level. So. Much. But I'm sure you also feel my gens are an abomination of God.
>pretend like the version from the finetune isn't quite clearly overall
I have yet to decide.
Anonymous
8/21/2025, 6:18:19 AM
No.106331822
>>106331841
>>106331849
>(@by atsumi yoshioka:1.22), (@by je o mo:1.15), (@by tonomiya68:1.1), (@by annin cha:1.2), (@by quasarcake:1.2),
kobayashi-san chi no maidragon, #kanna_kamui, flat chest, 1girl, beads, blue eyes, dragon girl, dragon horns, gradient hair, grey hair, hair beads, hair ornament, hairband, horns, long hair, low twintails, multicolored hair, purple hair, solo, sphere hair ornament, tail, twintails, capelet, dress, frilled dress, frilled sleeves, frilled thighhighs, frills, fur-trimmed collar, fur trim, long sleeves, puffy sleeves, thighhighs, two-tone dress, white thighhighs, A young woman seated at a white desk in a data center. She is focused on typing on a black keyboard in front of a glowing computer monitor. The background features multiple server racks with bright orange flames and explosions erupting from them, emitting sparks and smoke. The explosions are vivid, with intense orange and yellow hues, contrasting sharply with the dark, metallic server racks and the cool, blue-tinted lighting of the data center. The woman's calm demeanor contrasts with the chaotic explosion, creating a striking juxtaposition between order and chaos. She is on the left side of the picture.
(masterpiece, best quality, absurdres, highres, 疯狂精致且高质量的画作, 杰作, 最佳质量, 超高分辨率:1.1), (正确的解剖学:1.2), 不是畸形,
so do i have to learn a whole new syntax or what
Anonymous
8/21/2025, 6:23:13 AM
No.106331841
>>106331822
It's nothing crazy @ for styles and # for characters and the weird llm quality prompt thing in the beginning. Other than that prompt how you would prompt illustrious or flux/chroma
Anonymous
8/21/2025, 6:24:04 AM
No.106331847
>>106331757
It's mostly a LoRA and a prompt generated with Gemma 3 27B (and my own custom Sys. Prompt) made from a Titanic screencap.
>>106331767
>1jenny
lol
Anonymous
8/21/2025, 6:24:25 AM
No.106331849
>>106331822
boomerprompt or booruprompt not excessively use both
Anonymous
8/21/2025, 6:25:24 AM
No.106331856
>>106331962
>>106331810
the only significant criticism of my Illustrious model that I ever understood really was of the red-shift oversaturation in the first version, which was caused by some rogue captions. I did fix that in V2 though. V2 isn't perfect either, it can definitely lean slightly too realistic for some prompts, but I overall think I did a good job of balancing it out and improving over V1. And if I ever do a V3 (most likely would be against Illustrious V2.0 Stable as a base, if anything, to take advantage of the fact I already hybrid caption everything with both NLP and tags), I'll keep tweaking that sort of thing more.
Putting my model aside entirely though I don't understand how anyone thinks WAI looks better than this one, for example, that's what I'd use for Illustrious if mine didn't exist:
https://civitai.com/models/131986/cat-citron-anime-treasure-illustrious-and-noobai
Anonymous
8/21/2025, 6:26:09 AM
No.106331862
>>106331910
Reposting from old thread cause I didn't notice it was over already:
Differences between forge/reforge? The comparison link in the guide is dead. Looks like reforge is more active, but the dev is also swapping to a new branch for main development so idk which to pick.
Anonymous
8/21/2025, 6:27:22 AM
No.106331865
>>106331879
>>106331963
Anonymous
8/21/2025, 6:29:01 AM
No.106331879
>>106331950
Anonymous
8/21/2025, 6:33:32 AM
No.106331910
>>106331959
>>106331966
>>106331862
You pick comfy.
Anonymous
8/21/2025, 6:33:36 AM
No.106331911
>>106332204
Anonymous
8/21/2025, 6:35:44 AM
No.106331931
>>106331416
Please don't post anymore.
Anonymous
8/21/2025, 6:37:45 AM
No.106331950
>>106331968
Anonymous
8/21/2025, 6:38:21 AM
No.106331959
>>106331994
>>106331910
nobody consciously picks comfy because it's good. people pick comfy because it's alive
Anonymous
8/21/2025, 6:38:34 AM
No.106331962
>>106332077
>>106331856
Illu 2.0 can also go well above 1024px even better than some modern models IMO. The only reason I don't use it more often is because naked NoobAI is too good.
Anonymous
8/21/2025, 6:38:49 AM
No.106331963
Anonymous
8/21/2025, 6:39:02 AM
No.106331965
Good LORA to force realistic gens on chroma (v48)?
It has a tendency to push for cartoonish styles for some of my prompts.
Anonymous
8/21/2025, 6:39:18 AM
No.106331966
>>106331999
>>106331910
why don't A1111niggers just use SD.Next, which is actually actively maintained?
Anonymous
8/21/2025, 6:39:24 AM
No.106331968
>>106331950
because i said so
Anonymous
8/21/2025, 6:44:19 AM
No.106331994
>>106332009
>>106331959
I don't think I could go back to gradio now. I feel like a retard pushing a bit orange generate button.
Anonymous
8/21/2025, 6:44:47 AM
No.106331999
>>106332020
>>106331966
it's buggy and slow. comfy is buggy and ux hostile. a1111 is just ded. forge works but is pretty much ded so no new models. reforge just werks but the dev randomly goes on hiatus
Anonymous
8/21/2025, 6:46:15 AM
No.106332009
>>106332013
>>106331994
the comfy microcuck button is bad too. why is it a floating widget anyways? It's completely retarded
Anonymous
8/21/2025, 6:46:52 AM
No.106332013
>>106332027
>>106332009
you can pin it to the top
Anonymous
8/21/2025, 6:47:37 AM
No.106332020
>>106331999
webshit is gonna webshit. why isn't there a fucking desktop app for this shit?
Anonymous
8/21/2025, 6:48:41 AM
No.106332027
>>106332013
I know, who doesn't? chink vibe coding is something else however
Anonymous
8/21/2025, 6:52:29 AM
No.106332054
>>106332070
>>106332125
>>106331803
that prompt worked thanks! i'm using i2v and it seems to struggle with multiple ppl in the initial image
>if you disable the light lora you have to increase the steps to like 20 at least
the generations took way longer than i expected lol I upped it to like 22 steps. quality gain was sorta minimal, this light lora is impressive actually
>>106331805
I used Kijai's workflow from the reentry, only lora is lightx2v. (although are there loras that work with wan2.2? can i plug em into the same place the lightx2v lora's were?)
>>106332054
The 2.2 LoRAs are a waste of time. It will kill your movement, make anime 2.5D and blow out your colors., but the 2.1 light LoRA works well.
You can do a few steps without the LoRA then feed it to a sampler with the LoRA.
But most importantly, what are you prompting when trying to get them to remove their shirt?
Anonymous
8/21/2025, 6:56:31 AM
No.106332077
>>106331962
Yeah that's another benefit for sure
Anonymous
8/21/2025, 6:57:31 AM
No.106332084
>>106332121
>>106332207
Why the fuck are you all suddenly talking about SDXL again?
Anonymous
8/21/2025, 6:58:50 AM
No.106332098
>>106332279
Anyone have known-good settings for training Qwen Image loras?
Anonymous
8/21/2025, 7:02:32 AM
No.106332121
>>106332084
Qwen and Chroma are dead, anon.
Anonymous
8/21/2025, 7:03:00 AM
No.106332125
>>106332255
>>106332054
there is a saggy breasts lora for 2.2 on civit that will give you better nipples. right click and clone the lora selectors and chain new loras between the light loras and the set lora node. if you're adding loras made for 2.2 set the strength to 1.0 or lower. if you're adding loras made for 2.1 you might need to set it higher than 1.0 especially on the high model because those loras weren't trained for 2.2 but they still kinda work
>>106332070
anon is talking about the lightx2v loras for 2.2 which are garbage
Anonymous
8/21/2025, 7:14:28 AM
No.106332204
>>106331911
What do you do to make it not turn into 3D? Is it just based on the starting image aesthetic being clearly flat shading?
Anonymous
8/21/2025, 7:14:33 AM
No.106332206
Anonymous
8/21/2025, 7:14:38 AM
No.106332207
>>106332209
>>106332084
I can't run anything else at normal speeds
Anonymous
8/21/2025, 7:14:58 AM
No.106332209
>>106332207
I think you'd fit in better in /adg/
Anonymous
8/21/2025, 7:19:39 AM
No.106332239
>>106332380
>>106334883
comparison of Chroma to Pony V7, boomer prompting with absolutely no negative. Did these both on the Fictional.ai website, so Chroma version is unknown, and sampler / scheduler / etc is unknown. Same seed though, 5431.
Prompt (which was generated by Pony V7's prompt enhancer):
`A cute, happy, anthropomorphic, bipedal waffle man stands holding a bottle of maple syrup. The waffle man has a light golden-brown, textured body with visible square indentations across his entire form. His arms and legs are made of the same waffle material. He has large, round, dark brown eyes with small white highlights, a wide, open-mouthed smile revealing no teeth, and rosy cheeks. He holds a clear glass bottle with a dark brown liquid inside, labeled "Maple Syrup" in white text, in his right hand. The bottle is positioned at his side. The background is a plain, light blue wall. Full Shot. Digital 3D render with soft shading and hyper-realistic details. Warm and Cool color scheme. Spotlight directly above the character, casting soft highlights on the waffle texture and subtle rim lighting around the edges. Subsurface scattering on the waffle material for a slightly translucent effect. Chromatic aberration and slight film grain for a cinematic touch.`
Anonymous
8/21/2025, 7:22:36 AM
No.106332255
>>106332448
>>106332070
>what are you prompting when trying to get them to remove their shirt?
was literally one liners like 'three women standing in a boat take off their shirts revealing their naked breasts"
it was just more reliable when there's just one person in the starting image
>>106332125
much appreciated anon thanks! i didnt know you can chain them im on to downloading a bunch of wan2.1 lora from civit now
Anonymous
8/21/2025, 7:23:50 AM
No.106332265
>>106331158
>the guy who made the cakeful latinas
Who do you think you replied to kek
>pedoshit
I'm the self-insert type of /ss/ enthusiast which I believe is more common than the gay pedophile type because what's the point of the hag being there if you're a gay pedo
You have to remember that "the medium is the message". Remember this and goyim will think you have a gift for making impactful art. What's the point of generating a big tits Latina solo? Moreover, what's the point of sharing one? I believe that if you're not using AI to make things that couldn't exist otherwise you're wasting the uniqueness of the medium
For example, I have done solo brown women, but they've been doing things that also don't exist in real life like shitting on a flag of Israel. If you can think of solo brown women things that would fit on a blue board, keeping in mind that the medium is the message I would definitely be interested in generating them (people not doing anything interesting with solo gens is how we arrived at 1girl standing doing nothing general)
Anonymous
8/21/2025, 7:25:59 AM
No.106332279
>>106332423
>>106332098
I was trying it a bit on TensorArt. Dim 16 (which is in Kohya scale there) gives ~290 MB or so file, so a bit bigger than SDXL. Running no text encoder learning rate obviously, with 0.0005 model learning rate, and using Cosine With Restarts / 3 restarts, with 2 gradient accumulation steps, gave really good results in between 10 to 30 epochs for a couple of datasets I tried. (With repeats at 1, I never use multiple repeats, since according to Kohya a long time ago their actual purpose is not at all how people use them or seem to think they work).
Anonymous
8/21/2025, 7:27:07 AM
No.106332287
>>106332307
>>106332454
>>106332070
>The 2.2 LoRAs are a waste of time.
The 2.2 lora should be used with the 2.1 Lora at different ratios to control the amount of movement you get out of a gen/prompt
I have been happy with a 0.4/0.6 split
Anonymous
8/21/2025, 7:30:20 AM
No.106332307
>>106332287
What does this accomplish vs using only 2.1?
Anonymous
8/21/2025, 7:44:03 AM
No.106332375
>>106332412
>>106332424
Is it possible to add an element of randomness to the prompt? Like
>[red hair|blue hair|green hair]
To create a sort of 33% chance for any of the hair colors within the brackets?
Anonymous
8/21/2025, 7:44:33 AM
No.106332380
>>106332403
>>106332407
>>106332239
>Pony V7
Is that abomination ever leaving its (((beta test)))?
Anonymous
8/21/2025, 7:45:12 AM
No.106332383
>>106332433
Anonymous
8/21/2025, 7:47:43 AM
No.106332403
>>106332407
>>106332380
yeah weights coming out in september according to the discord
Anonymous
8/21/2025, 7:48:44 AM
No.106332407
>>106332403
>>106332380
Two more weeks then...
Anonymous
8/21/2025, 7:49:16 AM
No.106332412
>>106332375
yes that is called a wild card. i use a1111 so i have a text file with a bunch of words then i tell it to pick one of those words from the text file, not sure how other ui do it
Anonymous
8/21/2025, 7:50:50 AM
No.106332423
>>106332519
>>106332279
Thanks, how big were your datasets and did you use Qwen VLM for captioning?
Anonymous
8/21/2025, 7:50:53 AM
No.106332424
Anonymous
8/21/2025, 7:52:13 AM
No.106332433
>>106332444
>>106332505
>>106332383
I felt inspired and tried to make something similar, but an orange photobombed my gen
Anonymous
8/21/2025, 7:53:58 AM
No.106332444
>>106332433
i wish i was that orange
Anonymous
8/21/2025, 7:54:47 AM
No.106332448
>>106332255
>take off their shirts revealing their naked breasts"
It won't like that.
>Peel off their shirt revealing their bare chests.
>Remove their clothing, revealing their breasts
etc.
Anonymous
8/21/2025, 7:55:26 AM
No.106332451
wan2gp chads, were eating good tonight.
Anonymous
8/21/2025, 7:55:47 AM
No.106332454
>>106332287
>The 2.2 lora should be used with the 2.1 Lora at different ratios to control the amount of movement you get out of a gen/prompt
DO. NOT. USE. 2.2. LORA
Anonymous
8/21/2025, 7:56:51 AM
No.106332460
Anonymous
8/21/2025, 8:02:36 AM
No.106332505
>>106332524
Anonymous
8/21/2025, 8:04:13 AM
No.106332519
>>106332801
>>106332423
one was 32 images
that other was 105 images
they weren't captioned with Qwen, I used a custom jailbroken Gemini 2.5 Pro setup I have that outputs natural language captions in a particular way.
Anonymous
8/21/2025, 8:05:00 AM
No.106332524
>>106332505
Pretty!
Expected her to nom on the orange, but still really nice.
Anonymous
8/21/2025, 8:06:45 AM
No.106332534
>>106332553
>>106332555
What does this mean?
Prompt outputs failed validation:
WanVideoModelLoader:
- Value not in list: model: 'Wan2_2-I2V-A14B-HIGH_fp8_e4m3fn_scaled_KJ.safetensors' not in []
LoadImage:
- Custom validation failed for node: image - Invalid image file: 1.png
WanVideoModelLoader:
- Value not in list: model: 'Wan2_2-I2V-A14B-LOW_fp8_e4m3fn_scaled_KJ.safetensors' not in []
Which version of Illustrious do you guys use?
For a while I was using Hassaku (go ahead and laugh) but now I'm curious to know if there's an objectively "best" one that I should upgrade to.
Anonymous
8/21/2025, 8:08:59 AM
No.106332550
Anyone have a good workflow recommendation for a 4090?
Anonymous
8/21/2025, 8:09:27 AM
No.106332553
>>106332573
>>106332534
it didn't find the model files. if you downloaded them and put them in the right folders press R so the node can scan the folder
Anonymous
8/21/2025, 8:09:45 AM
No.106332555
>>106332573
>>106332534
>Value not in list
This thing wasn't in the list. What list?
>model
The models folder
>What model?
Wan2_2-I2V-A14B-Wan2_2-I2V-A14B-HIGH_fp8_e4m3fn_scaled_KJ.safetensors'
Anonymous
8/21/2025, 8:10:05 AM
No.106332558
>>106332545
i love boobies
Anonymous
8/21/2025, 8:11:16 AM
No.106332565
Anonymous
8/21/2025, 8:13:04 AM
No.106332573
>>106332579
>>106332553
I tried that but that also didn't work
>>106332555
I figured it'd be something like that, but as far as I can tell this is correct
Anonymous
8/21/2025, 8:13:42 AM
No.106332578
Anonymous
8/21/2025, 8:13:53 AM
No.106332579
>>106332580
>>106332573
diffusion_models folder
Anonymous
8/21/2025, 8:14:22 AM
No.106332580
>>106332579
Oh... I'm dumb, thanks
Anonymous
8/21/2025, 8:16:19 AM
No.106332592
>>106332545
I've been enjoying diving flat anime checkpoint that some anon recommended.
Anonymous
8/21/2025, 8:16:25 AM
No.106332593
Anonymous
8/21/2025, 8:21:35 AM
No.106332613
>>106332751
>>106332777
>>106332545
>(go ahead and laugh)
I use hassaku regularly, what's funny about it?
Anonymous
8/21/2025, 8:23:46 AM
No.106332622
Any autistic mumblings from Lode recently?
Anonymous
8/21/2025, 8:24:25 AM
No.106332628
>>106332739
>>106331379
ever tried to use the interface on an iphone?
Anonymous
8/21/2025, 8:27:30 AM
No.106332649
>>106332545
I did a simple 50/50 mix of
https://civitai.com/models/1313975/291-sih-quantum-merge and
https://civitai.com/models/1330192?modelVersionId=1501841 (V1 one) like 5 months ago and haven't even touched any other checkpoint since then. Somehow it doesn't have any heavy biases towards any particular style, so any loras that I tried work flawlessly with it
Anonymous
8/21/2025, 8:31:13 AM
No.106332663
>>106332682
>>106331618
I can't believe people use wan while they can train their own video model.
Anonymous
8/21/2025, 8:31:26 AM
No.106332664
>>106332699
I did a fresh install of comfy with Wan2.2, it now almost immediately crashes when trying to run the image to video test
[ComfyUI-Manager] default cache updated:
https://api.comfy.org/nodes
FETCH DATA from:
https://raw.githubusercontent.com/ltdrdata/ComfyUI-Manager/main/custom-node-list.json [DONE]
[ComfyUI-Manager] All startup tasks have been completed.
got prompt
T5Encoder: 100%|| 24/24 [00:00<00:00, 96.82it/s]
T5Encoder: 100%|| 24/24 [00:00<00:00, 606.03it/s]
CUDA Compute Capability: 8.9
Detected model in_channels: 36
Model cross attention type: t2v, num_heads: 40, num_layers: 40
Model variant detected: 14B
model_type FLOW
Using accelerate to load and assign model weights to device...
Loading transformer parameters to cpu: 100%|| 1095/1095 [00:00<00:00, 14178.91it/s]
Moving diffusion model from cuda:0 to cpu
Loading LoRA: lightx2v_I2V_14B_480p_cfg_step_distill_rank64_bf16 with strength: 3.0
Using 1053 LoRA weight patches for WanVideo model
sigmas: tensor([1.0000, 0.9756, 0.9412, 0.8889, 0.8000, 0.6153, 0.0000])
Sampling until step 3, timestep: 941
timesteps: tensor([999, 975, 941], device='cuda:0')
image_cond shape: torch.Size([20, 21, 104, 60])
Swapping 30 transformer blocks
Initializing block swap: 100%|| 40/40 [00:00<00:00, 40.23it/s]
----------------------
Block swap memory summary:
Transformer blocks on cpu: 10056.94MB
Transformer blocks on cuda:0: 3352.31MB
Total memory used by transformer blocks: 13409.26MB
Non-blocking memory transfer: False
----------------------
Input sequence length: 32760
Sampling 81 frames at 480x832 with 6 steps
0%| | 0/3 [00:00, ?it/s]
E:\ComfyUI\ComfyUI_wan>pause
Press any key to continue . . .
Anonymous
8/21/2025, 8:31:49 AM
No.106332666
>>106332545
cyberfalafel
CatTowernoob
NTR
some time WaiNSFW
Anonymous
8/21/2025, 8:33:39 AM
No.106332682
>>106332717
>>106332663
false equivalency.
You're begging people to fork out currency to fund model you have no idea works as advertised or not. And when pointed out that you can train your own LoRA of the subject matter you equate it to being the same as training an entire model.
Anonymous
8/21/2025, 8:33:56 AM
No.106332685
Anonymous
8/21/2025, 8:35:28 AM
No.106332699
>>106332702
>>106332664
sageattention mismatch with your hardware?
Anonymous
8/21/2025, 8:36:21 AM
No.106332702
>>106332699
It was working yesterday (kinda) on a different install, same hardware of course
qwen image edit is so fucking good wtf
Anonymous
8/21/2025, 8:38:35 AM
No.106332717
>>106332730
>>106332682
It's really not, some people don't have the compute, the dataset or the knowledge to train their own video lora.
Just like people don't have the compute, the dataset or the knowledge to train their own video model.
And I'm not the one who was begging, I'm another anon, but I want people to share their lora.
>Took 2 hours to make shit work the same as reforged hires fix with face adetailer.
May a thousand flees infest the couch of the person who came up with nodes UI. I understand the need for modularity, but this method is not repeatable or reteachable for newfags unless you're austically inclined.
Anonymous
8/21/2025, 8:40:30 AM
No.106332730
>>106332717
Dataset and knowledge can be fixed in the space of a few hours.
The hardware is a legitimate argument, but if you don't have enough to train at least a LoRA for the model you use, is it really worth using the model?
ps. I know you'll reply yes ty rhetorical question, but the answer is no. It's not worth it.
Anonymous
8/21/2025, 8:41:32 AM
No.106332738
>>106334350
>>106332707
Yeah it's really good. Still turns nipples into candies though.
Anonymous
8/21/2025, 8:42:27 AM
No.106332739
>>106332628
>double digit IQ
exhibit A
Anonymous
8/21/2025, 8:43:45 AM
No.106332747
>>106332720
Are there any good non-node based alternatives?
Anonymous
8/21/2025, 8:44:10 AM
No.106332751
>>106332777
>>106332800
>>106332613
To me, there's nothing wrong with it at all. But the last time I mentioned it here, I got bullied.
Anonymous
8/21/2025, 8:47:07 AM
No.106332767
>>106332720
While setting up a workflow is a pain I love that I can easily save a bunch of workflows and have them just work. When doing shit in forge yes I can load the info from an old image but I still have to do each step in the GUI but with comfy you can have a workflow that does every step automatically.
Anonymous
8/21/2025, 8:47:28 AM
No.106332770
>>106332720
>second pass with a detailer
i have it down to maybe 70 seconds tops
Anonymous
8/21/2025, 8:47:40 AM
No.106332772
>>106332707
can you post an example?
Anonymous
8/21/2025, 8:48:41 AM
No.106332777
>>106332797
>>106332800
>>106332545
>>106332613
>>106332751
It's so widely used that its style isn't interesting to me anymore. Noob is the only tune of Illustrious that matters desu.
Anonymous
8/21/2025, 8:48:49 AM
No.106332779
Anonymous
8/21/2025, 8:51:16 AM
No.106332797
>>106332815
>>106332777
>It's so widely used that its style isn't interesting to me anymore.
It's more accurate to say the checkpoint is too opinionated.
>>106332751
Bullied bullied or just took shitposting too seriously?
People here shit on all "ancient VRAMlet cope" models like SDXL (part jokingly, part seriously) but if you enjoy the output you shouldn't have too many reasons to care either way.
>>106332777
Just prompt some artist names or use a style lora.
Anonymous
8/21/2025, 8:51:40 AM
No.106332801
Anonymous
8/21/2025, 8:54:04 AM
No.106332815
>>106332992
>>106332800
>Just prompt some artist names or use a style lora.
See
>>106332797. Even with artists, it's own style is too overt. It shouldn't be noticeable at all.
Anonymous
8/21/2025, 8:55:00 AM
No.106332820
>>106332918
>>106332707
it has a lot of potential, far better than flux kontext.
Anonymous
8/21/2025, 8:55:13 AM
No.106332823
>>106332800
>part seriously
SDXL, particularly Illustrious/Noobai are still the best option for anime gens, both sfw and nsfw. Chroma can only go so far and PonyV7 is DoA (if it ever releases even). Maybe Qwen will have some great finetunes in the future but we're talking months ahead
Anonymous
8/21/2025, 8:59:26 AM
No.106332844
Is swarm more user friendly than comfy?
Anonymous
8/21/2025, 9:07:06 AM
No.106332893
>>106332912
Are there any benchmarks comparing DDR4 vs DDR5 RAM offloading performance?
Anonymous
8/21/2025, 9:09:35 AM
No.106332912
>>106332939
>>106332893
If you already ddr4 there is basically no value in upgrading to ddr5 for the speed boost. It's negligible.. The only thing that matters is GPU
Anonymous
8/21/2025, 9:10:53 AM
No.106332918
>>106332820
The model misspelled BWC
Anonymous
8/21/2025, 9:12:07 AM
No.106332929
Is there a good online lora trainer?
>>106332912
that seems unlikely since memory bandwidth is normally the major bottleneck. I would expect a straight 2x speed increase.
Anonymous
8/21/2025, 9:19:38 AM
No.106332983
>>106332939
What do expect with python?
Anonymous
8/21/2025, 9:20:43 AM
No.106332992
>>106333010
>>106332815
I am not doubting that. It can't do some 3d or western styles.
I still kinda like it, but I guess I respect it if that's a deal breaker.
Do you know any similar Illustrious finetunes that can play along with any style?
>>106332939
That depends entirely on how much you are offloading.
But I would recommend not planning to rely on any offloading for diffusion. Leaves some optimizations like sage out of the table.
Anonymous
8/21/2025, 9:21:59 AM
No.106333001
>>106332939
>memory bandwidth is normally the major bottleneck
yes, the memory inside your gpu, not ram
Anonymous
8/21/2025, 9:23:20 AM
No.106333010
>>106333033
>>106332992
>But I would recommend not planning to rely on any offloading for diffusion.
Don't most people offload for video itt?
Anonymous
8/21/2025, 9:25:17 AM
No.106333023
I finally figured out how kohya deep shrink works.
The downscale factor should be the exact multiple that how much empty latent resolution is higher than the model target.
I feel like an imbecile for being unable to figure this out the last time I tried it.
Anonymous
8/21/2025, 9:27:25 AM
No.106333033
>>106333060
>>106333010
Vramlet here although not super big on video. Nope.
GGUF quants let you partially cycle through different chunks of the model inside each step.
Slowing it down a lot of course, but still faster than offloading large amount of data to system ram in most cases.
Also you can use sage this way.
Anonymous
8/21/2025, 9:29:05 AM
No.106333042
>>106333072
>>106331091 (OP)
>government proceeds to make it mandatory for all GPUs to not allow anything but official censored models
Local models aren't safe either.
The only solution is to disempower government and public lobbies, period.
Otherwise, your government will kill itself and you along with it with this crap soon.
Anonymous
8/21/2025, 9:30:47 AM
No.106333054
>>106333099
Governments are bad enough without the need to mentally masturbate to stuff that won't happen.
Anonymous
8/21/2025, 9:31:56 AM
No.106333060
>>106333095
>>106333033
Everything I've seen posted here indicates that even Q8 is substantially worse than the full model for video, in contrast to imagegen where you can go down to Q5-Q6 and get reasonably consistent results
Anonymous
8/21/2025, 9:33:16 AM
No.106333072
>>106333131
>>106333042
>government proceeds to make it mandatory for all GPUs to not allow anything but official censored models
I don't think that's super likely in the near future. Sure they don't care about us, but large corporations have their own private models that they are not super keen on sending info about to Nvidia/Amd/whatever's databases so that it can be whitelisted in the driver.
They would lobby against such, actually.
Anonymous
8/21/2025, 9:36:25 AM
No.106333095
>>106333060
Depending on how much VRAM you have and how much you are offloading you are looking at possibly HOURS per single video when run under fp16.
You have been warned.
Also I believe you can do fp16 in gguf format, actually. I am not sure if cycling through works when run that way, but I don't see why it wouldn't.
Anonymous
8/21/2025, 9:36:29 AM
No.106333097
>>106332800
They posted a snickering pepe at me
I never fully recovered
Anonymous
8/21/2025, 9:36:37 AM
No.106333099
>>106333140
>>106333054
>imagine is not necessary for good governance
This is written by a true mouthbreather, I see.
>nothing but bland generic bureaucracy everywhere!
Ironically, degeneracy is what sparks ideas.
Anonymous
8/21/2025, 9:40:59 AM
No.106333131
>>106333072
>but large corporations have their own private models
Which they cannot fix because nobody knows how to use them on account of preventing people outside of government or their firms from using them. No general public use or know-how. Smaller supply of workers capable of fixing/using shit.
This is also why monopolies destroy countries. Nvidia/Intel/Amd monopoly was a mistake. So is government.
Anonymous
8/21/2025, 9:42:00 AM
No.106333140
>>106333339
>>106333099
>imagine
imagination*
derp
Time for coffee.
Anonymous
8/21/2025, 9:51:19 AM
No.106333215
What is the expected video generation time for something at 5 seconds long, on a 4090?
Anonymous
8/21/2025, 9:51:37 AM
No.106333218
>>106332939
holy delusion.
Anonymous
8/21/2025, 9:53:07 AM
No.106333229
>>106333483
Is there a diffusion for VR?
Anonymous
8/21/2025, 10:01:31 AM
No.106333300
>>106332939
Until the code becomes bloated so much that it doesn't have a 2x speed increase at all.
AI is becoming just bloated conventional code at this rate btw, this will soon be an issue when the "neural network" becomes less applicable to these things
A lobotomised AI is just a normal boring old program with none of the optimization or efficiency.
Anonymous
8/21/2025, 10:02:41 AM
No.106333312
Then again, python was always bloat.
ポストカード
!!FH+LSJVkIY9
8/21/2025, 10:05:49 AM
No.106333339
>>106333352
>>106333457
>>106334086
>>106331091 (OP)
well, i'll just come out and say it...
>GM! B-B0T STATUS??!
the collage is so bad
i just cant even bear
bear, to look at it..
:c
>>106333140
>coffee up
a l w a y s
ポストカード
!!FH+LSJVkIY9
8/21/2025, 10:06:53 AM
No.106333352
>>106333457
>>106334086
>>106333339
i did try for digits <\3
Anonymous
8/21/2025, 10:11:57 AM
No.106333397
>>106333453
>qwen edit input : 604x900
>output : 600x896
Should qwen input be at least divisible by 8?
Anonymous
8/21/2025, 10:20:40 AM
No.106333448
>>106333464
Anonymous
8/21/2025, 10:21:32 AM
No.106333453
>>106333545
>>106333397
Not sure about Qwen but most nodes will generally just cut the excess pixels. (Maybe some resize I dunno.)
Also I think it is divisible with 16 rather than 8 for these models.
Anonymous
8/21/2025, 10:21:47 AM
No.106333457
>>106333791
Anonymous
8/21/2025, 10:22:19 AM
No.106333464
>>106333575
>>106333448
>local open source models is better than ??? source online models
Anonymous
8/21/2025, 10:23:58 AM
No.106333483
>>106333487
>>106333229
360 loras, hunyuan world
Anonymous
8/21/2025, 10:24:22 AM
No.106333487
>>106333483
>hunyuan world
>slop world.
Anonymous
8/21/2025, 10:30:10 AM
No.106333545
Anonymous
8/21/2025, 10:32:44 AM
No.106333569
>>106333580
Hi guys, yesterday I finally installed WAN 2.2 on a rendering farm that you rent by hours, and obviously I would like to make the most of it.
Would you recommend to me some state of the art models and worflows to create good adult videos? I have seen amazing results over here and would like to become a master like you.
Thank you and regards
PS: Forgot to say I am looking for photorealistic results.
Anonymous
8/21/2025, 10:33:20 AM
No.106333575
>>106333464
posted the wrong gen.
Anonymous
8/21/2025, 10:34:07 AM
No.106333580
Anonymous
8/21/2025, 10:37:25 AM
No.106333603
>>106331507
A-bu-the, a-bu-the, a-bu-the, that's all folks!
Anonymous
8/21/2025, 10:45:27 AM
No.106333679
>>106333724
NoobAI or Flux?
Anonymous
8/21/2025, 10:46:01 AM
No.106333689
Peepee or poopoo?
Anonymous
8/21/2025, 10:47:53 AM
No.106333702
>>106333753
Seed or Feed?
Anonymous
8/21/2025, 10:50:24 AM
No.106333724
>>106333679
NoobAi. Great effort by the guy who made it.
Anonymous
8/21/2025, 10:51:49 AM
No.106333736
>use qwen edit on huggingface
>prompt add a naked woman in the scene
>generates woman in bikini
ok
Anonymous
8/21/2025, 10:52:31 AM
No.106333747
>>106333769
>>106333830
why the hell are my images output always zoomed in with qwen edit? and it's not just landscape, it can be characters having their head cut off, and so on
see picrel, left is source, and right is zoomed in while all I asked was to change it to winter time
ポストカード
!!FH+LSJVkIY9
8/21/2025, 10:53:40 AM
No.106333753
>>106333702
s n e e d
e
e
t
h
e
Anonymous
8/21/2025, 10:55:13 AM
No.106333769
>>106334011
>>106333747
I haven't touched qwen yet, but with Kontext I usually add "keep perspective the same" to the prompt
ポストカード
!!FH+LSJVkIY9
8/21/2025, 10:58:40 AM
No.106333791
>>106333843
>>106333843
Anonymous
8/21/2025, 10:59:59 AM
No.106333800
>>106334788
can I delete Flux Kontext now that Qwen Image Edit is out? any reason to keep it around?
Anonymous
8/21/2025, 11:04:57 AM
No.106333830
>>106334011
>>106333747
is this in comfy? would need to see the node setup, the workflows ive seen were resizing and cropping the source
Anonymous
8/21/2025, 11:07:04 AM
No.106333843
>>106333897
Anonymous
8/21/2025, 11:09:17 AM
No.106333858
>>106334788
how can I use qwen edit for inpainting?
ポストカード
!!FH+LSJVkIY9
8/21/2025, 11:09:56 AM
No.106333867
ポストカード
!!FH+LSJVkIY9
8/21/2025, 11:13:38 AM
No.106333897
>>106333968
>>106333843
>>106333882
see ya space cowboy, someway somehow
From Lodestone in discord:
***Just want to share the news
I've just created the first-ever large-scale pixel space model 9.5B (Chroma1-Radiance) with the same computational cost of a latent space model (Chroma1-HD).
This is huge! we 're no longer bounded by the VAE, the model is an end to end model now.
the model itself is still training for like 1/5th epoch (less than 1 epoch) and already converging fast!
P.S thanks <@1190003199696969790> for testing it
P.P.S credit to the paper author of this method
https://arxiv.org/abs/2507.23268***
What did he mean by this?
Anonymous
8/21/2025, 11:18:50 AM
No.106333935
>>106333921
It means you should go back to your cult discord and stop spamming us with his pseudo scientific bullshit.
Anonymous
8/21/2025, 11:19:40 AM
No.106333940
>>106333957
>>106333921
From my understanding, computing in latent space saves an order of magnitude in time. So I don't see how this will really work.
The model will probably be better but training it like that now is a bit late, not sure how much benefit would it get, and even with fast loras or things like that, generation will still take a few minutes on top hardware, no?
ポストカード
!!FH+LSJVkIY9
8/21/2025, 11:20:56 AM
No.106333947
>>106333999
>>106331091 (OP)
smell ya later ;3
Anonymous
8/21/2025, 11:21:56 AM
No.106333954
>>106333965
>>106333998
>>106331091 (OP)
Genuinely asking.
What you people do in this general and what is the use-case of AI image generation except for generating porn and slop? Yeah, it's entertaining to easily and quickly create anything you want if you know how to proompt, but it will always be just a mediocre incomplete soulless slop.
Anonymous
8/21/2025, 11:22:44 AM
No.106333957
>>106333940
>with the same computational cost of a latent space model
it's magic
Anonymous
8/21/2025, 11:23:26 AM
No.106333962
>>106331091 (OP)
That physics demo with miku and the crossover is impressive.
Anonymous
8/21/2025, 11:23:36 AM
No.106333965
>>106333954
>easily and quickly create anything you want
>it will always be just a mediocre incomplete soulless slop
Uhhh... saar...
Anonymous
8/21/2025, 11:23:50 AM
No.106333967
>>106333921
Interesting stuff for sure. I just hope he finishes the current Chroma model no matter what new tech appears.
Anonymous
8/21/2025, 11:23:53 AM
No.106333968
>>106333999
>>106333897
>That last one with black hair wearing an orange bikini
Anonymous
8/21/2025, 11:26:52 AM
No.106333989
Lodestone is /ourguy/ and anyone not supporting him is a copro shiller.
Anonymous
8/21/2025, 11:28:17 AM
No.106333998
>>106333954
I had to build another pc just to keep up with the demand for assets.
Anonymous
8/21/2025, 11:28:19 AM
No.106333999
>>106334047
Anonymous
8/21/2025, 11:28:37 AM
No.106334001
am I dumb or is there no independent seed node WITH the "control after generate" setting?
>>106333769
no, it does that for literally every image
I tried adding your sentence but it doesn't help, it literally looks like it works with a zoomed in version of the input and I don't know why
picrel is another example
>>106333830
it's the official comfy one slightly modified
it resizes and crop but I've added a preview of it resized and that's the thing I post
see the json below :
https://files.catbox.moe/4d48r7.json
Anonymous
8/21/2025, 11:32:06 AM
No.106334022
>>106334099
>>106334011
I wonder if it requires something similar to controlnet tile to keep the composition
Anonymous
8/21/2025, 11:35:36 AM
No.106334047
>>106333999
digits witnessed you cheeky cunt
Anonymous
8/21/2025, 11:36:16 AM
No.106334052
>>106334063
>>106334068
>Setting qwen_image_edit_bf16.safetensors weight dtype to torch.bfloat16
>model weight dtype torch.bfloat16, manual cast: None
>model_type FLUX
Flux?
Anonymous
8/21/2025, 11:37:07 AM
No.106334063
>>106334052
>delete system 32folder
Anonymous
8/21/2025, 11:37:30 AM
No.106334068
>>106334052
he knows too much
Anonymous
8/21/2025, 11:38:25 AM
No.106334081
>>106334358
Anonymous
8/21/2025, 11:39:01 AM
No.106334086
>>106334107
>>106333339
>>106333352
Finally. You came back.
The thread almost died.
After testing, it seems to me like Qwen Image Edit is to Kontext like what Qwen Image is to other T2I models.
It's a huge amount better for more complex prompt following, but it has slight tunnel vision on what it wants to do and in what way, and the main problem: it's only good for destructive edits.
For a lot of use cases where you want the image to stay mostly the same aside from the slight VAE quality loss, it looks like it's quite noticably worse.
For example, no matter what, I can't reproduce anything similar like in picrel that was possible with Kontext, none of the below prompts work, and they also make her hair brown and change her eyebrows completely by default
"elongate her hair"
"make her hair longer"
"make her hair long without changing anything else in the image"
"change her hair to be very long while keeping the rest of the image as is, keep her eyebrows the same"
It's fun to play around with given the much, much better complex prompt following capabilities but it's not good enough for iterative work over existing images that should keep most of their look. Granted, Kontext isn't that great either given the censoring and limited things it can change, but when it works, it works very well.
Anonymous
8/21/2025, 11:40:03 AM
No.106334099
>>106334022
I don't see others having the same problem, so not sure
Anonymous
8/21/2025, 11:41:09 AM
No.106334107
>>106334086
he's leaving again he is "becoming a normie"
Anonymous
8/21/2025, 11:42:57 AM
No.106334125
>>106334161
>>106334116
requestin pastel rainbow platinum blonde hair ;3
Anonymous
8/21/2025, 11:44:34 AM
No.106334135
Using the comfy ui kontext work flow
I'm trying to get something from the second image on the first
But it refuses to do anything but just put them side by side as one picture
How do I work this?
Anonymous
8/21/2025, 11:51:04 AM
No.106334183
>>106334199
>>106334208
>>106334098
>>106334116
I dunno about anyone else, but I can't stand that doorknocker in her nose.
Anonymous
8/21/2025, 11:51:26 AM
No.106334184
Anonymous
8/21/2025, 11:52:21 AM
No.106334191
>>106334161 (me)
Hair came out OK there and without many changes to her face now. I guess it's a hit or miss depending on the prompt then.
The problem is, unlike Kontext, you can't just reroll Qwen, given the lack of seed variety.
Anonymous
8/21/2025, 11:53:10 AM
No.106334199
>>106334183
i wouldn't ask her to get one
but its an acquired taste
i like industrial piercings
when you kiss you bump into it, cute CUTE! ;3
Anonymous
8/21/2025, 11:54:17 AM
No.106334208
>>106334219
>>106334183
me and you both anon, I never understood the appeal, it looks like she's having metal boogers
Anonymous
8/21/2025, 11:56:07 AM
No.106334219
>>106334208
they make ornate gold ones etc
dont knock it till ya suck on\ lick one ;3
Anonymous
8/21/2025, 11:56:15 AM
No.106334221
>>106334235
>>106334260
>>106334011
>https://files.catbox.moe/4d48r7.json
ok yeah i see its cropping the sides a bit here
i think its something with the aspect ratio of the actual latent image, i switch to square and everything is aligned perfectly, the more non-uniform the aspect ratio the more the "long" side of the image is mis-aligned
Anonymous
8/21/2025, 11:57:10 AM
No.106334235
>>106334259
>>106334387
>>106334011
>>106334221
another more obvious example
>Change her dress to be red. Rest is same.
Anonymous
8/21/2025, 12:00:43 PM
No.106334259
>>106334323
>>106334235
What if you start with: a full body photo of a woman
Anonymous
8/21/2025, 12:00:43 PM
No.106334260
>>106334295
>>106334221
>i think its something with the aspect ratio of the actual latent image, i switch to square and everything is aligned perfectly, the more non-uniform the aspect ratio the more the "long" side of the image is mis-aligned
ok but why would
>>106334098 &
>>106334116
work perfectly fine?
they are in portrait modes and they don't have that weird zoomed in property
Anonymous
8/21/2025, 12:06:29 PM
No.106334295
>>106334323
>>106334260
The first linked image is kontext and it was using a workflow that was fixed later so it doesnt matter, kontext can do it without zooming in now.
But the second linked image does actually zoom in a little
Anonymous
8/21/2025, 12:10:58 PM
No.106334323
>>106334387
>>106334259
no change, I don't believe it has anything to do with prompting, anon
>>106334295
oh yeah I can see it, but it's quite small
Anonymous
8/21/2025, 12:11:03 PM
No.106334325
>>106334329
>mfw he isn't a giraffemaxxer
Anonymous
8/21/2025, 12:12:04 PM
No.106334329
>>106334325
neck too fat
not long enough
comfy really fucked the memory management huh
Anonymous
8/21/2025, 12:17:22 PM
No.106334350
>>106334363
>>106332738
just give it a second image with the tits you want
Anonymous
8/21/2025, 12:17:25 PM
No.106334351
>>106334373
>>106334341
Using 1.5 with 2 loras?
Anonymous
8/21/2025, 12:17:26 PM
No.106334352
>>106334341
its the anti-comfy guy again!
HE SIMPLY WILL NOT GET COMFY!
IN ANYWAY WHATSOEVER HE CANT GET CONFORTABLE GUISE!!!!
Anonymous
8/21/2025, 12:18:09 PM
No.106334358
>>106334081
looks pretty neat. is there comfy support?
Anonymous
8/21/2025, 12:18:45 PM
No.106334363
>>106334395
>>106334350
>second image
?
Anonymous
8/21/2025, 12:20:01 PM
No.106334373
>>106334351
Had the same problem with wan 2.2 but now I'm just using QIE
Anonymous
8/21/2025, 12:21:19 PM
No.106334379
>>106334341
Had to set dynamic allocation for system pagefile to not crash while genning now, 72gb pagefile lmao.
Anonymous
8/21/2025, 12:22:46 PM
No.106334386
SAAAAAAAARS WHERE COMFYUI QWEN NUNCHAKU
Anonymous
8/21/2025, 12:23:12 PM
No.106334387
>>106334441
>>106334235 (You)
>>106334323 (You)
I think I get why it's zooming in, here is an example that works, need to test the change of dress color again
>Change her to wear a police uniform with a skirt, and no police cap.
Anonymous
8/21/2025, 12:24:05 PM
No.106334395
>>106334408
>>106334997
>>106334363
you stitch two images together as the input and then reference them in the prompt
Anonymous
8/21/2025, 12:26:47 PM
No.106334408
>>106334395
If that works that's pretty cool, and a great way to bypass any censored thing.
Anonymous
8/21/2025, 12:33:27 PM
No.106334441
>>106334459
>>106334387 (You)
ok I found out why, or at least a way to not get the problem: the size of any dimension needs to be >1024 (or maybe below), or at least it can't be too low, or it will zoom in in the latent for some reason
Anonymous
8/21/2025, 12:36:35 PM
No.106334459
>>106334466
>>106334441
i like it
as vramlets?
how fucked are we?
Anonymous
8/21/2025, 12:37:29 PM
No.106334466
>>106334476
>>106334459
dunno but works on my 3090
Anonymous
8/21/2025, 12:38:50 PM
No.106334476
>>106334499
>>106334466
how much vram though
i have a 3060 oc
Anonymous
8/21/2025, 12:42:29 PM
No.106334498
QIE Q8 takes 50s on 270w 3090 light lora 8 steps
Anonymous
8/21/2025, 12:42:33 PM
No.106334499
>>106334476
no matter the size it seems to always use 23GB for me, so try it to see I'd say
Anonymous
8/21/2025, 12:54:53 PM
No.106334566
>>106334737
whoever came up with the idea of using "sks woman" and shit to train loras should be shot
>comfyui tab being shown anywhere on the monitors steals a crap ton of gpu power if the vrma is already at near max usage
is there really no way to make the tab not take any resources if nothing is happening on it, including no videos playing even?
Anonymous
8/21/2025, 12:58:32 PM
No.106334580
Anonymous
8/21/2025, 1:01:20 PM
No.106334595
>>106334576
run it headless and access the ui from another computer
Anonymous
8/21/2025, 1:07:07 PM
No.106334640
>>106334657
can qwen edit nudify?
Anonymous
8/21/2025, 1:09:50 PM
No.106334657
>>106334640
It can undress, and generate some shit nipples on cartoony characters
Anonymous
8/21/2025, 1:10:41 PM
No.106334665
>>106334671
>>106331379
Because it needs 1000 configurations and shit that then breaks because X node isn't compatible with Y node. Its retarded, a time wasters and there is nothing you can't do in forge and easier.
What you tech autists don't understand is that people don't want to waste hours and hours to make things work, when easier options are available.
Anonymous
8/21/2025, 1:11:25 PM
No.106334671
>>106334678
>>106334665
>there is nothing you can't do in forge and easier
except the entirety of video generation as just 1 thing you cant do
Anonymous
8/21/2025, 1:12:37 PM
No.106334678
>>106334684
>>106334693
>>106334671
Video generation is shit and I have yet to see one video that doesn't look like crap, you are better off gen the frames individuals and animating yourself.
Anonymous
8/21/2025, 1:13:14 PM
No.106334684
>>106334695
>>106334678
lmaoing at vramlet cope
STATLER + (or) Waldorf & Company.
8/21/2025, 1:14:05 PM
No.106334693
>>106334678
have you tried opening your eyes?
BEAHAGHAHAHA
Anonymous
8/21/2025, 1:14:17 PM
No.106334695
>>106334684
You say that but then all of the gens here suck ass. You are indians posting slop
Anonymous
8/21/2025, 1:19:47 PM
No.106334737
Anonymous
8/21/2025, 1:23:22 PM
No.106334759
>>106334790
>>106320836
what a slut, these gens are awesome
Anonymous
8/21/2025, 1:23:36 PM
No.106334762
>>106334771
>>106334576
play around settings to make them less heavy
or launch a dedicated browser in software only mode without hardware acceleration
Anonymous
8/21/2025, 1:24:43 PM
No.106334771
>>106334762
>play around settings to make them less heavy
i did, its limited to 60fps, doesnt fix it, it is what it is
Anonymous
8/21/2025, 1:26:53 PM
No.106334788
>>106333800
to me it seems like it changes character details unprompted more often and so on, I'm not even sure qwen is actually better overall
>>106333858
what do you mean by that? it's meant to do edits and you can add objects or other stuff you might do with inpainting.
STATLER + (or) Waldorf & Company.
8/21/2025, 1:27:51 PM
No.106334790
>>106334859
>>106334759
beahaghahahah
why is no one talking about this
Anonymous
8/21/2025, 1:32:56 PM
No.106334830
>>106334816
it was already mentioned
there is nothing to test
people are testing QIE
Anonymous
8/21/2025, 1:33:13 PM
No.106334831
Anonymous
8/21/2025, 1:35:21 PM
No.106334845
>>106334971
>>106331091 (OP)
Who made that hatsune miku vs car video ? How ?
Anonymous
8/21/2025, 1:35:24 PM
No.106334847
>>106334816
It's not ready yet and it's too technical for most posters to test. It's neat though.
Anonymous
8/21/2025, 1:36:32 PM
No.106334859
>>106335056
Anonymous
8/21/2025, 1:38:55 PM
No.106334883
>>106334913
>>106334929
>>106332239
but pony is not going to be local
Anonymous
8/21/2025, 1:42:42 PM
No.106334913
>>106334883
that doesn't say anything about it not being local?
they're discussing what entity the nsfw saas model will be attached to with regards to google play nuking apps (smartphone user audience pretty much unrelated to the average ldg)
Anonymous
8/21/2025, 1:42:43 PM
No.106334915
qwen nunchaku comfyui bros... WHERE?????
Anonymous
8/21/2025, 1:43:05 PM
No.106334920
>>106334816
idk why you give this hack any of your time.
Anonymous
8/21/2025, 1:44:31 PM
No.106334929
>>106334883
holy fucking shit those avatars, imagine going in a discord with mentally ill faggots
Does Qwen keep the artstyle for weeb stuff? I keep seeing praises for Qwen but I don't really get why it would be praised if it can't even do basic weeb stuff like this
All exemples I've seen about keeping style were shit so far
Anonymous
8/21/2025, 1:46:05 PM
No.106334942
>>106334932
Qwen the image mode? Meh. It's smarter than flux out of the box but slopped.
Qwen edit? Really good.
Anonymous
8/21/2025, 1:47:05 PM
No.106334946
wan nunchaku comfyui bros... WHERE?????
Anonymous
8/21/2025, 1:47:13 PM
No.106334948
>>106335125
>>106334932
one of my qwen edit slops
Anonymous
8/21/2025, 1:49:37 PM
No.106334971
Anonymous
8/21/2025, 1:51:15 PM
No.106334977
Bakermen!
Anonymous
8/21/2025, 1:53:44 PM
No.106334997
>>106335023
>>106334395
Can you show a (sfw) example? I get only noise if I use a stitched image as input.
Anonymous
8/21/2025, 1:54:41 PM
No.106335006
>adamW
>Scheduler: constant
>Loss: L2
>Rank/alfa 64/64
>Lr 0.000001
>40 epochs
>Batch size 4
>Gradient check pointing
>Gradient accumulation: 4
>40images 1 repetition, flip augmentation
>Min SNR gamma:5
>Full bf16
>Noise offset 0.0357
>Pyramid noise iterations: 5
>Discount: 0.25
>Training illustriousxl
What do you think?
Anonymous
8/21/2025, 1:56:31 PM
No.106335023
>>106335045
>>106334997
that makes no sense, a stitched image is just like any image. you messed up bad somehow
Anonymous
8/21/2025, 1:56:51 PM
No.106335026
>>106335029
qwen nunchaku comfyui bros... WHERE????
Anonymous
8/21/2025, 1:57:42 PM
No.106335029
>>106335037
>>106335026
better to find a better job to not need copechaku anymore instead of waiting
Anonymous
8/21/2025, 1:58:54 PM
No.106335037
>>106335029
i dont have a job : (
Anonymous
8/21/2025, 1:59:41 PM
No.106335045
>>106335023
I think I did yeah, image was too big.
Not that it worked better, I'll retry with other prompts.
Statler/Waldorf
8/21/2025, 2:00:43 PM
No.106335056
>>106334859
looks like someone finally has a clear shot of our heads...
hopefully they dont miss!
BEAHAHAHGHFHAh
Anonymous
8/21/2025, 2:11:02 PM
No.106335125
>>106334948
>>106334932
Qwen edit yeah
What's the best workflow for this? I guess I will try it myself
ポストカード
!!FH+LSJVkIY9
8/21/2025, 2:14:29 PM
No.106335160
>>106335187
>>106335205
So I'm trying to train a chroma lora of my cousin for science, and while it does look like her, the prompt adherence becomes kind of shit. I was using 18 images for like 3K steps at rank 64 with adamw and a sigmoid timestep adjustment... does anyone have any recommendations on a change of settings?
Anonymous
8/21/2025, 2:15:57 PM
No.106335182
R anon said he would fuck off for good, go back to /adt/
Anonymous
8/21/2025, 2:16:17 PM
No.106335187
>>106335235
>>106335160
Here's the proper resolution of the image, although it did also get zoomed in a little since QIE workflow has that same problem kontext did early on.
Anonymous
8/21/2025, 2:16:52 PM
No.106335189
>>106335178
lower the strength
Anonymous
8/21/2025, 2:17:44 PM
No.106335205
>>106335219
>>106335160
Hey man, can you make a funny animation with this.
Anonymous
8/21/2025, 2:19:00 PM
No.106335216
>>106335250
>>106335178
Will you please share the Lora with us?
ポストカード
!!FH+LSJVkIY9
8/21/2025, 2:19:16 PM
No.106335219
>>106335244
>>106335205
i will try
the heatwave is still strong this week
tonight was my only day off ;_;
24-48 hours is my normal workflow these days
Anonymous
8/21/2025, 2:20:21 PM
No.106335234
thank god he's fucking off
ポストカード
!!FH+LSJVkIY9
8/21/2025, 2:20:22 PM
No.106335235
>>106335187
saved.
she looks quite a bit like my ex ;c
Anonymous
8/21/2025, 2:21:40 PM
No.106335244
>>106335219
>i will try
Thanks. I look forward to them.
Anonymous
8/21/2025, 2:22:03 PM
No.106335250
>>106335359
>>106335216
if someone wants to help me make a better one I'll drop it in a gofile for the gang
Anonymous
8/21/2025, 2:33:32 PM
No.106335349
>>106335178
>>106335178
Good settings for training a person on Chroma
Epochs: ~100
LR: 1e-4
Scheduler: constant
Optimizer: adamw
Rank / Alpha: 16 / 16
Batch size 1: for best quality, but it will be slower
I never use flip for people since everyone is assymetrical, it hurts likeness
Anonymous
8/21/2025, 2:35:22 PM
No.106335359
>>106335407
>>106335250
you would willingly sacrifice your female cousin?
>>106335359
It would just be for funny wholesome pictures, nobody would use it for NSFW!
Anonymous
8/21/2025, 2:42:02 PM
No.106335413
>>106335407
Upload it, surely nothing can go wrong.
Anonymous
8/21/2025, 2:43:38 PM
No.106335429
Statler/Waldorf
8/21/2025, 2:46:48 PM
No.106335453
>>106335407
ya show us your hot cousins panties!
beahgahahah
Anonymous
8/21/2025, 2:56:46 PM
No.106335542