Anonymous
8/20/2025, 5:01:46 PM
No.106324617
/ldg/ - Local Diffusion General
Anonymous
8/20/2025, 5:03:15 PM
No.106324635
>>106329061
Blessed thread of frenship
Anonymous
8/20/2025, 5:03:21 PM
No.106324637
Anonymous
8/20/2025, 5:03:58 PM
No.106324642
what node can I use to load fp8 scaled diffusion models ?
Anonymous
8/20/2025, 5:09:10 PM
No.106324689
>>106329061
dead thread
So adamW and constant is the way to go and everything else is gay and retarded?
Anonymous
8/20/2025, 5:21:31 PM
No.106324798
The wan/qwen enhancement system prompts are seriously the worst shit I have ever seen. They do jack shit to enhance the input at best and actively harm it at worst.
Dumb fucks.
Anonymous
8/20/2025, 5:22:37 PM
No.106324805
>>106324770
Don't know about adamw, but I use adafactor and it works well.
Anonymous
8/20/2025, 5:25:30 PM
No.106324821
>>106324770
no, of course prodigy schedulefree, prodigy, adafactor, came and others were also successfully used quite a lot, often after adamw didn't do all that well
and cosine instead of constant. and other choices.
Anonymous
8/20/2025, 5:27:13 PM
No.106324834
>>106324539
If it learns stupid things that aren't what you want, your dataset sucks.
Anonymous
8/20/2025, 5:29:10 PM
No.106324848
>>106324960
>>106324770
Anyone who says not to use AdamW or Adafactor are talking out of their ass. All the magic optimizers are ass and in my experience will burn your outputs. AdamW may sometimes be conservative but fucking hell it's not like an extra hour training is a big deal. And given they only talk about optimizers and learning rates and not about datasets, you can tell they're fucking morons.
Anonymous
8/20/2025, 5:29:31 PM
No.106324853
>>106324993
Anonymous
8/20/2025, 5:35:38 PM
No.106324896
My dataset is so kino it matters not what """optimizer""" I use.
Anonymous
8/20/2025, 5:43:17 PM
No.106324960
>>106325002
>>106324848
Came is very good
Anonymous
8/20/2025, 5:44:02 PM
No.106324964
Anonymous
8/20/2025, 5:47:31 PM
No.106324993
>>106325098
Anonymous
8/20/2025, 5:48:14 PM
No.106325002
>>106325046
>>106325575
>>106324960
CAME can burn out easier, I think AdamW is simply the most straightforward, beginner friendly optimizer and it doesn't even need to change from model to model, the same settings work everywhere.
Anonymous
8/20/2025, 5:49:01 PM
No.106325012
>>106325138
Anonymous
8/20/2025, 5:52:13 PM
No.106325046
>>106325002
Came + cosine + huber loss, always works for me. That being said you are right and most should stick to adamw
Anonymous
8/20/2025, 5:56:55 PM
No.106325088
>>106326037
comfy should be dragged out on the street and shot
Anonymous
8/20/2025, 5:58:29 PM
No.106325098
>>106324993
She just has a (very) meaty vagina.
Anonymous
8/20/2025, 6:03:28 PM
No.106325138
>>106325012
honestly though that was gonna whack him in the dick or something
Anonymous
8/20/2025, 6:14:13 PM
No.106325233
>>106325300
>>106328210
What's better for wan2.1 anyway, 14b lightx2v or 1.3b regular?
Anonymous
8/20/2025, 6:18:40 PM
No.106325275
>>106325399
>>106325486
anon talking about and pursuing photoreal is like incel talking about sex
Anonymous
8/20/2025, 6:21:08 PM
No.106325300
>>106325319
Anonymous
8/20/2025, 6:23:21 PM
No.106325319
>>106325300
the lightx2v version with 14b parameters
or
the regular alibaba version with 1.3b parameters
Anonymous
8/20/2025, 6:30:23 PM
No.106325399
Anonymous
8/20/2025, 6:38:56 PM
No.106325486
>>106325275
you should know
Anonymous
8/20/2025, 6:47:10 PM
No.106325575
>>106325595
>>106325002
What settings do you use?
Anonymous
8/20/2025, 6:49:11 PM
No.106325595
>>106325575
For Wan:
[optimizer]
type = 'AdamW8bit'
lr = 3e-5
betas = [0.9, 0.99]
weight_decay = 0.01
Anonymous
8/20/2025, 6:52:05 PM
No.106325624
I have been trying to make qwen work for anime characters but it is just worse than illustrious based models by far, it needs a full booru fine tune I suspect
Anonymous
8/20/2025, 6:54:57 PM
No.106325654
>>106326006
Anonymous
8/20/2025, 7:12:52 PM
No.106325823
>>106325854
>>106325870
probably for the best that I can't gen any quicker.
Anonymous
8/20/2025, 7:15:16 PM
No.106325854
>>106326126
>>106325823
>%RandomNoise.noise_seed%
Anonymous
8/20/2025, 7:16:50 PM
No.106325870
>>106326027
>>106325823
the only way this could get more autistic is if a train was somehow involved
Anonymous
8/20/2025, 7:31:11 PM
No.106326006
Anonymous
8/20/2025, 7:33:49 PM
No.106326027
>>106325870
I like trains
Anonymous
8/20/2025, 7:35:35 PM
No.106326037
Anonymous
8/20/2025, 7:46:17 PM
No.106326126
>>106325854
this happened to me when i wrangled my rentry op wan workflow and forgot to change filenames
>>106326110
visiting europe is crazy because you realize that the only reason most american thots are popular on social media at all is because the actually hot girls are all wearing beige coats instead of wearing bikinis and posting videos of it
Anonymous
8/20/2025, 7:47:07 PM
No.106326138
>>106326309
converge these nuts
Anonymous
8/20/2025, 7:56:44 PM
No.106326259
Anonymous
8/20/2025, 7:59:08 PM
No.106326287
>>106326316
>>106326110
>Decent resolution, 8 seconds, seemingly alright step count
What's your specs, man?
Anonymous
8/20/2025, 8:00:04 PM
No.106326309
Anonymous
8/20/2025, 8:01:03 PM
No.106326316
>>106326600
>>106326287
3090, 128gb ram
Anonymous
8/20/2025, 8:08:35 PM
No.106326399
>>106326110
looks cute but the fingernails kinda creep me out, they look like they got pulled out
Anonymous
8/20/2025, 8:13:41 PM
No.106326460
>>106326600
>>106330440
Anonymous
8/20/2025, 8:14:48 PM
No.106326475
>>106326110
Got same picture saved
Anonymous
8/20/2025, 8:22:46 PM
No.106326573
Anonymous
8/20/2025, 8:25:06 PM
No.106326600
>>106327273
>>106326316
howd you get 8 seconds without burn in at the beginning
>>106326460
i can tell that you don't have a foot fetish at all if you think this video was worth sharing
or maybe you just want to hurt footfags by sharing feet like those idk
Anonymous
8/20/2025, 8:34:57 PM
No.106326700
what do I have to prompt if I want a tile floor where all the tiles are just square and have the same size AAAAAAA
Anonymous
8/20/2025, 8:36:02 PM
No.106326712
>>106330440
Anonymous
8/20/2025, 8:48:07 PM
No.106326834
That wasn't actually what I meant.
Anonymous
8/20/2025, 9:05:40 PM
No.106327024
>>106327190
so has qwen edit been forgotten?
Anonymous
8/20/2025, 9:17:36 PM
No.106327166
>>106327273
What's the correct way to tell Wan to move the camera in a particular direction? I'm getting inconsistent results saying things like "zoom in towards x" "pan left" "move the camera to x" "change the viewpoint"
Sometimes telling to to "move the camera" is interpreted as moving some object in the scene rather than the PoV.
Anonymous
8/20/2025, 9:18:34 PM
No.106327178
that's it, I'm out
Anonymous
8/20/2025, 9:20:20 PM
No.106327190
>>106327243
>>106327024
Don't you worry. The turbo-autist will be back to do more "testing".
Anonymous
8/20/2025, 9:25:07 PM
No.106327243
>>106327276
>>106327190
But can it do Hatsune Miku without a lora? Nice!
Anonymous
8/20/2025, 9:28:14 PM
No.106327273
>>106327166
it helps to describe a new object that isn't in the image. "the camera pans to reveal a X"
>>106326600
>howd you get 8 seconds without burn in at the beginning
I use kijai's 2.2 workflow and it just werks. sometimes wan doesn't like the input image and will do weird shit.
Anonymous
8/20/2025, 9:28:34 PM
No.106327276
>>106327335
>>106327243
NO! I MUST SEE TODD HOWARD ON A MOUNTAIN BIKE!
Anonymous
8/20/2025, 9:32:40 PM
No.106327317
Anonymous
8/20/2025, 9:33:36 PM
No.106327325
>>106327416
anyone having just a black screen when using qwen edit?
Anonymous
8/20/2025, 9:34:57 PM
No.106327335
>>106327276
*change this man into hatsune miku driving bicycle* Oh my, nice! Did not use a lora!
Anonymous
8/20/2025, 9:41:42 PM
No.106327399
>>106327475
wasn't worth the time to train
Anonymous
8/20/2025, 9:42:50 PM
No.106327414
I feel like it was easier to train loras for Flux than Chroma. Or at least it was more forgiving.
Anonymous
8/20/2025, 9:43:09 PM
No.106327416
>>106327787
>>106327325
sage attention issue I think. Removing --use-sage-attention from the bat fixed it for me
Anonymous
8/20/2025, 9:48:36 PM
No.106327475
>>106327903
>>106327399
Qwen is good at creating real cosplayers, but the characters it knows are limited.
If you train characters with anime images, it also has the ability to create real cosplayer of the character with this lora?
Need to know anon :3
Anonymous
8/20/2025, 9:55:38 PM
No.106327540
>>106328475
with this "precision" setting, should you always try to match it to what the model's precision is?
(you can set precision with text encoder, vae, and the main model)
if I set it to fp32 it takes longer and I get different outputs but I'm having trouble figuring out if the quality is better.
Anonymous
8/20/2025, 9:59:49 PM
No.106327580
me so hony
Anonymous
8/20/2025, 10:09:30 PM
No.106327676
>>106327687
What's the best pytorch and cuda version for speed right now? Anyone tried investigating that rabbit hole?
Anonymous
8/20/2025, 10:10:31 PM
No.106327687
>>106327676
depends on hardware and python version too. too many variables to be worth investigating
Anonymous
8/20/2025, 10:19:37 PM
No.106327787
>>106327416
I found out that it was the model I was using (fp8 e4) that made it like that.
Switching to the bf16 solved the problem.
Anonymous
8/20/2025, 10:25:54 PM
No.106327858
>>106328271
Anonymous
8/20/2025, 10:30:07 PM
No.106327903
>>106327942
>>106327475
you are basically looking at the result of doing that, the prompt is. Photo of a girl eating an ice cream at the beach holding up a sign saying "Qwen Lora test". The image is of the character red hood.
That last sentence was an activation phrase.
The anime style seems to take over
Anonymous
8/20/2025, 10:33:16 PM
No.106327942
>>106327950
>>106327903
You overcooked it then or your captions are very poor. If you want to transfer a character you need to make sure art/anime tokens are in every caption. These models are more than capable of doing:
Anime (dataset) <- Character -> Photo (inferred)
Anonymous
8/20/2025, 10:34:04 PM
No.106327950
>>106327959
>>106327942
I used joycaption on ~150 images, if I would need to go above that to hand caption level I am not going to bother.
Anonymous
8/20/2025, 10:34:42 PM
No.106327959
>>106327976
>>106327950
Then you overcooked it from generalization
Anonymous
8/20/2025, 10:35:59 PM
No.106327976
>>106327985
>>106327959
I had it on half the default LR and kept all intermediate steps, anything that looked more "real" also started to lose character knowledge hard.
This is all compared to trivially working with illustrious or SDXL anime models
Anonymous
8/20/2025, 10:36:34 PM
No.106327985
>>106327996
>>106327976
Okay I can see where we're going, you're just a retard. Yes anon, can you believe it, an electric stove and a gas stove are different!
Anonymous
8/20/2025, 10:37:36 PM
No.106327995
very inorganic
Anonymous
8/20/2025, 10:37:42 PM
No.106327996
>>106328082
>>106327985
>bro my Chinese stovetop cooks this way, this French $10,000 in a $500 copper pan burned my eggs
Anonymous
8/20/2025, 10:44:29 PM
No.106328082
>>106328110
>>106328230
>>106327996
NTA but I assume you weren't here for the "muh 1.5 LoRAs learn so much better styles/character than SDXL or SDXL burns shit so easily" lol, not saying qwen learns better cause I haven't tried yet but we go through this hubbaloo everytime we get a new model and until someone figures out a way to train it better.
Anonymous
8/20/2025, 10:47:06 PM
No.106328110
>>106328230
>>106328082
The fact someone tried to train it like they do SDXL shows where the problem is if the "if I use this one token it completely railroads the output to anime" demonstrating they're not particularly good at training or determining what is a correct output. It reminds me of the guy who spammed Hyvid/Wan LoRAs of celebrities that had zero generalization capabilities, it would just do the obviously 10 clips they trained on and nothing else.
Anonymous
8/20/2025, 10:49:27 PM
No.106328140
>>106328278
>>106328500
whats the best way to split wan 2.2 lightx2v steps?
Anonymous
8/20/2025, 10:54:46 PM
No.106328190
qwen comfyui nunchaku status???????
Anonymous
8/20/2025, 10:56:41 PM
No.106328210
>>106325233
>14b lightx2v
absolutely this one
Anonymous
8/20/2025, 10:56:57 PM
No.106328213
>>106328272
https://github.com/spacepxl/demystifying-sd-finetuning
reminder that a larger dataset is always better and that artificially enlarging it by cropping etc. is even better
also use AdamW
Anonymous
8/20/2025, 10:58:14 PM
No.106328230
>>106328257
>>106328082
>>106328110
Ok frens
you have differences of opinion, it happens.
but could we skip that and get to the point where we can solve the problem and enjoy real anime girls?
Anonymous
8/20/2025, 11:00:26 PM
No.106328257
>>106328317
>>106328230
Train better you fucking retard, the problem isn't the model, the problem is you.
Anonymous
8/20/2025, 11:00:27 PM
No.106328258
>yet another failed style lora training
Fuck my useless ass
Anonymous
8/20/2025, 11:00:56 PM
No.106328268
Anonymous
8/20/2025, 11:01:25 PM
No.106328271
>>106328575
>>106329393
>>106327858
I feel like it's not gonna take long for real anime to get animated with AI
Anonymous
8/20/2025, 11:01:31 PM
No.106328272
>>106328213
The dataset increasing is the only thing I understand from that article, I'm too stupid to understand everything else. What the fuck is he even saying, my brain hurts.
Anonymous
8/20/2025, 11:01:54 PM
No.106328278
>>106328140
50/50 if you're not coping or 30/70 if you're coping
so try 10 steps 3/7 split if you're interested in meming around
Anonymous
8/20/2025, 11:05:56 PM
No.106328317
>>106328257
i'm just the leecher here and would never waste time on this - its your job, go eveb more incel tryhard and let me profit
Anonymous
8/20/2025, 11:19:25 PM
No.106328424
Is it worth exploring the various T5 encoder versions?
Anonymous
8/20/2025, 11:25:38 PM
No.106328475
>>106327540
fp32 is for compatibility for older cards afaik, your model is fp16, use fp16 if your card supports it
Anonymous
8/20/2025, 11:25:58 PM
No.106328480
>>106328504
>>106328869
>playing around with stable diffusion
>make some good looking pictures, jerk off to them
>check artists on pixiv for inspiration
There's something different about an artist's drawn picture that I'm reminded of after looking at slop for a while. I can look at the top rated slop on civitai, and the pose will be dynamic, the lighting amazing, the rendering superb, the environment detailed, and then I check an artwork on pixiv (not AI-generated) and even if the picture isn't as detailed or well-drawn, there's a lot more personality to it.
Also a lot of slop is designed to be slop, and takes inspiration from other slop. There are slop loras trained to look like your favorite slop creators. So I'm not sure if it's the technology itself, or just that sloppers have a certain preference.
Anonymous
8/20/2025, 11:27:29 PM
No.106328500
>>106328140
Use ComfyUI-WanMoeKSampler and let it do that dynamically like recommended by the devs, or use 30/70 which is roughly what I've seen happen most of the time using the dynamic node.
Anonymous
8/20/2025, 11:27:58 PM
No.106328504
>>106328548
>>106328591
>>106328480
the slop comes from the inbreeding and mixes
Anonymous
8/20/2025, 11:32:18 PM
No.106328548
>>106328673
>>106328504
i think its more that pixiv is for things you want to be judged on, while civit is just for sharing and dumping usually which is why the vibe is fundamentally different
Anonymous
8/20/2025, 11:32:56 PM
No.106328562
>checkpoint merged
instant shit signifier
>>106328271
a brave new world of QUALITY
Anonymous
8/20/2025, 11:35:14 PM
No.106328591
>>106328645
>>106328869
>>106328504
Yeah there's certainly a feedback loop of slop training more slop. Not even in the sense of machine learning training models, but even the viewer is trained to consume slop.
I noticed the range of expression seems a lot more limited in a very subtle way. Which seems counterintuitive because you can instantly generate a picture of almost anything you want. But the subjects, styles, and themes seem to converge to similar ideas and concepts, especially for the highest rated stuff.
I look at some of the top images on civitai and I think they look genuinely good. I take one look at a picture on pixiv, and not only do I think it looks good, but it makes me feel and think things that the slop didn't.
Anonymous
8/20/2025, 11:35:35 PM
No.106328597
>>106328575
I feel like the Chinese won't do it because they keep slopping up the datasets
Anonymous
8/20/2025, 11:35:40 PM
No.106328598
>>106328575
if it opens up creative expression I'm all for it
Anonymous
8/20/2025, 11:40:19 PM
No.106328645
>>106328673
>>106328591
The slop mixes and inbreeding does create a huge problem and a loss of personality in the model and it's a compounding issue especially with extreme filtering newer models, including SDXL, went through. The filters in a way are slopifiers because they remove a huge portion of quality images because they aren't professional stock photo enough (soulless).
>>106328548
The other thing I was thinking was that an artist hand drawing an image is spending magnitudes more time and effort on the image than most AI generated images.
>>106328645
Are there ways around this? Could one put more effort into their image generation and get an unslopped result? Or am I just overthinking it? Maybe AI art doesn't need to unslopped.
Anonymous
8/20/2025, 11:45:43 PM
No.106328684
>>106328718
>>106328673
>The other thing I was thinking
if you actually had reading comprehension you would have realized I said the same thing. pixiv is higher effort in general, AI assisted or not. if you just spam AI slop on pixiv you get banned really fast
Anonymous
8/20/2025, 11:47:57 PM
No.106328713
>>106328786
are the adetailer models on huggingface actually viruses or is it just their scanner sperging out
Anonymous
8/20/2025, 11:48:30 PM
No.106328718
>>106328736
>>106328684
You said pixiv is for things you want judged, and civitai is for dumping, and I was adding on that the method of creation requires more effort itself because hand drawing takes time while AI generation just takes prompting and clicking a button.
I wasn't disagreeing with what you're saying or saying you didn't think that, I was adding some context about how the method of drawing vs prompting itself inherently requires more effort, not just the difference between two websites. No need to attack my reading comprehension.
Anonymous
8/20/2025, 11:48:52 PM
No.106328722
>qwen edit doesn't know any panties outside of the general idea of underwear. So no thongs.
My day is ruined.
Anonymous
8/20/2025, 11:49:11 PM
No.106328726
>>106328673
It's deep in the model, you need to do a full unfiltered finetune of millions of images to undo it and you can never truly undo the inbreeding and slop once it's in. You can put more effort and that will help with "slop" but it can't make the model have a personality or express emotion because SDXl, Flux, etc are lobotomized from the start by removing candid, amateur, grungy "low quality" (they're not) images and bias towards staged, fake images. The only model that has some soul I've seen is Wan.
Anonymous
8/20/2025, 11:49:49 PM
No.106328736
>>106328752
>>106328673
>Could one put more effort into their image generation and get an unslopped result?
no because then you get a different kind of slop, the "trying to hard" kind of slop
>>106328718
sure, I'll attack you for being a blogposting redditspacing faggot instead lmao
Anonymous
8/20/2025, 11:51:06 PM
No.106328752
>>106328767
>>106328736
>redditspacing faggot
Paragraphs isn't redditspacing. Meanwhile you put a linebreak between quotes while I didn't and that would've gotten you called a redditfaggot 15 years ago on this site.
Anonymous
8/20/2025, 11:52:15 PM
No.106328767
>>106328811
>>106329118
>>106328752
>redditspacing isnt redditspacing
ok nigger
if you're bored we can keep (You)ing eachother for a bit i guess while GLM 4.5 does my work for me
Anonymous
8/20/2025, 11:53:53 PM
No.106328786
>>106328713
probably not but they do the bad pickle thing (run code) so you can't convert them to safetensors
Anonymous
8/20/2025, 11:57:14 PM
No.106328811
>>106328904
>>106328767
Nah I'm not interested in faggot namecalling but I am curious what this "trying too hard" slop is, because I'm not sure if I've seen it before, or could even recognize how different it is from low effort slop.
elf-hugger
8/20/2025, 11:57:19 PM
No.106328812
Don't forget to degauss you are hard drive.
Anonymous
8/21/2025, 12:00:31 AM
No.106328837
>>106328877
>>106328997
>>106326110
Does Wan always invariable return to a similar position after the 5 second mark?
Anonymous
8/21/2025, 12:03:15 AM
No.106328869
>>106328942
>>106328480
>>106328591
I'm going to be an elitist douche and say the main problem is that the easy accessibility of the technology enables people with no aesthetic sensibility to mass produce adequate content.
Generally, being a a passable visual artist requires tremendous effort. Besides countless hours of manual practice, you need to study hundreds of years worth of historical examples and techniques. You need to familiarize yourself with the physical properites of a wide range of media. And so on. This forces you to develop a sense not simply for what looks visually pleasing overall, but what minutiae of an image result in what specific perceptual effects.
Meanwhile some rando on civitai can type "1girl, booba" and wow!!! updoots to the left, everyone! There's nothing inherently wrong with this, but most people are easily pleased with 1girl, booba and not much more thought or elaboration is required. A nagging sense that something is missing may persist, but booba is delivered so whatever.
Professional artists using the technology will make more interesting outputs as they learn to use the tools more effectively. The bulk population will keep making booba and be happy with that.
Anonymous
8/21/2025, 12:04:01 AM
No.106328877
>>106328837
its suppose to, its only trained for 5 second videos. but i like to use this to my advantage to do penis reveal futa videos
Anonymous
8/21/2025, 12:07:11 AM
No.106328904
>>106328942
>>106328811
>I am curious what this "trying too hard" slop is, because I'm not sure if I've seen it before, or could even recognize how different it is from low effort slop
you know it when you see it. it's more difficult for visual mediums (how do you know when a painting is done), but you can easily tell when e.g. a song has "too much" in it
Anonymous
8/21/2025, 12:11:23 AM
No.106328942
>>106329130
>>106329195
>>106328869
>the easy accessibility of the technology enables people with no aesthetic sensibility to mass produce adequate content
This seems to be the main problem. What you've got is consumers now being producers, but without going through the process of actually learning to produce. It's this weird consumption/production hybrid. I was thinking about how image and video generation using AI is somewhat analogous to going to a media sharing website and looking for images. Prompting to me feels equivalent to typing into a search box, but instead of looking for pre-existing content, the AI model generates a new one to match your query. That's not creation to me.
>Professional artists using the technology will make more interesting outputs as they learn to use the tools more effectively
This is the part that I'm curious about. Using AI as a tool to augment the creative process.
>>106328904
You know this is an imageboard.
Anonymous
8/21/2025, 12:12:00 AM
No.106328950
>>106329085
>>106331081
>>106328673
most of the slop look comes from using optimizations. for img2video you can always take the time to inpaint and prepare the base image. and also through experience you'll learn what the limits are, like for a vertical 720x1280 video the most you can do is three girls full body standing together, and you'll learn pitfalls to avoid. you have to deal with a lot of compromise with ai
Anonymous
8/21/2025, 12:17:53 AM
No.106328997
>>106329021
>>106328837
i guess technically no but usually yes
Anonymous
8/21/2025, 12:18:02 AM
No.106329000
>>106329007
How do you know if your Lora is undercooked or overcooked?
Anonymous
8/21/2025, 12:18:45 AM
No.106329007
>>106329064
>>106329000
Undercooked = doesn't do what you want
Overcooked = only does your dataset
Anonymous
8/21/2025, 12:19:15 AM
No.106329012
>>106329440
>>106328575
Some poor guy is going to have the job of cleaning up all the mistakes AI makes over and over.
Anonymous
8/21/2025, 12:20:17 AM
No.106329018
Anonymous
8/21/2025, 12:20:31 AM
No.106329021
>>106329091
>>106328997
In my experience pushing beyond 5 seconds produces little movement, or the character moves somewhat according to the prompt before returning to the original position to start the movement again. For 8 seconds this is pretty good, but I wonder if pushing it to 10 would've made the camera return to its original position, or not produce the camera movement at all.
STATLER + (or) Waldorf & Company.
8/21/2025, 12:26:18 AM
No.106329061
>>106324635
>>106324689
>the duality of man!
beahgahgha
Anonymous
8/21/2025, 12:26:27 AM
No.106329064
>>106329310
>>106329007
What are all of these graphics with loss functions and how do I see mine when I'm training?
Anonymous
8/21/2025, 12:29:14 AM
No.106329085
>>106329309
Anonymous
8/21/2025, 12:29:47 AM
No.106329091
>>106329104
>>106329021
i'm goona teeeeest
Anonymous
8/21/2025, 12:30:51 AM
No.106329104
>>106329091
Yeah looks like it repeats. Was it the same prompt? As before?
Anonymous
8/21/2025, 12:31:53 AM
No.106329118
>>106329130
>>106328767
>GLM 4.5 does my work for me
Wait how do you do this, because I've been spending all my time making AI slop and not doing my work.
Anonymous
8/21/2025, 12:33:26 AM
No.106329130
>>106329158
>>106328942
>You know this is an imageboard.
and you should know that "slop" is qualitative, and therefore subjective so an image wouldnt help. so we're slinging (You)s at eachother yeah?
>>106329118
just install a vs code extension like Cline and turn on auto mode
Anonymous
8/21/2025, 12:33:50 AM
No.106329134
>>106329140
>>106329280
why is seedance number 1? Who the fuck even uses this paid tool?
Anonymous
8/21/2025, 12:34:24 AM
No.106329140
>>106329134
>seedance
literally who?
Anonymous
8/21/2025, 12:36:28 AM
No.106329158
>>106329206
>>106329242
>>106329130
I don't need you to prove what is or isn't tryhard slop, just give an example of what you think is one. You know, to discuss, on this imageboard.
>so we're slinging (You)s at eachother yeah?
I mean I'm about to take a nap for half an hour so if you want to wait I can give you some sweet (You)s.
Anonymous
8/21/2025, 12:37:00 AM
No.106329167
>>106329229
>>106329236
Anonymous
8/21/2025, 12:39:54 AM
No.106329195
>>106328942
>It's this weird consumption/production hybrid. I was thinking about how image and video generation using AI is somewhat analogous to going to a media sharing website and looking for images. Prompting to me feels equivalent to typing into a search box, but instead of looking for pre-existing content, the AI model generates a new one to match your query. That's not creation to me.
Yeah, I think that's exactly it
Anonymous
8/21/2025, 12:40:55 AM
No.106329206
>>106329158
>I'm about to take a nap for half an hour
i wish i could take short naps. i never fall asleep fast enough
Anonymous
8/21/2025, 12:43:13 AM
No.106329229
>>106329167
I have a sudden urge to read cool HFY military scifi
Anonymous
8/21/2025, 12:44:05 AM
No.106329236
>>106329167
Where's the greebles? It's not a sci fi spaceship without greebles.
Anonymous
8/21/2025, 12:44:26 AM
No.106329242
>>106329486
>>106329553
>>106329158
>what is or isn't tryhard slop, just give an example of what you think is one
oh i can give an example without an image actually.
the works of bob ross are objectively slop, that's the entire point of the show. usually at the end of each painting he ruins it with a big ass tree right down the middle
another example is something like running euler for 60 steps when you're doing something like jewellery or coatings and you get "too much", like to the point where the shinyness overstimulates your brain
Anonymous
8/21/2025, 12:45:46 AM
No.106329259
I hate the word slop so goddamn much, it's overused by snobs
Anonymous
8/21/2025, 12:48:25 AM
No.106329280
>>106329134
>Who the fuck even uses this paid tool?
Retards who can't into local. NB4 leaderboard stats.
Anonymous
8/21/2025, 12:50:37 AM
No.106329309
Anonymous
8/21/2025, 12:50:39 AM
No.106329310
>>106329342
>>106329064
use something like tensorboard
but loss is generally useless as it doesn't measure generalization, the best way to actually watch your training progress is using validation prompts that focus on generalization, there's basically three images you should do:
- something unrelated to your dataset
- a generalization test (e.g. a cosplay photo of your anime character)
- one of the captions you're training with
Anonymous
8/21/2025, 12:53:26 AM
No.106329342
>>106329349
>>106329310
Can you give me some examples please? I would still want to know how to setup tensor board, I wanna follow the demystifying fine-tuning article.
I'm having problems knowing if I overtrained my model or if it's undercooked.
If you have some samples please post them.
Anonymous
8/21/2025, 12:54:18 AM
No.106329349
>>106329371
>>106329342
loss is generally useless as it doesn't measure generalization, the best way to actually watch your training progress is using validation prompts that focus on generalization, there's basically three images you should do:
- something unrelated to your dataset
- a generalization test (e.g. a cosplay photo of your anime character)
- one of the captions you're training with
Anonymous
8/21/2025, 12:55:21 AM
No.106329360
>>106330561
Anonymous
8/21/2025, 12:56:36 AM
No.106329371
>>106329494
Anonymous
8/21/2025, 12:59:30 AM
No.106329389
Are we ever going to have a local nano banana? I've been testing it, it seems very capable.
Anonymous
8/21/2025, 1:00:24 AM
No.106329393
>>106329418
>>106329422
>>106328271
All they really need to do is spend more manual human effort on key frames and then do all the inbetweens with AI. It just seems like such a no-brainer that I'm surprised they're not already rushing to do this.
Anonymous
8/21/2025, 1:02:34 AM
No.106329418
>>106329438
>>106329393
Anyone doing it won't be announcing it for years.
Anonymous
8/21/2025, 1:03:09 AM
No.106329422
>>106329440
>>106329446
>>106329393
erm... what about the heckin' artists, sweaty?
Anonymous
8/21/2025, 1:05:07 AM
No.106329438
>>106329418
they're 100% doing it because money but also because the East is more AI-positive
but also because money but also because if Netflix are already doing it for real life stuff then the anime studios are definitely doing it and also because money
Anonymous
8/21/2025, 1:05:10 AM
No.106329440
>>106329422
they'll have new jobs
>>106329012
Anonymous
8/21/2025, 1:05:37 AM
No.106329446
>>106329422
all promoted to keyframers and becoming detailers for keyframes.
Anonymous
8/21/2025, 1:10:55 AM
No.106329486
>>106329242
Sure Bob Ross is slop but not AI slop. But to be fair, his artistry lied in the act of painting itself rather than the final result. Also we were talking about tryhard slop, while Bob Ross is intentionally low effort.
Anonymous
8/21/2025, 1:11:31 AM
No.106329494
>>106329539
>>106329371
Thanks for nothing anon
Anonymous
8/21/2025, 1:13:05 AM
No.106329510
>>106329557
now this is testing
jc denton is trapped in hell and is now a mystery-meat twink, which is also hell
Anonymous
8/21/2025, 1:15:35 AM
No.106329539
>>106329494
Sorry if you're actually retarded and don't understand what I said
If it's an consolation: you don't have high enough IQ and abstract thinking to make a LoRA or train anything, you'll only produce trash and never understand why.
Anonymous
8/21/2025, 1:17:05 AM
No.106329553
>>106329242
If you think plastic skin 1girl Ponys are the same level of slop as a hand-painted generic landscape painting you're absolutely delusional.
The white cloud painted with a fan brush has infinitely more soul and expression than a million AI 1girls.
Anonymous
8/21/2025, 1:18:05 AM
No.106329557
>>106329510
Is he gonna listen to the cow or what?
Anonymous
8/21/2025, 1:22:09 AM
No.106329607
>>106329625
>>106329627
why the fuck cant i find the qwen edit text encode node anywhere. i updated comfy but it still cant find the nodes. comfy is broken as fuck
Anonymous
8/21/2025, 1:22:28 AM
No.106329609
https://civitai.com/models/1782437
This project is surprisingly not dead. New rouwei gemma version has released. Alongside a new t5 gemma 2b.
Still, neither work practically well enough to be useful for much. Both are essentially bunch of pre-alpha experiments.
But the fact that one dude can train text encoders that can output coherent-ish (although not really following the prompt) images with 3 5090s is nothing to scoff at imo.
I don't think either of these models have too much potential. (Maybe the t5 has more potentially, being 2 times larger and t5 models are naturally geared towards this task. Though at this moment it is less trained than the other gemma so similarly useless for now.) Hopefully the author eventually switches to qwen-vl or similar as they claim so that we can get something useful one day.
Anonymous
8/21/2025, 1:23:43 AM
No.106329625
Anonymous
8/21/2025, 1:23:56 AM
No.106329627
>>106329607
"TextEncodeQwenImageEdit"
This?
Works for me, updated everything, refreshed the page, and it works.
Anonymous
8/21/2025, 1:25:47 AM
No.106329643
Anonymous
8/21/2025, 1:26:41 AM
No.106329651
me in the black car
How reliable is ChatGPT when I ask it technical questions about stable diffusion? I don't have a lot of background knowledge so I can't check if it's explanations are truthful, or hallucinations.
Anonymous
8/21/2025, 1:30:15 AM
No.106329685
>>106329750
>>106329665
The thinking model is reliable as long as you always start with: "Think and verify online xxx" and you actually properly say what system (gpu, ram, os, launch parameters for comfy)
Anonymous
8/21/2025, 1:31:37 AM
No.106329697
>>106329750
>>106329665
It can be really bad. Asked it the syntax for comments in comfyui today and it got it wrong and confidently said it was certain. One quick google told me what the actual syntax was.
Anonymous
8/21/2025, 1:33:03 AM
No.106329717
>>106329750
>>106329665
How technical are your questions? It can answer basic normie questions fine but actual technical stuff it will hallucinate a lot. They all will.
If you want to learn what a technique or optimization precisely does upload its arxiv paper and ask about it.
Or attach the relevant code snippet from a github implementation.
Anonymous
8/21/2025, 1:35:14 AM
No.106329744
>>106329665
so-so. Generally best to either repeat the query or ask chatgpt+grok/claude the same quest, then when they disagree cross reference. You can also paste your question + answer into a fresh chat and ask it to verify. Just turn off memory.
Anonymous
8/21/2025, 1:35:38 AM
No.106329750
>>106329784
>>106329685
>>106329697
>>106329717
I meant in a more foundational way. Like explain how cross-attention works between the latent image and the text embedding. I assume it should get most of this stuff right since it must be trained on the research papers, but I find it's explanations hard to grasp, and its metaphors mostly useless, while checking other resources like math youtube channels get the concepts across more straightforwardly.
Anonymous
8/21/2025, 1:39:05 AM
No.106329784
>>106329808
>>106329750
The more general and "known" the knowledge, the better it is it with. But like other anons said, your best bet is always :
- use the thinking model and tell it to search.
- feed it arxiv and github links.
- feed it documents if you have.
I got it to write me cool scripts for comfy and they work pretty nicely.
>checking other resources like math youtube channels get the concepts across more straightforwardly
Then what's the issue lol, use youtube channels then. These are all relevant tools.
what's the best nsfw anime model right now? is it still wai v11?
Anonymous
8/21/2025, 1:42:34 AM
No.106329808
>>106329824
>>106329855
>>106329784
>Then what's the issue lol, use youtube channels then. These are all relevant tools.
I do, but I like the conversational nature of chatgpt to go back and forth with my understanding. ChatGPT also helps with calling to attention things I might not have knoen before just from a simple query like "how does text embedding work" gave me a lead towards query, key, and value vectors, but it's explanation was hard to understand while a math channel on youtube explained it much better.
Anonymous
8/21/2025, 1:42:58 AM
No.106329814
>>106329665
I only use Grok but the more I use it, the more I realize how criminally overrated LLMs are. It's a tool to help you do things faster but I think its hitting a wall no matter what "AGI Sam" says
Anonymous
8/21/2025, 1:44:10 AM
No.106329824
>>106329855
>>106329808
Ah I forgot to say, the conversational part is what gets me worried, as I'm sure it's general knowledge is good, but when I dig deeper and ask for clearer explanations, or when I take a train of thought into a different direction based on my own knowledge, that's when I'm not sure if it starts making things up when it starts saying things like "You got it!" or "That's an interesting point!"
Anonymous
8/21/2025, 1:46:24 AM
No.106329839
>>106329803
>wai v11?
Nobody knows what that is. Ask in /adg/
Anonymous
8/21/2025, 1:46:36 AM
No.106329842
>>106329803
The T5 Rouwei looks interesting but it can break loras that need the clip hook.
Anonymous
8/21/2025, 1:47:49 AM
No.106329855
>>106329897
>>106329808
>>106329824
What model of chatgpt are you using?
Also go with robot style to avoid too much glazing.
I have the paid version so I don't know if this is available for free, but the base non thinking is just for very simple stuff.
Anonymous
8/21/2025, 1:49:22 AM
No.106329870
>>106329900
>>106329912
RTX 6000 has shipped bros
Anonymous
8/21/2025, 1:51:42 AM
No.106329897
>>106329982
>>106329855
Whatever the version is when you login on the website. In my current chat it's said
>Ah β this is a deep one, and youβre exactly zeroing in on the non-obvious part
or
>Exactly β youβre thinking along the right lines.
but then what I read next makes it seem like it just spitting out sycophantic remarks since the explanation that follows seems to relate more to the previous response than my follow up question that triggered the remark.
Anonymous
8/21/2025, 1:52:07 AM
No.106329900
>>106329922
>>106329870
>4090 users are called vramlet now
Anonymous
8/21/2025, 1:53:10 AM
No.106329912
>>106329922
>>106329870
It's never enough
Anonymous
8/21/2025, 1:53:33 AM
No.106329919
Is there a workflow to turn basic stickman drawings into reliable controlnets? Especially to combine them with regional prompting. Especially for Illustrious or SDXL in general.
Anonymous
8/21/2025, 1:53:50 AM
No.106329922
>>106330016
>>106330151
>>106329900
Now it's the 5090 owners. For the record I did not pay for it.
>>106329912
I'm also working with a vendor on a 3RU server build that can hold 6 for the server cards I have a large budget this year :)
Anonymous
8/21/2025, 1:58:07 AM
No.106329966
>>106329986
>>106330000
QIE really can't remaster an image? I've tried all variations: remove blur, enhance, sharpen, upscale, remaster, turn into a high-definition photo, restore, etc. I've even tried it in engrish. Is there something fundamental I'm missing here, or does it really not understand this concept?
Anonymous
8/21/2025, 1:59:47 AM
No.106329982
>>106329897
You are probably getting their cheapest (and dumbest) model.
As for glazing, just tell it to stop showering you with praises, to be no nonsense and to the point.
And simply switch to robot personality in the option if you can do that.
Anonymous
8/21/2025, 2:00:34 AM
No.106329986
>>106330020
>>106329966
Might help to explain what you're trying to achieve. I use restyle to X and it works pretty well.
Anonymous
8/21/2025, 2:02:07 AM
No.106330000
>>106329966
Yeah, it has a very limited vocabulary and doesn't seem to be able to do things outside of that. It may be incapable, or you might need to find a very specific trigger word. Try translating the prompt into moon runes too.
Anonymous
8/21/2025, 2:03:40 AM
No.106330016
>>106330038
>>106330162
>>106329922
>I'm also working with a vendor on a 3RU server build that can hold 6 for the server cards I have a large budget this year :)
If only image generation models could be split across multiple GPUs effectively...
Anonymous
8/21/2025, 2:03:46 AM
No.106330020
>>106330104
>>106329986
I'm trying to take a shitty low-resolution image with artifacts and blur and turn it into a higher-quality image. Actually, it's a frame captured from a wan video. I'm hoping to use this to generate novel views of a subject to use for training data.
Anonymous
8/21/2025, 2:05:58 AM
No.106330038
>>106330016
You can at lest make the same request to multiple gpus.
Anonymous
8/21/2025, 2:06:13 AM
No.106330041
>>106329803
If you enjoy the default AI look then yes
Anonymous
8/21/2025, 2:06:49 AM
No.106330047
>>106330168
>>106330241
The most recent version of this Neta Lumina finetune is a pretty solid improvement over the base Neta Lumina 1.0, IMO:
https://civitai.com/models/1790792?modelVersionId=2122326
Positive:
`masterpiece, best quality, 1other, rimuru tempest, dim lighting, yellow eyes, long hair, blue hair, hair between eyes, (smile:0.8), (glowing eyes:0.4), closed mouth, looking at viewer, solo, reaching towards viewer, fur trim, long sleeves, reaching, chromatic aberration, (best quality, very awa:0.8)`
Negative:
`ai generated image, blurry, worst quality, low quality, bad quality, normal quality, sketch, simple background`
Seed:
1214320592
and Euler Ancestral Linear Quadratic @ CFG 4.5 for both
Anonymous
8/21/2025, 2:10:34 AM
No.106330075
>>106330053
this is quite the improvement. nice ill have to check it out.
Anonymous
8/21/2025, 2:11:36 AM
No.106330084
>>106330194
>>106330053
Is this cherrypicked or it actually moved beyond generating 2006 deviantart pics?
Anonymous
8/21/2025, 2:14:22 AM
No.106330104
>>106330020
Hmm, I'll poke around and see if I can get it to work.
Anonymous
8/21/2025, 2:14:44 AM
No.106330105
>1girl, solo, black eyelashes, red eyes, voluminous ponytail, headband, long hair, orange hair, blush stickers, high collar, girl scout, pink scout badge strap, short sleeved blouse, squared skirt, boots, purple hoodie, messed hair, green pants, boots
when did this idiot trend start that lora creators fill their trigger word fields with whole prompts?
>>106329922
>For the record I did not pay for it.
how does one get rtx 6000 for free?
Anonymous
8/21/2025, 2:23:15 AM
No.106330159
>>106330186
>>106331278
>>106330151
>glurp glurp glurp
Anonymous
8/21/2025, 2:23:28 AM
No.106330162
>>106330016
yeah it won't do much for image gen but I'll use that for training.
Anonymous
8/21/2025, 2:24:24 AM
No.106330168
>>106330047
that's pretty close to the dark fantasy art classics
Anonymous
8/21/2025, 2:24:39 AM
No.106330171
>>106330196
>>106330199
Is there much of a difference going from a 4090 to a 5090?
Anonymous
8/21/2025, 2:25:49 AM
No.106330186
>>106330159
what is glurp?
Anonymous
8/21/2025, 2:25:52 AM
No.106330187
>>106330212
It would be like going from a 3090 to a 4090 but you also have an extra 8gb of vram to play around with.
Anonymous
8/21/2025, 2:27:03 AM
No.106330194
>>106331021
>>106330084
the original was a good model IMO but it was HEAVILY reliant on named artist tags for aesthetics, this finetune (at least in the latest version, like I said), generates good looking images even just with the standard NovelAI-style quality tags.
Here's another comparison attached.
Positive:
`masterpiece, best quality, 1girl, blonde hair, solo, blue eyes, breasts, shirt, looking at viewer, red lips, belt, rock, red shirt, ocean, sky, large breasts, day, outdoors, makeup, bracelet, red nails, parted lips, blue sky, skirt, cloud, black belt, selfie, jewelry, dutch angle, lipstick, denim, smile, blue skirt`
Negative:
`ai generated image, blurry, worst quality, low quality, bad quality, normal quality, sketch, simple background`
Seed:
2701528463
and Euler Ancestral Linear Quadratic @ CFG 4.5 for both like before
Anonymous
8/21/2025, 2:27:06 AM
No.106330196
>>106330171
one is a fire hazard and the other is a big fire hazard
Anonymous
8/21/2025, 2:27:51 AM
No.106330199
>>106330212
>>106330171
yeah, I have both and the 5090 is way better, though not as much a leap as the 3090 -> 4090
Anonymous
8/21/2025, 2:29:06 AM
No.106330207
>>106330656
>>106330686
booru style tags are great for image cataloging and search, but are terrible for image generation and the longer we're stuck with models that require it the longer this method will poison people's minds
Anonymous
8/21/2025, 2:29:52 AM
No.106330210
>>106330267
>>106330355
Anonymous
8/21/2025, 2:30:11 AM
No.106330212
>>106330226
>>106330187
>>106330199
I've been considering getting one if I can find one for a good price/swap my 4090 for it. I could also upgrade from 64 to 96gb of ram
Anonymous
8/21/2025, 2:32:27 AM
No.106330226
>>106330254
>>106330212
Not worth it unless you sell the 4090 at really high price, and you should do that soon because the value of the the 3090/4090 will probably plummet with the new Super 5080 and its 24GB vram.
Anonymous
8/21/2025, 2:33:53 AM
No.106330236
>>106330151
I requested it from my company. I have a lot of simulations I'm supposed to run, but my coworker underestimated how long he would take to generate the initial data so I'm free to produce 1girls indefinitely.
If anyone wants me to train loras for them, I'll do it if you provide instructions and data.
>>106330047
Is that AI art?
Anonymous
8/21/2025, 2:36:44 AM
No.106330254
>>106330226
>the value of the the 3090/4090 will probably plummet with the new Super 5080 and its 24GB vram.
It's been like 10 years of solid non-stop price rises for GPUs. The only reasoning people ever give for a decline in prices is that a new product will be released soon and that price drop in older products never arrives.
The moment a new product arrives demand for the old product simply increases to meet any potential price drop.
Anonymous
8/21/2025, 2:36:46 AM
No.106330255
>>106330241
Yeah, it's very good but not perfect, look at the inverted feet of the woman.
Anonymous
8/21/2025, 2:38:00 AM
No.106330262
>>106330241
AI can never be art.
Anonymous
8/21/2025, 2:39:42 AM
No.106330267
Anonymous
8/21/2025, 2:40:01 AM
No.106330268
>>106330241
First pass chroma output with deepshrink, just inpainted the girl a bit. She's still not perfect.
Anonymous
8/21/2025, 2:45:57 AM
No.106330300
>>106324770
I'm in the Cosine with Restarts/Prodigy camp. It's always been extremely consistent.
Anonymous
8/21/2025, 2:45:57 AM
No.106330301
>>106330516
Anonymous
8/21/2025, 2:48:56 AM
No.106330314
>>106330358
>2028
>alibaba goes dei
>dei hires create deQwan 50b and deQwan 13b video models
>local users report black screen output and stolen information
Anonymous
8/21/2025, 2:54:26 AM
No.106330339
>>106330344
is Topaz still SOTA for frame interpolation?
Anonymous
8/21/2025, 2:55:00 AM
No.106330344
Anonymous
8/21/2025, 2:56:29 AM
No.106330355
>>106330210
I could have saved her
Anonymous
8/21/2025, 2:56:33 AM
No.106330358
>>106330314
I doubt it, but I'm wondering how many more good releases we can expect before Qwen expects to make a return on their investment.
Anonymous
8/21/2025, 3:01:40 AM
No.106330388
>>106330401
wan fried one of my SSDs :(
Anonymous
8/21/2025, 3:01:45 AM
No.106330389
I tried to experiment but I am seeing zero change after going with v1.1 of t5 xxl. Even after zooming in to pixel by pixel level.
So the question I am asking is, is there no difference between v1 and v1.1 of t5, or does the bog standard t5xxl_fp16.safetensors download (e.g.
https://huggingface.co/comfyanonymous/flux_text_encoders/blob/main/t5xxl_fp16.safetensors) is already v1.1 version so I just wasted time and bandwidth?
>checksum
No point since v1.1 is gguf (though still fp16 precision)
Anonymous
8/21/2025, 3:03:12 AM
No.106330401
>>106330420
>>106330388
That SDD was going to fry with or without wan.
Anonymous
8/21/2025, 3:05:36 AM
No.106330420
>>106330431
>>106330401
yeah but it still sucks that i have to replace it
Anonymous
8/21/2025, 3:06:53 AM
No.106330431
>>106330420
Sorry for your loss. But at least it wasn't a GPU.
Anonymous
8/21/2025, 3:07:18 AM
No.106330437
>>106330482
Everything I do with qwen image edit turns out blurry
Anonymous
8/21/2025, 3:07:39 AM
No.106330440
Anonymous
8/21/2025, 3:12:22 AM
No.106330482
>>106330437
Every time I peepee I poopoo
Anonymous
8/21/2025, 3:17:04 AM
No.106330515
>>106330926
>>106330053
>still has aliasing
nah still shit
Anonymous
8/21/2025, 3:17:19 AM
No.106330516
>>106330301
>moments before the orc rape orgy
Anonymous
8/21/2025, 3:23:55 AM
No.106330561
Anonymous
8/21/2025, 3:29:05 AM
No.106330599
Anonymous
8/21/2025, 3:29:42 AM
No.106330607
>>106330637
>>106330865
maybe?
Anonymous
8/21/2025, 3:32:25 AM
No.106330632
>>106329803
no, ew.
https://civitai.com/models/1217645/sih for ease of use and slight slop look, naiXLVpred102d_custom for styles
>>106330053
cool, will have to try this one. neta was 80% of the way to being really usable
elf-hugger
8/21/2025, 3:32:43 AM
No.106330637
Anonymous
8/21/2025, 3:34:19 AM
No.106330652
>>106330788
Whenever a model comes out like wan2.2 but for 30 sec videos, non-AI porn will die.
Anonymous
8/21/2025, 3:34:33 AM
No.106330656
>>106330671
>>106330726
>>106330207
FALSE, booru tags are god's greatest gift to proompters
Anonymous
8/21/2025, 3:36:41 AM
No.106330671
>>106330656
here is an example of a poisoned mind
Anonymous
8/21/2025, 3:38:09 AM
No.106330686
>>106330706
>>106330207
Yeah i love having to write three paragraphs to get an image that i want instead of 15 tags with a couple of basic sentences and then gen 1000 different images
>>106330686
have you ever tried making something more complicated than 1girl images? i wanna know how you can create complex compositions with a system designed to have nothing to do with composition?
Anonymous
8/21/2025, 3:40:30 AM
No.106330711
>>106330706
slot machines! roll till I score!
Anonymous
8/21/2025, 3:42:19 AM
No.106330726
>>106330791
>>106330656
There are areas where booru tags fall short though and some models are so cooked on them that you literally canβt prompt the thing you want if thereβs no tag for it(one that I always get annoyed by is danbooru basedboys refusing to add race/ethnicity tags, for example.)
>>106330706
>i wanna know how you can create complex compositions with a system designed to have nothing to do with composition?
By not being a retard and using proper tools like regional prompting and inpainting
Anonymous
8/21/2025, 3:44:53 AM
No.106330749
>>106330761
i think i'm afflicted with retardbrain because i don't think i would have gotten into image genning without the slot machine mechanics of older models that used tags. being able to precisely prompt for what i want is nice but sometimes it's more fun to write a basic nondescript prompt, queue up an hour's worth of images, and be surprised by what i get
Anonymous
8/21/2025, 3:46:35 AM
No.106330761
>>106330793
>>106330749
I remember when we were all still playing with sd1.4 and one anon told the threads about βboomer promptingβ kek, just prompt something like an old boomer would caption their photos on facebook, and it usually led to fun stuff. I think k he used it for uohhh tho
Anonymous
8/21/2025, 3:47:50 AM
No.106330769
>>106330829
>>106330742
i used to do this a lot and the reality is that you run into hard limits eventually where the model simply doesn't understand what you're trying to do, no matter how much you use either of those
Anonymous
8/21/2025, 3:48:09 AM
No.106330771
>>106330829
>>106330742
so you use extra tools to do something you can't do with booru style tags because they aren't designed for composition? can you not even conceive of another style of prompting that lets you describe compositional information along with picture attributes and subject information?
no because your mind is fucking poisoned like I said so you add tools and processes to your workflow to make up for shitty prompting system
Anonymous
8/21/2025, 3:50:08 AM
No.106330788
>>106330652
This is not true because a real woman being degraded/hurt is part of the goon. Sex is about power and power is about [the ability to inflict] violence, after all
It will give us good data on how many pedos are actually not child molesters based on how much it disrupts the CP trade. I already don't care about the real thing at all even with 5 second clips. I just need maybe 20 seconds + audio (dialogue and sound effects) and I'm happy forever. No you don't get it I might not even make an Instagram for my future daughter's gymnastics journey that's how happy I'll be forever
Anonymous
8/21/2025, 3:50:17 AM
No.106330791
>>106330814
>>106330815
can your model do this?
>>106330726
sure, ideally models shouldn't be limited to tags, but booru tags are very powerful and concise. they also remove ambiguity around various words and phrases, and are excellent for describing poses and all kinds of stuff that is just missing from training data otherwise.
Anonymous
8/21/2025, 3:50:21 AM
No.106330792
>>106330829
>>106330742
The fuck if you use region you are basically writing a paragraph worth of tags + having to wrangle with it to actually work. Also good luck getting any sort of spacial prompting without rng gods smiling at you.
Anonymous
8/21/2025, 3:50:26 AM
No.106330793
>>106330817
>>106330761
Also I remember sd1.4 would generate ethnic tits aplenty with just βkhazar milkersβ as a prompt lol.
Anonymous
8/21/2025, 3:52:48 AM
No.106330811
>>106330871
Name a single model that outputs better gens due to retagging *booru images (hint: you can't)
>>106330706
>designed to have nothing to do with composition?
Erm... anon?
https://danbooru.donmai.us/wiki_pages/tag_group:image_composition
No need to reply
Anonymous
8/21/2025, 3:53:03 AM
No.106330814
>>106330791
probably the best system would be the use of natural language prompting, plus some sort of tag hierarchy, which booru style tags do not support because they were never meant to describe compositions but index images based on features
Anonymous
8/21/2025, 3:53:05 AM
No.106330815
>>106330791
An ideal would be a model that fully supports and understands both boorutags and natural language. Itβs practically impossible to get something like βIndian womanβ with current booru tag only models for example, because danbooru again doesnβt have race/ethnicity as a concept. Though funnily enough just prompt symmetra (overwatch) and there you go, but itβs only the one lol
Anonymous
8/21/2025, 3:53:21 AM
No.106330817
>>106330793
Tell us more about the pre novel ai leak days anon
>model that understands both tags and natural language
illustrious 2.0 does this but no one cares because theyre all promptlets
Anonymous
8/21/2025, 3:54:55 AM
No.106330829
>>106330848
>>106330769
sure but its always even worse for trying to get EVERYTHING from the first, single prompt
>>106330771
no base model is as good as basic regional prompting when it comes to control
if you actually had any complex images you wanted to gen you would not be a techlet who doesnt even use these basic tools while at the same time larping as if you are making something complex
also i never said you have to only use the most basic tags, i specifically said
>15 tags with a couple of basic sentences
>>106330792
there is no silver bullet of perfection, all tools ultimately rely on the same model, the point is it will always be easier to use regional prompting to get exactly what you want if you are actually making complex images
Anonymous
8/21/2025, 3:57:20 AM
No.106330848
>>106330829
I'm not even going to bother responding. stupid people have a lack of imagination and that shows in your inability to conceive of a prompt system more complex that a list of tags, and maybe a natual sentence or two
Anonymous
8/21/2025, 3:57:31 AM
No.106330852
>>106330823
*Sucks in air through teeth*
Tone, incredulous: "Are you still talking about SDXL? That model's been talked about for years. We have chroma, we have qwen, we have all the things."
Anonymous
8/21/2025, 3:57:43 AM
No.106330853
>>106330876
>>106330823
local models that can handle tags+NL:
>rouwei (a bit, could improve a lot with rouwei gemma)
>neta lumina
>chroma
>qwen (probably not even trained on tags, it's just powerful enough to get it)
Anonymous
8/21/2025, 3:58:19 AM
No.106330861
>>106330823
>because theyre all promptlets
This
Anonymous
8/21/2025, 3:58:44 AM
No.106330865
>>106330607
I want to tell you where you can get them way cheaper, but I also don't because they'll probably jack up the prices before I try to get another one
Anonymous
8/21/2025, 3:59:57 AM
No.106330871
>>106331035
>>106330811
>No need to reply
ok
but seriously because I knew someone would pull this shit but go ahead and look at the list of tags you just shared. notice one of them is "negative space" now tell me, how to use that tag in a compositionally useful way in the current booru style tagging system used in popular SD models
Anonymous
8/21/2025, 4:00:17 AM
No.106330874
>>106330917
I hope someone is working on a booru fine tune of qwen, natural language is just shit tags are a better design
Anonymous
8/21/2025, 4:00:21 AM
No.106330875
>>106330916
>>106330954
>>106330823
CLIP is horseshit at nlp, it's not even worth anyone's time. I swear everytime someone says that, they tried one nlp prompt and after a bazillion regens it gets one right and goes "oh my gud it wurks"
Anonymous
8/21/2025, 4:00:55 AM
No.106330876
>>106330961
>>106330853
only one of those models can produce even close to serviceable anime and its just a "retraining" of illustrious kek
Anonymous
8/21/2025, 4:04:51 AM
No.106330908
Can someone write a Qwen training guide? I am having a hard time training loras even after lots of epochs, the model still outputs slopped images with plastic skin or slopped gpt4o piss filter illustrations
Anonymous
8/21/2025, 4:05:23 AM
No.106330916
>>106331006
>>106330875
the fact that we're still talking about SDXL is the horseshit. I would've been totally slopped out of generative ai this year if it weren't for the video revolution. There are no SDXL outputs that look good anymore in 2025Q3. The vae is just too garbage
Anonymous
8/21/2025, 4:05:28 AM
No.106330917
>>106330937
>>106330999
>>106330874
i doubt anyone is working on finetuning qwen yet since it would cost a shit ton of money
Anonymous
8/21/2025, 4:06:23 AM
No.106330926
>>106330515
looks pretty standard for that kind of style to me
Anonymous
8/21/2025, 4:06:30 AM
No.106330927
*sniffs*
hm... newfaggotry...
Anonymous
8/21/2025, 4:07:36 AM
No.106330937
>>106330999
>>106330917
maybe alibaba will just do it themselves
sdxl will continue to be the model that everyone outside of this thread uses for the next few years unless we get some magic widely adopted inference optimization or hardware gets exponentially better
Anonymous
8/21/2025, 4:09:02 AM
No.106330953
>>106330962
How do I run the Radiance Chroma?
Anonymous
8/21/2025, 4:09:05 AM
No.106330954
>>106330875
Indeed. Only t5 family or theoretically some other repurposed LLMs can do natural language text encoding good.
CLIP is a tiny model with very limited fundamental understanding of the intricacies of human language. All attempts to beat these complexities with mere finetunes is futile.
>>106330939
normies will use saas
people who care will get better hardware
sdxl is like a rotting zombie grasping at the heels of progress
Anonymous
8/21/2025, 4:10:27 AM
No.106330961
>>106330876
wat? even base Neta Lumina is quite good if you use artist tags. The finetune I mentioned earlier doesn't need them so much though and is also just way more coherent in general. Unless you're just saying you expect everything to look like generic shiny SDXL ambiguous anime style automatically forever.
Anonymous
8/21/2025, 4:10:27 AM
No.106330962
>>106330953
you need to get the workflow from lodestones' shitcord and you need to use a custom comfyui fork
Anonymous
8/21/2025, 4:11:10 AM
No.106330968
>>106330977
>>106331003
>>106330957
You guys are like people getting mad at screwdrivers still existing once the power drill was invented. Like, itβs not a zero sum game lol why
Anonymous
8/21/2025, 4:11:31 AM
No.106330972
>>106331016
>>106330706
you can photoshop something together
Anonymous
8/21/2025, 4:12:26 AM
No.106330977
>>106330968
>why
boredom due to a lull in New Thing
they dont mean it really
Anonymous
8/21/2025, 4:12:58 AM
No.106330981
>>106330957
the people involved in local imagegen are mostly poors and thirdies that gen 1girl hentai out of their shacks and they will continue to use sdxl for that
Anonymous
8/21/2025, 4:15:31 AM
No.106330999
>>106331011
>>106330917
>>106330937
with TREAD and EQ-VAE it would actually be cheaper than any of the major SDXL finetunes we've had.
Anonymous
8/21/2025, 4:16:01 AM
No.106331003
>>106330968
vramlets get pissed off when someone wants to build something that needs too many screws because they won't be able to tighten them all by hand with their flabby little arms
Anonymous
8/21/2025, 4:16:22 AM
No.106331006
>>106331019
>>106331030
>>106330916
>>106330939
are there any non sdxl checkpoints that can do photorealistic shemales?
Anonymous
8/21/2025, 4:17:01 AM
No.106331011
>>106330999
as long as those are tested, proven to work well, and widely adopted. hopefully that all happens
Anonymous
8/21/2025, 4:17:30 AM
No.106331016
>>106331035
>>106331052
>>106330972
what does that have to do with booru style tags prompts being unable to describe composition, or really it's inability to do anything other than list features that appear in an image? you can't even describe left or right side of an image with booru tags, or associate tags with one another. the system is extremely limited in doing anything other than tell you that something shows up in an image. which for compositionally simple images like 1girl images is perfectly fine but extremely limited in all other cases, and no, being able to introduce other processes in the workflow to make up for this prompt inability does not mean that there aren't better ways of handling this
Anonymous
8/21/2025, 4:18:03 AM
No.106331019
>>106331006
chroma or qwen+loras
Anonymous
8/21/2025, 4:18:16 AM
No.106331021
>>106331267
>>106330194
unironically soul vs souless. did they train on 1.5 outputs?
>Negative:
>`ai generated image
fwiw the tag is "ai-generated" (dont forget midjourney, stable diffusion, and many other models are also tags) but for some reason i dont think that would really help
Anonymous
8/21/2025, 4:19:38 AM
No.106331030
>>106331081
>>106330939
This is true but only because everyone outside this thread has a 2gb vram gpu
>>106330957
The "people who care" number around 50,000 and most of them have probably opened the OP wan rentry multiple times kek
>>106331006
There are no sdxl checkpoints that can do photorealistic shemales either unless you're being disingenuous with your definition of "photorealistic" and you mean "obviously fake and not even slightly photorealistic at all but that's what the attempt is"
elf-hugger
8/21/2025, 4:19:59 AM
No.106331033
would you run a suite of diagnostic tests on an archive that was recovered from an automated organics nutrient biosphere parked at Lagrange Point 2?
Anonymous
8/21/2025, 4:20:14 AM
No.106331035
>>106331048
>>106331076
>>106330871
>>106331016
anon really outing himself as a promptlet this hard huh
Anonymous
8/21/2025, 4:21:29 AM
No.106331043
>>106331060
>ldg is the last place on the internet you can talk about local imagegen that isn't an advertising vector for spambots to shill paid workflows and closed models
grim
Anonymous
8/21/2025, 4:22:31 AM
No.106331048
>>106331035
Iβm not that guy but acting like booru tags donβt have any limitations and βyouβre just doing it wrongβ is just as retarded.
Anonymous
8/21/2025, 4:22:35 AM
No.106331052
>>106331076
>>106331016
i thought you were asking how to do it. if you have a specific vision in mind its better to draw a base image to img2img, if you know basic perspective you can create a room and if you have genned things you want in it you can shop them in and img2img. img2img to get basic shapes and the img2img details. its a lot like an actual art process
Anonymous
8/21/2025, 4:23:14 AM
No.106331058
Anonymous
8/21/2025, 4:23:28 AM
No.106331059
>draw a base image
if i wanted to pick up a pen i wouldn't be using ai
Anonymous
8/21/2025, 4:23:30 AM
No.106331060
>>106331068
>>106331075
>>106331043
But it is full of schizos.
Anonymous
8/21/2025, 4:24:29 AM
No.106331068
>>106331060
as long as the schizos aren't trying to sell me something that's fine
Anonymous
8/21/2025, 4:25:30 AM
No.106331075
>>106331060
Nuh uh, the voice in my head told me so. we cool
Anonymous
8/21/2025, 4:25:35 AM
No.106331076
>>106331035
the booru tag system was created to index images, not assist in their creation
even as a method of search it's a limited system because you cannot define tag associations
>>106331052
sure, or you could have a method of describing an image that lets you do something more useful than say these are the things that appear in the image
Anonymous
8/21/2025, 4:26:22 AM
No.106331081
>>106331097
>>106331030
>There are no sdxl checkpoints that can do photorealistic shemales either
this is a shemale sdxl checkpoint for the base image
>>106328950
Anonymous
8/21/2025, 4:27:22 AM
No.106331093
Anonymous
8/21/2025, 4:27:37 AM
No.106331097
>>106331207
>>106331081
Where is the photorealistic image? How indian are you??
Anonymous
8/21/2025, 4:41:39 AM
No.106331207
>>106331097
now your losing all credibility. just admit you never learned how to use sdxl properly. you have to prompt properly and inpaint, its not going to hold your hand like newer checkpoints. even 1.5 can make photo real images
Anonymous
8/21/2025, 4:54:01 AM
No.106331267
>>106331021
i mean you're entitled to your opinion I guess lmao. The guy I was responding to specifically referred to the base Lumina aesthetic as "2006 deviantart" and I do agree that was the basically case if you didn't use artist tags at all.
Anonymous
8/21/2025, 4:56:33 AM
No.106331278
>>106330151
>>106330159
Be a director or VP with a large IT budget in your area.
Anonymous
8/21/2025, 5:21:48 AM
No.106331476
Differences between forge/reforge? The comparison link in the guide is dead. Looks like reforge is more active, but the dev is also swapping to a new branch for main development so idk which to pick.