← Home ← Back to /g/

Thread 105689724

314 posts 200 images /g/
Anonymous No.105689724 >>105690148 >>105691577 >>105694754
/ldg/ - Local Diffusion General
Rugpulled Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105685453

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105689794 >>105695439
Blessed thread of frenship
Anonymous No.105689823 >>105689842 >>105689846 >>105694418
I always get so thirsty for milk when I see booba in the collage. my god
Anonymous No.105689842
>>105689823
my god
Anonymous No.105689846 >>105689854
>>105689823
sounds like you have mommy issues
Anonymous No.105689852
>>105664855
I'm trying out WAN 2.1 VACE I2V with self-forcing LoRA and I just can't get it to make a character from reference move in accordance with control video while staying true to reference. I tried extracting both the pose and depth map and combining them into one embedding, but it just warps everything to look kind of like the control video with a reskin. Setting the strength lower makes it ignore the controls. Using just start or start and end frames doesn't help either with some motions - particularly those involving Z axis. It's like it can't portray anything that has to do with things changing relative positions when it comes to depth.
Anonymous No.105689854
>>105689846
I have a NEED MOMMY issue desu
Anonymous No.105689873
Anonymous No.105689887 >>105689897 >>105689933 >>105689977
>ditch reforge, finally learn comfy
>learned how to make my own workflows
>gen'd hundreds of images I want to upscale
>want to batch automate it
>would need an extension that reads the metadata for the prompts and reuses them for upscaling
>find SD Prompt Reader
>yay!
>it doesn't work on normal metadata
>it requires you gen with the special SD Prompt Saver node
WHY. WHY THE FUCK ARE YOU LIKE THIS. YOU SHOULDN'T NEED A SPECIAL FUCKING NODE TO JUST READ A POSITIVE AND NEGATIVE PROMPT, MOTHERFUCKER.
FUCK YOU.
Anonymous No.105689890 >>105690986
Anonymous No.105689897
>>105689887
it's comfy, you must do everything in a node otherwise why bother
Anonymous No.105689933 >>105689977
>>105689887
>SD Prompt Reader
I think it works on the default text encoder nodes without needing the metadata saved by the node that comes with it, but if you're using custom nodes like ImpactWildcard, yeah you're fucked. It probably wouldn't be hard to get an LLM to edit the extension for you and make it compatible with prompt reader, fyi
Anonymous No.105689977
>>105689887
yeah welcome to the pain. modular is the way to go tho. WLSH has a save image with data node too but for whatever fucking reason the maker decided to implement his own custom syntax for the filename which doesn't work for me.
>>105689933
you can just route the final output of your prompt stuff into the sd prompt saver node, that's what I do. it would be nice to have two fields tho, t5xxl/clip_l
Anonymous No.105689991 >>105690028
i installed nunchaku but i can't use it. sir where is the flux dit loader node.
https://github.com/mit-han-lab/ComfyUI-nunchaku/issues/301
Anonymous No.105690006 >>105690040 >>105690170
Anonymous No.105690028 >>105690090
>>105689991
check your log? had some issues with insightface not being installed properly. nuke the nunchaku folder, do a git pull of the dev branch and install insightface manually
Anonymous No.105690040 >>105690086 >>105690160
>>105690006
very nice! it was made normally or with self forcing?
Anonymous No.105690086 >>105690170
>>105690040
with self forcing. genning them normally would take ages at this resolution
Anonymous No.105690090
>>105690028
thanks, they show up now. had to install visual studio's build tools because insightface doesn't build without them and gets ignored when the manager pulls in dependencies.
Anonymous No.105690095 >>105690219 >>105690223
Are regularization images a help or just a meme?
Anonymous No.105690148
>>105689724 (OP)
>Rugpulled
what was the rugpull
Anonymous No.105690160 >>105690198
>>105690040
self forcing obviously. look how fucking slow motion it is.
Anonymous No.105690170
>>105690006
>>105690086
impressive jiggles, very nice.
Anonymous No.105690190
sdxl does backgrounds and landscapes far better than pony or illustrious, right?
Anonymous No.105690194 >>105690210
when genning futa on female, how do i specify who is the female?
illustrious btw
Anonymous No.105690198 >>105690252
>>105690160
>look how fucking slow motion it is.
I know, but there's some anons who have the patience to wait more than 40 minutes for a 5 sec video, desu I have a lot of respect to them I would NEVER
Anonymous No.105690210 >>105690278
>>105690194
concat the other character in a separate prompt chunk. BREAK keyword in forge
Anonymous No.105690219 >>105690306
>>105690095
One use is to prevent training from taking over existing concepts, like if you train a specific female as 'foo woman', there is a very high likelyhood that whenever you generate a woman, the way the specific woman looks will 'bleed' into the concept of woman and her features will show up in every 'woman'. If when you train this specific female as 'foo woman', and also have regularisation images of other women as just 'woman', then the bleed will be a lot less if not entirely non-existant.

A good starting point for regularisation is 1:1, as in the same number of training images as regularization images being trained every epoch, preferably the regularization images should come from a larger pool of images.
Anonymous No.105690223
>>105690095
it's a meme. nobody uses it
Anonymous No.105690252 >>105690310
>>105690198
I wouldn't wait 40 minutes for some disposable meme that'll be forgotten after being posted either. But for exclusive porn catered to me? I'm willing to wait an entire fucking day no problem. The stuff I'm making you'd never find on PornHub or whatever. 40 minutes is nothing for the absolute GOLD tier content I generate.
Anonymous No.105690278 >>105690315
>>105690210
doesnt work, i keep getting the character with a pussy when it should be futa... ugh.
Anonymous No.105690306 >>105690355
>>105690219
>also have regularisation images of other women as just 'woman'
do these reg images need to be in the same style as the character lora im training? if not, then why doesnt someone just upload a generic reg dataset for everyone to use?
Anonymous No.105690310 >>105693105
>>105690252
"the absolute gold tier content I create" u wot m8
Anonymous No.105690315
>>105690278
paint penor and inpaint, regional prompting or both then
Anonymous No.105690328
Anonymous No.105690332 >>105690363 >>105690379 >>105690429 >>105690486
is it true noobai base is better than the finetunes like wai and janku?
Anonymous No.105690355 >>105690391
>>105690306
It's much better if they're not, just the same concept, like if it's photography, have photos, if it's an artstyle, have art.

>then why doesnt someone just upload a generic reg dataset for everyone to use?
huggingface is probably full of these, that said it's not hard to just download a bunch of images from any *booru site and quickly make your own 'pool' of regularization.

To be clear, if you don't care about 'concept bleed', like you just want to train a lora of a hot chick you want to wank to, don't bother with regularization. Same if you train an artstyle and you don't care if it bleeds into all other artstyles the base model knows.
Anonymous No.105690356 >>105690601 >>105690766 >>105691089
new version of neta lumina dropped 16 minutes ago
https://civitai.com/models/1612109?modelVersionId=1938135
Anonymous No.105690363
>>105690332
Yes. A huge issue with finetunes is that they significantly dumb the model down, it loses a huge chunk of its creativity and available styles in exchange for minor improvement for backgrounds. Wai also sloppifies the absolute fuck of the outputs, all of them look like plastic abyssorangemix shit in higher resolution.
Anonymous No.105690379 >>105690392
>>105690332
It's fun to play with. I spent a little bit of time trying different prompt and settings with it. I still like miruku way more. But I've been gaslit into thinking it's a skill issue, so I'll try again another day.
Anonymous No.105690391 >>105690468
>>105690355
>huggingface is probably full of these, that said it's not hard to just download a bunch of images from any *booru site and quickly make your own 'pool' of regularization.
I imagine someone probably has a better selection of ideal reg images than me, which is why I asked. I will check though. So far I only see regularisation datasets for SD 1.5.

>To be clear, if you don't care about 'concept bleed',
I care about concept bleed. It's a sign of a poorly made lora. It becomes problematic when you want to do more complex scenes or use multiple loras.
Anonymous No.105690392 >>105690436
>>105690379
>miruku
>look at the gens
>plastic shiny sloppa
>close the page
Anonymous No.105690429 >>105690488
>>105690332
never ask this question. you'll never get a straight answer. the noobai shill will always say noob is good despite not having remotely the popularity and will then proceed to argue 'because everyone is too dumb to use it'.

try both and see for yourself
Anonymous No.105690436 >>105690465 >>105690488
>>105690392
Great gen. I guess this is noob? Pretty impressive. I feel like CivitAI galleries are always quite poor, even the model uploaders'.
Anonymous No.105690438
What additional shit in Comfy did the individual node progress bars? I was uninstalling some bloat and I might have removed something.
Anonymous No.105690443
Anonymous No.105690465 >>105690472
>>105690436
Yeah this is vpred. Although i'd argue both epsilon and vpred have their uses.
Anonymous No.105690468 >>105690482
>>105690391
>I care about concept bleed.
Then you should use regularization, it also helps the lora be more flexible when used together with other loras.

Sometimes you don't care though, or you actually want a style to bleed into everything, then don't use regularization.

Again I want to stress that you should have a pool of regularization images that is much larger than the number of regularization you train per epoch, the reason is that you don't want the regularization images to bleed either.

Since I typically train on a set to 30-200 training images, I like to have a 'pool' of at least 1000-2000 regularization images, then I set it so that I have 1:1 training/reg images per epoch, but the regularization will randomly pick from a large pool so that those change every epoch.
Anonymous No.105690472
>>105690465
Yeah, I'll say your gens make a good point.
Anonymous No.105690476 >>105690489
>Found SDXL regularisation dataset
>all realism

Has no one uploaded an anime SDXL dataset? I don't want to spend an hour cherry picking them only to find out it doesn't improve my lora.
Anonymous No.105690482 >>105690523
>>105690468
if you're doing anime can you just upload that to google drive or something please
Anonymous No.105690486
>>105690332
nice
Anonymous No.105690488 >>105690507 >>105690510 >>105690588
>>105690436
noob vpred can do amazing shit but you need to learn how to operate the thing.
>>105690429
its not good because its not popular. awesome argument.
Anonymous No.105690489
>>105690476
Not personally aware of any such thing, but if you do make one and it does work, please share it, I'd appreciate it.
Anonymous No.105690507 >>105690632 >>105690702
>>105690488
Not the guy you're replying to, but can you give some examples of things to do to get the most out of Noob? A couple of examples of prompts that worked particularly well with it?
Anonymous No.105690510
>>105690488
if it were much better than other models then people would be using said better model. You don't need noob and I'm getting sick of that shill.
Anonymous No.105690523 >>105690538
>>105690482
Sorry, no can do, but really just use gallery-dl: https://github.com/mikf/gallery-dl

Download images from practically anywhere, the *booru, porn image sites, basically anything

Then use images of the same concept with what you're training on, but preferably not the exact concept, like if you're training on some anime character, don't have that character in your regularization data, but have lots of other characters.
Anonymous No.105690538 >>105690576 >>105690616
>>105690523
>Sorry, no can do
Why not? If practice what you preach you should be willing to share the data so others can replicate the alleged success you've had.
Anonymous No.105690556 >>105690601 >>105690766
the first beta version of neta lumina dropped for those who are interested
>https://huggingface.co/neta-art/Neta-Lumina
Anonymous No.105690559 >>105690568 >>105690571 >>105690579 >>105691125
another beauty that is NOT noob shit.
Anonymous No.105690568 >>105690586 >>105690588
>>105690559
yeah i could tell it's WAI shit
Anonymous No.105690571 >>105690586
>>105690559
has that typical sdxl sepia on it so i wouldn't say beauty
Anonymous No.105690576
>>105690538
>Share information on how to fish
>Hey, now you need to provide me with bait and a fishing rod!
Fuck you
Anonymous No.105690579
>>105690559
Is her hair made of plastic?
Anonymous No.105690586 >>105690599 >>105690626 >>105690764
another beauty that is NOT noob shit.#2

>>105690568
>>105690571
>butthurt noob shills will just claim its bad despite every image im posting being the top rated/commented
LOL
Anonymous No.105690588
>>105690488
>its not good because its not popular. awesome argument.
the biggest troll argument, i loathe it. of course more people are going to download the model that lets you type two words in and get a slopped 1girl as opposed to the one where you actually have to give it more tags to get something coherent and nonslopped. and >>105690568 yes you can ALWAYS tell its a slopped model
Anonymous No.105690599 >>105690634
>>105690586
Come on, I can sympathize with your point but your CivitAI clout isn't really an own.
Anonymous No.105690601
>>105690356
>>105690556
we know
Anonymous No.105690605
>its good because its popular
Don't you have some arena stats to post to try and get anon hyped for the next focused grouped to death model that'll end up SaaS?
Anonymous No.105690616
>>105690538
not the same anon, but dude, try to understand that there's a reason you don't see datasets in the wild.
compute is relatively cheap, but the time and effort it takes to curate a really good dataset does not scale. it's a massive labor of love, and ones made by anons like us can be personally identifying. I'm not combing through 1000s of images to make sure exif and metadata are scrubbed, a photo of my car didn't get mixed in, etc.

you can use a vllm to try to auto-tag, and you can pull tags and metadata from certain places, but you'll still need to do a ton of work to make a really good base for training. expect to work on your first one for at least a few months if you have a full time job.
Anonymous No.105690626 >>105690652 >>105690661
>>105690586
>NOT noob shit.#2
WAI literally has noob in it genius. it's not a finetune it's a shitmix. that's why wai can understand e621 tags even though illustrious models are only trained on danbooru. go give it a try, this is the case with most illustrious """finetunes""""
Anonymous No.105690632 >>105690741 >>105690774 >>105690794 >>105690862
>>105690507
need to tell it exactly what you want to see and follow the correct syntax. here found some older shit, basegen
"1girl, nehelenia \(sailor moon\) , sailor moon, watercolor \(medium\), pc-98 \(style\), masterpiece, best quality"
neg
"worst quality, early, low quality, lowres, signature, username, logo, bad hands, mutated hands, mammal, anthro, furry, ambiguous form, feral, semi-anthro"
Steps: 28, Sampler: Euler Ancestral CFG++, Schedule type: SGM Uniform, CFG scale: 1.5
there is a stabilizer lora you can use at low strength to get started
Anonymous No.105690634
>>105690599
I'm obviously not going to post random highly rated AI images from Twitter and such because they almost never share any information of how they created them. They could not even be generated locally. Atleast on Civitai I know what resources they used.
Anonymous No.105690646 >>105690676
>using "top rated" gens from Civitai to prove his point
BWAHAHAHAHA good one
Anonymous No.105690652 >>105690675 >>105691096
>>105690626
>WAI literally has noob in it genius
Why do you keep saying WAI? Not a single image I've posted has been WAI. You do realize just because someone doesn't use NoobAI doesn't automatically mean they use WAI NSFW right?
Anonymous No.105690661
>>105690626
>it's not a finetune it's a shitmix
I don't think anon understands the difference unironically. I've encountered many people who, when told it was mixed with Noob or Illustrious is the base model they'll say "huh?".
Anonymous No.105690675 >>105691251
>>105690652
Tell anon which model it is and if it's not a shitmix you win (it's a shitmix tho we all know)
Anonymous No.105690676
>>105690646
retard, i've already said why I use civitai. yeah it's shit for metrics but it's practically the only platform where uploaders tell you what model they used.
Anonymous No.105690702 >>105690741
>>105690507
>but can you give some examples of things to do to get the most out of Noob
Learn artist tags. Find a wildcard with a list of booru artists and prompt 30-40 gens, pick the styles you like.
Learn medium, palette, lighting and style prompts.
In case of vpred models, add as many details to the composition as you can because vpred is more rigid and less prone to randomness. Epsilon is easier to prompt with and more random, so use it if you want to utilize smaller prompts effectively.
That's basically everything you need to start working with noob models. The rest is just experimenting with particular artist tags and how they interact with style prompts. Don't be afraid of complete schizoprompting, it leads to some fun results.
Anonymous No.105690741
>>105690702
I already do all that and had more success with my lucky shitmix. But maybe I need to experiment more instead of sticking to my favorite tags. I've only tried vpred. I'll definitely give epsilon a try.
>>105690632
Thank you for sharing.
Anonymous No.105690762
anyone got the true(chinese translated) manual for noobai?
i remember the one in civitai is trash
Anonymous No.105690764
>>105690586
who cares about sameshit gens.
Noobai is way superior and without loras.
Anonymous No.105690766 >>105691118
>>105690356
>>105690556
Is there any way to make this model quicker?
Anonymous No.105690774
>>105690632
I think you are missing some quality tags my felow.
Anonymous No.105690786 >>105690793
mornin'
Anonymous No.105690793 >>105690820
>>105690786
nips are showing anony
Anonymous No.105690794 >>105690862
>>105690632
>Steps: 28, Sampler: Euler Ancestral CFG++, Schedule type: SGM Uniform, CFG scale: 1.5
desu 50 steps beta scheduler maybe a bit lower CFG
Anonymous No.105690820
>>105690793
I told her to cover up and she did
Anonymous No.105690862
>>105690632
>>105690794
also regular euler ++
Anonymous No.105690911 >>105690935 >>105690966
>>105689442
>but still seemingly doesn't know the artist tags.
It's becoming or has become clear that it will never know specific artists or characters outside of what regular flux knows. What a fucking letdown.
Anonymous No.105690924 >>105691779 >>105691795
How can I make the krita inpaint stay as close to the selection border as possible? It's sprawling out way too much and fucking everything up.
Anonymous No.105690935 >>105690966 >>105691002
>>105690911
yep, all the hype I had about that model was because I was expecting it to get the artist tags at some point :(
Anonymous No.105690966 >>105691002
>>105690911
>>105690935
dont worry it'll be just like pony and well have a million loras and jeetmixes to choose from :)
Anonymous No.105690976 >>105690987 >>105691013
I've been following along with recent model developments in both NAI and the finetunes, and I just want to ask:

Has any research or work gone into making characters and backgrounds more consistent across separate img gens?
Are the only solutions training a lora, reference only controlnet, and inpainting like a manual painter to fix inconsistencies?
Something as simple as a comic with the same character across multiple panels and pages easily exposes even the "best" models as "slop".
The image sets I've seen that are consistent are all the most normie popular characters and lack distinct details so are consistent because there are no finer details to be inconsistent on.
Above mentioned current solutions that I know of all kinda fail when it comes to detailed characters too.

With the advent of image generation and multi-frame consistency, do we still not have something for image gen that takes in one reference image and consistently generates the same character in different poses or same background with different characters?
In theory, should be similar to img2vid.
Some examples of this inconsistency are like a guy in a suit, sure I can prompt for a blue tie every time, but upon closer inspection, one image will have a tie clip and the next won't. One image will have 2 buttons next will have 3 buttons. Small stuff like this that immediately shows it's generated and not drawn by a human who would take into consideration these simple details.

Is there a newer better base model or plugin or workflow or whatever that solves this?
Anonymous No.105690986 >>105691360
>>105689890
Anonymous No.105690987
>>105690976
so apparently you haven't been following a long because we had controlnet and loras since the nai era
Anonymous No.105690991 >>105691027 >>105691064 >>105691080 >>105691124 >>105691160 >>105691174 >>105691194 >>105691238 >>105691277
how do we feel about this
Anonymous No.105691002 >>105691075
>>105690935
I have datasets nearly done for loras, just need to remove all futa + other crap from latest anime booru batch

>>105690966
Make own loras, problem solved
Anonymous No.105691013 >>105691057
>>105690976
it just falls apart when trying to install between angles it's never seen so you get what you described, extra buttons, laces, invented details, extra fingers, etc. imagen is much easier but world models is what is necessary for consistency and even then it degrades into nonsense eventually
Anonymous No.105691027
>>105690991
>he's distilling his shit even more
omgod bruh, STAPPPPPPPP
Anonymous No.105691057 >>105691101
>>105691013
I've seen the chinks and 3d modelers like autodesk maya are onto something in that direction
they're doing 2d img to 3d model or 3d model from text prompt
might be nice to have something like that but latent and behind the scenes so I can put one img in as master reference and then get lots of subsequent images consistent in detail
Anonymous No.105691064 >>105691072
>>105690991
the increase in inference speed sounds very nice. side effects? how many peeps are in his discord anyways? I imagine it being quite the cesspool of enjoyers of uh hairy things.
Anonymous No.105691072
>>105691064
>how many peeps are in his discord anyways? I imagine it being quite the cesspool of enjoyers of uh hairy things.
it's filled with troons, unironically, furries and LGBT goes hand in hand after all
Anonymous No.105691075 >>105691083
>>105691002
id rather use a base model that has as much in it as possible desu
Anonymous No.105691080 >>105691089
>>105690991
>how do we feel about this
I gave up on that model, it peaked at v27/v29 and it won't be better than that, KREA SAVE US
Anonymous No.105691083
>>105691075
me too, but beggars can't be choosers
Anonymous No.105691089 >>105691104
>>105691080
>KREA SAVE US
the lumina 2 anime models >>105690356 are by far more promising than anything else right now
Anonymous No.105691096 >>105691119 >>105691236
>>105690652
>this is the case with most illustrious """finetunes""""

>You do realize just because someone doesn't use NoobAI doesn't automatically mean they use WAI NSFW right?
use what you want dude, i don't care. just don't go spouting bullshit about stuff you don't understand.
Anonymous No.105691101 >>105691165
>>105691057
that's what a world model is anon. a 3d representation of a scene in latents
Anonymous No.105691104 >>105691132
>>105691089
>anime
I'm more interested in realistic stuff, that's why I was hyped by chroma, it's the only model that's able to make realistic skin natively
Anonymous No.105691113 >>105691139 >>105691158
Are the VAE and CLIP already baked in netalumi? Well, with an almost 10gb model, I would guess they are already in
Anonymous No.105691118
>>105690766
i know you can use teacache with it but it changes the output quite a bit
Anonymous No.105691119
>>105691096
>spouting bullshit about stuff you don't understand.
How most anon operate if I'm being honest.
Anonymous No.105691124 >>105691142
>>105690991
Sounds like a decent plan, assuming it performs well with all concepts at the targeted CFG, 4 is what I use for Chroma 99% of the time, but that might just be something that works well for my output.

Being able to cut generation time by half is hard to ignore though, hope he can do this in a separate branch while continuing normal CFG training for a while to see how things pan out.
Anonymous No.105691125
>>105690559
Anonymous No.105691132 >>105691154
>>105691104
muh chroma skin. every model can do that. might need some serious tinkering but at the end of the day only the result matters.
Anonymous No.105691139 >>105691158
>>105691113
>Are the VAE and CLIP already baked in netalumi
nop. you need to get them separately
Anonymous No.105691142
>>105691124
>hope he can do this in a separate branch while continuing normal CFG training for a while to see how things pan out.
I doubt he'll do something like that, there' already 2 branchs, it means it'll end up with 4 branchs, I suspect he'll do like the distilled low step stuff and merge everything into one model
Anonymous No.105691154 >>105691163
>>105691132
>every model can do that.
lol
Anonymous No.105691158
>>105691113
>>105691139
Seems like you can pick?
Anonymous No.105691160 >>105691310
>>105690991
>how do we feel about this
I can't wait for this model to be finished so that we can stop talking about it, ngl.
Anonymous No.105691163
>>105691154
promptlet say what?
Anonymous No.105691165 >>105691193
>>105691101
indeed
do we have something like that for img gen? anyone working on something like that? I would love to hear about any research in that direction or even just striving for more consistency through other methods.

we need something to break the single image barrier and break into other media content formats like comics
some complaints I've seen is that even the best image gens are just good wallpapers, nothing more beyond that in complexity of content and story

with modern tools, I can barely shitpost out a 4-koma meme comic ala stonetoss

consistency is absolutely necessary to go beyond single image gacha character cards and phone screen wallpapers
Anonymous No.105691167
Anonymous No.105691174 >>105691182
>>105690991
why doesn't he just do that distillation stuff at the end extract a lora out of it or something? like dmd2 for sdxl
Anonymous No.105691182
>>105691174
>why doesn't he just do that distillation stuff at the end extract a lora out of it or something?
because he's a retard, he prefers his model to have fucked up hands and lower quality but the generation time is slightly lower, the chroma hater was right, this project fucking stinks
Anonymous No.105691193
>>105691165
cosmos is a world model but uh i dont think its very good right now i dunno
Anonymous No.105691194
>>105690991
Honestly it's unbelievable that his concern is speeding up the model as opposed to getting the model to learn as much as possible.
Anonymous No.105691235
Chroma hasn't stopped rendering photoreal skin. It hasn't gotten worse. I still don't get the retarded complaints on this regard.
Anonymous No.105691236 >>105691251
>>105691096
>assumed I used WAI
>doesn't use WAI
>durr stop saying bullshit

>use what you want dude
Already do that. Didn't need you to tell me that.

>i dont care
Then you would not have responded, dumb fuck.

>just don't go spouting bullshit about stuff you don't understand.
Anon, you thought I was using WAI. You're stupid as hell. Whatever 'understanding' you think you have is pure dunning–kruger.

It's a shame this space is filled with so many idiots.
Anonymous No.105691238
>>105690991
I don’t get this guy, his model hasn’t improved since the v20s (it’s actually gotten more slopped), and his reaction to that is to distill it even further and lower the quality even more?
Anonymous No.105691249
Anyone got the lumina_workflow.json from their huggingface and could upload it?
I'd appreciate it.
Anonymous No.105691251
>>105691236
so... >>105690675 ? a shitmix is a shitmix anonie hahahah
Anonymous No.105691277
>>105690991
>hEy Guyzzz, dO yOu WanT cHrOmA tO nOt HaVe NeGaTivE pRomPting AnYmOrE??
jesus, if I wanted to use a model on cfg 1 I would use flux dev
Anonymous No.105691289
sdxl>flux nunchaku upscale, 8+35s, gonna stick to that for now.
Anonymous No.105691310 >>105691321
>>105691160
Then we will talk about the best loras for Chroma, or the best anime/porn finetune of Chroma

Hahaha
Anonymous No.105691321 >>105691352
>>105691310
>the best anime/porn finetune of Chroma
I really wonder what version people will use to further train the model? I can see some people having preferences on one version other the other, it's gonna be such a shit show kek
Anonymous No.105691324
Anonymous No.105691352 >>105691365
>>105691321
I think it will be the final version, I can only assume that all things considered, it will be the best overall, which is what you will want as a base for further finetuning.

Ponyv7 is never coming out, perhaps he will use Chroma as a base for Ponyv8, he is paying for the Chroma training after all.
Anonymous No.105691360
>>105690986
top kek
Anonymous No.105691365 >>105691448
>>105691352
>perhaps he will use Chroma as a base for Ponyv8
yaayyy a finetune without artist tags let's goooo
Anonymous No.105691375 >>105691395 >>105691402 >>105691408 >>105691409 >>105691438
where are we in the chroma cope timeline
Anonymous No.105691395
>>105691375
No one is stopping you from generating plastic Flux chins anon, you're not a victim.
Anonymous No.105691400 >>105694580
These images were as recent as of V38
>>105655640
>>105666737
>>105666804
>>105666868

Anon even thought I was faking my images due to fooling an AI detector on that last one, and yet you guys still think the model is slopped? I have been genning images like this consistently from the start (though I started mostly with feet pic images), the model has not gotten worse at this. I would know. The only thing I have consistently genned with this model is the amateur photoreal look. When I first tried the model I too had fears that it was getting worse over time. But then I realized that it was a quirk of trying the same seeds and not adapting the images that I got, in other words there were small changes which cause a bit of instability on that same particular seed. The model is not finished yet, so we subconciously nitpick the images that look best to us. Many times, we get lucky and get good seeds first try, but that's just what it was, so when there's a new epoch, and the seed shifts again, all hell seems to break loose with anons that don't understand this.
Anonymous No.105691402
>>105691375
depression
Anonymous No.105691408
>>105691375
chroma? اغرب عن وجهي
Anonymous No.105691409
>>105691375
>where are we in the chroma cope timeline
it's over
Anonymous No.105691412 >>105691492
I wish I could bet money on the model figuring out or not figuring out hands
Anonymous No.105691438
>>105691375
I never believed in that project in the first place and I'm glad that I was right.
Anonymous No.105691448
>>105691365
well its not like the model its based on had them either KEKE
Anonymous No.105691461 >>105691489
Anonymous No.105691489 >>105691535
>>105691461
Looks like a really cheap sci-fi b-movie, I like it!
Anonymous No.105691492 >>105691526 >>105691555 >>105691622
>>105691412
Flux was never good with hands. Try to do a middle finger. Or "OK" sign, or any other hand sign that is not a peace sign. You shouldn't need a LoRA for this. Into the garbabe bin it goes.
Anonymous No.105691526 >>105691559 >>105691567 >>105691602
>>105691492
But chroma can only barely handle regular hands unless that changed recently
Anonymous No.105691535 >>105691562
>>105691489
retro sci-fi is great
Anonymous No.105691544 >>105691568
Are you fucking me? Do I really have to tardwrangle it this much?
Anonymous No.105691555
>>105691492
Because Flux was overtrained on a small number of hand gestures / poses, so it will do those very well, but if you want anything outside of those, it won't happen.
Anonymous No.105691559 >>105691621
>>105691526
>But chroma can only barely handle regular hands unless that changed recently
this took 7 tries and it's still wrong, I guess I need Dio lora :^)
Anonymous No.105691562 >>105691646
>>105691535
Did you use a specific artist or just something like 50s retro sci-fi illustration ?
Anonymous No.105691567 >>105691582
>>105691526
>But chroma can only barely handle regular hands unless that changed recently

Higher rate of bad seed (that can even be fixed) =/= can't handle hands.
Anonymous No.105691568
>>105691544
unironically the least amount of wrangling from any anime model thats not a default style mix
Anonymous No.105691577
>>105689724 (OP)
Can a RX 580 run SDXL models in decent speeds?
Anonymous No.105691582 >>105691640
>>105691567
>bad seed
nice folk wisdom, next youll tell me that we should only use even numbered seeds kek
Anonymous No.105691602 >>105691622 >>105691631 >>105691665
>>105691526
Skill issue. I get more hits than misses
>A photo of a woman wearing a silver one piece bathingsuit, hands up and open with palms out, smiling, looking at the camera
>Prompt executed in 22.95 seconds
Anonymous No.105691608 >>105691707
>he doesn't use prime number seeds
Anonymous No.105691613
At last, the illu killer is here
But when his training will be done, in 6 months or so
Anonymous No.105691618 >>105691817
please touch grass
Anonymous No.105691621 >>105691646
>>105691559
Prompt for only the pinky and index finger to be extended, the model understands those words.
Anonymous No.105691622 >>105691657
>>105691602
>>105691492
>Try to do a middle finger. Or "OK" sign, or any other hand sign that is not a peace sign
Anonymous No.105691631 >>105691657
>>105691602
damn, bitch got the chroma nails

i've been putting "claws" in negatives to try to combat
Anonymous No.105691640
>>105691582
Okay so you are a troll.
Anonymous No.105691646 >>105691718
>>105691562
in the beginning put
>vibrant detailed illustration in a retro-futuristic fantasy style depicting
in the end put
>The illustration carries a 1960s science fiction aesthetic with its bright colors detailed textures and playful yet eerie elements.

>>105691621
exactly what I did, just got unlucky/lucky
Anonymous No.105691657
>>105691622
Okay?
>A photo of a woman wearing a silver one piece bathingsuit, hands up and making an okay shape, smiling, looking at the camera
>Prompt executed in 23.44 seconds

>>105691631
Details are still rough until final
Anonymous No.105691665
>>105691602
>Skill issue. I get more hits than misses
Understatement. You can do more complex shit with Chroma and still get a pretty good accuracy with hands, yet there are still naysayers.
Anonymous No.105691674 >>105691705
the biggest disconnect is a number of anons started with flux or chroma instead of sd1.5 and have little to base their opinions on
Anonymous No.105691705 >>105691796
>>105691674
Been here since SD 1.5 days. Since NAI leak. Since those slop mixes. Since XL 0.9. Still vouch for Chroma. Because I know how far behind we were from Dalle 3. And because I know Chroma has closed so much of the gap.
Anonymous No.105691707
>>105691608
>he doesn't use palindromic primes seeds
Anonymous No.105691718
>>105691646
Thanks my man
Anonymous No.105691739
Anonymous No.105691762
sdxl doesn't know the ok gesture. ok.
Anonymous No.105691771 >>105691834
Anonymous No.105691779 >>105692002
>>105690924
i use IOPaint instead of inpainting to remove stuff desu
Anonymous No.105691786 >>105691808
Lumina seems promising but I don't think I'll do much with it with the way they expect prompts to be structured. It's nice that it easily gens at 1920x1080, however.
Anonymous No.105691795
>>105690924
in Settings > Diffusion reduce your Selection Feather percentage. it's the same as mask blur
Anonymous No.105691796
>>105691705
>Because I know how far behind we were from Dalle 3
Anons don't know how easy we have it now. I still remember getting mogged hard from those early Dalle threads and the endless cope on SD threads (it's impossible for a smaller model that runs on consumer hardware to be as good as Dalle - wrong), and also from Emad (just use a LoRA or controlnet bro, SD is a tool) but even if you use a LoRA, stacking the concepts would not give you coherence on par with Dalle. Good times.
Anonymous No.105691808
>>105691786
there's nothing promising about the image you posted.
Anonymous No.105691817 >>105692143
>>105691618
Did you accidentally set cfg to 10 instead of 1?
Anonymous No.105691834 >>105691919 >>105691991
>>105691771
Was there ever a crossover ?
Anonymous No.105691838 >>105692121
Yeah I'm thinking the Lumina tune will finally be a replacement for NAI v4.5. We now have a good photoreal model and another really good anime model. It feels good to be a local Chad.
Anonymous No.105691861 >>105691873 >>105691929
Am I missing something or is it funny that the model designed to generate pony cartoons is actually SOTA for realism
I know he did some weird training thing with multiple different models, one on realism one on cartoon etc, but still it seems like no one uses it for the latter
Anonymous No.105691873
>>105691861
chroma beats it but you's probably find it funny it's a furry model
Anonymous No.105691919
>>105691834
https://myanimelist.net/stacks/18962
Anonymous No.105691929
>>105691861
What the base is strongest at will shine. That's why a good base model is very important.
Anonymous No.105691991
>>105691834
Don't think so,the only crossover i remember is a Lum plushie when they go to the mall in the Maison Ikkoku anime (not in the manga)
Anonymous No.105692002 >>105692073 >>105692107
>>105691779
Is this gonna install another 30Gb of python shit?
Anonymous No.105692073
>>105692002
this is just what happens for every python "app" you want to use
Anonymous No.105692107
>>105692002
>another 30Gb
If you're lucky
Anonymous No.105692121 >>105692445 >>105693724
>>105691838
>replacement for NAI v4.5
Can it do two characters cosplaying as another character from the other's series like this raw gen?
https://files.catbox.moe/jbsvf1.png
Anonymous No.105692143
>>105691817
not by accident silly
Anonymous No.105692161
Anonymous No.105692180 >>105692193
Someone post some acestep. It's fun. It doesn't have to be what spreadsheet nerds jizz over.
Anonymous No.105692193 >>105692238 >>105692253 >>105692292
>>105692180
it's honestly nowhere near suno or audio
Anonymous No.105692238
>>105692193
The authors have balls though, they say they will eventually have the quality of current day Suno / Udio

They are currently training Ace-Step 1.5, will be interesting to see how much of an improvement it will be
Anonymous No.105692248 >>105692286
الله يساعدني
Anonymous No.105692249 >>105692263 >>105692350
Anonymous No.105692253
>>105692193
Audio isn't like images. If an image of a woman shows her having 6 fingers, it's jarring.

Audio is totally different. Completely different.
Anonymous No.105692263 >>105692740 >>105692872
>>105692249
door's too small. Call the repairman at once.
Anonymous No.105692284
How tf do I prompt drifting cars? These niggcars refuse to turn even if include Initial D into the prompt.
Anonymous No.105692286
>>105692248
multistep/res_2m
exponential/res_2m

these cover 90% of what you'll ever need
اذهب مع الله
Anonymous No.105692292 >>105692310
>>105692193
This is the Ace-Step roadmap:

>X Release training code
>X Release LoRA training code
>X Release RapMachine LoRA
>X Release evaluation performance and technical report
>Train and Release ACE-Step V1.5
>Release ControlNet training code
>Release Singing2Accompaniment ControlNet
So next up is ACE-Step v1.5 model, the current 1.0 is easily the best local music model to date.
Anonymous No.105692302 >>105693109 >>105693190
Discuss
Anonymous No.105692304 >>105692327 >>105692335 >>105692351
what do /we/ think about this?
Anonymous No.105692310 >>105692321 >>105692338
>>105692292
Audio is utterly unlike images. Audio can sound very crazy and still remain perfectly listenable.
Anonymous No.105692321 >>105692364
>>105692310
yes but the tinny resonance that it's notorious for is very grating and annoying
Anonymous No.105692326 >>105693633 >>105694429
I will never* stop genning artsy fartsy redheads.

*in the next few days
Anonymous No.105692327
>>105692304
what a shitty slop thumbnail
Anonymous No.105692331
Anonymous No.105692335 >>105692353
>>105692304
pony
Anonymous No.105692338
>>105692310
art can look very crazy and still appear perfectly aesthetic(abstract). audio 100%, utterly shares likeness to images.
Anonymous No.105692350 >>105692368
>>105692249
kinda want a blt rn desu
Anonymous No.105692351
>>105692304
>too lazy to fix raiden's finger, triple eyelids, & watermark
not a good look.
Anonymous No.105692353 >>105692362
>>105692335
nah its illustrious nigga
Anonymous No.105692362
>>105692353
its a joke nigga
Anonymous No.105692364
>>105692321
You've probably never heard of an electric guitar.
Anonymous No.105692368 >>105692386 >>105692413
>>105692350
lol that has a tail, how did you not notice?
Anonymous No.105692373
Anonymous No.105692386
>>105692368
wat?
Anonymous No.105692388 >>105692419 >>105692433 >>105694845
The longer I keep Comfy open, the worse the gens get. I left it to generate 70 videos with Wan, and towards the end they start looking like this, with a lot of noise. If I regenerate with the same seed after restarting, it looks perfectly fine.
Anonymous No.105692409 >>105692428
seriously man is there any way to define who is going to be doing what in sdxl*illustrious*
?
cus i am trying to make two girls have sex but the dominant one is always the one i want to be a sub
Anonymous No.105692413 >>105692496
>>105692368
>how did you not notice?
Because he is heterosexual
Anonymous No.105692419 >>105692433 >>105692479
>>105692388
known problem. I mentioned the same problem before. You need to add the clean Vram node to the workflow. No one has figured out why it happens yet.
Anonymous No.105692428 >>105692679
>>105692409
No. It's a problem as old as time. You need to use regional prompting and/or controlnet.
Anonymous No.105692433
>>105692388
>>105692419
it's a virus called WANker
Anonymous No.105692445 >>105692590
>>105692121
Share the prompt you want to test
Anonymous No.105692479
>>105692419
The GPU gets tired and stops putting in effort
Anonymous No.105692496
>>105692413
Wow, that's supposed to be against the LAW!
Anonymous No.105692590
>>105692445
Basically what's in the metadata of the image I posted. I'm not sure how to best prompt a model I've never touched. The basics would be something with these two characters in these cosplays interacting:
takamachi nanoha, mahou shoujo lyrical nanoha strikers, ponytail, frieren (cosplay), holding staff, kneeling, staff with red jewel, facing another, kneeling
fern (sousou no frieren), fate testarossa (lightning form) (cosplay), cosplay, cape, holding, bardiche (scythe form) (nanoha), standing split, holding staff, facing another
Anonymous No.105692679 >>105692800
>>105692428
any guides?
Anonymous No.105692740 >>105692877
>>105692263
Anonymous No.105692800
>>105692679
Try this.
https://rentry.org/comfyui_guide_1girl#controlnet-pose-transfer
https://rentry.org/comfyui_guide_1girl#2girls-and-regional-prompting
Anonymous No.105692872 >>105693392
>>105692263
Sure
Anonymous No.105692877
>>105692740
this contractor had second thoughts. at least he didn't drive over the lawn
Anonymous No.105692889
sorry for the stupid question, but.. let's say I want to do a tiled upscale in comfyui, via ult sd upscale or tiled diffusion, with a tile controlnet as a guidance. the 'apply controlnet' node has an image input. how does that work internally when the image is split up into tiles and processed in the ult sd upscale node (or via tiled diffusion). do I need some form of preprocessor? a/b testing does show that it 'just' works, still.
Anonymous No.105692968
Anonymous No.105693004 >>105693055
>still no spinning wheels
sigh
Anonymous No.105693055 >>105693194 >>105693735
>>105693004
vroom
Anonymous No.105693084
Anonymous No.105693105
>>105690310
everyone likes the smell of their own farts
Anonymous No.105693109
>>105692302
sexo
Anonymous No.105693132
Anonymous No.105693167
dope lighting on this model
Anonymous No.105693190
>>105692302
>not calling it NSFWan
>empty model cards

totally legit
Anonymous No.105693194 >>105693288
>>105693055
Guess SDXL can't into motion.
Anonymous No.105693231
Official OmniGen 2 support when
Anonymous No.105693273 >>105693296
Anonymous No.105693286 >>105693296
Anonymous No.105693288
>>105693194
I take it you tried spinning wheels and stuff like that? 'in motion' maybe. worst case you'd need to inpaint. with flux lol
Anonymous No.105693296
>>105693273
>>105693286
nice
Anonymous No.105693392
>>105692872
awww
Anonymous No.105693497 >>105693546
Anonymous No.105693546
>>105693497
aerodiscs don't count lol
Anonymous No.105693618
Anonymous No.105693619
Anonymous No.105693633
>>105692326
I really like Chroma, as long as you don't need hands in the image.
Anonymous No.105693655 >>105693679
>finished genning 20 images
>comfyui has now finished FETCHING the 'ComfyRegistry Data' for nodes that this version doesn't even use. awesome!
Anonymous No.105693671
>>105686291
is a very nice gen
>105686222
extremely sovl gen
>105686245
this one is slop
>105686601
very nice
>>105689151
>105688945
>105688287
>105688138
hngggg
Anonymous No.105693679
>>105693655
I really hate the webshitters that think this shit is helpful
Anonymous No.105693724 >>105693762
>>105692121
Yes.
Anonymous No.105693735
>>105693055
Anonymous No.105693751 >>105693826
Let's settle the Chroma debate:

https://poal.me/a2xdi4
https://poal.me/a2xdi4
https://poal.me/a2xdi4
https://poal.me/a2xdi4
Anonymous No.105693762
>>105693724
Fern isn't cosplaying as Fate though.
Anonymous No.105693826
>>105693751
sisters... we are losing... time to spin up the bots... chromakeks cant win...
Anonymous No.105693929
Anonymous No.105693953 >>105693983
What hardware do you need for making local videos in acceptable time?
Anonymous No.105693983 >>105693996
>>105693953
What is 'acceptable time' to you ?
Anonymous No.105693996 >>105694092 >>105694303 >>105694809
>>105693983
I don't know.

How long does creating a 5 sec vid at 144p/480p take on a 4090/4060?
Anonymous No.105694092 >>105694170
>>105693996
With the light lora on my 4070S and 48Gb ram it takes 3 minutes for 480p gens.
Anonymous No.105694170 >>105694234
>>105694092
>With the light lora on my 4070S and 48Gb ram it takes 3 minutes for 480p gens.

>the light lora
Is the quality and prompt adherence acceptable?
>48Gb ram
For offloading?

For perfect linear dependence, it would take 18s for 144p then.
480p: 854 x 480 = 409.920 Pixel
360p: 640 x 360 = 230.400 Pixel
240p: 426 x 240 = 102.240 Pixel
144p: 256 x 144 = 36.864 Pixel
Anonymous No.105694234 >>105694310
>>105694170
https://rentry.org/wan21kjguide#lightx2v-nag-huge-speed-increase
It's a tradeoff
Anonymous No.105694281
>>105686424
I am once again asking for guidance on this issue
Anonymous No.105694303 >>105694333
>>105693996
>4090/64GB RAM
>14B, 20-30 steps @ 480p
Self Attention: ~110s and "Default": ~220s. With 1.3B there's really not much difference in speed, so there's no need for SAG. I have one of the faster 4090s though (with a fire hazard of a 700W per cap - which I run much lower), so ymmv.
Anonymous No.105694310
>>105694234
>You get: much faster gen speed
>Viewers get: worse quality
Offloading externalities on the bottomfeeders, very based

Are video generation models hardwired on a specific resolution or can you go as low as you like?
Anonymous No.105694332
nigga read the rentry
Anonymous No.105694333 >>105694441
>>105694303
Thanks, sounds really usable.
Anonymous No.105694418
>>105689823
Me 2.
Anonymous No.105694426 >>105694428 >>105694461
Anonymous No.105694428 >>105694530
>>105694426
the saar reveals himself
Anonymous No.105694429 >>105694572
>>105692326
I met a doctor yesterday with beautiful natural red hair. She was pretty on the profile pic but then I remembered what kind of doctor she was, absolutely monotone, fish eye deadpan gaze, kept repeating "ok" in the exact same tone over and over she wasn't even fat either. Imagine going outside.
Anonymous No.105694441 >>105694497
>>105694333
Oh, I didn't recall times for 1.3B, sorry. But it's stupid fast with SAG and marginally slower without. At lower step counts (around 4-8) it'll spend more time setting up the inference than it does actually rendering/outputting the final video. SAG puts it on par with image generation. The motion gets crippled though.
Anonymous No.105694461 >>105694530
>>105694426
good evening may i redeem the catbox
Anonymous No.105694497
>>105694441
Good to hear. I've seen AI vids with durations of 1min+, so I guess there are ways to concatenate them without losing visual coherence.
Anonymous No.105694530 >>105694586
>>105694461
>>105694428
usually I share, but because you accused me of being indian, I won't catbox these.
Anonymous No.105694572 >>105694691
>>105694429
Met as in date? Sounds terrible. Better stay inside and knock out some gens, man.
Anonymous No.105694580
>>105691400
these are all terrible
Anonymous No.105694586 >>105694917
>>105694530
nobody wants poo on their nodes, shitskin
Anonymous No.105694691 >>105694955
>>105694572
sick upscale
Anonymous No.105694754 >>105694820 >>105694841 >>105694846 >>105694919 >>105695053
>>105689724 (OP)
So what are we waiting for exactly in terms of the tech improving? Better hardware, better models, better data? When a new model version comes out on civitai what are they updating?

And has it overall been more consistent in the recent past? Like less prompts with dozens of key words and fine tuning and more what you type is what you get?
Anonymous No.105694809
>>105693996
14B with the new speed lora is less than 1 minute for 81 frames at 640x480. A motion Lora will restore any slowed motion, at least for my own training tests.
Anonymous No.105694820
>>105694754
Well everyone thinks Chroma will be good I think the next game changer will be a proper Flex finetune which is feasible with a 5090 and doesn't have the inference speed problems of Chroma.
Anonymous No.105694841
>>105694754
better software that isn't jeetech
Anonymous No.105694845
>>105692388
Pytorch issue. Downgrade to 2.7.1 or 2.7.0.
Anonymous No.105694846
>>105694754
i dont know about tech but im waiting for anon to post kinosoul
Anonymous No.105694917 >>105694974
>>105694586
I didn't even prompt for an indian guy lmao.
Anonymous No.105694919
>>105694754
>better data
From where, at least for image gen? What hasn't been scraped?
Anonymous No.105694955 >>105694988 >>105695005
>>105694691
Please don't call me a redditor, I'm just having fun man.
Anonymous No.105694974
>>105694917
Even if you didn't prompt any race the moment you got a brown and didn't throw it away let alone you posting it online it was already redeemed and too late
Anonymous No.105694988 >>105695024
>>105694955
by no means was I trying to accuse you of being a redditor, I just happened to be genning anti reddit pics at the moment. you posted a good upscale.
Anonymous No.105695005 >>105695024
>>105694955
i 'ave to say those are some good gens
nta btw
Anonymous No.105695024
>>105694988
>>105695005
Appreciate it, Anons. I'll head to bed now and continue my 'weird-ass redhead portrait' psychosis tomorrow.
Cheers.
Anonymous No.105695047
https://arxiv.org/abs/2505.23325

These guys confirmed what I have long suspected, video models are secretly the best "image editing" models if trained for it

Are there any Wan loras that forces "motionless" videos that just converts image A input A into image B output?
Anonymous No.105695053
>>105694754
>in terms of the tech improving?
Most of the focus is on video now, which makes sense since as a medium it has a much wider range than images, although I'm personally more interested in images.

Wan is really shockingly good for local video, so good (and relatively uncensored) that Black Forest Lab (who made Flux) scrapped the video model they were about to release and likely began from scratch.

I'm hoping we'll see more strong image models be released, Chroma has the most promise out of those in the pipeline, but as always we won't know what it can really do until it has a lot of lora and perhaps further finetunes, just like with SDXL and Flux.
Anonymous No.105695068
>>105695065
>>105695065
>>105695065
>>105695065
>>105695065
Anonymous No.105695439 >>105695534
>>105689794
:c
Anonymous No.105695534
>>105695439
shhhhh