← Home ← Back to /g/

Thread 105664855

316 posts 176 images /g/
Anonymous No.105664855 >>105664898 >>105665874 >>105666859
/ldg/ - Local Diffusion General
Wishful Thinking Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105662481

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>Chroma
Training: https://rentry.org/mvu52t46

>WanX (video)
https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Misc
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Archive: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
Local Model Meta: https://rentry.org/localmodelsmeta

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105664861 >>105665074
Blessed thread of frenship
Anonymous No.105664869
https://www.youtube.com/shorts/cHhvEiYgkqA
Anonymous No.105664880 >>105664884 >>105666185
If sdxl models are limited to 75 tokens then how do you train with more than 75 tokens?

It has to be possible, I've seen it
Anonymous No.105664884 >>105664943
>>105664880
>needs another 50 replies of technical support
don't help him
I can't believe it took you more than an hour to figure this out.
Anonymous No.105664889 >>105664922 >>105665097
What the fuck is happening
Anonymous No.105664898
>>105662959
>>105664855 (OP)
me and my wife
Anonymous No.105664917 >>105665074
Blessed thread of frenship
Anonymous No.105664922
>>105664889
Art
Anonymous No.105664929
Anonymous No.105664943 >>105664956 >>105665221
>>105664884
No, fuck off, 90% of the threads is AI slop and navigating through forums and nowhere places because this informations isn't widely available is a pain in the ass.

Sdxl models only admit 75 tokens at training for some reason (that I don't know about) but everyone and their mother is training on illustrious, so are they all tagging less than 75 tokens? Those are no tokens at all. If they add chunks of 75 tokens, how does that work exactly?
Anonymous No.105664947 >>105664961 >>105665892
let's go brandon
Anonymous No.105664956 >>105664966
>>105664943
I can't wait to see the 200 token tag cloud you think is crucial for training.
Anonymous No.105664961 >>105664977
>>105664947
kek, who's this guy?
Anonymous No.105664966 >>105664978
>>105664956
It is crucial, maybe if you learned to tag, 90% of gens in this piece of shit forum wouldn't be complete garbage
Anonymous No.105664977 >>105664985
>>105664961
>he doesn't know
bro that's St. Tarrant
Anonymous No.105664978 >>105665010
>>105664966
I'm just going to let this roadblock you and let you seethe.
Anonymous No.105664985
>>105664977
oh ok, thanks for the info
Anonymous No.105665003 >>105665015 >>105665025 >>105665038 >>105666143
Anonymous No.105665010 >>105665040
>>105664978
You are a retard, I'm reading their GitHub and they say they simply split the 225 tokens into 75 and then concatenate the results.

I prefer that rather than using less than 75.
What I'm not sure is if Lora easy trainer scripts uses that and automatically take all tokens or if it discards the extra tokens, it is not really SD scripts, it's another repository so I dunno.

If anyone can confirm
Anonymous No.105665015
>>105665003
sexo
Anonymous No.105665025
>>105665003
So that's the amazing anatomy people have been telling me about on Chroma, really impressive.
Anonymous No.105665028
Do you interpolate your videos? If you do, what do you use? I've tried RIFE, FILM and GIMM and I think I like FILM the most, especially when there's slower motion, but it can freak out with fast motion. RIFE is pretty meh and GIMM is just slow as fuck, and still seems less accurate than FILM
Anonymous No.105665038 >>105665060 >>105665120
>>105665003
>SD3 tier anatomy level
wtf happened to this model? Is it getting worse and worse through epochs now?
Anonymous No.105665040
>>105665010
>Lora easy trainer scripts
they both use the same backend you stupid retard. lora easy scripts is a frontend for sdscripts. just ignore the warning and click train
Anonymous No.105665054 >>105665058 >>105665075
Took a few tries to make it stop giving the flamingo arms.
Anonymous No.105665058 >>105665068
>>105665054
>4o piss filter
Anonymous No.105665060 >>105665478
>>105665038
>wtf happened to this model? Is it getting worse and worse through epochs now?
what do you mean? just feed me more furry datasets please
Anonymous No.105665061 >>105665078
>>105665027
>settings that in my experience produce basically perfect single-subject results on every single UNET-based model ever released (including Kolors).
how 2 use doras in comfy?
Anonymous No.105665067
i generate for she
Anonymous No.105665068
>>105665058
Looks good on flamingos. That's all that matters.
Anonymous No.105665074
>>105664861
>>105664917
blessed
Anonymous No.105665075
>>105665054
>glove on leg
why is this so kino
Anonymous No.105665078
>>105665061
you just load them like a normal lora
Anonymous No.105665080
I actually got it to do the sketch thing up prompt. A shame the final result is trashy (3 legs, and what are those wings?).
Anonymous No.105665097
>>105664889
sometimes it's clearly trying to tell us something
Anonymous No.105665120
>>105665038
I truly hope Astrakek did not interfere with the datasets.
Anonymous No.105665150
Anonymous No.105665206 >>105665249 >>105665478
It is me, The Thing, from John Carpenters the The Thing.
Anonymous No.105665211 >>105665218 >>105665243 >>105665276 >>105665301 >>105665330 >>105665435 >>105665619 >>105666664
https://www.reddit.com/r/StableDiffusion/comments/1lh62qb/who_s_ready_for_dev/
>A comfy employee hyping up kontext dev
this is it, the release will arrive soonTM
Anonymous No.105665218
>>105665211
this, just give us the model already
Anonymous No.105665221
>>105664943
You can ask chatgpt most questions related to lora training. It's free and you don't need an account.
Anonymous No.105665228
Anonymous No.105665243
>>105665211
Thank god someone put a muzzle on cumfart and let someone neurotypical handle the shilling
Anonymous No.105665249 >>105665255
>>105665206
why not just use pony again?
Anonymous No.105665255
>>105665249
because Chroma has the best skin texture of them all
Anonymous No.105665276 >>105665285
>>105665211
>deleted
I don't think anyone actually likes the shill fag that actually runs the company
Anonymous No.105665285 >>105665497
>>105665276
>the shill fag that actually runs the company
Comfy?
Anonymous No.105665301
>>105665211
Anonymous No.105665304
https://xcancel.com/GreenFrogLabs/status/1936480136414781863#m
kek
Anonymous No.105665330
>>105665211
>Sorry, this post was deleted by the person who originally posted it.
bruh
Anonymous No.105665376 >>105665378
check em
Anonymous No.105665378
>>105665376
I think I've already seen that one
Anonymous No.105665435
>>105665211
Poor guy...
Anonymous No.105665466 >>105665471 >>105665511 >>105665512 >>105665517 >>105665522 >>105665617
what am I doing wrong?
Anonymous No.105665471
>>105665466
believing this copium snakeoil actually works
Anonymous No.105665474 >>105665480 >>105665544
Is this chart accurate?
Anonymous No.105665478
>>105665206
>>105665060
This is precisely why I always say to take more steps (at least 30+). No idea how anons even such poor results in the first place. It's a non-issue for me regardless of how dynamic I make the pose.
Anonymous No.105665480
>>105665474
wrong thread
Anonymous No.105665497 >>105665500 >>105665510
>>105665285
comfy isn't the ceo
Anonymous No.105665500 >>105665510
>>105665497
comfy doesn't have any executive role kek. he is a little bitch for not having any power over what happens
Anonymous No.105665510 >>105665791
>>105665497
>>105665500
this makes so much sense as to why everything went to shit after org
Anonymous No.105665511 >>105665883
>>105665466
Try the tags "princess carry, carrying person" in both regions, and increasing ControlNet strength. I'm about to pass out so I hope it works.
Anonymous No.105665512 >>105665883
>>105665466
I'd try to pass the clip only from a relevant lora to the region's text encode, that's a little rewiring. But since the model goes into a single node that might be useless.
Anonymous No.105665517 >>105665883
>>105665466
you forgot the mask properly (background doesnt have a mask), also the weights of the CN node are too low
Anonymous No.105665522 >>105665883
>>105665466
Last suggestion, Canny might be better-suited than Depth for this job.
Anonymous No.105665533 >>105665565 >>105665631
Anonymous No.105665544
>>105665474
it might be
Anonymous No.105665565
>>105665533
needs another foundational support block labeled "civitAI"
and right next to it a guy with a sledgehammer smashing it apart also labeled "civitAI"
Anonymous No.105665576
>still not sageattention update
I weep
Anonymous No.105665617 >>105665883 >>105666068
>>105665466
Anonymous No.105665619 >>105665637
>>105665211
> deleted
what did it say and who posted it?
Anonymous No.105665621
flying doro
Anonymous No.105665631
>>105665533
missing snake oil slipping blocks off
Anonymous No.105665637 >>105665644 >>105665675 >>105665693
>>105665619
>who is ready for dev
That slug tells you everything he said, some Comfy Org guy implying there's a new model coming. But unless it's a model that can be trained on 24 GB of VRAM no one will care just like every other model since Flux.
Anonymous No.105665644 >>105665655
>>105665637
that is literally the majority shareholder and CEO of comfyorg. a fucking sf grift faghot
Anonymous No.105665655 >>105665675
>>105665644
Yeah unless it's an image model as good and big as Wan or Hyvid (especially for censorship) no one is going to give a fuck.
Anonymous No.105665674
Anonymous No.105665675 >>105665691
>>105665637
>>105665655
i looked it up, apparently he was asking who is hyped for Flux Kontext Dev.

no idea what you tards are talking about.
Anonymous No.105665691 >>105665810
>>105665675
There is another model trained on BFL hardware. If it's Kontext it's even more retarded to be hyped because it'll be even more censored and likely has active countermeasures against naughty prompts.
Anonymous No.105665693 >>105665731
>>105665637
remember that comfy hyped SD35 for months
Anonymous No.105665731
>>105665693
worse, he hyped 3.0
Anonymous No.105665741 >>105665755
glad to see people are losing their patience with apple style hyping
Anonymous No.105665755 >>105665770 >>105665791
>>105665741
the amount of times the comfy owner hyped API nodes over model releases should say a lot how things are going to go. honestly our fault for making comfy the only option now
Anonymous No.105665770 >>105665782 >>105665788 >>105665799
>>105665755
are there any other options worth trying?
Anonymous No.105665782 >>105665831
>>105665770
this is by far the best alternative
https://github.com/FizzleDorf/AniStudio
Anonymous No.105665788 >>105665798
>>105665770
there is the easy as shit wan GUI in the OP. forge is dedicated, auto is ancient history. the actual last hope is anistudio but that is still surrounded by scaffolding
Anonymous No.105665791
>>105665755
See >>105665510 Everything was well until some point.
Anonymous No.105665798
>>105665788
>dedicated
*deprecated
Anonymous No.105665799 >>105665817 >>105666883
>>105665770
anistudio is the only promising alternative but it already is better than cumfart
Anonymous No.105665804 >>105665815
so what's actually wrong with comfyui? it works on my machine..
Anonymous No.105665810
>>105665691
Also you can do 1 frame Wan @ 1024x1024 with lightx2v and Loras in <5 seconds. All someone has to do is do a high resolution aesthetics lora to push it to 2K and beyond.
Anonymous No.105665815 >>105665918
>>105665804
it's webshit made by autistic devs, anistudio is the only non-webshit solid frontend
Anonymous No.105665817 >>105665829
>>105665799
listen, it can gen images and it's very fast to crack open and gen but it needs more work. this video game engine design is something that's actually exciting. if it's the same design philosophy as clay or QT, it can be the one and true GOAT
Anonymous No.105665829 >>105665859 >>105666883
>>105665817
i agree, don't know why anyone still uses cumfart
Anonymous No.105665831 >>105665847 >>105665850 >>105665854
>>105665782
LOL this is the retard who spent all of 2023 shilling how comfyUI was the future of anime. what caused him to meltdown this time?
Anonymous No.105665839 >>105665848
pink hair anime dog flies into the night sky
Anonymous No.105665847
>>105665831
This is what happens when you court and are friends with autistic, mentally ill, unstable people. They are fake and will say anything without principle.
Anonymous No.105665848
>>105665839
>I must go my people need me
Anonymous No.105665850
>>105665831
retards sit around and complain every day about hard times. a man actually rolls up his sleeve and gets to work. ani can be a faggot sometimes but I support what he's doing
Anonymous No.105665854 >>105665971
>>105665831
ani is allowed to change his opinion, at least this time he is right. comfy must die. it's basically already dead considering how advanced anistudio is right now
Anonymous No.105665855
Anonymous No.105665859 >>105665868 >>105665869
>>105665829
because wan and gorillion custom nodes for many optimizations etc..
anistudio looks very comfy but im used to comfy and ^^
Anonymous No.105665867 >>105665899
THE KING OF KONG COLLECTS
Anonymous No.105665868 >>105665887
>>105665859
he said he would add interop with python to make a comfy plugin. beats using faglectron
Anonymous No.105665869 >>105665877
>>105665859
it's snake oil that doesn't work. anistudio is the best frontend and it actually works
Anonymous No.105665874 >>105665891
>>105664855 (OP)
why isn't this in the OP?
https://github.com/FizzleDorf/AniStudio
Anonymous No.105665877
>>105665869
..but it does work
Anonymous No.105665883
>>105665511
I will try that tmr.
>>105665512
hmm
>>105665517
>you forgot the mask properly
How do you mask properly?
I thought I was pretty accurate.
>>105665522
Will try that tmr.
>>105665617
damn nice genn.
Anonymous No.105665887 >>105665892
>>105665868
https://github.com/FizzleDorf/AniStudio/pull/80/commits/bd201340f9992821fad2a8d80bba61da6f886089
it's actually already in dev which is probably why he's been genning wan vids
Anonymous No.105665891 >>105665901
>>105665874
It's unfinished and completely barebones.
Anonymous No.105665892
>>105665887
me too, i made this video in anistudio >>105664947
it works like a charm
Anonymous No.105665899
>>105665867
more money version:
Anonymous No.105665900
ah the very organic shilling arrived again
Anonymous No.105665901 >>105665908 >>105665978
>>105665891
what an ignorant comment, anistudio has lora support and comfy interop now. if you don't add it to the OP you actively want this hobby to die
Anonymous No.105665905
Anonymous No.105665908
>>105665901
i'll take my chances
Anonymous No.105665913 >>105665927
>comfy obsessed with courting the favor of API services
>trani obsessed with clearing his name after schizoanon raped him in the hot summer of 2023
grim state of local
Anonymous No.105665918 >>105665932
>>105665815
the UIs are all just some buttons and widgets to run python code, using the browser makes the most sense in terms of flexibility and extensibility
Anonymous No.105665919
>julien
Anonymous No.105665926
放屁声
Anonymous No.105665927
>>105665913
if ani wins, saas companies using comfy will die. that would be fucking hilarious
Anonymous No.105665932 >>105665959
>>105665918
so what's the difference between Photoshop and photopea? they are the same right? which would you like to use?
Anonymous No.105665959 >>105665984
>>105665932
that's a very disingenuous comparison because photoshop does all of its processing and image manipulation in C++
all these local AI UIs are just very thin layers that do nothing but execute "some_python_file.py" with a load of parameters, there's nothing of value actually happening outside of the python code other than displaying images and providing widgets to set parameters
Anonymous No.105665971 >>105665991 >>105666065
>>105665854
> advanced
> can't do 10% of what comfy can do
welcome to the blocklist retard
Anonymous No.105665978
>>105665901
please. get some help. remove yourself from ai. it's getting ridiculous.
Anonymous No.105665984
>>105665959
it's the same thing for anistudio. there isn't an excuse for shitting a bunch of packages that one or two C/C++ libs can take care of. it's immediate mode as well so things like opencv just update immediately instead of having to run a single execution every time. anistudio also has a better licence
Anonymous No.105665991
>>105665971
i find it pretty damn funny
Anonymous No.105666006
pathetic kek
Anonymous No.105666014
Anonymous No.105666022 >>105666038
i still fail to see why i should care about JulienStudio at all
Anonymous No.105666038
>>105666022
user experience since cumfart just doesn't care at all about the front end
Anonymous No.105666040 >>105666135
this might be too ambitious
Anonymous No.105666065 >>105666073
>>105665971
if we can just run comfy with it why would that matter?
Anonymous No.105666068 >>105666316
>>105665617
why does your genn looks like ultra 4k high quality while mine looks like shit?
Anonymous No.105666073 >>105666089
>>105666065
why would i care about a wrapper tho
Anonymous No.105666089 >>105666102
>>105666073
runs lighter than a browser and you can make vidya out of the components. also vr
Anonymous No.105666102 >>105666113
>>105666089
i unironically never had a problem with "the browser being heavy"
also show some examples for "make vidya out of components" and "vr" i am really curios what you mean anon
Anonymous No.105666113 >>105666132
>>105666102
https://github.com/ocornut/imgui/wiki/Useful-Extensions#virtual-reality-vr--reprojected-ui-plane
you are a complete imgui virgin and I feel sorry for you
Anonymous No.105666114
Anonymous No.105666122 >>105666129
someone was asking about a UI for Sana in the last thread that "supported safetensors", and like, I'm 100% sure that ComfyUI always supported it? Not sure what they were talking about
Anonymous No.105666125
anime girl takes off her shirt to reveal a red bikini.

not quite but getting closer! more of a wet tshirt.
Anonymous No.105666129 >>105666154
>>105666122
native comfy doesn't but I think there is a node somewhere
Anonymous No.105666132 >>105666145
>>105666113
no idea why you're so hostile, i'm just curious
so you linked an imgui extension, how do i run this in JulienStudio? please provide a full example anon
Anonymous No.105666135
>>105666040
with VACE you can just do canny or openpose on a video, should work
Anonymous No.105666143
>>105665003
I mean this is extremely typical of Flux in general in my experience, it takes a LOT of brute force to teach it complex photographic NSFW.
Anonymous No.105666145 >>105666160
>>105666132
well, you just link it like ani is doing with a bunch of other extensions. seems pretty shit simple to implement this stuff
Anonymous No.105666154
>>105666129
yeah it might not have been native but I'm 100% sure there was a robust node for it almost immediately after release, I remember using it
Anonymous No.105666160 >>105666172
>>105666145
so i'm not supposed to tun this locally at all? because you didn't give a single hint on how i could run this at all
promise you i will try it if you give a shiver of a hint of how to run it like you described, but so far you you dodge the question anon
Anonymous No.105666170
Anonymous No.105666172 >>105666189
>>105666160
I ain't writing shit until ani has things ship shape. I think you just don't know C make or c programing for that matter
Anonymous No.105666185 >>105666284
>>105664880
bro just fucking set your training software (i think you said it was something other than Kohya earlier) to a longer max length, if it doesn't have such a feature it's dogshit

yes *inference* on more than 75 involves concatenation and such but that is not relevant to training, don't overcomplicate the issue lmao
Anonymous No.105666189 >>105666196 >>105666200
>>105666172
so there is literally nothing to show from your side and nothing to run from my side
Concession accepted (again), anon.
Anonymous No.105666196
>>105666189
>ask how it's supposed to work with anistudio
>gave example how
>I no understand so you fag
ok
Anonymous No.105666200 >>105666207
>>105666189
stop feeding the shill, that broken program is literally targeted advertising for this general, evidenced by the image in the repo itself
no one uses that shit it's broken and supports nothing
Anonymous No.105666207 >>105666217 >>105666218
>>105666200
why are you replying to yourself?
Anonymous No.105666217 >>105666238 >>105666243
>>105666207
shut the fuck up you dumb retard
Anonymous No.105666218
>>105666207
because anyone with a brain can see he doesn't know what he's talking about. c++ desktop app > some faggy slop webapp
Anonymous No.105666238 >>105666251
>>105666217
this means nothing in the age of smartphones and inspect element faggotry
Anonymous No.105666240 >>105666249
hypertits force 3d mode
Anonymous No.105666243
>>105666217
i'd honestly just add "ani", "ran", "concession", "c++" and all the other common words he uses to the filter. it really does unshit the thread nicely. out of all generals that guy is the most obnoxious straight up shizo shiller i've seen. at this point it's probably not even the actual dev and just a handful of random anons who find it funny and are continuing the joke.
Anonymous No.105666249
>>105666240
> double hip bone
even some sdxl models do this, guess it just can't be avoided
Anonymous No.105666251 >>105666269
>>105666238
yeah i really have nothing better to do
Anonymous No.105666269 >>105666312
>>105666251
considering you do this every time somebody mentions anistudio you really don't
Anonymous No.105666284 >>105666518
>>105666185
That's the first thing I did, but when it's training, console says the prompt is too long. Dunno if it refers to the sampling prompt or the training prompts.
Anonymous No.105666292
Anonymous No.105666304
Anonymous No.105666308 >>105666470
so we can conclude there is not a single reason to try out AniStudio, ok
Anonymous No.105666312 >>105666325
>>105666269
nigga i just came into this thread after you two started arguing and you were the one shilling broken jeetware shitting up this whole thread so i called you out
i won't grace you with a second more of my time or any more (you)s
Anonymous No.105666316
>>105666068
this is why
Anonymous No.105666325 >>105666366
>>105666312
they are both jeetware but one was around longer and the designated shitting street. regardless, I'm not that anon and you do have some sort of melty every time
Anonymous No.105666330
Anonymous No.105666337
i think julien is a homosexual pedo desu
Anonymous No.105666338
Anonymous No.105666349 >>105666398
getting the shirt fully off is hard, but this is better:
Anonymous No.105666366
>>105666325
>you do have some sort of melty every time
you probably won't believe me but it's the first time i've commented on any of this shit
nature of anonymous online discussion i guess, everyone calling each other schizos. still better than the alternative though otherwise this would just be reddit
Anonymous No.105666374
Anonymous No.105666398 >>105666435
>>105666349
i believe this is an extreme case of skill issue
t. genned gorillions of gens of taking off shirt without problems at 640x480
Anonymous No.105666407
480x640*
Anonymous No.105666424
Anonymous No.105666435
>>105666398
okay, now it worked
Anonymous No.105666437
Anonymous No.105666470 >>105666492
>>105666308
if anything it means people should be helping him out but you are just a lazy nigger that accomplished nothing in life but complain
Anonymous No.105666483
I think the comfy site is gathering telemetry
Anonymous No.105666492 >>105666500
>>105666470
why would i help a homosexual pedo anon?
Anonymous No.105666500 >>105666512
>>105666492
you seem to help yourself often
Anonymous No.105666510
Anonymous No.105666512 >>105666517
>>105666500
that somehow doesn't answer my question
why would i ever want to help a pedophile? it would ruin my karma
Anonymous No.105666517 >>105666523
>>105666512
you seem to ruin your karma regularly
Anonymous No.105666518 >>105666530 >>105666554 >>105666560
>>105666284
what software is this? if it explicitly has the option for token length extension it's either referring to the sample prompt (which would make sense as implementing concat for sample is sort of unnecessary) or it's just written by a dumbass
Anonymous No.105666522 >>105666537
lmao, I got a real chudjak
Anonymous No.105666523 >>105666534
>>105666517
i just think that julien is literal human garbage
Anonymous No.105666530
>>105666518
you could test this directly also by just giving a very very short sample prompt and seeing if it still shows that message in the console, I should have noted
Anonymous No.105666534 >>105666536
>>105666523
you seem to be human garbage
Anonymous No.105666536
>>105666534
why would you say something like this?
Anonymous No.105666537 >>105666541 >>105666543
>>105666522
it's good but slightly too Asiatic I think. Pretty sure the Chudjak illustration is meant to be moreso like "significantly nerdier Rowan Atkinson with glasses" moreso in appearance.
Anonymous No.105666541
>>105666537
woops i said "moreso" two times, i hate when i unintentionally do shit like that kek
Anonymous No.105666543 >>105666551
>>105666537
I didnt specify cartoon or anime man so wan thinks I want a real person.
Anonymous No.105666551
>>105666543
yeah fair, and like don't get me wrong, it makes sense that WAN would be slightly biased to making him look more Asian than not, given where it's from
Anonymous No.105666554 >>105666572
>>105666518
I'm using Lora easy training scripts
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
I was getting so confused because I selected 225 tokens and console started saying that the prompt is too long and needed to be truncated despite being lower than 225 tokens. Also my gens were looking like crap and anons here were saying that sdxl based models like illustrious can only work with 75 tokens.

If you have some general training presets for 40-50 images I would be glad if someone posts it, I just need something general that doesn't make my generations look like poopoo.

I'm using kohya and seems to be giving better results but I don't wanna be constantly switching.
Anonymous No.105666560 >>105666592
>>105666518
Console says:
>prompt was truncated. Try to shorten the prompt or increase max_embeddings_multiples.

And I have no idea on how to do that, I see no option for that in any of the trainers, only one for max tokens, which is already at 225.
Anonymous No.105666567 >>105666574 >>105666597 >>105666607
now I did cartoon man, and got a schizo wojak
Anonymous No.105666572
>>105666554
anyone talking about the token length limit in the context of normal person inference in Current Year is likely fucking with you on purpose for the sake of being a contrarian, it's really a nonissue in ComfyUI and all variations of A1111.

If you're trying actual Kohya now though I'd stick with that, it's a step up from Derrian's wrapper in basically all ways IMO
Anonymous No.105666574
>>105666567
trellis + facerig
Anonymous No.105666577
Anonymous No.105666592 >>105666741
>>105666560
is this Kohya or Easy Scripts? in regular Kohya at least `max_token_length` the training setting being at 225 or some other number is 100% for sure the only thing you need to care about. You'll want to test the Lora properly when it's done anyways so I wouldn't put all that much stock in the sample outputs.
Anonymous No.105666597
>>105666567
lmao this is oddly fitting IMO
Anonymous No.105666607 >>105666616
>>105666567
>Did nothing happen... or is it over?
Anonymous No.105666616
>>105666607
is this the birth of Schizochud?
Anonymous No.105666621 >>105666626
real women look like this
Anonymous No.105666625 >>105666632
a cartoon man with glasses takes his glasses off and smiles.
Anonymous No.105666626 >>105666656
>>105666621
>1104x1416
why are all your gens this weird-ass resolution though
Anonymous No.105666630 >>105666641 >>105666697
why are DORAs so unpopular? is it because QDORA doesnt exist? if theyre so good surely people would use them more, and i remember seeing "faster performance" in the paper too
Anonymous No.105666632
>>105666625
pivoting his head like the fucking Terminator or something lmao. Getting there though
Anonymous No.105666637 >>105666657 >>105666740 >>105667035
how do i stop the video from dimming on wan?
Anonymous No.105666641 >>105666656
>>105666630
IDK, people just forget to turn on the Dora setting? There's almost no difference or extra thought required in training a Dora so it really doesn't make sense not to do them over regular Loras
Anonymous No.105666647 >>105666658 >>105666678
lmao this is a good one

a cartoon man with glasses walks to a fridge and grabs a bottle of water.
Anonymous No.105666656 >>105666670
>>105666626
upscale

>>105666641
idk man it gives absolutely terrible results when I tried
Anonymous No.105666657
>>105666637
just undim it with some video editor.. and stop using such a weird resolution holy shit man
atleast use 720xsomethgin or 480xsomethignnb
Anonymous No.105666658
>>105666647
honestly it deciding to use that pivot to put him in place is pretty good given the starting material
Anonymous No.105666664
>>105665211
the flux kontext they will open source will be dogshit, Sesame 1b vs 8b scam all over again
Anonymous No.105666670 >>105666691 >>105666705
>>105666656
what factor were you using? I get better results from Dim 64 / Alpha 32 / Factor 5 Dora than I do Dim 32 / Alpha 32 Lora, with all other settings the same as ever, and the Dora is also smaller in terms of file size, for example.
Anonymous No.105666678
>>105666647
last chud for now, gonna try other stuff too
Anonymous No.105666691
>>105666670
>what factor were you using?
can't remember. I just couldn't make it work way back.
Anonymous No.105666697
>>105666630
Doras increase training time and can result in worse results than a standard lora with convolutional layers. Seems like snake oil to me like most other advanced lora types.
Anonymous No.105666705 >>105666722
>>105666670
Doras train additional modules on the lora. They shouldn't be smaller unless you're also doing something else.
Anonymous No.105666717 >>105666721 >>105666871 >>105666961 >>105667030
I maintain that the local community is utterly retarded for putting no effort into Kolors, it really did / does train like "SDXL with blowjobs and hookers" due to the significantly better prompt adherence. Does real person loras better than basically any SDXL model I can think of too and learns NSFW extremely easily.

And no, you don't have to "prompt it in Chinese", whoever the one guy who claimed that was is a retard. Nor is it necessary to train any text encoders, you just train a UNET-only Lora basically the same way you would an XL one, but captioned as though you were doing a Flux one (JoyCaption is a safe bet, as is jailbroken Gemini 2.5), and it's basically gold everytime.

Also there was nothing wrong with the Kolors license if you actually speak and read English natively, I'm 2000% sure that all of the people who thought there was something off with it were just ESL.
Anonymous No.105666721 >>105666742
>>105666717
whats kolors again? wasnt it based on sdxl?
Anonymous No.105666722
>>105666705
it depends directly on the Dora-specific factor setting relative to Dim / Alpha. Everything I said was in "Kohya scaling" too to be clear, which can be super different from how stuff like e.g. AI Toolkit does it.
Anonymous No.105666737 >>105666765 >>105666805
Anonymous No.105666740
>>105666637
does anyone have a real suggestion?
Anonymous No.105666741 >>105666798
>>105666592
It was on easy scripts, but it's giving me the same message now on kohya. It seems the prompt example sampler is too long but it shouldn't be even a thing, max tokens is 225.

Gens are better in kohya as and the training seems to go better. But I don't know what the fuck im doing, I'm just blind guessing and snake oiling.
50 images. 10 epochs, 4000 steps, learning rate 0.0003. I don't know where should I aim the values. I always get confused with epochs, steps and max steps relative to batch and n of repeats. Like, there are 2 kinds of steps...
Anonymous No.105666742
>>105666721
it's a complete ground-up-trained base model that uses the same architecture as SDXL except attached to ChatGLM3-8B as a text encoder. So the advantages were / are basically twofold, it was just a MUCH nicer looking model than SDXL in terms of the baseline dataset and also it had the benefit of properly supporting natural lanuage captioning / prompting.
Anonymous No.105666749 >>105666869
Anonymous No.105666765 >>105666775 >>105666783 >>105666804 >>105666805 >>105667082
>>105666737
fellow chinaman enjoyer
Anonymous No.105666775
>>105666765
>chinaman
Anonymous No.105666783
>>105666765
>shoulder to hips ratio
>bulge zone cropped
yep, that's a man
Anonymous No.105666790 >>105666804
Anonymous No.105666794 >>105666822 >>105666851
Contortionist prompt. First tries. This is not half bad.
Anonymous No.105666798
>>105666741
honestly I really think you're putting too much emphasis on the samples, just let the shit finish and test it properly I'd say
Anonymous No.105666804
>>105666765
>>105666790
Based
Anonymous No.105666805
>>105666737
>>105666765
>mfw asianposterchromaGODS won
Anonymous No.105666822 >>105666868
>>105666794
people would nitpick this image into oblivion (which you could in many ways) if the community had "decided" the model was "bad" already though
Anonymous No.105666848
Anonymous No.105666851
>>105666794
Is it some sm pose?
Anonymous No.105666859
>>105664855 (OP)
Ban google from offering internships
Anonymous No.105666868 >>105666879
>>105666822
Let them anon
Anonymous No.105666869
>>105666749
dumbass lol
Anonymous No.105666871 >>105666895 >>105666961
>>105666717
another example of lora inference just on the base model
Anonymous No.105666879 >>105666902 >>105666911
>>105666868
wtf are you doing to these images to trick Hive to this extent saar lmao
Anonymous No.105666883
>>105665799
>>105665829
I see we are using the prophetic perfect tense to shill now.
Anonymous No.105666895 >>105666908 >>105666961
>>105666871
last one. in conclusion everyone is retarded, thanks for attending my Ted Talk
Anonymous No.105666902 >>105666912 >>105666923
>>105666879
"hive" or whatever has yet to even be trained to detect chroma. its a community tune created by a furry that's not even "officially" complete yet kek
Anonymous No.105666908 >>105666934
>>105666895
ok but i have a 12gb vram gpu
thanks for sharing, maybe someone will pick up the model and make a coomtune for ME
Anonymous No.105666911
>>105666879
That's just the power of Chroma in action.
Anonymous No.105666912
>>105666902
it generally detects Chroma images as being 99 to 100 percent Flux, as you'd expect. Yours are the first exception I've ever see n
Anonymous No.105666923 >>105666953
>>105666902
The detector would be faulty anyways even if they train on those images. I can just add some noise or a filter and then the detection is gone.
Anonymous No.105666934 >>105666948
>>105666908
????? why would that matter? it's a 4.8 GB model file:
https://civitai.com/models/566526/kolors
Plus a 6.78 GB single-file text encode if you use Kijai's 8bit (very similar output to the 16bit in my experience):
https://huggingface.co/Kijai/ChatGLM3-safetensors/tree/main

And that's it. It's not clear why you think Kolors is / was particularly hard to run. All the pics I just posted I genned on a GTX 1660 Ti (6GB VRAM) + 16GB system RAM.
Anonymous No.105666935 >>105666940
aw man i am so disappointed. i have this beautiful girl dancing, but there are weird artifacts on her penis. ai is like 957% of the way there, but the last 3% is so important. I cant make anything that has a lot of movement without it getting fucked up even if i turn off all optimizations
Anonymous No.105666940 >>105666974
>>105666935
>>>/gif/vdg has many futa posters they can help you
Anonymous No.105666948 >>105666972
>>105666934
i meant finetuning, but i guess i could finetune a 4.8gb model on a 12gb card hmmm especially since you said unet only can work
very nice
Anonymous No.105666953 >>105666971
>>105666923
no, that doesn't really work at all, it'll go down to like 90 from 100% at best. TLDR gymnast anon definitely did something specific and intentional lol
Anonymous No.105666961 >>105667016
>>105666717
>>105666871
>>105666895
slopped skin desu
Anonymous No.105666971
>>105666953
>TLDR gymnast anon definitely did something specific and intentional lol

Nah, kek.
https://files.catbox.moe/c9mu9q.png
There's not even an edit to this image. They have yet to train on Chroma.
Anonymous No.105666972 >>105666987
>>105666948
ah I see what you mean. Yeah, like from a Lora standpoint it's not actually even possible to have a ChatGLM "clip component" of a Lora safetensors that would be loadable in ComfyUI or any other software AFAIK, even if that was necessary lol (which it's not at all, simply conditioning the UNET like I said has worked like a charm in all my Kolors ventures).
Anonymous No.105666974
>>105666940
i dont think its something anyone can fix, its just the limits of this model
Anonymous No.105666977 >>105667047
Any updates on the Mayli anon?
Anonymous No.105666978 >>105666998
My eyes burn from looking at the screen for too long they hurt really bad
Anonymous No.105666987 >>105667031
>>105666972
so you finetuned kolors on a 1660ti? can you post some parameters or configs?
t. never made a lora besides a crappy 7b llama tune in 2024
Anonymous No.105666998
>>105666978
Turn down screen brightness.
Anonymous No.105667016
>>105666961
I mean did I say that one of the advantages was a 16-channel VAE? No, I didn't, mor would anyone ostensibly satisfied with SDXL care about that. The point was that's lora inference directly ON the base model, not a lora trained on the base model but running on some significantly tweaked variant of the base model.
Anonymous No.105667030
>>105666717
>Does real person loras better than basically any SDXL model
>Kolors
I felt the same way about HunyuanDiT. Anyways, community has corrected itself with Chroma so I'm not too concerned about it.
Anonymous No.105667031 >>105667042
>>105666987
Huh? I said nothing about anything other than how I did the *inference* for those gens in particular because I thought you were under the impression Kolors was hard to *run*, not to *train*.
Anonymous No.105667035
>>105666637
that might be the effect of certain loras
Anonymous No.105667042 >>105667064
>>105667031
so you didnt make the madison beer lora?rr
Anonymous No.105667046 >>105667061
>>10566703
I experimented heavily with Hunyuan too, both it and Pixart were always very very very clearly worse from basically any perspective than Kolors in my opinion. Kolors was / is EXACTLY the "SDXL with Ella" (but much better than an actual SDXL with Ella would be just due to Kolors having a way better baseline dataset as I've mentioned) that a zillion people swore up and down they wanted desperately.
Anonymous No.105667047 >>105667171
>>105666977
He's dead
Anonymous No.105667050
Anonymous No.105667061
>>105667046
GT 640
Anonymous No.105667063 >>105667067
it's actually amazing how the light2x lora works for wan, usually "turbo loras" result in a big quality drop or artifacting, but this is a lot better than say...causvid or whatever.

does it work with VACE and other wan models?
Anonymous No.105667064 >>105667076
>>105667042
What? I did make the Lora, I trained it on a runpod setup though. I did not at any point "finetune" Kolors the overall model nor did anything I've said imply that at all lmao. Why is everyone in this fucking thread either aggressively ESL or just some weirdo contrarian doing it on purpose? Stop interpreting straightforward English comments in the strangest fucking ways imaginable
Anonymous No.105667067
>>105667063
also have a cute pauline, 88 seconds to gen.
Anonymous No.105667076 >>105667161
>>105667064
man i meant lora okay, accidentally used the word finetune instead, how do you call the process of making a lora
loratune?? train a lora? whatever, im kinda sleepy
but i see you made it on runpod, what gpu did you rent?
Anonymous No.105667082 >>105667097
>>105666765
tits too small
Anonymous No.105667092 >>105667098 >>105667111 >>105667121
>you set hundreds of gens to generate overnight
>you wake up to this
how do you respond without sounding mad?
Anonymous No.105667097 >>105667117 >>105667218
>>105667082
>>>/vp/napt
belongs on your updated neighbors list
byeee
Anonymous No.105667098 >>105667111
>>105667092
kek happened to me the first day when self forcing lora dropped, i just turned off my computer and continued sleeping
Anonymous No.105667111 >>105667116 >>105667136
>>105667092
>>105667098
>leaving your toaster pc and expensive gpu unattended at 100% usage for hours on end
i like having my house NOT on fire kek
Anonymous No.105667116 >>105667150
>>105667111
i powerlimited my 3060 to 100w :)
Anonymous No.105667117 >>105667137
>>105667097
>he doesnt generate himself kissing her
literally ngmi
Anonymous No.105667121
>>105667092
At least depending on the nodes you are using that concludes fast without rerunning anything.
Anonymous No.105667136 >>105667179 >>105667199
>>105667111
>what is tpd limiting, running fans on full blast and if one wants, undervolting
Anonymous No.105667137 >>105667153 >>105667206
>>105667117
if you look closely it falls apart
background is a bad dream
her tattoo is just scribbles :c
Anonymous No.105667142
Anonymous No.105667150
>>105667116
I put my 3090 to max power limit of 405 watts, get fucked loser
Anonymous No.105667153
>>105667137
assuming you used wan, its cuz you used shit resolution, and probably some shit optimization that fucks the quality away from 720p q8 wan
Anonymous No.105667161
>>105667076
ah my bad. It wasn't anything special, a while ago so don't remember exactly but I think it was consumer grade, just a 3090 or maybe 4090, for sure not a 5090
Anonymous No.105667171 >>105667178
>>105667047
the real "where did they go" is the anon who dropped an SD 1.5 model called "Retropixe" like a year ago (which I still have and is still the best one I ever tried) and then was never ever ever seen again
Anonymous No.105667178
>>105667171
woops this was supposed to say "SD 1.5 anime model", not just "SD 1.5 model"
Anonymous No.105667179 >>105667260
>>105667136
if i do all of that i can render wan videos longer than 5 seconds?
what the longest video you cooked that you actually liked?
Anonymous No.105667188 >>105667192
>>105662793
>360p and 121 frames is all I can eek out of my 3090
u wot
Anonymous No.105667192
>>105667188
maybe a large finger poking instead of an object?
Anonymous No.105667199 >>105667260
>>105667136
>3k rpm fan
jesus christ anon. just get a custom loop if you are that obsessed with temps, they are easy to setup these days.
Anonymous No.105667205
i'm concerned that there are a lot of vramlets in /ldg/.
Anonymous No.105667206 >>105667218
>>105667137
Use this image as input for 720p instead, I'll generate a vid for it too later
Anonymous No.105667218 >>105667262
>>105667206
>>105667097
STOP POSTING PICTURES OF MY WIFE
Anonymous No.105667260 >>105667265
>>105667179
doing all what i outlined is just so i increase efficiency to lower the electric bill and keep the temps low to keep the gpu longevity, it doesnt affect anything else, you can gen the same things regardless and there is no risk of your gpu burning down you house unless you have a meme 12VHPWR connector and you dont test it out by letting the gpu work for a few hours while watching the temps first
>>105667199
the fans arent loud, are easy to tune out, and you wont hear them if you use headphones anyway, a custom loop is an insane hassle, i wont ever fuck with having to fill water into my pc every X months, let alone drain it every time i want to move it around and prepare for the inevitable shitshow or replacing it all 5-7 years later, i place the cooler, blast it at what i dont mind hearing (100%) and it just works forever without thinking about it ever again, let alone thinking about a potential leak dripping onto a multi thousand dollar pc gpu psu etc
Anonymous No.105667262
>>105667218
you married a camgirl??? shieeet
>>105667241
Anonymous No.105667265 >>105667271
>>105667260
my temps never get above 70c that seems reasonable right?
maybe im just overthinking it
i overspent on my PSU when i built my rig for this reason too (future proofing\heat)
Anonymous No.105667271 >>105667295
>>105667265
depends on which sensor but sure, and its not really easily possible to cook a gpu anyway
Anonymous No.105667272 >>105667322 >>105667337 >>105668708
POV: You are a Japanese emperor
Anonymous No.105667279
>>105667276 >>105667276
fresh
>>105667276 >>105667276
Anonymous No.105667295
>>105667271
cpu is around 40c gets closer to 50c when cranked
my transparent display is complained about by reviews as being a "hot box" so i try to be extra careful

all renders micromanaged
Anonymous No.105667322
>>105667272
imagine the smell
Anonymous No.105667337
>>105667272
Skidmark banzai!
Anonymous No.105668708
>>105667272
ohayo ketsu !