← Home ← Back to /g/

Thread 105774047

336 posts 96 images /g/
Anonymous No.105774047 [Report]
/ldg/ - Local Diffusion General
Pinnacle of God's Beauty Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105769424

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105774062 [Report]
Cream
Anonymous No.105774069 [Report]
>>105774058
BTFO
Pay up son
Anonymous No.105774082 [Report] >>105774091
we finna fight the entire thread again??
Anonymous No.105774087 [Report] >>105774120 >>105774178
all my gens looks like shitty western cartoons now
Anonymous No.105774091 [Report] >>105774093 >>105774095 >>105774144 >>105774177
>>105774082
i figured out that the schizophrenic baker puts his own images in the collages
he never puts text with his images
Anonymous No.105774093 [Report] >>105774144
>>105774091
>citation needed
Anonymous No.105774095 [Report] >>105774144
>>105774091
source: it came to me in a dream
Anonymous No.105774112 [Report] >>105774164 >>105776386
>>105770040
> ======PSA NVIDIA FUCKED UP THEIR DRIVERS AGAIN======
> minor wan2.1 image to video performance regression coming from 570.133.07 with cuda 12.6 to 570.86.10 (with cuda 12.8 and 12.6)
> I tried 570.86.10 with cuda 12.6, the performance regression was still the same. Additionally I tried different sageattn versions (2++ and the one before 2++)
> reverted back to 560.35.03 with cuda 12.6 for good measure and the performance issue was fixed
> picrel is same workflow with same venv. the speeds on 560.35.03 match my memory of how fast i genned on 570.133.07
> t. on debian 12 with an RTX 3060 12GB

Could you share the workflow and the input picture please? Want to test it on Arc B580.

And what is your ram? I have to restart comfy after each 4th gen or it OOMs for ram during vae decoding of the 5th.
Anonymous No.105774120 [Report] >>105775594 >>105775657
>>105774087
you did this to yourself
this is your doomed future you created
>>105773833
>>105773848
WELL?!?
Anonymous No.105774129 [Report] >>105774137
>>105773897
If she chew Bubblegum (Crisis)
Anonymous No.105774134 [Report] >>105774174 >>105774600
Anonymous No.105774137 [Report] >>105774165
>>105774129
Better.
Still ugly as shit though
Anonymous No.105774144 [Report]
>>105774091
>>105774093
>>105774095
The source = open your fucking eyes bitch
Anonymous No.105774153 [Report] >>105774161 >>105774335
>generate images on tensor
>shidpost them in /ldg/
>??????
>profit
I have been doing this for months and no one even knows
Anonymous No.105774161 [Report] >>105774335
>>105774153
no one care you're generating a local model render through API, as long as it's from a local model and not 4o it's all right lol
Anonymous No.105774164 [Report] >>105776840
>>105774112
>I have to restart my firehazard pc repeatedly while genning
Uh oh
Anonymous No.105774165 [Report] >>105774178
>>105774137
>still ugly as shit though
Anonymous No.105774174 [Report] >>105774208 >>105774236
>>105774134
>tfw Lora is trained on your ex gf
Anonymous No.105774177 [Report] >>105774600
>>105774091
>he never puts text with his images
why put text
Anonymous No.105774178 [Report] >>105774246
>>105774165
>>105774087
Anonymous No.105774182 [Report] >>105774187
Often the controlnet preview looks better than the final gen
Anonymous No.105774185 [Report] >>105774261 >>105774273 >>105774559 >>105774569 >>105774975 >>105775424
Can this be combined with SageAttention though?
https://www.reddit.com/r/StableDiffusion/comments/1lpfhfk/radial_attention_onlogn_sparse_attention_with/
Anonymous No.105774187 [Report]
>>105774182
Screenshot it, I do it often
It could just all be in your head,
It’s like western vs Japanese videogame boxart
Neither are bad they’re just different
The brain wants the different one also
Anonymous No.105774198 [Report]
anime girl is holding a book with the text "how to gen 1girls" in scribbled font.
Anonymous No.105774204 [Report]
pool tags for illustrious?
also need a way to sort artists by nationality
Anonymous No.105774208 [Report] >>105774230 >>105774243 >>105774354 >>105774600 >>105775555
>>105774174
>>tfw Lora is trained on your ex gf
damn dude I just interrogated some old playboy model images
Anonymous No.105774224 [Report]
>>105773922
>MY PC!
>anon, no! it's too late!
Anonymous No.105774230 [Report] >>105774265
>>105774208
Its the eye shape and checkbones kek
>>105773834
>>105773843
>>105773867
>spoon feeds you
https://civitai.com/models/1620140/matrix-bullet-time-camera-effect-wan21-i2v-lora
Anonymous No.105774236 [Report] >>105774249 >>105774335 >>105774507 >>105776072
>>105774174
no need for that anymore, just one image is enough to get what you want with kontext
Anonymous No.105774243 [Report] >>105774265
>>105774208
If you put a Covid mask on her it’s 1:1 </3
Anonymous No.105774246 [Report] >>105774259
>>105774178
Trying backgrounds to see if that's any better
Anonymous No.105774249 [Report]
>>105774236
Years of academy training wasted manually editing/blending multiple layers in gimp kek
Anonymous No.105774259 [Report] >>105774291
>>105774246
dont bend the knee to these bastards make some aeon flux / voltron 1girls
Anonymous No.105774261 [Report] >>105774273
>>105774185
>Can this be combined with SageAttention though?
it will
https://www.reddit.com/r/StableDiffusion/comments/1lpfhfk/comment/n0vguv0/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>Radial attention is orthogonal to Sage. They should be able to work together. We will try to make this happen in the ComfyUI integration.
god bless those chinks, they can't stop deliver good shit
Anonymous No.105774265 [Report]
>>105774243
>>105774230
>wahh i dated a hot girl wahh
fucking normie
Anonymous No.105774273 [Report]
>>105774185
>>105774261
Can RingAttention be used on image models aswell?
Anonymous No.105774291 [Report] >>105774295 >>105774330
>>105774259
The schizophrenic baker will get his way no matter what
Anonymous No.105774295 [Report] >>105774330
>>105774291
he'll just say that he's watching you until you get paranoid enough to leave forever and stop posting kek
Anonymous No.105774330 [Report]
>>105774295
>>105774291
Local diffusion?
Anonymous No.105774335 [Report] >>105774340
>>105774236
>>105774161
>>105774153
LOCAL diffusion??
Anonymous No.105774340 [Report]
>>105774335
why did you quote me? when I said "Kontext" I was talking about Kontext Dev
Anonymous No.105774345 [Report] >>105774366
when i generate videos it uses my 16gb ram. do you think i can still play old emulation games in the meanwhile or it will break everything? keep in mind i'm technologically dumb
Anonymous No.105774354 [Report] >>105774391
>>105774208
this is a good one, nice.
Anonymous No.105774363 [Report] >>105774375
wan fun inp vs flf2v, which one is better?
also
wan fun control vs vace, which one is better?
Anonymous No.105774366 [Report] >>105774939
>>105774345
only one way to find out. emus usually hog the cpu and if the game has a small mem footprint you *should* be fine. you can also instruct comfyui to leave a bit of room vram wise
Anonymous No.105774375 [Report]
>>105774363
>wan fun control vs vace, which one is better?
vace is the goat
Anonymous No.105774391 [Report] >>105774433 >>105774600
>>105774354
it's the real one lol
Anonymous No.105774422 [Report] >>105774646 >>105774948 >>105774955
how do I generate two photos of the same person and same background but with different poses?
Anonymous No.105774433 [Report] >>105774449
>>105774391
o I see so that is the scan you used. good shit, me like. found her, hah. Katariina Souri. https://www.listal.com/viewimage/1267543
Anonymous No.105774449 [Report] >>105774493 >>105774544
>>105774433
grok is really nice for making prompts out of those. much more accurate than joy-caption, just doesn't accept nudity
Anonymous No.105774493 [Report] >>105774507 >>105774522
>>105774449
we had this discussion a while ago. I went through a number of vision models via ollama/comfy but the results were pretty meh esp. with nsfw content, joy is alright but well its also pretty large. someone recommended I try gemini via api (semi-free with a generous daily quota) and it's amazing and zero censorship. also perfect for using it w/ large models because nothing needs to be loaded into the vram. (I haven't fed it real hc or other extreme content tho because I don't gen that shit). supposedly the large minicpm-v barely fits into 24gb vram and is pretty good.
Anonymous No.105774507 [Report] >>105774516 >>105774544 >>105774548
>>105774493
just use kontext dev to get the same exact character bro >>105774236
Anonymous No.105774516 [Report] >>105774525
>>105774507
don't tell me what to do bitch
Anonymous No.105774517 [Report] >>105774528 >>105774531
>>105773210
>anons who post here go on to make money from their gens
based
Anonymous No.105774522 [Report]
>>105774493
>gemini via api
thanks for the tip, gotta try it next
Anonymous No.105774525 [Report]
>>105774516
>seething this hard over an advice
take some pills you mentally ill retard
Anonymous No.105774528 [Report]
>>105774517
and on and on it spins
Anonymous No.105774531 [Report] >>105774537
>>105774517
that sounds like a funny fan-fiction, but I have yet to see such thing happen in reali life, even once kek
Anonymous No.105774537 [Report] >>105774545 >>105774580
>>105774531
>no one is making money using Ai
uh oh stinky uh oh melty
Anonymous No.105774544 [Report]
>>105774449
ignore the part about gemini being uncensored, just threw a nude gen at it and got a refusal. weird, had it accurately describe various sex toys without issues but a mumu and tits is where it draws the line.
>>105774507
I haven't even installed it yet. next week when this fucking heatwave is over I'll git to it
Anonymous No.105774545 [Report] >>105774556 >>105774580
>>105774537
strawman, read the post again, it says "an anon leaves /ldg/ and then makes a career out of his gens" >>105773210
Anonymous No.105774548 [Report]
>>105774507
Kontext is still quite hit or miss with likeness and it often adds the dreaded plastic skin since it's trained over base Flux
Plus it cannot do NSFW/porn like Chroma does.
But ideally, for likeness you'd actually train a Lora with Kontext as the base
Anonymous No.105774556 [Report] >>105774561
>>105774545
>likely
you might want to learn how to read
Anonymous No.105774559 [Report] >>105774569
>>105774185
>Radial Attention enables 4× longer video generation with LoRA tuning, outperforming dense attention in vision rewards, while achieving 3.7× speedup and 4.4× lower tuning costs.
So you could do 20s video with Wan for the same amount it takes to do 5sec now? Yeah, sure
I'm tired of all these papers which end up being snake oil doing nothing
Anonymous No.105774561 [Report] >>105774578
>>105774556
>I admit it's a fan fiction
thank you
Anonymous No.105774567 [Report]
damn bro that troll nigga last thread trolled so hard his shidpoasts are still being linked
Anonymous No.105774569 [Report]
>>105774559
no, they said it's 4x faster for training, but for inference it's 1.9x faster, look the video it says it all >>105774185
Anonymous No.105774578 [Report] >>105774580 >>105774588
>>105774561
>I admit my reading comprehension is quite poor and I don't understand probability , assumptions , or likeliness whatsoever
Anonymous No.105774580 [Report]
>>105774578
>I admit my reading comprehension is quite poor
you sure do

>>105774537
>>no one is making money using Ai
>>105774545
>strawman, read the post again, it says "an anon leaves /ldg/ and then makes a career out of his gens
Anonymous No.105774588 [Report] >>105774632
>>105774578
>assumptions
a.k.a, a fan fiction
I accept your apology again.
Anonymous No.105774598 [Report] >>105774609 >>105774648 >>105774822
>hr when someone gets fired
Anonymous No.105774600 [Report] >>105774608 >>105774615 >>105774616
>>105774391
>>105774208
>>105774177
>>105774134
reported for avatar-posting :^)
Anonymous No.105774608 [Report] >>105774616
>>105774600
based, death to chromakeks
Anonymous No.105774609 [Report]
>>105774598
ngl i have been fuggin ROASTED by hr many times
Anonymous No.105774615 [Report] >>105774625 >>105774626 >>105774627 >>105774642
>>105774600
a bitter C U N T is what you are, rocketboy. and I was right, you are a psycho
Anonymous No.105774616 [Report] >>105774625 >>105774627
>>105774608
>>105774600
I hate you disgusting dickheads so much it's unreal
Anonymous No.105774625 [Report]
>>105774615
>>105774616
Sometimes I wonder who the most mentally ill is here
Anonymous No.105774626 [Report] >>105774632
>>105774615
nigga is the rocketgirl in the room wirh us right now?
Anonymous No.105774627 [Report]
>>105774615
>>105774616
>avatarfag seething noises
the best sound in the world
Anonymous No.105774632 [Report]
>>105774626
>>105774588
Speaking of assumptions…
Anonymous No.105774634 [Report] >>105774643
temperate today, perfect weather to bake a lora
Anonymous No.105774642 [Report] >>105774663
>>105774615
why would R-avatarfag report people for avatarfagging, he's all for this lol
Anonymous No.105774643 [Report] >>105774653
>>105774634
>tfw he is baking a Lora of my ex gf
Anonymous No.105774646 [Report] >>105774651
>>105774422
help
Anonymous No.105774648 [Report] >>105774698
>>105774598
I hate HRs so much it's unreal
https://www.youtube.com/watch?v=q6iGllJ-3aQ
Anonymous No.105774651 [Report]
>>105774646
by typing prompts??
Anonymous No.105774653 [Report] >>105774663
>>105774643
you got a problem with that?
Anonymous No.105774663 [Report]
>>105774653
you better hurry up it'll be illegal soon
>>105774642
rgal is rangebanned for cunnyposting
Anonymous No.105774698 [Report] >>105774707 >>105774822
>>105774648
Anonymous No.105774707 [Report] >>105774749
>>105774698
>hair on her tongue
ewww...
Anonymous No.105774749 [Report] >>105774755
>>105774707
happens if you smoke too many cigarettes
Anonymous No.105774755 [Report]
>>105774749
Low testosterone
Anonymous No.105774767 [Report] >>105774777 >>105774779
is chroma not shit yet?
Anonymous No.105774777 [Report] >>105774782
>>105774767
Judging by the last two threads, probably no
Anonymous No.105774779 [Report] >>105774785 >>105774821 >>105774921 >>105776846
>>105774767
it'll never happen, it's distilled now
Anonymous No.105774782 [Report] >>105774800
>>105774777
>probably
careful we don't understand Inference here or assumptions
Anonymous No.105774785 [Report] >>105774804
>>105774779
why does compute always land in the hands of the unqualified
Anonymous No.105774800 [Report] >>105774809
>>105774782
>assumptions
you mean fan fictions?
Anonymous No.105774804 [Report]
>>105774785
In a word? Outsourcing
Anonymous No.105774809 [Report] >>105774819
>>105774800
do you have to take the bait every single fucking time? ngl nigga annoying for real
Anonymous No.105774811 [Report] >>105774816 >>105774821 >>105774827 >>105774835
wan 2.2 will be a nothingburger btw
Anonymous No.105774816 [Report]
>>105774811
Pls no
Anonymous No.105774819 [Report] >>105774950
>>105774809
>I was pretending to be retarded
nah, you're naturally like that
Anonymous No.105774821 [Report] >>105774842
>>105774811
why would it? they perfectly nailed the previous version, they aren't a team of unqualified retard like that horse fucker >>105774779
Anonymous No.105774822 [Report]
>>105774598
>hr
Not ugly enough
>>105774698
Lore accurate
Anonymous No.105774827 [Report]
>>105774811
I would be very happy if they at very leas/t retrained the whole thing on better captions OR fine-tuned it to make 8/10 second long videos instead of 5
Anonymous No.105774835 [Report] >>105774846 >>105774874 >>105775620
>>105774811
I can feel this model will be unified, like Kontext, like it'll do t2v, i2v, r2v, all in one model, that's the future
Anonymous No.105774842 [Report]
>>105774821
it'll be fine as long as expectations are kept in check. go in expecting wan 2.1 with little tiny improvements and you'll be happy
Anonymous No.105774846 [Report] >>105774881
>>105774835
I'd be okay with that, I don't see where else that can take it without add billions more parameters and abandoning local
Anonymous No.105774874 [Report]
>>105774835
It a future but will it be as efficient as specific different models for t2v,i2v,r2v would be?
The size of the model is a factor for local use, i'd be happy to have 3 seperate models I can run wholly in 24gb than 1 unified model i have to use gguf to fit in. But i get your point.
Anonymous No.105774879 [Report] >>105774915 >>105775628
Anonymous No.105774881 [Report]
>>105774846 or it's exactly the same but works faster, that'd be cool too
Anonymous No.105774909 [Report] >>105774914 >>105774960 >>105775656
You niggas have to always assume those guys are not doing things for people with gaming GPUs, they are training models meant to be available on their API services, releasing the weights is a mere afterthought
So I doubt they are training models to make them faster or some shit except if there is a fundamental arch change
Anonymous No.105774914 [Report] >>105774922
>>105774909
BFL? no shit
Anonymous No.105774915 [Report]
>>105774879
my wife
Anonymous No.105774921 [Report] >>105774936
>>105774779
genuinely curious how chroma would've went if he just trained on dev at 1024x to start, no de-distillation or anything. i think that pixelflow finetune did it and it worked pretty well, though it was a stylistic finetune rather than a concept one. where is he getting the money for these experiments? he's constantly changing shit all the time as if he has unlimited budget yet the donation page hasn't even reached ~20 epochs worth of funding
Anonymous No.105774922 [Report]
>>105774914
Any major lab including Alibaba
Anonymous No.105774931 [Report]
If my second 3090 is running at x4, should I just train using only the 3090 that's running at x16 and ignore my second gpu?
Anonymous No.105774934 [Report] >>105774938 >>105775687
Ain't no way alibaba is going to keep releasing open source models forever, at some point they will just release a Wan 3.0 api only
Anonymous No.105774936 [Report]
>>105774921
>genuinely curious how chroma would've went if he just trained on dev at 1024x to start, no de-distillation or anything.
the issue is that going from 512x512 to 1024x1024 is 4x slower :(
Anonymous No.105774938 [Report]
>>105774934
>keep releasing open source models forever
>he doesn't know about Wan 2.1 Pro
Anonymous No.105774939 [Report]
>>105774366
ty
Anonymous No.105774948 [Report] >>105774958
>>105774422
Not possible.
Give up.
Anonymous No.105774950 [Report] >>105774958 >>105774962
>>105774819
This is Ai? Wow
Anonymous No.105774955 [Report] >>105775033 >>105775577
>>105774422
Flux Kontext does that out of the box
Anonymous No.105774958 [Report]
>>105774948
kek>>105774950
Anonymous No.105774960 [Report] >>105774984 >>105774989 >>105775709
>>105774909
surely china is making progress on their own hardware, right? there is no way they're content paying for overpriced import-nerfed nvidia
Anonymous No.105774962 [Report]
>>105774950
>This is Ai?
yes
>Wow
Ikr
Anonymous No.105774975 [Report] >>105774987
>>105774185
https://github.com/mit-han-lab/radial-attention
>Wan2.1-14B, HunyuanVideo, and Mochi-1 are supported for fast video generation with high quality under 1-4⨉ video length
>1-4⨉ video length
>Release LoRA checkpoints for longer-video generation

While exciting, I'm a little bit concerned about OOM. So in my understanding, this allows for continued generation WITHOUT the degradation beyond the Wan 5 sec limitation. So you're saying, if I load up 324 frames (20 secs), this should generate without the errors? Currently, if I load anything past 230 frames, I OOM.
Anonymous No.105774984 [Report]
>>105774960
They are just starting to make their own EUV machines and semiconductor foundries, it will take some time until mass production
Anonymous No.105774987 [Report] >>105775044 >>105775067
>>105774975
it means that this method will help the model not produce shit if you go for more than 5 seconds, but it also means that you have to be able to handle the additional memory if you want to go for something longer of course
Anonymous No.105774989 [Report]
>>105774960
>import-nerfed nvidia
aren't people on weibo bragging about importing grey h200s
Anonymous No.105774998 [Report] >>105775023
If I install Comfy portable, can I set it up so it opens in it's own browser window and not just in a tab?
Anonymous No.105775017 [Report] >>105775049
https://www.youtube.com/watch?v=2PkMO3yVz7g&list=PLHRLDTelHSJpnC7ML-kNvPWhSyXH4nXzt
Anonymous No.105775023 [Report]
>>105774998
have two browsers
Anonymous No.105775033 [Report] >>105775043
>>105774955
>local
Anonymous No.105775043 [Report] >>105775059
>>105775033
?
Kontext Dev is local
Anonymous No.105775044 [Report] >>105775050 >>105775067
>>105774987
Ah, figured that would be the case. Its good thing I'm upgrading, jej
Anonymous No.105775049 [Report]
>>105775017
Damn Rene
Anonymous No.105775050 [Report] >>105775067 >>105775117
>>105775044
>upgrading
unless you're getting a RTX PRO 6000, you wont have enough
Anonymous No.105775059 [Report] >>105775071
>>105775043
I'm waiting for nsfw lora for it
Anonymous No.105775067 [Report] >>105775274
>>105774987
>>105775044
>>105775050
won't the memory requirement be less of a pain if they went for O(nlogn) instead of O(n2)?
Anonymous No.105775071 [Report]
>>105775059
There is one to remove clothes, but the nipples look weird since the base model is censored and it's hard to train the entire concept of nudity
Anonymous No.105775073 [Report] >>105775266
Generating shit on tensor.art now costs more credits. Sucks.
Anonymous No.105775078 [Report] >>105775091
https://openart.ai/workflows/amadeusxr/change-any-image-to-anything/5tUBzmIH69TT0oqzY751

neat multi image workflow with some style presets you can enable/disable

pixel art option on: anime girl is holding a book with the text "how to gen 1girls" in scribbled font.
Anonymous No.105775083 [Report]
I am still waiting for a method of reliably generating shit that is NOT slow motion while using the lightx2v lora
Anonymous No.105775091 [Report] >>105775103
>>105775078
>style presets
nigger, it takes 2 seconds to write "pixel style" on the prompt box lol
Anonymous No.105775103 [Report] >>105775118
>>105775091
I know, it's not necessary it's just there, I might even remove the node but I dont want to break the workflow. can just bypass anyway
Anonymous No.105775117 [Report]
>>105775050
And why wouldn't I? It OOMs on my 16gb card at 230 frames, I'm pretty sure even a 24gb can handle 320 frames. But I'm going for the 48gb card
Anonymous No.105775118 [Report] >>105775128
>>105775103
yeah fair enough, thanks for sharing the workflow anon
Anonymous No.105775120 [Report]
Anonymous No.105775122 [Report]
anime girl is sitting at a desk in a dimly lit office. she is wearing a pink tracksuit and holding a book with the text "how to hide a body" in scribbled font.
Anonymous No.105775128 [Report] >>105775138
>>105775118
it works well for combining stuff (ie pepe + character), I leave the reference images to 2 and just bypass the other image inputs if I want a single input. otherwise I enable it if I want to combine stuff. works decent enough, I just wanted a multi-image option for combining stuff if I want to.
Anonymous No.105775133 [Report] >>105775141 >>105775146
Why do models use different vae?
Wouldn't it be nice if all of them work with the same vae so that you can mix their results in the same latent space? Is there any reason not to do this?
Anonymous No.105775138 [Report] >>105775167
>>105775128
that workflow stitches images together right? do you prefer it over the reference conditioning cascade thing?
https://www.reddit.com/r/StableDiffusion/comments/1lo4lwx/here_are_some_tricks_you_can_use_to_unlock_the/
Anonymous No.105775141 [Report] >>105775153
>>105775133
not all vaes are created equal
https://huggingface.co/spaces/rizavelioglu/vae-comparison
Anonymous No.105775146 [Report]
>>105775133
>Why do models use different vae?
because each company want to make their own vae and claim they made the best vae ever, it's a competition lol
Anonymous No.105775153 [Report] >>105775171 >>105775192
>>105775141
What does it have to do with my question?
Anonymous No.105775167 [Report] >>105775188
>>105775138
both work, I use both im just testing diff workflows to see how to get multiple images to interact, if you reference "green frog" it will understand, I think they are both considered separate in latent space or whatever then when generating it combines them.

ie: anime girl is sitting at a desk in a dimly lit office with a green cartoon frog that is wearing a red tshirt and blue shorts. she is wearing a pink tracksuit and holding a book with the text "how to hide a body" in scribbled font. keep the frog's expression the same.

originally I just used an image stitch node but sometimes it messed things up, the separate image inputs seems to work much nicer. can take a couple gens to get the pepe right though (you get a frog but not the exact face), but it works:
Anonymous No.105775171 [Report] >>105775186
>>105775153
Anonymous No.105775186 [Report]
>>105775171
Do you understand the question?
Anonymous No.105775188 [Report] >>105775242
>>105775167
also "keep expression the same" works really well for maintaining a face, otherwise you can get random faces which can be funny but not pepe.
Anonymous No.105775192 [Report] >>105775200
>>105775153
you're literally asking why a gamecube disk doesn't work on a ps2 console, the consoles (models) have different architectures, they are not compatible at all
Anonymous No.105775200 [Report] >>105775210
>>105775192
I know it's not compatible. I'm asking why they don't collaborate.
Anonymous No.105775210 [Report] >>105775224
>>105775200
>I'm asking why they don't collaborate.
are you retarded? why would companies collaborate with each other? they are rivals not friends
Anonymous No.105775224 [Report] >>105775230
>>105775210
Thank you for finally trying to be on point and answer the original question.
Anonymous No.105775230 [Report] >>105775239
>>105775224
this is such a retarded question though
Anonymous No.105775237 [Report] >>105775256 >>105775276 >>105775293 >>105775306
Don't lie. If you guys were a company/lab, would you jew out the weights too? Or would you do like based Emad, scam VCs and give away all weights for free?
Anonymous No.105775239 [Report] >>105775247 >>105775254 >>105775258
>>105775230
People must find you to be hard to talk to irl.
Scattered mind of reasoning lol
Anonymous No.105775242 [Report] >>105775340
>>105775188
like so:
Anonymous No.105775247 [Report]
>>105775239
nah, I'm not surrounded by retards like you so it goes pretty smoothly
Anonymous No.105775254 [Report]
>>105775239
you're either extremely ESL, retarded, or both. your question is shit and i feel bad for the anons who wasted their time answering you.
Anonymous No.105775256 [Report] >>105775306
>>105775237
>Don't lie. If you guys were a company/lab, would you jew out the weights too? Or would you do like based Emad, scam VCs and give away all weights for free?
it really depends desu, if my company is big like Alibaba, I wouldn't mind sharing my best models
Anonymous No.105775258 [Report] >>105775291
>>105775239
Ignore that nigga he does this shit every thread
Anonymous No.105775266 [Report] >>105775298
>>105775073
NO PLS NO
Anonymous No.105775274 [Report]
>>105775067
>won't the memory requirement be less of a pain if they went for O(nlogn) instead of O(n2)?
chat is it true?
Anonymous No.105775276 [Report] >>105775740
>>105775237
>based Emad
didn't he screech at compvis \ runway?
Anonymous No.105775291 [Report]
>>105775258
I really wish there's a unified vae
Anonymous No.105775293 [Report]
>>105775237
>based Emad
your "based" Emad wanted to cuck SD1.5, but Runway (the guys who trained the model) released it on an uncucked form, and Emad did everything in his power to nuke the model out of the internet's existence lol
Anonymous No.105775298 [Report]
>>105775266
Go see for yourself. More resolution - more credits cost.
Anonymous No.105775306 [Report] >>105775319 >>105775339
>>105775237
If I were BFL, I would not release the weights for Kontext. It's just too useful and there is no other open source equivalent (omnigen 2 etc are not on the same level)

>>105775256
It pisses me off that Bytedance doesn't release their models. Their main business is social media, it makes no sense locking down their models behind API since it was never their thing anyway.
Anonymous No.105775319 [Report] >>105775328
>>105775306
>It's just too useful
Idk man, for a professional setting I doubt they'll be using a model like that, it adds jpg artifacts on each iteration edit
Anonymous No.105775328 [Report] >>105775358
>>105775319
Still miles ahead the alternatives for image editing, including mainstream SaaS models like 4o and gemini
Anonymous No.105775339 [Report] >>105775364
>>105775306
>It pisses me off that Bytedance doesn't release their models.
they just released XVerse though
https://bytedance.github.io/XVerse/
Anonymous No.105775340 [Report] >>105775351 >>105775365 >>105775368
>>105775242
anime girl is wearing a white tshirt with an image of the green cartoon frog that is wearing a red tshirt and blue shorts. She is at the beach holding a bottle of water.
Anonymous No.105775349 [Report] >>105775388
how long before we have video 2 video kontext?
I need to nudify videos!
Anonymous No.105775351 [Report]
>>105775340
kek this is really good
Anonymous No.105775358 [Report]
>>105775328
if I were a company that does editing images, why would I use kontext dev? I would use their kontext pro/max API, that's the SOTA model
Anonymous No.105775364 [Report] >>105775413 >>105775497
>>105775339
The good stuff, Seedream (image model), Seedance (video model), they keep to themselves, and both of them mog the local alternatives
Anonymous No.105775365 [Report]
>>105775340
one more

anime girl is wearing a white tshirt with an image of the green cartoon frog that is wearing a red tshirt and blue shorts. She is at the beach is waving hello. keep her blue and yellow hairclip the same.
Anonymous No.105775368 [Report] >>105775383
>>105775340
drop a catbox of the workflow please
Anonymous No.105775373 [Report]
Why would a company keep locking down a model which have been outclassed by other companies, for the same price?
Why not just open source your old model and get free advertisement, and keep your newest one api only? Well, I guess other big tech companies would just train on top of it to fuck you over
Where is illu 3.5
Anonymous No.105775383 [Report] >>105775400
>>105775368
it's this one:

https://openart.ai/workflows/amadeusxr/change-any-image-to-anything/5tUBzmIH69TT0oqzY751

just with the gguf q8 model + node instead of the default.
Anonymous No.105775388 [Report]
>>105775349
For topless I think you can just ask for undressing since WAN is good at taking clothes off, but for bottomless other than buttcheekage there's no point because you'll get weird scary pepperoni genitals
Anonymous No.105775389 [Report]
clip is wrong side but you get the idea:
Anonymous No.105775400 [Report] >>105775405
>>105775383
reference images is set to 2, if I wanna do a single image I just bypass the second or third image inputs, works fine.
Anonymous No.105775405 [Report]
>>105775400
*because the selector can bug out for some reason, going to 0, so I just bypass the second image if I dont want a second reference.
Anonymous No.105775413 [Report] >>105775426 >>105775429
>>105775364
>Seedance (video model), they keep to themselves, and both of them mog the local alternatives
This is true but from what it seems normies already don't care about AI video, and Seedance is ironically harder to set up than WAN if you've used AI before but haven't used tiktok/capcut. Also watermarked stuff is usually DOA at this point because if you cannot generate fake Gaza footage what's the point at all
Anonymous No.105775424 [Report] >>105775436
>>105774185
Fuckos should've had the comfy node ready to go, nobody is gonna infer wan via their gay ass script
Anonymous No.105775426 [Report] >>105775507
>>105775413
???
None of those problems would exist running local, my point was that they outright refuse to release the weights while I doubt this would hurt their business at all
Anonymous No.105775429 [Report] >>105775507
>>105775413
>This is true but from what it seems normies already don't care about AI video
if this was true, I wouldn't get my tiktok feed flooded by veo3's videos (don't get me wrong they are really funny)
Anonymous No.105775436 [Report]
>>105775424
yeah, I hate when they do that, first they announce their new method, then we have to wait for ComfyUi's node to appear, they should do it all in one shot, the wow effect will have a much bigger impact
Anonymous No.105775456 [Report] >>105775465 >>105775477 >>105775480 >>105775547
I made a nudey kontext LoRA that's slightly better than the old one. It's still pretty cooked given nude LoRA's for flux are generally shit no matter what, but it works better than the other one. Combine it with a Chroma inpaint on the nude bits afterwards and the results are pretty fucking great. Can upload it, but don't know where, catbox only takes like 200 and it's 350.
Anonymous No.105775465 [Report]
>>105775456
>Can upload it, but don't know where
huggingface?
Anonymous No.105775477 [Report] >>105775547
>>105775456
>catbox only takes like 200
https://litterbox.catbox.moe/
upload here temp, let's see it
Anonymous No.105775480 [Report] >>105775547
>>105775456
Compress the file splitting into two, then upload the catbox
Anonymous No.105775497 [Report] >>105775555
>>105775364
was using seedream today, it's such a fucking perfect model and it makes me so angry we don't have it. it really feels like SDXL 2.0: wide range of styles, 1.2k res, minimal slopped look, and not too heavily biased towards anything.
Anonymous No.105775507 [Report] >>105775523
>>105775429
Yeah anon, YOUR feed. Normies under 50 hate AI because "I hate the current thing" also a little bit of TDS

>>105775426
How would it help their business at all though? Especially if seedream isn't runnable on a 5090. How has releasing WAN helped Ali Baba's business?
Anonymous No.105775523 [Report] >>105775659
>>105775507
>also a little bit of TDS
well, technically it would be called AIDS (literally KEEEEK)
Anonymous No.105775547 [Report] >>105775777
>>105775456
>>105775477
>>105775480
>https://files.catbox.moe/8s88kw.rar
>rename to kn_v1.part1.rar
https://files.catbox.moe/0b7der.rar
>rename to kn_v1.part2.rar
Was more of a test of my dataset and captioning to see if I could get it to work. It's undertrained, but it does indeed werk.
Password is three letters and the name of this general. I'll post any future versions here too.
Anonymous No.105775555 [Report] >>105775652
>>105775497
is it good tho? it turned a pretty solid description of this >>105774208 into this slop
Anonymous No.105775577 [Report] >>105775923
>>105774955
what can kontext do and why would people use it?
Anonymous No.105775594 [Report] >>105775624 >>105775722 >>105775760
>>105774120
On it boss, wait 3 minutes.
Anonymous No.105775620 [Report]
>>105774835
But Kontext unified into shit, please no Wan!
Anonymous No.105775624 [Report]
>>105775594
shiieet wan turned her into something decent, figures.
Anonymous No.105775628 [Report]
>>105774879
Very nice, but she has the hands of someone who has worked as a dishwasher for 30 years, surely she could afford a maid...
Anonymous No.105775634 [Report] >>105775648 >>105775659 >>105775660 >>105775677
I've read what feels like every manga and manhwa on earth at this point with this hobby. What do you all do while waiting?
Anonymous No.105775648 [Report]
>>105775634
collect real images and videos for future use
Anonymous No.105775652 [Report] >>105775722
>>105775555
what is the prompt and where are you using it?
Anonymous No.105775656 [Report]
>>105774909
Yes, obviously we understand that the watered down, heavily-censored, distilled models BFL releases to the public are crippled so as to make people gravitate towards their paid API versions.

This is also why they hate and want to destroy NSFW loras for their models, since that is something they will never offer on their API, and it makes the public versions more valuable.
Anonymous No.105775657 [Report] >>105775734 >>105775923
>>105774120
K here u go boss.
Anonymous No.105775659 [Report]
>>105775523
>technically it would be called AIDS
Compute pool's closed due to AIDS

>>105775634
>What do you all do while waiting?
Video takes two minutes to generate now so I just stroke my penis (goon) or craft the next prompt
Most of my waiting is for my wife to be asleep or leave the room
Anonymous No.105775660 [Report] >>105775673
>>105775634
How is the temp only 64?
Mine reaches 90
Anonymous No.105775673 [Report] >>105775698
>>105775660
Probably power limiting, which you should almost certainly do if you have a strong GPU as well
I lowered my power on my 5070ti from 300w to 250w and fan speeds and temperature went down at no cost to gen time
Anonymous No.105775677 [Report]
>>105775634
Work in parallel bro. Get another image/workflow ready.
Anonymous No.105775687 [Report]
>>105774934
As long as they do, I will gladly accept.

Wan 2.1 was already a massive revolution in local video generation. With further optimizations and more loras there's a lot untapped potential there.

But they are at least releasing open Wan 2.2, so we will eat good once more at least.
Anonymous No.105775698 [Report] >>105775746
>>105775673
how to do it in a laptop?
MSI shows the option as greyed out
Anonymous No.105775709 [Report]
>>105774960
It's not an actual problem, as it stands, China can just get cards from countries where there is no embargo. For chinese citizens and smaller companies there, yes, it is a problem, but for the Chinese state, or huge chinese companies like Alibaba, it's no problem at all.

That said, they are obviously making their own chips, it will take time though, still they are moving faster than expected.
Anonymous No.105775722 [Report] >>105775743 >>105775770
>>105775594
dude on closer inspection.. this is amazing. are you willing to share the workflow+prompt?
>>105775652
something along the lines of
"A close-up portrait captures a young woman with captivating emerald green eyes and vibrant red lips, offering a gentle smile directly at the viewer. Her dark, straight hair features a short fringe, neatly framing her face. She wears an elaborate, traditional-style hat adorned with brown and white fur, featuring a richly embroidered band in reds, blues, and white, with a long blue fabric tail falling over her shoulder. A substantial silver necklace hangs around her neck, composed of numerous coin-like medallions and small dangling bells, reflecting the bright sunlight. Lush green leaves and clusters of glossy red berries from a tree branch artfully frame the upper left and right portions of the image, subtly diffusing the natural light. The background remains softly blurred, indicating an outdoor setting bathed in brilliant sunlight, which casts a gentle warmth and subtle highlights across the scene, imparting a serene and inviting atmosphere."
and I just used the free seedream image gen page at https://seedream.pro/
Anonymous No.105775723 [Report]
there are a decent amount of 32gb+ inference options, but almost all of them are for LLMs and would shit themselves trying to run image/video models.
Anonymous No.105775734 [Report]
>>105775657
damn that's pretty neat, technologically speaking of couse
Anonymous No.105775740 [Report]
>>105775276
Yes, first Emad was saying he wanted uncensored models because he hated censorship.

Then he dragged his feets for months releasing SD1.5.

Then Runway, their partner in making SD1.5, realized that unless they released it themselves, it would take another year since Emad was busy censoring the model like a motherfucker.

When Runway released SD1.5, Emad went pissed and there were some really passive aggressive posts from him and his employees, ending all further partnership with Runway.

If not for Runway, if we ever got SD1.5 it would have been MUCH later, and a heavily censored version.
Anonymous No.105775743 [Report]
>>105775722
>https://seedream.pro/
you got saar'd, that's not the actual site. it's on dreamina
Anonymous No.105775746 [Report] >>105775844
>>105775698
use nvidia-smi
Anonymous No.105775760 [Report]
>>105775594
Encounters of the Double Kind
Anonymous No.105775770 [Report] >>105775784 >>105775818 >>105776119 >>105776469
>>105775722
It's just the NAG + lightx2v workflow from the rentry with some breast jiggle loras from civit. I'm running on a RTX 6000 pro so I'm loading the full WAN model, might be making a difference.
Anonymous No.105775777 [Report] >>105775799
>>105775547
That works pretty good, can you share your data set?
Anonymous No.105775784 [Report]
>>105775770
>6000 pro
is there a max resolution that the vid quality starts to degrade?
Anonymous No.105775799 [Report] >>105775850
>>105775777
No.
Anonymous No.105775818 [Report]
>>105775770
thanks, I'll figure it out. how would the prompt look like, just in general? sorry for the smoothbrain question, I've yet to gen a single video. ahem.
Anonymous No.105775844 [Report] >>105775882
>>105775746
Thanks
Running nvidia-smi -q -d POWER gives this out[ut
GPU 00000000:01:00.0
GPU Power Readings
Average Power Draw : N/A
Instantaneous Power Draw : 76.15 W
Current Power Limit : 95.00 W
Requested Power Limit : 95.00 W
Default Power Limit : 80.00 W
Min Power Limit : 1.00 W
Max Power Limit : 95.00 W

What should I set it too? 75? 60?
Anonymous No.105775850 [Report] >>105775889
>>105775799
why, is it based on your girlfriend's nudes?
Anonymous No.105775882 [Report]
>>105775844
what I did was overclock a bit and then cap the max clock
Anonymous No.105775889 [Report] >>105776021
>>105775850
The 400 image set is 800MB and the 800 set is twice that. I couldn't be arsed uploading it.
A monkey could make a nude dataset for Kontext though. It's easy.
>collect images of nude women from some artsy nude site, go for quality and variety over quantity, with different body types
>run them through kontext using nested wildcards that give the nude women various types of clothing
>it will glitch on some, rerun it on the ones that don't work correctly until it does
>you now have two datasets, the control (AI clothed women) and the target (the source nude images)
>caption each with "Remove their clothing and make them nude. They are blah blah blah" describing their body features
>can use a local vision model to caption them, then append the "remove" caption at the start of each caption file
Easy as that.
Anonymous No.105775923 [Report]
>>105775657
MY MAN
HELL YEAH
>>105775577
I just use it for funny things
Anonymous No.105775961 [Report] >>105775974 >>105775978 >>105775994 >>105776056 >>105776078 >>105776215 >>105776491
For any real anon reading, is there any good AI community where people are good with training and know what they are doing? I wanna discuss training settings and I'm not getting much help here, and based on uploaded images alone, I suspect half of the posters in this general are indians.
Anonymous No.105775970 [Report]
Is there a thorough guide i can read for kontext, I got comfy installed and used anons workflow for 2 images into one "shaking hands with the other woman" , that one, and it takes 7 minutes on my 4060ti.
Obv something is wrong or i misunderstand kontext and i need to figure out which is which.
Anonymous No.105775974 [Report]
>>105775961
I'd move to plan B
Anonymous No.105775978 [Report]
>>105775961
Pls sir think of my village please send btc for helping u
>el Dee gee is ass/garbage
Wow cool your eyes work great
Anonymous No.105775980 [Report] >>105776010
>and it takes 7 minutes on my 4060ti.
>Obv something is wrong
Anonymous No.105775984 [Report] >>105775999 >>105776003
>anon posts lora based on heavily censored model and knows how to uncensor it
>clearly knows what hes doing
>random retard "duuuuh where da trainers at you ppl are stooopid"
good job, tardlinger
Anonymous No.105775994 [Report] >>105776176 >>105776512
>>105775961
go to reddit then, faggot
Anonymous No.105775999 [Report] >>105776060
>>105775984
On other boards ragebaiting/insulting is the fastest way to get information (false-flagging) so…
Anonymous No.105776003 [Report] >>105776006 >>105776014
>>105775984
He won't share his dataset though
Anonymous No.105776006 [Report]
>>105776003
You can’t find naked photos of women on the internet??
Anonymous No.105776010 [Report] >>105776023
>>105775980
Sorry i'm not up on memes, I have no idea what that sog is specifically telling me?
Start from scratches?
That's the way the dog bone crumbles?
I just farted?

any other options?
Anonymous No.105776014 [Report]
>>105776003
>anon posts his exact recipe on how he did it
>random faggot: "duuuh lemme borrow your homework so I don't have to do a thing myself? I'm too busy jerkin my meatstick to make a dataset"
good job, cock snorkeler
Anonymous No.105776021 [Report] >>105776031 >>105776041 >>105776072
>>105775889
Who can help me make a Prophet Muhammad LoRA?
I can zip all the images I have. I don't know how to caption.
We must make one for free speech.
They removed the Jesus lora though
Anonymous No.105776023 [Report] >>105776086
>>105776010
Time, the time-knife
Is referencing your short amount of time You spent Comparatively to all the other toaster gpu here
Anonymous No.105776031 [Report]
>>105776021
>they removed the Jesus Lora
Good.
Anonymous No.105776041 [Report] >>105776044 >>105776051
>>105776021
In my experience, nobody is going to actually do it for you, especially niche shit like that. There's no collaboration here when it comes to LoRA's or training. At the most, anons will help you with advice or settings, and you'll need to learn from researching shit and trial and error.
Anonymous No.105776044 [Report] >>105776055
>>105776041
So like any other hobby?
Anonymous No.105776051 [Report]
>>105776041
I would have done it myself if I didn't have a 6 GB gpu
>anons will help you with advice or settings
okay tell me
is it even possible for me to create a SDXL lora?
Anonymous No.105776055 [Report]
>>105776044
Pretty much, yeah. Eventually local assistant AI's will be powerful enough to do stuff like that, ie drop a folder of images and say "Make me a LoRA dataset and train it up, bitch", and they will. We're not there yet though.
Anonymous No.105776056 [Report]
>>105775961
>I wanna discuss training settings and I'm not getting much help here,
I don't know either, I just copypaste training settings I see on civitai to see what works and what don't
Though quite a lot of them have batch >= 8
Anonymous No.105776060 [Report]
>>105775999
>satanic trips of (mostly) truth
Anonymous No.105776072 [Report] >>105776083 >>105776135
>>105776021
>Who can help me make a Prophet Muhammad LoRA?
>>105774236
Anonymous No.105776078 [Report] >>105776176
>>105775961
Ask good questions, get good answers

You are probably asking the most retarded shit
Anonymous No.105776083 [Report] >>105776148
>>105776072
Y tho
Anonymous No.105776086 [Report] >>105776104
>>105776023
>Time, the time-knife
If it's normal for that card then I apologise, I guess it was my lack of experience with kontext and the way other models are so comparitively quick in genning.
Anonymous No.105776104 [Report]
>>105776086
Hypothesis: most people here don’t even have a pc
Despite all the snobbery everyone here is crying about tensor rolling back the free gravy-train & now they have to spend their mctendies money to keep gooning
Anonymous No.105776112 [Report] >>105776120 >>105776177
ignore the age, this is your reminder that exercise balls are a thing and women jiggle barefoot when bouncing on them
Anonymous No.105776119 [Report]
>>105775770
Pika finna catch a SA court case
Anonymous No.105776120 [Report]
>>105776112
Anonymous No.105776135 [Report] >>105776374
>>105776072
Ask the guys at Charlie Hebdo, I'm sure they can help
Anonymous No.105776148 [Report]
>>105776083
Y not tho
Anonymous No.105776176 [Report] >>105776205 >>105776209
>>105776078
>Assuming this thread have smart anons with standards
Anon...
>>105775994
I'm thinking about this, not joking. What a shame it's probable I would get higher quality feedback in fucking reddit than 4chan. How low the mighty have fallen.
Anonymous No.105776177 [Report]
>>105776112
FED
Anonymous No.105776183 [Report] >>105776213 >>105776226 >>105776227
when 48GB drops onto mainstream will we finally see llm+t2i combos? So that the model talks to you, asking questions on how to refine the picture and what tags it understands
current black box approach is wasting a lot of time
Anonymous No.105776205 [Report] >>105776218 >>105776239
>>105776176
>muh mighty
Read the guide
Follow the instructions literally in the exact thread you’re whingeing in
Experiment
& most importantly, Have fun ;^)
Anonymous No.105776209 [Report]
>>105776176
I hope you overcome whatever is stopping you
Anonymous No.105776213 [Report]
>>105776183
this has been attempted multiple time on cloud models and never really went anywhere. something about refining back-and-forth just doesnt work. maybe related to that one paper from 2 years ago that talked about the issue with feeding AI itself and quickly collapsing its internal "world model" as a result
Anonymous No.105776215 [Report]
>>105775961
yes, https://arcenciel.io/
this is the only training community i found and they talk about mostly training illustrious loras. i stopped training months ago and don't care for realism so i'm not sure what else is out there.
Anonymous No.105776218 [Report] >>105776242
>>105776205
My question is can a 3060 train
And will it take more than 1 hour
Anonymous No.105776226 [Report]
>>105776183
>when 48GB drops onto mainstream
So, perhaps year 2032 then
Anonymous No.105776227 [Report] >>105776236 >>105776237
>>105776183
>when
if*
it has been 5 years of 24gb and there are no signs of it improving any time soon
Anonymous No.105776236 [Report] >>105776249
>>105776227
it has literally improved to 32gb this year
Anonymous No.105776237 [Report] >>105776238
>>105776227
There's an anon with RTX 6000 ITT
Anonymous No.105776238 [Report] >>105776243
>>105776237
Allegedly
Anonymous No.105776239 [Report]
>>105776205
What guide, it's completely shit. You don't know how to make a good tutorial. It's just snake oiling
Anonymous No.105776242 [Report] >>105776252 >>105776943
>>105776218
Train what ? What model, how many images, what resolution ?

For example, a 3060 12gb can train a Flux lora at 512x512 resolution with NF4 quantization, the results will be good enough, with 20-30 images it will take 4-6 hours to do 100 epochs which would transfer the style/likeness well enough.
Anonymous No.105776243 [Report]
>>105776238
Careful, we don’t talk about things that COULD have allegedly happened, we cannot speculate whatsoever, we can not use inference. Period.
Anonymous No.105776249 [Report] >>105776254 >>105776257
>>105776236
woah, really??? sweet i can finally run SDXL slightly faster than 3 years ago!!
Anonymous No.105776252 [Report] >>105776272
>>105776242
>500x500
:(

I think the anon from a few threads ago was right, just train online with rented hardware
I break my render times up in groups to prevent heat-death
6 hours is never happening
Anonymous No.105776254 [Report]
>>105776249
that has nothing to do with vram lmao. lurk moar before you talk about things you do not understand
Anonymous No.105776257 [Report] >>105776295 >>105776298
>>105776249
Don’t belittle progress
Things you take for granted can evaporate overnight
Anonymous No.105776272 [Report] >>105776332
>>105776252
For all the problems with Flux, it's amazing at training at low resolution and be able to generate high resolution images with the person/style without any artifacts or loss in detail, unless you are training something like detailed full posters, 512-640 is often easily enough and it obviously speed up training a lot.
Anonymous No.105776295 [Report] >>105776317
>>105776257
>Things you take for granted can evaporate overnight
i can feel the bytes from my safetensors files slowly evaporating from radioactive decay as we speak
Anonymous No.105776298 [Report] >>105776313
>>105776257
nvidia drip-feeding gaymerslop to protect their already outdated a100s is not progress. AMD or intel developing an adequate rival to Cuda would be.
Anonymous No.105776313 [Report]
>>105776298
they both had projects, then scuttled them
then they both funded ZLUDA, then dropped it

they failed, and they lost. the only person who doesnt deserve the seethe are consumers left without a choice
Anonymous No.105776317 [Report]
>>105776295
Do you want ssd Failure?
Because this is how you get ssd Failure
Anonymous No.105776332 [Report]
>>105776272
The face being accurate is v important to me
Anonymous No.105776374 [Report] >>105776390
>>105776135
gonna need a ouija board
Anonymous No.105776386 [Report]
>>105774112
i have 64gb ram
https://files.catbox.moe/3aq184.mp4
exact workflow i use^^^
maybe increase your block swap size? if you dont have much ram you should use gguf and maybe load less loras? if you have plenty of ram then load loras in low mem mode
ps: fp8 is faster than q4 or q8
Anonymous No.105776390 [Report]
>>105776374
>open the portal! XD
Ya good idea
Anonymous No.105776416 [Report]
subgraphs when
Anonymous No.105776456 [Report] >>105776463 >>105776858
>bump limit reached
>44 images
Anonymous No.105776463 [Report]
>>105776456
holy sdxl frying
Anonymous No.105776469 [Report]
>>105775770
hello rtx 6000 pro enjoyer. please share your workflow
Anonymous No.105776491 [Report] >>105776501 >>105776762
>>105775961
your limiting yourself very hard if you rely on /g/ for a decent ai community. There multiple other diffusion threads on different boards and subreddits have decent discussions and gens posted. /g/ is only place of shills who only get excited to shill next fancy FOTM meme model™ and comfyui workflow. Your better off going to /trash/ or /b/ for advice on lora training from other vramlets.
Anonymous No.105776501 [Report]
>>105776491
>/g/ is only place of shills
hello sir
Anonymous No.105776512 [Report]
>>105775994
reddit has plenty of good 18+ ai subreddits that i lurk around.
Anonymous No.105776527 [Report] >>105776545 >>105776586
Lets be honest, the only reason you should be on /g/ AI threads is if you're into celebs or kids or something else illegal so you can't post on reddit or discord

>n-no I'm just german so i just care about privacy and anonymity also I hate normies
that's just being into kids with extra steps
Anonymous No.105776540 [Report]
ok so why won't you go there and stay there?
Anonymous No.105776545 [Report]
>>105776527
Infected neovagina spotted
Anonymous No.105776586 [Report]
>>105776527
what the hell? what the helli?
https://www.tiktok.com/@fanduel/video/7503426695345573166
Anonymous No.105776740 [Report]
Anonymous No.105776762 [Report]
>>105776491
>bro just hang out the communities using SDXL finetunes they're going to help you with training Wan
Anonymous No.105776824 [Report] >>105776846 >>105776858 >>105776859
What's the difference between let's say chroma-unlocked-v41.safetensors and chroma-unlocked-v41-detail-calibrated.safetensors?
Anonymous No.105776840 [Report]
>>105774164
>200W
>firehazard
Anonymous No.105776846 [Report]
>>105776824
>What's the difference between let's say chroma-unlocked-v41.safetensors and chroma-unlocked-v41-detail-calibrated.safetensors?
>>105774779
"detail calibrated" is "large", it means he's training the model at 1024x1024 instead of 512x512
Anonymous No.105776858 [Report]
>>105776456
really a bit thin eh. sorry I can't provide any new chroma material, working on sdxl set
>>105776824
'detail calibrated' has better detail, thus provides better detail on detail. insert graph here.
Anonymous No.105776859 [Report] >>105776869 >>105776890 >>105776962
>>105776824
unlocked 41 gives you vintage analog grain (noise artifacts) from training at 512x512 that make the model look so authentic to the early 2000s myspace era
detailed calibrated is trained at 1024x1024 so you get extra-smooth vaseline smear from the High Quality synthetic dall-e 3 dataset.
Anonymous No.105776869 [Report] >>105776910
>>105776859
Do you have anything better to use in mind?
Anonymous No.105776890 [Report] >>105777320
>>105776859
>detailed calibrated is trained at 1024x1024 so you get extra-smooth vaseline smear from the High Quality synthetic dall-e 3 dataset.
that's because he started to make the detail calibrated in v34, and since v34 calibrated is v1 for large, it means that the large model is really undertrained, for example on v41, he's basically mixing a v41 ""base"" model (that's still a part distilled with fast) with a v7 large model, this is such a shitshow lol
Anonymous No.105776910 [Report]
>>105776869
no it was a joke. but generally i still prefer non-detail because detail feels a bit slopped in comparison still.
Anonymous No.105776930 [Report]
For those who want to install the newest version of Sage2++, you can use this
https://www.youtube.com/watch?v=QCvrYjEqCh8
Anonymous No.105776943 [Report] >>105776994 >>105777320
>>105776242
> with 20-30 images it will take 4-6 hours to do 100 epochs
> 2000000-3000000 steps in 4-6 hours
Damn, 3060 is huge.
Anonymous No.105776962 [Report]
>>105776859
Bullshit. You will get less analog feel with 512x512 vs 1024x1024 because so much 'noisy' details are lost when converting them to latents to be trained.

You don't know anything about the subject yet you keep spouting off nonsense.

Seek help.
Anonymous No.105776976 [Report]
GET THE FUCK OUT
>>105776972
>>105776972
>>105776972
Anonymous No.105776994 [Report] >>105777111
>>105776943
It's a good budget card, unlike the 4060 and 5060 models which has a 128-bit bus, the 3060 has 192-bit bus. The 5060 has DDR7 vram though, which speeds things up. The 4060 series is absolute shit though, steer clear.

Best bang for buck still has to be the 3060 Ti, I doub't it will ever be beaten in value.
Anonymous No.105777111 [Report] >>105777148
>>105776994
what about a 4090 card?
Anonymous No.105777148 [Report]
>>105777111
4070 and upwards have 192-bit bus or larger, 4090 has 384-bit bus
Anonymous No.105777320 [Report]
>>105776890
Pretraining on a lower resolution is how every model is initially trained... the difference here is the smaller dataset size and the weird mixing method. Who knows what their basis for the latter is. They aren't dumb so I'll have some faith in the anthro wolf shaggers
>>105776943
Your maths might be a little off
Anonymous No.105777357 [Report]
>RifleXRoPE extends WAN video output by an additional 3 seconds, increasing the frame count from 81 to 129. However, it comes with some limitations: Higher VRAM usage, Longer generation time, A tendency to revert/loop the scene to its original state by the end of the video
How is this any different than just increasing the length?