← Home ← Back to /g/

Thread 106851472

338 posts 262 images /g/
Anonymous No.106851472 >>106851479 >>106852428
/ldg/ - Local Diffusion General
Some Models Next Week Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106848716

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Nick No.106851479 >>106851497 >>106851734
>>106851472 (OP)
thanks this is my life's work
Anonymous No.106851482
Anonymous No.106851488
debo world
Anonymous No.106851497
>>106851479
thanks for the Korn lady Nicholas
Anonymous No.106851498 >>106851549
>lightx2v/Wan2.1-T2V-14B-CausVid
wtf are they doing, they release everything except i2v
Anonymous No.106851503 >>106851552
Blessed thread of frenship
Anonymous No.106851505 >>106851552
Blessed thread of frenship
Anonymous No.106851507
Anonymous No.106851533 >>106851542 >>106851564 >>106851605 >>106851615 >>106851907
New to this. First time training a chroma Lora and it came out pretty good in the sampler (ai-toolkit), and I’ve been experimenting with a simple workflow based on the default Comfy chroma template, adding only my Lora to the graph (picrel). Results have been good but inconsistent. I’ve been using a fixed seed so that I have reproducibility between all experiments. The settings in picrel repeatably make a VERY good representation of the subject. Like so good I think I could use it to train with… HOWEVER:

>If I change only the seed, results are often inconsistent in both facial retention, body shape, and other details.

>Even with the β€œperfect seed” if I make what I think are minor changes to the prompt (eg to change her position etc) I get similarly inconsistent results.

>If I remove the inconsistent typo (β€œShe's in a relaxed pose with her right arm on her hip. Her She”) I lose some facial retention and get weird arm artifacts.

I noticed during my training that by the final epoch some samples were spot on and some were meh. Does all this point to my LORA being shit? Should I go back and retrain until all 10 training samples are perfect? I’m using the default training prompts from ai-toolkit. Is my prompting shit? It’s just cobbled together crap I found.

Given the workflow I’ve made, if my LORA is actually good what consistency should I be expecting? What other variables are at play in terms of locking in a body shape? The body from the current settings is something I’d like to bake in somehow (re-training with the generated images?) I don’t know yet how to explain the wild inconsistency. Any advice appreciated, sorry I can’t post the actual pix or lora. Using a runpod L40S.
Anonymous No.106851542
>>106851533
maybe end you're life
Anonymous No.106851549 >>106851560
>>106851498
>our model excels at producing coherent long-form videos

Wait, so is this actually long video? Is it finally here? Is this their aim? I'm interested to know their choice of 2.1 and causvid
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106851552 >>106851561
>>106851503
>>106851505
blessed threads of frenly zones ;D
Anonymous No.106851560
>>106851549
Forgot link https://huggingface.co/lightx2v/Wan2.1-T2V-14B-CausVid
Anonymous No.106851561 >>106851704
>>106851552
go back
Anonymous No.106851564 >>106851588 >>106851588
>>106851533
from what I found chroma in general tends to be very inconsistent and you need to reroll many times to get the right result. doesnt matter whether you use a lora or not.
Anonymous No.106851570 >>106851593
did that sadamoto yoshiyuki lora ever get posted somewhere?
Anonymous No.106851571 >>106851607
what is the difference between DC-2k and chroma-HD?
Anonymous No.106851580
Is it me or does Img2Img just not work with Chroma? I always get really bad results. Is there a trick?
Anonymous No.106851584
Anonymous No.106851588 >>106851603
>>106851564
>>106851564
Interesting... have you tried qwen-image? I trained a qwen lora with the same dataset and steps, and pictures and consistency and detail is pretty good but I haven't figured out how to get it to do the nsfw stuff well at all (which I understand is absent from its training?)
Anonymous No.106851593 >>106851641
>>106851570
wasn't good enough, have to re-tag and re-train
Anonymous No.106851603
>>106851588
no, because qwen is censored therefore you will need many NSFW loras to make explicit content where as with chroma you only need a lora for your character/person.
Anonymous No.106851605 >>106851656
>>106851533
There is no way in hell that Chroma or any model for that matter understand hip and breast measurement sizes.
Anonymous No.106851607 >>106851682
>>106851571
Everything with HD in its name is ass.
Anonymous No.106851615 >>106851618 >>106851622 >>106851648 >>106851670 >>106851834
>>106851533
Why do you fucks keep genning basic fat bitches? Can you rotate an apple in your head?
Anonymous No.106851618
>>106851615
>Can you rotate an apple in your head?
use case?
Anonymous No.106851620 >>106851627
Anonymous No.106851622 >>106851639
>>106851615
>Can you rotate an apple in your head
what does this mean
Anonymous No.106851627 >>106851734
>>106851620
Anonymous No.106851637 >>106851659
I just imagined anons 1girl in my head, rotated her 180 degrees, took her clothes off, raped her, and cut off all her limbs.
Anonymous No.106851639 >>106851722
>>106851622
if you cant it means you're an npc with no imagination.
Anonymous No.106851641 >>106851738
>>106851593
i really liked the look of the example pics though.
if the new version ends up looking quite a bit different, would you consider uploading the older version as well?
Anonymous No.106851648
>>106851615
maybe that's anon fat wife and he wants to see her in sexual poses though i dont know why he doesnt just ask her i mean she's right there wtf are you doing
Anonymous No.106851656 >>106851661
>>106851605

lol yeah, if I use this

>breast size A, medium waist size, large hips.

I get similar proportions but a totally different stance and a decent face

If I actually remove that line above entirely, I get a literal alien body horror show. WTF?
Anonymous No.106851659
>>106851637
>and cut off all her limbs.
Speaking of it.
Are there any remarkable guro/gore Qwen Edit loras, or Wan Loras?
Anonymous No.106851661
>>106851656
how about you go back to /b/? hmmm?
Anonymous No.106851670
>>106851615
Post a SINGLE gen you've made for the class to see your superior taste.
Anonymous No.106851682 >>106851692
>>106851607
so DC-2k is good?
Anonymous No.106851692
>>106851682
yeah
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106851704
>>106851561
>"We couldn't detect valid metadata in this image.
>Outputs based on this image must be PG, PG-13, or they will be blocked and you will not be refunded...!"
Anonymous No.106851717
Anonymous No.106851722 >>106852489
>>106851639
I can't visualize shit in my head
wish I had this superpower, but it has nothing to do with imagination
γƒγ‚Ήγƒˆγ‚«γƒΌγƒ‰ !!FH+LSJVkIY9 No.106851734 >>106851845 >>106852489
>>106851479
>>106851627
who IS this man??? ;o
Anonymous No.106851738
>>106851641
it had like 70% fail rate, overcooked with too low resolution. next version will be same but better so no worries
Anonymous No.106851797
Anonymous No.106851803 >>106851914 >>106852159
Anonymous No.106851827 >>106851852
I'm enjoying qwen image edit, but is it possible to use it with a mask ?
I had big trouble inpainting this picture, qwen kept messing up with the characters wings, i had to use flux with mask inpaint back again.
Anonymous No.106851834
>>106851615
I have aphantasia, cool it with the anti p-zombie remarks
Anonymous No.106851845
>>106851734
its ani
Anonymous No.106851852 >>106851926
>>106851827
https://github.com/scraed/LanPaint?tab=readme-ov-file#example-qwen-edit-2509-inpaint
Anonymous No.106851875
Anonymous No.106851907
>>106851533
jesus christ, im gonna have to get into AI image gen now. There simply aren't enough smooth plapable bitches to crank my hog to, Ill have to generate them.
Anonymous No.106851914
>>106851803
cool style. catbox?
Anonymous No.106851925
Catpiss-anon is wreaking havoc again.
Anonymous No.106851926 >>106851983
>>106851852
aside from the water coming from the ceiling light, this is a really aesthetic gen
Anonymous No.106851938 >>106851974 >>106851984 >>106851996 >>106852030 >>106852069 >>106852174
The difference between NetaYume 3.0 and 3.5 is a bit harder to define than 2.0 Plus vs 3.0, but I do think 3.5 is another modest improvement overall. Main thing I've noticed is eye proportions for both male and female characters make a bit more sense in 3.5, and it adds some nice relevant details in appropriate contexts where 3.0 didn't, like the sword here. Prompt (sans boilerplate / neg) was just `masterpiece, best quality, very aesthetic, a 2d digital anime illustration of a samurai warrior in traditional armor, standing in a cherry blossom garden.`
Anonymous No.106851974 >>106852063
>>106851938
desu artists are better in 3.5 IMO as well
Anonymous No.106851983 >>106851996
>>106851926
ceiling mounted "rain showers" with some or many integrated lights - it could literally be this way IRL

sure: it maybe just dreamt this up, hard to tell
Anonymous No.106851984 >>106851997 >>106852071
>>106851938
I don't think you understand how these models work. Your scientific comparison is useless.
Anonymous No.106851996 >>106852024
>>106851938
looks quite a bit better in my subjective opinion. not just the eyes but the whole face looks way less sloppy
sword also breaks some of the obnoxious symmetry, but still way too much of that in this pic imo
>>106851983
sure but shes still wasting water with the other shower head in that case, unless maybe theres a dwarf standing under it just out of sight
Anonymous No.106851997
>>106851984
>I don't think
Of course you don't.
Anonymous No.106852001
Anonymous No.106852020
Anonymous No.106852024
>>106851996
>sure but shes still wasting water with the other shower head in that case
seen that IRL too. quite a few women even apply soap/shampoo without turning their shower off at all, yes.
Anonymous No.106852027 >>106852029 >>106852032
Gemini or JoyCaption for wan captions?
Anonymous No.106852029
>>106852027
janus
Anonymous No.106852030
>>106851938
>boilerplate
do you prepend your positive and negative prompts with "You are an assistant designed to generate anime images based on textual prompts. " like the examples?
Anonymous No.106852032
>>106852027
grok
Anonymous No.106852040
Anonymous No.106852051 >>106852074
Anonymous No.106852060
Anonymous No.106852063 >>106852113
>>106851974
yeah I haven't tried a ton yet. I still don't really understand his explanation of what 3.5 actually is on the Civit page lol, but it doesn't seem to be that important, whatever it is its not worse than 3.0 so whatever
Anonymous No.106852069 >>106852098
>>106851938
Testing now, but your pic like you said is a small improvement. I think Yume being honest that it is 3.5 is fair
Anonymous No.106852071
>>106851984
what? it was the same seed and same prompt and same sampler / scheduler settings, how else would you compare two versions of the same model lol
Anonymous No.106852074 >>106852091 >>106852112
>>106852051
aesthetically pleasing
Anonymous No.106852082
Anonymous No.106852091 >>106852175
>>106852074
nice eyes
Anonymous No.106852098 >>106852148
>>106852069
I'd rather he be careful and make small improvements like this than just YOLO train like some people do and wind up with like enormous seed variance between versions, meaning there'd be more likely noticeable new deficiencies in some area
Anonymous No.106852112 >>106852129 >>106852154 >>106852175
>>106852074
ty radiance chad. I'm not having a ton of luck with radiance atm

>Prompt executed in 14.46 seconds
Chroma1-HD-Flash again. I think aura flow shift 3 makes it slightly better
Anonymous No.106852113
>>106852063
i think hes saying 3.5 isnt DPO'd which IIRC is something anon really wants in a model. no dumbass "human preference".
Anonymous No.106852129 >>106852134
>>106852112
Not a dig, but man these images look cooked broski. As if the CFG is set high.
Anonymous No.106852134
>>106852129
MFW CFG 1
Anonymous No.106852142
Anonymous No.106852148
>>106852098
Yeah I like it, at least he is active on discord and we are getting model updates pretty fast. Did you prompt the sword?
Anonymous No.106852154 >>106852257
>>106852112
>"Computer! Add a ring to the creature's middle finger. The ring is of a shiny silver material and should have a noticeably large blue engraving of the Star of Remphan"
Anonymous No.106852159 >>106852215
>>106851803
niceu
Anonymous No.106852174
>>106851938
Artist styles are more pronounced, anatomy is better (not perfect, but better), seems to retain the creativity and prompt adherence.
Also tried quite a few more erotic prompts and it seems to grasp NSFW better too.
Anonymous No.106852175 >>106852235
>>106852091
ty

>>106852112
>ty radiance chad. I'm not having a ton of luck with radiance atm
2d/3dcg 1girls (often still with flawed hands) are clearly among the best trained concepts right now in case you're trying to do something else. it's not a direct continuation of chroma-1 base, it doesn't know some other stuff AS well as it worked on pre-radiance chroma

cool demon hand on a globe!
Anonymous No.106852190
Anonymous No.106852207 >>106853034
Anonymous No.106852215
>>106852159
sank yew
Anonymous No.106852216
Yume doesn't matter because NovelAI exists
>But saas
The animeland is Illustrious or novelAI. Everything else is a failed experiment.
Anonymous No.106852224
You can come up with something better than that, anon.
Anonymous No.106852235
>>106852175
>2d/3dcg 1girls (often still with flawed hands) are clearly among the best trained concepts right now
It works with a LoRA trained on HD pretty well. Need to do a sampler/scheduler sweep on radiance. Wish it was native in Comfy to queue all the options in a menu
Anonymous No.106852236
Not worth the effort since only one person uses it
Anonymous No.106852243
Anonymous No.106852254
Anonymous No.106852257 >>106852282
>>106852154
how does a star of remphan look? i guess this is a transformed israeli instead.
Anonymous No.106852279 >>106852418
Anonymous No.106852282 >>106852445
>>106852257
holy shit its perfect
Anonymous No.106852298
Anonymous No.106852353
Anonymous No.106852354
Anonymous No.106852367
Anonymous No.106852387 >>106852422
is pony v7 as good as everyone expected it to be?
Anonymous No.106852399
Anonymous No.106852417
Anonymous No.106852418 >>106852445
>>106852279
a masterpiece.
Anonymous No.106852422
>>106852387
That fact that it got token discussion and wasn't mentioned until you brought it up should say everything about it.
Anonymous No.106852428
>>106851472 (OP)
Anonymous No.106852435
Anonymous No.106852441
badger badger badger badger
Anonymous No.106852445
>>106852282
good

>>106852418
yes, it used neural
Anonymous No.106852446 >>106852463 >>106852539 >>106852726 >>106853016
Here is your favourite elf!
Make sexy animations with her and you will be rewarded with more spicy pics!
Anonymous No.106852454
Anonymous No.106852463
>>106852446
I don't want to.
Anonymous No.106852467 >>106853016
Anonymous No.106852489
>>106851734
>who IS this man??? ;o
he is a namefag who goes by tenta who was doxxed on /b/ by his crazy ex gf for being too much of a pedo (serious)
he left /b/ and has hung out in /g/ ever since
he's probably one of the best schizogenners of all time if we're being honest. recognizable style over the years but always some weird liminal madness that felt like you had to be mentally ill to conjure it up

>>106851722
>I can't visualize shit in my head
>wish I had this superpower, but it has nothing to do with imagination
It's not a superpower, its what 95% of the world is able to do. I have aphantasia too. Is there 3 anons itt with aphantasia right now??
Anonymous No.106852502
Anonymous No.106852530
Anonymous No.106852533
Anonymous No.106852539 >>106852997 >>106853016
>>106852446
Anonymous No.106852659
cozy
Anonymous No.106852669 >>106852691 >>106852742 >>106852842 >>106852857 >>106853162 >>106853229
>took the time to learn comfyai
Ngl its not as bad as the people said it would be to learn. Is it one of those things where it gets annoying when you dig into it more
Anonymous No.106852691
>>106852669
It makes sense to programmers but for anyone who wants simple and easy to understand, it sucks because it's a regression from A1111 in that regard for their smooth brains which I am not insulting, it's just fact.
Most people haven't gotten click to install easy to work because the tools need to be build fundamentally so that a normal person on Windows can click an .exe, install, and then off to the races with buttons. Even A1111 was not like this but people got quite close, the main issue is the complexity doesn't scale well, which ComfyUI once you understand does well with workflows and how it does management of models and LoRAs.
Anonymous No.106852703
Anonymous No.106852726 >>106853004 >>106853016
>>106852446
Anonymous No.106852742
>>106852669
same feeling for me. hated it at first, but once I got the hang of it so many things were unlocked. Some custom node stuff is annoying but overall it is fun being able to experiment. Doesn't hurt that gpt5-thinking and grok4 are good at looking at workflows and giving feedback lol
Anonymous No.106852771 >>106853133 >>106853501
MMAudio is criminally underrated.
It's a nice copium while we don't get a true local Veo 3 or Sora 2.

https://files.catbox.moe/ls6f60.mp4
https://files.catbox.moe/742l9a.mp4


Reminder that Wan S2V is also a thing (but it is its own dedicated model unfortunately)
Anonymous No.106852842 >>106852862
>>106852669
you learned for nothing, it's over. no more new local models. the api era is now
Anonymous No.106852856
stableslop > uncomfy noodle
Anonymous No.106852857
>>106852669
>Is it one of those things where it gets annoying when you dig into it more
Yes, slightly. Once you realize how powerful it is, the things that do not work suddenly become maddening. Not in the way the trolls portray it, but definitely in a
>oh my goodness this is so close to working, what is this node bug/UI quirk/python weirdness etc
I would much rather engage in spaghetti weaving than going back to A1111 style gens, but there is genuine frustration in the noodles.
Anonymous No.106852862
>>106852842
>pircel
I have the same expression when I look out my bedroom window and see those fucking NPCs crossing the street.
Anonymous No.106852887
I often wonder if NPCs are even capable of having dreams.
Or if maybe they're just living in ours.
Anonymous No.106852919
https://huggingface.co/spaces/wcy1122/DreamOmni2-Edit
>another snakeoil
it won't stop these days, can't stop taking those Ls
Anonymous No.106852997 >>106853016
>>106852539
Nice! Here is another pic.
Anonymous No.106853004 >>106853016 >>106853950
>>106852726
Sweet! Here is another one.
Anonymous No.106853016 >>106853022
>>106852446
>>106852467
>>106852539
>>106852726
>>106852997
>>106853004
>another avatarfag
jesus, it never stops in this place, fortunately that one is easy enough to filter out
Anonymous No.106853021 >>106853237
is there a solution for color correcting two clips so you can seamlessly edit them together? i have tried the comfy color match nodes and multiple video editor but none can do it. there is no way i can do it manually
Anonymous No.106853022 >>106853058 >>106853097
>>106853016
are you pretending to be retarded?
Anonymous No.106853034 >>106853041 >>106853043
>>106852207
Is it radiance or your prompting that always leaves a noticeable texture to everything?
Anonymous No.106853041
>>106853034
It's radiance.
Anonymous No.106853043 >>106853068
>>106853034
that's just the iconic Chroma noise
Anonymous No.106853058 >>106853097
>>106853022
He doesn't need to pretend.
Anonymous No.106853068
>>106853043
Damn, he always gens good 1girls but the texture makes it a bit off.
Anonymous No.106853097 >>106853118
>>106853022
>>106853058
>another avatarfag protecting his fellow avatarfag
color me shocked
Anonymous No.106853118 >>106853128
>>106853097
>another no-gen response
We're generating what we feel like generating.
What would you prefer to see, anon?
Maybe some clowns so you won't feel so out of place?
Anonymous No.106853128 >>106853135
>>106853118
>no-gen
that answer is /sdg/ coded, you need to go back
Anonymous No.106853133 >>106853143
>>106852771
https://litter.catbox.moe/ov1h3rgen4mhlysd.webm
Anonymous No.106853135 >>106853139
>>106853128
>implying
and you need to tongue my anus
Anonymous No.106853139 >>106853144
>>106853135
>you need to tongue my anus
I'm not a faggot like you so I'll kindly refuse
Anonymous No.106853143
>>106853133
true, MMAudio is criminally undorighted
Anonymous No.106853144 >>106853151 >>106853521
>>106853139
Cool, let us know when you want to contribute something besides salt.
In the meantime I will avatarfag as a green frog.
Anonymous No.106853151 >>106853165
>>106853144
what are you contributing? broken rules?
Anonymous No.106853162 >>106853208 >>106853229 >>106853318
>>106852669
ngl, anistudio is the best of both world when it's out. easy to install and components open up a lot of unexplored implementations. game engine design is the future because it can give you an abstract of something simple and give you a deeper autism than nodes can provide. you can even make nodes out of components. comfy can't beat that
Anonymous No.106853165 >>106853197
>>106853151
>literally threatening anon
Yes, please report me for avatarfagging as a green frog, go ahead.
Then scurry back to plebbit.
Anonymous No.106853180 >>106853206
What is it about this thread that attracts such buttmad troons?
Anonymous No.106853197
>>106853165
>he missed
pepe is just too pure for this world
Anonymous No.106853206 >>106853221 >>106853817 >>106853841
>>106853180
/sdg/ avatarfags (those "peopl" are the main reason /sdg/ died) want to kill this general too, they're like viruses, if you let them spread, it's over
Anonymous No.106853208
>>106853162
lol
Anonymous No.106853221 >>106853227
>>106853206
How many similar gens am I allowed to post before it's 'breaking the rules'?
Pray tell.
Anonymous No.106853227 >>106853231 >>106853301 >>106853381
>>106853221
just stop avatarfagging nigger. you dont have to post the same image 100s of times.
stop shitting up the thread or are you too retarded to understand this?
just dont be a faggot.
Anonymous No.106853229
>>106853162
>>106852669
>ngl
Anonymous No.106853231
>>106853227
this
Anonymous No.106853237
>>106853021
>multiple video editors
This doesn't tell anything especially if they are freeware shit.
You need to learn the correct workflow. Match the blacks and whites first.
Anonymous No.106853273 >>106853288
Anonymous No.106853288
>>106853273
Now animate it.
Anonymous No.106853301
>>106853227
>post 3 gens of an anime character
>react to a retard with 5 amphibious gens
So a maximum of 4 similar gens before the avatar police are called, got it.
Anonymous No.106853318
>>106853162
a game engine with this stuff built in sounds infinitely better than whatever webslop we are using
Anonymous No.106853360
Anonymous No.106853376
Anonymous No.106853381 >>106853464
>>106853227
What's wrong with Avatar? You have inreresting psychosis xD
Anonymous No.106853457 >>106853475
https://litter.catbox.moe/zpak3566ec5dhkd6.webm
Anonymous No.106853464
>>106853381
>pixai
Your non-localfag gen has been reported to the authorities.
Enjoy your b&
Anonymous No.106853472
Anonymous No.106853475 >>106853743
>>106853457
VibeVoice + Wan S2V?
Anonymous No.106853501 >>106853516 >>106853575
>>106852771
No, it's trash even for just a copium.
Anonymous No.106853516
>>106853501
It does an okay job for stuff that doesn't have voice/vocals in my opinion
https://files.catbox.moe/uqc7o4.mp4
Anonymous No.106853521 >>106853527
>>106853144
https://litter.catbox.moe/0zkw5it7owl4h3f0.webm
Anonymous No.106853527
>>106853521
KEK
Anonymous No.106853537
someone wake me up the day local can do fucking MAD videos like this >>>/wsg/5995145
>inb4 what is "MAD"
newfag!
https://www.youtube.com/watch?v=fz_KNTsP0cQ
Anonymous No.106853553
Anonymous No.106853575
>>106853501
whoever said that it works for nsfw must've been smoking crack
Anonymous No.106853637
Anonymous No.106853639
comfy being decidedly uncomfy again
Anonymous No.106853646
Anonymous No.106853652 >>106853675
Anonymous No.106853675
>>106853652
moar cyborgs plox
Anonymous No.106853743 >>106853765
>>106853475
https://litter.catbox.moe/z4kcqhpchizxa77m.webm
Anonymous No.106853765
>>106853743
Can it do "normal" sounds, like a person starting a car and driving it in high speeds?
Anonymous No.106853806 >>106853812
What's the SOTA local for masked-region prompting?
Anonymous No.106853812 >>106853896
>>106853806
I've seen Qwen Edit inpaint loras (where it replaces what is masked in a particular color), look it up
Anonymous No.106853814 >>106853825
Anonymous No.106853817 >>106853837 >>106853863
>>106853206
Are those people in the same room with you right now?
Anonymous No.106853825 >>106853832
>>106853814
Neta anon, do you prompt for artists?
Anonymous No.106853832
>>106853825
yes
Anonymous No.106853837 >>106853843 >>106853872
>>106853817
you are in the room so yes
Lumi No.106853841
>>106853206
boo!
also, i disavow all of this. we just want peaceful coexistence.
Anonymous No.106853843
>>106853837
What do you mean?
Anonymous No.106853863
>>106853817
what about me :3
Anonymous No.106853872 >>106853880 >>106853909
>>106853837
>still bickering about avatarfaggotry
Not everyone is a terminal /g/fag concurrent with the entire history of board drama...
Might I suggest putting a disclaimer in an easy to read place?
Something like a maximum of 4 similar images, as we discussed.
>pic unrelated
Anonymous No.106853880 >>106853893
>>106853872
>jeez, I wonder why that anon is so weary about avatarfaggotery, it's not like those guys killed a general (/sdg/) or someth... oh...
Anonymous No.106853885
Anonymous No.106853893 >>106853898 >>106853909
>>106853880
At least tell me how avatarfags 'killed it'
because last I checked it was just less quality than this thread
...and I just checked again, and it's still there?
Anonymous No.106853896
>>106853812
Thanks, it looks neat but not quite what I'm looking for. I want text to image, but where the prompt changes slightly in different parts of the canvas
Anonymous No.106853898 >>106853919
>>106853893
try to make your best guess on why avatarfagging is against the rules
Anonymous No.106853909 >>106853919
>>106853872
>>106853893
cooked garbage gens
Anonymous No.106853919
>>106853898
Because we must remain trapped in a maze of confusion and never knowing who we're speaking to or else we might escape the matrix?
I seriously don't know, I'm also not avatarfagging either.

>>106853909
Why yes, I'm running SD1.5 on a smart refrigerator.
It's all I have.
Thanks for noticing.
Anonymous No.106853930
Anonymous No.106853933 >>106853951
Anonymous No.106853943
glowie melty?
Anonymous No.106853950
>>106853004
https://litter.catbox.moe/yfs54isq57bbo9nz.mp4
Anonymous No.106853951 >>106853973
>>106853933
omg its mi-.. wait this isnt migu
Anonymous No.106853973
>>106853951
Sorry to disappoint lol
Anonymous No.106853992
>click on /sdg/
>58 posts containing variations of approximately 5 different images.
Okay, I see what you mean now.
Newfag standing down...
Have a blessed evening!
Anonymous No.106854001 >>106854025
https://www.reddit.com/r/StableDiffusion/comments/1o3o1ax/rcm_sota_diffusion_distillation_fewstep_video/
>RCM : SOTA Diffusion Distillation & Few-Step Video Generation
is it better than the self forcing method (lightvx)?
Anonymous No.106854025 >>106854028
>>106854001
Nobody knows until we get an implementation because all bechmarks are lies.
Anonymous No.106854028
>>106854025
>implementation
it's just a lora you can just run it just like that?
Anonymous No.106854060 >>106854069 >>106854077
https://rectified-cfgpp.github.io/
trust me bro this time we'll replace cfg bro, I know there's like hundreds of attempts that turned out to be snakeoils but this one is the good one bro
Anonymous No.106854069
>>106854060
>no comparison with original cfg++
Anonymous No.106854077 >>106854112
>>106854060
You logged my IP. What's the catch?
Anonymous No.106854099
Anonymous No.106854112 >>106854133
>>106854077
>127.0.0.1
bro ... you are in MY HOUSE?!?!?
Anonymous No.106854129 >>106857065
Anonymous No.106854133
>>106854112
WHAT?! MY IP IS 127.0.0.2 WHAT HAVE YOU DONE?!
Anonymous No.106854194 >>106854206 >>106854222 >>106854223 >>106855674
man, ive also read that this vietnamese fag decide to just train tags instead of NL because 'uwaaah, i didnt have anyone to review nl captions... so I just dropped them lol!'. I mean having gemini/gpt or even joycaption caption your shit would be better than to caption at all.
I fucking hate these subhumans
Anonymous No.106854198
Anonymous No.106854206 >>106854215
>>106854194
>I mean having gemini/gpt or even joycaption caption your shit would be better than to caption at all.
You think those don't need QC?
Anonymous No.106854215
>>106854206
you can have another LLM do the QC. Point is, even without QC, don't you think that adding non QC'd natural language captions would be better than no NL captions at all? Modern models have very good performance, man even fucking GEMMA 27b is good at it (except it doesnt do NSFW), so you're telling me that 95% good/passable NL captions are worse than 0%?
Anonymous No.106854219
I will never prompt as if I'm a VLM. You can't make me.
Anonymous No.106854222 >>106854232 >>106854233
>>106854194
>train tags instead of NL
good. neta works just fine with tags + minimal nl. anybody who thinks having to write a novel of gpt slop for a prompt is a good idea is subhuman
Anonymous No.106854223
>>106854194
Considering I have yet to get a decent answer for my question, I can see why.
Anonymous No.106854232 >>106854243
>>106854222
the point being you want the model to be able to generalize, if your dataset isn't evenly captioned like in this case, NL becomes less effective, while tags will be more effective.
retard
Anonymous No.106854233
>>106854222
trips of truth
Anonymous No.106854234
1boy, anonymous, fellatio
Anonymous No.106854243 >>106854268
>>106854232
>NL becomes less effective, while tags will be more effective.
now go ahead and explain why this is bad
Anonymous No.106854248 >>106854256 >>106854261 >>106854262
give me new prompt ideas
Anonymous No.106854256 >>106854266
>>106854248
get her on her knees and make her suck a dick
Anonymous No.106854258
I'm okay with 2 more years of SDXL until the saas replacement is created
Anonymous No.106854259
>lobotomizing a language to a short list of words is actually good
Anonymous No.106854261
>>106854248
Okay, hear me out...
2girls
Anonymous No.106854262
>>106854248
white skin, glowing eyes, halo
Anonymous No.106854266
>>106854256
thats easy with oral insertion lora, next
Anonymous No.106854268
>>106854243
>model losing the ability to actually follow NL is good!
kys, it's one of the selling features of neta lumine, I'll just go back to ill/noob
Anonymous No.106854272
>neta
Failed model, I accept your concession
Anonymous No.106854273
>noo sir you need to write a 10 page novel describing the texture girth, and smell of her penis instead of just writing large penis, veiny penis, futanari
Anonymous No.106854282
>her
Anonymous No.106854286 >>106854288 >>106854290 >>106854293
redpill me on netayume
Anonymous No.106854288
>>106854286
no
Anonymous No.106854290
>>106854286
Finetune of a model that wasn't finished baking
Anonymous No.106854293
>>106854286
It's alright.
Anonymous No.106854311 >>106854654
the fuck is peft and NaDiT and how do you update it?
Could not import 'NaDiT' from any of the paths: ['custom_nodes.ComfyUI-SeedVR2_VideoUpscaler.src.models.dit_v2.nadit', 'ComfyUI.custom_nodes.ComfyUI-SeedVR2_VideoUpscaler.src.models.dit_v2.nadit', 'src.models.dit_v2.nadit']. Last error: peft>=0.17.0 is required for a normal functioning of this module, but found peft==0.15.2.
Anonymous No.106854334 >>106854346
Anonymous No.106854343 >>106854363 >>106855706
I accidentally updated my comfy and now I see a really shitty looking new ui.
how fucked am I?
Anonymous No.106854346
>>106854334
make her do some sit ups and push ups.
Anonymous No.106854355
Anonymous No.106854363
>>106854343
its over, uninstall, low level format your drives, sell the pc and just buy a comfy cloud subscription
Anonymous No.106854385 >>106854395 >>106855713
Is SATA ssd good enough for models? I don't want to waste my last M2 slot until 4TB drives drop in price.
Anonymous No.106854395
>>106854385
unless you're swapping your ssd speed should only affect your model load times
Anonymous No.106854419 >>106854663 >>106854713 >>106854798 >>106854840
I might be too fucking dumb for this, I cant get an image to generate
I install stable diffusion
dl wai and put it in my models/stable-diffusion folder
I run stable difusion in cmd prompt window, it opens in my web browser
I choose the model in the drop down menu and then type my prompts and click start
the bar moves but no progress is ever made
AMD gpu

what is wrong?
Anonymous No.106854654
>>106854311
Peft is a hugging face lora lib. The most foolproof way is probably to update the requirements.txt of the last node you've installed with the relevant version number and then do requirements reinstall.
Anonymous No.106854661 >>106854669
Anonymous No.106854663
>>106854419
>amd gpu
bro....
Anonymous No.106854669 >>106854683
>>106854661
asuka a shit.
Anonymous No.106854683
>>106854669
no u
Anonymous No.106854713
>>106854419
>I install stable diffusion
wut? do you mean stable diffusion webui by automatic1111? because that's a outdated piece of shiet. switch to either comfyui or whatever fork of reforge people use now
>AMD gpu
uhh for amd you'll have to do some googling to figure out how to get your backend of choice working
Anonymous No.106854798
>>106854419
>he buy boughted the amd graphics
Anonymous No.106854803 >>106854813
Anonymous No.106854812 >>106854817 >>106854865
Anonymous No.106854813 >>106854818
>>106854803
>>106854788
embarassing
Anonymous No.106854817
>>106854812
nsfw lora fucks it up, turn it off
Anonymous No.106854818 >>106854823 >>106854830
>>106854813
They won't give me even an inch!
Anonymous No.106854823
>>106854818
does conan hit this bitch or not
Anonymous No.106854827 >>106854854
Anonymous No.106854830
>>106854818
>the pole

KEK
Anonymous No.106854835 >>106854854
Anonymous No.106854839 >>106854854
Anonymous No.106854840
>>106854419
Follow some guide if necessary and download comfyui-zluda:
https://github.com/patientx/ComfyUI-Zluda
Anonymous No.106854844 >>106854854
Anonymous No.106854849 >>106854854 >>106854891
Anonymous No.106854854
>>106854827
>>106854835
>>106854839
>>106854844
>>106854849
literally better than all Sora 2 threads combined
bravo
Anonymous No.106854865
>>106854812
nice
Anonymous No.106854868
Anonymous No.106854891 >>106854911
>>106854849
wtf why did lodestone do this
Anonymous No.106854911
>>106854891
She'll be pulverized and then sprinkled across next radiance weights randomly. He found that 1girl sacrifice is the best way to steer his checkpoints.
Anonymous No.106855016 >>106855109 >>106855413 >>106855429 >>106855628 >>106855643
Anonymous No.106855109
>>106855016
nice finally some squats
Anonymous No.106855126 >>106855294 >>106855639
Anonymous No.106855294 >>106855473
>>106855126
Anonymous No.106855413
>>106855016
never skip boob day
Anonymous No.106855429
>>106855016
Finally, wan 5B
Anonymous No.106855473
>>106855294
Anonymous No.106855476 >>106855506
Anonymous No.106855506
>>106855476
>where do you think you're going geek boy?
Anonymous No.106855512
Pony status?
Anonymous No.106855599 >>106855621
Is there a chance of doing something with 8GB vram and 16gb ram? I've been using some pony model about a year ago, and it was somehow working. Did the technology advance?
Anonymous No.106855621 >>106855643
>>106855599
>can I do something with shit hardware
no, youre stack with XL derived models
Anonymous No.106855628 >>106855633
>>106855016
FTFY
Anonymous No.106855633 >>106855666 >>106855708
>>106855628
can we have a middleground? i dont like tumors girls
Anonymous No.106855639 >>106855644 >>106856273
>>106855126
Anonymous No.106855642 >>106855650
What are you guys doing videos with? Still just WAN?
barely works on my 2070S...
Anonymous No.106855643 >>106855659
>>106855621
What are the baseline requirements for doing something like that?>>106855016
Anonymous No.106855644
>>106855639
damn I wonder when they will find this poor being's bones... if they will be able to identify her?
Anonymous No.106855645
>>106850096
> Those who tried using Qwen-Omni and uploaded real songs for it to describe know what I am talking about.
2.5 or 3?
Anonymous No.106855650 >>106855657
>>106855642
>2070
the vramlets thread is that way >>>/sdg/
Anonymous No.106855657 >>106855726
>>106855650
I mean it does work. It just takes half an hour for 5 seconds
Anonymous No.106855659
>>106855643
16gb vram and 64gb ram minimum if you dont want to kill yourself waiting for gens to finish. it's around 3 minutes for 5s video with this setup
Anonymous No.106855666 >>106855673
>>106855633
Anonymous No.106855673
>>106855666
holy balloony
Anonymous No.106855674
>>106854194
Qwen/Wan do capturing with LLM anyway.
Anonymous No.106855706
>>106854343
pip install -U comfyui-frontend==1.23.4
Anonymous No.106855708 >>106855722
>>106855633
Anonymous No.106855713
>>106854385
Good, but not enough. Try to move the hottest models/loras to nvme.
Anonymous No.106855721 >>106855916
Desperately need the GTA VI chad to post a catbox of one of his gens for that workflow!

Captcha: PAWGX
Anonymous No.106855722
>>106855708
O_o ( . Y . )
Anonymous No.106855726
>>106855657
Use light2x loras.
Anonymous No.106855809
Anonymous No.106855820
what is this /adt/ leak?
Anonymous No.106855898
Anonymous No.106855916 >>106855950 >>106855963 >>106856029
>>106855721
I'll clean the workflow first, it's filled with embarrassing custom nodes
Anonymous No.106855937 >>106857226
Anonymous No.106855950
>>106855916

Thank you, I appreciate you!
Anonymous No.106855963 >>106856006 >>106856018
>>106855916
please a lot of anons want to use your workflow
Anonymous No.106856006
>>106855963
i don't
Anonymous No.106856018
>>106855963
chroma is broken, gta6 anon cherrypicks his gens
Anonymous No.106856023
Taken me 4 days of genning to finally get the results I wanted, and even then it's not close to perfect. I can 3d animate the result I wanted in a day.
I give it another year before it can do what I want with just a few minutes spent on it.
Anonymous No.106856029
>>106855916
basterd bicth redeem the workflow
Anonymous No.106856130 >>106856140
Is there any benefit in changing these values? I'm noticing a lot of performance drop when this is working on higher res gens.
Anonymous No.106856140 >>106856147
>>106856130
tiling can help reduce peak VRAM usage
Anonymous No.106856147 >>106856155
>>106856140
During the entire process of the gen? I guess it will produce visible lines from the tiles?
Anonymous No.106856151
new
>>106856149
>>106856149
>>106856149
Anonymous No.106856155 >>106856175
>>106856147
no, vae encode/decode only happen when transforming pixels to/from latents, meaning only at the beginning and end
Anonymous No.106856175
>>106856155
Ah, alright, thanks.
Anonymous No.106856217
>crying because your slop wasn't chosen for the completely arbitrary fagollage
Anonymous No.106856273
>>106855639
holy shit haha
good one
Anonymous No.106857065
>>106854129
waow
Anonymous No.106857226
>>106855937
i like it