← Home ← Back to /g/

Thread 106150095

343 posts 140 images /g/
Anonymous No.106150095 >>106150104 >>106151889 >>106152622
/ldg/ - Local Diffusion General
Discussion of Free and Open Source Text-to-Image/Video Models

GenJam3: https://forms.gle/hWs19H4vTGTdwARq8

Prev: >>106146763

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.2 Guide: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106150104 >>106152245
>>106150095 (OP)
>including disgusting footfag garbage in the collage
Embarrassing.
Anonymous No.106150106 >>106150120
best wan 2.2 workflow for rtx 3060 + 64gb ram?
Anonymous No.106150108
Blessed thread of frenship
Anonymous No.106150112
pasta needs an update methinks
Anonymous No.106150118 >>106150279
>12 collage pics.
Not my record, but not a bad performance at all.
Anonymous No.106150120 >>106152114
>>106150106
https://files.catbox.moe/6lp32g.json, you might need to use lower gguf quants
Anonymous No.106150126 >>106150154 >>106150221 >>106150258
chromagods won
24+gb vramchads won
xi won
chuds won

antitorrenting fags lost
vramlet fudders lost
mikutroons lost
Anonymous No.106150134
When using wan i2v and the input image includes a character very close to the camera that's blurred by the focus, how can you instruct wan to retain that focal blur for the video?
Anonymous No.106150148 >>106150248 >>106150741
so where are the torrents? i wanna seed
i have a lot of wan 2.1 loras archived on my hdd
im too lazy to make a torrent and shits on hf anyways
let me SEEEEEEED
Anonymous No.106150154
>>106150126
xi is out of favor now. firing squads need to be summoned to cull the bugs that screwed the datasets. until that happens, China is cringe again
Anonymous No.106150155 >>106150170 >>106150181 >>106150186 >>106150195
Sup fags, you you guys all use this website to generate your shit?

https://stabledifffusion.com/
or this one?
https://stablediffusionweb.com/

which is better and why?
Anonymous No.106150170 >>106150176
>>106150155
kill yourself
Anonymous No.106150176 >>106150188
>>106150170
???
Anonymous No.106150181
>>106150155
>local diffusion general
>they must use websites to gen images!
ohhh nononono
Anonymous No.106150186 >>106150194
>>106150155
Yes
Anonymous No.106150188 >>106150194
>>106150176
LOCAL diffusion general
Anonymous No.106150194
>>106150186
>>106150188
i aint using "local" just tell me which one is better
Anonymous No.106150195
>>106150155
Nah lil bro we are using

http://127.0.0.1:8188/
or
http://localhost:8188/
Anonymous No.106150200 >>106150229
Torrentmong melty
Anonymous No.106150221
>>106150126
whats up? I missed something?
Anonymous No.106150229 >>106150242
>>106150200
>lose the argument in the previous thread
>get exposed as a poorfag retard
>cope by trying to suggest anon was "melty" in the next thread
Lol you are a loser bro.
Anonymous No.106150231 >>106150238 >>106150249
did I sleep through something big
Anonymous No.106150238
>>106150231
qwen_image but it's mostly a nothing burger until finetunes exist
Anonymous No.106150242 >>106150255
>>106150229
Let us know when you've got the torrents ready!
Anonymous No.106150248 >>106150280
>>106150148
> i have a lot of wan 2.1 loras archived on my hdd
do you update them?
Anonymous No.106150249 >>106150295 >>106150405
>>106150231
fp8 qwen image model, you need 32GB+ vram to use the fp8 model
Anonymous No.106150255
>>106150242
>anon asks why the community doesn't just move to p2p filesharing
>"ERR TORRENTING COSTS MONEYYY"
>yes, and?
>"HEY DO YOU HAVE YOUR TORRENTS READY? WELL WHERE ARE THEY ANON?!"
This is why you are the village idiot of /ldg/.
Anonymous No.106150256
feels a lot like flux
Anonymous No.106150258 >>106150271
>>106150126
stop attack vramlet. i'm cool, young, and i already generated millions nsfw vids, kek
Anonymous No.106150271
>>106150258
this. qwen is too big for what it does and researchers need to think of the little guys. cumfart just makes wasteful bloat so everyone suffers
Anonymous No.106150275 >>106150284 >>106150305
No, seems like it doesn't know what openpose or skeletal pose estimation overlay are.
Anonymous No.106150278 >>106150303 >>106150306
can someone explain to me why the wan chinks can't stop characters from talking?
Anonymous No.106150279
>>106150118
why is the gooner on there twice tho
Anonymous No.106150280
>>106150248
i dont, one day i felt like archiving them and i didnt even archive them all
the folder count is wrong because i only saved the pointer files to all the loras, and i made a script to download the pointer files for each folder
i dont even use the loras kek
Anonymous No.106150284 >>106150325
>>106150275
you preprocess openpose the video after you retard
Anonymous No.106150295
>>106150249
20B is a hefty boy, but we've been burned before
dunno why they still use these flat param numbers, it's the quality and diversity that matters
Anonymous No.106150303 >>106150320
>>106150278
every video model has this problem. it's just hard to find videos where girls just shut up
Anonymous No.106150305
>>106150275
lmao
Anonymous No.106150306
>>106150278
because you touch yourself at night
Anonymous No.106150315
>wan still sucks at making anime characters kiss
One of the major problems here is there's so little good training data. Out of all the anime kiss scenes ever animated where you actually see the lips connect, 99.9% of the time the composition will show it in extreme close up. It is extremely rare to see two characters kiss with with both of their entire heads visible.
Anonymous No.106150320 >>106150326 >>106150327 >>106150348
>>106150303
Literally how difficult can it be to just engineer a trigger word that disables all mouth movement? Wan knows what a mouth is, so why can't they include conditioning to prevent a mouth from opening?
Anonymous No.106150325 >>106150336
>>106150284
It is not accurate enough.
Anonymous No.106150326
>>106150320
not a problem if you are i2v a blowjob scene
Anonymous No.106150327
>>106150320
you are just referring to Chinese caption incompetence in general with slop captioning
Anonymous No.106150336
>>106150325
then just use a 4d model because it does exactly what you want to accomplish and you aren't doing anything new
Anonymous No.106150348 >>106150381
>>106150320
it gets even worse when wan creates mouths for characters that clearly shouldn't have them.
Anonymous No.106150381 >>106150440
>>106150348
>understands piplup has a yellow beak
>animates the yellow beak
>still generates a talking human anime mouth
do the chinks have a talking fetish?
Anonymous No.106150388 >>106150393
Chroma gave me a child when I asked for a young woman. Am I going to jail?
Anonymous No.106150393
>>106150388
pics or didnt happen
Anonymous No.106150405
>>106150249
Just wait for nunchaku if you’re a vramlet
Anonymous No.106150406 >>106150425 >>106150520 >>106151558 >>106151593
why does she get elf ears? I didn't specify that
https://files.catbox.moe/0s8s8e.mp4
Anonymous No.106150418 >>106150442 >>106150501
where can a young brother such as myself use joycaption without running into the ZeroGPU quota exceeded
nb4 local
Anonymous No.106150425 >>106150436 >>106150453
>>106150406
qwen generalized into an understanding of knowing that elf females are the perfect match for human men
Anonymous No.106150436 >>106150453 >>106150473
>>106150425
it's wan. qwen doesn't gen videos so I don't get why people care at all about it
Anonymous No.106150440
>>106150381
I'm sure the dataset just has a gigantic bias of people talking / moving their mouths
Anonymous No.106150442
>>106150418
https://github.com/jhc13/taggui
Anonymous No.106150446
is sage fucking with qwen? that's the only arg I have active
Anonymous No.106150453
>>106150436
right
>>106150425
*wan
Anonymous No.106150473 >>106150500 >>106150523 >>106150542 >>106150544 >>106150564
>>106150436
one of the most interesting things I've discovered is that there are groups of people who love static images who dislike video, and groups of people who love video and dislike images, and neither group can understand the other's obsession.
Anonymous No.106150480
Anonymous No.106150500
>>106150473
it isn't that, wan makes images too and people were seriously considering training it's image capability. having just an image model isn't really efficient but neither is faking MoE with 2.2.
Anonymous No.106150501
>>106150418
on your cpu
Anonymous No.106150507
Anonymous No.106150520 >>106150527
>>106150406
i want to have sex with her
Anonymous No.106150523 >>106150905
>>106150473
It's just vramlets who cry about videos, and most of the people who mostly focus on videos like me are doing it because we had years of image gen and just recently can generate good videos
Anonymous No.106150527
>>106150520
me too anon. me too
Anonymous No.106150542 >>106150559 >>106150591 >>106150622
>>106150473
Maybe it's time to split the thread between image and video gen.
Anonymous No.106150544
>>106150473
no I completely understand someone with autism doing mentally ill things
Anonymous No.106150559 >>106150578
>>106150542
or if you use sdxl you can go to sdg which is where the poorfags with autism post
sdxl is great because it's basically a timecapsule, no need to go to the archives because it's groundhog day in that thread
Anonymous No.106150562
I'm noticing wan seems to love putting watches on characters wrists.
Anonymous No.106150564
>>106150473
i like both and im a vramlet (12gb)
Anonymous No.106150578 >>106150603
>>106150559
illustrious is the last holdout of trained styles and artists. that is why it's still popular. you keep saying it's vramlet but 8gb is enough to make videos.
Anonymous No.106150579
the funny thing about xl is its still unmatched for anime
Anonymous No.106150591 >>106150617
>>106150542
+1 for this
/vdg/ - Video Diffusion General
Anonymous No.106150603
>>106150578
if you wait 20 minutes per video generation you have autism
Anonymous No.106150617 >>106150680
>>106150591
okay you can split the thread, get your ban for trolling outside of /b/ and I'll keep posting in the local diffussion general thread
every day we march towards God's plan of digital IDs removing you completely, proxy evade that
Anonymous No.106150622
>>106150542
Absolutely. I have no interest in videos but static images are my jam
Anonymous No.106150628
Anonymous No.106150632
Anonymous No.106150680 >>106150706
>>106150617
Hey idiot didn't you insist the guy baking /adt/ would get banned for trolling too?
Anonymous No.106150694 >>106150716
Anonymous No.106150695 >>106150709
krea is really good for surveillance cam gens
Anonymous No.106150706 >>106150730
>>106150680
you know what you're doing right now is trolling, don't play stupid
also good luck with your split, adg is surviving on sdxl fumes from dedicated trolls, good luck astroturfing a video thread with your 3060
Anonymous No.106150708
>benchmaxxed chinkslop
I'm getting flashbacks to March of last year
Anonymous No.106150709 >>106150727
>>106150695
Anonymous No.106150716
>>106150694
why are you posting it again?
Anonymous No.106150721
looks like there's some interest in /vdg/.
Personally I like video gen a lot more than image gen too. My interest in image gen is just generating inputs for i2v.

I keep seeing this thread filled with chroma/flux discussion. Not interested.
Anonymous No.106150724
>>106149907
>>106150545
Typical Chinese training practice is just to use synthetic slop. I am now more convinced than before that even all their LLMs including Deepseek and Kimi K2 are the same. That's why their models will always be inferior. Why a nation that has billions is unwilling to just channel into their cheap labor to properly label datasets is beyond me.
Anonymous No.106150727
>>106150709
Anonymous No.106150729 >>106150759
Sirs, how do I fix this? Google says comfyui is missing triton or needs its older version, but I've installed 3.1 through pip install triton-windows and nothing changed
Anonymous No.106150730 >>106150748
>>106150706
>anyone keeping alternative threads alive must be dedicated trolls!
You have schizophrenia.
Anonymous No.106150737
i thought chang was supposed to save us...
Anonymous No.106150741 >>106150752
>>106150148
Give some torrent for me to SEEEEEEED
Anonymous No.106150748 >>106150763
>>106150730
>schizofag posting
You're obvious because you troll and then call everyone schizos
Anonymous No.106150752 >>106153483
>>106150741
no give it TO ME
Anonymous No.106150756
>You grey fucks are taking too many people! That wasn't part of the deal!
Anonymous No.106150759 >>106151082
>>106150729
>I've installed 3.1 through pip install triton-windows and nothing changed
You're sure you're making those changes to ComfyUI/Lib/site-packages/triton and not your system Lib/site-packages/triton?
Go to the updates folder in your comfyui portable, look at the update_dependencies.bat, it has the correct commans to invoke comfyui portable's pip and update its own packages instead of your system's
Anonymous No.106150763 >>106150777
>>106150748
I'm just calling you a schizo, schizo.
It takes some serious mental illness to believe /adt/ is only being kept alive by "dedicated trolls".
It also demonstrates that you are quite stupid.
Anonymous No.106150777 >>106150792
>>106150763
no, you simply are obvious because schizo and mental illness insults are your go to projections
Anonymous No.106150792
>>106150777
>no! i'm not a schizo! you are just projecting! /adt/ is definitely just trolls and the mods will definitely delete it one day! just you w-watch!
Anonymous No.106150864 >>106150958
Anonymous No.106150905 >>106150917
>>106150523
youre literally using a quant and speed lora you poorjeet. you don't have a B200
Anonymous No.106150912 >>106151085 >>106151657
When I use loras with wan2.2, should I use the same weight as I was with 2.1?
Also between HIGH and LOW, is there any recommended weight change? For example, loras should be half HIGH in LOW?
Anonymous No.106150917
>>106150905
the difference is Q8 is basically max quality, Q4 or even svdq is quite noticably shittier
Anonymous No.106150928 >>106151335
>please hand hold me, I expect you to experiment with all these things and then just tell me
Anonymous No.106150944
Anonymous No.106150949
Anonymous No.106150958
>>106150864
cool
Anonymous No.106150991 >>106151044 >>106151078 >>106151164
i know this is ldg but have you guys seen google genie 3? insane, anon last night was making retro shooter games playing with instructions, and today its realtime and temporally consistent - insane
Anonymous No.106150993 >>106151011 >>106151064 >>106151164 >>106151184 >>106151307
WAAAAAA WHY ARE CHINKS RELEASING MODELS THAT ARENT THAT SPECIAL HUNYUAN WORLD IS A TOY IT CANT-AAAACK

It forces all other players to compete and publish their own research so it's not obsoleted and forgotten, retard

https://www.youtube.com/watch?v=PDKhUknuQDg
Anonymous No.106151011 >>106153063
>>106150993
I view this in the same way as Pixar talking about some new 3D rendering technique.
Anonymous No.106151044 >>106151059 >>106151122
>>106150991
Wtf. did they just make game devs and game engines obsolete? should I short unity and unreal stock?
Anonymous No.106151059 >>106151081 >>106151174
>>106151044
gacha random videos != creating something with purpose
AI is like going to a booru site and seeing some random images based on some tags you selected, it is divorced from purpose
Anonymous No.106151064 >>106151090 >>106151150 >>106151174
>>106150993
>real time 24fps 720p video that's fully interactive using video game controls
local fucking lost
Anonymous No.106151078
>>106150991

If not local, google saars will censor it to irrelevance.
Anonymous No.106151081 >>106151095
>>106151059
Yeah but [whatever]booru doesnt have my extremely niche fetishes
Anonymous No.106151082 >>106151102
>>106150759
Thanks, downgrading it for comfy's internal pip solved the problem.
Now I have to do something about generation speed because 53.09s/it on 4070ti is apparently not ok
Anonymous No.106151085 >>106151130
>>106150912
>When I use loras with wan2.2, should I use the same weight as I was with 2.1?
Yes, but they will always look worse than for wan2.1 unless they were trained for 2.2.

>Also between HIGH and LOW, is there any recommended weight change? For example, loras should be half HIGH in LOW?
I use the same, but the bigger impact should be on high, so if you have one that's too much frying the video, you can technically lower it in low.
Anonymous No.106151090
>>106151064
>trains on your outputs to get 90% of the quality + porn
nothing personnel, thanks for your research
Anonymous No.106151095
>>106151081
yes but that ultimately has extremely limited value, being a passive observer is different from someone wanting something specific they see in their imagination
Anonymous No.106151102 >>106151123 >>106151133
>>106151082
What resolution are you genning? 480p is around 25 seconds/it on my 3090.
Anonymous No.106151112
I just upgraded to sageattention2++ and it got me a small but free speedup, something like 5% on wan, nice
Anonymous No.106151116
Anonymous No.106151122 >>106151142
>>106151044
Not just yet, but its a super exciting development imo, but all depends on compute power which I doubt is cheap. looks like they gen'd those examples for use with SIMA, which I think is how its going to end up working. Not all in gen, but connected to a web of models with explicit use cases for each behind the scenes and inferencing with each other

that being said... yeah it might be over in a decade desu
Anonymous No.106151123 >>106151134
>>106151102
did you try 720p?
I also have a 3090 and I get 145s/it on it
Anonymous No.106151130
>>106151085
OK thanks.
Anonymous No.106151133
>>106151102
480p as well, every option is left unchanged from the thread's pastebin workflow.
Not surprising since 3090 has twice as much vram as 4070ti, I'll just try lower quants for now
Anonymous No.106151134 >>106151140
>>106151123
Yeah 720 was way slower for me
I'll just cope and be happy im remotely close to genning what i want
Anonymous No.106151140 >>106151148
>>106151134
>Yeah 720 was way slower for me
do you remember the speed, I'd like to compare to what I get
Anonymous No.106151142 >>106151166
>>106151122
It's also very simple in the examples and smells of OpenAI cherrypicking.
Anonymous No.106151148 >>106151391
>>106151140
Like 90s/it or some shit
Anonymous No.106151150 >>106151172
>>106151064
local needs to get its shit together and rush the cyberpunk aesthetic in a city with a large power grid in a small apartment with a rack of h100s and industrial cooling in your living room right next to your kitchen and bed training ML 24/7
Anonymous No.106151164
>>106150991
>>106150993
I feel nothing
Anonymous No.106151166
>>106151142
agree fully, ai is the only thing i can try to be hopeful/excited about these days so a small bit of suspension of disbelief is default
Anonymous No.106151172 >>106151189
>>106151150
I think people would be buying $10k GPUs if that bought actual real-time AI games. I think theoretically people would pay as much as for a car if it meant the end-game of AI like full games, Hollywood movies, porn, etc.
Anonymous No.106151174 >>106151190
>>106151064
You just need to achieve 60fps 720p and then nvidia DLSS 6.66 will produce crisp clear 240FPS 4k in realtime. It's joever.

>>106151059
Yeah yeah blah blah, i'll give it 2 weeks and they'll have images to 3d world. You'll feed it the front cover and backside of a 2000's classic, and it will spit out the game with canon artstyle and gameplay. Then someone generates supermario sunshine 2 and the greatest tendie lawyer shitstorm ever will commence, resulting in the prompting anon getting 6 milllion years in jail.
Anonymous No.106151184
>>106150993
Local tripped so SaaS could run. Thank you, Flux, Hidream, and Qwen image!
Anonymous No.106151185 >>106151403
remember this guy?
Anonymous No.106151189
>>106151172
i can personally vouch saying I would unironically buy h100s if this was local and opensourced
Anonymous No.106151190 >>106151512
>>106151174
You described 1,000x more complex inference done in 1/1000th of the compute time.
Anonymous No.106151238 >>106151264 >>106151289 >>106151307 >>106151311 >>106151364 >>106151405 >>106151439 >>106151506
>https://x.com/jparkerholder/status/1952732999193096392

actually fucking nuts, even keeps consistency after looking away
Anonymous No.106151264 >>106151296
>>106151238
I'll be impressed when the dog goes through a portal, explores a completely different environment for 30 seconds, and then goes back through the portal and everything is unchanged
Anonymous No.106151289
>>106151238
where is the demo i want to be a dog on the beach
Anonymous No.106151296
>>106151264
valid desu
Anonymous No.106151307 >>106151321
>>106151238
>>106150993
Does it generate compilable code for the game or how does this thing work?
Anonymous No.106151311 >>106151346
>>106151238
>dog walks forward
>doesn't get closer to water
>brain begins to hurt
Neat tech, curious to see where it goes, but that video is fucking headache inducing.
Anonymous No.106151321 >>106151367
>>106151307
It's basically the hallucinated Minecraft AI model scaled up
Anonymous No.106151335
>>106150928
it's a 40gb model. ofc I'd let idiot bring bing wahoo redditors test the shit for me before bothering. so far, I'm sticking with wan
Anonymous No.106151346 >>106151388
>>106151311
>headache inducing.
Look at how the waves appear to be a different frame rate than the dog around 8 seconds onward
Anonymous No.106151364
>>106151238
>download the local quant equivalent
>dog turns around and materializes sunglasses
thanks emad
Anonymous No.106151367 >>106151401
>>106151321
this. they probably made it using footage from a dev taking a dog out for a walk
Anonymous No.106151388
>>106151346
yeah I think that is what is getting me as well, everything in the scene is wrong
Anonymous No.106151391
>>106151148
thanks anon
I'm definitely slower then lol
Anonymous No.106151401 >>106151461
>>106151367
I would bet their training is like this:
- get a video clip
- take frame 1
- use AI to caption the "action" the player did on frame 1 to do the motion in the clip
Anonymous No.106151403
>>106151185
fake, he should have sunglasses
Anonymous No.106151405 >>106151439 >>106151466 >>106151853
>>106151238
what the fuck is jparker smoking? what the fuck does a world model have to do with AGI? why do they feel the need to insert AGI meme into everything?
Anonymous No.106151434
Anonymous No.106151439 >>106151462
>>106151238
It's cool tech, but early tech, until they can get some sense of permanence.

>>106151405
It's a bit annoying agreed.
Anonymous No.106151452
Anonymous No.106151461
>>106151401
sort of. it's a behavior tree that tokenize the inputs from the controls. it's why there is a delay when you see an input. they should just be precached
Anonymous No.106151462
>>106151439
Just gives us 60 second video clips, that's the white whale.
Anonymous No.106151466
>>106151405
its good for company funding!! business men are just professional shills
Anonymous No.106151476
pixar miku walks in:
Anonymous No.106151506 >>106151557
>>106151238
comfy will never add world models
Anonymous No.106151507
>FUCK HIM UP!
Anonymous No.106151512
>>106151190
no? or maybe yes, I guess? I don't know how their world gen exactly works. do you? we both know that doom example. and apparently there's some ex google guys who created mirage. I doubt they explored many performance optimization techniques yet, let alone a mixture of generative AI and present techniques. I suppose the realtime visual generation is the bottleneck. so what would happen if you rip all the textures from a game and make AI use these textures instead of generating new ones? obviously you'd be limited to the games world context and artsyle, but performance would shoot up at least 2x-5x and you have your supermario sunshine v2 in 1080p 60fps, native.
Anonymous No.106151522
where the fuck is i2v light?
Anonymous No.106151527 >>106151536 >>106151543 >>106151550 >>106151564 >>106151580 >>106151724
why isn't meta doing anything ai img or vid? shouldn't they concern more because ar and vr?
Anonymous No.106151533 >>106151613
a man is listening to music, an anime version of Miku Hatsune walks beside him and waves hello.

again pixarized, maybe cardboard cutout would work.
Anonymous No.106151536
>>106151527
its not safe
Anonymous No.106151543
>>106151527
they sometimes do but it's usually shit or toy projects
Anonymous No.106151550 >>106151559
>>106151527
probably training as we speak, wonder if the meta glasses phone home telemetry and data scans like the quest 3 does

theyll prob go all in on mega data farming training and then making the metaverse just be a clone of the real world
Anonymous No.106151557 >>106151615
>>106151506
duh, the nodetard didn't see this coming and didn't bootlick tencent when theirs came out
Anonymous No.106151558
>>106150406
ADVENTURERS RESTAURANT BROS WE EATIN GOOD TONIGHT
Anonymous No.106151559
>>106151550
Metaverse will never take off, no one wants to pay $10 for a png on a shirt.
Anonymous No.106151564
>>106151527
They do, they just won't release it anytime soon
Anonymous No.106151580 >>106151594
>>106151527
Does SAM count?
Anonymous No.106151593
>>106150406
ToT holy shit
Anonymous No.106151594
>>106151580
yes
Anonymous No.106151609 >>106151638
>Silly hum-ACK
Anonymous No.106151613 >>106151664
>>106151533
kek

a man is listening to music, a cardboard cutout of an anime version of Miku Hatsune walks beside him and waves hello.

well, it worked!
Anonymous No.106151615 >>106151728
>>106151557
>50 nodes
>each action input is a different node
>displays each generated frame in a tiny preview node
Anonymous No.106151638 >>106151745
>>106151609
he missed his spark punch ult, sad
Anonymous No.106151657
>>106150912
you should reconsider your homo-erotic obsession with todd howard and cia
Anonymous No.106151663 >>106151745
a man is listening to music puts on a miku hatsune mask.

pretty good.
Anonymous No.106151664
>>106151613
LOL what the fuck, make the cardboard cutout hump the chair
Anonymous No.106151718 >>106152024
kojima contacting tendie lawyers as predicted earlier
Anonymous No.106151724
>>106151527
they have the best model anon
Anonymous No.106151728 >>106151752 >>106151819 >>106151877
>>106151615
yeah, comfyui is showing it's age. the node naming is all fucked too because of model shilling. it disgusts me seeing the "sd3" guidance node used in everything
Anonymous No.106151745
>>106151638
Yeah, his second punch was clean, wan 2.2 knows how to connect'em

>>106151663
Kek'd
Anonymous No.106151752
>>106151728
yeah
emptysd3 latent in a flux workflow is humiliation
Anonymous No.106151819
>>106151728
Almost as if the design just doesn't work
Anonymous No.106151847
Anonymous No.106151852
is genjam dead?
Anonymous No.106151853 >>106151893 >>106151910
>>106151405
>what the fuck does a world model have to do with AGI
it's extremely obvious why it's significant. do you really not understand?
Anonymous No.106151875 >>106151900
VRAM requirement for openshit?
Anonymous No.106151877 >>106151979 >>106151995
>>106151728
>it disgusts me seeing the "sd3" guidance node used in everything
Couldn't that just be renamed? I doubt its name is locked in.
I think you're just looking at the nature of open software here - you're the first person to bring up this annoyance and so no one's bothered to change it.
Anonymous No.106151889 >>106151961 >>106152015
>>106150095 (OP)
Is an M4 pro macbook 48GB of ram good enough for decent results in generating video with ld?
Anonymous No.106151893 >>106151913
>>106151853
>next token autocomplete is AGI
Anonymous No.106151900
>>106151875
24GB
Anonymous No.106151910
>>106151853
4
Anonymous No.106151913
>>106151893
Your natural general intelligence is really showing. Machines will never compete.
Anonymous No.106151955 >>106152002 >>106152004 >>106152032
kek

wan 2.2 is great

"a man opens a pack of onions chips and eats some."
Anonymous No.106151961
>>106151889
I haven't messed with it so I can't say for sure but my understanding is that although mac has the memory and can do decently well with LLMs, the type of inference required for image/video gen makes it super slow on a mac,.
Anonymous No.106151979 >>106151995
>>106151877
it would break all the workflows
Anonymous No.106151995
>>106151877
Forgot your fox girl image.
The problem isn't the name. The problem is that it doesn't need to be a node at all, anything model specific like that, or the different prompt encode nodes, should simply be handled internally/automatically. If you can't actually use a different node there's literally no point in having them.
>>106151979
Also this, comfy doesn't understand backwards compatibility or compatibility of any kind really.
Anonymous No.106152002
>>106151955
Anonymous No.106152004
>>106151955
the fucking ending frames killed me lmfao, saved
Anonymous No.106152015
>>106151889
In most cases (most models, most settings) for the higher end of local video or imagegen so far it has sucked to use anything but nvidia.

I haven't tried it recently tho, especially not with a mac.
Anonymous No.106152024
>>106151718
I'll be imprsssed when we get VR porn we can interact with.
Anonymous No.106152032
>>106151955
alright this one is pretty kek
Anonymous No.106152056 >>106152288
Anonymous No.106152060 >>106152592
ITS TIME.jpg
Anonymous No.106152101 >>106152592
Anonymous No.106152114 >>106152119 >>106152194
I don't understand this: I had been using a Lightx2v Lora for T2V for I2V generation, and it was working fine. After I notice I switch to a I2V Lora and everything sucks: video "glows", "dropping sand" and all sorts of artifacts. I am not using Kijai my workflow is more like >>106150120
Anonymous No.106152119 >>106152194 >>106152277
>>106152114
T2V lora absolutely kills more complicated prompt following
Anonymous No.106152124
Diffusers team smoking crack. What the fuck is this bullshit?
Anonymous No.106152127 >>106152150 >>106152162 >>106152187 >>106152191 >>106152196 >>106152224 >>106152235
Good god Stability AI. You really ran your brand into the ground.
Anonymous No.106152135 >>106152147 >>106152152
Anonymous No.106152147
>>106152135
Actually my perspective fellow anons who I am also.
Anonymous No.106152150
>>106152127
did he run out of cocaine money?
Anonymous No.106152152
>>106152135
kek
Anonymous No.106152162
>>106152127
>professional
>3 buttons undone
Anonymous No.106152187
>>106152127
kek why do they they always get jeets to fill in the ceo spots when other people back down
Anonymous No.106152191 >>106152225 >>106152288
>>106152127
All because they were unwilling to do porn or train unrestricted.
Anonymous No.106152194 >>106152205
>>106152114
>>106152119
So the recommended lightx lora for wan2.2 isn't i2v 2.1 but t2v 2.1?
Anonymous No.106152196 >>106152210
>>106152127
custom ai workflow to help him snort more coke quicker
Anonymous No.106152205 >>106152267
>>106152194
use this: https://files.catbox.moe/6lp32g.json
Anonymous No.106152210
>>106152196
Coke tube made out of silver
Anonymous No.106152224 >>106152238
>>106152127
can xi just nuke india ffs
Anonymous No.106152225 >>106152288
>>106152191
the crazy thing is that Chinese models literally show it doesn't matter
all their fears about their brand being impacted by allowing nsfw amount to nothing if people used their latest models, and nsfw is a big booster
Anonymous No.106152235
>>106152127
what did this idiot actually do for the company? James Cameron joined in for what exactly? cocaine parties?
Anonymous No.106152238 >>106152262
>>106152224
xi lost power months ago, he's not the top dog in China anymore
Anonymous No.106152245
>>106150104
You must be new
Anonymous No.106152262 >>106152303
>>106152238
no wonder why the good times seemed to stop. whatever happened to chairman for life?
Anonymous No.106152267 >>106152543 >>106152562
>>106152205
either this is some vodoo schizo stuff or super brilliant, I have no idea which one
I'll try it
Anonymous No.106152277 >>106153114
>>106152119
You gave an idea, using I2V for high noise and T2V for low noise.
Anonymous No.106152288 >>106152467
>>106152056
nice

>>106152191 >>106152225
we did try to tell them what the people need, sad that they didn't listen
Anonymous No.106152303
>>106152262
internal coup probably because the party didn't appreciate the whole return to mao and close down the country rhetoric
Anonymous No.106152359
damn. every company decided to dump all the AI news on the same day. What a coincidence.
Anonymous No.106152360 >>106152376
how did the weakest, limp twisted faggots become the leaders of all these corps and countries? bad times ahead
Anonymous No.106152376 >>106152395
>>106152360
weak men create bad slop
bad slop creates strong men
strong men create good slop
Anonymous No.106152383
https://huggingface.co/city96/Qwen-Image-gguf/tree/main

but if it's slower than flux why bother
Anonymous No.106152395
>>106152376
>pose and style wow will never do again
Anonymous No.106152406
Anonymous No.106152408
>Qwen Image
>default workflow
>notes
>1st generation 94s
>2nd generation 71s
I got 88s/63s (half that at CFG 1.0) on my 4090 that's limited to 85%... and here I was expecting it to be slow.
Anonymous No.106152453
latest test :
2x3090 (each having its own separate comfy session) + 128GB ram + 5950x
lightx 4+5 steps
CUDA128 / Sageattention 2
113 frames (7s@16fps)
720p
-> 25min
So basically I get a new video every 13min taking account the two gpus.
I went more steps on LOW because it gives better details for some reason.
Anonymous No.106152467
>>106152288
Shame no one had the balls to tell the investors that porn is the only differentiating factor in the success of an open sourced AI. They should've been funding platforms, not division.
Anonymous No.106152500
Anonymous No.106152543 >>106152650
>>106152267
its the 2nd one
Anonymous No.106152544 >>106152723 >>106152764
how we feeling about comfy's vram optimization?
Anonymous No.106152560 >>106152580 >>106152785
pc barely coping
24gb vram and 64gb ram is not even enough anymore
Anonymous No.106152562 >>106152650
>>106152267
heres another with it
Anonymous No.106152580 >>106152667
>>106152560
consumer models were never enough. you need h100 or h200 to do real work. you should be grateful you can even do 10% of what they're capable of locally
Anonymous No.106152592
>>106152101
>>106152060
change keep_proportion to resize in the image resize node
Anonymous No.106152610 >>106152687 >>106153114
Why is every video trying to loop now? My first two gens weren't like this and actually ended with different frames compared to input images, did I accidentally toggle some option?
Anonymous No.106152622 >>106152657 >>106152719
>>106150095 (OP)
the rentry for i2v wan2.2 is wrong, the resolution shouldn't be divisible by 32 but by 16 (or 8) in i2v so you get 720x1280 and not 704x1280
704 is only recommended for t2v
Anonymous No.106152650 >>106152817
>>106152543
>>106152562
ok I'll bite, can you share the links for "fusionx" and "pusa"?
Anonymous No.106152657
>>106152622
*in the shared workflows
Anonymous No.106152667 >>106152704 >>106152718
>>106152580
Oy vey, so true my fellow anon! It's not like nvidia slaps 10x price tag on a product after attaching more vram chips to the same gpu, you're just a poor consumer who was never meant to run neural networks locally.
Anonymous No.106152687 >>106152738
>>106152610
I hope things get worse for you. take care
Anonymous No.106152704
>>106152667
>It's not like nvidia slaps 10x price tag on a product
Right, it's an entirely different chip with more CUDA cores, more Tensor cores, more SMs, larger cache, etc.
Anonymous No.106152718
>>106152667
>It's not like nvidia slaps 10x price tag on a product after attaching more vram chips to the same gpu
yes, they can do that because they have no competition. they charge whatever they want and (you) will have to buy it. Nvidia is the richest company in the world right now.
Anonymous No.106152719 >>106152743
>>106152622
What if the resolution is shifted a couple of pixels in one direction or the other? Is there a big difference in gen quality?
Anonymous No.106152723 >>106152750 >>106152753
>>106152544
>Veo3 above Qwen image and VRAM optimizations
local lost, comfyui is adware
Anonymous No.106152738
>>106152687
Sorry but we're browsing an AI thread on the anime pedo website right now, our lives can't really get much worse than this.
Anonymous No.106152743
>>106152719
no
Anonymous No.106152750 >>106152773
>>106152723
comfy never said it was going to be local only. api cloud shit is how the money is made.
Anonymous No.106152753
>>106152723
>affiliate monetization
Confybros...
Anonymous No.106152764
>>106152544
he still uses jeetthon so I don't really care. I hope this shitware crashes and burns hard next year
Anonymous No.106152773
>>106152750
>comfy never said it was going to be local only
Rug status?
Anonymous No.106152785
>>106152560
if the app didn't choke out a gig of memory you'd have more wiggle room
Anonymous No.106152792 >>106152813 >>106152821 >>106152838 >>106152852 >>106153265
are you actually 24/7 looking at the thread to see when comfyui is talked about to rage about it
you are legit insane dude
Anonymous No.106152813 >>106152841
>>106152792
it's honestly not hard to see many people are sick of this shit. you are schizo if you think everyone should be happy local is getting fucked by garbage software
Anonymous No.106152817
>>106152650
its in the workflow
Anonymous No.106152821
>>106152792
yes, yes he is. i thought they were just being an ironic shitposter at first but they are serious
Anonymous No.106152838
>>106152792
>you are legit insane dude
Took you long enough to notice.
Anonymous No.106152841 >>106152878
>>106152813
>happy local is getting fucked by garbage software
you do realize comfy isn't the only local software?? there's nothing stopping you from using the alternatives. why do you have tunnel vision for comfy? if people were sick of it as you claim they obviously they would not be using or talking about it.
Anonymous No.106152852
>>106152792
Comfy has always been actively hostile towards every other dev, only fair that his work is criticized too.
Anonymous No.106152868 >>106152895 >>106152914 >>106152930 >>106153035 >>106153065
bros, openai actually open??
> https://x.com/sama/status/1952777539052814448
Anonymous No.106152878
>>106152841
why does the community always shove comfy in front of their face and force them to fix every fucking problem cumfart didn't bother with for three years? who seriously wants to try and make other options when nobody bothers to help contribute?
Anonymous No.106152895
>>106152868
so i can gen yellow-puke film-grain images locally?
Anonymous No.106152912
Anonymous No.106152914
>>106152868
>>106152344
Anonymous No.106152930
>>106152868
it will be so safe it will be unusable
also this is just a response to deepseek and kimi
Anonymous No.106153033 >>106153060 >>106153074 >>106153108 >>106153189 >>106153220 >>106153550 >>106153685
The crackdown on adult content continues.
Anonymous No.106153035
>>106152868
Always has been.
Anonymous No.106153053
Given normalized_shape=[3584], expected input with shape [*, 3584], but got input of size[1, 171, 4096]
guhffuf'd
Anonymous No.106153060
>>106153033
>"big breasts"
Anonymous No.106153063
>>106151011
honestly best way to describe it.
Anonymous No.106153065
>>106152868
They should release Dall-E 3 so we can stop pretending fluxslop is somehow superior
Anonymous No.106153074 >>106153106
>>106153033
how long until all ancient statues and texts are destroyed
Anonymous No.106153106
>>106153074
Less than 100 years when the retard white population gets replaced fully without anyone else even being able to realize what is happening to the boiling frog, then a perfect brown mass will be created to be ruled forever.
Anonymous No.106153108 >>106153220 >>106153564
>>106153033
ISIS in Palmyra tier retardation
Anonymous No.106153114 >>106153154 >>106153397
>>106152277
I need to test it more, but it worked better at movement than T2V and less artifacts like I2V
>>106152610
Did you add frames? I tested 121 and it tended to loop.
Anonymous No.106153143 >>106153161 >>106153167 >>106153330
this is the cutting edge in cgi food.
can /ldg/ compete?
Anonymous No.106153154 >>106153397
>>106153114
don't help the wanschizo. he'll just get banned anyways since he spams on /a/
Anonymous No.106153161 >>106153168 >>106153184
>>106153143
yes but why would i want to gen that retarded shit
Anonymous No.106153167
>>106153143
absurd amount of creme
Anonymous No.106153168
>>106153161
you can make a naked lady do it and btfo him
Anonymous No.106153184
>>106153161
for the sport of it.
it's a challenging test case.
i'm not even baiting, i'm curious how close the models can get right now.
Anonymous No.106153189
>>106153033
The reign of the karens.
I blame people listening to them.
Anonymous No.106153220 >>106153238 >>106153564
>>106153108
you are why >>106153033 happens
Anonymous No.106153238
>>106153220
This is some galaxybrain logic
Anonymous No.106153240 >>106153445
Is there a magic word, positive or negative (possibly in Chinese) that'll make Qwen produce a sharp image?
Anonymous No.106153259
Does any program use ZLUDA (AMD) that can do WAN?
Anonymous No.106153265 >>106153294 >>106153384 >>106153421
>>106152792
I wouldn't put it past that competitive, narcissistic fucker ComfyAnon to have actually paid off Ilyasviel and Panchovix to just drop Forge and ReForge and leave them for dead.

The way he was shilling about Qwen a few hours ago really sealed it for me.

It's crystal clear his priority isn't actually stabilizing or improving his spaghetti code interface in any meaningful way.
It's all about chasing the next shiny toy to bolt on so the local consoomers can play with their new toys.

And you have to remember, ComfyAnon is the guy who's been samefagging in these very generals for years, shilling his own UI and shitting on everything else.

It's fucking rich how people in this circle throw the word 'schizo' around at everyone else, when Comfy himself is the final boss of all the schizos here.
Anonymous No.106153290 >>106153306 >>106153315 >>106153321
qwen is much, much better than flux and wan for the subject matter I'm interested it. And it runs on my 4070 with fp8, I don't know why everyone is complaining about the size.

I really hope there will be a lot of lora / tuning support.
Anonymous No.106153294
>>106153265
the open source community has been compromised by a greedy narcissistic fuck that can't even generate a good looking image
Anonymous No.106153306 >>106153412
>>106153290
> And it runs on my 4070 with fp8
how long does it take for a single gen? post workflow or fuck off XD
Anonymous No.106153315
>>106153290
>I don't know why everyone is complaining about the size.
IT DOESNT RUN ON MY LAPTOP 1050 SO ITS NOT A LOCAL MODEL CHUD
Anonymous No.106153321 >>106153412
>>106153290
Didn't you forget Chroma on your list? It's a thousand times better than Chroma and it came out yesterday.
>inb4 much penis, muh vagina, muh NSFW
Local coomers would make Loras and Basemodels in a second.
Anonymous No.106153330 >>106153338 >>106153433
>>106153143
wow, looks like they won in a niche no one is competing in.

what can their hot 1girl [nsfw] do?
Anonymous No.106153338 >>106153392
>>106153330
prompt?
Anonymous No.106153361 >>106153401 >>106153403
So I'll make /vdg/? Everyone happy?
Anonymous No.106153384
>>106153265
You know the Dev is crap when all he does is be a namefag here and cares more about appearing on YouTube streamings than improving his interface.
Anonymous No.106153392
>>106153338
wan ahegao lora "she crosses her eyes, sticks her tongue out and makes the ahegao face."

plus base wan "they are jumping up and down in a disco." which with my settings seems to usually make them do this more than actually jumping

the women were prompted like "multiple women partying in a club wearing glossy outfits collars and "fuck me" top"
Anonymous No.106153394
change the grassy racetrack to a sunny beach. keep the proportions of the anime girls the same.

kontext is fast and fun, wan is good but I really wanna see extended clips without OOM.
Anonymous No.106153397 >>106153432
>>106153154
I think you have to take your medication because I haven't posted a single AI gen on /a/ yet. It's actually the other way around - I saw a guy posting AI videos and then went here to learn how to make these.
>>106153114
113 and 162 did loop with this picture, I'll try other values but doubt it'll help. It probably depends more on the input image and prompt.
Anonymous No.106153401
>>106153361
no no, this is /ldg/, house of manchilds with their big GPUs, big nodes and big cooms
For images is /sdg/
Anonymous No.106153403 >>106153419
>>106153361
no, people will come with pitchforks. /adt/ already pisses people off
Anonymous No.106153411
>>106153400
>>106153400
>>106153400
Anonymous No.106153412
>>106153306
template workflow, 20 steps, 1328x1328, 149 seconds.

But I think 20 steps isn't necessary a lot of the times. And you can lower the resolution to iterate on the prompt.

>>106153321
I tried it a few times and it creates incoherent stuff for me most of the time.
Anonymous No.106153419
>>106153403
It pisses schizo off*
Anonymous No.106153421
>>106153265
>Ilyasviel and Panchovix to just drop Forge and ReForge and leave them for dead.
More like incessant shilling and toxicity from Comfy and his squad of goons is off-putting. Similar thing happens to all the custom node devs, they eventually get fed up of fighting breaking changes, or the code is copied without credit as a (((native node))).
Anonymous No.106153432 >>106153478
>>106153397
>it's totally not me samefagging a link to the thread
Anonymous No.106153433 >>106153827
>>106153330
houdinigods do not concern themselves with women. we're only interested in men with their skin removed.
Anonymous No.106153445
>>106153240
Maybe?
"Ultra HD, 4K, cinematic composition.", # for english prompt,
"θΆ…ζΈ…,4K,η”΅ε½±ηΊ§ζž„ε›Ύ" # for chinese prompt,

Found in the python code from https://huggingface.co/DFloat11/Qwen-Image-DF11
Anonymous No.106153463 >>106153542
is there a recommended size to keep Flux gens at? I've just been genning in 1024x1024 because I'm gonna turn the outputs into a texture map (as paintings on a wall) but is Flux one of those models that only works well at a specific res/aspect ratio or is it really just a matter of 'how long do I want my gacha rolls to take' and 'do I really need the extra res to give it more space to fill in detail'
Anonymous No.106153478
>>106153432
Yep you're a schizo alright. I'll go and create a thread on /a/ just for (You) once I learn to generate good enough videos.
Anonymous No.106153483
>>106150752
God, 1.6tb of cyberslop? An older version nonetheless. That's weird fren.
Anonymous No.106153542 >>106153665
>>106153463
no, it can definitely generate higher resolutions and has decent flexibility with resolutions/aspect ratios, I don't think it will do unlimited resolution well tho (even if you had the hardware), its not that kind of a model.
Anonymous No.106153550
>>106153033
not even hiding their intentions any more
>men like it, so it's gotta go
Anonymous No.106153564
>>106153108
kek
>>106153220
retard
Anonymous No.106153665 >>106153832
>>106153542
oh okay, was wondering since Forge defaulted to a 16:9 res but I changed it since a texture map ideally should be power of 2
maybe genning in 16:9 will help avoid making some of my scenery shots feel too cramped
Anonymous No.106153685 >>106153712
>>106153033
Can I have that statue?
Anonymous No.106153712
>>106153685
It's going to be destroyed.
Anonymous No.106153827
>>106153433
all completely obsolete now, amazing
Anonymous No.106153832
>>106153665
no experience with texture maps but for other subjects it's fine, you should try it
Anonymous No.106154153 >>106154240
What's the source on that feet clip? Was it actually made it with AI?
Anonymous No.106154240
>>106154153
no that's actual footage of my daughter. i just felt like sharing it
Anonymous No.106155299
>>106148661
>>106148712
did they use gpt-4o to pad the training data? Isn't that illegal?