← Home ← Back to /g/

Thread 105703501

312 posts 270 images /g/
Anonymous No.105703501 [Report] >>105704778 >>105706158 >>105706504
/ldg/ - Local Diffusion General
No Model Card Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105695065

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.105703533 [Report] >>105706158 >>105706504
Blessed thread of frenship
Anonymous No.105703675 [Report] >>105703688 >>105703717
I just checked the wan workflow from the rentry anon https://rentry.org/wan21kjguide and noticed the UnetLoaderGGUFDisTorchMultiGPU now has the option to use_other_vram

Does this mean that basically multi-gpu is solved? can we launch 40-80GB models into 2,3 or 4 3090's?

Also I set the virtual_vram at 0.0 and it's working, with a single 3090 and wanI2V720p model, it used to require up to 10GB extra with loras etc. Does this now have "automatic" fallback into system RAM like the comfy's native workflows always had?
Anonymous No.105703688 [Report] >>105703823
>>105703675
>Does this mean that basically multi-gpu is solved?
always was, this shit has been there for month at this point
Anonymous No.105703717 [Report]
>>105703675
with wan q8 I use 10gb of virtual vram, I have a 4080 with 16 physical, so you can use larger models with it. If you have enough vram (you have 24) then there is no need.

oddly enough the virtual vram doesnt slow it down much if at all for me.
Anonymous No.105703735 [Report] >>105703751
wan 720p isnt that much slower than 480p with the lora, which is neat

using a smaller res to test/for speed but it's a little over a minute...pretty cool imo
Anonymous No.105703751 [Report] >>105703800
>>105703735
>wan 720p isnt that much slower than 480p with the lora, which is neat
which is normal, you haven't changed the resolution, only the resolution and the number of frames changes the speed of inference
Anonymous No.105703800 [Report]
>>105703751
usually with 480p wan 600x480 works fine, im just testing diff resolutions now. but it's still super fast with the new lora.
Anonymous No.105703823 [Report]
>>105703688
>always was
>been there for month[s]
Anonymous No.105703850 [Report]
does someone else have problems with the wan workflow from the rentry not saving the videos in the output folder? i always have to change the prefix to something else, the default prefix doesn't work
Anonymous No.105703930 [Report] >>105703966 >>105704024
idk I think 480p wan works faster and the quality is good at the default preset
Anonymous No.105703931 [Report] >>105703941 >>105704160
>>105703577
I think Intel already have FlashAttention or some kind of optimized kernel built in IPEX, because if not, I think I cannot run Chroma BF16 on A770 at 7it/s
At some point when I upgraded to new beta driver, the speed and VRAM usage suddenly became worse, so I reverted back to an old driver. So I think Intel already had some optimization built in their drivers already
Anonymous No.105703941 [Report]
>>105703931
7s/it, typo lol
Anonymous No.105703966 [Report] >>105704056
>>105703930
that's because the 720p is not intended to run on a 480p res, you have to get 720 minimum on either the height or width
Anonymous No.105703975 [Report] >>105704024 >>105704084 >>105704240
>>105703517
F U C K O F F
crawl back to whatever kindergarden bush you came from
Anonymous No.105704024 [Report] >>105704048 >>105704084
>>105703930
the quality looks good at the intended resolution, whoa no way
>>105703975
why would you link that here, retard?
Anonymous No.105704048 [Report]
>>105704024
what is your problem? hm? fucker
Anonymous No.105704056 [Report] >>105704075
>>105703966
how much memory does 720p wan need at 1280x720? got oom even with 10gb virtual ram (using a 4080/16gb)
Anonymous No.105704075 [Report] >>105704093
>>105704056
>how much memory does 720p wan need at 1280x720?
a lot, that's why I go for 720x720
Anonymous No.105704084 [Report]
>>105703975
Based

>>105704024
Nta because that little freak and his nonce defenders have been posting in these generals now.
Anonymous No.105704089 [Report] >>105704187 >>105704440
its the 25th

where is SageAttention2+

https://github.com/thu-ml/SpargeAttn
>SpargeAttn based on SageAttention2++ will be released around June 25.
>SpargeAttn based on SageAttention2++ will be released around June 25.
>SpargeAttn based on SageAttention2++ will be released around June 25.
Anonymous No.105704090 [Report] >>105704136 >>105704154 >>105705006
been out of the loop, what is chroma? Is it based on flux? is it on civitAI? Will it work with webui forge?
Anonymous No.105704093 [Report]
>>105704075
I changed it from 10 to 16 virtual and now no oom, so we'll see how this works. Yeah with 480p wan I like 600x480 for speed with minimal quality loss.
Anonymous No.105704136 [Report] >>105704185 >>105705006
>>105704090
chroma is a model based on flux
https://huggingface.co/lodestones/Chroma

yes it's on civitai
https://civitai.com/models/1338204

yes forge supports it as of this commit
https://github.com/lllyasviel/stable-diffusion-webui-forge/commit/963e7643f07dca155de5de2f617cc17adc2aee4d
Anonymous No.105704150 [Report] >>105704192
Anonymous No.105704154 [Report] >>105704185
>>105704090
A large finetune of Flux Schnell, it is currently still training, at epoch 39 of an estimated 50.

Every trained epoch is released, so you can try it out while it is unfinished.

Support for Forge was merged ~two days ago.

https://huggingface.co/lodestones/Chroma
Anonymous No.105704160 [Report] >>105706264
>>105703931
So I simplified what they actually did but I have done research on this. They put in an optimized version of standard SDPA into standard stock Pytorch 2.7 which piggybacks off their oneDNN library for SYCL and etc. It uses a very generic version of SDPA built with OpenCL.
This is where the speed is coming from on top of BF16 support in the Alchemist architecture where they can get it fairly fast. However, that is not Flash Attention and it could go even faster if they had that. They did have a pull request in the main Flash attention repo, but they closed it.
https://github.com/Dao-AILab/flash-attention/pull/1528
And that only supports Battlemage and Ponche Vecchio since it relies on some lower level primitives Alchemist is missing. And obviously as I mentioned earlier, MIA on other attention types.
Anonymous No.105704166 [Report] >>105704186 >>105704209
slightly smaller width, seems to work decent
Anonymous No.105704185 [Report]
>>105704136
>>105704154
thanks senpai
Anonymous No.105704186 [Report] >>105704197
>>105704166
I do notice the eyes in particular are clearer compared to the 480p model, among other things, with the interpolated output (this isnt) it's even smoother as well
Anonymous No.105704187 [Report]
>>105704089
what? I think you sent the wrong link, this is SpargeAttn, not SageAttention
Anonymous No.105704192 [Report]
>>105704150
So much better than Darth Plagueis
Anonymous No.105704197 [Report]
>>105704186
>I do notice the eyes in particular are clearer compared to the 480p model, among other things
of course, more pixels = better details
Anonymous No.105704198 [Report] >>105706805
Anonymous No.105704209 [Report]
>>105704166
interpolated with a new gen, smooth
Anonymous No.105704240 [Report]
>>105703975
thanks for the you
Anonymous No.105704290 [Report] >>105706805
Anonymous No.105704356 [Report] >>105704369
asian girl puts a can down on a table.

720p is fast with the lora, scaled down a bit I can go back to 10 virtual vram with a 4080 and no oom error, plus slightly faster.
Anonymous No.105704369 [Report]
>>105704356
er...it was at 16 still. but multigpu is flexible anyway. 10 seems fine now, no oom. so I have to bump it higher at full 720p default res.
Anonymous No.105704381 [Report]
I remember back when I used the 720p model, it had a slightly better prompt adherence than the 480p model. Can anyone confirm if that's still the case with the distillation lora?
Anonymous No.105704397 [Report]
Anonymous No.105704409 [Report]
Anonymous No.105704440 [Report]
>>105704089
https://github.com/thu-ml/SpargeAttn?tab=readme-ov-file#project-updates
>SpargeAttn based on SageAttention2++ will be released around June 25.
So basically if you choose a sparsity of 0 you get the OG SA2++?
Anonymous No.105704442 [Report]
girl holding can dives into a pool
Anonymous No.105704463 [Report] >>105704543
I'm trying to just use the fp16 version of wan instead of the gguf but it breaks in the rentry workflow. All I did was add a unetloader node and connect it where the unetgguf node was connected. However, the gens get all jacked up. Is there something else I need to do?
Anonymous No.105704471 [Report] >>105704976 >>105707372
>>105703169
>e621 and furry into negative
>>105703378
LOL, I tried this and it actually works. So far in my testing, it almost always makes significant improvements to anatomy and background details.

PUT E621 AND FURRY IN NEGATIVE. the furry has been training on too much furry data, chroma is literally inserting furry mutations everywhere.
Anonymous No.105704504 [Report]
Anonymous No.105704543 [Report] >>105704578
>>105704463
grab the gguf q8

https://huggingface.co/city96/Wan2.1-I2V-14B-720P-gguf
Anonymous No.105704578 [Report]
>>105704543
I was intentionally trying to use the fp16, but your link helped me since there's also fp16 ggufs too, thanks!
Anonymous No.105704588 [Report]
Anonymous No.105704615 [Report]
anime dog flies high in the air

wan gave doro wings:
Anonymous No.105704669 [Report] >>105704898 >>105704969
You wouldn't... you know... a mushroom, would you?
Anonymous No.105704672 [Report] >>105704705
Anonymous No.105704678 [Report]
doro mcdonalds:
Anonymous No.105704693 [Report] >>105704732
AI boomer here. Is Waifu2X still considered a relatively "modern" upscaler for anime-style images or is it stone age tech today ?
Anonymous No.105704705 [Report] >>105704756
>>105704672
nice, is that VACE with force enforcing lora?
Anonymous No.105704732 [Report]
>>105704693
definitely not.
couldn't remember the names but i've seen a ton of shit that completly mogs it since.
Anonymous No.105704736 [Report] >>105705540
a girl jumps into a swimming pool

not bad, interpolation on the 720p gens takes much longer I notice, this is the regular output.
Anonymous No.105704756 [Report]
>>105704705
no
Anonymous No.105704778 [Report] >>105704796
>>105703501 (OP)
totally missed that thomas jefferson webm last thread
Anonymous No.105704796 [Report] >>105705074
>>105704778
Sir that's Benjamin Franklin.
Anonymous No.105704807 [Report]
the ball necklace lora cant handle my waifu's big neck
Anonymous No.105704863 [Report] >>105705104
If the depth controlnet has trouble properly transfering the pose, should I cut out the background or raise brightness/contrast?
Anonymous No.105704889 [Report] >>105705005
a girl sits at a desk with a computer.

amazing, it's a stellar blade image and she seems to be playing a gacha like nikke (made by the devs)
Anonymous No.105704898 [Report] >>105705173 >>105705394
>>105704669
WOULD.
would you a big titty succubus?
Anonymous No.105704969 [Report]
>>105704669
would
Anonymous No.105704975 [Report] >>105705005
a girl dives off a cliff into a swimming pool.

AND out
Anonymous No.105704976 [Report] >>105704987
>>105704471
what's this art style again?
Anonymous No.105704987 [Report] >>105705060 >>105705642
>>105704976
She's fat as fuck so maybe Baroque?
Anonymous No.105705005 [Report]
>>105704889
>>105704975
damn these are really cool
Anonymous No.105705006 [Report] >>105705014 >>105705073 >>105705593 >>105705698
>>105704090
>>105704136
I used this gguf with forge and it works but my gen looks like this on a simple prompt

I'm clearly doing something wrong
Anonymous No.105705014 [Report]
>>105705006
this is chroma btw
Anonymous No.105705023 [Report] >>105705033
a girl dives off a cliff into a swimming pool.

pretty good!
Anonymous No.105705033 [Report] >>105705540
>>105705023
interpolated output:
Anonymous No.105705060 [Report]
when you think about it, AI waifus are a real life succubus.

>>105704987
kek. no, but that reminds me to do some Rubens gens.

>Masterpiece very dark French Neoclassical orientalist oil painting on canvas by Jean-Auguste-Dominique Ingres with visible brushstrokes and Craquelure.
Anonymous No.105705062 [Report]
a girl gets into a red sports car and drives away in the desert.
Anonymous No.105705073 [Report] >>105705593
>>105705006
Yeah it should look a bit better than that. Let me know your settings or catbox the gen so I can take a look and see if I can spot what's missing
Anonymous No.105705074 [Report]
>>105704796
making out with sally hemmings??
Anonymous No.105705104 [Report] >>105705119
>>105704863
shouldnt you be using open pose or something, not depth?
Anonymous No.105705119 [Report] >>105705207
>>105705104
Openpose is for the premade stick figures
Anonymous No.105705150 [Report]
pouring one out into an invisible glass
Anonymous No.105705173 [Report] >>105705204 >>105705379
>>105704898
Depends on my survival chances. Ah, fuck it, WOULD.
Would you a marble statue, however?
Anonymous No.105705204 [Report] >>105705507
>>105705173
what's better, regular or the detailed chroma version? i'd assume detailed but maybe not
Anonymous No.105705207 [Report] >>105705242
>>105705119
open pose creates a stick figure which can be used to pose the character. you said you want the pose transferred, and that's what it does?
Anonymous No.105705242 [Report] >>105705443
>>105705207
Yes, but if I already have an image, I can just use depth or canny
Anonymous No.105705271 [Report] >>105705324 >>105705337
node based interfaces will age like milk
Anonymous No.105705324 [Report]
>>105705271
houdini is doing fine anon
Anonymous No.105705337 [Report]
>>105705271
it's honestly the complete mismanagement of community nodes that poison the well of workflows. they refuse to put redundant and outdated shit to rest
Anonymous No.105705376 [Report] >>105705425 >>105706599
For my fellow 80's/90's kids.
Anonymous No.105705379 [Report]
>>105705173
nope. hard, cold, and dry, sounds awful.
here's Rubens.
Anonymous No.105705394 [Report]
>>105704898
don't they steal your soul if you fuck them
I'm horny but not eternal damnation horny
Anonymous No.105705405 [Report]
I nothing to add to this conversation so here's this
Anonymous No.105705425 [Report]
>>105705376
just for that, the game over of banjo kazooie was worth it kek
Anonymous No.105705443 [Report]
>>105705242
I really don't know what you want. Are you saying you only want the pose or everything else in the image, hence the depth CN?
Anonymous No.105705465 [Report]
anime girl miku hatsune dives into a swimming pool.
Anonymous No.105705497 [Report] >>105705517
anime girl with blue eyes punches a hole in the wall to her left.
Anonymous No.105705507 [Report]
>>105705204
I'm using the detail calibrated version, but I haven't done any testing whatsoever. I just know that it werks.
Anonymous No.105705517 [Report]
>>105705497
Very kino of it to hallucinate an instant hole drilling guitar
Anonymous No.105705540 [Report]
>>105704736
>>105705033
Looks like she gave up on life once she hit the water
Anonymous No.105705548 [Report]
Ah yeas, four leaf clover
Anonymous No.105705551 [Report]
Anonymous No.105705562 [Report] >>105705578
finally, death stranding honest edition.
Anonymous No.105705578 [Report] >>105705617 >>105706361
would you a squid?

>>105705562
kek
Anonymous No.105705593 [Report] >>105705700 >>105705746
>>105705006
>>105705073
ah shit had to afk, heres the gen though

https://files.catbox.moe/p2mhpe.png

If anyone can chroma this and post the result that would nice.
Anonymous No.105705599 [Report]
bumped res a bit, amazon man wont be stopped:
Anonymous No.105705606 [Report]
Anonymous No.105705611 [Report] >>105705631 >>105705636
its so hard to make women with large hooked noses
Anonymous No.105705617 [Report]
>>105705578
100%
Anonymous No.105705625 [Report]
okay, NOW it's a success with a sorta proper logo.
Anonymous No.105705631 [Report]
>>105705611
small hooked nose enjoyer here I hate liking things that will never be tagged, or arent tagged in english like seiza and wariza even though they are common poses
Anonymous No.105705636 [Report]
>>105705611
make a lora for it? find a bunch of images of women with large hooked noses.
Anonymous No.105705642 [Report]
>>105704987
The only thing she baroque is the scale
Anonymous No.105705680 [Report] >>105705698
forge, reforge, or classic for chroma?
Anonymous No.105705698 [Report]
>>105705680
Judging by this guy's pics, you need to use comfy: >>105705006
Anonymous No.105705700 [Report] >>105705709 >>105705753
>>105705593
>cfg scale 1.0
Go to 4.
This is the gen with cfg set at 1 and my negatives.
Anonymous No.105705709 [Report] >>105705753 >>105706019
>>105705700
Same seed, prompt etc with CFG at 4 (also upscaled because I forgot to disable it, lol!)

Also, your prompt is... not great for chroma. You need to kinda tell it a bedtime story for a decent gen most of the time.
Anonymous No.105705713 [Report]
Of course the silicone demon gives a robot the correct hand.
Anonymous No.105705746 [Report]
>>105705593
I set the cfg to 3.5 (recommended for chroma, iirc)
I also set steps to 26 (same)
Negatives: "aesthetic 0, aesthetic 1, e621, furry, low quality, 3D, render, drawing,"
I also used comfyui, just because I'm used to that with chroma at this point.

Not sure if adetailer is messing with it either. You'll also want to experiment with a more detailed prompt.
Anonymous No.105705748 [Report] >>105705754 >>105705765 >>105705817 >>105705968 >>105705984 >>105708224
https://github.com/comfyanonymous/ComfyUI/pull/8669

>get omnigen2
>put image of girl
>use prompt "make her naked"
Enjoy gooners, this one is for you.
Anonymous No.105705753 [Report]
>>105705700
>>105705709
Oh thanks anon. I got old advice that flux used distilled cfg over cfg and didn't like negative prompts.
Anonymous No.105705754 [Report] >>105705757
>>105705748
Is this gen or just inpainting model?
Anonymous No.105705757 [Report]
>>105705754
It's like GPT 4o without censorship.
Anonymous No.105705765 [Report] >>105705790 >>105705803
>>105705748
but the model looks like shit from a butt desu
Anonymous No.105705790 [Report]
>>105705765
The model looks extremely good, you are most likely not using it correctly.
Anonymous No.105705803 [Report]
>>105705765
nta but yeah, I tested omnigen pretty extensively on my 3090 with their official repo and it was shit

constantly doesn't do what you ask until you try 5 or 6 different seeds, alters faces, changes random stuff etc

it's barely any less shit than the first omnigen. and txt2img is as slopped as Cosmos2B, while being much slower
Anonymous No.105705806 [Report]
how is NIGGIDY NAG for chroma?
Anonymous No.105705817 [Report] >>105705918
>>105705748
When can i expect this in AniStudio?
Anonymous No.105705819 [Report]
Anonymous No.105705918 [Report] >>105706057
>>105705817
two more weeks
Anonymous No.105705968 [Report] >>105706003
>>105705748
it's working for me
https://files.catbox.moe/fkf3y5.png
Anonymous No.105705984 [Report] >>105706004
>>105705748
I never tried the official implementation but I'm using this one now. It appears to be completely uncensored? Can do breasts, nipples, pussy, dick (kind of). Cocks look a little bit weird but it's clearly like 90% of the way there. It can even almost do POV missionary sex. Downside is that the textures and lighting are completely slopped but that can be finetuned away.
Anonymous No.105706003 [Report] >>105706960
>>105705968
actually, though it's working for me, it does have a plastic skin problem. and tendency to change faces a bit. surprisingly good anatomy, nudity, sex organs, even seems like you can add in sex with other people with enough prompt engineering. It struggles a bit with oral.
Anonymous No.105706004 [Report] >>105706021
>>105705984
Strange that you neglected to include a catbox.
Anonymous No.105706019 [Report] >>105706265
>>105705709
Anonymous No.105706021 [Report]
>>105706004
It's in ComfyUI now, just download the model and try it yourself you lazy fuck
Anonymous No.105706057 [Report]
>>105705918
Are you mocking ani?
Anonymous No.105706066 [Report]
Anonymous No.105706080 [Report]
Anonymous No.105706132 [Report]
Anonymous No.105706154 [Report]
Anonymous No.105706158 [Report] >>105706186 >>105706251 >>105706301
>>105703533
:/
>>105703501 (OP)
>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/vp/napt


v disappointed w\ the 'community' here
v disappointed w\the 'innocent bakers' here
v disappointed w\ the 'harmless thread blesser'
v disappointed w\ the thread quality in the last 4 months (possibly longer)
Anonymous No.105706164 [Report]
where's a good vace workflow
Anonymous No.105706178 [Report]
>rocketgurlp***
Anonymous No.105706186 [Report] >>105706199 >>105706277
>>105706158
she looks SO sad wow
Anonymous No.105706194 [Report]
pretty good
Anonymous No.105706199 [Report] >>105706277
>>105706186
she does look like she regrets something
but what?
Anonymous No.105706217 [Report]
Anonymous No.105706231 [Report] >>105706612 >>105707700
Anonymous No.105706236 [Report]
stop shitting up /napt/ asshole ;c
Anonymous No.105706243 [Report]
two men get in a chopper and fly away

interesting.
Anonymous No.105706251 [Report] >>105706268
>>105706158
>>v disappointed w\ the 'community' here
>v disappointed w\the 'innocent bakers' here
>v disappointed w\ the 'harmless thread blesser'
>v disappointed w\ the thread quality in the last 4 months (possibly longer)
Why?
Anonymous No.105706264 [Report]
>>105704160
Then it's pretty hard, optimized attentions such as xformers or SageAttention are very catered to NVIDIA specifics. I think FlashAttention is fairly platform indepedent though?
Anyway we shouldn't hope for community efforts for such things as Sage, because most of them only think about NVIDIA NVIDIA.
And Intel is pretty slow on implement things not related to LLM, well...
Anonymous No.105706265 [Report]
>>105706019
please take care of my daughter she likes milkshakes and pringles
Anonymous No.105706268 [Report] >>105706290 >>105706297
>>105706251
(self) appointed thread authority, you gain this ability (delusion) by posting the same image for a year
Anonymous No.105706277 [Report] >>105706299
>>105706199
>>105706186
im posting sadgirls until you update your neighbors list
NO FUN ALLOWED!
Anonymous No.105706290 [Report]
>>105706268
ask me for my workflow again
get mad again that you dont know how to use comfy kek
get mad at the lora stack i chose again
get mad at rgal-posting (yet again)
saltmining is tough work, but i must continue
Anonymous No.105706297 [Report]
>>105706268
what is authoritarian about asking to be on the neighbors list kek
Anonymous No.105706299 [Report] >>105707589
>>105706277
Can you make her fart?
Anonymous No.105706301 [Report] >>105706303
>>105706158
SAD, PATHETIC
Anonymous No.105706303 [Report]
>>105706301
v nice img2img :3
Anonymous No.105706358 [Report] >>105706380 >>105706401
Watch out for tight underwear
Anonymous No.105706361 [Report] >>105706398
chroma is kino.

>>105705578
any video frends want to try animating this one, pretty please?
Anonymous No.105706380 [Report] >>105706398
>>105706358
Anonymous No.105706398 [Report] >>105706412
>>105706380
Yes most of my gens will be a femboy or chick's with dicks. I mean I have regular stuff.
>>105706361
Some times you just know
Anonymous No.105706401 [Report]
>>105706358
Not local, post this garbage in the /aco/ thread or something.
Anonymous No.105706412 [Report]
>>105706398
>Yes most of my gens will be a femboy or chick's with dicks. I mean I have regular stuff.
that's pretty gay
Anonymous No.105706424 [Report] >>105706836
>push me
Anonymous No.105706426 [Report] >>105706435 >>105706436 >>105706453 >>105706455 >>105706481
a man with sunglasses walks to a computer and starts typing.

well, I guess it works.
Anonymous No.105706435 [Report]
>>105706426
kek
Anonymous No.105706436 [Report]
>>105706426
it's like those youtube channels from the 00s with the fancy graphic then the video starts and it's a goofy looking dweeby mofo
Anonymous No.105706438 [Report] >>105706473
What is the secret to stop the slow motion shit?
Anonymous No.105706442 [Report] >>105706692
Anonymous No.105706453 [Report] >>105706460 >>105706481 >>105706836
>>105706426
second try: JC does laptop tutorials
Anonymous No.105706455 [Report]
>>105706426
if you add "stable camera" you wont have faggy jumpcuts in your wanvideos
bye.
Anonymous No.105706460 [Report] >>105706493
>>105706453
>4:3 ROG laptop
Anonymous No.105706473 [Report] >>105706488
>>105706438
Try adding "slow motion" to negative and "movement is fast" to positive?
Anonymous No.105706481 [Report]
>>105706453 >>105706426
I want a deus ex review from these guys like the old days
Anonymous No.105706488 [Report] >>105706537 >>105706539 >>105706575
>>105706473
I already tried that genius, not working
Anonymous No.105706490 [Report]
now this is a transition.
Anonymous No.105706493 [Report]
>>105706460
thats the only R0G aspect ratio i would consider though
>120+ fps with black-frame insertion
>4:3 retro vidya mobile \bst\
W O U L D
Anonymous No.105706504 [Report]
>>105703501 (OP)
Thank you for baking this thread, anon.
>>105703533
Thank you for blessing this thread, anon.
Anonymous No.105706509 [Report]
As a pokemon fan, I used to like Team Rocket fanart a lot, I thought it was cool.
This faggot is not only shitting up the threads with his terrible and slopped 1girls, but is also misrepresenting what team rocket grunts are supposed to be
Anonymous No.105706537 [Report] >>105706558 >>105706692
>>105706488
Anonymous No.105706539 [Report] >>105706575
>>105706488
Lower lora strength?
Anonymous No.105706543 [Report]
imagine unironically complaining about "slop" on \LDG\
aka local diarrhea general

>not supposed to be that!
clearly unfamiliar with true canon
educate yourself.
not my job to give lectures to fill in the gaps in your tiktok\doomscroll fried zero attentionspan zoomer brain


>drunk\obsessed\schizoid no-genposter continues shidding\farding as per usual
ew.
Anonymous No.105706558 [Report] >>105706629 >>105706692
>>105706537
Milk
Anonymous No.105706575 [Report]
>>105706488
>>105706539
>"neg: slow motion, static, still, incomplete, frozen, freeze-frame"
Anonymous No.105706576 [Report] >>105706604
As opposed to every other gen in this thread, yours look like shit that could be made on fucking SD1.5. That's the clearest definition of slop.
You also NEVER, EVER, even post the slightest effort in your avatarfagging, like making the Rocket girls take any action outside of just posing, or making the scene interesting, adding pokemon and pokeballs in it etc.
Anonymous No.105706599 [Report]
>>105705376
basado
Anonymous No.105706604 [Report] >>105706648
>>105706576
>yours look like shit that could be made on fucking SD1.5
well guess why? local models are still shit
Anonymous No.105706608 [Report]
if you could read you would see that i reply\help people every thread anonkun
wanvid can be finnicky
Anonymous No.105706612 [Report]
>>105706231
looks like a screenshot from youtube spot on
Anonymous No.105706629 [Report] >>105706705 >>105706744
>>105706558
Anonymous No.105706644 [Report] >>105706660
hey don't be sad
Anonymous No.105706648 [Report]
>>105706604
As long as you are not a vramlet or a mentally handicapped gooner like some of the users and the rocketnigger, that statement is thoroughly incorrect. You can do complex and interesting scenes with Chroma and Flux just fine, as shown time and again in several of the threads.
Anonymous No.105706660 [Report]
>>105706644
v yucky fetish
Anonymous No.105706692 [Report]
>>105706537
>>105706558
>>105706442
not good anon, NOT GOOD.
Anonymous No.105706705 [Report] >>105706713 >>105706858
>>105706629
So the anon screaming about the sloppa being bad do i indulge them or just ignore?
Anonymous No.105706713 [Report] >>105706799
>>105706705
ignore OBV
Anonymous No.105706718 [Report] >>105706830
where are the pixartsexuals nowadays
Anonymous No.105706744 [Report]
>>105706629
;3
Anonymous No.105706799 [Report] >>105706837
>>105706713
Have a vampire
Anonymous No.105706805 [Report]
>>105704198
>>105704290
What is its name?
Anonymous No.105706830 [Report] >>105706967
>>105706718
what's the tldr of pixart models? and hunyuan dit for that matter? why no adoption?
Anonymous No.105706836 [Report]
>>105706424
>satisfaction

>>105706453
can it emulate the old unreal engine style?
Anonymous No.105706837 [Report]
>>105706799
cute
Anonymous No.105706858 [Report] >>105707128 >>105707239
>>105706705
>screaming
how do you scream on 4chan
Anonymous No.105706960 [Report]
>>105706003
>it does have a plastic skin problem

Have you tried upscaling using Chroma?
Anonymous No.105706967 [Report]
>>105706830
tencent fucked us on licencing and Alibaba basically got them btfo. pixart was always a meme
Anonymous No.105706968 [Report] >>105706981
bugged animations in this game
Anonymous No.105706981 [Report]
>>105706968
He wanted lemon lime
Anonymous No.105707013 [Report] >>105707123
Want can't seem to make sense of the background and keeps wanting to change it.
Anonymous No.105707061 [Report]
Anonymous No.105707103 [Report]
Anonymous No.105707123 [Report]
>>105707013
add description of bg to your prompt+ "stable camera"
or cutout in photoshop & move her to a by of your own :3
Anonymous No.105707128 [Report] >>105707231
>>105706858
>le scramz
shidpoast daily saying the same crap? ;3
Anonymous No.105707231 [Report] >>105707350
>>105707128
like the guy who asks op to add veepee?
Anonymous No.105707239 [Report]
>>105706858
i2i a face using the goatse pic
Anonymous No.105707305 [Report] >>105707340 >>105707746
Anonymous No.105707313 [Report] >>105707322 >>105707329 >>105707755
In A1111 I remember in img2img there was a control for how much noise was injected into the source image essentially to control how much the generated image can diverge from the original - how do I set that up in comfy?
Anonymous No.105707322 [Report]
>>105707313
just look for a denoise setting
Anonymous No.105707329 [Report]
>>105707313
You can't. Comfy is pos software and its dev is basically a money hungry jew.
Anonymous No.105707340 [Report]
>>105707305
nice gag clip for a youtube video
Anonymous No.105707350 [Report]
>>105707231
no, like no-gen complainer/saltmine shitposter schizoids
Anonymous No.105707364 [Report] >>105707372 >>105707400 >>105707425 >>105707480
What am I doing wrong with chroma?

Photorealism: check
Aesthetics: check
Detailed backgrounds: check
anatomy: kick rocks
Anonymous No.105707368 [Report] >>105707586
anyone know wtf this is - (*bias): last dimension must be contiguous
Anonymous No.105707372 [Report] >>105707388
>>105707364
Maybe >>105704471
Anonymous No.105707388 [Report]
>>105707372
already in my negative prompt
Anonymous No.105707400 [Report] >>105707425 >>105707456 >>105707495
>>105707364
try increasing cfg and steps
Anonymous No.105707401 [Report] >>105707418
>im outta here
Anonymous No.105707418 [Report]
>>105707401
Wow the first coherent one.
Anonymous No.105707425 [Report] >>105707441
>>105707364
>>105707400
Might also want to try an ancestral sampler if you're not doing so already. I often see euler ancestral being able to fix bad anatomy during generation.
Anonymous No.105707441 [Report] >>105707484
>>105707425
I use euler, does it matter the sample type?
Anonymous No.105707456 [Report] >>105707495
>>105707400
maybe the spaghetti girls are slightly better
Anonymous No.105707474 [Report] >>105707525
man with sunglasses gets onto a black helicopter.

well, it's still true.
Anonymous No.105707480 [Report]
>>105707364
Try using res multistep for higher quality than Euler or for highest quality heun or dpm 2. Take at least 30 steps as opposed to 20, the more, the better. Remember the model is not done training yet, and it attempts to be very dynamic which is good, but that comes at cost of mistakes. Turn image previews on to cancel ahead of time obviously bad gens.
Anonymous No.105707484 [Report] >>105707519 >>105707581
>>105707441
Yes. See here for example (from OP): https://stable-diffusion-art.com/samplers/#Ancestral_samplers

It's not explained too well, admittedly, but basically the added randomness at each step gives the model more opportunity to fix weirdness during generation - at least in my experience specifically with Chroma and photorealistic generations.
If you have a disembodied leg somewhere, other samplers might be forced to just lock in and go with it, whereas with an ancestral sampler, there is more of a chance it can turn into a towel or something. Do some experiments with and without ancestral would be my recommendation.
Anonymous No.105707495 [Report] >>105707506
>>105707400
>>105707456
Increasing cfg is bad advice. Results in overfitted slopped Flux look, not sure that is "better" anatomy.
Anonymous No.105707506 [Report]
>>105707495
>Increasing cfg is bad advice
experimenting with 0.1 increases/decreases works pretty often, you dont have to go overboard
Anonymous No.105707519 [Report]
>>105707484
ah fuck me I forgot euler a exists
Anonymous No.105707523 [Report] >>105707539
My CFG is 4.5, default from Chroma workflow, never felt need to experiment at least for photorealism cause it's perfect, but feel free to play with it
Anonymous No.105707524 [Report]
Anonymous No.105707525 [Report]
>>105707474
kek
Anonymous No.105707537 [Report] >>105707549
is using --listen argument and prompting with my phone over the internet safe?
Anonymous No.105707539 [Report] >>105707569
>>105707523
Oh wow ldg sure has all this technical knowledge...
Anonymous No.105707549 [Report] >>105707562
>>105707537
If you need to ask: no
But you have already compromised your system because you're a tard normie, so please go ahead.
Anonymous No.105707562 [Report] >>105707569
>>105707549
Oh wow ldg sure has all this technical knowledge...
Anonymous No.105707569 [Report]
>>105707539
>>105707562
Oh wow ldg sure has all this technical knowledge...
Anonymous No.105707580 [Report]
bout to make a big stinky
Anonymous No.105707581 [Report] >>105707694
>>105707484
I just don't use Euler to avoid mistakes. In my case I've found it to be inferior to res multistep or dpmpp 2m. If swap to Euler I always find entire image quality to be significantly worse and never better.
Anonymous No.105707586 [Report]
>>105707368
NVM i found the problem had to downgrade pytorch
Anonymous No.105707589 [Report]
>>105706299
If you need to ask: no
But you have already compromised your system because you're a tard normie, so please go ahead.
Anonymous No.105707616 [Report] >>105707631 >>105707952
https://ace-step.github.io/
Looking unlikely these guys will open source their next improved model. Looks like we will never get a local Udio tier model because no one wants to mess with record companies.
Anonymous No.105707631 [Report] >>105707655
>>105707616
>next improved model
Where? And is it actually Udio tier or just another generic chinese pop rap dataset?
Anonymous No.105707636 [Report] >>105707694
Anonymous No.105707653 [Report]
Why did you tell anon to increase CFG :(
Anonymous No.105707655 [Report] >>105707662
>>105707631
>Where? And is it actually Udio tier or just another generic chinese pop rap dataset?

It's not really out. But those guys can easily make the model Udio tier, and I don't think it'd be local because they wouldn't have an incentive to release it. They are a small Chinese lab and there are millions to be made from such a model behind API.
Anonymous No.105707662 [Report]
>>105707655
That's a shame. Basically the only hope for a true competitor to Udio is some random autist with a training rig and nothing to lose, which I doubt there are any out there.
Anonymous No.105707668 [Report]
>have one parameter which can go up or down
Normies are crying about it.
Anonymous No.105707694 [Report]
>>105707636
INCREASE CFG. HIGHER. not under 6.9
>>105707581
euler is ok, just needs more steps. but yeah dpmpp2m and other samplers seem to be better. really depends on the content tho
Anonymous No.105707696 [Report]
what distilled cfg do you guys use for chroma
Anonymous No.105707700 [Report]
>>105706231
reminds me of a carmen san diego cutscene
Anonymous No.105707731 [Report] >>105707755 >>105707968 >>105708162
>OmniGen2
>Finally getting better results than Closed AI 4o at home.

Sweet. Finally I can generate my obscure waifus from 1 single picture then train LoRas some more from synthetic data.

https://github.com/VectorSpaceLab/OmniGen2
Anonymous No.105707746 [Report]
>>105707305
if you follow aoe2, this actually looks like something that would appear on memb's channel kek
Anonymous No.105707755 [Report] >>105707787
>>105707313
there are about 100 different ways to inject noise into an img2img process. need to be more specific.
>>105707731
how fast is it?
Anonymous No.105707757 [Report]
done for the night catboxing this one just in case someone wants the prompt

https://files.catbox.moe/bw99dn.png
Anonymous No.105707787 [Report] >>105707804 >>105707833 >>105707842 >>105707939
>>105707755

~16 seconds, 4090
Anonymous No.105707804 [Report]
>>105707787
>Harpuea
What kind of a username is this?
Anonymous No.105707821 [Report]
Anonymous No.105707823 [Report]
lightx2v is only good for animated wallpapers and fake live2d
Anonymous No.105707833 [Report] >>105707843
>>105707787
is it censored?
Anonymous No.105707840 [Report] >>105707848
Anonymous No.105707842 [Report]
>>105707787
nice. so, according to my precise calculations, around 25 secs on my african space force 3090
Anonymous No.105707843 [Report]
>>105707833

Not cencored. Nipples and pussy shows.
Anonymous No.105707848 [Report] >>105707899
>>105707840
What model is this? Box please?
Anonymous No.105707892 [Report]
Anonymous No.105707899 [Report] >>105707914
>>105707848
https://files.catbox.moe/7ddgvl.png
Anonymous No.105707911 [Report]
a man with sunglasses turns around and starts running far away.
Anonymous No.105707914 [Report]
>>105707899
I saved it.
Anonymous No.105707939 [Report] >>105707942
>>105707787
>~16 seconds, 4090
wtf, takes me 4 mn on my 3090, how??
Anonymous No.105707942 [Report] >>105707945
>>105707939

Flash attention and Triton installed?
Anonymous No.105707945 [Report]
>>105707942
yes, I'll test the native Comfy one soon and see how much faster it is with sage
Anonymous No.105707951 [Report]
UM GUNNA OOOM
Anonymous No.105707952 [Report]
>>105707616
>Looks like we will never get a local Udio tier model because no one wants to mess with record companies.
yeah, the music industry is worse than the mafia, even the chinks are terrified about copyright when it's about music
Anonymous No.105707967 [Report]
too realistic.
Anonymous No.105707968 [Report] >>105707975 >>105708000
>>105707731
Can omnigen for instance fatten up a skinny girl? Asking for a friend
Anonymous No.105707973 [Report] >>105708002 >>105708192 >>105708206 >>105708217
https://github.com/mit-han-lab/radial-attention
>We present Radial Attention, a sparse attention mechanism with O(nlogn) computational complexity. Radial Attention accelerates pre-trained HunyuanVideo by 1.9× at its default video length while maintaining comparable video quality. When generating 4× longer videos, it reduces tuning costs by up to 4.4× and speeds up inference by up to 3.7× versus dense attention.
ok now we're talking
Anonymous No.105707975 [Report]
>>105707968
CANNOT and WILL NOT
Anonymous No.105708000 [Report] >>105708104
>>105707968
just gen something tasty, why bother. seriously.
t. dipping into chubby after having exploited.. everything else.
Anonymous No.105708002 [Report] >>105708010
>>105707973
>accelerates pre-trained HunyuanVideo
It seems lots of researchers turned their heads to Hunyuan when it first released and spent the last few months with it, huh? Too bad that model was only relevant for like 3 months

On a side note, going forward, we will probably be receiving papers in the coming months using Wan as the base
Anonymous No.105708010 [Report]
>>105708002
yep, it takes time to make such a project so I'm not surprised one bit, at least they're not shilling fucking Sora mini you know what I mean kek
Anonymous No.105708016 [Report] >>105708034 >>105708098
To this day, I haven't seen any Wan gen nearly as soulful as this. This was Skyreels I think. Higher base fps makes quite a difference at times
Anonymous No.105708034 [Report] >>105708090 >>105708098
>>105708016
Wan as comparison
Anonymous No.105708044 [Report] >>105708084 >>105708119
https://github.com/thu-ml/SageAttention/issues/190
I'm glad that those fuckers are getting shitstormed lol
Anonymous No.105708084 [Report]
>>105708044
>Why are you all locking it down pretending like this is Flux Kontext with risk to public safety????
Meanwhile in chink's world:
>Here, have our Omnigen 2, it can do nudity have fun!
Anonymous No.105708090 [Report] >>105708160
>>105708034
y aint it the same starting frame tho
Anonymous No.105708098 [Report]
>>105708016
>>105708034
I prefer Wan's one, the structure is better it has less AI weirdness
Anonymous No.105708104 [Report] >>105708159
>>105708000
Based trips
Anonymous No.105708119 [Report]
>>105708044
https://github.com/thu-ml/SageAttention/issues/190#issuecomment-3003693626
>the license in the gated HF repo seems to be unchanged. Weird.
https://huggingface.co/jt-zhang/SageAttention2_plus
>apache 2.0 licence
you know what this means? the first mf who gets the hands on the code can post i on github without much issues
Anonymous No.105708152 [Report]
Anonymous No.105708159 [Report]
>>105708104
a sign of THE MAN to dive deeper into chubby huh. aw man. chubby koreans are tasty.
Anonymous No.105708160 [Report]
>>105708090
Yeah, I forced a frame trying to get results as close as possible to the other one (also because I liked that smile better, heh)

But here is one with the same starting frame as the other
Anonymous No.105708162 [Report] >>105708183
>>105707731
>better results than Closed AI 4o
obviously no pissfilter and vagene is amazing but does it also surpass it in other areas?
Anonymous No.105708183 [Report] >>105708218
>>105708162
comparing it to 4o is useless, the new SOTA model to compare with is Kontext pro
Anonymous No.105708192 [Report] >>105708202
>>105707973
Wait so youre saying, longer videos + faster gen times all in one? Oh, keeping a fat eye on this one. This is the news I come here for
Anonymous No.105708202 [Report]
>>105708192
>Wait so youre saying, longer videos + faster gen times all in one?
yeah, it gets faster and faster when it's longer (and has more resolution) because the attention layer is not quadratic anymore, if they managed to keep good quality on it that's a big deal
Anonymous No.105708206 [Report]
>>105707973
The code's already there, right? Can't kijai or comfy integrate it into a node themselves?
Anonymous No.105708217 [Report]
>>105707973
zased, that'll fix the issue with riflexrope being a piece of shit that likes to loop
Anonymous No.105708218 [Report] >>105708220
>>105708183
4o is still ranked above gpt-image. this is like saying base flux beats dall-e kek.
Anonymous No.105708220 [Report]
>>105708218
>4o is still ranked above gpt-image.
how? it changes the image (instead of editing it like kontext pro) and that piss filter is awful
Anonymous No.105708224 [Report]
>>105705748
>https://github.com/comfyanonymous/ComfyUI/pull/8669
how can we make multiple image input comfy? your workflow doesn't have that
Anonymous No.105708225 [Report]
Fresh

>>105708222
>>105708222
>>105708222

Fresh