/ldg/ - Local Diffusion General - /g/ (#105669256) [Archived: 801 hours ago]

Anonymous
6/22/2025, 10:58:04 AM No.105669256
highlights_g_105667276_1750581775_thumb.jpg
highlights_g_105667276_1750581775_thumb.jpg
md5: ef708f8e0ba868cd921a4fd4b7f4c240🔍
Good Luck with the Experimenting Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105667276

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Models, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Cook
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>Chroma
Training: https://rentry.org/mvu52t46

>WanX (video)
https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Misc
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Archive: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate
Local Model Meta: https://rentry.org/localmodelsmeta

>Neighbors
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Replies: >>105669353 >>105670099 >>105670118 >>105673344
Anonymous
6/22/2025, 11:03:32 AM No.105669291
You're not entitled to my frenship.
Replies: >>105669329
Anonymous
6/22/2025, 11:08:14 AM No.105669329
>>105669291
:(
Anonymous
6/22/2025, 11:10:43 AM No.105669350
ComfyUI_temp_zybdi_00051_
ComfyUI_temp_zybdi_00051_
md5: 940cdc3985f1e07c7350a44a2dd9c67d🔍
Replies: >>105669361
Anonymous
6/22/2025, 11:11:00 AM No.105669353
IMG_2670
IMG_2670
md5: b5e6dd6a487a59cf7dd180f665b8cbb2🔍
>>105669256 (OP)
neighbors: >>>/vp/napt/

:3
Anonymous
6/22/2025, 11:11:58 AM No.105669361
>>105669350
>High CFG - The animation
Replies: >>105669386
Anonymous
6/22/2025, 11:14:06 AM No.105669373
Is a trigger word tag necessary for a lora? Isn't SD supposed to automatically apply the Lora if you... use the Lora?

I don't get why you have to train a lora with one extra tag.
Replies: >>105669420 >>105669535 >>105669761
Anonymous
6/22/2025, 11:15:28 AM No.105669386
ComfyUI_temp_zybdi_00064_
ComfyUI_temp_zybdi_00064_
md5: 26e865cfe369a7c3bb39e29181917a06🔍
>>105669361
No, it's the shader noise sampler. Actual mental illness generator.
Anonymous
6/22/2025, 11:22:57 AM No.105669420
>>105669373
Shit like "pu55ytw3rk1n" is not necessary and utterly retarded, yes. Models learn by association and tag frequency in the set. If you're training a person LoRA, you always want their name in the caption for every image they're in, and then it learns to associate said person in the images with that name/tag. For styles, you literally just describe the scene in tags/natural language, depending on the model. You can add stuff like "pixar style, 3d" if it's a Pixar LoRA to reinforce the style though, but you still want to caption it just like you'd actually prompt it
Replies: >>105669429 >>105669441
Anonymous
6/22/2025, 11:25:21 AM No.105669429
>>105669420
>not necessary
It's my lucky charm, okay? I don't feel good unless I put it in.
Anonymous
6/22/2025, 11:25:39 AM No.105669431
>>105669270
/LDG/ ladies & gentlemen
Anonymous
6/22/2025, 11:29:15 AM No.105669441
>>105669420
But isn't stupid to use the same tag for all of the images if you are copying an artstyle? You are going to use the Lora for that artstyle, why would you tag it as if you wanted it out of the Lora if you would not tag it.
Replies: >>105669447 >>105669493 >>105669923
Anonymous
6/22/2025, 11:30:15 AM No.105669447
>>105669441
The Lora learns what you don't tag as a core trait for the Lora. Or so I'm told.
Anonymous
6/22/2025, 11:34:03 AM No.105669466
00070-3757744791
00070-3757744791
md5: c3d4a80b1deb576a154a286b271d2443🔍
>>105669315
gooning is vital daily exercise.
Anonymous
6/22/2025, 11:40:32 AM No.105669493
>>105669441
I made a 3D style LoRA based on a videogame, then trained it for illustrious. I only used normal tags, no same tag for the style (game cg, 3d). The end result was weak. Then I trained it again with those two tags, and it was a LOT stronger. For styles, I'd say train two loras, one with 'trigger' style tags and one without and see what works best. Most image models don't take long to train on
Replies: >>105669923
Anonymous
6/22/2025, 11:48:10 AM No.105669535
>>105669373
I remember reading that you want to attach and associate styles to tags and words and concepts that the model already knows. So if you're training an oil painting lora, you wouldn't use the artists name, you'd use 'oil painting', and it'd sort of replace the generalized knowledge and images it was trained on for 'oil painting' with the new style you teach it, making the effect stronger than if you used no trigger word at all

Is it true? I have no idea. There's a lot of misinfo and misinterpretations when it comes to training loras

Either way, I don't see how it could hurt to have them though, unless someone has evidence to the contrary
Replies: >>105669599
Anonymous
6/22/2025, 11:52:28 AM No.105669557
00076-2368627032
00076-2368627032
md5: 568fd05fbad829dae6162f7ac97ff255🔍
Anonymous
6/22/2025, 11:53:43 AM No.105669564
cursed thread of hostility
Replies: >>105669577
Anonymous
6/22/2025, 11:55:19 AM No.105669577
>>105669564
Fear not friend, I'm here now!
Anonymous
6/22/2025, 11:58:04 AM No.105669599
>>105669535
yeah that's how it works. if you're training a lora and 3d stuff without the 3d tag you're training the 3d style from scratch. with the 3d tag you're training what the model already knows to be like what you want. you're basically telling it "this is what 1girl, 3d, hatsune miku, huge breasts, sagging breasts, futanari, huge cock, veiny cock, huge testicles, sagging testicles, excessive pubic hair, armpit hair, armpit hair peek, projectile cum, projectile lactation very sweaty, shiny skin, smell, steaming body, scat, cat" looks like now
Replies: >>105669605
Anonymous
6/22/2025, 12:00:11 PM No.105669605
1540283453588
1540283453588
md5: 499456078d5134ca2c6dbf0c2908c0f8🔍
>>105669599
Anonymous
6/22/2025, 12:01:09 PM No.105669612
1505741832521
1505741832521
md5: e45353b778ffb4f1d14391e092b3c026🔍
Replies: >>105673373
Anonymous
6/22/2025, 12:20:43 PM No.105669706
so ermmmm what happened to that flux release comfykek was hype-raising about?
Replies: >>105669764
Anonymous
6/22/2025, 12:20:58 PM No.105669707
217
217
md5: cf6497493e3dd2dee870246476b5aaa7🔍
I've seen this slapped as a note in several workflow. Do multiples of 2 of these also work fine?
Replies: >>105669760 >>105669781
Anonymous
6/22/2025, 12:26:35 PM No.105669735
Blessed thread of frenship
Replies: >>105669852
Anonymous
6/22/2025, 12:29:44 PM No.105669755
it cant be blessed if my beautiful creations are never in the collage but these disgusting pics are
Anonymous
6/22/2025, 12:30:31 PM No.105669760
>>105669707
The goal is get an image with roughly 1 million pixels total in it, because that's what sdxl was trained on. With some optimizations and side nodes like kohya's upscale you could get away with 1.5M or even 2M pixel base resolution images. Flux also can handle 1.5M just fine on its own too, not sure about going higher than that.
Anonymous
6/22/2025, 12:30:45 PM No.105669761
>>105669373
It's not strictly necessary, but it helps a lot with training. The model needs to know what it's learning. Without a main keyword, the model won't have a main concept to associate its new knowledge with and will take a longer time to learn or might not even learn it at all. Some people compensate for this by using very high learning rates, which would easily lead to overcooking and poor generalization. However, if you use the same main keyword too often, it will also lead to overcooking.
Anonymous
6/22/2025, 12:30:55 PM No.105669764
>>105669706
cumfy org has an deal for early exclusive access to the pro model through api nodes once the dev model is out
that's why they have a financial interest in shilling it
Anonymous
6/22/2025, 12:32:37 PM No.105669772
why no Loras to make video game HUDs? all i find is a shitty dark souls health bar. sad
Anonymous
6/22/2025, 12:34:15 PM No.105669781
>>105669707
Do you mean like, say, 2048x2048? If so, no, you want to do an upscaling process where you decode the image, upscale it with an upscale model, then encode it and use it as the latent for a separate KSampler process with low denoise (Ultimate SD Upscale consolidates this and has tiling options, though tiling is best avoided if you have VRAM)

Optionally use a Controlnet with the upscaled image into the second KSampler, this will make the output more consistent to the original since inference can change details

The reason those resolutions are listed is because those are the resolutions trained at. It can handle a little variation, but if you go too high you'll start seeing weird aberrations, humans merged into second humans with excessive features and limbs, things like that.
Replies: >>105669791 >>105670185
Anonymous
6/22/2025, 12:35:52 PM No.105669791
>>105669781
*i mean you want to use the output of the first KSampler this way, to be clear
Anonymous
6/22/2025, 12:44:03 PM No.105669834
why does batch generating fucks with regional prompter in Forge? such a strange phenomenon
Anonymous
6/22/2025, 12:47:56 PM No.105669852
>>105669735
My dude!
Anonymous
6/22/2025, 12:48:20 PM No.105669855
1735941793121036
1735941793121036
md5: 6c428ca989f74f9cb1506b1d40394270🔍
I'm doing the /ldg/ Comfy I2V 480p workflow from the wan rentry. Is it normal, that without offloading (unless there is offloading by default) it takes up 43 GB of my RAM?
I got the same error as this guy https://www.reddit.com/r/comfyui/comments/1lgxzym/last_dimension_must_be_contiguous/ and had to select CPU for the CLIPLoader. Could this be why so much RAM is used? I thought, or hoped, after the initial CLIPLoader node only the GPU would be used for the rest
Replies: >>105669896
Anonymous
6/22/2025, 12:53:58 PM No.105669886
>>105668302
noice
Anonymous
6/22/2025, 12:56:00 PM No.105669896
>>105669855
offloading is enabled by default
if you have 24gb vram just replace the distorch node with plain unet load gguf
Replies: >>105669937
Anonymous
6/22/2025, 12:58:42 PM No.105669909
>>105668302
Do Kat too please
Anonymous
6/22/2025, 1:03:24 PM No.105669923
>>105669493
>>105669441
for art styles i usually leave the captions completely empty lmao.
granted this is semi retarded because it takes on certain aspects like eye colors if for instance you have an overwhelming amount of red eyes in the dataset. best to tag "red eyes" then, so it doesn't learn that as part of the lora.
Anonymous
6/22/2025, 1:06:33 PM No.105669937
>>105669896
Nah I'm a 16GBlet, so I guess I need offloading. I mean I was prepared for it but I didn't think it would need that much
Anonymous
6/22/2025, 1:32:12 PM No.105670099
1744820149424066
1744820149424066
md5: 31e44abb5c251b42e33706e955233401🔍
>>105669256 (OP)
I made a tag explorer with Illustrious gens for all face/hair/style/composition tags as well as 18000+ artist tags. Could be useful.

https://tagexplorer.github.io/
Replies: >>105670126 >>105670226 >>105670402 >>105670424 >>105670608 >>105670870 >>105671323 >>105671804 >>105672062
Anonymous
6/22/2025, 1:37:26 PM No.105670118
file
file
md5: 45c513926a6e1b6ed96a7c4382f2ad67🔍
>>105669256 (OP)
anyone able to prompt this without img2img i tried for like half hour but couldnt kek
Replies: >>105670870
Anonymous
6/22/2025, 1:39:13 PM No.105670126
>>105670099
thats neat bro, I was looking for something like this.
Anonymous
6/22/2025, 1:45:47 PM No.105670161
what are the next things to get excited for?
-chroma v50+ unfurried fixed anatomy version
-kontext dev finetuned unslopped
-some new sage attention extra turbo meme edition
anything i'm missing?
Replies: >>105670238 >>105671142 >>105671200
Anonymous
6/22/2025, 1:50:38 PM No.105670185
ogQSCiKcuXk
ogQSCiKcuXk
md5: 31a2a85720ebe067a43e3efc885897b9🔍
>>105669781
>Ultimate SD Upscale consolidates this and has tiling options, though tiling is best avoided if you have VRAM)
Oh found that. Also do I run SEGS for hands and faces before or after upscale?
Anonymous
6/22/2025, 2:00:41 PM No.105670226
>>105670099
Cool, bookmarked
Anonymous
6/22/2025, 2:03:11 PM No.105670238
>>105670161
netani lumina
illust vpred 3.5
illust lumina
Wan 3
Replies: >>105670992
Anonymous
6/22/2025, 2:31:35 PM No.105670402
>>105670099
that is awesome... thanks and never delete pls
Anonymous
6/22/2025, 2:34:18 PM No.105670414
What do you use your gens for?
Are you working on some project or generating just for the heck of it
Replies: >>105670431 >>105670447
Anonymous
6/22/2025, 2:35:40 PM No.105670424
>>105670099
Based beyond belief, fucking hate having to lurk through extracted booru tags without image examples
Anonymous
6/22/2025, 2:36:13 PM No.105670431
>>105670414
Both. For work and for shits and giggles
Anonymous
6/22/2025, 2:39:28 PM No.105670447
Strip 1
Strip 1
md5: 8556b755cf9950dea04bfc90e2380f1f🔍
So I guess it is segs after upscale. I don't want to prompt so it doesn't change their expressions randomly baka.
>>105670414
Prompt goonstuff for pixiv>goon>use postnut clarity to work on sfw art
20Loras
6/22/2025, 2:47:43 PM No.105670481
I want to give image2video a try.
If I already have forge up and running without issues, do I have to mess about with pytorch? I don't want to brick my usual genning.
Replies: >>105670532
Anonymous
6/22/2025, 2:56:20 PM No.105670532
>>105670481
install comfyUI
Replies: >>105670545
Anonymous
6/22/2025, 2:58:18 PM No.105670540
drank002_thumb.jpg
drank002_thumb.jpg
md5: 262777fb16bc30ecef8f1d1c496615f8🔍
Replies: >>105670561 >>105670621 >>105673337
20Loras
6/22/2025, 2:59:00 PM No.105670545
>>105670532
Sure, I know WAN requires that. But does the pytorch version being higher than what forge needs by default, ruin it for me? Right now it's at 2.3 iirc. You can't have multiple pytorch versions running I'm guessing.
Replies: >>105670552 >>105670726
Anonymous
6/22/2025, 3:00:05 PM No.105670552
>>105670545
why bother? just use comfyUI for both, whatever the forge crap can, comfyUI can do better.
Replies: >>105670682
Anonymous
6/22/2025, 3:01:33 PM No.105670561
>>105670540
The beer starts spilling way after the beer reaches the edge looks kinda silly
Anonymous
6/22/2025, 3:07:26 PM No.105670608
WVI2V_CC_INT_294375696559235_00002_thumb.jpg
WVI2V_CC_INT_294375696559235_00002_thumb.jpg
md5: 069800952cb3ae50436cf242a6b1472f🔍
>>105670099
very nice
Replies: >>105673351 >>105673364
Anonymous
6/22/2025, 3:07:41 PM No.105670609
WVI2V_CC_INT_178517362021639_00001_thumb.jpg
WVI2V_CC_INT_178517362021639_00001_thumb.jpg
md5: e03b46365b4e76dae620e867caeea156🔍
Anonymous
6/22/2025, 3:08:45 PM No.105670621
>>105670540
She has a drinking problem
Anonymous
6/22/2025, 3:09:06 PM No.105670624
WVI2V_CC_INT_365179606385175_00001_thumb.jpg
WVI2V_CC_INT_365179606385175_00001_thumb.jpg
md5: aae0d4962f8f3486c163f770bebe3443🔍
20Loras
6/22/2025, 3:18:57 PM No.105670682
35bb8aae2ae1d6f02afe337895bfe440
35bb8aae2ae1d6f02afe337895bfe440
md5: 35bb8aae2ae1d6f02afe337895bfe440🔍
VROOM

>>105670552
Because forge is comfy, comfyui is not.
Doesn't comfyui have a really convoluted way to tile upscale? Everything is just extra steps, not comfy.
Replies: >>105670793 >>105671304
Anonymous
6/22/2025, 3:20:41 PM No.105670692
1480061930691
1480061930691
md5: 4d44dd044a59b9e5aace755f6cf52dc6🔍
How do I improve SEGS detection of hands that are fists?
Replies: >>105670696 >>105670705 >>105670794 >>105670870
Anonymous
6/22/2025, 3:21:27 PM No.105670696
>>105670692
wash your mouth out
Anonymous
6/22/2025, 3:22:05 PM No.105670705
>>105670692
Test other ultralytics detectors from Civitai trained on anime, test a SAM2 setup, test a Florence2 setup.
Anonymous
6/22/2025, 3:24:33 PM No.105670726
>>105670545
i am seconding this question. i also use forge and don't want to break anything as i'm technologically inept. if you find any good answer that doesn't involve quitting forge, i'm interested
Replies: >>105670793
Anonymous
6/22/2025, 3:33:34 PM No.105670793
>>105670682
it's just a different way. a bit ironic that having a zoomable UI on PC is what's giving zoomers conniptions. that, and a total unfamiliarity with package management.
>>105670726
just so you know, once you get comfortable with this part, you'll be past the hurdle that filters 90% of comfy complainers. setup conda/venv, install dependencies, start service. it's basically the same for every manual install section of every python project across github.
Replies: >>105671349 >>105672544
Anonymous
6/22/2025, 3:33:34 PM No.105670794
>>105670692
Just mask them manually, it takes no time at all and you don't need to be precise.
Anonymous
6/22/2025, 3:35:00 PM No.105670802
I'm getting ~12-15% speed improvement on Wan when using sage attention (in Comfy), is that what you'd expect ?

I'm testing it on a 3060 ti.
Replies: >>105670809
Anonymous
6/22/2025, 3:36:13 PM No.105670809
>>105670802
I don't remember, it's been a while and it was a different GPU but I recall it being significant
Replies: >>105670819
Anonymous
6/22/2025, 3:37:32 PM No.105670815
is there a way to use ai to edit a video? i want to get rid of weird artifacts but i dont want to edit every frame. i want to select the area with the artifacts, have it automatically track it and fix it
Replies: >>105670962
Anonymous
6/22/2025, 3:38:05 PM No.105670819
>>105670809
Hmmm... I've just installed sage attention and launched comfy with --use-sage-attention, do I need to change something in my workflow ?
Replies: >>105670827
Anonymous
6/22/2025, 3:39:07 PM No.105670827
>>105670819
on native no, on kijai's wrapper it won't work if using the flag
Replies: >>105670873
Anonymous
6/22/2025, 3:45:08 PM No.105670870
chroma-unlocked-v38-detail-calibrated-Q8-2025-06_040719-gen-146854730493866-euler-sigmoid_offset-00416
>>105670118
throw it into joy caption and go from there.
>>105670099
neat.
>>105670692
manual is the way. no detection is foolproof
Anonymous
6/22/2025, 3:45:42 PM No.105670873
>>105670827
Ok, thanks, maybe that's all you get on a 3060, I am testing at 512x512, perhaps it gives larger gains at higher resolutions ?
Replies: >>105670898
Anonymous
6/22/2025, 3:46:52 PM No.105670885
what was the name of that song generator model lads?
Replies: >>105670891
Anonymous
6/22/2025, 3:47:17 PM No.105670891
>>105670885
Ace-Step ?
Anonymous
6/22/2025, 3:48:22 PM No.105670898
>>105670873
stretching my memory, I think I was looking at 15 minutes for 40 frames of 480 on fp8
and that was the best I could get out of it, anything higher was exponentially longer, like extra 16 frames an extra 10 minutes
this was just with teacache, if you don't have that get it
Replies: >>105670906
Anonymous
6/22/2025, 3:49:54 PM No.105670906
>>105670898
I was avoiding teacache because I heard it had a noticeable degradation on the results, but I should probably give it a go
Anonymous
6/22/2025, 3:53:12 PM No.105670937
Did anyone find a way around the lightx2v making the motion slower in gens? I know there's the RIFLEx workaround, but it makes my gen times longer and I'm inpatient.

I've tried the :2 strength trick in the prompt, and it seems to burn the frames.
Replies: >>105671014
Anonymous
6/22/2025, 3:56:19 PM No.105670962
elf-magic-water
elf-magic-water
md5: 1f1bf327ce2ef86d360dce3fcdb8657b🔍
>>105670815
You mean inpainting a video? I don't think it is possible. Or at least I've not seen any discussion of it around here.
Anonymous
6/22/2025, 3:59:40 PM No.105670992
>>105670238
Wtf is netani?
Anonymous
6/22/2025, 3:59:59 PM No.105670997
What's should I use if I literally just want to creat porn videos featuring my coworkers?
Replies: >>105671021 >>105671025 >>105671039 >>105671043 >>105671046 >>105671129
Anonymous
6/22/2025, 4:01:22 PM No.105671014
>>105670937
you can increase the frame rate but that makes the video shorter obviously. maybe we need a motion lora or some shit.
Anonymous
6/22/2025, 4:02:31 PM No.105671021
>>105670997
Use a camera.
Anonymous
6/22/2025, 4:02:59 PM No.105671025
>>105670997
A good lawyer
Anonymous
6/22/2025, 4:05:28 PM No.105671039
ComfyUI_00422_ (1)
ComfyUI_00422_ (1)
md5: 5e6cdcda61eae78611a12405bed87fc2🔍
>>105670997

Hypothetically if anyone wanted to do this, there are two routes. Keep in mind this is HYPOTHETICAL because it is EEL LEGAL!!!!

Rookie:
>Get I2V setup
>DL average titty drop/cumshot loras
>Gen and Goon


Level 99 Mafia Boss Gooner:
>Get a bunch of images of this person together
>Crop and Caption them
>Train a lora
>Repeat 1-3 in the rookie setup
Replies: >>105671373
Anonymous
6/22/2025, 4:05:58 PM No.105671043
file
file
md5: 40e19e979bded718879ad8cdfbc93fef🔍
>>105670997
>What's should I use if I literally just want to creat porn videos featuring my coworkers?
Anonymous
6/22/2025, 4:06:16 PM No.105671046
>>105670997
At least 20-30 images of each person, train a lora of them, generate a start image, use Wan img2video with a lora which is trained on the sexual activity you want them to perform.

Do not post the results online, it is illegal unless you have their permission.
Anonymous
6/22/2025, 4:09:40 PM No.105671070
Where do I get celebrity LORAs now?
Replies: >>105671085 >>105671144
Anonymous
6/22/2025, 4:11:25 PM No.105671085
>>105671070
duckduckgo and musubi
Anonymous
6/22/2025, 4:16:37 PM No.105671121
00164-4225490145
00164-4225490145
md5: 6ca086872491b772e4266f4c0a43c685🔍
AI is just like humans sometimes, can't draw a straight line if it goes behind a character
Replies: >>105671178 >>105671233 >>105671326
Anonymous
6/22/2025, 4:16:59 PM No.105671123
retard here,

I'm trying to use image2image in Forge to get a desired basic outline of the image (a person standing behind a desk). The outputs always look way too much like my reference image though (more like what I remember controlnet canny stuff doing). What setting should I be tweaking so that the image2image weighs the prompt more and the reference image less?
Replies: >>105671128 >>105671153
Anonymous
6/22/2025, 4:17:45 PM No.105671128
>>105671123

Have you tried adjusting the denoise?
Replies: >>105671170
Anonymous
6/22/2025, 4:17:51 PM No.105671129
>>105670997
VACE
Anonymous
6/22/2025, 4:19:25 PM No.105671142
>>105670161
>things to get excited for
AMD catching up to Nvidia
Replies: >>105671173 >>105671190
Anonymous
6/22/2025, 4:19:34 PM No.105671144
>>105671070
Train your own or use the MANY already existing, most should be available at places like https://civitaiarchive.com
Anonymous
6/22/2025, 4:20:47 PM No.105671153
>>105671123
try using something like a scribble controlnet instead if you want more variety
Replies: >>105671170
Anonymous
6/22/2025, 4:23:03 PM No.105671170
>>105671128
I have and it kind of works but not to the extent I will need.
>>105671153
Noted. I'll look into that.
Anonymous
6/22/2025, 4:23:22 PM No.105671173
>>105671142
I wish
Anonymous
6/22/2025, 4:23:47 PM No.105671178
chroma-unlocked-v38-detail-calibrated-Q8-2025-06_020402-gen-352661040046828-euler-beta-00322
>>105671121
sdxl does that. it's fun!
Anonymous
6/22/2025, 4:25:10 PM No.105671190
>>105671142
Just 2 more weeks
Anonymous
6/22/2025, 4:26:38 PM No.105671200
>>105670161
>anything i'm missing?
krea I guess, but at this point it's really unlikely it'll happen
Replies: >>105671229
Anonymous
6/22/2025, 4:30:09 PM No.105671229
>>105671200
I think it will release, I doubt the post would have been made unless the decision was at least already 90% made.
Anonymous
6/22/2025, 4:30:43 PM No.105671233
>>105671121
Yes background discontinuities frequently trip up the models. Though I found flux is noticeably better at that (not perfect).
Anonymous
6/22/2025, 4:31:20 PM No.105671240
why am i getting OOM with chroma safetensors on 3090? it's only 17.4gb
Replies: >>105671262 >>105671307
Anonymous
6/22/2025, 4:33:52 PM No.105671262
>>105671240
check nvtop/nvidiasmi and look at the output of the service.
Anonymous
6/22/2025, 4:40:01 PM No.105671304
>>105670682
https://github.com/deepbeepmeep/Wan2GP
people are retarded and don't just give people this
Replies: >>105671357
Anonymous
6/22/2025, 4:40:31 PM No.105671307
>>105671240
try:
>offloading t5 to cpu
>tiled VAE decode
Anonymous
6/22/2025, 4:43:33 PM No.105671323
>>105670099
>no bedroom eyes/half-lidded eyes
ngmi
Replies: >>105671789
Anonymous
6/22/2025, 4:44:05 PM No.105671326
>>105671121
That's actually one of the more inhuman of the common AI mistakes. One of those little things that reminds you the image was made by something that doesn't "see" it as a 3D scene like you do, because even though the problem is objectively subtle, human visual reasoning immediately notices something wrong. A human artist of that skill level could freehand both ends of the staff better than that, but they wouldn't, they'd draw the full staff in the sketch, or they'd use a ruler.
Anonymous
6/22/2025, 4:46:13 PM No.105671349
>>105670793
it's bad software design to expect people to fix their own app all the time which is why comfy needs to die
Replies: >>105672520
Anonymous
6/22/2025, 4:47:51 PM No.105671357
>>105671304
cuz it wasnt using good optimizations and looked like shit, does it even use SLG now?
no reason to use it, if you got 24gb vram, you shouldnt use it and should go for quality, if you got anything below, you shouldnt use it since with comfyui ldg workflow you can finetune optimizations a lot better to get the most out of everything

you gotta learn the basics of what the optimization do either way, might as well just do it through the main workflow then instead of cope
Replies: >>105671365
Anonymous
6/22/2025, 4:48:49 PM No.105671365
file
file
md5: 4ebc3157d87af2310c7bf4ae0a5e48ce🔍
>>105671357
you are actually fucking retarded
Replies: >>105671400
Anonymous
6/22/2025, 4:50:10 PM No.105671373
>>105671039
Why the fuck can't I just make a video to jack off?
What's next? Needing their permission to have a wank?
Replies: >>105671407 >>105671420 >>105671694
Anonymous
6/22/2025, 4:54:42 PM No.105671400
>>105671365
fp16 accumulation?
sageattention?
torch compile?
latest pytorch?
can you easily swap quants of wan and clip?
virtual vram settings?
does it support all the different things you can do with vace? easy looping videos?
will it implement sageattention2++ when it comes out soon as fast as comfy which will give a big boost?
just no point, its not like current video tools are like images where you need to mask things all the time which is better done with a mouse in a big UI, you just quickly set up ldg wan workflow and thats it, then just select lora and prompt away until something big drops when you update the workflow
Replies: >>105671459
Anonymous
6/22/2025, 4:55:42 PM No.105671407
Commander-in-chief
Commander-in-chief
md5: 830482e7086d83586688ecf2b5fa0334🔍
>>105671373
>Why the fuck can't I just make a video to jack off?
You have to ask that to your Commander-in-chief Sergeant Johnson.
Anonymous
6/22/2025, 4:56:58 PM No.105671420
>>105671373
Because to get the resources you have to basically stalk them, retard
Replies: >>105671477 >>105671484
Anonymous
6/22/2025, 5:00:01 PM No.105671447
ComfyUI_temp_fbfsq_00337cen_
ComfyUI_temp_fbfsq_00337cen_
md5: f426506bf9c00edb1c06d4dc21fd29a4🔍
Anonymous
6/22/2025, 5:00:58 PM No.105671459
>>105671400
fp16 acc is in, safe attention is a requirement retard, pytorch version matters very little, torch compile memory leaks, it automatically selects the quant, don't really care about virtual vram, yes there is a whole vace interface, looping videos don't really loop properly so no UI can really do it unless you are talking about ping pong which looks like shit. why not ask the author about sage++. just no point playing around with noodle garbage instead of just having what you want to see right in front of you. it's actually easier to extend the clips with new input images than comfy. you can be a noodle faggot but it doesn't make you better than anybody. stop sniffing your own farts. not to mention I think a lot of people are comfy fatigued
Replies: >>105671536
Anonymous
6/22/2025, 5:01:13 PM No.105671461
>still no sage2 update
>i weep
Replies: >>105671471
Anonymous
6/22/2025, 5:02:23 PM No.105671471
oopsies
oopsies
md5: b8ab45f35e7e78bd1748200c669a37ba🔍
>>105671461
They sayd "around 20th of june" they didn't specify the year
Anonymous
6/22/2025, 5:03:10 PM No.105671477
ComfyUI_00418_ (1)
ComfyUI_00418_ (1)
md5: 44ad712b410ff9c2f21faa5048f8f1c9🔍
>>105671420

If you don't cyberstalk your crush, can you even say you truly love her?
Replies: >>105671484
Anonymous
6/22/2025, 5:04:05 PM No.105671484
>>105671420
If you post pictures of yourself on social media they are free to use for everyone.

>>105671477
based
Anonymous
6/22/2025, 5:06:59 PM No.105671506
B03854D7
B03854D7
md5: 15e80995f131ae94ac11cf7b65dfcafc🔍
Anonymous
6/22/2025, 5:10:55 PM No.105671536
1747692441866015_thumb.jpg
1747692441866015_thumb.jpg
md5: 486300d94267db2ad1fa1621fd0fb238🔍
>>105671459
>it automatically selects the quant, don't really care about virtual vram
The problem is there is a big difference when going lower than Q8, and for some things surely people want to actually have a high quality output despite needing to wait an hour or two, so offloading like this is a requirement.
>looping videos don't really loop properly so no UI can really do it unless you are talking about ping pong which looks like shit
Picrel
>>105571625

There will definitely be better UI at some point that will dominate but it's hard to compete with the flexibility of comfy given the speed of developments.
Replies: >>105671632 >>105671633
Anonymous
6/22/2025, 5:14:31 PM No.105671560
>>>105663104
>chroma svdquant
where do i find this though?
Replies: >>105671601 >>105671607
Anonymous
6/22/2025, 5:15:55 PM No.105671574
>>105668859
great gen anon, can you please post the catbox?
Anonymous
6/22/2025, 5:18:36 PM No.105671601
>>105671560
https://huggingface.co/rocca/chroma-nunchaku-test/tree/main
So far only experimental v29, with v38 coming in 2 more hours. Or days. Or weeks https://huggingface.co/rocca/chroma-nunchaku-test/discussions/1#68557c81961b7e57afe5f902
Replies: >>105671607
Anonymous
6/22/2025, 5:19:05 PM No.105671607
>>105671560
https://huggingface.co/rocca/chroma-nunchaku-test
>>105671601
*kicks you in the ass*
Anonymous
6/22/2025, 5:22:35 PM No.105671632
>>105671536
>offloading like this is a requirement.
I make vids in ~70 seconds. you are dramatically overthinking things. if speed is a concern for making videos, just buy a better GPU instead of gobbling snake oil
Replies: >>105671679
Anonymous
6/22/2025, 5:22:36 PM No.105671633
>>105671536
>big titty girl, boob physics like shes on the moon. sloshing bags of water. remove safeties. execute
Replies: >>105671748
Anonymous
6/22/2025, 5:24:10 PM No.105671647
>finally figured out the workflow
>been gooning for 16 hours straight
help when do i get bored of infinite fully customized pornography
Replies: >>105671658 >>105671670 >>105671714
Anonymous
6/22/2025, 5:25:51 PM No.105671658
>>105671647
>fully customized pornography
doesn't exist yet. the model is great but it's still limited what concept motions it knows. like seriously, why is it so obsessed with blowjobs?
Anonymous
6/22/2025, 5:27:33 PM No.105671670
>>105671647
when you realize it's limited by your imagination and tastes, and all your niche fetishes rapidly become dull and boring when you have an unlimited supply
Replies: >>105671680
Anonymous
6/22/2025, 5:28:38 PM No.105671676
Chinkmodded 4090D 48GB... yes, no? I'm really sick of losing the 5090 FE lottery, and honestly, 32GB isn't enough to run the best local video models. It's $3K. What else is there? Drop $4K on a M4 Max 40-core GPU 128GB Mac? That's going to be slow at imagegen and utterly shit at video, right? DGX Spark? Seems like a shitty, expensive cloud-service upsell box to me.
Replies: >>105671716 >>105672217 >>105673118
Anonymous
6/22/2025, 5:28:52 PM No.105671679
>>105671632
>I make vids in ~70 seconds.
Do post those 70s made videos that blow out the colors, have stiff motion and warping the fuck out of anything moving.
>if speed is a concern for making videos, just buy a better GPU instead of gobbling snake oil
You are the one using cope projects instead of having a 24gb card to just go for max quality workflows.
Replies: >>105671765
Anonymous
6/22/2025, 5:28:52 PM No.105671680
>>105671670
reduction is the key. I never go all out and that's a strict rule.
Anonymous
6/22/2025, 5:30:41 PM No.105671694
>>105671373
>Why the fuck can't I just make a video to jack off?
You literally can, the law only has any effect if you generate porn (sexually explicit) images or video of REAL PEOPLE and then POST THEM ONLINE without their permission.

If you don't do either of these things, you can wank off as much as you want.
Anonymous
6/22/2025, 5:33:14 PM No.105671714
>>105671647
post workflow
Anonymous
6/22/2025, 5:33:46 PM No.105671716
>>105671676
https://www.newegg.com/amd-100-300000075-radeon-pro-w7800-32gb-graphics-card/p/N82E16814105115?Item=9SIA24GKAP2001
this has drawbacks but it's a better deal on VRAM and can actually be purchased
Replies: >>105671811 >>105671925 >>105671933 >>105671954
Anonymous
6/22/2025, 5:37:32 PM No.105671748
file
file
md5: 12521b510066f22805bebbe620ab5c7d🔍
>>105671633
checked
also
>pic related
Anonymous
6/22/2025, 5:40:02 PM No.105671765
>>105671679
>max quality workflows.
that would be without the snake oils. pretty much raw fp16. do you not pay attention to what people say around here?
Replies: >>105671780
Anonymous
6/22/2025, 5:40:47 PM No.105671780
>>105671765
>no 70s made video posted
yawn
Replies: >>105671850
Anonymous
6/22/2025, 5:41:18 PM No.105671789
>>105671323
Thanks for the heads-up. I added two new groups for eyes and pupils tags.
https://tagexplorer.github.io/#/?tagGroupFilter=eyes
Replies: >>105671850
Anonymous
6/22/2025, 5:42:52 PM No.105671804
1115001-close up photograph of brown hair, curly-cuteMix1
>>105670099
>github
really cool project thankyou anon
Replies: >>105671851 >>105671942 >>105672042
Anonymous
6/22/2025, 5:43:18 PM No.105671811
>>105671716
Drawbacks like: If you even get it to run it will run much slower than the NVidia equivalent.

There is hope now that AMD actually (after 3 fucking years) are pulling their heads out of their asses and is starting to work on strong AI support, but it's not even close yet, so you'd be an idiot to buy an AMD GPU for AI use today.
Replies: >>105671969
Anonymous
6/22/2025, 5:47:01 PM No.105671850
AniStudio_InterOpTest-00015_thumb.jpg
AniStudio_InterOpTest-00015_thumb.jpg
md5: b16fc671a0316524672c96ceb38a3ce9🔍
>>105671780
i have one here that took 73 seconds. I just run a script in anistudio to send requests to the wan2gp backend. no quants since I have a 4090. dunno if he added magcache yet but I've been too busy.

>>105671789
this is really helpful anon! thanks!
Replies: >>105671870 >>105671879 >>105671923 >>105671930
Anonymous
6/22/2025, 5:47:01 PM No.105671851
>>105671804
F U C K O F F
Replies: >>105671861 >>105673466
Anonymous
6/22/2025, 5:48:11 PM No.105671861
>>105671851
someones moody
Replies: >>105671879
Anonymous
6/22/2025, 5:50:23 PM No.105671870
1747877166421495
1747877166421495
md5: 84d66afb9a3f16b3635d96cea273b3d8🔍
>>105671850
>mostly static sketch of 10 different colors in total of a simply sketched cartoony anime girl that barely moves and when she does picrel happens
ah, so this is the power of 70s generated videos... i now truly see
Replies: >>105671884
Anonymous
6/22/2025, 5:50:46 PM No.105671875
based ani proving my point
Anonymous
6/22/2025, 5:51:36 PM No.105671879
>>105671861
I do not accept the pedo creep. end of story.
>>105671850
vnice!
Replies: >>105671884
Anonymous
6/22/2025, 5:52:27 PM No.105671884
AniStudio_InterOpTest-00032_thumb.jpg
AniStudio_InterOpTest-00032_thumb.jpg
md5: 36699254434e35c6c2358e0cdc79a996🔍
>>105671870
that is teacache doing that and it's been in pretty much every video that moves a little quickly. wan really is made for 3dpd too

>>105671879
tyvm
Replies: >>105671929
Anonymous
6/22/2025, 5:57:16 PM No.105671923
>>105671850
Pedo
Anonymous
6/22/2025, 5:57:20 PM No.105671924
SNAKE OILED
Anonymous
6/22/2025, 5:57:23 PM No.105671925
>>105671716
32GB is not enough, most big video models give 40/48GB as the minimum, plus whatever small discount there is on the hardware you pay for it in wasted time trying to get shit to work with AMD.
No thank you.
Replies: >>105671969
Anonymous
6/22/2025, 5:58:16 PM No.105671929
>>105671884
the cope trannies have to tell themselves, lol
Anonymous
6/22/2025, 5:58:19 PM No.105671930
>>105671850
does your implementation support loras? does it support sage attention? does it support torch compile?
genuinely curious.
how much space does the venv take up in total?
does it support kijai models?
Replies: >>105671944
Anonymous
6/22/2025, 5:58:29 PM No.105671933
>>105671716
>2300 dollars for a 32gb vram card
might aswell buy 2x3090 with that money and get 48 gb of vram total
Replies: >>105671969 >>105671975
Anonymous
6/22/2025, 5:59:35 PM No.105671942
good shit WanVideoWrapper_I2V_00007_thumb.jpg
good shit WanVideoWrapper_I2V_00007_thumb.jpg
md5: 01b10a09c39e4f219e2c107367fdacc0🔍
>>105671804
Replies: >>105671950 >>105672042 >>105672063
Anonymous
6/22/2025, 5:59:46 PM No.105671944
>>105671930
>does your implementation support loras
it's literally a script that just runs the backend. just read what the author has to say. I use the kijai distill yeah

https://github.com/deepbeepmeep/Wan2GP
Anonymous
6/22/2025, 6:00:39 PM No.105671950
>>105671942
was rife used for interpolation here instead of film vfi?
Replies: >>105671968
Anonymous
6/22/2025, 6:01:03 PM No.105671954
>>105671716
ok but how big is it..
tell me the size, i need to know
my comfyui venv takes up 6.2GiB
Anonymous
6/22/2025, 6:01:21 PM No.105671957
Wan trolling me with nudity:
https://files.catbox.moe/le11vb.mp4

scrubbing nipples from your dataset vs tagging "rating:expicit"... which to choose...
Anonymous
6/22/2025, 6:02:16 PM No.105671968
WanVideoWrapper_I2V_00004_thumb.jpg
WanVideoWrapper_I2V_00004_thumb.jpg
md5: 4f97e56c2780bb8bd6f145cc84da5564🔍
>>105671950
there was no interpolation at all, just self forcing and movement reward lora
Replies: >>105671972 >>105672042
Anonymous
6/22/2025, 6:02:32 PM No.105671969
1726899054921694
1726899054921694
md5: b502cb340c64af03dff946f8859602cb🔍
>>105671811
>it's not even close yet, so you'd be an idiot to buy an AMD GPU for AI use today
it works for me ¯\_(ツ)_/¯

the AMD cards are a good VRAM value if you're on Linux. I use AMD for this reason and because AMD's Linux GPU drivers are open source and much more stable than Nvidia's.

currently, the 7900 XTX is around a 3090 in AI performance using chroma as a benchmark. If we're lucky and FineWine kicks in with newer ROCM versions, this card could get to 4090 perf.

>>105671925
>>105671933
totally fair. just mentioning we have options.
Replies: >>105671986 >>105671989
Anonymous
6/22/2025, 6:03:01 PM No.105671972
>>105671968
so thats why its bad
Replies: >>105671989 >>105672011
Anonymous
6/22/2025, 6:03:20 PM No.105671975
>>105671933
>might aswell buy 2x3090 with that money and get 48 gb of vram total
Well that's my setup right now. I even have the nvlink. The problem is diffuser stuff needs to have the whole model in memory, you really need more memory on a single GPU at the moment.
Replies: >>105671999 >>105672008
Anonymous
6/22/2025, 6:04:02 PM No.105671986
>>105671969
>the 7900 XTX is around a 3090 in AI performance using chroma as a benchmark
don't know if I can believe this
Replies: >>105672023 >>105672081
Anonymous
6/22/2025, 6:04:12 PM No.105671989
>>105671969
have you tried wan? what performance are you getting? do you have a single 7900 xtx?
>>105671972
resolution too, yeah
Replies: >>105672023
Anonymous
6/22/2025, 6:05:13 PM No.105671999
>>105671975
isnt there a multigpu node
Replies: >>105672017
Anonymous
6/22/2025, 6:06:58 PM No.105672008
>>105671975
>The problem is diffuser stuff needs to have the whole model in memory,
you can split the model into multiple gpus if that's a gguf though
https://rentry.org/wan21kjguide
Replies: >>105672039
Anonymous
6/22/2025, 6:07:33 PM No.105672011
ComfyUI_02143__8fbfff_thumb.jpg
ComfyUI_02143__8fbfff_thumb.jpg
md5: f9bbf8a8e5c348c9668bed6e76115f09🔍
>>105671972
Wan was trained at 16 FPS. Either you minterpolate the output in ffmpeg or use cosmos and deal with limbs falling off and characters growing a second head.
Plus, anime shit is like 6 fps, so there's that too. Maybe i2v in wan and then v2v back into cosmos to make it smoother?
Anonymous
6/22/2025, 6:08:16 PM No.105672017
file
file
md5: 826b381ace361e6aeb1835afcfb2744d🔍
>>105671999
I've seen someone say previously that multigpu setups are mostly for loading encoders separately, and that "use other vram" doesn't work properly but no proofs were posted
Replies: >>105672037
Anonymous
6/22/2025, 6:08:55 PM No.105672023
1743521200281394
1743521200281394
md5: b76887d92224b485a69fc95606e6d90e🔍
>>105671986
We were discussing this in an old thread, a 3090 user said he gets ~60 to 70 s per standard chroma gen at 20 steps. I get the same result.

>>105671989
Haven't tried Wan, because local video just doesn't look worth the effort to me yet. Single card. Would be interested to know if any AMD users have been using Wan.
Replies: >>105672090 >>105672139
Anonymous
6/22/2025, 6:09:01 PM No.105672024
What's this whole deal with "detected dubious ownership in repository at"?
Is it because it's installed on an external drive?
Replies: >>105672289
Anonymous
6/22/2025, 6:10:37 PM No.105672037
>>105672017
>"use other vram" doesn't work properly but no proofs were posted
it works, I'm currently splitting the model into 2 of my nvdia gpus
Anonymous
6/22/2025, 6:10:46 PM No.105672039
>>105672008
>https://rentry.org/wan21kjguide
Ah so "To manage VRAM limitations, offload to RAM/CPU using the virtual_vram_gb setting in the UnetLoaderGGUFDisTorchMultiGPU node, though this slows generation and you can only offload so much." OK. Eh, at the moment I just run two comfy instances on different ports and pinned to separate GPUs, and just tandem gen. It's like 30-50% success rate on gens, so more gens is preferable I think.
Anonymous
6/22/2025, 6:11:14 PM No.105672042
>>105671804
>>105671942
>>105671968
kys
Replies: >>105673466
Anonymous
6/22/2025, 6:15:30 PM No.105672062
>>105670099
nice but
>Illustrious 1.1.
why not 2.0 which used the largest dataset out of all versions currently available?
Replies: >>105672083 >>105672112
Anonymous
6/22/2025, 6:15:37 PM No.105672063
1240001-close up photograph of long hair, messy-cuteMix1-15
>>105671942
setting her on fire isnt very nice
Replies: >>105672074 >>105672123 >>105672140 >>105672158 >>105672180
Anonymous
6/22/2025, 6:16:44 PM No.105672074
>>105672063
remember, kys
Replies: >>105673466
Anonymous
6/22/2025, 6:17:33 PM No.105672081
>>105671986
>don't know if I can believe this
I know I don't believe this
Anonymous
6/22/2025, 6:17:40 PM No.105672083
>>105672062
it's all the same booru shit. basically nothing changed
Anonymous
6/22/2025, 6:18:35 PM No.105672090
>>105672023
weird, i get the same speed, albeit with SVDQuant (sadly still at v29 but its good enough)
100%|...| 20/20 [00:48<00:00, 2.44s/it] @100W
100%|...| 20/20 [00:37<00:00, 1.89s/it] @ 170W
t. 3060 vramlet
Replies: >>105672207
Anonymous
6/22/2025, 6:20:59 PM No.105672112
>>105672062
Honestly, I didn't think about it too hard, I just got the general impression that the image gen community overall didn't fully accept 2.0 as the rightful successor to 1.0. For example, WAI v14 chose 1.0 over 2.0 as a base. It probably makes little difference for the site's purpose.
Replies: >>105672145
Anonymous
6/22/2025, 6:21:56 PM No.105672120
Does controlnet, openpose, work with illustrious, noobai? I can't get it to work.
Replies: >>105672151
Anonymous
6/22/2025, 6:22:00 PM No.105672123
>>105672063
fuck off you disgusting piece of shit, you're not welcome here
Replies: >>105673466
Anonymous
6/22/2025, 6:23:40 PM No.105672139
chroma-unlocked-v38-detail-calibrated-Q8-2025-06_181952-gen-816291110242521-res_multistep-sigmoid_offset-00337
>>105672023
it's a bit faster. Prompt executed in 59.40 seconds. 3090@80%, 26 steps, 2.25s/it, sage attention & --fast, the comfy wan build from the rentry basically. sniff.
Replies: >>105672207
Anonymous
6/22/2025, 6:23:41 PM No.105672140
>>105672063
Back to trooncord, sis
Anonymous
6/22/2025, 6:24:09 PM No.105672145
>>105672112
>I just got the general impression that the image gen community overall didn't fully accept 2.0 as the rightful successor to 1.0. For example, WAI v14 chose 1.0 over 2.0 as a base
unfortunate considering its ability to handle larger initial latent sizes and NLP. i thought anon would be all over its NLP ability at least. maybe that wont be fully realized until/if/when 3.5vpred drops...
>It probably makes little difference for the site's purpose.
fair point
Anonymous
6/22/2025, 6:24:52 PM No.105672151
>>105672120
It does, use xinsir's promax ControlNet model. However, the OpenPose preprocessor works poorly with anime input images. I'd recommend using Depth or Canny if your input image is anime-style. If you set it up like in my guide or download the workflows it should work without much fuss. https://rentry.org/comfyui_guide_1girl#controlnet-pose-transfer
Replies: >>105672163
Anonymous
6/22/2025, 6:25:50 PM No.105672158
>>105672063
Uncanny
Anonymous
6/22/2025, 6:26:21 PM No.105672163
>>105672151
Thank you kindly. Haven't used it in a while since the checkpoints are so good at poses these days.
Anonymous
6/22/2025, 6:28:18 PM No.105672180
WanVideoWrapper_I2V_00058_thumb.jpg
WanVideoWrapper_I2V_00058_thumb.jpg
md5: e6c2fe699468d6c815ee31f1888e43c9🔍
>>105672063
cute, would
Replies: >>105672291
Anonymous
6/22/2025, 6:31:28 PM No.105672207
1739940052734975
1739940052734975
md5: 5c0e4ffbf0b0186232928deef4b62cd6🔍
>>105672139
I am not using sage attention or flash attention. Some other AMD anon figured out how to install these, but I haven't figured it out yet. this kind of issue is obviously the big drawback of AMD for the time being.

>>105672090
that makes sense because SVDQuant is a lot faster.
Replies: >>105672284 >>105672309
Anonymous
6/22/2025, 6:32:20 PM No.105672217
1725656622889377
1725656622889377
md5: 336437a3b99de2a7eb0d521b830520c5🔍
>>105671676
>Chinkmodded 4090D 48GB... yes, no?
Up to you but if you're wondering where to get one, I bought mine from
https://www.c2-computer.com/products/new-parallel-nvidia-rtx-4090d-48gb-gddr6-256-bit-gpu-blower-edition
Replies: >>105672257 >>105672330
Anonymous
6/22/2025, 6:36:07 PM No.105672257
>>105672217
Yep that's where I'm intending to purchase mine. Did they pack it well? I'm always nervous of them sending an item like that in a "speed pack" and having it get trashed.
Replies: >>105672395
Anonymous
6/22/2025, 6:36:58 PM No.105672269
I wish I could run Lumina 2 at a reasonable speed :(
Anonymous
6/22/2025, 6:38:40 PM No.105672284
chroma-unlocked-v38-detail-calibrated-Q8-2025-06_183300-gen-274325549277675-dpmpp_2m-sigmoid_offset-00343
>>105672207
60s/gen just kills the vibe. sick shit tho
Replies: >>105672352
Anonymous
6/22/2025, 6:39:32 PM No.105672289
blumfeld-elderly-bachelor-cartoon
blumfeld-elderly-bachelor-cartoon
md5: 2ec1de4a5d48bbf13408e58003483530🔍
>>105672024
> Is it because it's installed on an external drive?
I usually get that when I try to access a repo initially checked out by another user or checked out on another computer and accessed via a samba share. I would think an external drive would be ok as long as it isn't a network drive administered by another machine.
Anonymous
6/22/2025, 6:39:46 PM No.105672291
1737002-close up photograph of long hair, messy-cuteMix1
1737002-close up photograph of long hair, messy-cuteMix1
md5: e9cb943d9ad687909ae875c5d59c86b1🔍
>>105672180
regenned on newer model
Replies: >>105672350
Anonymous
6/22/2025, 6:41:10 PM No.105672304
1748881579887888
1748881579887888
md5: 68ee47d6d2390ed42f3f2c2616a6ab42🔍
Replies: >>105672324
Anonymous
6/22/2025, 6:42:16 PM No.105672309
>>105672207
did they say how they got sage working i didnt have any luck with that flash attention is part of rocm now and uses triton but pytorch also has a triton flash attention thing built in i think they might be the same thing you can enable the torch one with TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1
Replies: >>105672352
Anonymous
6/22/2025, 6:44:52 PM No.105672324
>>105672304
>be me
>middle school teacher
>tell class the next assignment is to make a political cartoon since our current topic is world politics
>kid in the back asks if he can use "ai" to make his
>other kids chuckle
>"sure i dont care"
>ff to end of the week
>this is his submission
Replies: >>105672334
Anonymous
6/22/2025, 6:45:42 PM No.105672330
>>105672217
>https://www.c2-computer.com/products/new-parallel-nvidia-rtx-4090d-48gb-gddr6-256-bit-gpu-blower-edition

Interesting, slightly cheaper than a 5090 and 48gb? Is this the best place to buy these? I can only find ebay and aliexpress listings
Replies: >>105672395
Anonymous
6/22/2025, 6:46:01 PM No.105672334
>>105672324
>>this is his submission
he deserves an A+, he nailed that shit
Anonymous
6/22/2025, 6:47:48 PM No.105672350
>>105672291
the new gen is less cute, it looks more slopped and its close to the uncanny valley, the old gen looks cartoonish in comparison and has cute hearts and the old gen has a lewder outfit
Replies: >>105672379
Anonymous
6/22/2025, 6:47:57 PM No.105672352
1722746198007052
1722746198007052
md5: 4f366fbef3ef7e619e687331f0d953fd🔍
>>105672284
for complex prompts and styles, the ROI is worth it. chroma can do in a single 60s gen what SDXL and SD3.5 models can't do in six 10s gens.

>>105672309
no, sadly they didn't explain how.

>you can enable the torch one with TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1
I use this command:
>TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 python main.py --use-pytorch-cross-attention --bf16-vae
I have no idea if I'm getting a perf boost from this.
Replies: >>105672416 >>105672586
Anonymous
6/22/2025, 6:51:12 PM No.105672379
>>105672350
its a wip im trying to get analog photography style but getting it working nice requires merging in more realism stuff which is hard to balance kek also the hearts werent prompted so theyre completely random
Anonymous
6/22/2025, 6:52:51 PM No.105672395
>>105672257
It comes with a 12VHPWR adaptor and I forget if it was styrofoam, bubble wrap or something else but there was cushioning between the inner and outer box, I think.
https://litter.catbox.moe/14v4hucayh4elbjh.jpg

>>105672330
>Is this the best place to buy these?
I'm not sure. Some other anon on /lmg/ bought one from there so I did the same thing. There are some listings on ebay that I can see but they ship from china too and they cost slightly more than what I paid. Depends on where you live probably.
Anonymous
6/22/2025, 6:53:12 PM No.105672397
1747005-close up photograph of long hair, messy-cuteMix1-2
Replies: >>105672513
Anonymous
6/22/2025, 6:55:11 PM No.105672416
>>105672352
might be worth trying this to see if you get an perf differences between the rocm implementation and the torch one i think in general theyll both be slower than nvidia flash attention because theyre both using triton https://www.reddit.com/r/LocalLLaMA/comments/1jh0n3q/psa_get_flash_attention_v2_on_amd_7900_gfx1100/
Replies: >>105672586
Anonymous
6/22/2025, 6:56:26 PM No.105672433
AE42C66D
AE42C66D
md5: 8495448def93a9055c1e5b3c9c7c0df2🔍
Replies: >>105672437
Anonymous
6/22/2025, 6:57:02 PM No.105672437
>>105672433
nice
Anonymous
6/22/2025, 6:58:19 PM No.105672448
chroma-unlocked-v38-detail-calibrated-Q8-2025-06_185656-gen-36047594955023-deis-sigmoid_offset-00353
"very pale skin", t. chroma
Anonymous
6/22/2025, 6:58:36 PM No.105672450
1738534726732010
1738534726732010
md5: b7353cbfb1562fc939035999b7ade52e🔍
>tfw Bethesda is too stupid to make Fallout 5 with NCR as the main antagonist and them having their own frank Horrigan that enforces their fascist end stage capitalism with corrupt oligarchs onto the populace of the wasteland.

feels bad
Replies: >>105673027
Anonymous
6/22/2025, 7:01:21 PM No.105672470
should i use
https://github.com/TTPlanetPig/Comfyui_JC2
https://github.com/StartHua/Comfyui_CXH_joy_caption
or something else for joycaption in comfy?
Replies: >>105672521 >>105672542
Anonymous
6/22/2025, 7:02:19 PM No.105672477
72301C58
72301C58
md5: d6f3e4356c2238d86c1f37d6e23d9cb3🔍
Anonymous
6/22/2025, 7:04:42 PM No.105672498
Could you anons post your illustrious training settings? I can't manage to get a goof safetensor. Do you have any tips?
Replies: >>105672531
Anonymous
6/22/2025, 7:06:32 PM No.105672513
WanVideoWrapper_I2V_00001_thumb.jpg
WanVideoWrapper_I2V_00001_thumb.jpg
md5: 0994c8b72b7567bb150973a29c93cdcc🔍
>>105672397
heh
Replies: >>105672581
Anonymous
6/22/2025, 7:07:39 PM No.105672520
>>105671349
It's incredible how many g tards don't see it
Anonymous
6/22/2025, 7:07:43 PM No.105672521
>>105672470
I use this as per recommendation of another friendly anon, works well
https://github.com/silveroxides/joycaption_comfyui
it's a big boy tho. there is a quant floating around on huggingface of the joycaption model but I have no idea how one would use that.
Replies: >>105672539
Anonymous
6/22/2025, 7:08:51 PM No.105672531
>>105672498
I don't know if anyone has finetuned it. I'm waiting for 3.5 before trying.
Anonymous
6/22/2025, 7:09:37 PM No.105672539
>>105672521
thank you anon, but why are you using that repo and not the main repo, that repo is a fork of a fork of the main repo, up to date with the main repo branch
it would probably be safer to use the main repo.. Thank You!!
Anonymous
6/22/2025, 7:10:00 PM No.105672542
>>105672470
https://github.com/EvilBT/ComfyUI_SLK_joy_caption_two/blob/main/readme_us.md works fine with nf4 quant but its requirements.txt doesn't have one additional requirement: pip install timm==1.0.13
Replies: >>105672566
Anonymous
6/22/2025, 7:10:07 PM No.105672544
>>105670793
>having a zoomable UI on PC is what's giving zoomers conniptions.
zoomies have no frame of reference, they dont care in the least
Anonymous
6/22/2025, 7:10:18 PM No.105672548
chroma-v38calibrated-Q8_00744_
chroma-v38calibrated-Q8_00744_
md5: 6e28078fea2943cc1ebc41676d54f0bf🔍
Replies: >>105672603 >>105672902
Anonymous
6/22/2025, 7:12:16 PM No.105672566
>>105672542
Thank You anon!!
Anonymous
6/22/2025, 7:13:48 PM No.105672581
>>105672513
lol nice
Replies: >>105672592
Anonymous
6/22/2025, 7:14:17 PM No.105672586
1745625800243557
1745625800243557
md5: 837c469b4f8dd9476caa9052ea7873ef🔍
>>105672352
>>105672416
from a quick test, there is zero or negligible difference from using TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 and/or main.py --use-pytorch-cross-attention

this kind of thing is where AMD really needs to get their shit together. instead the retards in their AI department are busy releasing "optimized" SDXL and SD35 versions that nobody will fucking use. they should instead be contributing to comfyui and underlying libraries to make sure consumer AI runs fast.
Anonymous
6/22/2025, 7:15:06 PM No.105672592
WanVideoWrapper_I2V_00003_thumb.jpg
WanVideoWrapper_I2V_00003_thumb.jpg
md5: 1bce96e688ce42c9e58917a04cc0fb6f🔍
>>105672581
Anonymous
6/22/2025, 7:15:47 PM No.105672603
>>105672548
you could fool every poster in >>>/b/irl
Anonymous
6/22/2025, 7:21:47 PM No.105672646
collage_1750612864_1
collage_1750612864_1
md5: 5c28b3d2bb6b166338a1251e285c41c1🔍
NetaAniLumina_Alpha examples
Replies: >>105672660 >>105672738 >>105672787 >>105672813 >>105672825
Anonymous
6/22/2025, 7:22:16 PM No.105672649
I often get anime characters when i prompt random gibberish like w0e9mfv-w0me9vf-0awm9v-0awmve with Chroma.
This makes me think Astracuck might have meddled in the dataset, and that the artist names are all like that.
Replies: >>105672700 >>105672713 >>105672739
Anonymous
6/22/2025, 7:23:17 PM No.105672654
ComfyUI_00005_
ComfyUI_00005_
md5: 79a805ed2aef7c339c08c84a0f5c9100🔍
This is what I get with the prompt
"w0e9mfv-w0me9vf-0awm9v-0awmve-a0wmve9-aw0mef9v-m0wae9vf-wa0me9v-e00-e33333333335555555555555"
Replies: >>105672729
Anonymous
6/22/2025, 7:24:02 PM No.105672660
>>105672646
nice.. ungated download?
Replies: >>105672685 >>105672738
Anonymous
6/22/2025, 7:26:49 PM No.105672685
>>105672660
Only the HF repo is gated for some reason. On Civitai it's open https://civitai.com/models/1612109
Replies: >>105672705
Anonymous
6/22/2025, 7:28:47 PM No.105672700
>>105672649
does the ponycuck not have a page where he posts training logs and the captions?
Anonymous
6/22/2025, 7:29:22 PM No.105672705
file
file
md5: 3e975c185c29324f83cd3728d87a74a6🔍
>>105672685
this is why, the civitai model is over a month old
Anonymous
6/22/2025, 7:30:05 PM No.105672713
>>105672649
it's more like the chroma dataset has way more anime images than real photos, so the model is more biased to render an image image if it has no idea what your prompt even mean
Replies: >>105672739
Anonymous
6/22/2025, 7:31:19 PM No.105672729
>>105672654
"98A0S9sudv0 8s9dv09ns8 oiuxouv 0s987dv7": https://files.catbox.moe/u0gi64.png
"sdfsajhgk 79873f 9x8v90 xvxdrve23": https://files.catbox.moe/7tu5jr.png
"ef9dgue9rue 948t9ut as898fg urgu9874g4" (nsfw): https://files.catbox.moe/dxu4am.png

Just mashing my keyboard and it always makes anime esque art. This doesn't make sense unless he hashed all the artist tokens and it's picking up the tokens. And thus explains why the model "is learning them slowly" after 38 epochs.
Replies: >>105672746 >>105672761 >>105672874 >>105672929
Anonymous
6/22/2025, 7:32:10 PM No.105672738
>>105672646
>>105672660
you have to request access and then beg for acceptance on their d server KEK
Replies: >>105672746
Anonymous
6/22/2025, 7:32:16 PM No.105672739
>>105672713
>>105672649
certain keywords also trigger anime in chroma. For example "Japanese"
Anonymous
6/22/2025, 7:32:45 PM No.105672741
AI killing real porn industry is the best part about it
Anonymous
6/22/2025, 7:33:28 PM No.105672746
>>105672729
>1
sovl
>2
cute
>3
would
>>105672738
thats my point, someone ITT should leak their new models on hf
DO IT!!!!
Replies: >>105672792
Anonymous
6/22/2025, 7:35:04 PM No.105672761
>>105672729
>This doesn't make sense unless he hashed all the artist tokens and it's picking up the tokens
FUCK

let's test. do any substrings of these random characters consistently reproduce a single style?
Replies: >>105672840
Anonymous
6/22/2025, 7:36:49 PM No.105672787
>>105672646
So is this the obsolete version from Civitai or the latest one from HF?
Anonymous
6/22/2025, 7:37:18 PM No.105672792
>>105672746
I have access to them and they aren't that different compared to the public version on CivitAI, they tend to have more artifacting. I'd just wait for a newer version to show up, i'm guessing they're trying different training settings (roundnnnn versions)
Replies: >>105672807
Anonymous
6/22/2025, 7:38:35 PM No.105672807
>>105672792
post them on hf or wherever and ill post them on hf for you
Anonymous
6/22/2025, 7:38:55 PM No.105672813
>>105672646
looks way more promising than chroma unironically desu
Anonymous
6/22/2025, 7:40:45 PM No.105672825
>>105672646
why did they go for lumina? what's so special about that model?
Replies: >>105672840 >>105672873
Anonymous
6/22/2025, 7:42:11 PM No.105672840
ComfyUI_00011_
ComfyUI_00011_
md5: 2c3770bf5ec1b163d71093bd66b9f8d3🔍
>>105672761
Yeah, it stays consitent for each string. Basically any sequence of alphanumerical characters that isn't a word will always generate anime with Chroma in my testing. This is exactly what happened with Pony, the artist tokens were turned into random strings.

Here's "dfvdfh8bh7df".

>>105672825
It's a DiT with 2.6b parameters so not too heavy, has a better text decoder and a 16 channel vae. It has way nicer backgrounds compared to sdxl, just better to work with, it's already this good with four epochs.
Replies: >>105672873 >>105672874 >>105672907 >>105672911 >>105672939
Anonymous
6/22/2025, 7:43:12 PM No.105672852
>astra actually cucked the model
Anonymous
6/22/2025, 7:44:50 PM No.105672873
>>105672825
Apache-2.0 license? The tooling https://github.com/Alpha-VLLM/Lumina-Accessory ?
>>105672840
>compared to sdxl
Compare it to SD3.5m but I'll take what I can get honestly with a 16ch and further enhanced prompt understanding.
Replies: >>105673087
Anonymous
6/22/2025, 7:44:50 PM No.105672874
>>105672729
>>105672840
Would you mind posting your findings to some shitty chink spreadsheet website pls
Replies: >>105672915
Anonymous
6/22/2025, 7:47:06 PM No.105672902
>>105672548
prompt?
Anonymous
6/22/2025, 7:47:42 PM No.105672907
>>105672840
It would make sense that there's something since they have "aesthetic 1" to "aesthetic 11" thing
Anonymous
6/22/2025, 7:47:52 PM No.105672911
1742272752932756
1742272752932756
md5: 1d7c48429229c904446054ce29e5e8b8🔍
>>105672840
>>Here's "dfvdfh8bh7df".
you have to test this with different prompts though. eg 1boy playing baseball, 1girl in kitchen, whatever.
so far I am not seeing conclusive evidence that random strings produce a consistent and distinct style when paired with a real prompt.
Replies: >>105672939
Anonymous
6/22/2025, 7:48:10 PM No.105672915
ComfyUI_00013_
ComfyUI_00013_
md5: 4d6a6acaf7b885513da58267e6a5b348🔍
>>105672874
Maybe I'll do more testing and see if I recognize any.
Sometimes it's not anime, but it's definitely "stylized" enough to be some artist.

"ahsdjkvhs0d9v09"
Anonymous
6/22/2025, 7:49:28 PM No.105672929
ComfyUI_00189_
ComfyUI_00189_
md5: d147a868fa7c44d1b1b8641b0828aeb4🔍
>>105672729
>prompt '4n0nd03sntkn0wwh4th4lluc1nat10n1s'
wow, i think you're really onto something here
Anonymous
6/22/2025, 7:49:58 PM No.105672937
big titty kitty
big titty kitty
md5: 4e400e7519c5f7e8ab5120e5fc00e630🔍
I'm a total newfag to this, but tested comfy ui on a 4060 Ti 8gb for a bit, and loved what I got. I'm planning on buying a new pc, what should I focus on in my GPU? Should I look at relative performance, or try to get as much VRAM as possible?
Also a poorfag, so budget is really important.
Asking this because there's cases like 3060 12gb apparently having worse performance than 4060 ti 8gb. Would this also hold true for image generation?
https://www.techpowerup.com/gpu-specs/geforce-rtx-4060-ti-8-gb.c3890
https://www.techpowerup.com/gpu-specs/geforce-rtx-3060-12-gb.c3682
I don't care about gaming on that pc btw, I only want the best performance and speed on image generation.
Replies: >>105672971 >>105672984 >>105672987 >>105673017 >>105673029 >>105673095
Anonymous
6/22/2025, 7:50:21 PM No.105672939
AniStudio-01250
AniStudio-01250
md5: 3bc29e812eb424e4e3255db6b5a32807🔍
>>105672840
oof

>>105672911
I agree. a grid would be better suited for this
Anonymous
6/22/2025, 7:53:46 PM No.105672971
>>105672937
vram vram vram
Anonymous
6/22/2025, 7:55:00 PM No.105672984
>>105672937
Anything less than 24gb and you will regret it, so 3090/4090 or a 5090 even better
if you are a poorfag, then just save more for one of those three, rather than getting a 12/16gb now and complaining later about shit results and being out of memory when you will do video stuff too
Anonymous
6/22/2025, 7:55:07 PM No.105672987
>>105672937
3060 12gb is infinitely better than any <12gb vram gpu
but maybe you should get a 4060 ti 16gb, maybe 5060 ti 16gb
depends on the prices
if you're feeling lucky check out intel and amd too they're cheaper and have fairer amounts of vram but keep in mind many optimizations are nvidia only
Replies: >>105673187
Anonymous
6/22/2025, 7:55:15 PM No.105672988
Can you give me feedback for my training settings?
https://pastebin.com/pdnQG4fj
Replies: >>105673155
Anonymous
6/22/2025, 7:57:52 PM No.105673012
2325001-You are an assistant designed to generat-Lumina2nietaaniluminaAlpha_round7Ep4S68
Anonymous
6/22/2025, 7:59:04 PM No.105673017
>>105672937
if you're not going for fast WAN or big LLMs, you'd still need around ex- 20GB for chroma. if you can get a 40/50 series 16GB used or for MSRP, you could run quants at full speed quickly, but I'd personally prefer a 3090 over either.
also, prompt?
Replies: >>105673187
Anonymous
6/22/2025, 7:59:36 PM No.105673027
>>105672450
looks like a gay man drawed this
Replies: >>105673086
Anonymous
6/22/2025, 7:59:52 PM No.105673029
>>105672937
>cases like 3060 12gb apparently having worse performance than 4060 ti 8gb
and which are those cases exactly? for ai you need 1. vram size 2. memory bandwidth

used 3090
Replies: >>105673187
Anonymous
6/22/2025, 8:03:14 PM No.105673058
Been a long time since I've proompted anything, has speed on 4gb lol vramlet machines improved? last time i messed around with this stuff auto1111 was still king
Replies: >>105673094 >>105673188
Anonymous
6/22/2025, 8:05:40 PM No.105673071
ComfyUI_00019_
ComfyUI_00019_
md5: 9b51a379f0759c5ec8d0a44db7e25506🔍
8273987298735
Anonymous
6/22/2025, 8:05:57 PM No.105673074
is chroma worth the time in a 3060?
Can it do more not realistic styles? Not anime but something more 3d-2.5d
Replies: >>105673094
Anonymous
6/22/2025, 8:06:48 PM No.105673086
>>105673027
>drawed
Replies: >>105673129
Anonymous
6/22/2025, 8:06:58 PM No.105673087
>>105672873
Lumina's stated Apache 2.0 license shouldn't be taken seriously given
1) requires Gemma 2 which has a very obviously incompatible license that makes reference to a Google URL for the terms and has extensive use restrictions, including use for generating sexually explicit content
2) the model weights were necessarily trained on Gemma which makes the model a "Model Derivative". The terms related to this are not compatible with Apache 2.0. Technically given the license and the fact they're violating some of the terms already, Google could actually tell them to delete the model at any time, if you take the terms at face value... that goes for any derivative model thereafter

Whether all that matters you to personally, as someone who can download the weights and give Google a big middle finger as you sail off into a sunset comprising of explicit pornographic material, is another matter. But I wouldn't take Lumina 2 license seriously and it surprises me anyone does. If you used it for commercial purposes you'd be taking a huge risk
Replies: >>105673167
Anonymous
6/22/2025, 8:07:57 PM No.105673094
>>105673074
with svdquant 20steps is 48 seconds on a 3060 @100W pl
>>105673058
gpu? rest of the specs?
Replies: >>105673134 >>105673172
Anonymous
6/22/2025, 8:08:06 PM No.105673095
>>105672937
Grab a 5060Ti 16gb, don't throw away your old gpu, you can use them both by partially offloading larger models to the second card. Not nearly as good as a single gpu with 24gb but many times faster than offloading to system RAM
Replies: >>105673147 >>105673187
Anonymous
6/22/2025, 8:10:17 PM No.105673118
>>105671676
What? With a $3k budget anything other than 5090 is asking for trouble. Chinks are very crafty scammers, don't do it anon.
Anonymous
6/22/2025, 8:11:15 PM No.105673129
>>105673086
am i wrong
Replies: >>105673144
Anonymous
6/22/2025, 8:11:44 PM No.105673134
>>105673094
is there any guide on making it work?
Replies: >>105673163
Anonymous
6/22/2025, 8:12:27 PM No.105673144
>>105673129
No, you're Indian.
Anonymous
6/22/2025, 8:12:37 PM No.105673147
>>105673095
>many times faster than offloading to system RAM
Don't most of the consumer hardware requires you to pass the data to ram first before going into another gpu aside from old nvlink tech? the main benefit from multigpu systems is if you can permanently keep layers in another gpu and pcie isnt the bottleneck
Anonymous
6/22/2025, 8:12:44 PM No.105673149
73026e081_cleanup
73026e081_cleanup
md5: ae4091547d1f7e1afb69114cab9383d8🔍
Replies: >>105673220
Anonymous
6/22/2025, 8:13:20 PM No.105673155
>>105672988
Please, it's for an illustrious training
Replies: >>105673194
Anonymous
6/22/2025, 8:14:01 PM No.105673163
>>105673134
https://files.catbox.moe/x4ev0o.png
heres a workflow, super ez to set it up on linux, the guide is on the nunchaku comfyui github repo
if you want 20 steps change res_multistep to euler
Replies: >>105673241
Anonymous
6/22/2025, 8:14:21 PM No.105673167
>>105673087
Interesting point, thanks for the explanation. I'm not open source brained enough yet to make sure to check the upstream licensing. I'll also add this to my list of pointless arguments why someone should do an SD3.5m animetune.

On that note, any news about Animaestro?
Anonymous
6/22/2025, 8:14:53 PM No.105673172
1728142673747902
1728142673747902
md5: 2bbff768a1321fb2eb455131195b35c5🔍
>>105673094
>gpu? rest of the specs?
It's an old 1650 super, not sure what gpu specs are even relevant besides vram
But I was more asking for a general direction in terms of whether low vram gens have gotten significantly faster for other anons
Replies: >>105673190
Anonymous
6/22/2025, 8:16:22 PM No.105673187
smol titty kitty
smol titty kitty
md5: b5a98d174779eb499f8beca1e68d4f6e🔍
>>105673017
What's WAN, quants and chroma?

prompt was something like Ghislaine, simple style, flat colours, loli, very chibi, huge breasts, etc, etc
>>105672987
>if you're feeling lucky check out intel and amd too they're cheaper and have fairer amounts of vram but keep in mind many optimizations are nvidia only
I'm definitely not feeling lucky after reading billion people saying "Nvidia or suffer".
>>105673029
>and which are those cases exactly?
I linked these 2 sites that compare relative performance.
>>105673095
>Grab a 5060Ti 16gb, don't throw away your old gpu
That's the funny part, I have no GPU right now. Would a 3090 work faster than a 5060 Ti? Anyone has experience with upgrading VRAM but using an older card?
Replies: >>105673207 >>105673217 >>105673261
Anonymous
6/22/2025, 8:16:33 PM No.105673188
>>105673058
comfy or reforge are the quickest for what youd be doing
you be able to at least run XL but it would be slow
Anonymous
6/22/2025, 8:16:39 PM No.105673190
>>105673172
i meant rest of the rig
theres ggufs now and svdquant
Replies: >>105673281
Anonymous
6/22/2025, 8:17:00 PM No.105673194
>>105673155
better do a test run and see how it goes. You could use some language model to interrogate those settings
Replies: >>105673248
Anonymous
6/22/2025, 8:17:31 PM No.105673198
2341001-You are an assistant designed to generat-Lumina2nietaaniluminaAlpha_round7Ep4S68
Replies: >>105674637
Anonymous
6/22/2025, 8:18:16 PM No.105673207
>>105673187
>Would a 3090 work faster than a 5060 Ti?
yes
wan is a video model (image=>video/text=>video)
quants are 4bit,8bit...
chroma is a new text to image model, very good
Anonymous
6/22/2025, 8:19:23 PM No.105673217
Is anything new?

>>105673187
>That's the funny part, I have no GPU right now. Would a 3090 work faster than a 5060 Ti? Anyone has experience with upgrading VRAM but using an older card?
loads of people use a 3090.
Anonymous
6/22/2025, 8:20:25 PM No.105673220
>>105673149
cool shoes
Anonymous
6/22/2025, 8:22:25 PM No.105673241
>>105673163
>quantization process, known as SVDQuant, involves compressing the model weights and activations to 4 bits, significantly reducing the memory footprint and computational load. This is achieved by absorbing outliers in the data using low-rank components, which helps maintain the model's performance and visual quality. The extension integrates seamlessly with ComfyUI, allowing users to set up and execute workflows that take advantage of these optimizations.
Wow am I getting filtered. It what?
Replies: >>105673255
Anonymous
6/22/2025, 8:23:10 PM No.105673246
00039-1315088222
00039-1315088222
md5: e87b6c8f6df969f306e8bc7688419005🔍
Anonymous
6/22/2025, 8:23:19 PM No.105673248
>>105673194
Already did, that's why I'm asking for some smart anons.
The generated images don't pass the bar, any ideas?

LLM invent stuff many times and sometimes they don't know what they are talking about.
Replies: >>105673259
Anonymous
6/22/2025, 8:23:51 PM No.105673255
>>105673241
reduces vram for flux or chroma by 4 times while keeping quality super good
Replies: >>105673327
Anonymous
6/22/2025, 8:24:24 PM No.105673259
>>105673248
>they don't know what they are talking about.
What?
Replies: >>105673503
Anonymous
6/22/2025, 8:24:28 PM No.105673261
>>105673187
3060Ti was the only good 60 series RTX card, easily being on par or even beating 2080. 4060Ti could barely hold out against 3070 and 5060Ti actually loses to 4070 in terms of speed. 16gb is nice of course but you'd still take your sweet time genning stuff, unless the only thing you're interested in is smaller models like sdxl
Anonymous
6/22/2025, 8:26:36 PM No.105673281
>>105673190
Thanks I'll check these out
the rest of the rig is a ryzen 5600x and 32gb ram
Replies: >>105673308
Anonymous
6/22/2025, 8:29:55 PM No.105673308
>>105673281
https://huggingface.co/city96/stable-diffusion-3.5-medium-gguf/blob/main/sd3.5_medium-Q4_K_S.gguf
try this with comfyui gguf node
https://huggingface.co/city96/stable-diffusion-3.5-medium-gguf
report back with results, try with 512x512 and make sure to use linux to reduce vram usage, you could also disable browser hardware acceleration to reduce vram usage on linux too
Replies: >>105673544
Anonymous
6/22/2025, 8:30:25 PM No.105673316
I've been out of the loop for a while.

Is Hunyuan not even worth considering as a video model or something? No guides in the op? was Wan equally uncensored?
Replies: >>105673344
Anonymous
6/22/2025, 8:30:36 PM No.105673318
defa74c21_cleanup
defa74c21_cleanup
md5: af4715bf1182d28dadca94af0ebd56f9🔍
Anonymous
6/22/2025, 8:31:53 PM No.105673327
>>105673255
Tradeoffs?
Replies: >>105673343 >>105673360
Anonymous
6/22/2025, 8:32:29 PM No.105673337
>>105670540

I did not manage to make beer spilled.

How do you prompt it?
Anonymous
6/22/2025, 8:33:09 PM No.105673343
>>105673327
none
it's black magic
Replies: >>105673360
Anonymous
6/22/2025, 8:33:10 PM No.105673344
>>105673316
Wan is the only good video model right now >>105669256 (OP)
>>WanX (video)
>https://rentry.org/wan21kjguide
Anonymous
6/22/2025, 8:33:29 PM No.105673351
>>105670608

kino
Anonymous
6/22/2025, 8:34:03 PM No.105673360
>>105673327
>Tradeoffs?
>>105673343
>none
of course there's a tradeoff, the quality isn't close to bf16 or Q8
Anonymous
6/22/2025, 8:34:13 PM No.105673364
>>105670608
yjk
Anonymous
6/22/2025, 8:34:20 PM No.105673365
Fresh

>>105673353
>>105673353
>>105673353

Fresh
Anonymous
6/22/2025, 8:35:01 PM No.105673373
>>105669612
Every day man, every fucking day!

SMASH
Anonymous
6/22/2025, 8:45:14 PM No.105673466
>>105672042
>>105672123
>>105672074
>>105671851
>please post more, i-i mean that is disgusting!
imagine trying this hard to convince others?
kys
Anonymous
6/22/2025, 8:49:48 PM No.105673503
>>105673259
Sometimes LLM makes stuff on the fly. It happened just to me today.

If you really know about a topic and the AI does not, sometimes instead of saying "I don't know" it invents stuff. You shouldn't trust everything a LLM model says.
Anonymous
6/22/2025, 8:55:11 PM No.105673544
>>105673308
I'll give it a try later today. Thanks anon!
Anonymous
6/22/2025, 9:16:17 PM No.105673723
1725786180015994
1725786180015994
md5: e41cfe697af7020bbf686463722a6cea🔍
This was the most drastic hires fix I've ever seen and the denoise is only at 0.3. What the fuck lol
Anonymous
6/22/2025, 10:57:27 PM No.105674637
ComfyUI_02559_
ComfyUI_02559_
md5: 36ff8faabbded5fa5c81d971b4af7220🔍
>>105673198

I'm trying the nietaani lumina found on civitai too
Anonymous
6/22/2025, 11:09:31 PM No.105674754
ComfyUI_02572_
ComfyUI_02572_
md5: 27c3ea5c220b1785de0738db19740925🔍
Netaanilumina is really promising, very satisfying in terms of prompt adherence and character knowledge(recent characters tho)
Can't find more artist tags other than the one declared in the example image found on civitai.
Really wonky anatomy ,but all in all this alpha is usable