/ldg/ - Local Diffusion General - /g/ (#105814447) [Archived: 501 hours ago]

Anonymous
7/6/2025, 7:40:01 AM No.105814447
WVI2V_CC_INT_05-07-25-22-24_00002_thumb.jpg
WVI2V_CC_INT_05-07-25-22-24_00002_thumb.jpg
md5: 7744e62b24d4a15e3a6b0c089cf6d2cc🔍
Prev: >>105810290

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Replies: >>105814464 >>105814484 >>105814658 >>105814882 >>105814928 >>105816304 >>105816328 >>105817872 >>105821064
Anonymous
7/6/2025, 7:41:41 AM No.105814456
WVI2V_CC_INT_05-07-25-22-15_00002_thumb.jpg
WVI2V_CC_INT_05-07-25-22-15_00002_thumb.jpg
md5: 8d249035ff9997f9e677e62815381b89🔍
Replies: >>105814484
Anonymous
7/6/2025, 7:42:51 AM No.105814464
>>105814447 (OP)
>the scene outside is exactly the same
holy sloppa
Replies: >>105814469
Anonymous
7/6/2025, 7:45:48 AM No.105814469
>>105814464
its called distance bozo
Anonymous
7/6/2025, 7:47:32 AM No.105814477
1704953097135656
1704953097135656
md5: c8f9b77313a6bc7f899871dc5d5fdc9a🔍
dang, animu coombait yapping with no sound while literally nothing happens
epic as fuck, OP
Replies: >>105814483 >>105816817
Anonymous
7/6/2025, 7:48:39 AM No.105814482
Should I download the kontext dev in case (((they))) remove it?
You know what? I will
Anonymous
7/6/2025, 7:48:46 AM No.105814483
look op if you are too lazy to pick a decent gen or make a collage just don't bake. this is embarassing. thread's a rotten zombie anyways, all the good ppl left. just let it die.
>>105814477
we need a new place, frenman
Replies: >>105816817
Anonymous
7/6/2025, 7:49:13 AM No.105814484
>>105814447 (OP)
>>105814456
I just got a 5090, how do I make stuff like this?
Replies: >>105814493 >>105814516 >>105814536 >>105814807
Anonymous
7/6/2025, 7:50:34 AM No.105814493
>>105814484
read the OP
Anonymous
7/6/2025, 7:54:28 AM No.105814516
>>105814484
You can't. Sorry. You have to have 4 x 5090 in sli
Replies: >>105814545
Anonymous
7/6/2025, 7:57:04 AM No.105814536
1749202317687250_thumb.jpg
1749202317687250_thumb.jpg
md5: 6fda4ac6362673947010e5dd2d91948b🔍
>>105814484
You have to get (un)comfy
https://rentry.org/wan21kjguide
Replies: >>105814545 >>105814548
Anonymous
7/6/2025, 7:58:42 AM No.105814545
>>105814516
Can I just stack 4xSwitchs on top of each other and call it a day?

>>105814536
Thanks
Anonymous
7/6/2025, 7:59:03 AM No.105814548
>>105814536
pepe is floating away on an iceberg
Anonymous
7/6/2025, 8:03:19 AM No.105814564
1495191589565
1495191589565
md5: 79994c69f0db647d52185e17ff9ef3de🔍
>mfw chinese 3060 i got for $200
surprisingly good for the money i paid, but she isn't generating anything animated.
Replies: >>105814631
Anonymous
7/6/2025, 8:09:42 AM No.105814595
ComfyUI_temp_lotsy_00027_
ComfyUI_temp_lotsy_00027_
md5: 220ed203f99a130fcbfcae85af8c413b🔍
I totally forgot there is a anime wan finetune, will try it now

https://civitai.com/models/1626197?modelVersionId=1852433
Replies: >>105814631 >>105814670 >>105814847 >>105814882
Anonymous
7/6/2025, 8:15:29 AM No.105814631
>>105814564
1986 lada can't participate in f1 race, d'oh. anything below a 3090 is outdated at this point. wait can you even load an sdxl model?
>>105814595
that underboob/whateverthefuck half tit-out top just doesn't do it for me, sorry. it's what creepy 50 year old japanese men find sexy.
Replies: >>105814641 >>105814673
Anonymous
7/6/2025, 8:17:39 AM No.105814641
>>105814631
>wait can you even load an sdxl model?
nta I've 3060 6gb card, I can use flux dev pruned just fine
sdxl works great
Replies: >>105814673
Anonymous
7/6/2025, 8:18:35 AM No.105814648
I put off getting into Image Gen and Stable Diffusion for a while because it seemed like a whole rabbit hole I would fall into and I wasn't even sure if was worth it. But holy shit, it's insane how easily you can started with this stuff. I haven't done anything else for days. I feel like Quagmire discovering internet porn for the first time.
Replies: >>105814754
Anonymous
7/6/2025, 8:19:17 AM No.105814654
1730161516030395
1730161516030395
md5: 2498edcaacac854bc80af562e02657e8🔍
You sods call this art?
LOL, LMAO
Anonymous
7/6/2025, 8:19:45 AM No.105814658
eh
eh
md5: bfa9e59cd661c62738e088eaac8c967a🔍
>>105814447 (OP)
>cameltoe
thread destined to be deleted?
i don't want to post my shit if the thread is doomed
Replies: >>105816817
Anonymous
7/6/2025, 8:21:02 AM No.105814670
WVI2V_CC_INT_06-07-25-02-09_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-09_00002_thumb.jpg
md5: 89bfb0098f130bdaac7dcd6d150f58a9🔍
>>105814595
seems like its overfit
Replies: >>105814882
Anonymous
7/6/2025, 8:21:33 AM No.105814673
>>105814631
>>105814641
i have the 12 GB version of the 3060. this is a laptop chip ported to a desktop PCB, which is why it was so cheap. despite that, its only slightly worse performance than an actual 3060, and I've ran into no problems generating SDXL stuff.

the chinese brand surprisingly honors their warranty, too.
Replies: >>105814754
Anonymous
7/6/2025, 8:23:00 AM No.105814677
someone make a proper thread pls
Replies: >>105816817
Anonymous
7/6/2025, 8:32:24 AM No.105814726
I am new to local gen. Do you need to start with images before trying video?
Replies: >>105814732 >>105814754
Anonymous
7/6/2025, 8:33:52 AM No.105814732
>>105814726
You can generate a video without ever having generating an image before, but generating images is way faster than generating videos. Like, 10 seconds for an image but 5 minutes for a video. So it's probably smarter to gain experience with generating images first.
Replies: >>105814793 >>105815049
Anonymous
7/6/2025, 8:36:56 AM No.105814748
WVI2V_CC_INT_06-07-25-02-28_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-28_00002_thumb.jpg
md5: 0220790410894f5aaf3d77aae853f743🔍
meh
Replies: >>105814882
Anonymous
7/6/2025, 8:38:25 AM No.105814754
>>105814648
it only gets better.
>>105814673
that is pretty cool. I started with a 2080 but it just didn't cut it for sdxl lol
>>105814726
do w/e you want anon. imagegen is a good starting point tho. some ppl never switched to videogen because fuck videogen.
https://www.youtube.com/watch?v=pWBtEccqMTU&list=RDpWBtEccqMTU
Anonymous
7/6/2025, 8:40:24 AM No.105814767
WVI2V_CC_INT_06-07-25-02-32_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-32_00002_thumb.jpg
md5: 3cd22e3aa399f0c7f1fc876e43114c84🔍
a little better
Replies: >>105814882
Anonymous
7/6/2025, 8:43:33 AM No.105814793
>>105814732
Understood, thank you.

For the last few days I've been practicing with Sora and HailuoAI but I've decided I want more control over my gens and less censorship. Should I start with something easier like Forge or go straight to ComfyUI? Does the latter allow for significantly greater control?

Also what is the difference between Wan2.1 and Wan2GP?

For the record, I have an RTX 5070 Ti.
Replies: >>105814846 >>105815535
Anonymous
7/6/2025, 8:43:58 AM No.105814794
WVI2V_CC_INT_06-07-25-02-36_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-36_00002_thumb.jpg
md5: 95a99e87ffbc659d164ce6e04dedc6f1🔍
Anonymous
7/6/2025, 8:46:12 AM No.105814807
>>105814484
https://github.com/deepbeepmeep/Wan2GP
fuck the nodes. embrace simplicity
Replies: >>105814818 >>105814829
Anonymous
7/6/2025, 8:48:02 AM No.105814818
>>105814807
>embrace simplicity
not just that but runs way better
Replies: >>105814823
Anonymous
7/6/2025, 8:48:51 AM No.105814823
>>105814818
true. comfy just fucking sucks now. the org really enshitified everything
Replies: >>105814836
Anonymous
7/6/2025, 8:49:35 AM No.105814828
WVI2V_CC_INT_06-07-25-02-40_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-40_00002_thumb.jpg
md5: 9684a3708f4cca2a79083cff4e3f43db🔍
this model sadly can't maintain the consistency of the first frame, wish there was a VACE version to play with first and last frame
Anonymous
7/6/2025, 8:49:47 AM No.105814829
>>105814807
contains malicious code, dont downlod
Replies: >>105814835
Anonymous
7/6/2025, 8:50:52 AM No.105814835
>>105814829
you mean comfyui right? it already installed bitcoin miners, malware and interception points to rob you while you pay for api nodes
Anonymous
7/6/2025, 8:50:53 AM No.105814836
preview
preview
md5: c7aa8f41dc2b3a2f1a2540c120bc38bf🔍
>>105814823
>now
it was a good idea for about two weeks
Anonymous
7/6/2025, 8:53:00 AM No.105814846
00081-4143260728
00081-4143260728
md5: 5c654b144024250b1bca56fd0e35c907🔍
>>105814793
Careful, the people on this board get irrationally upset if they see spaces between your statements. They start screaming "gb2 leddit, faggot!" just because you pressed the enter key an extra time.
I use Forge for image generation. It seems that all video generators expect you to be on ComfyUI. Be prepared to spend multiple hours downloading and installing all the shit needed to make Comfy generate videos. You might try Pinokio since it comes with "install everything I need with one click" for noobs, and has a much more user-friendly interface than Comfy.
WanGP has lower hardware requirements than Wan 2.1. I think the P literally stands for "poor" since it's designed for people who can't afford nice things, like a 5070 ti for example.
In absolutely all situations, whether it's images or videos or forge or comfy, your bottleneck will be VRAM. The 5070 ti has 16GB of VRAM. You'd generate videos much faster if you had a 5090 with its 32GB of VRAM, but you'll still be able to make some cool stuff.
Replies: >>105817056
Anonymous
7/6/2025, 8:53:28 AM No.105814847
>>105814595
I really dislike the way this bunch of ilu tunes handle body shading on Sailor Moon. I dislike it in general since it leads to "shaded body/flat face" effect, but on Sailor Moon it's especially jarring.
Weirdly, this was a problem with AOM, but it doesn't seem like it was inherited from AOM directly.
Replies: >>105814875
Anonymous
7/6/2025, 8:58:50 AM No.105814875
WVI2V_CC_INT_06-07-25-02-50_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-50_00002_thumb.jpg
md5: 8e94e8170eb91b1dcebe04530a0d12c1🔍
>>105814847
what would be a proper body shading?
Replies: >>105815191
Anonymous
7/6/2025, 9:00:19 AM No.105814882
>>105814767
>>105814748
>>105814670
>>105814595
>>105814447 (OP)
nakaԁashi
Anonymous
7/6/2025, 9:01:47 AM No.105814888
WVI2V_CC_INT_06-07-25-02-53_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-53_00002_thumb.jpg
md5: 682fa7956c44aa0766e45dc4075d4631🔍
Anonymous
7/6/2025, 9:07:32 AM No.105814913
WVI2V_CC_INT_06-07-25-02-58_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-02-58_00002_thumb.jpg
md5: 8320f76df5e7633fb92d67074410e628🔍
Anonymous
7/6/2025, 9:09:40 AM No.105814928
>>105814447 (OP)
>no collage
lame, what is this, /sdg/?
Replies: >>105815598 >>105816817
Anonymous
7/6/2025, 9:10:37 AM No.105814933
1740485939136411
1740485939136411
md5: a37da1164a41a9536ed0aec51cce4f26🔍
>sailor mussy general
since we've abandoned any decency or morals I'm just going to post some slop
Replies: >>105814952 >>105815462
Anonymous
7/6/2025, 9:14:12 AM No.105814947
Man, wish IL / Noob was just a tiny bit smarter, for example if it was able to replace hair with strands of cardboard, and similarly weird stuff.
Replies: >>105814993
Anonymous
7/6/2025, 9:15:09 AM No.105814952
>>105814933
how much slop are we talking? i have some slop to offload too
Replies: >>105814975
Anonymous
7/6/2025, 9:16:45 AM No.105814960
Any good anime video gens that replicate the exact artstyle and animation style as their original anime?
Replies: >>105814975 >>105815028 >>105817157
Anonymous
7/6/2025, 9:21:28 AM No.105814975
1739767420507191
1739767420507191
md5: 18a668b859553235d01668358158a13e🔍
>>105814952
my sloppa supply is infinite and I have some for any occasion.
Just don't ask where I got it

>>105814960
No, but give it time and China will probably release it.
Anonymous
7/6/2025, 9:22:11 AM No.105814982
WVI2V_CC_INT_06-07-25-03-13_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-03-13_00002_thumb.jpg
md5: 823df161250e8fd9f30428a80f0b631d🔍
Anonymous
7/6/2025, 9:24:38 AM No.105814993
00002-3905250099
00002-3905250099
md5: 1f05b20e2488c333e81a53019ce001e4🔍
>>105814947
let me give it a try
>masterpiece, 1girl, woman with cardboard hair
well that didn't work out as planned
but i don't dislike it
Anonymous
7/6/2025, 9:35:18 AM No.105815028
homura3_thumb.jpg
homura3_thumb.jpg
md5: 78cc220fc9600aa0eebefcf17990f81c🔍
>>105814960
Replies: >>105815071 >>105815084
Anonymous
7/6/2025, 9:39:59 AM No.105815049
>>105814732
are they kissing Itou from School Days?
Replies: >>105815124
Anonymous
7/6/2025, 9:40:32 AM No.105815054
What an unbelievably shitty thread. Hope Sailor Moon anon get's raped by a gang of niggers.
Replies: >>105815063 >>105815084 >>105815291
Anonymous
7/6/2025, 9:41:24 AM No.105815063
>>105815054
in the ass and the mouth
Replies: >>105815084
Anonymous
7/6/2025, 9:42:28 AM No.105815071
>>105815028
are they kissing Itou from School Days?
Replies: >>105815124
Anonymous
7/6/2025, 9:43:17 AM No.105815079
chroma-v42calibrated-Q8_00055_
chroma-v42calibrated-Q8_00055_
md5: 65f3871d8d07ce06e6e1eec067b257a4🔍
Replies: >>105815480
Anonymous
7/6/2025, 9:43:51 AM No.105815084
1739429899740494
1739429899740494
md5: bb486da87bcda32fff479711d6f7d60f🔍
>>105815028
>posting kino in the gooner thread
this webm both confuses and enrages me

>>105815054
>>105815063
that would certainly be more interesting than what we've seen thus far
Anonymous
7/6/2025, 9:45:22 AM No.105815094
chroma-v42calibrated-Q8_00057_
chroma-v42calibrated-Q8_00057_
md5: d54c8a9de1099fb334076dfd05a0f989🔍
Anonymous
7/6/2025, 9:51:10 AM No.105815124
sayaka-3_thumb.jpg
sayaka-3_thumb.jpg
md5: 588db3f980a2757ee9f67d070a7f2075🔍
>>105815049
>>105815071
Yes, how could you tell?
Replies: >>105815143 >>105815144 >>105815544
Anonymous
7/6/2025, 9:54:23 AM No.105815143
>>105815124
god damn, the absolute giga chad. post more.
Anonymous
7/6/2025, 9:54:23 AM No.105815144
>>105815124
why did you blurred haulio watermark?
Replies: >>105815196
Anonymous
7/6/2025, 10:04:33 AM No.105815191
>>105814875
The actual period cel shading is nigh impossible for ilu tunes: they learn faces and accessories fine, but are adamant about shading bodies with a single tired template.
So unironically, the path of least resistance would be to simply force flat colors.
https://m.media-amazon.com/images/M/MV5BOTU2ZjAwYTEtYWM4NC00YzIyLTlmOTEtMzIxYjFlY2IzOGY5XkEyXkFqcGc@._V1_.jpg
Anonymous
7/6/2025, 10:05:01 AM No.105815196
>>105815144
because the text becomes increasingly broken on every prompt
Replies: >>105815206
Anonymous
7/6/2025, 10:06:38 AM No.105815206
>>105815196
but it adds the watermark after genning
Replies: >>105815221
Anonymous
7/6/2025, 10:08:31 AM No.105815221
>>105815206
You don't get it, I gen from frames of previous hailuo gens. I can't be bothered photoshopping the text out every time so I just blur it before using the next gen.
Anonymous
7/6/2025, 10:14:04 AM No.105815265
ComfyUI_00024_thumb.jpg
ComfyUI_00024_thumb.jpg
md5: c453d075cc3d3524a8df77b70de6ea5c🔍
Don't you hate it when you try to summon an anime girl, and it turns it into one instead?
Anonymous
7/6/2025, 10:17:58 AM No.105815291
cyberrealisticXL_catalystXLV11-2025-07_003129-gen-583034770846919-00174
>>105815054
called it.
Anonymous
7/6/2025, 10:25:51 AM No.105815340
ComfyUI_temp_cblxs_00080_
ComfyUI_temp_cblxs_00080_
md5: fdc5d7ecbf2dd082eb0b779d4600dd6a🔍
Here is my idea for a game: You use an llm to make a blue archive-like tactical shooter but with way more control. And you'd hook it up with a model that would generate the images for the game.
Replies: >>105815345
Anonymous
7/6/2025, 10:27:52 AM No.105815345
>>105815340
it won't be long before you can type and talk whatever you want with LLM-powered NPCs and other players in vidya
Replies: >>105815391
Anonymous
7/6/2025, 10:30:41 AM No.105815363
kontext dev pruned when?
Replies: >>105815388
Anonymous
7/6/2025, 10:34:12 AM No.105815377
1745674154305297_thumb.jpg
1745674154305297_thumb.jpg
md5: e55c08b8e31c545c3a5f5c20bd38f3bd🔍
>>105813581
That's a cool idea.
Anonymous
7/6/2025, 10:34:39 AM No.105815379
>can't download kontext with a download manager
gay
I'm not putting my login info in it
Anonymous
7/6/2025, 10:36:02 AM No.105815388
>>105815363
https://civitai.com/models/1719758/flux1-kontext-dev
Anonymous
7/6/2025, 10:37:04 AM No.105815391
>>105815345
comfyui is gpl3 so it will never be able to ship in a game
Replies: >>105815405
Anonymous
7/6/2025, 10:39:37 AM No.105815405
>>105815391
Just make the gui in the game. And just ship it with already needed packs.
Replies: >>105815418
Anonymous
7/6/2025, 10:41:22 AM No.105815418
>>105815405
YOU CAN'T RETARD. GPL3 FORBIDS SHIPPING THE CODE WITH THE FUCKING GAME.
Anonymous
7/6/2025, 10:44:40 AM No.105815442
Is cooooming the only true use case for ai?
Replies: >>105815446
Anonymous
7/6/2025, 10:45:35 AM No.105815446
>>105815442
propaganda and population control is
Anonymous
7/6/2025, 10:49:38 AM No.105815462
>>105814933
Oh how the tables have turned now you're the one destroying your own general
Replies: >>105815469
Anonymous
7/6/2025, 10:51:13 AM No.105815469
1720358149705515_thumb.jpg
1720358149705515_thumb.jpg
md5: c5138a9e78b9ae1f7f1119f474ffc48c🔍
>>105815462
I'm not the OP, I'm just a tourist.
Please tag him, he's the faggot you want to berate.
Anonymous
7/6/2025, 10:53:39 AM No.105815480
>>105815079
pov chinese
Anonymous
7/6/2025, 11:04:18 AM No.105815535
Screenshot 2025-07-06 050220
Screenshot 2025-07-06 050220
md5: 35ce190111f5453fc3734f356cab3a1e🔍
>>105814793
get wan2gp and framepack studio via pinokio.
Anonymous
7/6/2025, 11:05:16 AM No.105815544
>>105815124
>local will never have something this good

It'a over
Replies: >>105815557 >>105816063
Anonymous
7/6/2025, 11:07:17 AM No.105815557
>>105815544
you'll get a distilled encrypted untrainable version from BFL..
in 3 years
Replies: >>105815580
Anonymous
7/6/2025, 11:10:20 AM No.105815580
>>105815557
Wishful thinking. BFL has gone full closed source. They would've given us Kontext Pro or Flux. 2 which they are gatekeeping a long time ago if they wanted to.
Replies: >>105815600
Anonymous
7/6/2025, 11:13:26 AM No.105815598
00020-2824926600
00020-2824926600
md5: 587558f7e6d5825fd08aadfe31f1a793🔍
>>105814928
getting into the collage use to give the euphoric energy rush to keep genning images for 3+ hours. But lately i rarely ever get into the collage these days. The Op in these generals just prefers to put disgusting meme gens and boring basic bitch miku slop. If you want engagement go to /trash/ or other fast diffusion general threads.
Replies: >>105815611 >>105815808 >>105815815 >>105815845 >>105815990 >>105816355 >>105816817 >>105817063
Anonymous
7/6/2025, 11:13:37 AM No.105815600
>>105815580
they always were closed-source. flux dev and schnell were designed to be untrainable, which is why lodestone had to hack schnell apart and basically retrain it from scratch. their local releases are just glorified ads for their API. there is no model team out there with a 100% local-first approach except maybe deepseek.
Replies: >>105815639
Anonymous
7/6/2025, 11:16:18 AM No.105815611
>>105815598
It's not that deep. You shouldn't need external encouragement to enjoy making AI art, and if you want the thread to have a collage, you don't need to be the OP to make one.
Replies: >>105815701
Anonymous
7/6/2025, 11:21:56 AM No.105815639
>>105815600
Chroma still remains the only way out of this mess, but it's a shame, really. Local could've been so much more. I mean, it would buried on the ground if not for Chroma. SDXL is dead. Stability is dead. Flux dev suffers from fake skin. Chinks only release slop and gatekeep their best models.
Replies: >>105815845 >>105817094
Anonymous
7/6/2025, 11:29:03 AM No.105815682
I need a model that works well with boomer prompting (preferably SDXL) for using with AI Dungeon. What do you suggest?
I've been using rawcharm, which I had lying around, but it inserts pussy and cock everywhere just because.
Anonymous
7/6/2025, 11:34:42 AM No.105815701
00002-485347828
00002-485347828
md5: 7d478cfe5b7647a60792d3d231c8e5ec🔍
>>105815611
positive reinforcement from ai threads from this site or subreddits is the only positive attention I ever get these days so I've become addicted to it. My real life is boring as shit and i hate draining all my creativity brain energy at wageslaving.
Replies: >>105815711 >>105815728 >>105816355
Anonymous
7/6/2025, 11:37:49 AM No.105815711
>>105815701
I like your gen, anon. I know how soul-sucking wage slavery can be, I don't have much to say here.
Anonymous
7/6/2025, 11:40:41 AM No.105815728
>>105815701
2.5D look is pretty basic, it doesn't look like anything
Anonymous
7/6/2025, 11:51:24 AM No.105815808
00031-2983094756
00031-2983094756
md5: f62bf45261528afd521e912b401bd85f🔍
>>105815598
i would like to steal your waifu and fuck her
Anonymous
7/6/2025, 11:52:03 AM No.105815815
>>105815598
he uses a script that grabs random images from the thread
he's coy about it and pretends that's not what he does, but that's exactly what he does
Replies: >>105815962
Anonymous
7/6/2025, 11:53:04 AM No.105815820
Can anyone share an illustrious workflow with loras?
I downloaded one from civit but it has like 10 useless nodes missing and I don't need all the extra stuff
Thanks in advance
Replies: >>105815836 >>105815852
Anonymous
7/6/2025, 11:55:21 AM No.105815835
>Total VRAM 6144 MB, total RAM 15724 MB
Anonymous
7/6/2025, 11:55:24 AM No.105815836
>>105815820
This one doesn't use any custom nodes:
https://rentry.org/comfyui_guide_1girl#using-loras
Replies: >>105815843
Anonymous
7/6/2025, 11:56:31 AM No.105815843
>>105815836
damn
sorry
I should have checked the rentry first
Anonymous
7/6/2025, 11:56:46 AM No.105815845
Chroma_final_00037__resized
Chroma_final_00037__resized
md5: 8e7e947c41491d53aa9951640f2c1d14🔍
>>105815598
Getting into a collage is always a nice feeling, I agree. But getting a comment by an anon trumps that, if it ever happens.
But in the end, I'm genning for myself and come here for inspiration and news, hoping that some anons might get some inspiration from my slop as well.
>>105815639
Man, I always wanted to get hidream going but it's even slower than Chroma for me and the results haven't been amazing. Probably a skill issue, though.
Replies: >>105816001 >>105816100
Anonymous
7/6/2025, 11:57:18 AM No.105815852
00032-1690868374
00032-1690868374
md5: e7c0a929124850fdd971732eb0199265🔍
>>105815820
Well what are you trying to gen? What kind of effect do you want to get out of your loras?
I usually don't even use any loras unless I'm trying to gen a very specific character or style
Replies: >>105815871
Anonymous
7/6/2025, 11:59:32 AM No.105815871
>>105815852
anime 1girl
Replies: >>105815917
Anonymous
7/6/2025, 12:04:10 PM No.105815917
00034-1787893183
00034-1787893183
md5: 045813b48f1bba4b1979be62d0d3e01e🔍
>>105815871
Well, you don't really need any special workflow or loras for that
>masterpiece, 1girl, solo
will do just fine
Anonymous
7/6/2025, 12:11:26 PM No.105815962
>>105815815
nta but I don't think that's it. A few threads ago I got three of my images in, and statistically it makes no sense.
Anonymous
7/6/2025, 12:15:27 PM No.105815990
>>105815598
I just like to see what anons were up to and what other anons possibly enjoyed in particular. But yeah I'm not too keen on the cringeworthy ones either if that comforts you.
Anonymous
7/6/2025, 12:16:48 PM No.105816001
>>105815845
>I'm genning for myself and come here for inspiration and news, hoping that some anons might get some inspiration from my slop as well.
Based anon of truthposting.
Anonymous
7/6/2025, 12:23:46 PM No.105816053
Has anyone tried turning sketches to images with kontext?
Replies: >>105816232
Anonymous
7/6/2025, 12:24:42 PM No.105816063
>>105815544
>local will never have something this good
Is this true? That would be disappointing
Anonymous
7/6/2025, 12:31:24 PM No.105816100
00006-3811911574
00006-3811911574
md5: 0b5d6f0e42b723da6d44e0db91c08db4🔍
>>105815845
i loved the constant begging and trading of catboxes back in 2023. Learning new artists, prompt techniques, extensions, settings and models was awesome. Happy memories begging anon for their catboxes and discovering cardos anime and animated models. I get teary looking at my old a1111 output folder and folder of gens from other anons i saved.
Replies: >>105816236
Anonymous
7/6/2025, 12:43:39 PM No.105816197
6RcStHj5
6RcStHj5
md5: 6555de9458d132423cc610388b9a9d3f🔍
Anonymous
7/6/2025, 12:44:25 PM No.105816204
00022-2964827577
00022-2964827577
md5: a6e99c4bb6999a05ae554487e4206878🔍
Anonymous
7/6/2025, 12:48:26 PM No.105816232
>>105816053
coloring yeah but we have hundreds of models that do it faster. If you want it to stylize it's shit at it
Replies: >>105816283 >>105816287
Anonymous
7/6/2025, 12:48:35 PM No.105816236
chroma-v42calibrated-Q8_00006_
chroma-v42calibrated-Q8_00006_
md5: b5dcb53e08b900e39e5cd441685b82d2🔍
>>105816100
good times
Anonymous
7/6/2025, 12:58:07 PM No.105816271
>take photo of a random woman on the street
>undress with flux kontext
why does it get boring real quick?
Anonymous
7/6/2025, 1:00:02 PM No.105816283
ComfyUI_temp_aefrz_00001_
ComfyUI_temp_aefrz_00001_
md5: 95235655b6684a51c78af370a40f4b6e🔍
>>105816232
>we have hundreds of models that do it faster
which one would you suggest to make it realistic?
Replies: >>105816287 >>105816294 >>105816368
Anonymous
7/6/2025, 1:01:04 PM No.105816287
ComfyUI_00071_
ComfyUI_00071_
md5: e980a07005671acfb742a26825a26bc8🔍
>>105816232
>>105816283
because kontext created this kino
Anonymous
7/6/2025, 1:02:39 PM No.105816294
>>105816283
Can controlnets be used for chroma/flux? Because there was a controlnet for sketches.
Anonymous
7/6/2025, 1:06:25 PM No.105816304
>>105814447 (OP)
So I had a dream I was firing people last night and my coworker came up to me and said someone needs to talk to the boss about the firings, so I said you want someone to talk to him, you got it. I walked him right up to the bosses office and said. I need to talk to you about the firings that there is NOT enough of. You've only fired 9000 people when we need to dire 20000. Now im not saying 9000 is bad, that is fantastic, *looks at coworker* but we need 20000.
Replies: >>105816324
Anonymous
7/6/2025, 1:10:13 PM No.105816324
>>105816304
This is /ldg/, my friend. Nice dream though.
Anonymous
7/6/2025, 1:10:37 PM No.105816328
Akko sweating
Akko sweating
md5: a46f42a6b4bfa811ac56d34ef7843784🔍
>>105814447 (OP)
i've read that ComfyUI is not safe (has no built in security), is this true ?
i'm trying to get started with an RTX 3070 8GB but i have no idea what image generator i should use and what video generator i CAN use.
i think Wan2GP is for videos with shitty GPU's up to good GPU's but i have no idea about the image generator, like what generator is "safe" for use security wise, which one is used for non-anime/nude images, which one for art (i saw a chroma pic that looked very good tho) and which one is used for anime pics generation ?
i need HELP !
Anonymous
7/6/2025, 1:12:29 PM No.105816336
Untitled
Untitled
md5: e6e77e8220fc09d9b3f513db26b970e9🔍
lol
Replies: >>105816345 >>105816347 >>105816369 >>105816472 >>105816497
Anonymous
7/6/2025, 1:14:08 PM No.105816345
>>105816336
A little bit of wireless linking is acceptable.
Anonymous
7/6/2025, 1:14:12 PM No.105816347
df0
df0
md5: f18e9325221828b2b7f79d349b574eee🔍
>>105816336
Anonymous
7/6/2025, 1:15:32 PM No.105816354
ComfyUI_00072_
ComfyUI_00072_
md5: 7f3abbcba1a53025bf6b7194098bcd82🔍
Anonymous
7/6/2025, 1:15:36 PM No.105816355
>>105815598
>>105815701
i think its a little pathetic to seek validation through AI generated stuff, but i don't want to take that away from you considering you wagecuck and that sucks. i can't talk shit, either. since i use AI specifically to make goon material for fetishes most people wouldn't draw.
Anonymous
7/6/2025, 1:17:47 PM No.105816364
any way to easily compress loras?
i have like 800 gigs of them and its getting out of hand
Replies: >>105816373 >>105816554
Anonymous
7/6/2025, 1:18:02 PM No.105816368
chroma-v42calibrated-hrfix_00003_
chroma-v42calibrated-hrfix_00003_
md5: 326730b20fa0f3387d55717ad827c7b3🔍
>>105816283
interrogated with grok
Replies: >>105816377
Anonymous
7/6/2025, 1:18:11 PM No.105816369
ComfyUI_temp_jsrbr_00001_
ComfyUI_temp_jsrbr_00001_
md5: 478196fa5af67068df5e2df4f4b26e8e🔍
>>105816336
>people be like "this is my basic workflow"
I hate this community so much.
Anonymous
7/6/2025, 1:18:25 PM No.105816373
>>105816364
The answer is to disk space, I'm afraid.
Replies: >>105816380
Anonymous
7/6/2025, 1:19:08 PM No.105816377
>>105816368
share your chroma workflow please my fren
Replies: >>105816547
Anonymous
7/6/2025, 1:19:26 PM No.105816380
>>105816373
Derp. I meant buy storage.
Anonymous
7/6/2025, 1:28:23 PM No.105816440
AnimateDiff_00034_thumb.jpg
AnimateDiff_00034_thumb.jpg
md5: e62fae6d005e88bd2e1f0c1ba33681b2🔍
Replies: >>105816604 >>105816920 >>105816936
Anonymous
7/6/2025, 1:29:55 PM No.105816450
p7tx4Bbu
p7tx4Bbu
md5: a9c4493c35bdf6b8a0c575c16e6ed7a7🔍
Anonymous
7/6/2025, 1:33:45 PM No.105816472
>>105816336
>all that
>basically for nothingburger
Anonymous
7/6/2025, 1:38:00 PM No.105816497
00035-2957370306
00035-2957370306
md5: cc9048535d96f69ff40daf88d563517a🔍
>>105816336
what that absolute fuck. All that bullshit noodles just to gen a few images and fap. I'll stick a basic ass gradio webui like reforge
Replies: >>105816798
Anonymous
7/6/2025, 1:47:30 PM No.105816547
chroma-v42calibrated-Q8_00026_
chroma-v42calibrated-Q8_00026_
md5: f951db3ba6f256f3226dffcb76dd6000🔍
>>105816377
just normal workflow bloated to hell with experimental nodes and stuff, get one from here instead https://civitai.com/models/1330309?modelVersionId=1884429
Anonymous
7/6/2025, 1:48:27 PM No.105816554
15815030_3
15815030_3
md5: 069f4573f410fe2421a470b1fa771c69🔍
>>105816364
just transfer loras and models you hardly use into a hard drive safe keeping. Hard dives and docking stations are pretty affordable.
Replies: >>105816571
Anonymous
7/6/2025, 1:52:03 PM No.105816571
>>105816554
can I plug 4 or 5 hard drives to my computer using this?
Replies: >>105816627
Anonymous
7/6/2025, 1:59:33 PM No.105816604
>>105816440
Teach me how please.
Replies: >>105816619
Anonymous
7/6/2025, 2:02:59 PM No.105816619
>>105816604
do you have a 5090 and 128gb ram?
Replies: >>105816643 >>105817924 >>105819416
Anonymous
7/6/2025, 2:03:50 PM No.105816623
I can't cope with that
Anonymous
7/6/2025, 2:05:02 PM No.105816627
>>105816571
no, you can only up to two plug sata hard drives or ssd at a time. I use it store files of all kinds for long term safe keeping. Its always good to keep back ups of your stuff you value.
Anonymous
7/6/2025, 2:06:41 PM No.105816636
Ok, I propose extra guider for WAN video2video: it will calculate energy/motion vector/matrix for the first video and use it for the second, preserving movement, like in physics.
I'm not a chinese and I'm not working in some university, so I can't implement that.
Anonymous
7/6/2025, 2:07:31 PM No.105816640
Maybe I'll wait for 6090 laptops.
Anonymous
7/6/2025, 2:07:59 PM No.105816643
>>105816619
No, and you? Are they required?
Replies: >>105816864
Anonymous
7/6/2025, 2:09:14 PM No.105816647
ComfyUI_temp_cblxs_00067_
ComfyUI_temp_cblxs_00067_
md5: 2cd273af955e3abc8f71e35e846fdf9a🔍
Can I tell the AI to imagine the piece as if it was rendering it in 128x128 and to omit the finer details?
Replies: >>105816769
Anonymous
7/6/2025, 2:09:23 PM No.105816648
When training a lora, does having higher steps result in worse quality or diminishing returns?
Replies: >>105816684 >>105816688
Anonymous
7/6/2025, 2:10:41 PM No.105816655
Also can I speak to AI to draw "unicode code blah blah" and it will invoke the character?
Anonymous
7/6/2025, 2:17:25 PM No.105816684
>>105816648
it's best to monitor epochs, somtimes earlier ones can have better output, more steps will give diminishing results, eventually leading to either overcooked weights and / or lack of flexibility with the lora, which defeats it's purpose of being used with different prompts, styles, models, etc
Anonymous
7/6/2025, 2:17:55 PM No.105816688
>>105816648
if you use cosine with standard settings it's pretty hard to fry lora. You could try the generate image every x epoch function and stop training when it's gens the same image, however it's possible that lora just doesn't seem to converge, but functions just fine besides that
Anonymous
7/6/2025, 2:28:27 PM No.105816755
CIOdC0oG
CIOdC0oG
md5: e0b06291f3a8ebff298afe42617df5d4🔍
Replies: >>105816802
Anonymous
7/6/2025, 2:29:36 PM No.105816769
>>105816647
Probably better to either use something like flux, or a lora that minimalizes details, and manually downscale it in something like krita.
Anonymous
7/6/2025, 2:29:44 PM No.105816770
shorts__thumb.jpg
shorts__thumb.jpg
md5: fd07941043b91ef544642b76b437a9c3🔍
Anonymous
7/6/2025, 2:35:11 PM No.105816798
>>105816497
reForge my beloved
panchovitox my beloved
Anonymous
7/6/2025, 2:35:26 PM No.105816802
>>105816755
woah
Anonymous
7/6/2025, 2:38:14 PM No.105816817
>>105814477
>>105814483
>>105814658
>>105814677
>>105814928
>>105815598
None of you ever make collages tho
Anonymous
7/6/2025, 2:42:04 PM No.105816834
What is the main problem with local video gen compared to hailuo?
Replies: >>105816902
Anonymous
7/6/2025, 2:46:02 PM No.105816864
AnimateDiff_00036_thumb.jpg
AnimateDiff_00036_thumb.jpg
md5: 74a26aeb5db8381040916ba8619e407f🔍
>>105816643
nta but no, i have a 3090 and 64gb ram
Replies: >>105816900 >>105816936 >>105816962
Anonymous
7/6/2025, 2:52:36 PM No.105816900
Local Democratic Governance
Local Democratic Governance
md5: d6486c57af5d73d19dab2d316c082d28🔍
>>105816864
not exactly the vibe I was going for, but that makes it only funnier to see, smooth moves while at it
Anonymous
7/6/2025, 2:52:55 PM No.105816902
>>105816834
never used hailuo but any "why is local version so behind??" question can almost always be answered with
>bad datasets
>less parameters
>local hardware running quants
>sekret saas technology
Anonymous
7/6/2025, 2:54:03 PM No.105816914
gimm
gimm
md5: 357d99dcf36f21357c053e825c543d22🔍
>>105794731
>what's the best tool for frame interpolation and increasing fps now?
i've been using gimm-vfi lately
https://github.com/kijai/ComfyUI-GIMM-VFI
Anonymous
7/6/2025, 2:55:01 PM No.105816920
>>105816440
god damn
Anonymous
7/6/2025, 2:57:07 PM No.105816936
>>105816864
>>105816440
how are you keeping your video gens under 4mb. I want to post videos but the gens keep exceeding 4mb.
Replies: >>105816940 >>105817055
Anonymous
7/6/2025, 2:57:46 PM No.105816940
>>105816936
re-encode them to be under 4 mb?
Replies: >>105816950
Anonymous
7/6/2025, 3:00:07 PM No.105816950
>>105816940
i refuse to believe it's that simple, you can just compress a video?
Replies: >>105817055
Anonymous
7/6/2025, 3:02:07 PM No.105816962
>>105816864
Can you share workflow?
Replies: >>105817055
Anonymous
7/6/2025, 3:05:48 PM No.105816986
my first video gen
my first video gen
md5: 869e42942b50fd656d1f413ab2465e2d🔍
just started my first generation attempt ever with Wan 2.1, pic related, how do i know how long it will take with basic default settings ?
i'm running with a 3070 8GB.
Replies: >>105817081 >>105817193
Anonymous
7/6/2025, 3:07:01 PM No.105816995
WVI2V_CC_INT_636068018459649_00001_thumb.jpg
WVI2V_CC_INT_636068018459649_00001_thumb.jpg
md5: 4b7d89c13ffd033a8d78e15b61084de7🔍
heres a song I made with acestep:

https://vocaroo.com/142xQz7cdusw
Replies: >>105817113
Anonymous
7/6/2025, 3:13:36 PM No.105817055
Animatediff 00039_thumb.jpg
Animatediff 00039_thumb.jpg
md5: c4bc345bb63feb875b15acd70c15a1cb🔍
>>105816936
I just throw it in handbrake and re-encode with the fast 1080p preset if they exceed 4mb. takes two seconds
>>105816950
bruh are you for real
>>105816962
it's the rentry workflow pretty much
https://files.catbox.moe/kdrm1l.png
Replies: >>105817100
Anonymous
7/6/2025, 3:13:42 PM No.105817056
>>105814846
Well, because reddit is the place where you need to put an extra CR in for it to separate a quote from the text you write, so it is a sure tell.

Sounds more like you are butthurt because you were correctly called out.
Replies: >>105819421
Anonymous
7/6/2025, 3:14:53 PM No.105817063
>>105815598
I now remember Angel Blade for the first time in ages
Anonymous
7/6/2025, 3:17:12 PM No.105817079
slowmo realism = AI
it's so easy to spot
Anonymous
7/6/2025, 3:17:21 PM No.105817081
>>105816986
holy shit don't torture yourself using regular wan models. select the self forcing model or using fusionx with light2v lora. i have 4060ti 16gb with 64gb of ram setup, the self forcing models takes 4-5 minutes.
Replies: >>105817087 >>105817109 >>105817193
Anonymous
7/6/2025, 3:18:58 PM No.105817087
>>105817081
i was following this guide :
https://www.youtube.com/watch?v=tsphZ-2bqko&t
i'm new at this
Replies: >>105817175
Anonymous
7/6/2025, 3:20:07 PM No.105817094
>>105815639
>SDXL is dead
No it's not, and neither are the many SDXL finetunes, it's actually shocking how much they are still in use.

That said when it comes to furthering the capacity of local you are correct, Chroma IS the one with potential, the base model looks strong enough so it comes down to the community lora ecosystem and potential further finetuning.

Also there could be a new image model dropping at any moment which beats all the currently available, most likely would have to come from China though, and they seem to be all into video these days.
Anonymous
7/6/2025, 3:21:32 PM No.105817100
>>105817055
> it's the rentry workflow pretty much
> https://files.catbox.moe/kdrm1l.png
> Animatediff
> wan
I don't understand.
Replies: >>105817430
Anonymous
7/6/2025, 3:22:55 PM No.105817109
>>105817081
>self forcing model
I thought it was just a lora, is there a full model ?
Anonymous
7/6/2025, 3:23:36 PM No.105817113
1742288878801828
1742288878801828
md5: 07b81cae09bd4bb18bc43bff19b3baef🔍
>>105813114
yes ty
>>105816995
are 1others acceptable
Anonymous
7/6/2025, 3:24:48 PM No.105817118
>asking kontext to put pictures of a new born on the wall
>it's putting pictures of a 2 year old kid instead
why didn't they train it on babies?
I need it for a project!
Anonymous
7/6/2025, 3:32:05 PM No.105817157
>>105814960
I made this
https://civitai.com/models/1671285
Anonymous
7/6/2025, 3:32:42 PM No.105817167
1750635347552045
1750635347552045
md5: a8bf00f8aa3bdc66ac60d2f4927cd8ab🔍
>shitpost master 9000
Anonymous
7/6/2025, 3:32:50 PM No.105817168
Cuz training for kiddie port in illegal and sick. Please seek professional help.
Replies: >>105817267
Anonymous
7/6/2025, 3:34:05 PM No.105817172
Why would anyanon care to generate anything other than gargantuan booba
Replies: >>105817182
Anonymous
7/6/2025, 3:34:49 PM No.105817175
>>105817087
i remember that guide but its 3 months old, the new optimized self forcing models came out a month ago. The self forcing models only take 4 steps to get a quality at 480p under 350 seconds. You going to waste a lot of valuable time genning with regular wan models. Make sure wangp is updated to v6.5 and select the self forcing model. save your time anon.
Replies: >>105817193
Anonymous
7/6/2025, 3:35:27 PM No.105817182
>>105817172
Flat chest on an adult woman with well-shaped thighs and huge ass is god tier.
Anonymous
7/6/2025, 3:37:16 PM No.105817193
first videogen ever_thumb.jpg
first videogen ever_thumb.jpg
md5: 09d554ab1b24966454ff9af095052ab2🔍
>>105817081
>>105817175
i've tried it and indeed it is faster but it's not generating what i want based on the image i used.
here's the result based on that pic :
>>105816986

so what to do if you want a video that looks like the image you give to the AI ?
better prompt ? better model ?
Replies: >>105817316
Anonymous
7/6/2025, 3:37:43 PM No.105817197
>I have to assume there will come a day where AI generated outputs can be so diverse, artistic, and accurate that someone can make an entirely open copy left model including all the training datasets, etc. but I don’t think we’re there yet. At the moment I’m pretty excited about Chroma which is getting great results and is relentlessly training day after day, already up to v42 out of 50 for its base model. I really expect the community will unify around it as much as is possible while video models are distracting people away from image models. Chroma is built off of Flux Schnell but it can do so much better at following a prompt. I’m a little disappointed it doesn’t appear to have the rich collection of artists you could make use of in your prompts that SD 1.x did but it is training on more of an internet art dataset. What would people here think of getting more involved in Chroma? I know it’s not a public domain dataset and maybe that’s what some wanted here but it really hits a lot of the goals for me: top quality, easy to train for and customize, permissive license, and best of all updated versions come out as quickly as every 4 hours right now if you want to try the experimental cfg 1 versions.
Replies: >>105817201
Anonymous
7/6/2025, 3:39:10 PM No.105817201
>>105817197
Tldr
Anonymous
7/6/2025, 3:40:23 PM No.105817209
1742809750112276
1742809750112276
md5: 7973399ad2bc148dbd2a71114e4531be🔍
Anonymous
7/6/2025, 3:43:47 PM No.105817224
937
937
md5: d3b71a6845819670d68a3422c2f66bff🔍
more on OP topic pl0xe
Anonymous
7/6/2025, 3:45:07 PM No.105817228
i just goon genned over 40 clips of the same girl
Replies: >>105817244
Anonymous
7/6/2025, 3:48:11 PM No.105817242
We need more people developing wf for acestep.

https://files.catbox.moe/28we52.flac

I keep getting gains. I'm struggling against cfg. Raise cfg, it starts to sound bad.

If you load the audio files, you will notice that gain is too high in may gens, lowering cfg lowers gain. Very strange, there has to be a strategy to prevent that, but I don't know how to get a vae preview, for example (of audio)
Anonymous
7/6/2025, 3:49:11 PM No.105817244
>>105817228
Me, only it's an acestep parody gen.
Anonymous
7/6/2025, 3:54:06 PM No.105817267
>>105817168
sick fuck
it generated an infant in the arms of his mother just fine but won't generate photographs of him on the wall
Anonymous
7/6/2025, 3:59:04 PM No.105817298
What's a good starting point for learning video gen from scratch? Is there a good guide that isn't outdated?
Replies: >>105817374
Anonymous
7/6/2025, 4:02:32 PM No.105817316
2025-07-06-09h57m08s_seed364872667_This is a highly detailed, photorealistic CGI imag_thumb.jpg
>>105817193
enhance your promptings with thishttps://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
you need to use natural language prompts and not the usual tagging prompt style used to make sdxl images.
Replies: >>105817472
Anonymous
7/6/2025, 4:08:14 PM No.105817345
H5E_4A-W
H5E_4A-W
md5: e0955a38f72e69f121ea297b91caa4b1🔍
Anonymous
7/6/2025, 4:11:49 PM No.105817374
WANI2V_22220029187_00002_thumb.jpg
WANI2V_22220029187_00002_thumb.jpg
md5: a859e30d94b20bc8af28158aaa59ac26🔍
So is self-forcing better than fusion? In terms of quality, speed or both?
>>105817298
I also just started yesterday. the wan guide in the OP is a bit broken atm (needed to roll back a few packages) but doesn't seem too abysmal desu.
Replies: >>105817725
Anonymous
7/6/2025, 4:12:39 PM No.105817380
please make the ugly stop
Replies: >>105817388
Anonymous
7/6/2025, 4:13:19 PM No.105817387
how to copy paste in this field
how to copy paste in this field
md5: ecbc7c54d2d3016508eb7e08b6f2a146🔍
how do i paste a text on this field ?
using Wan 2.1 in Pinokio.
there's no right click option to do this, can i enable it somewhere ?
Replies: >>105817399 >>105817463
Anonymous
7/6/2025, 4:13:37 PM No.105817388
>>105817380
turn on your monitor, you're staring at your reflection
Anonymous
7/6/2025, 4:14:48 PM No.105817399
>>105817387
nvm, i found out you can drag and drop a copied text, but i'd still like to know if you can just paste it directly with right mouse button instead.
Anonymous
7/6/2025, 4:18:39 PM No.105817430
>>105817100
That was interpolated with a separate workflow
Anonymous
7/6/2025, 4:24:21 PM No.105817463
>>105817387
ctrl+c to copy
ctril+v to paste
Anonymous
7/6/2025, 4:25:22 PM No.105817472
cat 2_thumb.jpg
cat 2_thumb.jpg
md5: 89aec2d0d815ff502e8723b211ac4308🔍
>>105817316
thanks for that, it's way better now, still ugly but at least it's relevant to the original image.
Anonymous
7/6/2025, 4:27:00 PM No.105817485
Hey, if I'm looking to create a lora for an obscure anime character, what size should my image dataset be? Is there a guide? I need different angles, poses, outfits, and lighting variations. How many do I need, and what resolutions?
I'm an artist, so generate diferent images datasets is no problem.
Replies: >>105817525 >>105817693
Anonymous
7/6/2025, 4:34:25 PM No.105817525
>>105817485
Depends on what you will use it for, if you need a versatile model, draw it from as many angles as you can, as for the amount of images, 20-30 should be enough to capture any 'characteristics', actually fewer but then you run the risk of overtraining before the model picks up all the small intricate details.

Which brings me to another point, those 20-30 images need to be at least somewhat varied in pose etc, else the model will overtrain quickly. Overtraining means that the model essentially spits out the same images you trained on, that's not what you want.

As for resolutions, at least the resolution you plan on training at, but bigger is always better since the training program will downscale the images to your training resolution if needed.
Replies: >>105817564 >>105817589 >>105817599 >>105818127
Anonymous
7/6/2025, 4:38:45 PM No.105817560
i've clicked the "Let the Lora's festival start !" download button on Wan 2.1 (with Pinokio), is it normal that it takes a long ass time to install ??
i have a fast internet connection.
because right now i can't select any Lora's, there's the typical option that tells you to find some.
it tells me "Lora's have been completely downloaded" but the terminal is still working on something.
is this normal ?
Replies: >>105817653
Anonymous
7/6/2025, 4:39:03 PM No.105817564
>>105817525
>20-30
nta, that's insane, I thought it took thousands
Replies: >>105817599
Anonymous
7/6/2025, 4:40:14 PM No.105817574
is ollama relevant for image gen?
Replies: >>105817592 >>105817608
Anonymous
7/6/2025, 4:42:12 PM No.105817589
>>105817525
is Kohaya still the best tool to train loras?
Replies: >>105817623
Anonymous
7/6/2025, 4:42:18 PM No.105817592
>>105817574
You can use it to write prompts I guess
Anonymous
7/6/2025, 4:43:32 PM No.105817599
>>105817564
>>105817525
>>20-30
>nta, that's insane, I thought it took thousands
Another anon here. 20-30 is a minimum but sometimes less is more
Anonymous
7/6/2025, 4:43:42 PM No.105817602
bros how are you editing your 32 fps clips? i was going to use davinci resolve, but I have realized it only exports to certain preset fps such as 16, 24, 30, 60
Replies: >>105818477
Anonymous
7/6/2025, 4:43:43 PM No.105817603
Where is the old google docs link with GPU benchmarks? It used to be in OP, and the vladmandic site is terrible
Anonymous
7/6/2025, 4:44:07 PM No.105817608
>>105817574
do you own a macbook pro and love docker?
Anonymous
7/6/2025, 4:46:26 PM No.105817623
>>105817589
Kohya is fine, but I would recommend OneTrainer, it has a nice gui with good defaults and it supports pretty much everything, Flux, SDXL, etc. Much easier than Kohya for beginners I think.

In terms of capacity/quality of output there's no difference, it's just a matter of which you prefer.
Replies: >>105817639 >>105817644
Anonymous
7/6/2025, 4:48:06 PM No.105817636
file
file
md5: 464cb8f970b578e101ab2db9e7d5b3e2🔍
Replies: >>105817751
Anonymous
7/6/2025, 4:48:27 PM No.105817639
>>105817623
finna make a lora of an unsuspecting girl tonight
Anonymous
7/6/2025, 4:48:54 PM No.105817644
>>105817623
>OneTrainer
Trash
Replies: >>105818376
Anonymous
7/6/2025, 4:49:04 PM No.105817645
WANI2V_22220029187_00004_thumb.jpg
WANI2V_22220029187_00004_thumb.jpg
md5: 58386f11848013c6045b6ad744e45d7b🔍
Another question, still working mostly off of the i2v workflow in the wan rentry.
I am getting: "AdaptiveGuider: Cosine similarity 0.9997 exceeds threshold, setting CFG to 1.0" message in the terminal after a few steps.
Since negative prompts don't work out of the box in CFG=1, does that mean next steps are purely based on normal prompt?
So do I need to add a NAG node? Would it improve image quality? If these are true, is there a reason why it isn't included?
Replies: >>105817761
Anonymous
7/6/2025, 4:50:23 PM No.105817653
>>105817560
you may have fucked up and started downloading hell of unnecessary loras. Check the terminal, also careful because huggingface may throw you 429 error and block your ip for day if you keep download too much files with heavy bandwidth.
https://huggingface.co/collections/Remade-AI/wan21-14b-480p-i2v-loras-67d0e26f08092436b585919b
Replies: >>105817709
Anonymous
7/6/2025, 4:51:40 PM No.105817663
how do i interpolate long clips? is there an automated batching? i tried a 2 minute clip but i got oom because it was trying to allocate 49gb of vram. would i have to manually chop up the video then re edit them together?
Anonymous
7/6/2025, 4:56:13 PM No.105817693
>>105817485
Oh, and then there's the captioning.

If you are just training a single character with a single outfit, you could just capture it as 'blabla character' and the model training will do a good job since it seeks to learn patterns, and the same character drawn in the same style is a very easy pattern.

However if you have different outfits, hair styles etc, and want to fine control over that, you typically need to identify them in the image captions. Like 'blabla character wearing a red tunic', 'blabla character with white hair wearing dark armor and holding a large sword' etc.

You can get away with not doing this and hope the model will pick it up itself during training, as in identifying that the image you your character holding a sword is exactly that, but if you caption it, that removes any ambiguity.
Replies: >>105818127
Anonymous
7/6/2025, 4:56:50 PM No.105817698
ComfyUI_00003_
ComfyUI_00003_
md5: fb79635581b7eeb813186a32cec9b408🔍
Replies: >>105817731 >>105817745
Anonymous
7/6/2025, 4:57:36 PM No.105817709
>>105817653
i've closed Pinokio and restarted Wan 2.1, the Lora download button says all of them are downloaded but nothing appears on the video generator Lora list button (the one that says : "Enter here a Name for a Lora Preset or a Settings or Choose one").
what am i doing wrong ?
Replies: >>105817738
Anonymous
7/6/2025, 4:59:13 PM No.105817725
>>105817374
nice tits, but can she SING to me?
Anonymous
7/6/2025, 5:00:13 PM No.105817731
>>105817698
Handsome beast
Replies: >>105817753
Anonymous
7/6/2025, 5:01:17 PM No.105817738
>>105817709
there set of folders "loras_i2v" and "loras" folder which is t2v.
Replies: >>105817920
Anonymous
7/6/2025, 5:02:02 PM No.105817745
>>105817698
Where did you get this photograph of me
Replies: >>105817784
Anonymous
7/6/2025, 5:03:00 PM No.105817751
>>105817636
But what about stuff that isn't nature? At least cars maybe, unless your dataset is limited.
Replies: >>105817783
Anonymous
7/6/2025, 5:03:00 PM No.105817753
ComfyUI_00042_
ComfyUI_00042_
md5: 43f9b5f52c057d49e5dd2fece6711e5d🔍
>>105817731
you should meet his dad
Replies: >>105817830
Anonymous
7/6/2025, 5:03:26 PM No.105817761
>>105817645
The fast workflow includes the NAG node if you need an example. I personally don't use adaptive guider, doesn't seem worth it.
Anonymous
7/6/2025, 5:05:23 PM No.105817783
file
file
md5: cafe2f25b9c485692622d80193300918🔍
>>105817751
This is where cars are at
Replies: >>105817812 >>105818854
Anonymous
7/6/2025, 5:05:26 PM No.105817784
>>105817745
*that photograph

>quothe
Determiner
that (plural those)

(demonstrative) The (thing, person, idea, etc) indicated or understood from context, especially if more remote physically, temporally or mentally than one designated as "this", or if expressing distinction.
That book is a good read. This one isn't.
That battle was in 1450.
That cat of yours is evil.
Anonymous
7/6/2025, 5:08:05 PM No.105817812
>>105817783
Ah well. Still, the hope never dies.
Replies: >>105817839
Anonymous
7/6/2025, 5:09:30 PM No.105817830
>>105817753
Where did you get this photograph of my dad
Replies: >>105817930
Anonymous
7/6/2025, 5:10:22 PM No.105817838
2025-07-06-10h46m49s_seed364760858_This is a highly detailed, digital CGI artwork dep_thumb.jpg
Replies: >>105817895
Anonymous
7/6/2025, 5:10:42 PM No.105817839
>>105817812
From what I understand it's about in the middle point and it'll never be done training as I'll keep letting it polish as a personal project and once it's good enough I'll release the training code so that I can once and for all end the problem with people saying they can't have good local models. Someone with Ponyfags hardware could make a 4B model in less time and likely be SOTA in most categories.
Replies: >>105817862 >>105817925
Anonymous
7/6/2025, 5:11:42 PM No.105817848
1725955844913553
1725955844913553
md5: 2cbd4e12a75eed569ded1268f1a0c92c🔍
Anonymous
7/6/2025, 5:14:29 PM No.105817862
>>105817839
Also I'm training my model are hard mode:
- position free embeddings so the patches have to learn how to extrapolate
- complex captions, some of which are ambiguous, poetic, and chaotic including alt tags
- lots of video frames testing to see if it improves spatial reasoning
A simpler dataset with simple captions likely would train faster but it's possible if you want to hit the next level (like anti-hallucinations, anti-bias) you need to do more than that. I have lots of captions which are actually negatives.

Dog picture.jpg => This is not a picture of a cat.
Anonymous
7/6/2025, 5:15:51 PM No.105817872
riaHappy
riaHappy
md5: 0a3ea5716081a1e9fb2913a490dc2ed3🔍
>>105814447 (OP)
I appreciate you being on model, but use softer shading to capture that old low-resolution animation aesthetic and, honestly, she'd be hotter in her costume unmodified.
Anonymous
7/6/2025, 5:20:13 PM No.105817895
>>105817838
>no bounce
This is an absolute disgrace. Just wait until I get home
Anonymous
7/6/2025, 5:22:20 PM No.105817920
>>105817738
found it, i made it work by putting it in the /loras folder and selecting it in advanced settings.
Anonymous
7/6/2025, 5:22:43 PM No.105817924
>>105816619
Yes actually
I don't know anything about this but I want to make vids like this
Anonymous
7/6/2025, 5:22:46 PM No.105817925
>>105817839
Yeah I 'member. Already nice that you didn't quit after the setback.
Replies: >>105817955
Anonymous
7/6/2025, 5:23:12 PM No.105817930
ComfyUI_00043_
ComfyUI_00043_
md5: a275c6415e0fd2ee718953f8dfa460d0🔍
>>105817830
from your mom, of course
Replies: >>105818324
Anonymous
7/6/2025, 5:26:30 PM No.105817955
>>105817925
This is technically the second model because I changed the architecture when Sana came out so it's basically a SOTA Pixart model with a 16 channel VAE. The main setback is the grind required but I've now completed my main training rig when I got a 5090 so I can more easily just put it out of my sight and let it train unmolested.
Replies: >>105817972 >>105817976 >>105818000
Anonymous
7/6/2025, 5:27:47 PM No.105817972
>>105817955
You are the man
Anonymous
7/6/2025, 5:28:04 PM No.105817976
>>105817955
>but I've now completed my main training rig when I got a 5090 so I can more easily just put it out of my sight and let it train unmolested.
Based
Anonymous
7/6/2025, 5:30:38 PM No.105818000
>>105817955
Godspeed, anon.
Replies: >>105818011
Anonymous
7/6/2025, 5:31:45 PM No.105818010
Xena Shrug
Xena Shrug
md5: b1f376c777e868f7c48ca9fd62860941🔍
so help me there, the more inference steps i set, the more relevant/precise my video will be ?
i'm trying to make a near 1 on 1 video of Xena the warrior princess from pic related with her facial expressions changing to disgust.
any tips ?
Replies: >>105818085
Anonymous
7/6/2025, 5:31:47 PM No.105818011
file
file
md5: c0896b700937a00a981931b8857e5b61🔍
>>105818000
Here's some sexy training graphs.
Anonymous
7/6/2025, 5:42:01 PM No.105818074
WVI2V_CC_RAW_06-07-25-15-25_00001_thumb.jpg
WVI2V_CC_RAW_06-07-25-15-25_00001_thumb.jpg
md5: b300852bb7beb7402d8bd5d917723148🔍
Gonna use wan to make some alternate perspectives of SHODAN and then use images from the output to make a lora in chroma. Wish me luck!!!!
Replies: >>105818127
Anonymous
7/6/2025, 5:43:11 PM No.105818085
adae13830ac25447
adae13830ac25447
md5: c8e68e745cca286c865e822dbfc3535e🔍
>>105818010
Not necessarily, but overall yes, you need to experiment.

Also at least use a decent quality source image
Anonymous
7/6/2025, 5:46:00 PM No.105818120
rt
rt
md5: ed0102cddc00a11cbe92c6e99ecd53ea🔍
Everyone please, lend me some of your strength!!!
Replies: >>105818346
Anonymous
7/6/2025, 5:47:06 PM No.105818127
>>105817525
>>105817693
Thank you, I will take your information into account. Is there a good or famous Lora creator at Civit AI that I can ask for more in depth information, such as how many images and what poses does he recommend?
>>105818074
This is very interesting how did you do it? How many examples do you need to do that?

I want a plug and play waifu lora for SDXL, Flux or Chrona
Replies: >>105818148 >>105818154
Anonymous
7/6/2025, 5:49:12 PM No.105818148
>>105818127
The 3d rotation is just a lora for wan on civitai. You just need one image and it'll make a rotation for you. You can just use images from the resulting video to train a lora. I've never done it before but should be the same as any other lora.
Replies: >>105818330
Anonymous
7/6/2025, 5:50:01 PM No.105818154
>>105818127
Every concept is unique, the simple answer is you don't know until you try and dataset curation is an art in itself and you can't be taught. You will only see when you try and see why bad images ruin your Lora. There are many variables, for example if your character typically wears certain accessories the model WILL learn it as an intrinsic feature.
Anonymous
7/6/2025, 5:52:00 PM No.105818173
Is there a radial attention workflow for wan yet?
Replies: >>105818300 >>105818346
Anonymous
7/6/2025, 5:54:59 PM No.105818300
rt2
rt2
md5: c3140d6dabc543f81f31f9f36d77af1e🔍
>>105818173
https://github.com/mit-han-lab/radial-attention
Replies: >>105818346
Anonymous
7/6/2025, 5:58:14 PM No.105818324
ComfyUI_00061_
ComfyUI_00061_
md5: fbd5cd57a5402e24ae0c6b5130e0a72e🔍
>>105817930
I know we don't talk about your other brother, but here's what they're up to these days
Anonymous
7/6/2025, 5:58:55 PM No.105818330
>>105818148
Oh im short on buzz, didn't they refill each month?
Replies: >>105818333
Anonymous
7/6/2025, 5:59:16 PM No.105818333
>>105818330
Idk about any of that I'm running this stuff locally.
Anonymous
7/6/2025, 6:00:45 PM No.105818346
>>105818120
>>105818173
>>105818300
So is this actually better than sage?
Replies: >>105818351 >>105818353
Anonymous
7/6/2025, 6:01:20 PM No.105818351
>>105818346
Supposedly you can use them both together.
Replies: >>105818371
Anonymous
7/6/2025, 6:01:28 PM No.105818353
>>105818346
It can handle extremely long videos
Replies: >>105818371
Anonymous
7/6/2025, 6:03:17 PM No.105818367
WVI2V_CC_INT_06-07-25-15-58_00001_thumb.jpg
WVI2V_CC_INT_06-07-25-15-58_00001_thumb.jpg
md5: a425fd41daf41443b017d7ffe2da5766🔍
The girl sits down into a chair.
Replies: >>105818379
Anonymous
7/6/2025, 6:03:41 PM No.105818371
>>105818351
Interesting. Too much on my plate right now but will check it out soon then.
>>105818353
How "extreme" are we talking?
Replies: >>105818396
Anonymous
7/6/2025, 6:03:52 PM No.105818374
Gen something you would want to show your girlfriend.

Oh. yeah. ... right.
Anonymous
7/6/2025, 6:04:01 PM No.105818376
>>105817644
Why?
Anonymous
7/6/2025, 6:04:42 PM No.105818379
>>105818367
Looks... kinda cool? As if she wills it into existence with thought.
Anonymous
7/6/2025, 6:06:52 PM No.105818396
file
file
md5: 7f3197baa935707398461c8cc9332f5f🔍
>>105818371
It drastically reduces the memory requirements per frame. How well a model works with longer context/attention who knows, it'll likely need a Lora or finetune but I believe this means you can apply this to training as well which means we can train on longer samples.
Anonymous
7/6/2025, 6:07:39 PM No.105818408
1729922882296477
1729922882296477
md5: c62f437b75637c406b20b5a80a2adbd8🔍
Anonymous
7/6/2025, 6:17:11 PM No.105818477
>>105817602
I doubt it only exports at those framerates. You probably didn't set it properly in some settings
Anonymous
7/6/2025, 6:19:37 PM No.105818498
Untitled
Untitled
md5: b2d15e5cb61e45e66b1101e73e2ea3fd🔍
Anonymous
7/6/2025, 6:25:05 PM No.105818526
I see "Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors" in the self-forcing example workflow that is for 480p i2v. So despite the "T2V" in its name is this lora one size fits all for all Wan 2.1s? (t2v 1.3B, t2v 14B, i2v 480p and i2v 720p)
The answer is likely yes but I wanted to check before I get a nightmare fuel.
Replies: >>105818630
Anonymous
7/6/2025, 6:27:17 PM No.105818550
Xena 2_thumb.jpg
Xena 2_thumb.jpg
md5: 3f23eb675d8dcd9b39b26084dbfb0bfc🔍
too bad the AI can't do Xena's face as she should be.
and i've tried things like pregnant belly (which works) and hairy armpits (doesn't work, idk why).
Replies: >>105818609
Anonymous
7/6/2025, 6:29:47 PM No.105818567
1734942167393554
1734942167393554
md5: 0d12f40c380d1d0409475cf0461b48d0🔍
Anonymous
7/6/2025, 6:32:22 PM No.105818588
00175-3894115689
00175-3894115689
md5: e1e6768e810144ee040d82613e06f60c🔍
Anonymous
7/6/2025, 6:33:03 PM No.105818594
guys i got an idea, how you help me create better video creation, instead of radial-attention, we create mumbai-attention? any thoughts?
Anonymous
7/6/2025, 6:34:30 PM No.105818609
>>105818550
You can just train your own lora. Or is this your lora?
Replies: >>105818691
Anonymous
7/6/2025, 6:35:29 PM No.105818618
So pony v7 is basically confirmed dead at this point?
Replies: >>105818657 >>105818669
Anonymous
7/6/2025, 6:36:39 PM No.105818630
>>105818526
It's for all the 14b models (haven't tested it on 720p version but it should work), 1.3b model has a separate one.
As a rule of thumb, many 14b loras are compatible between t2v and i2v.
Replies: >>105818687
Anonymous
7/6/2025, 6:39:37 PM No.105818657
>>105818618
its been dead. the v7 architecture is terrible, 30+ seconds on a 4090 for 1024x1024 for a model with a 4ch vae. it's basically a worse SDXL that somehow takes even longer to generate. why he's still working on this shit is beyond me because he should've realized 6 months ago that it wouldn't ever be usable.
Replies: >>105818669 >>105818874
Anonymous
7/6/2025, 6:41:15 PM No.105818669
>>105818618
>>105818657
If ponyfag was so scared of flux dev's license why didn't he think of training on schnell like chroma?
Replies: >>105818694 >>105818710 >>105818737
Anonymous
7/6/2025, 6:43:59 PM No.105818687
>>105818630
>As a rule of thumb, many 14b loras are compatible between t2v and i2v.
Thanks for the tip
Anonymous
7/6/2025, 6:44:18 PM No.105818691
>>105818609
i'm just using the basic setting of Vace Fusionix 14B with lightx2v.
i have an 8GB 3070 so it might be why it's not as expected.
each attempts are around 6 minutes too.
i've tried generating nude but i think i'll need a model that allows NSFW because it doesn't work.
Anonymous
7/6/2025, 6:44:32 PM No.105818694
>>105818669
he doesn't know how. chroma isn't just schnell, it's modified architecture
Anonymous
7/6/2025, 6:46:37 PM No.105818710
>>105818669
Didn't he start training v7 before flux came out?
Replies: >>105818749
Anonymous
7/6/2025, 6:46:54 PM No.105818715
I wrote a still grabbing tool for creating wan t2v training datasets:
https://huggingface.co/quarterturn/facesaver

It uses GPU-accelerated ultralytics library with yolov11 face detection to detect scene changes, and save a still image with a certain-size face in it from each scene.

After you run that, you can use my captioning tool to caption your images. I change the prompt like so:
Provide an image caption which uses the following hierarchy: the kind of image, the kind or name of the subject, the subjects state of dress, their body type, their pose, what it is they are doing, their facial expression, the space they are within, and the style or atmosphere of the image. All of the images you see feature [character] from the anime [anime] as the main character. Limit your response to 100 words.
Replies: >>105818726
Anonymous
7/6/2025, 6:48:09 PM No.105818726
>>105818715
forgot the link to the captioner:
https://huggingface.co/quarterturn/molmo-flux-captioner
Anonymous
7/6/2025, 6:49:28 PM No.105818737
>>105818669
Because he's a dumbass that got lucky and doesn't even know how to leverage his position to get a proper new model. All people really wanted was a 16 channel VAE slightly smarter SDXL model. There's a big gap from SDXL's shitty unet to 12B Flux.
Replies: >>105818783
Anonymous
7/6/2025, 6:50:31 PM No.105818749
>>105818710
He's doing what the Summertime Saga dev does, completely lie about what's being worked on to grift for years without ever being accountable.
Replies: >>105818761 >>105818882
Anonymous
7/6/2025, 6:51:37 PM No.105818761
>>105818749
hes probably waiting for chroma to finish so he can use it as a baseline since he doesnt want to suicide his rep by releasing a shit model now
Replies: >>105818794
Anonymous
7/6/2025, 6:52:12 PM No.105818768
So how can I disable NAG in the self-forcing example workflow?
I want to see how much NAG is affecting speed but I can't just disable it. If I disable both it and the Negative Text Prompt, red circle appears on the WanImageToVideo box. If I just disable NAG, it gives "OOM" which I believe is erroneous generic message for a different error.
Replies: >>105818800 >>105818917
Anonymous
7/6/2025, 6:53:53 PM No.105818783
>>105818737
crazy how we don't have a simple 4-6B 16ch "SDXL 2" with a slightly improved dataset. all these new models are so shit. they're slow and half of them don't even support negative prompts. that krea model isn't particularly good but it's fun to prompt on because it has a lot of styles and the outputs actually look varied. just release something like that locally and i'd be fine, i don't care for GPT text shit
Anonymous
7/6/2025, 6:54:54 PM No.105818793
WVI2V_CC_INT_06-07-25-12-41_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-12-41_00002_thumb.jpg
md5: 6ed4d8c91acf9c02102cafea930e1ea7🔍
Replies: >>105818806
Anonymous
7/6/2025, 6:54:58 PM No.105818794
>>105818761
He's already suicided his rep by not making a new model. Chroma isn't even a good model it's never going to finish because the captions are a pile of shit. My point is he should have gotten some people together to help design a 2B-4B transformer model from scratch. He would've already been done now if all he did was take Pixart's base architecture and switched the VAE to a 16-channel one from Flux or Ostris and added some layers and hidden dims and just trained on the same dataset he used for SDXL Pony.
Replies: >>105818813 >>105818857 >>105818877 >>105818910
Anonymous
7/6/2025, 6:55:30 PM No.105818800
>>105818768
My bad it seems to be a different bug actually locking VRAM in idle. I needed to reset. You can actually just disable the NAG node.
I will see if it actually takes affect in a few minutes.
Replies: >>105818917
Anonymous
7/6/2025, 6:56:06 PM No.105818806
>>105818793
can you do a brap one
Anonymous
7/6/2025, 6:57:02 PM No.105818813
>>105818794
I look forward to your attempt.
Replies: >>105818869 >>105818894
Anonymous
7/6/2025, 7:00:23 PM No.105818853
WVI2V_CC_INT_06-07-25-12-51_00001_thumb.jpg
WVI2V_CC_INT_06-07-25-12-51_00001_thumb.jpg
md5: ea088c4bdfa974c81c661c842945cb18🔍
Replies: >>105818896
Anonymous
7/6/2025, 7:00:24 PM No.105818854
>>105817783
SOVL
Anonymous
7/6/2025, 7:00:36 PM No.105818857
>>105818794
>Chroma isn't even a good model
>he should have gotten some people together to help design a 2B-4B transformer model from scratch
lol...

Chroma got slopped a bit too much later on in training and has some problems but I'm convinced the only people hating on it are vramlets that simply can't run it fast. It's still the best model for realism and it's not even close.

If anything, Chroma now trying to cater to low step retards at the price of quality is the reason why it started to go down the hill.
We are held back by vramlet retards who piss and shit themselves at anything that needs more than 30s on their 8gb 250gb/s gpus.
Replies: >>105818909
Anonymous
7/6/2025, 7:01:24 PM No.105818869
>>105818813
Where's your acestep, kiddo?
Anonymous
7/6/2025, 7:01:48 PM No.105818874
>>105818657
Yeah, even if it is a good model eventually, it is just too slow, it's HiDream level slow, and worse quality.

Sunk cost fallacy, already spent tons of money while thinking 'surely I can get this to work', and then when it's clear it won't, it's like: 'well, I must continue and pray for a miracle since I've spent so much money'.

A shame, we need more great local models.
Anonymous
7/6/2025, 7:02:05 PM No.105818877
>>105818794
that would be fucking awful. pony's dataset is trash and he even admits he's not competent to do more than use finetuning scripts already available. there is no world in which pony creates a base model
Anonymous
7/6/2025, 7:02:32 PM No.105818882
ComfyUI_00014_
ComfyUI_00014_
md5: 130f22fc846a2f14fc8322d6939ca172🔍
>>105818749
>Summertime Saga
Replies: >>105818887
Anonymous
7/6/2025, 7:02:48 PM No.105818884
>inb4 niggu bake
Anonymous
7/6/2025, 7:03:01 PM No.105818887
>>105818882
kek
Anonymous
7/6/2025, 7:03:26 PM No.105818894
>>105818813
Making models isn't hard you retard especially when you have a personal GPU server cluster. These models aren't that complicated and the components to make a diffusion model are well documented and open source. As I already said, Ponyfag could get some data scientists to help him to put together a novel model architecture and Pixart is a proven working simple architecture to work from.
Replies: >>105818907 >>105818916 >>105818920
Anonymous
7/6/2025, 7:03:30 PM No.105818896
WVI2V_CC_INT_06-07-25-12-55_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-12-55_00002_thumb.jpg
md5: be75469140def017dc6cd09b28290de3🔍
>>105818853
NAG is a meme, or just that kijai's implementation sucks (what a surprise)

vid. related is without NAG, prompt is "wind blowing girl skirt"
Anonymous
7/6/2025, 7:04:28 PM No.105818907
>>105818894
holy dunnin kruger
Anonymous
7/6/2025, 7:04:35 PM No.105818909
>>105818857
The problem with the dataset are well evident:
- overrepresented tokens are burning out
- underrepresented tokens aren't learning
- the model isn't converging and appears to be spinning its wheels
Replies: >>105818942
Anonymous
7/6/2025, 7:04:40 PM No.105818910
>>105818794
>Chroma isn't even a good model it's never going to finish because the captions are a pile of shit.
lel at butthurt hater, you must be the retard who claimed it was distilled. seek help.
Replies: >>105818915
Anonymous
7/6/2025, 7:05:36 PM No.105818915
>>105818910
I'm sure in 50 more epochs it'll learn those artist tags
Replies: >>105818953
Anonymous
7/6/2025, 7:05:41 PM No.105818916
>>105818894
Get a life thread schizo
Anonymous
7/6/2025, 7:05:44 PM No.105818917
>>105818768
>>105818800
For posterity's sake it takes effect and makes genning I dunno 5% shorter?
Anonymous
7/6/2025, 7:05:58 PM No.105818920
>>105818894
pony doesnt have the hardware to train a base model. the few a100s he has are not enough. even SD1.4 was trained on 32x8xA100s. nobody in the local scene, outside of noob AI's limited run, ever had enough compute to train a base model.
Anonymous
7/6/2025, 7:07:56 PM No.105818936
>>105818934
>>105818934
>>105818934
>>105818934
move
Anonymous
7/6/2025, 7:08:28 PM No.105818942
>>105818909
Ok, show us the evidence, show us the overrepresented tokens that are 'burning out'

>the model isn't converging and appears to be spinning its wheels
You are clearly so stupid that you compare from one epoch to the next expecting large differences. There's no reason to expect the model converging at the 50 epochs stated goal, it could take 100 or more, you can't know.
Anonymous
7/6/2025, 7:09:08 PM No.105818948
WVI2V_CC_INT_06-07-25-13-00_00002_thumb.jpg
WVI2V_CC_INT_06-07-25-13-00_00002_thumb.jpg
md5: 61f26fbc5520368dfa04d1a38cd9f7f5🔍
Replies: >>105819314
Anonymous
7/6/2025, 7:09:50 PM No.105818953
>>105818915
What artist tags, it doesn't seem to be trained on contemporary artists with their names captioned.

Perhaps furry artists, but I don't really want to know.
Anonymous
7/6/2025, 7:50:56 PM No.105819314
>>105818948
I really like these gens. Were the original images made using a Lora, or is this artstyle built into Illustrious / etc.?
Anonymous
7/6/2025, 8:05:01 PM No.105819416
fgdgdfsgdfs
fgdgdfsgdfs
md5: a00f19e4d05517404de7202e962ef32f🔍
>>105816619
I have 128GB of RAM but only 12GB of VRAM
Can I convert my spare unused RAM into VRAM using some sort of black magic?
Anonymous
7/6/2025, 8:05:57 PM No.105819421
>>105817056
I've been separating lines with spaces since before reddit was a concept in anyone's mind

it looks neat and clean this way

packing your lines together with no spaces together is messy and hard to read

i advocate for spacing supremacy
Anonymous
7/6/2025, 9:37:25 PM No.105820175
>Ryzen 7 5700G
>RX 6650 XT
>32GB DDR4
I don't have a chance in hell to make it work, do I?
Replies: >>105820904
Anonymous
7/6/2025, 10:56:28 PM No.105820904
>>105820175
Make what work? You can run SDXL fine. Flux should work too although slower. Also post at alive thread if you want more answers.
Anonymous
7/6/2025, 11:12:42 PM No.105821064
>>105814447 (OP)
can someone make a video on how you get started doing this?