Thread 105842620

316 posts 244 images /g/

Anonymous 7/9/2025, 12:53:17 AM No.105842620 >>105842632 >>105842642 >>105845302 >>105849251

/ldg/ - Local Diffusion General

highlights_g_105836648_1752015108_1.jpg md5: f7d58026... 🔍

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>105836648

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com
https://tensor.art
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX (video)
Guide: https://rentry.org/wan21kjguide
https://github.com/Wan-Video/Wan2.1

>Chroma
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and beyond: https://rentry.org/comfyui_guide_1girl
Tag explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage | https://rentry.org/ldgtemplate

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/celeb+ai
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous 7/9/2025, 12:54:40 AM No.105842632

>>105842620 (OP)
GOOD MORNiN :3

Anonymous 7/9/2025, 12:55:44 AM No.105842642

WAKEUPkek_thumb.jpg.webm md5: 3dbf1a29... 🔍

WebM not supported

>>105842620 (OP)
neighbors list seems v outdated
Ai technologies have proliferated to several other boards...
hmmmm

Anonymous 7/9/2025, 12:56:06 AM No.105842646 >>105842783 >>105842857 >>105842919 >>105842988 >>105843062 >>105843183 >>105843778

1750723423813062.png md5: eb807d97... 🔍

New SLG implementation is finally live.

https://github.com/comfyanonymous/ComfyUI/pull/8759

Anonymous 7/9/2025, 12:56:16 AM No.105842648 >>105842659 >>105842701

comfy should be dragged out on the street and shot

Anonymous 7/9/2025, 12:56:28 AM No.105842651 >>105842677 >>105849381

>radial attention waiting room

Anonymous 7/9/2025, 12:57:42 AM No.105842659 >>105842681 >>105842701

>>105842648
that's a bit harsh. I just hope someone btfo his app and it becomes irrelevant

Anonymous 7/9/2025, 12:58:19 AM No.105842664

Good Evening and Happy where the fuck is radial attention

Anonymous 7/9/2025, 12:58:47 AM No.105842667 >>105842689

I love ComfyUI so god damn much. Updated frequently, implements improvements from the community, it's fast, it's flexible, very modular, it's clean and easy to develop custom nodes for.

I couldn't ask for anything better. Thank you Comfy for allowing us peasents to seamlessly and effortlessly produce AI content!

Anonymous 7/9/2025, 12:59:28 AM No.105842677 >>105842884

>>105842651
i wouldn't expect anything until next year considering their current pace. on the bright side, that gives you plenty of time to save up for a 5090 or 6000

Anonymous 7/9/2025, 12:59:59 AM No.105842681

>>105842659
>that's a bit harsh.
nowhere near enough

Anonymous 7/9/2025, 1:00:49 AM No.105842689 >>105842722

>>105842667
>I love ComfyUI so god damn much. Updated frequently
this is b8

Anonymous 7/9/2025, 1:01:33 AM No.105842698 >>105842708 >>105842773

1743504485229621_thumb.jpg.webm md5: bbcf3a24... 🔍

WebM not supported

vace + miku + generic model runway video:

this is with causvid, gonna try the light2x lora as well.

ポストカード 7/9/2025, 1:01:43 AM No.105842701

paint420loop_thumb.jpg.webm md5: ed4c1229... 🔍

WebM not supported

>>105842659
>>105842648
rude.
i still use it for a bunch of autistic stuff
its not my "favorite" interface by any means
but surely you can atleast appreciate its use-case

Anonymous 7/9/2025, 1:02:34 AM No.105842708

>>105842698
this used the default canny processor, low 0.1, high 0.3 (otherwise it wasnt detecting the edges in the vid)

Anonymous 7/9/2025, 1:04:42 AM No.105842722

>>105842689
No, It's not. I mean that from the bottom of my heart. I am sorry you are too low IQ to fully utilize the GOD like power of ComfyUI. Maybe read some books or something so that one day, you too, can become enlightened, my brainlet friend. I will be waiting for you at the Comfy Altar.

Anonymous 7/9/2025, 1:05:25 AM No.105842727 >>105842773

1725694764856996_thumb.jpg.webm md5: 047e2d1d... 🔍

WebM not supported

lightx2v lora instead of causvid at 1.0 strength:

works fine. didnt specify clothes so the clothes here are different. prompt is just "the girl is showing off her clothes."

Anonymous 7/9/2025, 1:11:29 AM No.105842768 >>105842775

1728096024635883_thumb.jpg.webm md5: ef5d6225... 🔍

WebM not supported

Miku + Kiryu slamming a desk and walking away:

but the lora does work just fine at 1.0 str. need to test more though

ポストカード 7/9/2025, 1:12:18 AM No.105842773

>>105842727
>>105842698
try: https://tensor.art/models/872743460111704414
&
https://tensor.art/models/839853388687731926

Anonymous 7/9/2025, 1:12:23 AM No.105842775 >>105842792 >>105842801

>>105842768
When are we gonna get past the point where everything feels like its underwater

Anonymous 7/9/2025, 1:13:10 AM No.105842783

>>105842646
city96 is a cool dude

Anonymous 7/9/2025, 1:13:19 AM No.105842787 >>105842809

Any word on local 3D model generation or UIs?

Anonymous 7/9/2025, 1:13:36 AM No.105842792 >>105842864

>>105842775
the output framerate is low. this is just testing outputs, thats like 12fps. higher fps or interpolation helps a lot.

ポストカード 7/9/2025, 1:14:37 AM No.105842801 >>105842864

>>105842775
adjust negative prompt use proper wan & those errors are (mostly) mitigated
>neg: slow movement, slow motion, freeze-frame, etc

Anonymous 7/9/2025, 1:15:56 AM No.105842809 >>105842841

>>105842787
exists like hunyuan3d-2, typical ui is comfyui as nearly always

most people here don't do much or any actual 3d models at this point

Anonymous 7/9/2025, 1:16:02 AM No.105842812 >>105842827

how am I getting torch oom if I close comfy and reopen it, it worked fine a gen ago.

Anonymous 7/9/2025, 1:16:57 AM No.105842821

what are the settings for res_3m image 2 image? my shit looks a little cooked

Anonymous 7/9/2025, 1:17:24 AM No.105842827

>>105842812
I picked a diff clip with shorter length and now it's fine, but the frames are set to 81 so why does it matter?

oh...the canny node is trying to preview all 27 seconds, not the 81 frames (4 seconds)

Anonymous 7/9/2025, 1:18:38 AM No.105842839 >>105842903 >>105843693

explain comfy memory usage to me. I used to have 32gb and got a deal on 64gb of better latency RAM, yet sometimes 50gb is in use.

what is going on? does it try to populate as much memory as possible?

Anonymous 7/9/2025, 1:18:47 AM No.105842841

>>105842809
Tripo Studio is getting pretty good, wish I had something like that locally.

Anonymous 7/9/2025, 1:20:55 AM No.105842857

>>105842646
doesnt matter for us FusionXisters or lightx2virgins right?

Anonymous 7/9/2025, 1:22:03 AM No.105842864 >>105842894

>>105842792
>>105842801
>no examples
Yeah sure buddy. I have yet to see a local video where the character actually has a "pop" to their movements

Anonymous 7/9/2025, 1:25:18 AM No.105842884

>>105842677
well they've been updating it around every weekend so, I'd give it 2 months tops

T_T

Anonymous 7/9/2025, 1:26:30 AM No.105842890 >>105842947

1750888447761222_thumb.jpg.webm md5: 6dd19bf3... 🔍

WebM not supported

*sips coffee*

Anonymous 7/9/2025, 1:27:11 AM No.105842894 >>105842915

herewegooooooooooo_thumb.jpg.webm md5: 1a6e589c... 🔍

WebM not supported

>>105842864
lurk moar fren
>>>>105835417

Anonymous 7/9/2025, 1:28:07 AM No.105842903 >>105842996 >>105843170

ComfyUI_36823_.png md5: b6499741... 🔍

>>105842839
Stop using custom nodes, update pytorch to the latest version.

Anonymous 7/9/2025, 1:30:12 AM No.105842915 >>105842962

>>105842894
>more underwater slop
Is this supposed to be a joke or something? Go outside, that's not how people move in real life. Even more so in animated film

Anonymous 7/9/2025, 1:30:40 AM No.105842919 >>105842930

>>105842646
Sell me on skip layer as a concept, I've never used it, am I being retarded ?

Anonymous 7/9/2025, 1:31:49 AM No.105842930 >>105842977 >>105842988

>>105842919
It generates good hands. It generates non blurry hands when using TeaCache with WAN.

Anonymous 7/9/2025, 1:35:32 AM No.105842947

A_small_cup_of_coffee.jpg md5: 3bac9043... 🔍

>>105842890
annoying that "coffee" has such a strong bias towards a starbucks cup in wan t2v.
"milk" is heavily biased towards glass bottles and cartons, too.

Anonymous 7/9/2025, 1:37:27 AM No.105842962

>>105842915
smoke and a pancake?
cigar and a waffle?
THEN THERE IS NO PLEASING U

kling will have "less water" but it queue system is annoying as fuck & no one should be supporting closed-source fag shit

Anonymous 7/9/2025, 1:38:41 AM No.105842977

>>105842930
Thanks!

ポストカード 7/9/2025, 1:39:37 AM No.105842988

increasedspeedby1point3huehue_thumb.jpg.webm md5: 74a8d004... 🔍

WebM not supported

>>105842646
>>105842930
based anon i'll look into it<3

Anonymous 7/9/2025, 1:39:42 AM No.105842990 >>105843115

Excuse me, with apologies to the anons I will have a meltie.

Anonymous 7/9/2025, 1:40:18 AM No.105842996 >>105843041

>>105842903
start explaining shit instead of running away from the problem

Anonymous 7/9/2025, 1:41:11 AM No.105843008

>thinks its not obvious when he takes off his trip

Anonymous 7/9/2025, 1:43:15 AM No.105843023

>thinks bananas aren't fruits

Anonymous 7/9/2025, 1:45:08 AM No.105843041

>>105842996
thats a diff anon, I dont want to break torch so whats the ideal way to do so

or just get rid of some custom nodes?

Anonymous 7/9/2025, 1:48:55 AM No.105843062 >>105843183 >>105843238

chroma-unlocked-v42-detail-calibrated.safetensors_00090_.png md5: bcb24817... 🔍

>>105842646
could this fix sd3.5m? testing...

Anonymous 7/9/2025, 1:53:42 AM No.105843088

00015-2503489940.png md5: bdb6c1d1... 🔍

Anonymous 7/9/2025, 1:54:23 AM No.105843091 >>105843107

1734112989673936.png md5: 4c1aa6b8... 🔍

Anonymous 7/9/2025, 1:55:18 AM No.105843099

can anyone share a good workflow for the new chroma rl low steps?

Anonymous 7/9/2025, 1:56:28 AM No.105843107 >>105847380

00058-1916548496.png md5: 6f2f21f3... 🔍

>>105843091

Anonymous 7/9/2025, 1:57:26 AM No.105843115 >>105843133 >>105843141 >>105843150 >>105843162 >>105843724 >>105844085 >>105846632

>>105842990

1)The Core Problem Nobody's Addressing

Everyone's avoiding the elephant in the room: these models are fundamentally stupid. People keep vibing merging, but nobody tackles the real cognitive limitations of these models. Yes, we have millions of LoRAs and checkpoints for art styles, copyrighted characters, fixing five fingers, preventing extra arms
MILLIONS OF THEM! STOP MAKING MORE!

2)Basic Spatial Understanding is BROKEN

DON'T YOU REALIZE THAT IF I TELL SDXL TO HAVE MY CHARACTER LOOK AT HIS PALM HE DOESN'T UNDERSTAND WHERE HIS HAND IS OR HIS PALM?

WHY DO I WANT GOOD AESTHETICS OR 2025 ART STYLES? YES, IT'S OBVIOUSLY 1000 TIMES EASIER TO TAKE SCREENSHOTS AND TAG THEM IN A SLOP VLM THAN TACKLE THE REAL COGNITIVE PROBLEM!

3)The "SOVL" Problem

I can't bring my characters to life with these shitty models. Sure, I can generate images, but they lack SOVL. Having to micromanage everything manually just kills the creative process.

I had more fun generating images with NovelAI using temp emails for 30 free 1024x1024 images than with my 24GB VRAM PC. Why does NovelAI have SOVL? It's like it reads my subconscious. It's frustrating we can't get it locally or buy it like a Steam game.

4)Local Models Can't Handle Basic Scenes

Local models constantly ignore prompts:

Character staring at the sea? Nope, they'll be looking anywhere but there
Character spiking a volleyball mid-air, crashing through the net? NO WAY!

It doesn't matter which checkpoint or version - they're all the same stupid model with minor tweaks.

5)TLDR: What's Even the Point?

Local models just create 2D mannequins in random poses. No action, zero sense of motion or energy in the images.

Life's too short to break scenes into 500 tags and read hundreds of articles, only for the model to grasp maybe 15% of your vision and produce the usual slop.

Anonymous 7/9/2025, 2:01:28 AM No.105843133

>>105843115
I agree except for glazing saas. every model lacks sovl. I don't think data scientists have good taste when it comes to selecting data. it's slop all the way down

Anonymous 7/9/2025, 2:02:18 AM No.105843141

>>105843115
Yeah I've been playing the patience card for the past year since we saw such exponential growth before then but at this point it's just getting weird how bad prompt comprehension and model intelligence is for local. At least now we can have Kontext fix up mistakes for image gen, but I worry for video gen since it's facing the same issues with more difficult scaling laws

Anonymous 7/9/2025, 2:03:11 AM No.105843150

>>105843115
you are right about literally everything except novelai. novelai sucks, midjourney and dalle 3 were the only models with actually sovl

Anonymous 7/9/2025, 2:03:15 AM No.105843151 >>105843161

The true artist does not blame his tools.

Anonymous 7/9/2025, 2:04:42 AM No.105843161 >>105843205

>>105843151
a true artist has the right tools in the first place

Anonymous 7/9/2025, 2:04:58 AM No.105843162

>>105843115
If you want great control over the output, just do simple img2img generations like anyone who isn't retarded.

You can get away with VERY SIMPLE drawings / paintings as long as you add a bit of noise to the drawing before you do img2img, this is also the TRUE creativity with ai imagegen, since you are not just rolling the dice hoping for something cool, you are actively directing where things should go, what pose a person should have, the exact composition etc.

Don't blame the tool because you've never even tried to move past the most basic use.

Anonymous 7/9/2025, 2:05:18 AM No.105843165

1721630408217715_thumb.jpg.webm md5: 0c096c82... 🔍

WebM not supported

a man wearing dark black sunglasses looking up at the sky, eats a McDonalds cheeseburger.

720p q8 wan but at a smaller size is pretty fast for gens with the lora. (light2x)

Anonymous 7/9/2025, 2:05:49 AM No.105843170

>>105842903
I ran update_comfyui.bat, launches with pytorch version: 2.7.1+cu128 - is that right?

Anonymous 7/9/2025, 2:08:24 AM No.105843183 >>105843604

ComfyUI_01575_.png md5: 8abb8d98... 🔍

>>105843062
>>105842646
sd3.5m still has the bad hands, not sure if I see any improvement from this really.

Anonymous 7/9/2025, 2:10:08 AM No.105843189

1751949984255312_thumb.jpg.webm md5: c32956b6... 🔍

WebM not supported

a man wearing dark black sunglasses looking up at the sky, opens a pizza box and eats a pizza slice. (interpolated output)
>why food?
to test.

Anonymous 7/9/2025, 2:12:33 AM No.105843205

chroma-unlocked-v42-detail-calibrated.safetensors_00092_.png md5: 9354e2df... 🔍

>>105843161
I will keep genning locally, seethe

Anonymous 7/9/2025, 2:14:04 AM No.105843216 >>105843296

1741231244598963_thumb.jpg.webm md5: 2f64c511... 🔍

WebM not supported

a man wearing dark black sunglasses fires a rocket launcher at the black helicopter in the sky behind him, causing it to explode in fire and smoke.

he didnt do it, but you get some neat special fx anyway!

Anonymous 7/9/2025, 2:16:53 AM No.105843238 >>105843251 >>105843259 >>105843270

WVI2V_CC_INT_08-07-25-17-00_00001_thumb.jpg.webm md5: 60242ebe... 🔍

WebM not supported

>>105843062

Anonymous 7/9/2025, 2:18:49 AM No.105843251

>>105843238
Muffin to see here.

Anonymous 7/9/2025, 2:19:52 AM No.105843259

>>105843238
It was cute until she defiled the blueberry muffin

Anonymous 7/9/2025, 2:22:19 AM No.105843270 >>105843447

chroma-unlocked-v42-detail-calibrated.safetensors_00093_.png md5: e9ee3c55... 🔍

>>105843238
based. can wan render a guy drinking coffee out of her head?

Anonymous 7/9/2025, 2:25:42 AM No.105843296

1735117121376704_thumb.jpg.webm md5: 68938372... 🔍

WebM not supported

>>105843216
okay, ALMOST the desired result.

Anonymous 7/9/2025, 2:25:52 AM No.105843298 >>105843308 >>105843326 >>105843328 >>105843330 >>105843620 >>105845036 >>105848116

00877-119488393.jpg md5: ecd746f5... 🔍

just deleted all of my models and 99.9% of gens

praying I escape for good this time

Anonymous 7/9/2025, 2:26:44 AM No.105843308 >>105843384

>>105843298
but why, AI is fun, my GPU isnt just for games

Anonymous 7/9/2025, 2:28:29 AM No.105843321

sdxl_leosamsHelloworldXL_helloworldXL70.safetensors_big_00007_.jpg md5: ce8ca65d... 🔍

>tfw ywn have a harem

Anonymous 7/9/2025, 2:29:16 AM No.105843326 >>105843384

>>105843298
You are just going through a bit of summer depression, you'll be so pissed at what you did when it subsides.

Anonymous 7/9/2025, 2:29:19 AM No.105843328 >>105843384

>>105843298
nice ... freckles

Anonymous 7/9/2025, 2:29:40 AM No.105843330 >>105843384

1608384677402.gif md5: c92f1cbb... 🔍

>>105843298
you can't escape ai, goyim

Anonymous 7/9/2025, 2:37:15 AM No.105843384 >>105845539

00910-484073290.jpg md5: 242b11ea... 🔍

>>105843328
thanks, it's all from a lora and 0 skill
>>105843308
I spend enough time behind the computer as it is
>>105843326
nah, i've plateaued in skill and dont feel like learning anymore
>>105843330
true, a lot of businesses use boomer slop AI images in their marketing nowadays

here's the catbox in case somebody cares, maybe sth interesting for 1girl aficionados:
https://files.catbox.moe/ktvraz.jpg

Anonymous 7/9/2025, 2:39:36 AM No.105843398

1743524297967735_thumb.jpg.webm md5: 7ba2a0c2... 🔍

WebM not supported

a man on a bicycle rides it off a ramp and flies high into the sky. he pumps his fist in the air.

Todd has Skyrim magic.

Anonymous 7/9/2025, 2:47:59 AM No.105843447 >>105843477

WVI2V_CC_INT_08-07-25-17-35_00001_thumb.jpg.webm md5: 512447d0... 🔍

WebM not supported

>>105843270
Closest I got that wasn't a dude popping a coffee cup into existence.

Anonymous 7/9/2025, 2:53:57 AM No.105843477

1739665198349634.png md5: 78848014... 🔍

>>105843447
bruh imagine touching cappuccina ballerina's ass and kissing the rim of her head like that

Anonymous 7/9/2025, 2:56:55 AM No.105843493 >>105843526

what's best? hunyuan or wan? working with a 12GB 4070 and only really do T2V

Anonymous 7/9/2025, 3:02:56 AM No.105843526

>>105843493
wan is best, use the rentry workflow + the lora for way faster gens

multigpu node lets you use virtual vram so you can use larger models too.

Anonymous 7/9/2025, 3:14:24 AM No.105843598 >>105843660

ComfyUI_07517_.png md5: 1860c559... 🔍

"VHS-style" gens anon, can you kindly share your prompts? I've been replying to your posts in a couple of threads

Older versions of Chroma used to nail the aesthetic with ease, now it only produces cinematic slop

Anonymous 7/9/2025, 3:14:50 AM No.105843604

>>105843183
a tarantula!

Anonymous 7/9/2025, 3:17:04 AM No.105843620

>>105843298
fuck you and see you tomorrow

Anonymous 7/9/2025, 3:24:32 AM No.105843660 >>105843679

>>105843598
dont use detailed

Anonymous 7/9/2025, 3:24:40 AM No.105843662 >>105843669 >>105843685 >>105843708

1728435496225058_thumb.jpg.webm md5: 60da8bf3... 🔍

WebM not supported

A man with a beard holds up a large bag of money with a dollar sign symbol on the bag. He smiles.

Anonymous 7/9/2025, 3:25:33 AM No.105843669 >>105843717

>>105843662
now make one with the jobst retard lmao

Anonymous 7/9/2025, 3:27:27 AM No.105843679

ComfyUI_07518_.png md5: 88857c5b... 🔍

>>105843660
Are you that anon? Gib prompt pls

Anonymous 7/9/2025, 3:28:07 AM No.105843685 >>105847132

1729995660545724_thumb.jpg.webm md5: be207ae9... 🔍

WebM not supported

>>105843662
changed size, still got same type of result, gen time much faster (messing with 720p Q8 wan, and comparing to 480p)

Anonymous 7/9/2025, 3:28:23 AM No.105843688 >>105843764

00126-4156040151-8066da29-bdb59bac77.png md5: 6b5e11d1... 🔍

Anonymous 7/9/2025, 3:29:19 AM No.105843693

>>105842839
>>105841774
> Is it just me or does ComfyUI freezes the pc every few WAN gens?
Try disabling "smart" memory.

Anonymous 7/9/2025, 3:31:32 AM No.105843708

>>105843662
Kinda needs Jobst crying, but not bad

Anonymous 7/9/2025, 3:32:37 AM No.105843717 >>105843738

1722081257252789_thumb.jpg.webm md5: 11b82f43... 🔍

WebM not supported

>>105843669
success, picked a random google image result

"a blonde man sits at a desk and starts crying."

Anonymous 7/9/2025, 3:33:28 AM No.105843724

>>105843115
Enjoy your 200b models with 8xH100 requirements (and 8000xH100 for training).

Anonymous 7/9/2025, 3:34:59 AM No.105843738 >>105843752

1722763046985788_thumb.jpg.webm md5: 98bf9a37... 🔍

WebM not supported

so close. >>105843717

Anonymous 7/9/2025, 3:37:02 AM No.105843752 >>105843807

1741309728064652_thumb.jpg.webm md5: 89ddc080... 🔍

WebM not supported

>>105843738
there

poor guy cant even use a gun right...

Anonymous 7/9/2025, 3:38:44 AM No.105843764 >>105843812 >>105845364

noob_naiXLVpred102d_final.safetensors_big_00172_.jpg md5: c95322a2... 🔍

>>105843688
box or style info por favor?

>>105834947
https://files.catbox.moe/u1aj0s.png

Anonymous 7/9/2025, 3:39:03 AM No.105843768

What if... llm agent, but for images? It will analyze a picture by itself and send to img2img models fixing hands and other artifacts iteratively? Or adding something new/changing colors/effects/etc.

Anonymous 7/9/2025, 3:40:37 AM No.105843778

FK3964XMV58HRN.png md5: 42ece986... 🔍

>>105842646
am I doing it right?

Anonymous 7/9/2025, 3:44:20 AM No.105843803 >>105847427

>>105839234
Catbox please

Anonymous 7/9/2025, 3:45:29 AM No.105843807

>>105843752
Have a female asian hand hold the gun.

'Goodbye husbando!'

Anonymous 7/9/2025, 3:46:43 AM No.105843812 >>105843863

00135-489029456-e8a61baa-bdb59bac77.png md5: a63673bb... 🔍

>>105843764
https://files.catbox.moe/mexmlo.png

Anonymous 7/9/2025, 3:55:54 AM No.105843863 >>105843962

noob_naiXLVpred102d_final.safetensors_big_00193_.jpg md5: 93276022... 🔍

>>105843812
>antialiased latent upscale
how do you get this in comfy? this seems like it could solve the jaggies problem and make latent upscales viable

Anonymous 7/9/2025, 4:00:18 AM No.105843887

1746051446860666_thumb.jpg.webm md5: 1f7245a0... 🔍

WebM not supported

a man wearing black sunglasses picks up a large black bomb and throws it, causing a huge explosion of fire and smoke.

Anonymous 7/9/2025, 4:14:24 AM No.105843959

84451243.png md5: e513490e... 🔍

Anonymous 7/9/2025, 4:15:16 AM No.105843962

>>105843863
>no results found
Forgechads I kneel

Anonymous 7/9/2025, 4:16:42 AM No.105843970

1727464437378384_thumb.jpg.webm md5: fe2f6d47... 🔍

WebM not supported

a man jumps off a building into a swimming pool.

kek

Anonymous 7/9/2025, 4:23:11 AM No.105843998

been gone for a while. did chroma get official nunchaku support yet

Anonymous 7/9/2025, 4:26:11 AM No.105844016

1726211731392334_thumb.jpg.webm md5: cc335133... 🔍

WebM not supported

a man drinks a bottle of beer in a dark room at night.

there we go, including a reference to the light levels made the sudden brightness go away.

Anonymous 7/9/2025, 4:32:07 AM No.105844049 >>105844074 >>105844089 >>105844097 >>105844108

>spend an eternity looking for wan extension workflows that doesnt burn or use "last frame"
>think of the loop nodes in comfy but to stupid to figure it out
>find workflow on youtube that does all of that with i2v including vace
>behind a patreon paywall/sign up

I hate youtubers

Anonymous 7/9/2025, 4:36:28 AM No.105844074

>>105844049
ai youtubers are the worst

Anonymous 7/9/2025, 4:40:01 AM No.105844085

cc9[1].jpg md5: e540c06a... 🔍

>>105843115
>No action, zero sense of motion or energy in the images
At this point I can only hope video saves image gen somehow. Maybe if AI can do passable "two cars crashing" in video then we'll get models able to gen good static images of it.

Anonymous 7/9/2025, 4:40:43 AM No.105844089

>>105844049
anon just use logic, you gotta mask the frames you wanna extend with VACE and thats it

Anonymous 7/9/2025, 4:42:02 AM No.105844097 >>105844304

>>105844049
farukan gogizur

Anonymous 7/9/2025, 4:43:40 AM No.105844107

1748593366666021_thumb.jpg.webm md5: 4ce4a5ad... 🔍

WebM not supported

a man drinks opens a brown bag of McDonalds and grabs a McDonalds cheeseburger, and eats it.

JC must consume

Anonymous 7/9/2025, 4:44:12 AM No.105844108 >>105844129 >>105844304

>>105844049
have you tried this one?
https://www.reddit.com/r/StableDiffusion/comments/1llx9uq/

Anonymous 7/9/2025, 4:48:36 AM No.105844129 >>105844304

resonance_cascade_thumb.jpg.webm md5: 28061981... 🔍

WebM not supported

>>105844108
Seconding this one, it's the one I used to make this video.

Anonymous 7/9/2025, 4:53:28 AM No.105844155 >>105844169 >>105844170 >>105847690

1747990586931961.png md5: ec4e5a7f... 🔍

3090gods... we won. Can anyone now send the official /ldg/ memo to lodestonesnigger to stop catering to low step vramlet shitters and stop cucking chroma before he ruins it permanently? Thanks.

https://strawpoll.com/XOgOVDj1Gn3/

Anonymous 7/9/2025, 4:55:11 AM No.105844169

>>105844155
>3060
you know not all of us have 12 gb cards, some of us gen on a laptop

Anonymous 7/9/2025, 4:55:18 AM No.105844170

>>105844155
>106 votes
LMAO yeah sure

Anonymous 7/9/2025, 4:58:36 AM No.105844188

00202-1167038458-849e1e51-bdb59bac77.png md5: a8ff4290... 🔍

Anonymous 7/9/2025, 4:59:40 AM No.105844195 >>105844225

is there a node that can extract the last frame of a video input? so I can stitch generated clips together for example. Ideally I wouldnt need to use a web app to extract it every time.

Anonymous 7/9/2025, 5:00:08 AM No.105844202

output_thumb.jpg.webm md5: 2b2f15ba... 🔍

WebM not supported

Anonymous 7/9/2025, 5:05:24 AM No.105844225

>>105844195
nm load video (vhsloader) does this

Anonymous 7/9/2025, 5:15:01 AM No.105844289

1723742473591744.png md5: 1f364ab6... 🔍

Anonymous 7/9/2025, 5:17:45 AM No.105844304

steps.png md5: 8e1c0110... 🔍

>>105844097
kek

>>105844108
>>105844129
saw this before, was put off by dicking around with picrel. however he did commented with an automated version, so this should do the trick: https://pastebin.com/TCs9J88i

Anonymous 7/9/2025, 5:18:57 AM No.105844317

1735907188099621_thumb.jpg.webm md5: 64c3fb52... 🔍

WebM not supported

I think it worked? two clips:

Anonymous 7/9/2025, 5:22:51 AM No.105844329

>chroma
>all that burnt training on distilled flux instead of training wan 1.3b/14b
OH NO NO NO

Anonymous 7/9/2025, 5:24:15 AM No.105844340 >>105844366

>choma
>all that burnt training on distilled flux instead of training sana
OH NO NO NO

Anonymous 7/9/2025, 5:25:52 AM No.105844351 >>105844366

file.png md5: 42b2e24b... 🔍

All that burnt training when they could've done a custom model with a 16 channel VAE.

Anonymous 7/9/2025, 5:27:37 AM No.105844366 >>105844381

>>105844340
Why not Lumina?
>>105844351
Why not use someone else's already spent massive compute as a foundation?

Anonymous 7/9/2025, 5:29:54 AM No.105844381 >>105844833

file.png md5: ba423100... 🔍

>>105844366
I think you grossly overestimate the compute required to train a model. I also think you grossly underestimate how much compute is wasted undoing the lobotomy / redoing a model's understanding of anatomy.

Anonymous 7/9/2025, 5:38:44 AM No.105844433

00237-1846894496-fced24e8-bdb59bac77.png md5: 37ba81c8... 🔍

Anonymous 7/9/2025, 5:45:18 AM No.105844471 >>105844958

AnimateDiff_00073_thumb.jpg.webm md5: 547c00f3... 🔍

WebM not supported

Anonymous 7/9/2025, 5:46:02 AM No.105844477 >>105844533

1741482024182267_thumb.jpg.webm md5: db7f81aa... 🔍

WebM not supported

a house at the top of a hill explodes with smoke and fire everywhere.

pretty cool

Anonymous 7/9/2025, 5:54:24 AM No.105844533 >>105844626

1751545248297306_thumb.jpg.webm md5: da9e1606... 🔍

WebM not supported

>>105844477
alternatively,

a white house at the top of a hill launches into the air like a rocket, leaving a rocket trail and flames. The camera pans up to show the house in the sky.

not much elevation, but still pretty good!

Anonymous 7/9/2025, 6:01:11 AM No.105844583 >>105844652

Wan is seriously better compared to Flux/Chroma at generating still images. The video training translates into better still frames.

Anonymous 7/9/2025, 6:03:11 AM No.105844597 >>105846001

ComfyUI made me realize this hobby requires at least 120 IQ points.

Anonymous 7/9/2025, 6:05:41 AM No.105844619 >>105845053

00003-1152342910-806ec80c-bdb59bac77.png md5: c522cd2a... 🔍

Anonymous 7/9/2025, 6:06:24 AM No.105844626 >>105844633 >>105847177

1741751520423863_thumb.jpg.webm md5: 231bad56... 🔍

WebM not supported

>>105844533
okay, giving a distance made it move more.

a white house at the top of a hill launches into the air like a rocket, leaving a rocket trail and flames. The camera pans up to show the house in the sky.

Anonymous 7/9/2025, 6:07:26 AM No.105844633

>>105844626
er,

a white house at the top of a hill launches miles into the sky like a rocket, leaving a rocket trail and flames. The camera pans up to show the house in the sky.

miles is what did the trick.

Anonymous 7/9/2025, 6:09:30 AM No.105844652 >>105844666

>>105844583
but can it do bobs and vagene

Anonymous 7/9/2025, 6:11:23 AM No.105844666

>>105844652
Out of the box it's not great but it takes a bare minimum Lora to get it to A-tier.

Anonymous 7/9/2025, 6:15:35 AM No.105844699

00011-3094211913-5ea9e191-bdb59bac77.png md5: 5715d1b1... 🔍

Anonymous 7/9/2025, 6:31:53 AM No.105844833 >>105844851 >>105844872

>>105844381
NTA, but my opinion is that the quality of the base model very strongly influences the results of a finetune the size of Chroma. Models like Flux and Wan are trained on literally billions of images. LAION alone is like 5b and that's an older dataset. Chroma's 5 million training dataset is nothing in comparison. You absolutely cannot train a model from scratch on 5m images and have it be any decent.

Anonymous 7/9/2025, 6:33:45 AM No.105844847 >>105844948

1742428344951958_thumb.jpg.webm md5: 8329624c... 🔍

WebM not supported

I said to space, seems that is not possible quite yet

Anonymous 7/9/2025, 6:35:04 AM No.105844851

file.png md5: edeab666... 🔍

>>105844833
You really think there are billions of unique images? I think you grossly underestimate how much variety is in 5 million images. You do realize they pad "billions" because most of them are duplicates, resizes and crops right? How many thousands of variations of Harold exists do you think?

Anonymous 7/9/2025, 6:39:29 AM No.105844872 >>105844894 >>105845192

>>105844833
no way in hell flux trained on that many
novelai claims to have trained from scratch and their dataset is at most like 20m

Anonymous 7/9/2025, 6:45:08 AM No.105844894

>>105844872
I think they do this to intimidate people from trying to train models. It's important the plebs don't realize they can make their own printing press.

Anonymous 7/9/2025, 6:55:25 AM No.105844948

>>105844847
groq is this real

Anonymous 7/9/2025, 6:56:40 AM No.105844958

>>105844471
have her hold her sword in front of her hips and twerk

Anonymous 7/9/2025, 7:04:46 AM No.105844998

>>105842199
naisu

Anonymous 7/9/2025, 7:10:29 AM No.105845036

>>105843298
i'll miss you anon

Anonymous 7/9/2025, 7:13:21 AM No.105845053

>>105844619
nice

Anonymous 7/9/2025, 7:22:55 AM No.105845130 >>105845158

1729531854553444.png md5: 3d9cea95... 🔍

Give the man black sunglasses, he is holding a large bag of money, overflowing with dollar bills. On the bag is the text "KARL" in scribbled font. He is wearing a black baseball cap that says "KING OF KONG" in white text.

kontext is so fun. it's like inpainting evolved, but does stuff inpainting can't.

Anonymous 7/9/2025, 7:25:27 AM No.105845158 >>105845188

>>105845130
give him an anime gf

Anonymous 7/9/2025, 7:29:10 AM No.105845188 >>105845200 >>105845206

1729128740149687.png md5: d3e3b0f6... 🔍

>>105845158
anime girl Miku Hatsune is standing beside the man, wearing a black baseball cap saying "karl LOST" in white text.

and this is one image, if I want a better miku I just put a good miku picture in the second image input

workflow: https://openart.ai/workflows/amadeusxr/change-any-image-to-anything/5tUBzmIH69TT0oqzY751

Anonymous 7/9/2025, 7:29:34 AM No.105845192 >>105845242

>>105844872
Even pixart alpha, that was woefully undertrained, claimed to use at least 25M images. Obviously pixart alpha is too small to be a viable modern base. But if the number of parameters is passable and vae and text encoder are modern, why throw away those 25M images already trained in, unless there's an architectural breakthrough? Lodestone's dataset is around 5 times smaller, if I remember correctly.

Anonymous 7/9/2025, 7:30:34 AM No.105845197 >>105845205 >>105845237 >>105845249 >>105846001

I'm sick and tired of 1girl effortless AI slop in this thread.

Anonymous 7/9/2025, 7:30:51 AM No.105845200 >>105845244

>>105845188
make the anime girl smoke a pipe lol

Anonymous 7/9/2025, 7:31:52 AM No.105845205

>>105845197
>I'm sick
You can always commit suicide, that way all your problems go away (you are the problem)

Anonymous 7/9/2025, 7:32:03 AM No.105845206 >>105845214

1741677137920185.png md5: 1cf360ba... 🔍

>>105845188
and this is with 2 images (bypass the 2nd input if you just want a solo image for input)

anime girl with teal hair Miku Hatsune is standing beside the man, wearing a black baseball cap saying "karl LOST" in white text.

it just works.jpg

Anonymous 7/9/2025, 7:33:48 AM No.105845214 >>105845244

1739029250596758.png md5: 3419f51f... 🔍

>>105845206
and fixed the hat with a simple hat text prompt:

Anonymous 7/9/2025, 7:38:50 AM No.105845237

>>105845197
for every one kinosoul 1girl there are 5b effortless 1girls

Anonymous 7/9/2025, 7:39:49 AM No.105845242 >>105845708

>>105845192
Boy you quickly gave up billions of images huh? Maybe it's not much use to talk to someone who is ignorant.

Anonymous 7/9/2025, 7:39:54 AM No.105845244 >>105845252

1728656577638741.png md5: a83e88fc... 🔍

>>105845214
>>105845200

one more! revised:

pink hair anime girl is standing beside the man in a black baseball cap. she is smoking a pipe. change the location to a bank. keep her blue and yellow hairclip the same. keep the man's pose the same.

Anonymous 7/9/2025, 7:41:03 AM No.105845249

>>105845197
>complainer
>nogen
quite literally, everystein.singleberg.timeowitz.

Anonymous 7/9/2025, 7:41:07 AM No.105845252 >>105845277 >>105845299

1746426684233980.png md5: 55ef9baa... 🔍

>>105845244
kek, double pipe this gen

Anonymous 7/9/2025, 7:44:45 AM No.105845277

>>105845252
lol

Anonymous 7/9/2025, 7:50:22 AM No.105845299

1748653260802981.png md5: 47ad8768... 🔍

>>105845252
bonus: anime billy

Anonymous 7/9/2025, 7:50:37 AM No.105845302

XrNfUctyG7k.jpg md5: 4666c69d... 🔍

>>105842620 (OP)
Where did that dual clip thing came from?

Anonymous 7/9/2025, 7:51:28 AM No.105845307 >>105845312 >>105845389 >>105847154

>3090
>no fp8
>sage attention doesn't work
>torch compile does nothing
lol, lmao even

Anonymous 7/9/2025, 7:52:02 AM No.105845312

>>105845307
Also almost 5 years old.

Anonymous 7/9/2025, 7:54:00 AM No.105845324 >>105845344

1726230853529826.png md5: e7628fe7... 🔍

The man is pointing and laughing at a blonde swedish man wearing a t-shirt that says "KARL JACOBS", who looks very upset.

kek, I dont know if he is swedish so I used that as a generic npc.

Anonymous 7/9/2025, 7:56:54 AM No.105845344

1741115690213053.png md5: 85366f71... 🔍

>>105845324
oops, cant forget to make sure he is still holding money.

Anonymous 7/9/2025, 7:57:04 AM No.105845345 >>105845349

fantasy_pepe.jpg md5: 8d763147... 🔍

Anonymous 7/9/2025, 7:58:02 AM No.105845349

>>105845345
interesting leg

Anonymous 7/9/2025, 7:59:12 AM No.105845352 >>105845792

1736819172559725.png md5: d06d1165... 🔍

change the location to a mcdonalds restaurant. the man is sitting at a table eating a McDonalds Big Mac. His table is surrounded with hundreds of cheeseburgers.

JC needs to eat so he can stop the illuminati

Anonymous 7/9/2025, 8:01:54 AM No.105845364

>>105843764
Thank you!

Anonymous 7/9/2025, 8:05:49 AM No.105845389 >>105847154

>>105845307
3090 shills deserve death

Anonymous 7/9/2025, 8:10:36 AM No.105845416 >>105845829

is ai art getting shittier by the day?

Anonymous 7/9/2025, 8:16:16 AM No.105845455

1731605846495010.png md5: a31368cb... 🔍

Anonymous 7/9/2025, 8:32:46 AM No.105845539 >>105845570

>>105843384
>nah, i've plateaued in skill
pic unrelated? your gens look like shit dude.

Anonymous 7/9/2025, 8:37:52 AM No.105845570 >>105845572

>>105845539
I like them, personally.

Anonymous 7/9/2025, 8:38:12 AM No.105845572

>>105845570
I seen better

Anonymous 7/9/2025, 8:49:57 AM No.105845643 >>105845654

1745390927083348.png md5: 07a0e6d4... 🔍

The man is wearing a hat saying "#1 illuminati fan". keep his pose and expression the same. the image is in a pixel art style.

neat

Anonymous 7/9/2025, 8:50:54 AM No.105845645 >>105845816 >>105847448

0.png md5: 1692961a... 🔍

What would you prompt to get weird / unorthodox / asymmetrical lewd swimsuits?

Anonymous 7/9/2025, 8:52:16 AM No.105845654

1741485509853812.png md5: 561c0d3f... 🔍

>>105845643
and without pixel art

Anonymous 7/9/2025, 8:55:59 AM No.105845679 >>105845765

ComfyUI_04435_.png md5: 488fea61... 🔍

Anonymous 7/9/2025, 9:01:51 AM No.105845708

>>105845242
Nta.

Anonymous 7/9/2025, 9:08:51 AM No.105845742 >>105845752 >>105845762

1722707207620796.png md5: 6ee04abc... 🔍

two image inputs:

The man is shaking hands with the pink hair anime girl. the background is black.

Anonymous 7/9/2025, 9:10:37 AM No.105845752

1723967205224395.png md5: e07fe8e5... 🔍

>>105845742

Anonymous 7/9/2025, 9:12:07 AM No.105845762 >>105845771

>>105845742
please share the catbox
thanks

Anonymous 7/9/2025, 9:12:30 AM No.105845765

>>105845679
I knew it!

Anonymous 7/9/2025, 9:13:24 AM No.105845771 >>105845778

1739251510106583.png md5: 46c0f387... 🔍

this one turned out better:
>>105845762
same prompt I used in the post.

Anonymous 7/9/2025, 9:14:18 AM No.105845778 >>105845796

>>105845771
I know, I need the workflow
my two image workflow is broken
please fren

P0STCARD 7/9/2025, 9:15:59 AM No.105845792

>>105845352
excellent

Anonymous 7/9/2025, 9:16:25 AM No.105845796 >>105845801

>>105845778
https://files.catbox.moe/tfnkvg.png

got it from here: https://openart.ai/workflows/amadeusxr/change-any-image-to-anything/5tUBzmIH69TT0oqzY751

Anonymous 7/9/2025, 9:17:08 AM No.105845801

>>105845796
thank you

Anonymous 7/9/2025, 9:17:28 AM No.105845803

1743415722249005.png md5: ebcd352c... 🔍

there, bit better proportions:

postcard 7/9/2025, 9:19:53 AM No.105845816 >>105845876

>>105845645
Turn cfg low & let the prompt run for 20 iterations w/ “loose settings”
U sometimes get neat outfits this way

Anonymous 7/9/2025, 9:22:28 AM No.105845829

>>105845416
all the top posters are rangebanned by the baker again
grim

Anonymous 7/9/2025, 9:31:02 AM No.105845864

1728067473482728.png md5: ec0cb7ff... 🔍

diff image

The man is sitting at a computer and is typing. the pink hair anime girl is waving hello. the background is black. keep the man's expression the same.

Anonymous 7/9/2025, 9:33:51 AM No.105845876

>>105845816
>>105845754
fat bitch
>>105836371

Anonymous 7/9/2025, 9:42:37 AM No.105845908

1747593954320872.png md5: 9ed3001c... 🔍

The man is sitting at a computer and is typing in a dimly lit office. A rectangular sign above says "glowie HQ" in yellow text.

Anonymous 7/9/2025, 9:51:24 AM No.105845936 >>105847463

comfy.png md5: a2083588... 🔍

>show OCD friend my Comfy workflow
>he loses his mind

It's not that bad, right?

Anonymous 7/9/2025, 10:05:46 AM No.105845985

ComfyUI_hgdf_03783_.jpg md5: 052e3090... 🔍

Anonymous 7/9/2025, 10:10:19 AM No.105846001

ComfyUI_12096.png md5: 8ced464c... 🔍

>>105844597
You just need the right motivation (genning the waifu, gooning, etc) to get your footing. It's not TOO horrible... except when everything breaks.

>>105845197
Be the change you want to see!

Anonymous 7/9/2025, 10:18:34 AM No.105846045

ComfyUI_hgdf_03766_.jpg md5: 7c7eded7... 🔍

Anonymous 7/9/2025, 10:21:35 AM No.105846064

ComfyUI_hgdf_03769_.jpg md5: 1747ad4f... 🔍

Anonymous 7/9/2025, 10:23:51 AM No.105846073

ComfyUI_hgdf_03777_.jpg md5: 7e38415d... 🔍

Anonymous 7/9/2025, 10:32:44 AM No.105846115

ComfyUI_hgdf_03785_.jpg md5: 5c20c148... 🔍

Anonymous 7/9/2025, 10:36:51 AM No.105846125

ComfyUI_hgdf_03786_.jpg md5: de298694... 🔍

quiet tonight...

Anonymous 7/9/2025, 10:38:28 AM No.105846131

ComfyUI_04335_.png md5: 4d70509d... 🔍

Anonymous 7/9/2025, 10:44:33 AM No.105846160

undead.jpg md5: 5c3ad923... 🔍

Anonymous 7/9/2025, 10:46:47 AM No.105846174

ComfyUI_04372_.png md5: 6e2b0690... 🔍

Anonymous 7/9/2025, 10:51:04 AM No.105846194 >>105846214

123123213213.png md5: 47171d5a... 🔍

Survey
https://strawpoll.com/XOgOVDj1Gn3/results

Anonymous 7/9/2025, 10:53:55 AM No.105846214 >>105846220

ComfyUI_hgdf_03776_.jpg md5: 4b55dc73... 🔍

>>105846194
would have been 17 for 3060 had I voted

Anonymous 7/9/2025, 10:55:28 AM No.105846220

123123213213.png md5: 47171d5a... 🔍

>>105846214
Survey
https://strawpoll.com/XOgOVDj1Gn3

Anonymous 7/9/2025, 11:06:51 AM No.105846274

ComfyUI_hgdf_03789_.jpg md5: 13656fca... 🔍

Anonymous 7/9/2025, 11:20:22 AM No.105846345 >>105846386 >>105846484 >>105846637 >>105846659

>flux kontext
use case?

Anonymous 7/9/2025, 11:27:13 AM No.105846386 >>105846408

>>105846345
Correct small details and do meme I guess
If it was uncensored AND could keep artstyle, it would have killed loras and this would have been big
Unfortunately it didn't

Anonymous 7/9/2025, 11:31:21 AM No.105846408

>>105846386
>If it was uncensored AND could keep artstyle, it would have killed loras and this would have been big
>Unfortunately it didn't
When will we get this?
2 more weeks or anything that's actually on the horizon?

Anonymous 7/9/2025, 11:43:59 AM No.105846484

>>105846345
fry your image in just 5 revisions!

Anonymous 7/9/2025, 12:08:46 PM No.105846632

>>105843115
>SDXL
Dude. You're using a 2 years old obsolete clip_l based model. It can at most understand one (1) character doing one (1) simple thing if you prompt it right. With NoobAI/Illustrious we maximized the fuck out of that architecture and did things that shouldn't be possible, but it's like giving a new coat of paint on a 1960s Ford 2. You won't gain any racing competition with it.

Flux-dev 1.0 dual clip/t5 has a more recent architecture and prompt comprehension, which means it is only one year obsolete now. Still pretty bad, and Flux-dev has huge issues with its guidance distillation which makes it kinda retarded a lot of the time. But it's technically better than SDXL in prompt following, from 1/10 to 3/10.

If you want a decent prompt following check HiDream (a solid 5/10 on prompt following), but for some reasons people decided two months ago they didn't like HiDream.

Anonymous 7/9/2025, 12:09:29 PM No.105846637

>>105846345
I think it's useful if you need specifically the thing it does. For everything else it's just a deepfryer and shitty mememaker

Anonymous 7/9/2025, 12:09:42 PM No.105846641 >>105846735

>finally set up everything
>can now generate as many fatties as i wish
I will dehydrate from all the gooning holy shit

Anonymous 7/9/2025, 12:12:00 PM No.105846659

>>105846345
For me? Remove clothes, and change anime girls into realistic. Both are so/so and for now worse than doing manual inpaint and stuff, but you don't need to do manual inpaint.

Also for some reason my outpainting of characters is better with Kontext than with Fill.

Anonymous 7/9/2025, 12:24:19 PM No.105846735 >>105846749

1746749769310284.png md5: f64d4a46... 🔍

>>105846641
>anon can generate anything
>wastes it on fatasses

Anonymous 7/9/2025, 12:26:02 PM No.105846749

>>105846735
I'm considering taking vacation desu
FUCK YES

Anonymous 7/9/2025, 12:27:55 PM No.105846768

1749811573558415.png md5: 99dcabdc... 🔍

Anonymous 7/9/2025, 12:30:49 PM No.105846780 >>105846792

wan is still the lightest i2v, right?

Anonymous 7/9/2025, 12:32:39 PM No.105846792 >>105846807

>>105846780
Isn't it the only good i2v? I mean there's lighter one but they're more proof of concept, and Hunyuan Video is shit at i2v.

Anonymous 7/9/2025, 12:35:54 PM No.105846807 >>105846822

>>105846792
I don't know, I don't lurk here much. Just waiting for some simple model that can animate images, maybe even without conditioning. Wan is way too slow.

Anonymous 7/9/2025, 12:39:00 PM No.105846822 >>105846876

>>105846807
Slow generation times or slow to input what you want?

Anonymous 7/9/2025, 12:46:55 PM No.105846876 >>105846939

>>105846822
I mean waiting for 20+ minutes for a video that doesn't even follow a simple prompt most of the time

Anonymous 7/9/2025, 12:56:41 PM No.105846939 >>105846974

>>105846876
sage attention + teacache can help you to halve the render time, but yeah it's slow. But it's the only one that really works beyond trivial stuff.

But if you want trivial stuff like people breathing or something, LTX video and CogVideoX are faster, and far more limited.

Anonymous 7/9/2025, 1:05:55 PM No.105846974 >>105847002

>>105846939
LTX Video claims to
>produces 30 FPS videos at a 1216×704 resolution faster than they can be watched.
Sounds interesting if it's even remotely true

Anonymous 7/9/2025, 1:10:27 PM No.105847002 >>105847163

>>105846974
Well, LTX is blazing fast.

It's also shit at anything that isn't "camera immobile/slightly panning in/slightly panning out" and "character standing, breathing" or "character sitting, breathing", occasionally "character walking".

Anonymous 7/9/2025, 1:12:30 PM No.105847014 >>105847100

>another cool little wan speed boost we'll probably never see for comfy
https://github.com/madebyollin/taehv

Anonymous 7/9/2025, 1:24:47 PM No.105847090 >>105848289 >>105848314

Can you control an AI by inputting an image that is a screenshot of a code and the AI executes it?

Anonymous 7/9/2025, 1:27:01 PM No.105847100 >>105847145

>>105847014
Isn't that already implemented on WanVideoWrapper?

Anonymous 7/9/2025, 1:33:36 PM No.105847132

>>105843685

Do you mind catboxing one of your I2V gens? Thanks in advance!

Anonymous 7/9/2025, 1:35:19 PM No.105847145 >>105847186

>>105847100
you're right, I was googling the wrong thing, disregard my tard comment

https://huggingface.co/Kijai/WanVideo_comfy/blob/main/README.md

Anonymous 7/9/2025, 1:37:15 PM No.105847154

1736680474355454.jpg md5: 562790d6... 🔍

>>105845307
>>105845389
anything better for under $1000?

Anonymous 7/9/2025, 1:38:20 PM No.105847163

>>105847002
>It's also shit at anything that isn't "camera immobile/slightly panning in/slightly panning out" and "character standing, breathing" or "character sitting, breathing", occasionally "character walking".

Though this was their 2b model. I see they've published a 13b model. I know what I'll be testing tonight.

Anonymous 7/9/2025, 1:41:28 PM No.105847177

>>105844626
KCD IV looks wild.

Anonymous 7/9/2025, 1:42:49 PM No.105847186 >>105847214

>>105847145
No worries. It might be compatible with vanilla comfyui as well. You can just replace them for a vae.

https://huggingface.co/Kijai/WanVideo_comfy/blob/main/taew2_1.safetensors

I use tiny encoder for sdxl previews, its really good.

Anonymous 7/9/2025, 1:48:41 PM No.105847214

>>105847186
Thanks, I'll give it a try later after work

Anonymous 7/9/2025, 2:03:14 PM No.105847299 >>105847308 >>105847354 >>105848653

chroma-v43calibrated-hrfix_00002_.jpg md5: 6ea6a79a... 🔍

Anonymous 7/9/2025, 2:04:41 PM No.105847308

coom.gif md5: 6b597429... 🔍

>>105847299

Anonymous 7/9/2025, 2:10:01 PM No.105847347 >>105847389 >>105847657

00006-3790477019.png md5: 999d4995... 🔍

Canny Valley

Anonymous 7/9/2025, 2:10:41 PM No.105847354

1751968869917115.jpg md5: 7de517ab... 🔍

>>105847299
I don't get it, how are you supposed to inspect it with your shirt over your head?

Anonymous 7/9/2025, 2:14:17 PM No.105847380

>>105843107
I like this

Anonymous 7/9/2025, 2:15:39 PM No.105847389 >>105847657

>>105847347
more of a canny channel really

Anonymous 7/9/2025, 2:20:50 PM No.105847427 >>105847614 >>105848006

ComfyUI_temp_nvrnk_00010_.jpg md5: 06efd00c... 🔍

>>105843803
https://files.catbox.moe/icjkb3.png

There ya go

Anonymous 7/9/2025, 2:24:04 PM No.105847448

>>105845645
You could also erase by hand some part of a normal one, add extra lines and feed it trough again

Anonymous 7/9/2025, 2:25:45 PM No.105847463 >>105847550

>>105845936
This is nothing yet, it will grow larger in time

Anonymous 7/9/2025, 2:39:31 PM No.105847550 >>105847604

>>105847463
anon mine hasn't grown larger in 15 years

Anonymous 7/9/2025, 2:40:28 PM No.105847559 >>105847569 >>105848805

Am i retarded or there is no way of moving a model from one device to another once it's loaded? I have this problem where I run 2 large models after one another and the second one is really slow since my vram can't fit both of them, but swapping them between devices would speed things up

Anonymous 7/9/2025, 2:41:46 PM No.105847568

>Am i retarded
ngl I stopped reading after that

Anonymous 7/9/2025, 2:41:54 PM No.105847569

>>105847559
in comfy

Anonymous 7/9/2025, 2:42:52 PM No.105847577 >>105847614

ComfyUI_temp_vmvpb_00004_.jpg md5: 89e3eeb2... 🔍

Anonymous 7/9/2025, 2:45:54 PM No.105847604

>>105847550
Tug on it more maybe something will happen

Anonymous 7/9/2025, 2:47:46 PM No.105847614

>>105847427
NTA but thank you, this is great.
>>105847577
Maximum neuron activation.
...catbox?

Anonymous 7/9/2025, 2:54:21 PM No.105847647 >>105847714

SDXL:
>2023 release
>1024x1024
>3.5b parameters
>learns styles in under 10 epochs
>understands complex sex positions
>outputs in 3 seconds without any copechaku quanting needed
Chroma:
>2025 release
>512x512
>8b parameters
>hasnt learned a single style in over 40 epochs
>no characters
>melted anatomy and duplicate limbs
>barely understands POV missionary
>takes 20 seconds per image on a 4090

i'm thinking 3 more years of SDXL

Anonymous 7/9/2025, 2:56:11 PM No.105847655

you're suppossed to prompt the style in retard

Anonymous 7/9/2025, 2:56:29 PM No.105847657 >>105847672 >>105848031

>>105847347
>>105847389
>canny channel
Are you kidding me? It's right there
A canny canyon
Fuck you ESLs

Anonymous 7/9/2025, 2:57:09 PM No.105847663

If you weren't a poorfag you'd use a video model.

Anonymous 7/9/2025, 2:57:45 PM No.105847672

>>105847657
>A canny canyon
so a cannyon?

Anonymous 7/9/2025, 3:00:48 PM No.105847690 >>105847837

>>105844155
>no M3/M4
nvidia nerds will never learn what 120GB VRAM feels like

Anonymous 7/9/2025, 3:03:48 PM No.105847714

>>105847647
U-net vs DiT

Anonymous 7/9/2025, 3:20:06 PM No.105847837

>>105847690
>540GB/s
>$19,999.99
lol

Anonymous 7/9/2025, 3:23:23 PM No.105847856

rbt.jpg md5: 5f19ab07... 🔍

so what upscaler or setting work best for anime/drawn hires fix,
cant figure or search out why it turns into smudge compared to realistic checkpoints that nicely enhanced details and fixes mistake

Anonymous 7/9/2025, 3:26:20 PM No.105847884 >>105848259

which one should I buy for genning?
https://mdcomputers.in/catalog/graphics-card/nvidia/rtx-50-graphics-card/rtx-5090-graphics-card

Anonymous 7/9/2025, 3:29:13 PM No.105847906 >>105847913

anyone have experience with character consistency? i was thinking of genning a face then face swapping it onto the image. and using wan to gen a video, then use frames of that video to have the character standing vs sitting

Anonymous 7/9/2025, 3:29:57 PM No.105847913 >>105847941

>>105847906
kontext

Anonymous 7/9/2025, 3:34:12 PM No.105847941

>>105847913
Nta, but I couldn't make kontext *swap* faces in particular. It seems to treat this request as a deepfake threat.

Anonymous 7/9/2025, 3:43:50 PM No.105848006

>>105847427
Thanks based anon, also this image is good as well.

Anonymous 7/9/2025, 3:48:03 PM No.105848031

>>105847657
more of a canny chasm really

Anonymous 7/9/2025, 4:00:50 PM No.105848116

>>105843298
>in a time when models are being reported and deleted for no reason through false reports
what a stupid thing to do.

Anonymous 7/9/2025, 4:02:57 PM No.105848136 >>105848288 >>105848408

ComfyChrome41RL_00002_.png md5: 0aa4a60c... 🔍

Anonymous 7/9/2025, 4:17:40 PM No.105848259

>>105847884
Astral is the only real choice because you can check the per pin power/amps to make sure your connecter isn't going to fucking melt. That said, I thought it was too loud (even in quiet mode) when genning so I put it in a custom loop.

Anonymous 7/9/2025, 4:22:24 PM No.105848288

>>105848136
Nice, Chroma looks promising for sure

Anonymous 7/9/2025, 4:22:38 PM No.105848289

>>105847090
4o can read text in images and can read and interpret code so I guess it can do that.

Anonymous 7/9/2025, 4:25:51 PM No.105848314

>>105847090
you typically control models through strings (which are converted to tokens blah blah), much more convenient

Anonymous 7/9/2025, 4:30:50 PM No.105848353

1725827436287050.png md5: 3f0c592d... 🔍

Anonymous 7/9/2025, 4:37:43 PM No.105848408

>>105848136
How long did the training take?

Anonymous 7/9/2025, 4:45:42 PM No.105848481

causvid and other speed up loras produce almost no motion, solution lets create a dual sampler workflow that looks like cancer and shill it. both samplers use 8 steps, so that's 16 steps total, whats the fucking point then?

Anonymous 7/9/2025, 4:49:34 PM No.105848511

vramlet here
Can I use this to make little animations?
https://huggingface.co/CiaraRowles/TemporalDiff/blob/main/temporaldiff-v1-animatediff.safetensors

Anonymous 7/9/2025, 5:07:33 PM No.105848653 >>105848749

>>105847299
I made those original target pics around the end of May. can I ask how you came across them?

Anonymous 7/9/2025, 5:13:08 PM No.105848681 >>105848758 >>105848762 >>105848795 >>105848799

umm?
Why did the thread die all of a sudden?

Anonymous 7/9/2025, 5:21:57 PM No.105848749 >>105848768 >>105848936

chroma-v43calibrated-hrfix_00010_.jpg md5: b2c94bd7... 🔍

>>105848653
>I made those original target pics around the end of May. can I ask how you came across them?
Can't remember, just another prompt in my wildcards. Perhaps you posted your prompt around then and I saved it

Anonymous 7/9/2025, 5:23:31 PM No.105848758

>>105848681
huh?

Anonymous 7/9/2025, 5:24:00 PM No.105848762

>>105848681
I stopped genning

Anonymous 7/9/2025, 5:24:36 PM No.105848768 >>105848771 >>105848936

>>105848749
women as sex robots is the lowest form of sci fi

Anonymous 7/9/2025, 5:25:00 PM No.105848771

>>105848768
cry more feminist

Anonymous 7/9/2025, 5:27:37 PM No.105848790 >>105848803

Screenshot.png md5: 1aa15acb... 🔍

no matter how many times I download these and restart it keeps showing missing again and again
Updated cumfy too

Anonymous 7/9/2025, 5:28:07 PM No.105848795

>>105848681
China went to bed.

Anonymous 7/9/2025, 5:28:59 PM No.105848799

xena_dissaproves_thumb.jpg.webm md5: e78e387b... 🔍

WebM not supported

>>105848681
because you showed up

Anonymous 7/9/2025, 5:29:36 PM No.105848803

>>105848790
Install them from their git pages
I noticed this yesterday with some nodes, installing from git fixed it.

Anonymous 7/9/2025, 5:29:46 PM No.105848805

>>105847559
Anyone?

Anonymous 7/9/2025, 5:35:53 PM No.105848853 >>105848872 >>105848973

WanVid_00018_thumb.jpg.webm md5: cebf9d26... 🔍

WebM not supported

it's a little ropey but that's the effect I wanted

Anonymous 7/9/2025, 5:37:37 PM No.105848872 >>105848880

>>105848853
That's really great. Wasn't there some 90's tv show with stuff like this

Anonymous 7/9/2025, 5:38:14 PM No.105848880 >>105848901

>>105848872
funny you should say that, that's what I trained it on

Anonymous 7/9/2025, 5:40:55 PM No.105848901 >>105848910

>>105848880
Was it the Sabrina witch? Ancestral penis memory

Anonymous 7/9/2025, 5:41:47 PM No.105848910 >>105848959

>>105848901
yes I sat through 7 seasons of the shit. They really scaled back the effects after the first 2, that's where most of them are
and yeah I did get a chub at parts, but mostly it's horrible cringe and I'm embarrassed for everyone involved

Anonymous 7/9/2025, 5:44:36 PM No.105848936

>>105848749
awesome, I also posted a collection to civitai with full metadata. prompt sharing is fun

>>105848768
yeah, it's a classic

Anonymous 7/9/2025, 5:46:48 PM No.105848959

>>105848910
The talking cat looked like shit, that I remember

Anonymous 7/9/2025, 5:48:49 PM No.105848973

>>105848853
please tell me you did i2v too

Anonymous 7/9/2025, 5:49:34 PM No.105848980

ComfyUI_temp_covky_00035_.png md5: 441759b4... 🔍

Finally, Yelakeqrinde

Anonymous 7/9/2025, 5:49:38 PM No.105848984

>>105848978
>>105848978
>>105848978
>>105848978

Anonymous 7/9/2025, 6:17:16 PM No.105849251

>>105842620 (OP)
Is there a desktop application that I can use like notebooklm or something, but also allows me to use API keys if necessary?
I usually jailbreak deepseek remotely via an "untrammeled" prompt and it has been great so far just as an erp bot, but I want something that helps me use it to learn and improve my notes.
I also want to help better at prompting or understand it better, since deepseek either has a very disgusting and frustrating aneurysm or gets its ethical guard up as it tries to lecture me on topics I couldn't give a rats ass about as I press the stop button.

Anonymous 7/9/2025, 6:31:43 PM No.105849381

>>105842651
Install linux while you wait a dependency for it is flashinfer and that's linux only.