← Home ← Back to /g/

Thread 106248560

315 posts 204 images /g/
Anonymous No.106248560 >>106249600
/ldg/ - Local Diffusion General
"Fictional" Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106244742

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://github.com/Wan-Video
2.1: https://rentry.org/wan21kjguide
2.2: https://rentry.org/wan22ldgguide
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-HD/tree/main
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Anonymous No.106248569
Qwen image edit soon, right xibros...?
Anonymous No.106248590
>>106248439
>She waxes, nice
the meta right now is to just do permanent hair removal, especially if you have darker hair
which is nice because the hairiness is one of the worst parts of brown girls irl
Anonymous No.106248594 >>106248653
cumfartui v0.3.50 Latest 13 minutes ago
https://github.com/comfyanonymous/ComfyUI/releases/tag/v0.3.50

What's Changed

Add Qwen Image model to readme. by @comfyanonymous in #9191
Qwen Image model merging node. by @comfyanonymous in #9202
qwenLora cannot load properly by @flybirdxx in #9208
Update template to 0.1.52 by @comfyui-wiki in #9206
Update frontend to v1.24.4 by @christian-byrne in #9175
Fix RepeatLatentBatch not working on multi dim latents. by @comfyanonymous in #9227
_ui.py import torchaudio safety check by @Kosinkadink in #9234
async API nodes by @bigcat88 in #9129
Users report gfx1201 is buggy on flux with pytorch attention. by @comfyanonymous in #9244
Not sure if AMD actually support fp16 acc but it doesn't crash. by @comfyanonymous in #9258
Bump pytorch cuda and rocm versions in readme instructions. by @comfyanonymous in #9273
Only show feature flags log when verbose. by @comfyanonymous in #9281
remove creation of non-used asyncio_loop by @bigcat88 in #9284
Update template & embedded docs by @comfyui-wiki in #9283
Support SimpleTuner lycoris lora for Qwen-Image by @PsychoLogicAu in #9280
fix(Kling Image API Node): do not pass "image_type" when no image by @bigcat88 in #9271
Update template to 0.1.58 by @comfyui-wiki in #9302
Update test release package workflow with python 3.13 cu129. by @comfyanonymous in #9306
Wan2.2 fun control support. by @comfyanonymous in #9292
Make torchaudio exception catching less specific by @Kosinkadink in #9309
Update template to 0.1.59 by @comfyui-wiki in #9313
Update release workflow to python3.13 pytorch cu129 by @comfyanonymous in #9315
Downgrade frontend for release. by @comfyanonymous in #9316
Anonymous No.106248602
Blessed thread of frenship
Anonymous No.106248617 >>106248638 >>106248711 >>106249962
retro itasha for a vintage endurance simracing season /o/ is running
Anonymous No.106248638
>>106248617
Anonymous No.106248648 >>106248690 >>106248810 >>106248815
Is anon here right?

>>>/v/717999572

Is there some art that AI just cannot replicate no matter what? Will AI always be behind for stuff like this?
Anonymous No.106248653
>>106248594
>qwen shit
nothing of interest
Anonymous No.106248658 >>106248908 >>106252220
>>106248430
>Retard here, can you train loras on lightning/dmd models? There's the model ponyhofdmd4_v10 which gives me extremely good outputs especially regarding anatomy for an XL model, seems like it'd be a good base. But so far my attempts at trainign a lora either resulted in black pictures or blurry outputs with datasets that worked on other models. Do you need special settings, or do you need a version without the dmd lora baked in for training?
Anonymous No.106248666 >>106248679 >>106248691 >>106248934 >>106249987
Anonymous No.106248679 >>106248701
>>106248666
Can you make these kiss?
Anonymous No.106248690
>>106248648
are you retarded?
Anonymous No.106248691
>>106248666
happa sexo
Anonymous No.106248701
>>106248679
post bussy
Anonymous No.106248711 >>106248737 >>106248936
>>106248617
I don't understand what I'm supposed to be getting from these. Is a part of them generated?
Anonymous No.106248734
why not 4090 48gb?
Anonymous No.106248737 >>106248759
>>106248711
have you been sleeping under a rock the past few years?
Anonymous No.106248741
Anonymous No.106248759
>>106248737
Sounds like it. Can you clue me in? Is the 1girl generated?
Anonymous No.106248763 >>106249220
>Base Model: Pony
Anonymous No.106248810
>>106248648
post bussy
Anonymous No.106248811
the only thing lightx2 lora is good for is testing if a lora works.
Anonymous No.106248815 >>106248928 >>106249032
>>106248648
ai can't have passion
ai can't lust for misty or understand why, for example, it's hot that her shorts are just barely long enough to not be panties, ai can't tell that at that distance you should be close enough to smell her, ai doesn't understand embarrassment so it can't really make a face of mixed emotion like that where she is equal parts shocked someone is looking but also kind of aroused.

ai can try and do a fine job for a try, but it can never be perfect the same way an artist with a boner is perfect. it can get some parts of a picture right and massively fuck up elsewhere and it really doesn't know the difference between good and bad. a human works until they are happy with the product and can mentally do no more. an ai doesn't even know what happy is beyond training weights of smiles and squinted eyes. getting truly good art out of ai will require giving the machines a way to feel what happiness, sadness, love and pain are.
Anonymous No.106248857 >>106248892
>>106247846
>>106247856
>is there a way to save the samples from the 2.2 high noise model so you can gen a bunch with the high noise model, pick the best one, then feed it to the low noise model and reroll?

ok i got it. you just use these nodes. when you get a gen with satisfactory motion copy the latent to the input folder, the loadlatent node scans there. then bypass the high noise sampler and connect loadlatent to samples on the second sampler
Anonymous No.106248882
vramletbros, its about to get FUN

https://huggingface.co/QuantStack/Wan2.2-Fun-A14B-Control-GGUF/tree/main
Anonymous No.106248885 >>106248894 >>106248905 >>106248932 >>106248958
is wan 2.2 noticeably better than 2.1?
t. tourist
Anonymous No.106248892
>>106248857
now i gotta figure out how to view the latents and how to have the filenames of the latent and videos stay synchronized
Anonymous No.106248894
>>106248885
yes, by exactly 0.1
Anonymous No.106248905
>>106248885
way better. i remember just trying to get a girl to dance was body-horror gen after gen after gen with 2.1
Anonymous No.106248908 >>106252220
>>106248658
wan refuses to let this motherfuckers hair grow without his neck expanding into some eldritch horror and the color changing. ive tried like 3 times now.
Anonymous No.106248928
>>106248815
No matter what an artist is feeling when making an artwork, it either ends up on said art or it has zero significance to said art.

The AI can mimic everything in the artwork, so there's nothing 'lost', and with several artworks from an artist, the AI will discern the inevitable patterns, which is what we call an 'artist style' and be able to convincingly mimic this pattern across endless creations when prompted.
Anonymous No.106248932
>>106248885
Short answer: Yes
Long answer: Maybe
Anonymous No.106248934
>>106248666
fitting trips, kek
Anonymous No.106248936 >>106248971 >>106249002
>>106248711
the robot girl mascot is generated from a controlnet of images from an old video game, so she is actually wearing the car's original livery and has the low-poly no-filter style of the game.
the car is intended to be a retro version of this, but in a roughly period correct 1970s bosozoku style plus modern itasha decals.
i tried putting the girl more toward the front of the car like WWII bomber nose art but it didn't look as good.
Anonymous No.106248958 >>106248994
>>106248885
Not for gooning yet, which is all I care about, but I expect great loras will eventually be released
Anonymous No.106248971 >>106249665
>>106248936
Ah yes, the Oval Astra
Anonymous No.106248994
>>106248958
loras for 2.2 are already coming out. its just lightx2v that's pooping the party by shitting the bed with its 2.2 release
Anonymous No.106249002 >>106249665
>>106248936
oh shit I remember this game but I don't remember what it's called
Anonymous No.106249016 >>106249031 >>106249212
>fp8
>q8
which one and why
Anonymous No.106249031
>>106249016
depends if you want to use multigpu nodes or kj nodes

id suggest fp8 because thats what kijais workflow is
Anonymous No.106249032
>>106248815
I've posted these kinds of copypasta coupled with mildly out-of-distribution gens on ai-hostile boards, and they always lapped it up. People wanna hear what they wanna hear.
Anonymous No.106249061 >>106249079
so for 2.2 t2v, are the 2.2 loras worth it at all? or 2.1 still

works for i2v fine, just curious.
Anonymous No.106249070 >>106249279 >>106250523
Anonymous No.106249079 >>106249106
>>106249061
huh?
Anonymous No.106249086
sorry for the hag example but I was impressed by the viewfinder being pretty damn accurate, even accounting for proc latency
Anonymous No.106249091 >>106249113 >>106249121 >>106249311
you ever looking back admiring how far we come?
Anonymous No.106249098 >>106249115 >>106249143 >>106250379
I have no imagination and spend more time tinkering with Comfy nodes and parameters than actually genning.
I'm thinking of getting some 200 (curated) random screenshots of the 90's Poirot and captioning them with Gwen 2.5 7b and train a style lora with it. 90s TV and 30s fashion kinda deal. Workable idea?
Anonymous No.106249106
>>106249079
do the 2.2 lightning loras for t2v work fine for t2v
Anonymous No.106249113
>>106249091
Looking at your gen - not very far.
Anonymous No.106249115 >>106249180
>>106249098
just write scripts at that point. at least you'd learn how to script while doing all this while comfart has two years of being relevant at most
Anonymous No.106249121
>>106249091
I remember SD1.5 checkpoints being great and fast and why do I have to wait minutes for this shit now, but then I look at my old gens and they are absolute shit.
Anonymous No.106249126 >>106249135 >>106249157 >>106249170 >>106249200 >>106249278 >>106249677
Is there a reason anime gets so much less movement? If I use a real person for i2v it feels dynamic while anime just feels pretty static besides a couple parts of the video.
Anonymous No.106249135
>>106249126
censorship woes. investors won't give shit to the website because they know it's depraved
Anonymous No.106249143
>>106249098
>Workable idea?
Better one than at least 98% of civitai's pajeets. Good luck.
Anonymous No.106249149 >>106249714
Anonymous No.106249157 >>106249541
>>106249126
their mouths sure don't stop moving
Anonymous No.106249170 >>106249186 >>106249200 >>106249209 >>106249541
>>106249126
oh wait, you mean video gens. anime is animated at different frame rates constantly and sometimes different frame rates in the same scene. there is a lot of intent and exaggeration too which gets lost in the muck of 3dpd. the easiest stuff it can fathom is low frame rate scenes which are stuff to begin with
Anonymous No.106249180 >>106249205
>>106249115
I'm fully intending to do that. Just have to find out how to get ffmpeg to do that and I guess learn transformers to do the captioning in bulk.
Anonymous No.106249186 >>106249200 >>106249209 >>106249541
>>106249170
>low frame rate scenes
yeah it seems there is a lot of dialogue scenes trained considering how much these bitches yap
Anonymous No.106249200 >>106249218 >>106249541
>>106249170
>>106249186
>>106249126
What about trying to use aniwan as the low noise model?
Anonymous No.106249205
>>106249180
you can just look at the VHS nodes as an example and check out other captioning tools to just grab what works best. there isn't much point reinventing the wheel from scratch but it's worth making a better one with what worked before
Anonymous No.106249209 >>106249244
>>106249170
>>106249186
Isn't it quite normal for anime (and i guess animation in general) to have a mostly static image and just animate the mouth movement, etc.
Anonymous No.106249212
>>106249016
Anonymous No.106249218
>>106249200
aniwan is compromised. it still yaps and does low frame rate. it's impossible to point at the problems with the dataset when these niggers never post them
Anonymous No.106249220
>>106248763
kek
Anonymous No.106249223
>have animated previews on for wan
>decided to gen some sdxl slop
>use batch of 4 images at once like I usually do with smaller models
>preview keeps rapidly switching between the 4 latents it generates at the speed of light
Great stuff, working as intended
Anonymous No.106249235 >>106249269 >>106250069
QwenGODS... exactly as predicted... soon.
Anonymous No.106249244 >>106249258
>>106249209
yes. 60% of an episode will be the most basic shit since 40% is required to keep artists attention. this is moot for diffusion training since you can choose the datasets and these dumb Asians just threw it all I to a pile. results would be completely different if the only thing that was trained was sakuga and dynamic scenes
Anonymous No.106249258
>>106249244
>artists
*autists*
Anonymous No.106249269
>>106249235
cool beans
Anonymous No.106249275
I repeat, is it possible to train a lora on an LCM model or not?
Anonymous No.106249278
>>106249126

Reduce Lightx2v Lora for wan2.2 for more movement. I settled on a ratio of 1 high to 0.25 low. Keep in mind, more movement will introduce more smearing and you need to increase steps to compensate.
Anonymous No.106249279
>>106249070
WTF

I can't fap to this, stop
Anonymous No.106249282 >>106249291
Qwen is just too big for mass appeal. DOA.
Anonymous No.106249291 >>106249308
>>106249282
I honestly don't give a shit about t2v but I might use the edit model from time to time
Anonymous No.106249308
>>106249291
>t2v
*t2i*
Anonymous No.106249311
>>106249091
Taken just before he went out to fight Corn Pop
Anonymous No.106249330 >>106249393
interesting results with t2v miku prompt + ghibli lora:
Anonymous No.106249393 >>106249421
>>106249330
what is interesting about it?
Anonymous No.106249421 >>106249447
>>106249393
im used to i2v, for whatever reason 2.2 t2v workflow takes ages so this is 2.1.
Anonymous No.106249447 >>106249467
>>106249421
>im used to i2v
because that is more interesting than a big model with no finetunes. you will probably have an easier time with noob to generate the start frame so you have complete control over the established scene
Anonymous No.106249467 >>106249474
>>106249447
yeah, i've had good success with using reforge (noobai/wai v14) for start frames if I wanna do anime gens.
Anonymous No.106249469 >>106249479
Anonymous No.106249474 >>106249485
>>106249467
just get a Ghibli Lora for noob then try the same scene again but i2v instead
Anonymous No.106249479 >>106249495
>>106249469
fried to hell but at least the moment is good
Anonymous No.106249485
>>106249474
yeah, plus it's super fast to make a 1024x1024 img with no hires fix/upscaling.
Anonymous No.106249488 >>106249496 >>106249506 >>106249508
I need kontext clothes changing workflow quick
the one on civitai is trash
Anonymous No.106249495 >>106249668
>>106249479
Yeah, it's only 6 steps on the 5B model, deepfries everything, but it's very fast.
Anonymous No.106249496 >>106249507 >>106250055
>>106249488
>kontext
it's an abortion model. just wait for qwen edit
Anonymous No.106249506
>>106249488
just use wan honestly
Anonymous No.106249507
>>106249496
I can't wait till then, please saar do the needful and share a workflow
Anonymous No.106249508
>>106249488
just use the clothes remover lora and prompt as usual

try this workflow, bypass the 2nd image stuff if using 1 image

https://www.reddit.com/r/StableDiffusion/comments/1m5wpmv/flux_kontext_psa_you_can_load_multiple_images/
Anonymous No.106249521 >>106249863
Anonymous No.106249528
pretty good smoking, wan 2.2 is neat
Anonymous No.106249541
>>106249170
>>106249157
>>106249186
>>106249200
the movement just feels really awkward in anime i2v gens and im not sure why
Anonymous No.106249571
>still no news on the long vid generation front

that's all thats needed then I'd never download another video model ever again
Anonymous No.106249578 >>106249593
in video nodes, crf is related to quality but what's the max quality? it's set to 20. idk what max is or what diminishing returns are for it.
Anonymous No.106249593 >>106249689
>>106249578
max quality is 0
Anonymous No.106249600 >>106249618
>>106248560 (OP)
Here's an interesting workflow: Initial image with Chroma, cleanup with Qwen, then back to Chroma.
Anonymous No.106249618 >>106249791
>>106249600
Anonymous No.106249665
>>106249002
it's Whiplash! A friend of mine is decompiling the game and made a track editor which is how I'm able to get clean screenshots of it. the AI in the game are all named after famous robots from movies, and it's strongly implied they are all actually robots since the game specifically calls you out as human, so I enjoy giving them a face to go with the name. This is Holly, named for the character from Red Dwarf (do NOT look up Holly's actress from the later seasons of the show yeeeeesh).


>>106248971
astra dont do 192 mph fulley fuckin sideways thoughbeit
Anonymous No.106249668
>>106249495
Forgive my ignorance as I have yet to ascend to videochad status but does lowering the CFG not alleviate the frying?
>6 steps
Nice desu
Anonymous No.106249677 >>106249749
>>106249126
Probably because a lot of anime has stiff motion. I haven't done video gen yet, is it possible to render a realistic video first for good movement and then re-render that video in an anime style?
Anonymous No.106249689
>>106249593
10 seems to be decent, 3.3 megs, 0 was 8 megs (cant post)
Anonymous No.106249714
>>106249149
damn, pretty cool
Anonymous No.106249749
>>106249677
anime does not move like real life which is why all these anime clips hit uncanny valley hard. anime is an imitation of reality with exaggeration which is why it breaks so many rules of anatomy for the purposes of intent
Anonymous No.106249791 >>106250120
>>106249618
Anonymous No.106249793 >>106249848
anime girls wave hello:
Anonymous No.106249809
Anonymous No.106249821 >>106249847 >>106249866
what was that huggingface model that analysed the image and converted it into a detailed prompt?
Anonymous No.106249838
Anonymous No.106249843
anime girls change clothes

neat how the coat even has a liner.
Anonymous No.106249847
>>106249821
https://huggingface.co/spaces/fancyfeast/joy-caption-alpha-two
https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one

I forget which one was thread favorite.
Anonymous No.106249848
>>106249793
Even without sound, they still keep talking. Girl on left is best girl.
Anonymous No.106249863 >>106249895
>>106249521
Anonymous No.106249866
>>106249821
There's like a dozen (base) models for T5 style.
Anonymous No.106249872
I have a 4090fe, fan flow is basically bottom to top.
Is there any 5090 model that would send the majority of the heat behind the case, blower style?
I'd like to get a second card below the 4090fe, but I can't go fe because the 4090fe would basically get all the heat from the 5090fe.
Anonymous No.106249895 >>106249946
>>106249863
very nice anon
Anonymous No.106249946
>>106249895
TAKE IT DOWN
Anonymous No.106249961
its okay anon you dont have to samefag
Anonymous No.106249962
>>106248617
>/ovg/ in /ldg/
worlds are colliding
Anonymous No.106249979 >>106250049 >>106251737
the girls shake hands

actually worked, neat
Anonymous No.106249987 >>106250036
>>106248666
that's a lot of cream
Anonymous No.106250036
>>106249987
for you
Anonymous No.106250049 >>106251737
>>106249979
and a hug.
Anonymous No.106250055 >>106250069
>>106249496
>qwen edit
was this even announced?
Anonymous No.106250069 >>106250093
>>106250055
>>106249235
Anonymous No.106250093 >>106250108 >>106250109
>>106250069
Nice!
Kontext obsession with "safety" drives me nuts.
Anonymous No.106250108 >>106250120 >>106250140
>>106250093
Qwen-edit will be even safer lol
Anonymous No.106250109
>>106250093
BFL is in the mud right now, BTFO by WAN
Anonymous No.106250120
>>106250108
This. If their base model is this
>>106249791
What do you expect their edit model to be? Kek
Anonymous No.106250140
>>106250108
That's actually impossible
Anonymous No.106250222 >>106250308 >>106250390
wansisters?
Anonymous No.106250237 >>106250254
Did lodestone give up on a Wan tune?
Anonymous No.106250254
>>106250237
he's still doing autism with chroma trying to save his failbake
Anonymous No.106250308 >>106250335 >>106250356 >>106250417
>>106250222
>processing wan
what does this even mean
Anonymous No.106250329
*sips*
Anonymous No.106250335
>>106250308
Who cares what it means as long as we get more good based stuff. All hail china.
Anonymous No.106250356 >>106250442
>>106250308
Qwen Video 120b.
Anonymous No.106250379 >>106250407
>>106249098
I recommed old Italian horror movies likes Suspiria (1977) and Opera (1987). Dario Argento's stuff is pretty much timeless due to color and lighting and with Chroma I bet it would look great.
Anonymous No.106250390 >>106250434
>>106250222
>No Chroma
Anonymous No.106250407 >>106250602
>>106250379
Interesting to see you can get this level of likeness with Chroma LoRAs.
Anonymous No.106250417 >>106250623 >>106250725
>>106250308
>Muyang Li
>contributor for ComfyUI-nunchaku
>also contributor for radial attention
>radial attention requires ComfyUI-nunchaku

HAPPENING
Anonymous No.106250434 >>106250464
>>106250390
Guys are only paying attention to Chinkshit, it's insane
Anonymous No.106250442
>>106250356
oh, SaaS only then
Anonymous No.106250451
Qwen prompt following + Wan 2.2 realism + Chroma v48 realism + Noobai/Illustrious artist and tranime knowledge, all-in-one controlnets text/voice editing mixture of experts SSDmaxxing 671B A2B natively omnimodal with realtime 720p upscaled to 1080p video model runnable on laptop 1060 (3gb)-Q3_K_S.gguf

When?
Anonymous No.106250464 >>106250515
>>106250434
Call me the next time a western model doesn't have an unhealthy obsession over ""safety"".
Anonymous No.106250470
Someone make a workflow where you take any celeb's image and swap their clothes with this formal wear
Anonymous No.106250515 >>106250595 >>106250746 >>106251091
>>106250464
chroma is pretty damn hazardous. it looks like it's being retrained from v48 currently? idk, the misinformation here is nuts and their discord has barely any info because they don't know either.
Anonymous No.106250523
>>106249070
kek fuck the other guy this is great, gave me some funny "save the cows" ideas too
Anonymous No.106250524 >>106250601
Anonymous No.106250533 >>106250553 >>106250584 >>106250603 >>106251901
>UniPic2-Metaquery-9B is an unified multimodal model built on Qwen2.5-VL-Instruct and SD3.5-Medium. It delivers end-to-end image understanding, text-to-image (T2I) generation, and image editing. Requires approximately 40 GB VRAM. For NVIDIA RTX 40-series GPUs, we recommend using the Skywork/UniPic2-Metaquery-Flash

https://huggingface.co/Skywork/UniPic2-Metaquery-9B
https://github.com/SkyworkAI/UniPic

I assume Q8/6 fits into 24GB
Anonymous No.106250546 >>106250624
what's the best cfg and shift for wan22
separately for high and low noise
Anonymous No.106250553
>>106250533
16GB for the Flash (lightweight) (cope) version
Anonymous No.106250584
>>106250533
no fine-tune gold rush for qwen sized models so idrc
Anonymous No.106250595 >>106250746 >>106250766
>>106250515
lodestone has been quiet for a while, but apparently yes, he is doing retraining from v48, will be interesting to see what the results are
Anonymous No.106250601
>>106250524
kek they just won't stfu
Anonymous No.106250602 >>106250616
>>106250407
all the glory to anons who train and share
Anonymous No.106250603
>>106250533
>40gb
we should thank nvidia for charging people $5000 for 32gb.
Anonymous No.106250616
>>106250602
I knew he didn't die in that cell
Anonymous No.106250623 >>106250893
>>106250417
considering radial is a snake oil and took him forever to implement I have a right to be worried. he's great at math but shit at everything else like most Chinese in the field
Anonymous No.106250624 >>106250660 >>106250752 >>106250815 >>106250826
>>106250546
I don't know the best, and tbdesu some of it is purely subjective, but official examples use:

T2V 14B:
720p
16 fps
81 frames
40 total steps
shift 3 or 12 (depending if git or hf example)
unipc_bh2 / simple
High noise: cfg 3
Low noise: cfg 4

I2V 14B:
720p
16 fps
81 frames
40 total steps
shift 3 or 5 (depending if git or hf example)
unipc_bh2 / simple
High noise: cfg 3.5
Low noise: cfg 3.5
Anonymous No.106250660 >>106250752
>>106250624
To add to that, official examples use dynamic switching between the two models with a boundary of 0.900 for i2v and 0.875 for t2v.
It should land you to always get less steps in High Noise vs Low Noise models, for 40 steps I get 14+26, and maybe it would be different if it's landscape vs portrait, or with different loras.
Maybe the rule of thumb is 35% of the steps in High then 65% in low, but I don't have enough examples to be sure of that, only that it's for sure not 50/50.
Anonymous No.106250685 >>106250736
anime girl walks out the door behind her.

have a moonwalk.
Anonymous No.106250721 >>106250760
Is there something magic you can put in the negative to get them to stop talking?
Anonymous No.106250725 >>106250758
>>106250417
What is nunchaku and why should I care?
Anonymous No.106250736
>>106250685
anime girl turns around and runs out the door behind her.

not bad
Anonymous No.106250746 >>106251398 >>106251773
>>106250595
>>106250515
I spoke against Chroma versions post v48 and showed some basic tests, especially when primarily prompted with just a bunch of basic tags (20-25+), even in 1152px, and its not good
>>106229326
>>106229891

But asianfootposter anon talked how in his gens and with his prompts the newer versions v50 are better in 1152px.
>>106230465
>>106230847

He should contact the people in the Discord to maybe help diagnose the problem if they already really want to spend more money fixing something, might as well do it as best as possible.

I'm doing some tests on his workflows that I'll post to compare the versions directly soon.
Anonymous No.106250752
>>106250624
>>106250660
thanks I'll try it later
Anonymous No.106250758 >>106250810 >>106250893
>>106250725
Less memory and faster inference. Nunchaku Wan could be like 30-40 seconds on a 4090 if it ever comes out.
Anonymous No.106250760
>>106250721
no. no eggs work without the distill so any scaling doesn't work either
Anonymous No.106250766
>>106250595

That's great to hear. Shitposting aside I'm glad the weirdness of the last two epochs was noticed.
Anonymous No.106250768 >>106251012
Anonymous No.106250779 >>106250786 >>106250815
so for wan 2.2, what scheduler is best/ideal?
Anonymous No.106250786
>>106250779
euler
Anonymous No.106250803 >>106250833 >>106250860 >>106250884
any of you faggots pay for novel ai? is it worth the price of admission?
Anonymous No.106250810
>>106250758
Is it like radial attention or sage 3, aka faster but looking way worse?
Anonymous No.106250815
>>106250779

>>106250624
Anonymous No.106250824
god I hate how close everything is to being fun again but absolute cocksuckers just keep shitting on everything just enough to be uncomfortable
Anonymous No.106250826 >>106250847
>>106250624
>40 total steps
nigger what i do like 6 normally then 3 skip and my videos turn out fine
Anonymous No.106250833 >>106250878
>>106250803
what for?
>anime: noob/illustrious
>realism: flux/qwen/whatever
>video: wan 2.2
>controlnets, free
>checkpoints, free
>gen as much as you like
open source is better.
Anonymous No.106250847
>>106250826
>official examples use
Anonymous No.106250855
>swapping chroma unified model loader with unet loader gguf and swapping ggufs around before returning to the original setup and seed makes the original seed not output anywhere close to the same image until you restart comfyui
Ahh... i assume the problem is the different models tested are all cached in ram but wrongly used/cast later after some bug corrupts the workflow

You really gotta nuke the process every time you want to be "safe" while genning
Anonymous No.106250860 >>106250887
>>106250803
only for the story generator
>local diffusion
Anonymous No.106250872 >>106250895
Who keeps spreading the "official settings" pasta?
Anonymous No.106250878 >>106250924
>>106250833
i played around with the free trial today and it generated pretty much any porn or joke images i wanted instantly and they seemed high quality.
didnt require a lot of prompt autism either i could just kinda organically type what i wanted. just wanted to know what anons thought about it.
Anonymous No.106250884
>>106250803
>pay for a cucked exp
lol
Anonymous No.106250887 >>106250934
>>106250860
>story generator
nigga it's on par or worse than local llms. nai is a joke when it comes to llms
Anonymous No.106250893 >>106250915
>>106250623
The comfyui implementation (based off of their todo list) was meant to be 1st but they kept adding things before it. Adding all the sages make sense though. One anon predicted it would be by the end of the year or next, seems about accurate.

>>106250758
Dont forget it holds context (well radial does) after 81 frames, meaning your gens wont slop out and you can do long vid
Anonymous No.106250895
>>106250872
How is it a pasta? It's literally in the examples wan devs use on github and hf.
Anonymous No.106250915
>>106250893
I don't really care. tired of chinks promising the world and delivering shitty restrictive optimizations nobody uses
Anonymous No.106250924 >>106251038
>>106250878
That's the whole point but the downside to this other than the obvious sending your prompts to some random server is that all NAI gens look the same or rather an experienced genner can spot them from a mile away. I prefer having more control over my outputs than what NAI offers.
Anonymous No.106250934 >>106250950
>>106250887
you got a guide or something so that i can set up an equivalent experience locally?
Anonymous No.106250945 >>106250993
of course
Anonymous No.106250950 >>106250966
>>106250934
right after we read your mind on what specs you have

https://huggingface.co/bartowski/Qwen_Qwen3-30B-A3B-Instruct-2507-GGUF/blob/main/Qwen_Qwen3-30B-A3B-Instruct-2507-Q4_K_S.gguf
https://github.com/LostRuins/koboldcpp/releases
Anonymous No.106250966
>>106250950
and https://github.com/SillyTavern/SillyTavern
Anonymous No.106250970 >>106251084
the camera zooms out on an anime girl wearing a white racing suit sitting on her red ducati motorcycle

pretty damn good considering the initial image:
Anonymous No.106250993
>>106250945
CHINK'D
Anonymous No.106250996
>rapid wan has a nsfw version in v7
Anyone know what it is made from? Just a lora merge?
Anonymous No.106251012 >>106251051
>>106250768
catbox?
Anonymous No.106251038 >>106251078 >>106251105
>>106250924
>some random server
nta but one of the best things about nai is nothing is stored except in the metadata of the image.
Anonymous No.106251051 >>106251109
>>106251012
too much of a mess but here's prompt
>A photo of a 50-60-year-old man posing alone with white hair, white beard stubble, and a devilish, evil grin, wearing a party hat. He stands outdoors in front of a rusty Ford Sprinter-type van. The van has 'FREE CANDY' written in thick marker on its side, with the text in clear focus. The scene includes a reflective surface.
previous thread has link to anon's lora
Anonymous No.106251078
>>106251038
They also use multiple 3rd party analytics scripts. I don't even know why they need more than one.
Anonymous No.106251084
>>106250970
Doesn't have the Golden Boy female body type obviously, but yes, it looks good
Anonymous No.106251091 >>106251112 >>106251126
>>106250515
Most Plebbitors, Civitai users, prompt Chroma like this "cinematic film still, close up, photo of redheaded girl near grasses, fictional landscapes, (intense sunlight:1.4), realist detail, brooding mood, ue5, detailed character expressions, light amber and red, amazing quality, wallpaper, analog film grain, jacket"

Just copying their prompt over from SDXL

Then they complain that the model is shit without even prompting it properly.
Anonymous No.106251105
>>106251038
Sentry and Posthog too, I think.
Also, they don't own their servers. They just rent from CoreWeave, and you don't know what they actually do with your data.
Anonymous No.106251109
>>106251051
thanks
Anonymous No.106251112
>>106251091
And this is not a meme btw. Go on some red boards here, you see most people doing the same whenever a Chroma gen is posted. Then you've got people who think Chroma sucks.
Anonymous No.106251116
an anime girl wearing a white racing suit gets off her red ducati motorcycle. the bike is parked in a parking lot in Tokyo.
Anonymous No.106251126 >>106251207 >>106251237 >>106251538
>>106251091
Is there any guide on proper prompting for chroma?
Anonymous No.106251140 >>106251145 >>106251760
>Git pull
>Wan2.2 preview broken

Wtf.
Anonymous No.106251145
>>106251140
feeling comfortable yet?
Anonymous No.106251146
>he pulled
Anonymous No.106251161
Feels like there should be a way to use multiple GPUs with Kijays comfyUI workflow for Wan2.2 yea?

I got a 3060 and 5070ti. Are there custom nodes that perhaps let me load the image or text embedding models into my 3060 rather than offloading to the CPU? (Or the CLIP or text encoder models perhaps?)

It's using up all my 16g vram from the 5070ti and almost all my 64gb of RAM.
Anonymous No.106251164
>read update notes
>it's literally nothing
>shit is broken
yeah, I'm thinking comfyorg doesn't have long to live if nothing comes out in long stretches like this.
Anonymous No.106251172 >>106251202 >>106251223 >>106251247
https://rentry.org/wan22ldgguide#kijais-wan22-lightx2v-workflow

The link for "ldg_2_2_t2v_14b_480p.json" appears to be either empty or missing. Can anyone confirm?
Anonymous No.106251202 >>106251382
>>106251172
https://huggingface.co/bullerwins/Wan2.2-T2V-A14B-GGUF/blob/main/wan2_2_14B_t2v_example.png
Anonymous No.106251207 >>106251248
>>106251126
Literally just don't include all this booru crap on the prompt, do not describe the level of detail unless you are going for a 3D render, and assume you're describing a real photo rather than an AI gen. What can be so hard about that anon? Think of every way you can describe what a woman is doing, because most are just trying to prompt for 1girls.
Anonymous No.106251223 >>106251247
>>106251172
>he doesnt have the jiggling ass lora
Anonymous No.106251226
Anonymous No.106251237
>>106251126

That's another issue. There's zero documentation so when their 1girl giant feet prompt doesn't work they dismiss it. The whole process of getting it to cooperate is only through word of mouth right now.
Anonymous No.106251247
>>106251172
>>106251223

>he doesn't have the pregnant belly lora
Anonymous No.106251248
>>106251207
Got it.
Anonymous No.106251262 >>106251266 >>106251268 >>106251674
Hey, new genner here. How/where do i get this style of picture? I know the artist is probably ghibli but how to i get this 'warm' feel to them?
Anonymous No.106251264 >>106251345
tried to get their booba suit, with decent success:

an anime girl wearing a white form fitting racing suit gets off her red ducati motorcycle. the bike is parked in a parking lot in Tokyo. She has large breasts and is showing cleavage with her racing suit.
Anonymous No.106251266 >>106251922
>>106251262
use chatgpt
Anonymous No.106251268
>>106251262
>'warm' feel
Haha warm because it's piss
Anonymous No.106251302 >>106251322 >>106251339 >>106251820
wtfff
Anonymous No.106251322
>>106251302
I call these things "watchers." They are the emanations of the silicone demon
training wizza No.106251339
>>106251302
you fool, you tried dark magic didn't you?
Anonymous No.106251345 >>106251405
>>106251264
this time ill use a diff image and the same prompt, with a diff girl from the show.

i2v magic
Anonymous No.106251382
>>106251202
thanks, that's the workflow I used in my original post and this one.

I'm hoping someone has the wan2.2-lightx2v-t2v workflow that's apparently missing from the guide.
Anonymous No.106251398 >>106251432 >>106251773
>>106250746
>my workflows
I don't really have a special workflow, it's just the default Chroma workflow anon (posted to lodestone's HF page). And the reason I noticed about higher res is because the message anon posted here was also posted to Discord, so lodestone has been made aware of the issue. Though, I do not think he recognized original 1024px v50 as a downgrade over regular Chroma for whatever reason (according to him, background on v48 looked melted, which is retarded because you're losing the photoreal look of your model). So I just conceded to using v48 at the time, but that had more issues (multiple subjects not as good as v49 etc...)
Anonymous No.106251405 >>106251476
>>106251345
Anonymous No.106251432 >>106251663
>>106251398

What's been your technique for multiple subjects? If it makes any difference, I usually gen illustrations and not photos.
Anonymous No.106251438
Anonymous No.106251451
Is there any other website for flux and wan loras that isn't civitai or hf?
Anonymous No.106251476
>>106251405
and for fun, what if you use the prompt for a multi person image?
Anonymous No.106251484
I have to unload the models every time I add or change a lora, right?
Anonymous No.106251521 >>106251561
I've been seeing a lot of videos with zero warping lately, is there something special people are doing? My gens devolve into warping messes if I try anything a little more complex.
Anonymous No.106251538 >>106251556
>>106251126
Add aesthetic 11.
Anonymous No.106251556
>>106251538
OK.
Anonymous No.106251561 >>106251577
>>106251521
warping?
Anonymous No.106251577 >>106251649
>>106251561
Morphing would be a better word I guess, the front of their body suddenly turns into the back and visa versa.
Anonymous No.106251605
the Q5 gguf seems alright
Anonymous No.106251649 >>106251686
>>106251577
are you using a cope quant?
Anonymous No.106251663 >>106251708
>>106251432
No special technique for it. Though v50 is the best for it because it follows the prompt correctly. If you're unsure how to prompt, try using a VLM like Grok 4 or Gemini, but keep in mind those may slop the prompt if you do not change or remove certain words after you get the response. Do keep in mind that like Dalle, anything you describe is possible, because that is the power of Chroma.

As for bad gens, I just take an extra step or change sampler, change a single token like remove a period and add a comma, I try it 2-3 times before just swapping seed. I mean, prior to v49/50, it was normal to get plenty of bad gens, but I haven't really had as many with it. And if you are, then possibly a prompt issue. Just ask LLM to reword it.
Anonymous No.106251674
>>106251262
pee on the image
Anonymous No.106251686 >>106251721 >>106251728 >>106251779
>>106251649
fp8, should I be using fp16 only? I have 2 4090s but haven't been able to get fp16 to work well with offloading.
Anonymous No.106251693 >>106251707 >>106251742
Anonymous No.106251707 >>106251775
>>106251693
very nice, is that kijai or native wf?
Anonymous No.106251708 >>106251796
>>106251663

Interesting, thank you. What samplers and cfg do you switch between? I've seen Euler and bong tangent used here.
Anonymous No.106251721
>>106251686

Fp8 is kind of weird in my experience. Q8 is more similar to fp16 if you're willing to give it a try.
Anonymous No.106251728
>>106251686
i'm using kijai's workflow with fp8_e4m3fn and 2.1 loras and body horror is pretty rare
Anonymous No.106251737 >>106251774
>>106249979
>>106250049
I haven't checked on the goldenboy lora situation in a while, did they ever add this girl who closely resembles my wife
Anonymous No.106251742
>>106251693
SEX
Anonymous No.106251745
that ultrareal lora makes gens very unstable and distorts stuff
Anonymous No.106251760
>>106251140
my wan2.2 preview works fine
Anonymous No.106251773 >>106251814
>>106250746
>>106251398

Oh man, looking at the artifacts on the legs of the second seed girl, could this be a bad quant? I'm just using Q8 from huggingface.co/silveroxides/

The workflows are all the same as from: >>106231063

I just changed the loader to Unet Loader (GGUF) and that's it.

For those that don't want to open the workflows, it should be the chroma lodestone's HF page workflow.
All images are with sampler res_multistep with scheduler beta except the pool girl which had dpmpp_2m sampler.
And the vertical gen which was in 832x1488 instead of 1152x1152.

Full image: https://litter.catbox.moe/2iqj5nx50mhzd7iu.png
Anonymous No.106251774
>>106251737
idk, just using some google image results for test gens.
Anonymous No.106251775
>>106251707
kijai
Anonymous No.106251779
>>106251686
Use fp8_scaled or q8, they're better than fp8.
I rarely had the problem you get, so it's weird it's a big issue in your gens.
Anonymous No.106251796 >>106251854
>>106251708
I stick to res multistep around 35 steps, swap to dpmpp 2m first, and if that doesn't work then swap to heun or dpm 2. During my testing in prior versions I have found Euler to be significantly worse for limb accuracy in realism gens, it could've been terribly bad luck but I kept testing and my results were always the same.

The CFG is 4.5, I don't switch it often.
Anonymous No.106251814 >>106251905
>>106251773

>V50 gives Cheeto feet

What did lode mean by this?
Anonymous No.106251816 >>106251860
girl climbs a ladder, first try: looking for climbing down so i'll have to be specific.
Anonymous No.106251820
>>106251302
best gen itt
Anonymous No.106251854
>>106251796

Limbs disappearing still shows up every ten or so gens with Euler, so I'll give that a try.
Anonymous No.106251857 >>106251886 >>106252752
Anonymous No.106251859
>Wan 2.2 I2V Footjob
Finally some good fucking food.
Anonymous No.106251860
>>106251816
getting better...
Anonymous No.106251886
>>106251857
Yes please
Anonymous No.106251897 >>106252144
civitai ded
Anonymous No.106251901
>>106250533
look at the girl and rabbit example, this model is fucked up. and completely changes the patterns on the blue porcelain woman.
needs more training/tuning or it's DOA
Anonymous No.106251903
she insists on going up.
Anonymous No.106251905
>>106251814
https://e621.net/posts/5539288
Anonymous No.106251922 >>106252040 >>106252128
>>106251266
tying it now, thanks, it's cool (is it possible to jailbreak it for NSFW, or do i need to use this program on something else to get that? Fine if i can't, just curious.)
Anonymous No.106251936
What are the current top models for image Gen?
I haven't followed this stuff since stable diffusion was the best
Anonymous No.106251954
Anonymous No.106251965 >>106251998
Anonymous No.106251989 >>106252021 >>106252081
The drunk anime girl should start to laugh but then fall off her barstool and out of shot at the bottom of the frame.
Anonymous No.106251998
>>106251965
Pretty
Shame about the hands
Anonymous No.106252006
Anonymous No.106252021 >>106252055
>>106251989
so you didn't prompt for her to piss and barf on herself? wan is freaky
Anonymous No.106252040
>>106251922
>jailbreak it for NSFW
no
Anonymous No.106252055 >>106252120
>>106252021
Well I said she was drunk so wan just assumed she would do that.
Anonymous No.106252059 >>106252106
why is there no nunchaku svdquant of Wan?
Anonymous No.106252065 >>106252073 >>106252179
>all of these anime models, loras, images, videos, editing and interpolations

I can finally make my dream 90s anime

>inb4 usual contrarian takes
Anonymous No.106252073 >>106252120
>>106252065
I don't agree with you.
Anonymous No.106252081 >>106252158
>>106251989
interpolated? what do you use for that if so
Anonymous No.106252099 >>106252119
Anonymous No.106252106 >>106252170
>>106252059
Because they have shiny object syndrome, even radial attention made it compatible for wan fusionX lora before releasing a comfyui implementation....fusionSLOP of all things kek
Anonymous No.106252108
asian girl in a swimsuit spins around and does a 360. pretty good.
Anonymous No.106252119 >>106252214
>>106252099
you didnt ask but the details on these are not very good though the style is not particularly offensive
Anonymous No.106252120
>>106252055
Gunsmith Moons lookin sick

>>106252073
Ayy Ok
Anonymous No.106252124 >>106252131
Anonymous No.106252128
>>106251922
get that shit OUT OF HERE this is the LOCAL DIFFUSION general
Anonymous No.106252131 >>106252198
>>106252124
great hellsing character
Anonymous No.106252141
asian girl in a swimsuit does a karate kick

splash
Anonymous No.106252144
>>106251897
Don't get my hopes up.
Anonymous No.106252158
>>106252081
>interpolated
Yes film_net_fp32.pt at 2 multiplier.
Anonymous No.106252170
>>106252106
>he thumbed up that one too :DDD
lmfao pathetic
Anonymous No.106252175
>https://huggingface.co/Kijai/WanVideo_comfy/tree/main/FantasyPortrait
this could be fun (by fun I mean shit)
Anonymous No.106252179
>>106252065
Make sure to post it here when it's finished, fren
Anonymous No.106252183
Anonymous No.106252188 >>106252210
asian girl in a swimsuit flies into the sky like a rocket, with flames emitting from her shoes.

silly prompt on purpose to test but yep, it works.
Anonymous No.106252198
>>106252131

Oh yeah, Zorin is my favorite
Anonymous No.106252200
>>106252196
>>106252196
>>106252196
>>106252196
>>106252196
Anonymous No.106252210
>>106252188
Wan always has these goofy fire effects
I once I2V'd the elephant with a firehose of shit coming out of it's ass and wan took the "fire" part very literally
Anonymous No.106252214 >>106252250
>>106252119
You are in a thread where people spam garbage videos and images with unoriginal styles, not on an art workshop, so nitpicking on AI image details does feel kinda pointless

And yeah, the base model is undertrained and goes to shit whenever a hand appears, but that doesn't make me stop appreciating the good things it can do
Statler/Waldorf No.106252220
>>106248658
>>106248908
BEAHAGHAHA
Anonymous No.106252250
>>106252214
>You are in a thread where people spam garbage videos and images with unoriginal styles,
unfortunate you see it that way
Anonymous No.106252752 >>106253045
>>106251857
CAN THIS BE ANY BLURRIER?????
Anonymous No.106253045
>>106252752
Subject, strawberry, is on focus
Anonymous No.106253597
>set crf to 0
>the videos no longer have thumbnails
why?